powered by
Center for Curriculum and Transfer Articulation
Bioinformatics and Scientific Computing
Course: CSC283

First Term: 2021 Fall
Lec + Lab   3.0 Credit(s)   4.0 Period(s)   4.0 Load  
Subject Type: Academic
Load Formula: T - Lab Load


Description: Introduction to bioinformatics, including history, concepts, major genetic databases and access tools. Computer software and techniques for analyzing one nucleotide or protein sequence, searching for similar sequences, and aligning and comparing two or multiple sequences. Microarray analysis and phylogenetic trees. Application of standard software to bioinformatic computing tasks, including word processing of reports, and use of spreadsheets for statistical analysis and graphing. Text editors, Unix, Internet web site searching and construction, and ethics.



MCCCD Official Course Competencies
1. Identify major genetic databases and access tools and describe special features and strengths of each. (I)
2. Use software to perform basic statistical analyses and characterize sequences. (II)
3. Use software to locate open reading frames. (II)
4. Use available software to search genetic databases for similar sequences. (III)
5. Use available software to align sequences and determine similarities. (III, IV)
6. Select appropriate multiple sequences, align them, and evaluate the results. (V)
7. Construct various types of phylogenetic trees from multiple aligned sequences. (V)
8. Perform normalization and statistical analysis on microarray gene expression data. (VI)
9. Use a word processor to create bioinformatic reports and forms. (VII)
10. Use a spreadsheet to perform statistical analysis and create graphs of bioinformatic data. (VII)
MCCCD Official Course Competencies must be coordinated with the content outline so that each major point in the outline serves one or more competencies. MCCCD faculty retains authority in determining the pedagogical approach, methodology, content sequencing, and assessment metrics for student work. Please see individual course syllabi for additional information, including specific course requirements.
 
MCCCD Official Course Outline
I. Introduction
   A. What are bioinformatics and scientific computing?
   B. History
   C. Definitions and concepts
      1. Review of molecular biology
         a. Frequencies and abbreviations for amino acids
         b. Reading protein and DNA sequences correctly
      2. Computer applications to molecular biology
   D. Uses of bioinformatics
   E. Levels of focus
      1. Single sequences or genes
      2. Domains and families
      3. Metabolic pathways
      4. Genomes
   F. Major genetic databases and access tools
      1. National Center for Biotechnology Information (NCBI)
         a. GenBank database
            (1). Organisms and types of data stored
            (2). Organization of the data, accession numbers
            (3). Understanding a GenBank entry
         b. Entrez to search across databases
         c. PubMed literature research
            (1). Basic searching
            (2). Searching using fields
            (3). Searching using limits
            (4). MeSH terminology
         d. BLAST searching programs for DNA and protein sequences
         e. Online Mendelian Inheritance in Man
         f. Other NCBI databases and tools
      2. Other nucleotide sequence databases and software
         a. Additional NCBI databases and software
         b. Internet sites around the world
      3. Other protein databases and software
         a. Additional NCBI databases and software
         b. Internet sites around the world
II. Analyzing one sequence
   A. Nucleotide sequences
      1. Detecting vector contamination
      2. Restriction maps
      3. Analyzing DNA/RNA composition
         a. Basic statistics and probability
         b. Counting words and repeats
         c. GC content: percentage of guanine and cytosine nucleotides
         d. Other analyses
         e. Analysis software
      4. Finding protein coding regions
         a. Open reading frames
         b. Exons
         c. Assembly
   B. Protein sequences
      1. Predicting physical properties
      2. Predicting structure
      3. Finding domains
III. Searching for similar sequences
   A. Homology, similarity, and identity concepts
   B. The BLAST family of programs for nucleotides and proteins
      1. Choosing parameters
      2. Masking
      3. Bit scores and E-values
      4. Formatting output
      5. Evaluating results
IV. Alignment and comparison of two sequences
   A. Issues
      1. Sequence lengths
      2. Scoring algorithms
      3. Manual evaluation and tweaking
   B. Dot Plots
   C. Local and global alignments
V. Selection, alignment and comparison of multiple sequences
   A. Selecting sequences to align
      1. How many to align at once
      2. How close in identity
      3. Evaluating quality of and interpreting alignments
   B. Clustal family of alignment programs
      1. Progressive alignment is influenced by order of sequence entry
      2. Parameter refinements
         a. Substitution matrices
         b. Gap-opening penalties
         c. Gap-extension penalties
   C. Other alignment programs
   D. Phylogenetic trees for comparison of multiple dequences
      1. Types of trees
         a. Binary trees
         b. Consensus trees
         c. Phylograms
      2. Construction algorithms
         a. Maximum parsimony
         b. Neighbor joining
         c. Maximum likelihood
      3. Validation using bootstrapping
VI. Microarrays for gene expression analysis
   A. Target selection and experimental design
   B. Image analysis
   C. Data validation and normalization
   D. Statistical analysis
VII. Scientific computing
   A. Text editors
      1. Creating, editing, saving, and retrieving data files
      2. Converting data files to different file formats
         a. Operating system file formats
         b. FASTA (Fast-All) and other sequence file formats
   B. Word processing
      1. Creating scientific reports
         a. Text layouts and headings
         b. Tables
         c. Charts or graphs
         d. Diagrams
      2. Creating forms for scientific or lab use
   C. Spreadsheets
      1. Bioinformatic data computation and statistical analysis
      2. Editing rows and columns, entering calculation formulas
      3. Converting microarray or other bioinformatics data to graphs
   D. Databases
      1. Searching and retrieval in major online databases
      2. Organizing lab data for local storage and retrieval
      3. Organizing lab data for submission to online databases
   E. Web page development and searching
      1. Effective scientific web page design
      2. Uploading web pages to a server
      3. Using internet search engines to locate scientific information
      4. Evaluating integrity of websites
   F. Electronic mail
      1. Effective use of email
      2. Professional etiquette
      3. Attaching files to emails
   G. Privacy, safety, and security
      1. Downloading files - size issues
      2. Privacy and security issues
      3. Virus dangers
   H. Introduction to Unix/Linux
      1. Unix file system commands
      2. Uploading and downloading files
      3. Running local and/or remote bioinformatics programs
      4. Introduction to scripting
   I. Ethics
      1. Copyrights and infringement
      2. Professional responsibility and integrity
      3. Obligation to protect data edit
 
MCCCD Governing Board Approval Date: February 23, 2021

All information published is subject to change without notice. Every effort has been made to ensure the accuracy of information presented, but based on the dynamic nature of the curricular process, course and program information is subject to change in order to reflect the most current information available.