Genome 373: Genomic Informatics

Instructors:
   Elhanan Borenstein, elbo [ a t ] @uw.edu
   Douglas Fowler, dfowler [ a t ] uw.edu

Teaching Assistant:
   Hannah Pliner, hpliner [ a t ] uw.edu

Schedule: MWF, 1:30PM-2:20PM, Foege S110.

Links:

This course is intended to introduce students to the breadth of problems and methods in computational analysis of genomes, arguably the single most important new area in biological research. The specific subjects will include large-scale comparative genome structure, sequence alignment and search methods, gene prediction, evolutionary relationships among genes, and next-generation sequencing. The course will include one mid-term exam and a final exam. Other graded assignments will be problem sets, due most weeks. Grades: 50% home assignments, 20% midterm, 30% final exam.

News:

» Midterm: Wednesday, April 27, 2018. *** Note, you are allowed to bring a two-sided cheat sheet ***


Lectures and Resources:
(Note: Links to resources will become live as the course progresses)

WeekDatesTopicsLecturesQuiz section
1 (Borenstein)Mar. 26, 28, 30 Welcome; syllabus; Intro to bioinformatics; Sequence alignment; Dynamic programming; Global alignment; Lecture 1, Lecture 2, Lecture 3 Slides
2 (Borenstein)Apr. 2, 4, 6 Local alignment; Score matrices; Lecture 1, Lecture 2 Slides
3 (Borenstein)Apr. 9, 11, 13 Trees; Distance trees; UPGMA; NJ; Parsimony (small and large); Search heuristics; Lecture 1, Lecture 2, Lecture 3, Slides
4 (Borenstein)Apr. 16, 18, 20 Clustering algorithms; Hierarchical clustering; K-mean clustering; Biological networks; Lecture 1, Lecture 2, Lecture 3, Slides
5 (Borenstein)Apr. 23, 25, 27 Dijkstra's algorithm; Network motifs; Midterm; Lecture 1, Lecture 2 Slides
6 (Fowler)Apr. 30, May 2, 4 Gene finding with Hidden Markov Models Lecture 1, Lecture 2, Lecture 3 Slides
7 (Fowler)May 7, 9, 11 Intro to machine learning, decision trees, variant effect prediction Lecture 1, Lecture 2, Lecture 3 Slides
8 (Fowler)May 14, 16, 18 High throughput DNA sequencing Lecture 1, Lecture 2, Lecture 3 Slides
9 (Fowler)May 21, 23, 25 Contemporary sequence alignment Lecture 1, Lecture 2, Lecture 3 Slides
10 (Fowler)May 28, 30, Jun. 1 Genome assembly and wrap up Lecture 1, Lecture 2, Lecture 3 Slides

References:

Electronic access to journals is generally free from on-campus computers. For off-campus access, follow the "[offcampus]" links or look at the library "proxy server" instructions.

  1. Noble, WS, "A quick guide to organizing computational biology projects." PLoS Comput. Biol. 5 (2009) e1000424. Pmid: 19649301 [Offcampus]
  2. Dudley, JT and Butte, AJ, "A quick guide for developing effective bioinformatics programming skills." PLoS Comput. Biol. 5 (2009) e1000589. Pmid: 20041221 [Offcampus]
  3. How dictionaries work (aka hash tables or hash maps)
  4. Subramanian et al., "Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles"PNAS 102(43) (2005)

Python Resources:

   General
Regular Expressions
"RegExPal" (For Javascript rather than Python, but similar and quite handy. Try it!)
Biopython
Python Books
Python for Software Design: How to Think Like a Computer Scientist by Allen B. Downey. (Includes early drafts of our text book; cheaper than the published version, but less polished...)
Learning Python by Mark Lutz. O'Reilly (Very comprehensive. Much is accessible to beginners.)

Bioinformatics Books

» Biological sequence analysis: probabilistic models of proteins and nucleic acids, R. Durbin, S. Eddy, A. Krogh, and G. Mitchison, Cambridge. (Excellent reference, classics)
» Inferring Phylogenies, Joseph Felsenstein, Sinauer, 2004. (Excellent reference on this topic.)
» Introduction to Computational Genomics: A Case Studies Approach, Cristianini, Nello & Hahn, Matthew, Cambridge, 2007.
» An Introduction to Bioinformatics Algorithms, Neil C. Jones & Pavel A. Pevzner, 2004.
» Bioinformatics: Sequence and Genome Analysis, David W. Mount, Cold Spring Harbor Laboratory Press.
» Python for Bioinformatics, Sebastian Bassi, CRC Press, 2010. (A little too advanced as a progamming book for beginners, but fine now that you're experienced.)
» Python for Bioinformatics, Jason Kinser, Jones and Bartlett, 2009. (Ditto.)



Elhanan Borenstein
Department of Genome Sciences
University of Washington
Douglas Fowler
Departments of Genome Sciences
University of Washington