Page GenomeComparisonP4

This is the project page of the Genome-Comparison group.

Students

Mail an alle Gruppenmitglieder: AA2010SS-GenComp bei lists.spline.de

NameSorted descending email
Svenja Specovius specoviu@mi.fu-berlin.de
Sabrina Krakau krakau@mi.fu-berlin.de
Max Homilius homilius@mi.fu-berlin.de
John Wiedenhoeft johnwie@mi.fu-berlin.de
Ivan Kel kel@mi.fu-berlin.de
Felix Heeger fheeger@mi.fu-berlin.de

Literature

Initial reading (this should be read by all)

You can start by reading a recent review review article about genome comparison.

For the core lecture we will deal with the representation of large (kollinear) alignments as graphs. For this refer to

Further reading

For the other two lecture topics you migh find the further reading helpful:

The project

Slides

Joint presentation of the projects. Please see svn for updated versions of the lecture slides.

Exercises

exercise on colinear alignment for Wednesday 12th May

corrected pseudo code of Segment Match Refinement Algorithm

exercise on assembly comparison (OSLay)

Discussion Forum

Please use this space to interface between each other and with the instructors

Proposal outline

  • Colinear Alignment
    • overview
    • match refinment

  • Non-Colinear Alignment

  • Assembly Layout and Comparison

Analysis/Programming Projects

Reannotation of the Genome of Carsonella Ruddii (Ivan Kel).

C.Ruddii is a g-proteobacteria with the smallest genome known so far (approx 214 genes), as well as a very small CG content of 16%. The first complete sequencing and annotation has been done 2006. The goal of this project is the reannotation of the genome using MSA methods and a comparison of the results with the existing annotation. (original paper) and (Moya et al. paper from 2007) Here you can find the details.

Glocal Alignment: SLAGAN and SuperMap (Felix Heeger & Svenja Specovius) (Partners in Instructor team: Manuel Holtgrewe, Birte Kehr)

The first idea of this project was to reimplement the SuperMap-Algorithm presented in the Dubchak paper using SeqAn. Since this would be a lot of work for 60 hours and it needs the SLAGAN-Algorithm, we would like to try and re-implement the SLAGAN-Algorithm and after that, depending on how much time is left, continue to use this for SuperMap. Here you find the details.

Studying sex determination systems using SuperMap (John Wiedenhoeft)

I would like to study the evolutionary relationships of chromosomal sex-determination systems (see http://dx.doi.org/10.1016/j.cub.2006.08.021) using SuperMap. Especially the transitions between systems (XY <-> WZ) would be interesting, since gene mapping suggests that Z in one snake species is not homologous to bird-Z-chromosomes. Here you find the details.

Assembly comparison (Sabrina Krakau) (David Weese and Knut Reinert)

In this project we assemble a set of reads with two different assemblers,compute a layout of contigs and compare the resulting linear sequences with genome comparison programs. Here you find the details.

Non-collinear alignments of viral genomes (Max Homilius)

I would like to study genome rearrangements, horizontal gene transfer etc. in viruses using non-collinear alignments programs. This might be useful to infer phylogenies, and to depict relations. Here will be more details soon.

Possible projects

Please note, that if you are interested in one of those we still have to work out the details. These are some ideas me Manuel and Birte had.

  1. Local alignments Local alignments are often input for genome comparison algorithms. Different local aligners can be used to compute local matches between a number of comparable genomes (e.g. Drosophila chromosomes and shorter sequences, e.g. Adenoviruses). Local aligners are for example BLAT, BlastZ, Blast, Local-Swift, lalign, chaos. In the project we will vary the parameters to obtain local matches of different length and quality. We report on the run time, coverage of each genome, and statistics regarding the length and error rates of the local matches.
  2. Analysis of precomputed alignments Download precomputed alignments from VISTA, UCSC and the Ensemble Genome Browser and analyze key statistics (length, number of identical positions, indel length, etc.
  3. Test the ABA program Use different inputs from project A) and run ABA with it. Report on the differences in output using different local alignments as input.
  4. Evolutionary conservation Download genomes of a number of species with well known evolutionary tree. Use different, suitable genome aligners to compute distances between the genomes as well as gencompress (Chen, Kwong, Li). Construct phylogenetic trees from those differences and report the results.
  5. Repeats in genomes Genomes contain many repeats. Tandem repeats are two or more contiguous, approximate copies of the same repetitive sequence. There are several programs available to compute tandem repeats. Compare their results on a human chromosome.
  6. Assembly comparison Download the read from a fairly well finished genome and assemble it using two or more standard assemblers. Compare the results using layout software and genome comparison programs.
  7. Difference between collinear and non-kollinear aligners Take two well studied genome sequences and compare them with a collinear aligner and a non-collinear aligner. How much difference do you see? (use similar statistics as in 1. and 2.

 

This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback