You are here: ABI » ThesesHome » LastImplementation

Implementation of LAST with gapped seeds and applications to read mapping and local alignment

Area

Substring Indices, read mapping, local alignment

Topic

The goal of this thesis is to reimplement and extend parts of LAST [1], which in essence uses a string index to adaptively search for sequence seeds that do not occur too often. The thesis can be split into 5 larger parts.

  • Implementation of a gapped suffix array using the method described in [2] or a similar method.
  • Implementation of LAST [1], i.e. finding adaptive seeds (gapped and ungapped)
  • Using LAST to find seeds in replacement of epsilon-cores for STELLAR. Develop strategies to handle overlapping seeds.
  • Using LAST to find seeds to verify using Myers bitvector algorithm.
  • Evaluating both tools.

Timeline

References

  1. Kiełbasa, Szymon M, Raymond Wan, Kengo Sato, Paul Horton, and Martin C Frith. 2011. “Adaptive Seeds Tame Genomic Sequence Comparison.” Genome Research 21 (3) (March): 487–493. doi:10.1101/gr.113985.110.
  2. Horton, P, and SM Kiełbasa. 2008. “DisLex: a Transformation for Discontiguous Suffix Array Construction.” In.
  3. Kehr, B., David Weese, and Knut Reinert. 2011. “STELLAR: Fast and Exact Local Alignments.” BMC Bioinformatics 12 (Suppl 9): S15.
  4. Siragusa, Enrico, David Weese, and Knut Reinert. 2013. “Fast and Accurate Read Mapping with Approximate Seeds and Multiple Backtracking.” Nucleic Acids Research (January 28). doi:10.1093/nar/gkt005.
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback