I am interested in high-throughput bioinformatics, especially short-read analysis. I am also broadly interested in genome sequencing, annotation, and comparative genomics.

My advisors are Steven Salzberg and Lior Pachter

Interests

Short-read sequencing technologies produce a huge volume of data. Many of the researchers interested in analyzing that data do not have large computing budgets. I am interested in building short-read analysis software that runs on common workstations.

Ben Langmead, Mihai Pop, Steven Salzberg and I recently released Bowtie, an ultrafast short read mapping program. Bowtie can map around 25 million reads to the human genome per CPU hour. Bowtie is meant to be used with whole-genome resequencing projects as well as new short-read applications like RNA-Seq, ChIP-Seq, and others.

My current research focus is improving Bowtie and building tools on top of it to help analyze RNA-Seq and ChIP-Seq datasets. TopHat is the first such tool. With TopHat, you can identify novel splice junctions from your RNA-Seq data, without needing any annotations on the reference genome. From the spliced read alignments produced by TopHat, you can assemble full length transcripts and estimate their abundances using Cufflinks.


Publications

High-throughput bioinformatics

Genomics

Former interests - Artificial Immune Systems


Presentations


Software