High-quality draft assemblies of mammalian genomes from massively parallel sequence data
- PMID: 21187386
- PMCID: PMC3029755
- DOI: 10.1073/pnas.1017351108
High-quality draft assemblies of mammalian genomes from massively parallel sequence data
Abstract (V体育官网)
Massively parallel DNA sequencing technologies are revolutionizing genomics by making it possible to generate billions of relatively short (~100-base) sequence reads at very low cost. Whereas such data can be readily used for a wide range of biomedical applications, it has proven difficult to use them to generate high-quality de novo genome assemblies of large, repeat-rich vertebrate genomes. To date, the genome assemblies generated from such data have fallen far short of those obtained with the older (but much more expensive) capillary-based sequencing approach. Here, we report the development of an algorithm for genome assembly, ALLPATHS-LG, and its application to massively parallel DNA sequence data from the human and mouse genomes, generated on the Illumina platform. The resulting draft genome assemblies have good accuracy, short-range contiguity, long-range connectivity, and coverage of the genome. In particular, the base accuracy is high (≥99. 95%) and the scaffold sizes (N50 size = 11. 5 Mb for human and 7. 2 Mb for mouse) approach those obtained with capillary-based sequencing. The combination of improved sequencing technology and improved computational methods should now make it possible to increase dramatically the de novo sequencing of large genomes. The ALLPATHS-LG program is available at http://www VSports手机版. broadinstitute. org/science/programs/genome-biology/crd. .
Conflict of interest statement
The authors declare no conflict of interest.
References
- 
    - International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature. 2004;431:931–945. - PubMed
 
- 
    - Church DM, et al. Mouse Genome Sequencing Consortium. Lineage-specific biology revealed by a finished genome assembly of the mouse. PLoS Biol. 2009;7:e1000112. - PMC (V体育安卓版) - PubMed
 
- 
    - Waterston RH, et al. Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. - "VSports注册入口" PubMed
 
- 
    - Lindblad-Toh K, et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature. 2005;438:803–819. - "VSports在线直播" PubMed
 
- 
    - Warren WC, et al. Genome analysis of the platypus reveals unique signatures of evolution. Nature. 2008;453:175–183. - "VSports注册入口" PMC - PubMed
 
Publication types
- "V体育官网" Actions
MeSH terms
- VSports在线直播 - Actions
- "VSports" Actions
- Actions (V体育ios版)
- "VSports最新版本" Actions
- Actions (V体育2025版)
- "VSports app下载" Actions
- Actions (VSports手机版)
Grants and funding
LinkOut - more resources
- Full Text Sources
- Other Literature Sources
- Molecular Biology Databases
- Research Materials
- Miscellaneous
 
        