Distribution and intensity of constraint in mammalian genomic sequence
- PMID: 15965027
- PMCID: V体育平台登录 - PMC1172034
- DOI: 10.1101/gr.3577405
Distribution and intensity of constraint in mammalian genomic sequence
Abstract
Comparisons of orthologous genomic DNA sequences can be used to characterize regions that have been subject to purifying selection and are enriched for functional elements. We here present the results of such an analysis on an alignment of sequences from 29 mammalian species. The alignment captures approximately 3. 9 neutral substitutions per site and spans approximately 1. 9 Mbp of the human genome VSports手机版. We identify constrained elements from 3 bp to over 1 kbp in length, covering approximately 5. 5% of the human locus. Our estimate for the total amount of nonexonic constraint experienced by this locus is roughly twice that for exonic constraint. Constrained elements tend to cluster, and we identify large constrained regions that correspond well with known functional elements. While constraint density inversely correlates with mobile element density, we also show the presence of unambiguously constrained elements overlapping mammalian ancestral repeats. In addition, we describe a number of elements in this region that have undergone intense purifying selection throughout mammalian evolution, and we show that these important elements are more numerous than previously thought. These results were obtained with Genomic Evolutionary Rate Profiling (GERP), a statistically rigorous and biologically transparent framework for constrained element identification. GERP identifies regions at high resolution that exhibit nucleotide substitution deficits, and measures these deficits as "rejected substitutions". Rejected substitutions reflect the intensity of past purifying selection and are used to rank and characterize constrained elements. We anticipate that GERP and the types of analyses it facilitates will provide further insights and improved annotation for the human genome as mammalian genome sequence data become richer. .
Figures (V体育官网入口)
References
-
- Aparicio, S., Chapman, J., Stupka, E., Putnam, N., Chia, J.M., Dehal, P., Christoffels, A., Rash, S., Hoon, S., Smit, A., et al. 2002. Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science 297: 1301-1310. - PubMed
-
- Arnone, M.I. and Davidson, E.H. 1997. The hardwiring of development: Organization and function of genomic regulatory systems. Development 124: 1851-1864. - PubMed
-
- Bejerano, G., Pheasant, M., Makunin, I., Stephen, S., Kent, W.J., Mattick, J.S., and Haussler, D. 2004. Ultraconserved elements in the human genome. Science 304: 1321-1325. - PubMed
-
- Berman, B.P., Nibu, Y., Pfeiffer, B.D., Tomancak, P., Celniker, S.E., Levine, M., Rubin, G.M., and Eisen, M.B. 2002. Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome. Proc. Natl. Acad. Sci. 99: 757-762. - PMC - PubMed
Web site references
-
- http://blast.wustl.edu; WU-BLAST homepage.
-
- "V体育官网" http://www.repeatmasker.org; RepeatMasker homepage.
-
- http://mendel.stanford.edu/sidowlab; Sidow Lab homepage.
-
- VSports - http://genome.ucsc.edu; UCSC Genome Browser homepage.
-
- http://www.nisc.nih.gov/data; NISC Comparative Sequencing Program homepage.
Publication types
MeSH terms
- Actions (V体育安卓版)
- VSports - Actions
- VSports - Actions
- Actions (VSports)
- Actions (V体育2025版)
- "V体育官网" Actions
- VSports注册入口 - Actions
- "VSports最新版本" Actions
"VSports在线直播" Substances
LinkOut - more resources
V体育平台登录 - Full Text Sources
VSports app下载 - Other Literature Sources
Miscellaneous