Abstract
Understanding regulatory mechanisms of protein synthesis in eukaryotes is essential for the accurate annotation of genome sequences. Kozak reported that the nucleotide sequence GCCGCC(A/G)CCAUGG (AUG is the initiation codon) was frequently observed in vertebrate genes and that this 'consensus' sequence enhanced translation initiation. However, later studies using invertebrate, fungal and plant genes reported different 'consensus' sequences. In this study, we conducted extensive comparative analyses of nucleotide sequences around the initiation codon by using genomic data from 47 eukaryote species including animals, fungi, plants and protists. The analyses revealed that preferred nucleotide sequences are quite diverse among different species, but differences between patterns of nucleotide bias roughly reflect the evolutionary relationships of the species. We also found strong biases of A/G at position -3, A/C at position -2 and C at position +5 that were commonly observed in all species examined. Genes with higher expression levels showed stronger signals, suggesting that these nucleotides are responsible for the regulation of translation initiation. The diversity of preferred nucleotide sequences around the initiation codon might be explained by differences in relative contributions from two distinct patterns, GCCGCCAUG and AAAAAAAUG, which implies the presence of multiple molecular mechanisms for controlling translation initiation.
Original language | English (US) |
---|---|
Pages (from-to) | 861-871 |
Number of pages | 11 |
Journal | NUCLEIC ACIDS RESEARCH |
Volume | 36 |
Issue number | 3 |
DOIs | |
State | Published - Feb 2008 |
Externally published | Yes |
Bibliographical note
Funding Information:We would like to thank Tadashi Imanishi, Motohiko Tanino, Kaoru Mogushi, Takeshi Fukuhara, Emilio Campos, Takeshi Hase, Yutaka Fukuoka, Tadashi Masuda, Soichi Ogishima, and Fengrong Ren for their helpful comments and discussion. Funding for this work was provided by the Ministry of Education, Culture, Sports, Science and Technology of Japan, the Genome Information Integration Project of the Ministry of Economy, Trade and Industry of Japan, and the Japan Biological Informatics Consortium (17710162 to Y.N.). Funding to pay the Open Access publication charges for this article was provided by Tokyo Medical and Dental University.
ASJC Scopus subject areas
- Genetics