Multiple reference genomes and transcriptomes for Arabidopsis thaliana

Xiangchao Gan, Oliver Stegle, Jonas Behr, Joshua G. Steffen, Philipp Drewe, Katie L. Hildebrand, Rune Lyngsoe, Sebastian J. Schultheiss, Edward J. Osborne, Vipin T. Sreedharan, André Kahles, Regina Bohnert, Géraldine Jean, Paul Derwent, Paul Kersey, Eric J. Belfield, Nicholas P. Harberd, Eric Kemen, Christopher Toomajian, Paula X. KoverRichard M. Clark, Gunnar Rätsch, Richard Mott

Research output: Contribution to journalArticlepeer-review

491 Scopus citations


Genetic differences between Arabidopsis thaliana accessions underlie the plants extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions. ©2011 Macmillan Publishers Limited. All rights reserved.
Original languageEnglish (US)
Pages (from-to)419-423
Number of pages5
Issue number7365
StatePublished - Aug 28 2011
Externally publishedYes

Bibliographical note

KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): KUK-I1-002-03
Acknowledgements: We thank C. Jiang and C. Brown for technical help; P. Dikjwel, K. Schneeberger and D. Weigel for providing sequence data in advance of publication; and J. Flint, M. Tsiantis, E. Feil, L. Hurst, A. Wachter and D. Weigel for comments on the manuscript. Funding was provided by the Biotechnology and Biological Sciences Research Council (BBSRC) BB/F022697/1 (to R. M. and P. X. K.), BB/D016029/2 (to P. X. K and R. M.), BB/F020759/1 (to N.P.H.) and BB/F019793/1 to P. K. (PI E. Birney), by core funding of the Max Planck Society (to G. R.), the German Research Foundation RA1894/1-1 and RA1894/2-1 (to G. R.), Volkswagen Stiftung (to O.S.), the National Science Foundation (NSF) 0929262 (to R. C. and C. T.) and 0820985 (to R. C., Principal Investigator L. Sieburth), and award no. KUK-I1-002-03 (to N.P.H), made by King Abdullah University of Science and Technology (KAUST). D. Buck, I. Ragoussis and colleagues in the Wellcome Trust Centre for Human Genetics Genomics Core performed most of the genomic sequencing, supported by the Wellcome Trust Core grant 090532/Z/09/Z. Contribution no. 11-388-J from the Kansas Agricultural Experiment Station.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.


Dive into the research topics of 'Multiple reference genomes and transcriptomes for Arabidopsis thaliana'. Together they form a unique fingerprint.

Cite this