Abstract
Colorectal cancer (CRC) ranks as the second leading cause of cancer deaths globally. In recent years, short-read single-cell RNA sequencing (scRNA-seq) has been instrumental in deciphering tumor heterogeneities. However, these studies only enable gene-level quantification but neglect alterations in transcript structures arising from alternative end processing or splicing. In this study, we integrated short- and long-read scRNA-seq of CRC samples to build an isoform-resolution CRC transcriptomic atlas. We identified 394 dysregulated transcript structures in tumor epithelial cells, including 299 resulting from various combinations of splicing events. Second, we characterized genes and isoforms associated with epithelial lineages and subpopulations exhibiting distinct prognoses. Among 31,935 isoforms with novel junctions, 330 were supported by The Cancer Genome Atlas RNA-seq and mass spectrometry data. Finally, we built an algorithm that integrated novel peptides derived from open reading frames of recurrent tumor-specific transcripts with mass spectrometry data and identified recurring neoepitopes that may aid the development of cancer vaccines.
Original language | English (US) |
---|---|
Article number | 100641 |
Journal | Cell Genomics |
Volume | 4 |
Issue number | 9 |
DOIs | |
State | Published - Sep 11 2024 |
Bibliographical note
Publisher Copyright:© 2024 The Author(s)
Keywords
- colorectal cancer
- long-read RNA-seq
- neoantigen
- scRNA-seq
ASJC Scopus subject areas
- Biochemistry, Genetics and Molecular Biology (miscellaneous)
- Genetics