scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences

Sumeer Ahmad Khan, Robert Lehmann, Xabier Martinez-De-Morentin, Alberto Maillo, Vincenzo Lagani, Narsis A. Kiani, David Gomez-Cabrero, Jesper Tegner*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Recent progress in Single-Cell Genomics has produced different library protocols and techniques for molecular profiling. We formulate a unifying, data-driven, integrative, and predictive methodology for different libraries, samples, and paired-unpaired data modalities. Our design of scAEGAN includes an autoencoder (AE) network integrated with adversarial learning by a cycleGAN (cGAN) network. The AE learns a low-dimensional embedding of each condition, whereas the cGAN learns a non-linear mapping between the AE representations. We evaluate scAEGAN using simulated data and real scRNA-seq datasets, different library preparations (Fluidigm C1, CelSeq, CelSeq2, SmartSeq), and several data modalities as paired scRNA-seq and scATAC-seq. The scAEGAN outperforms Seurat3 in library integration, is more robust against data sparsity, and beats Seurat 4 in integrating paired data from the same cell. Furthermore, in predicting one data modality from another, scAEGAN outperforms Babel. We conclude that scAEGAN surpasses current state-of-the-art methods and unifies integration and prediction challenges.

Original languageEnglish (US)
Article numbere0281315
JournalPloS one
Volume18
Issue number2 February
DOIs
StatePublished - Feb 2023

Bibliographical note

Funding Information:
This work was supported by the King Abdullah University of Science and Technology. The funders had no role in study design, data collection and analysis decision to publish, or preparation of the manuscript.

Publisher Copyright:
© 2023 Khan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

ASJC Scopus subject areas

  • General

Fingerprint

Dive into the research topics of 'scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences'. Together they form a unique fingerprint.

Cite this