JOURNAL ARTICLE

Learning from multi-omics data of cancer

Zhang, Xiaoyu

Year: 2022 Journal:   Spiral (Imperial College London)   Publisher: Imperial College London

Abstract

Analysing multiple types of omics data is a keystone methodology in biomedical research nowadays. With the significant advances of high-throughput experimental technologies, an enormous amount of omics data with extremely high dimensionality are generated every day at an unprecedented speed, which leaves a massive data gold ore waiting to be mined. However, it is difficult for traditional bioinformatics methods to deal with the high dimensionality and enormous data amount. The rapid development of machine learning, especially the deep learning methodology, has dramatically revolutionised fields like natural language processing and computer vision over the past decade. Deep learning has shown great success in decoding high-dimensional data like images, which makes it promising to adopt this cutting-edge technology to the analysis of multi-omics data. In this thesis, we propose a comprehensive toolbox for deep learning-based multi-omics data analysis named OmiSuite to decode high-dimensional multi-omics data and unveil the correlation between the phenotype profile and different types of omics profiles. OmiSuite is comprised of four components: OmiVAE, OmiEmbed, XOmiVAE, and OmiTrans. Among them, OmiVAE is one of the first endeavours to decode high-dimensional multi-omics data using variational autoencoders for pan-cancer classification. OmiEmbed is a unified multi-task deep learning framework for multi-omics data, supporting multi-omics integration, dimensionality reduction, omics embedding, tumour type classification, phenotypic feature reconstruction, survival prediction, and multi-task learning for aforementioned tasks. XOmiVAE is the explainable upgrade of OmiVAE, which can provide the contribution score of each molecular feature and each latent dimension for each phenotype prediction. OmiTrans is the first generative adversarial networks-based omics-to-omics translation framework, which ushered in a brand-new research topic with a promising vision. These four components of OmiSuite created a unified ecosystem for high-dimensional multi-omics analysis, which covered almost every aspect of this field, benefited follow-up studies and led to an upsurge of research in this field.

Keywords:
Toolbox Deep learning Dimensionality reduction Feature selection Curse of dimensionality Feature (linguistics) Data type Dimension (graph theory)

Metrics

1
Cited By
0.12
FWCI (Field Weighted Citation Impact)
0
Refs
0.45
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Bioinformatics and Genomic Networks
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Single-cell and spatial transcriptomics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Machine Learning in Bioinformatics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
© 2026 ScienceGate Book Chapters — All rights reserved.