JOURNAL ARTICLE

FAIR data and metadata – The X-omics FAIR Data Cube and its added value for multi-omics researchers

Abstract

The FAIR (Findable, Accessible, Interoperable and Reusable) (FAIR) principles were proposed [1] to guide researchers to describe and share their data to increase data reuse and research reproducibility. Creating FAIR data can be challenging for multi-omics researchers due to a lack of tooling and a diverse landscape of (meta)data standards differing across -omics types. Linked data structures and graph representations allow semantic queries and open up new possibilities of data analysis. However, large multi-omics data sets cannot easily be converted to such structures. In the Netherlands X-omics Initiative, we develop a FAIR Data Cube (FDCube) [2] – a set of tools and services that help researchers in different stages of the Research Data Life Cycle including creating and describing new data, and finding, understanding and reusing existing FAIR multi-omics data. To facilitate creation of FAIR multi-omics data and metadata, we collaborate with different initiatives such as the FAIR Genomes project [3]. We adopt and develop metadata schemas for different omics data types, and make use of the Investigation-Study-Assay (ISA) metadata framework [4] to capture experimental metadata. Example workflows to create such metadata are publicly shared [5]. Researchers can find and query multi-omics studies via a FAIR Data Point (FDP) instance [6], which links to public or access-protected data repositories. A set of accompanying tools allows the import of general study metadata to the FDP as well as performing semantic queries on additional metadata on samples, phenotypes, or molecular features represented in an RDF-based knowledge graph. In order to allow analysis of access-protected data, we further implement a vantage6-based architecture that allows bioinformaticians to send containerised computing requests to access-controlled omics data storage and receive aggregated results. A prototype FDCube implementation is being developed in collaboration with the Trusted World of Corona (TWOC) [7], in which we use public COVID-19 multi-omics data sources to demonstrate the strength and added value of the FDCube and its FAIR-based methodologies. We invite researchers to discuss with us about their own experiences, how the FDcube can facilitate their research, and how X-omics tools can further support them.

Keywords:
Metadata Interoperability Workflow Data element Open data Metadata repository Reuse Linked data Set (abstract data type) Data discovery

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.44
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Research Data Management Practices
Physical Sciences →  Computer Science →  Information Systems
Scientific Computing and Data Management
Social Sciences →  Decision Sciences →  Information Systems and Management
Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

JOURNAL ARTICLE

Integrating FAIR Experimental Metadata for Multi-omics Data Analysis

Gajendra DoniparthiTimo MühlhausStefan Deßloch

Journal:   Datenbank-Spektrum Year: 2024 Vol: 24 (2)Pages: 107-115
© 2026 ScienceGate Book Chapters — All rights reserved.