JOURNAL ARTICLE

3D graph contrastive learning for molecular property prediction

Kisung MoonHyeon-Jin ImSunyoung Kwon

Year: 2023 Journal:   Bioinformatics Vol: 39 (6)   Publisher: Oxford University Press

Abstract

Abstract Motivation Self-supervised learning (SSL) is a method that learns the data representation by utilizing supervision inherent in the data. This learning method is in the spotlight in the drug field, lacking annotated data due to time-consuming and expensive experiments. SSL using enormous unlabeled data has shown excellent performance for molecular property prediction, but a few issues exist. (i) Existing SSL models are large-scale; there is a limitation to implementing SSL where the computing resource is insufficient. (ii) In most cases, they do not utilize 3D structural information for molecular representation learning. The activity of a drug is closely related to the structure of the drug molecule. Nevertheless, most current models do not use 3D information or use it partially. (iii) Previous models that apply contrastive learning to molecules use the augmentation of permuting atoms and bonds. Therefore, molecules having different characteristics can be in the same positive samples. We propose a novel contrastive learning framework, small-scale 3D Graph Contrastive Learning (3DGCL) for molecular property prediction, to solve the above problems. Results 3DGCL learns the molecular representation by reflecting the molecule’s structure through the pretraining process that does not change the semantics of the drug. Using only 1128 samples for pretrain data and 0.5 million model parameters, we achieved state-of-the-art or comparable performance in six benchmark datasets. Extensive experiments demonstrate that 3D structural information based on chemical knowledge is essential to molecular representation learning for property prediction. Availability and implementation Data and codes are available in https://github.com/moonkisung/3DGCL.

Keywords:
Computer science Property (philosophy) Representation (politics) Graph Machine learning Artificial intelligence Training set Feature learning Molecular graph Natural language processing Theoretical computer science

Metrics

23
Cited By
7.11
FWCI (Field Weighted Citation Impact)
37
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Computational Drug Discovery Methods
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Machine Learning in Materials Science
Physical Sciences →  Materials Science →  Materials Chemistry
Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.