JOURNAL ARTICLE

Multiresolution graph transformers and wavelet positional encoding for learning long-range and hierarchical structures

Nhat Khang NgoTruong Son HyRisi Kondor

Year: 2023 Journal:   The Journal of Chemical Physics Vol: 159 (3)   Publisher: American Institute of Physics

Abstract

Contemporary graph learning algorithms are not well-suited for large molecules since they do not consider the hierarchical interactions among the atoms, which are essential to determining the molecular properties of macromolecules. In this work, we propose Multiresolution Graph Transformers (MGT), the first graph transformer architecture that can learn to represent large molecules at multiple scales. MGT can learn to produce representations for the atoms and group them into meaningful functional groups or repeating units. We also introduce Wavelet Positional Encoding (WavePE), a new positional encoding method that can guarantee localization in both spectral and spatial domains. Our proposed model achieves competitive results on three macromolecule datasets consisting of polymers, peptides, and protein-ligand complexes, along with one drug-like molecule dataset. Significantly, our model outperforms other state-of-the-art methods and achieves chemical accuracy in estimating molecular properties (e.g., highest occupied molecular orbital, lowest unoccupied molecular orbital, and their gap) calculated by Density Functional Theory in the polymers dataset. Furthermore, the visualizations, including clustering results on macromolecules and low-dimensional spaces of their representations, demonstrate the capability of our methodology in learning to represent long-range and hierarchical structures. Our PyTorch implementation is publicly available at https://github.com/HySonLab/Multires-Graph-Transformer.

Keywords:
Computer science Transformer Macromolecule Cluster analysis Graph Hierarchical clustering Artificial intelligence Theoretical computer science Pattern recognition (psychology) Biological system Algorithm Chemistry Physics Biology

Metrics

13
Cited By
1.74
FWCI (Field Weighted Citation Impact)
66
Refs
0.78
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning in Materials Science
Physical Sciences →  Materials Science →  Materials Chemistry
Computational Drug Discovery Methods
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Protein Structure and Dynamics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
© 2026 ScienceGate Book Chapters — All rights reserved.