JOURNAL ARTICLE

Masked Autoencoders are Articulatory Learners

Abstract

Articulatory recordings track the positions and motion of different articulators along the vocal tract and are widely used to study speech production and to develop speech technologies such as articulatory based speech synthesizers and speech inversion systems. The University of Wisconsin X-Ray microbeam (XRMB) dataset is one of various datasets that provide articulatory recordings synced with audio recordings. The XRMB articulatory recordings employ pellets placed on a number of articulators which can be tracked by the microbeam. However, a significant portion of the articulatory recordings are mistracked, and have been so far unsuable. In this work, we present a deep learning based approach using Masked Autoencoders to accurately reconstruct the mistracked articulatory recordings for 41 out of 47 speakers of the XRMB dataset. Our model is able to reconstruct articulatory trajectories that closely match ground truth, even when three out of eight articulators are mistracked, and retrieve 3.28 out of 3.4 hours of previously unusable recordings.

Keywords:
Vocal tract Computer science Speech recognition Speech production Artificial intelligence Speech processing Ground truth

Metrics

5
Cited By
1.34
FWCI (Field Weighted Citation Impact)
6
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Adversarial Masked Autoencoders Are Robust Vision Learners

Yuchong YaoNandakishor DesaiMarimuthu Palaniswami

Journal:   IEEE Transactions on Artificial Intelligence Year: 2024 Vol: 6 (4)Pages: 805-815
JOURNAL ARTICLE

Masked Autoencoders as Single Object Tracking Learners

Chunjuan BoXin ChenJunxing Zhang

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2024 Vol: 80 (1)Pages: 1105-1122
© 2026 ScienceGate Book Chapters — All rights reserved.