Autoregressive Articulatory WaveNet Flow for Speaker-Independent Acoustic-to-Articulatory Inversion

Narjes Bozorg; Michael T. Johnson; Mohammad Soleymanpour

doi:10.1109/sped53181.2021.9587350

ScienceGate Book Chapters

JOURNAL ARTICLE

Autoregressive Articulatory WaveNet Flow for Speaker-Independent Acoustic-to-Articulatory Inversion

Narjes Bozorg Michael T. Johnson Mohammad Soleymanpour

Year: 2021 Vol: abs 1609 3499 Pages: 156-161

DOI: 10.1109/sped53181.2021.9587350

Get Full-Text PDF Get Analytical Report

Abstract

In this paper we introduce a new speaker independent method for Acoustic-to-Articulatory Inversion. The proposed architecture, Speaker Independent-Articulatory WaveNet (SI-AWN), models the relationship between acoustic and articulatory features by conditioning the articulatory trajectories on acoustic features and then utilizes the structure for unseen target speakers. We evaluate the proposed SI-AWN on the Electro Magnetic Articulography corpus of Mandarin Accented English (EMA-MAE), using the pool of acoustic-articulatory information from 35 reference speakers and testing on target speakers that include male, female, native and non-native speakers. The results suggest that SI-AWN improves the performance of the acoustic-to-articulatory inversion process compared to the baseline Maximum Likelihood Regression-Parallel Reference Speaker Weighting (MLLR-PRSW) method by 21 percent. To the best of our knowledge, this is the first application of a WaveNet-like synthesis approach to the problem of Speaker Independent Acoustic-to-Articulatory Inversion, and results are comparable to or better than the best currently published systems.

Keywords:

Speech recognition Computer science Inversion (geology) Autoregressive model Weighting Mandarin Chinese Acoustics Mathematics Linguistics Statistics Geology

Metrics

Cited By

0.14

FWCI (Field Weighted Citation Impact)

Refs

0.56

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Autoregressive Articulatory WaveNet Flow for Speaker-Independent Acoustic-to-Articulatory Inversion

Abstract

Metrics

Citation History

Topics

Related Documents

Acoustic-to-Articulatory Inversion with Deep Autoregressive Articulatory-WaveNet

Speaker-Independent Acoustic-to-Articulatory Speech Inversion

Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion

Reference speaker selection for kinematic-independent acoustic-to-articulatory-inversion

An Investigation on Speaker Specific Articulatory Synthesis with Speaker Independent Articulatory Inversion