JOURNAL ARTICLE

Transformer-Based Turkish Automatic Speech Recognition

D. Emre TaşarKutan KoruyanCihan Çılgın

Year: 2024 Journal:   Acta Infologica Vol: 0 (0)Pages: 0-0   Publisher: Istanbul University

Abstract

Today, businesses use Automatic Speech Recognition (ASR) technology more frequently to increase efficiency and productivity while performing many business functions. Due to the increased prevalence of online meetings in remote working and learning environments after the COVID-19 pandemic, speech recognition systems have seen more frequent utilization, exhibiting the significance of these systems. While English, Spanish or French languages have a lot of labeled data, there is very little labeled data for the Turkish language. This directly affects the accuracy of the ASR system negatively. Therefore, this study utilizes unlabeled audio data by learning general data representations with self-supervised learning end-to-end modeling. This study employed a transformer-based machine learning model with improved performance through transfer learning to convert speech recordings to text. The model adopted within the scope of the study is the Wav2Vec 2.0 architecture, which masks the audio inputs and solves the related task. The XLSR-Wav2Vec 2.0 model was pre-trained on speech data in 53 languages and fine-tuned with the Mozilla Common Voice Turkish data set. According to the empirical results obtained within the scope of the study, a 0.23 word error rate was reached in the test set of the same data set.

Keywords:
Turkish Speech recognition Transformer Computer science Natural language processing Engineering Electrical engineering Linguistics Voltage

Metrics

2
Cited By
1.28
FWCI (Field Weighted Citation Impact)
0
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Web Service-Based Turkish Automatic Speech Recognition Platform

Saadin OyucuHüseyin PolatHayri Sever

Journal:   2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) Year: 2020 Vol: 6 Pages: 1-5
JOURNAL ARTICLE

Deep Learning Based Automatic Speech Recognition for Turkish

Burak TombaloğluHamit Erdem

Journal:   Sakarya University Journal of Science Year: 2020 Vol: 24 (4)Pages: 725-739
JOURNAL ARTICLE

Automatic speech recognition with efficient transformer

Shuhan Luo

Year: 2023 Vol: 1412 Pages: 186-186
© 2026 ScienceGate Book Chapters — All rights reserved.