Recovering punctuation marks for automatic speech recognition

Fernando Batista; Diamantino Caseiro; Nuno Mamede; Isabel Trancoso

doi:10.21437/interspeech.2007-581

ScienceGate Book Chapters

JOURNAL ARTICLE

Recovering punctuation marks for automatic speech recognition

Fernando Batista Diamantino Caseiro Nuno Mamede Isabel Trancoso

Year: 2007 Pages: 2153-2156

DOI: 10.21437/interspeech.2007-581

Get Full-Text PDF Get Analytical Report

Abstract

This paper shows results of recovering punctuation over speech transcriptions for a Portuguese broadcast news corpus. The approach is based on maximum entropy models and uses word, part-of-speech, time and speaker information. The contribution of each type of feature is analyzed individually. Separate results for each focus condition are given, making it possible to analyze the differences of performance between planned and spontaneous speech. Index Terms: rich transcription, punctuation recovery, sentence boundary detection, maximum entropy.

Keywords:

Punctuation Computer science Speech recognition Principle of maximum entropy Sentence Natural language processing Focus (optics) Transcription (linguistics) Artificial intelligence Feature (linguistics) Part of speech Entropy (arrow of time) Portuguese Linguistics

Metrics

Cited By

3.88

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Recovering punctuation marks for automatic speech recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Recovering capitalization and punctuation marks for automatic speech recognition: Case study for Portuguese broadcast news

Speech recognition with automatic punctuation

PUNCTUATION MARKS IN SPEECH: CONSTRUCTIONALIZATIONS

Transformer-Based Punctuation Restoration for Automatic Speech Recognition Systems

Automatic punctuation generation for speech