JOURNAL ARTICLE

Audio-Visual Speaker Diarization in the Framework of Multi-User Human-Robot Interaction

Abstract

International audience

Keywords:
Speaker diarisation Computer science Speech recognition Robustness (evolution) Artificial intelligence Speaker recognition Task (project management) Robot Task analysis Humanoid robot Voice activity detection Speech processing

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
20
Refs
0.04
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Hearing Loss and Rehabilitation
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

JOURNAL ARTICLE

1P1-K06 Audio-Visual Speaker Detection in Human-Robot Interaction

Thatsaphan SuwannathatJun‐ichi ImaiMasahide Kaneko

Journal:   The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) Year: 2007 Vol: 2007 (0)Pages: _1P1-K06_1
JOURNAL ARTICLE

End-to-End Audio-Visual Neural Speaker Diarization

Maokui HeJun DuChin‐Hui Lee

Journal:   Interspeech 2022 Year: 2022
JOURNAL ARTICLE

DyViSE: Dynamic Vision-Guided Speaker Embedding for Audio-Visual Speaker Diarization

Abudukelimu WuerkaixiKunda YanYou ZhangZhiyao DuanChangshui Zhang

Journal:   2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP) Year: 2022 Pages: 1-6
JOURNAL ARTICLE

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion

Israel D. GebruSilèye BaXiaofei LiRadu Horaud

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2017 Vol: 40 (5)Pages: 1086-1099
© 2026 ScienceGate Book Chapters — All rights reserved.