Real-time sociometrics from audio-visual features for two-person dialogs

Yasir Tahir; Debsubhra Chakraborty; Tomasz Maszczyk; Shoko Dauwels; Justin Dauwels; Nadia Magnenat‐Thalmann; Daniël Thalmann

doi:10.1109/icdsp.2015.7251991

ScienceGate Book Chapters

JOURNAL ARTICLE

Real-time sociometrics from audio-visual features for two-person dialogs

Yasir Tahir Debsubhra Chakraborty Tomasz Maszczyk Shoko Dauwels Justin Dauwels Nadia Magnenat‐Thalmann Daniël Thalmann

Year: 2015 Vol: 3 Pages: 823-827

DOI: 10.1109/icdsp.2015.7251991

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes a real time sociometric system to analyze social behavior from audio-visual recordings of two-person face-to-face conversations in English. The novelty of the proposed system lies in this automatic inference of ten social indicators in real time. The system comprises of a Microsoft kinect device that captures RGB and depth data to compute visual cues and microphones to capture speech cues from an on-going conversation. With these non-verbal cues as features, machine learning algorithms are implemented in the system to extract multiple indicators of social behavior including empathy, confusion and politeness. The system is trained and tested on two carefully annotated corpora that consist of two person dialogs. Based on leave-one-out cross-validation test, the accuracy range of developed algorithms to infer social behaviors is 50% - 86% for audio corpus, and 62% - 92% for audio-visual corpus.

Keywords:

Computer science Conversation Speech recognition Artificial intelligence Natural language processing Novelty Phrase Inference Politeness Face (sociological concept) Human–computer interaction Psychology Communication

Metrics

Cited By

1.26

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Real-time sociometrics from audio-visual features for two-person dialogs

Abstract

Metrics

Citation History

Topics

Related Documents

Real-Time Comprehensive Sociometrics for Two-Person Dialogs

Real Time Audio-Visual Person Tracking

Real-Time Emotion Recognition from Audio-Visual Data

Real-Time Emotion Recognition from Audio-Visual Data

Audio-visual person verification