Clustering Driven Multi-Hop Graph Attention Network for Speaker Diarization

Qingqing Mao; Qirong Mao

doi:10.1109/ichci58871.2023.10277871

ScienceGate Book Chapters

JOURNAL ARTICLE

Clustering Driven Multi-Hop Graph Attention Network for Speaker Diarization

Qingqing Mao Qirong Mao

Year: 2023 Pages: 408-411

DOI: 10.1109/ichci58871.2023.10277871

Get Full-Text PDF Get Analytical Report

Abstract

Recently, the segmented utterance-level modeling approach based on Graph Attention Network (GAT) has been proved to be effective in Clustering-based Speaker Diarization (CSD). However, these existing methods only rely on the message passing by a single neighbor per layer, ignoring the influence of sub-region and global information. In this paper, we propose clustering driven multi-hop Graph Attention Network (CD-MGAT) with the multi-hop neighbor module and the clustering-oriented prototype module, which effectively explores the sub-region and global information for each segmented utterance. Specifically, the developed modules can adaptively interact with each other by clustering-consistency loss, which ensures the consistency of learning between the prototype and speaker embedding. Extensive experiments demonstrate the effectiveness of our solution on the AMI datasets.

Keywords:

Cluster analysis Computer science Graph Consistency (knowledge bases) Embedding Utterance Speaker diarisation Data mining Artificial intelligence Theoretical computer science Speaker recognition

Metrics

Cited By

0.26

FWCI (Field Weighted Citation Impact)

Refs

0.57

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Clustering Driven Multi-Hop Graph Attention Network for Speaker Diarization

Abstract

Metrics

Citation History

Topics

Related Documents

Graph attention-based deep embedded clustering for speaker diarization

Triplet Network with Attention for Speaker Diarization

Markov clustering regularized multi-hop graph neural network

Multi-scale graph attention subspace clustering network

Supervised Hierarchical Clustering Using Graph Neural Networks for Speaker Diarization