José Ángel GonzálezLluís-F. HurtadoEncarna SegarraFernando GarcíaErnesto Julià Sanchís
In this paper, we present an approach to Spanish talk shows summarization. Our approach is based on the use of Siamese Neural Networks on the transcription of the show audios. Specifically, we propose to use Hierarchical Attention Networks to select the most relevant sentences for each speaker about a given topic in the show, in order to summarize his opinion about the topic. We train these networks in a siamese way to determine whether a summary is appropriate or not. Previous evaluation of this approach on summarization task of English newspapers achieved performances similar to other state-of-the-art systems. In the absence of enough transcribed or recognized speech data to train our system for talk show summarization in Spanish, we acquire a large corpus of document-summary pairs from Spanish newspapers and we use it to train our system. We choose this newspapers domain due to its high similarity with the topics addressed in talk shows. A preliminary evaluation of our summarization system on Spanish TV programs shows the adequacy of the proposal.
José Ángel GonzálezEncarna SegarraFernando GarcíaEmilio SanchísLlu rsquo ıs-F. Hurtado
José Ángel González-BarbaJulien DeloncaEmilio Sanchís ArnalFernando GarcíaEncarna Segarra
Jingxu LinSheng-hua ZhongAhmed Fares
José Ángel GonzálezEncarna SegarraFernando GarcíaEmilio SanchísLluís-F. Hurtado
Yufeng DiaoHongfei LinLiang YangXiaochao FanYonghe ChuDi WuDongyu ZhangKan Xu