Abstract

Question Answering (QA) has been an area of interest for researchers, in part motivated by the international QA evaluation forums, namely the Text REtrieval Conference (TREC), and more recently, the Cross Language Evaluation Forum (CLEF) through QA@CLEF, that since 2004 includes the Portuguese language. In these forums, a collection of written documents is provided, as well as a set of questions, which are to be answered by the participating systems. Each system is evaluated by its capacity to answer the questions, as a whole, and there are relatively few results published that focus on the performance of its different components and their influence on the overall system performance. That is the case of the Information Retrieval (IR) component, which is broadly used in QA systems. Our work concentrates on the different options of preprocessing Portuguese text before feeding it to the IR component, evaluating their impact on the IR performance in the specific context of QA, so that we can make a sustained choice of which options to choose. From this work we conclude the clear advantage of the basic preprocessing techniques: case folding and removal of punctuation marks. For the other techniques considered, stop word removal enhanced the performance of the IR system but that was not the case as far as Stemming and Lemmatization are concerned.

Keywords:
Question answering Computer science Information retrieval Natural language processing

Metrics

19
Cited By
1.16
FWCI (Field Weighted Citation Impact)
6
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Intelligent Tutoring Systems and Adaptive Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Document Retrieval System for Biomedical Question Answering

Harun BolatBaha Şen

Journal:   Applied Sciences Year: 2024 Vol: 14 (6)Pages: 2613-2613
JOURNAL ARTICLE

Passage retrieval vs. document retrieval for factoid question answering

Charles L. A. ClarkeEgidio L. Terra

Journal:   Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03 Year: 2003
BOOK-CHAPTER

Document Retrieval in the Context of Question Answering

Christof Monz

Lecture notes in computer science Year: 2003 Pages: 571-579
© 2026 ScienceGate Book Chapters — All rights reserved.