JOURNAL ARTICLE

Extracting Relations from Italian Wikipedia using Self-Training

Siciliani, LuciaCassotti, PierluigiBasile, PierpaoloDe Gemmis, MarcoLops, PasqualeSemeraro, Giovanni

Year: 2021 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

This dataset contains relations extracted from the Italian Wikipedia by the WikiOIE framework.
WikiOIE is based on UDPipe and the Universal Dependencies project for text processing.
It easily allows customizing the information extraction (IE) approach to automatically extract triples (subject, predicate, object).
This dataset contains relations extracted by a supervised approach based on self-training.
The extraction process is provided in JSON format. Version 2 of the dataset was extracted using an improved version of the learning algorithm. The files of version 2 are identified by the suffix "_reg" in the file name. More information and the Java code are available here: https://github.com/pippokill/WikiOIE Self-training approach: Lucia Siciliani, Pierluigi Cassotti, Pierpaolo Basile, Marco de Gemmis, Pasquale Lops, and Giovanni Semeraro 2021. Extracting Relations from Italian Wikipedia using Self-Training. In Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021). CEUR-WS. WikiOIE framework: Pierluigi Cassotti, Lucia Siciliani, Pierpaolo Basile, Marco de Gemmis, and Pasquale Lops. 2021. Extracting relations from Italian Wikipedia using unsupervised information extraction. In Proceedings of the 11th Italian Information Retrieval Workshop 2021 (IIR 2021). CEUR-WS.

Keywords:
Suffix Information extraction JSON Code (set theory) Computational linguistics Relationship extraction

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.29
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Water Quality and Resources Studies
Physical Sciences →  Environmental Science →  Water Science and Technology
Hydrology and Sediment Transport Processes
Physical Sciences →  Environmental Science →  Ecology
Soil and Water Nutrient Dynamics
Physical Sciences →  Environmental Science →  Environmental Chemistry
© 2026 ScienceGate Book Chapters — All rights reserved.