Extracting References from German Legal Texts Using Named Entity Recognition1

Silvio Peikert; Celia Birle; Jamal Al Qundus; Le Duyen Sandra Vu; Adrian Paschke

doi:10.3233/faia220472

ScienceGate Book Chapters

BOOK-CHAPTER

Extracting References from German Legal Texts Using Named Entity Recognition1

Silvio Peikert Celia Birle Jamal Al Qundus Le Duyen Sandra Vu Adrian Paschke

Year: 2022 Frontiers in artificial intelligence and applications

DOI: 10.3233/faia220472

Get Full-Text PDF Get Analytical Report

Abstract

Information extraction tasks are particularly challenging in specific contexts such as the legal domain. In this paper, Named Entity Recognition is used to make legal texts more accessible to domain experts and laymen. This paper focuses on extracting law references and citations of court decisions, which occur in various syntactic formats. To investigate this task a reference data set is constructed from a large collection of German court decisions and different NER-techniques are compared. Pattern matching, probabilistic sequence labeling (CRF), Deep Learning (BiLSTM) and transfer learning using a pretrained language model (BERT) are applied to extract references to laws and court decisions. The results show that the BERT based approach achieves F1 scores around 0.98 for both tasks and outperforms methods from prior work, which achieve F1 scores of 0.89 (CRF for law references) respectively 0.82 (CRF for court decisions) on the same data set.

Keywords:

German Computer science Task (project management) Natural language processing Artificial intelligence Named-entity recognition Set (abstract data type) Transfer of learning Matching (statistics) Domain (mathematical analysis) Probabilistic logic Sequence labeling F1 score Sequence (biology) Information retrieval Linguistics Engineering Mathematics

Metrics

Cited By

6.00

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Artificial Intelligence in Law

Social Sciences → Social Sciences → Political Science and International Relations

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Extracting References from German Legal Texts Using Named Entity Recognition1

Abstract

Metrics

Citation History

Topics

Related Documents

Extracting Ecological Facts from Karakalpak Texts via Named Entity Recognition

Named-entity recognition in Turkish legal texts

Extracting Information from NOTAMs Using Named-Entity Recognition

Named Entity Recognition from Turkish texts

German BERT Model for Legal Named Entity Recognition