Sentence-Level Rhetorical Role Labeling in Judicial Decisions

Csányi, Gergely Márk; Üveges, István; Lakatos, Dorina; Ripszám, Dóra; Kozák, Kornélia; Nagy, Dániel; Vadász, János Pál

Published December 5, 2025

0 views Journal article Open Access

Sentence-Level Rhetorical Role Labeling in Judicial Decisions

1. MONTANA Knowledge Management Ltd., H-1029 Budapest, Hungary
2. Department of Electric Power Engineering, Budapest University of Technology and Economics, H-1111 Budapest, Hungary
3. Political and Legal Text Mining & Artificial Intelligence Laboratory (poltextLAB), ELTE Centre for Social Sciences, H-1097 Budapest, Hungary
4. UNESCO Chair on Digital Platforms for Learning Societies, Institute of the Information Society, Ludovika University of Public Service, H-1083 Budapest, Hungary
5. Department of European Public and Private Law, Faculty of Public Governance and International Studies, Ludovika University of Public Service, H-1083 Budapest, Hungary

This paper presents an in-production Rhetorical Role Labeling (RRL) classifier developed for Hungarian judicial decisions. RRL is a sequential classification problem in Natural Language Processing, aiming to assign functional roles (such as facts, arguments, decision, etc.) to every segment or sentence in a legal document. The study was conducted on a human-annotated sentence-level RRL corpus and compares multiple neural architectures, including BiLSTM, attention-based networks, and a support vector machine as baseline. It further investigates the impact of late chunking during vectorization, in contrast to classical approaches. Results from tests on the labeled dataset and annotator agreement statistics are reported, and performance is analyzed across architecture types and embedding strategies. Contrary to recent findings in retrieval tasks, late chunking does not show consistent improvements for sentence-level RRL, suggesting that contextualization through chunk embeddings may introduce noise rather than useful context in Hungarian legal judgments. The work also discusses the unique structure and labeling challenges of Hungarian cases compared to international datasets and provides empirical insights for future legal NLP research in non-English court decisions.

Enabled by The Lens

Open Access

Licence Attribution (CC BY)

Publisher Website Access full text

Publication Details

Journal article

Journal: Big Data and Cognitive Computing

Publisher: MDPI AG

ISSN: 25042289

Volume: 9

Pages: 315

Persistent Identifiers

DOI 10.3390/bdcc9120315 Read more

References

Orosz, T., V\u00e1gi, R., Cs\u00e1nyi, G.M., Nagy, D., \u00dcveges, I., Vad\u00e... Read more

Chen, J., Xiao, S., Zhang, P., Luo, K., Lian, D., and Liu, Z. (2024, January 11\... Read more

Wang, H., He, T., Zou, Z., Shen, S., and Li, Y. (2019, January 22\u201326). Usin... Read more

Bambroo, P., Adhikary, S., Bhattacharya, P., Chakraborty, A., Ghosh, S., and Gho... Read more

Knott . The classification of coherence relations and their linguistic markers: ... Read more

Showing first 5 of 24 references.

Sentence-Level Rhetorical Role Labeling in Judicial Decisions

Creators

Description

Open Access

Publication Details

Related Works

Persistent Identifiers

References