Sentence-Level Rhetorical Role Labeling in Judicial Decisions
Creators
- 1. MONTANA Knowledge Management Ltd., H-1029 Budapest, Hungary
- 2. Department of Electric Power Engineering, Budapest University of Technology and Economics, H-1111 Budapest, Hungary
- 3. Political and Legal Text Mining & Artificial Intelligence Laboratory (poltextLAB), ELTE Centre for Social Sciences, H-1097 Budapest, Hungary
- 4. UNESCO Chair on Digital Platforms for Learning Societies, Institute of the Information Society, Ludovika University of Public Service, H-1083 Budapest, Hungary
- 5. Department of European Public and Private Law, Faculty of Public Governance and International Studies, Ludovika University of Public Service, H-1083 Budapest, Hungary
Description
This paper presents an in-production Rhetorical Role Labeling (RRL) classifier developed for Hungarian judicial decisions. RRL is a sequential classification problem in Natural Language Processing, aiming to assign functional roles (such as facts, arguments, decision, etc.) to every segment or sentence in a legal document. The study was conducted on a human-annotated sentence-level RRL corpus and compares multiple neural architectures, including BiLSTM, attention-based networks, and a support vector machine as baseline. It further investigates the impact of late chunking during vectorization, in contrast to classical approaches. Results from tests on the labeled dataset and annotator agreement statistics are reported, and performance is analyzed across architecture types and embedding strategies. Contrary to recent findings in retrieval tasks, late chunking does not show consistent improvements for sentence-level RRL, suggesting that contextualization through chunk embeddings may introduce noise rather than useful context in Hungarian legal judgments. The work also discusses the unique structure and labeling challenges of Hungarian cases compared to international datasets and provides empirical insights for future legal NLP research in non-English court decisions.
Open Access
Licence Attribution (CC BY)
Publisher Website
Access full text
Publication Details
Journal article
Persistent Identifiers
DOI
10.3390/bdcc9120315
Read more
References
Orosz, T., V\u00e1gi, R., Cs\u00e1nyi, G.M., Nagy, D., \u00dcveges, I., Vad\u00e...
Read more
Chen, J., Xiao, S., Zhang, P., Luo, K., Lian, D., and Liu, Z. (2024, January 11\...
Read more
Wang, H., He, T., Zou, Z., Shen, S., and Li, Y. (2019, January 22\u201326). Usin...
Read more
Bambroo, P., Adhikary, S., Bhattacharya, P., Chakraborty, A., Ghosh, S., and Gho...
Read more
Knott . The classification of coherence relations and their linguistic markers: ...
Read more
Showing first 5 of 24 references.