Rhetorical Role Identification for Portuguese Legal Documents

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet


In this paper, we present a new corpus for Rhetorical Role Identification in Portuguese legal documents. The corpus comprises petitions from 70 civil lawsuits filed in TJMS court and was manually labeled with rhetorical roles specifically tailored for petitions. Since petition documents are created without a standard structure, we had to deal with several issues to clean the extracted textual content. We assessed classic and deep learning machine learning methods on the proposed corpus. The best performing method obtained an F-score of 80.50. At the best of our knowledge, this is the first work to deal with rhetorical role identification for petitions, given that previous works focused only on judicial decisions. Additionally, it is also the first work to tackle this task for the Portuguese language. The proposed corpus, as well as the proposed rhetorical roles, can foster new research in the judicial area and also lead to new solutions to improve the flow of Brazilian court houses.

TitelIntelligent Systems : 10th Brazilian Conference, BRACIS 2021, Virtual Event, November 29 – December 3, 2021, Proceedings, Part II
HerausgeberAndré Britto, Karina Valdivia Delgado
Anzahl der Seiten15
VerlagSpringer Schweiz
ISBN (Print)978-3-030-91698-5
ISBN (elektronisch)978-3-030-91699-2
PublikationsstatusErschienen - 2021
Extern publiziertJa
VeranstaltungBrazilian Conference on Intelligent Systems - BRACIS 2021 - Virtual, Online
Dauer: 29.11.202103.12.2021
Konferenznummer: 10