Rhetorical Role Identification for Portuguese Legal Documents
Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review
Authors
In this paper, we present a new corpus for Rhetorical Role Identification in Portuguese legal documents. The corpus comprises petitions from 70 civil lawsuits filed in TJMS court and was manually labeled with rhetorical roles specifically tailored for petitions. Since petition documents are created without a standard structure, we had to deal with several issues to clean the extracted textual content. We assessed classic and deep learning machine learning methods on the proposed corpus. The best performing method obtained an F-score of 80.50. At the best of our knowledge, this is the first work to deal with rhetorical role identification for petitions, given that previous works focused only on judicial decisions. Additionally, it is also the first work to tackle this task for the Portuguese language. The proposed corpus, as well as the proposed rhetorical roles, can foster new research in the judicial area and also lead to new solutions to improve the flow of Brazilian court houses.
Original language | English |
---|---|
Title of host publication | Intelligent Systems : 10th Brazilian Conference, BRACIS 2021, Virtual Event, November 29 – December 3, 2021, Proceedings, Part II |
Editors | André Britto, Karina Valdivia Delgado |
Number of pages | 15 |
Place of Publication | Cham |
Publisher | Springer Schweiz |
Publication date | 2021 |
Pages | 557-571 |
ISBN (print) | 978-3-030-91698-5 |
ISBN (electronic) | 978-3-030-91699-2 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Event | Brazilian Conference on Intelligent Systems - BRACIS 2021 - Virtual, Online Duration: 29.11.2021 → 03.12.2021 Conference number: 10 https://c4ai.inova.usp.br/bracis2021/#:~:text=Organized%20by%20C4AI%2C%20the%2010th,29th%20to%20December%203rd%2C%202021. |
- Corpus, Legal sentence classification, Natural language processing, Rhetorical role identification
- Informatics
- Business informatics