Corpus annotation of functional discourse units for aspect‑based sentiment analysis
| dc.centro | Facultad de Filosofía y Letras | es_ES |
| dc.contributor.author | Moreno-Ortiz, Antonio Jesús | |
| dc.contributor.author | García Gámez, María | |
| dc.date.accessioned | 2025-09-09T12:02:54Z | |
| dc.date.available | 2025-09-09T12:02:54Z | |
| dc.date.created | 2025-09-09 | |
| dc.date.issued | 2025-07-08 | |
| dc.departamento | Filología Inglesa, Francesa y Alemana | es_ES |
| dc.description.abstract | Aspect-based sentiment analysis (ABSA) aims to identify the sentiment associated with specifc aspects or entities in a text. In order to facilitate the development and evaluation of ABSA systems, it is crucial to have annotated datasets that contain information about the aspects, entities, and the sentiments expressed towards them. However, the amount of information in existing datasets (for example those used in the SemEval shared tasks) is very limited. We innovate on existing corpora by introducing a multi-layered annotation schema that includes not only entities and aspects, but also lexical items and, crucially, functional discourse units (FDUs). These FDUs are text segments (typically sentences or clauses) that play a specifc role or function within the overall text, such as “description”, “evaluation”, or “advice”, a type of information which we believe can be of great help in ABSA. Our corpus focuses on user reviews of tourist attractions (specifcally monuments) in the region of Andalusia (Spain), but the same schema can be used to annotate reviews of other domains simply by adapting the aspects layer, which is domain-dependent. The annotation schema is described, and the validation process is carried out on a sample of 400 reviews from this domain. Results show a substantial level of agreement among the annotators, indicating that the schema is reliable and consistent. We go on to illustrate and discuss some difcult cases where annotation showed discrepancy among annotators. The annotation of FDUs in the corpus is a signifcant advancement for aspect-based sentiment analysis. | es_ES |
| dc.description.sponsorship | Funding for open access charge: Universidad de Málaga / CBUA | es_ES |
| dc.identifier.citation | Moreno-Ortiz, A., García-Gámez, M. Corpus Annotation of Functional Discourse Units for Aspect-Based Sentiment Analysis. Corpus Pragmatics (2025). https://doi.org/10.1007/s41701-025-00199-0 | es_ES |
| dc.identifier.doi | 10.1007/s41701-025-00199-0 | |
| dc.identifier.uri | https://hdl.handle.net/10630/39809 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Springer | es_ES |
| dc.relation.references | https://hdl.handle.net/10630/40750 | |
| dc.rights | Atribución 4.0 Internacional | * |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
| dc.subject | Lingüística aplicada | es_ES |
| dc.subject | Corpus lingüístico - Proceso de datos | es_ES |
| dc.subject.other | Aspect-based sentiment analysis | es_ES |
| dc.subject.other | Corpus annotation | es_ES |
| dc.subject.other | Annotation schema | es_ES |
| dc.subject.other | Functional discourse units | es_ES |
| dc.title | Corpus annotation of functional discourse units for aspect‑based sentiment analysis | es_ES |
| dc.type | journal article | es_ES |
| dc.type.hasVersion | VoR | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 3233c4af-5a32-40f2-9c82-103bc48c43cd | |
| relation.isAuthorOfPublication.latestForDiscovery | 3233c4af-5a32-40f2-9c82-103bc48c43cd |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- s41701-025-00199-0.pdf
- Size:
- 1.77 MB
- Format:
- Adobe Portable Document Format
- Description:

