Corpus annotation of functional discourse units for aspect‑based sentiment analysis

dc.centroFacultad de Filosofía y Letrases_ES
dc.contributor.authorMoreno-Ortiz, Antonio Jesús
dc.contributor.authorGarcía Gámez, María
dc.date.accessioned2025-09-09T12:02:54Z
dc.date.available2025-09-09T12:02:54Z
dc.date.created2025-09-09
dc.date.issued2025-07-08
dc.departamentoFilología Inglesa, Francesa y Alemanaes_ES
dc.description.abstractAspect-based sentiment analysis (ABSA) aims to identify the sentiment associated with specifc aspects or entities in a text. In order to facilitate the development and evaluation of ABSA systems, it is crucial to have annotated datasets that contain information about the aspects, entities, and the sentiments expressed towards them. However, the amount of information in existing datasets (for example those used in the SemEval shared tasks) is very limited. We innovate on existing corpora by introducing a multi-layered annotation schema that includes not only entities and aspects, but also lexical items and, crucially, functional discourse units (FDUs). These FDUs are text segments (typically sentences or clauses) that play a specifc role or function within the overall text, such as “description”, “evaluation”, or “advice”, a type of information which we believe can be of great help in ABSA. Our corpus focuses on user reviews of tourist attractions (specifcally monuments) in the region of Andalusia (Spain), but the same schema can be used to annotate reviews of other domains simply by adapting the aspects layer, which is domain-dependent. The annotation schema is described, and the validation process is carried out on a sample of 400 reviews from this domain. Results show a substantial level of agreement among the annotators, indicating that the schema is reliable and consistent. We go on to illustrate and discuss some difcult cases where annotation showed discrepancy among annotators. The annotation of FDUs in the corpus is a signifcant advancement for aspect-based sentiment analysis.es_ES
dc.description.sponsorshipFunding for open access charge: Universidad de Málaga / CBUAes_ES
dc.identifier.citationMoreno-Ortiz, A., García-Gámez, M. Corpus Annotation of Functional Discourse Units for Aspect-Based Sentiment Analysis. Corpus Pragmatics (2025). https://doi.org/10.1007/s41701-025-00199-0es_ES
dc.identifier.doi10.1007/s41701-025-00199-0
dc.identifier.urihttps://hdl.handle.net/10630/39809
dc.language.isoenges_ES
dc.publisherSpringeres_ES
dc.relation.referenceshttps://hdl.handle.net/10630/40750
dc.rightsAtribución 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectLingüística aplicadaes_ES
dc.subjectCorpus lingüístico - Proceso de datoses_ES
dc.subject.otherAspect-based sentiment analysises_ES
dc.subject.otherCorpus annotationes_ES
dc.subject.otherAnnotation schemaes_ES
dc.subject.otherFunctional discourse unitses_ES
dc.titleCorpus annotation of functional discourse units for aspect‑based sentiment analysises_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication3233c4af-5a32-40f2-9c82-103bc48c43cd
relation.isAuthorOfPublication.latestForDiscovery3233c4af-5a32-40f2-9c82-103bc48c43cd

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s41701-025-00199-0.pdf
Size:
1.77 MB
Format:
Adobe Portable Document Format
Description:

Collections