Corpus sense: A comprehensive tool for advanced text and discourse exploration

dc.centroFacultad de Filosofía y Letrases_ES
dc.centroFacultad de Filosofía y Letras
dc.contributor.authorMoreno-Ortiz, Antonio Jesús
dc.date.accessioned2025-09-10T09:29:20Z
dc.date.available2025-09-10T09:29:20Z
dc.date.issued2025-08-13
dc.departamentoFilología Inglesa, Francesa y Alemanaes_ES
dc.description.abstractCorpus Sense is a web application with a focus on content and discourse analysis designed to facilitate the exploration, analysis and visualization of linguistic corpora that incorporates some advanced functionalities not available in existing software. The tool enables users to obtain useful insights with minimal effort by combining quantitative, qualitative and AI-powered features. It is designed for small to medium-sized corpora (currently up to 2.5 million tokens), permits online corpus sharing, and offers unique functionalities, such as NLP-based keyword extraction, named entity recognition, semantic search and advanced topic modelling with LLM-generated interpretable labels. The application’s interface is simple and intuitive, in an effort to make it accessible to a wide range of user profiles. This paper provides a comprehensive overview of the application’s development, architecture and applications in corpus linguistics and discourse analysis research. This description is complemented by a discussion of the integration of novel NLP-based and AI-assisted tools with traditional corpus analysis methods.es_ES
dc.description.sponsorshipMinisterio de Ciencia, Innovación y Universidadeses_ES
dc.description.sponsorshipFunding for open access charge: Universidad de Málaga/CBUAes_ES
dc.identifier.citationAntonio Moreno-Ortiz, Corpus sense: A comprehensive tool for advanced text and discourse exploration, Applied Corpus Linguistics, Volume 5, Issue 3, 2025, 100145, ISSN 2666-7991, https://doi.org/10.1016/j.acorp.2025.100145. (https://www.sciencedirect.com/science/article/pii/S2666799125000280)es_ES
dc.identifier.doi10.1016/j.acorp.2025.100145
dc.identifier.urihttps://hdl.handle.net/10630/39823
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.relation.projectIDPID2020-115310RB-I00es_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectLingüística - Innovaciones tecnológicases_ES
dc.subjectCorpus lingüístico - Proceso de datoses_ES
dc.subjectRecuperación de la informaciónes_ES
dc.subject.otherCorpus analysis softwarees_ES
dc.subject.otherKeyword extractiones_ES
dc.subject.otherNamed entity recognitiones_ES
dc.subject.otherSemantic searches_ES
dc.subject.otherLarge language modelses_ES
dc.titleCorpus sense: A comprehensive tool for advanced text and discourse explorationes_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication3233c4af-5a32-40f2-9c82-103bc48c43cd
relation.isAuthorOfPublication.latestForDiscovery3233c4af-5a32-40f2-9c82-103bc48c43cd

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2.0-S2666799125000280-main.pdf
Size:
7.2 MB
Format:
Adobe Portable Document Format
Description:
Artículo principal
Download

Description: Artículo principal

Collections