Corpus sense: A comprehensive tool for advanced text and discourse exploration
| dc.centro | Facultad de Filosofía y Letras | es_ES |
| dc.centro | Facultad de Filosofía y Letras | |
| dc.contributor.author | Moreno-Ortiz, Antonio Jesús | |
| dc.date.accessioned | 2025-09-10T09:29:20Z | |
| dc.date.available | 2025-09-10T09:29:20Z | |
| dc.date.issued | 2025-08-13 | |
| dc.departamento | Filología Inglesa, Francesa y Alemana | es_ES |
| dc.description.abstract | Corpus Sense is a web application with a focus on content and discourse analysis designed to facilitate the exploration, analysis and visualization of linguistic corpora that incorporates some advanced functionalities not available in existing software. The tool enables users to obtain useful insights with minimal effort by combining quantitative, qualitative and AI-powered features. It is designed for small to medium-sized corpora (currently up to 2.5 million tokens), permits online corpus sharing, and offers unique functionalities, such as NLP-based keyword extraction, named entity recognition, semantic search and advanced topic modelling with LLM-generated interpretable labels. The application’s interface is simple and intuitive, in an effort to make it accessible to a wide range of user profiles. This paper provides a comprehensive overview of the application’s development, architecture and applications in corpus linguistics and discourse analysis research. This description is complemented by a discussion of the integration of novel NLP-based and AI-assisted tools with traditional corpus analysis methods. | es_ES |
| dc.description.sponsorship | Ministerio de Ciencia, Innovación y Universidades | es_ES |
| dc.description.sponsorship | Funding for open access charge: Universidad de Málaga/CBUA | es_ES |
| dc.identifier.citation | Antonio Moreno-Ortiz, Corpus sense: A comprehensive tool for advanced text and discourse exploration, Applied Corpus Linguistics, Volume 5, Issue 3, 2025, 100145, ISSN 2666-7991, https://doi.org/10.1016/j.acorp.2025.100145. (https://www.sciencedirect.com/science/article/pii/S2666799125000280) | es_ES |
| dc.identifier.doi | 10.1016/j.acorp.2025.100145 | |
| dc.identifier.uri | https://hdl.handle.net/10630/39823 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Elsevier | es_ES |
| dc.relation.projectID | PID2020-115310RB-I00 | es_ES |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | Lingüística - Innovaciones tecnológicas | es_ES |
| dc.subject | Corpus lingüístico - Proceso de datos | es_ES |
| dc.subject | Recuperación de la información | es_ES |
| dc.subject.other | Corpus analysis software | es_ES |
| dc.subject.other | Keyword extraction | es_ES |
| dc.subject.other | Named entity recognition | es_ES |
| dc.subject.other | Semantic search | es_ES |
| dc.subject.other | Large language models | es_ES |
| dc.title | Corpus sense: A comprehensive tool for advanced text and discourse exploration | es_ES |
| dc.type | journal article | es_ES |
| dc.type.hasVersion | VoR | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 3233c4af-5a32-40f2-9c82-103bc48c43cd | |
| relation.isAuthorOfPublication.latestForDiscovery | 3233c4af-5a32-40f2-9c82-103bc48c43cd |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 1-s2.0-S2666799125000280-main.pdf
- Size:
- 7.2 MB
- Format:
- Adobe Portable Document Format
- Description:
- Artículo principal
Description: Artículo principal

