The Málaga Corpus of Late Modern English Scientific Prose

Calle-Martín, Javier

The Málaga Corpus of Late Modern English Scientific Prose

dc.centro	Facultad de Filosofía y Letras	es_ES
dc.contributor.author	Calle-Martín, Javier
dc.date.accessioned	2023-05-15T10:39:04Z
dc.date.available	2023-05-15T10:39:04Z
dc.date.issued	2023-05-11
dc.departamento	Filología Inglesa, Francesa y Alemana
dc.description.abstract	The Málaga Corpus of Early English Scientific Prose is a collection of English vernacular medical writing, consisting of three diachronically divided components, i.e. The Málaga Corpus of Late Middle English Scientific Prose (1350-1500); The Málaga Corpus of Early Modern English Scientific Prose (1500-1700); and The Málaga Corpus of Late Modern English Scientific Prose (1700-1900). The three components have been purposely designed so as to contain evidence from the three text types of medical writing in English, that is, theoretical treatises, surgical treatises and recipe collections. In itself, the corpus stems from actual linguistic evidence of the period, both handwritten and printed, standing out as the ideal input for diachronic linguistic research at the levels of spelling, morpho-syntax and lexis. The present paper is particularly concerned with the third component of the corpus, The Málaga Corpus of Late Modern English Scientific Prose (1700-1900), which has been recently published and made available in the project’s webpage (https://latemodernmss.uma.es). In its current form, the corpus amounts to 2.5 million words, of which 1.5 million belong to the 18th century and the other million to the 19th century. The corpus is offered in three different formats, that is, the plain text version, the modernised version and the tagged version. The CQP-web version is also available for online use (https://latemodernmss.uma.es/cqpweb/). The present paper first describes the rationale of the corpus considering the typology of texts, their chronology, the text types and authorship. Second, the paper delves into the process of compilation, which is a sequential process consisting of a) modernisation by means of VaRD (Variant Detector) and b) automatic tagging by means of CLAWS (Constituent Likelihood Automatic Word-tagging System). The paper closes with a brief demonstration of the corpus potential using the CQP-web version.	es_ES
dc.description.sponsorship	Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech.	es_ES
dc.identifier.uri	https://hdl.handle.net/10630/26563
dc.language.iso	spa	es_ES
dc.relation.eventdate	10/05/2023	es_ES
dc.relation.eventplace	Oviedo, España	es_ES
dc.relation.eventtitle	14º Congreso Internacional de Lingüística de Corpus	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.accessRights	open access	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Lingüística computacional	es_ES
dc.subject	Inglés técnico	es_ES
dc.subject.other	Corpus	es_ES
dc.subject.other	Inglés moderno tardío	es_ES
dc.subject.other	Inglés científico	es_ES
dc.subject.other	Etiquetado de corpus	es_ES
dc.title	The Málaga Corpus of Late Modern English Scientific Prose	es_ES
dc.type	conference output	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	f634bb56-f67c-4e8f-8928-51228f50ebcd
relation.isAuthorOfPublication.latestForDiscovery	f634bb56-f67c-4e8f-8928-51228f50ebcd

Files

Original bundle

Now showing 1 - 1 of 1

Name:: The Málaga Corpus of Late Modern English Scientific Prose_Abstract.pdf
Size:: 67.99 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Ponencias, Comunicaciones a congresos y Pósteres