The Málaga Corpus of Late Modern English Scientific Prose

dc.centroFacultad de Filosofía y Letrases_ES
dc.contributor.authorCalle-Martín, Javier
dc.date.accessioned2023-05-15T10:39:04Z
dc.date.available2023-05-15T10:39:04Z
dc.date.issued2023-05-11
dc.departamentoFilología Inglesa, Francesa y Alemana
dc.description.abstractThe Málaga Corpus of Early English Scientific Prose is a collection of English vernacular medical writing, consisting of three diachronically divided components, i.e. The Málaga Corpus of Late Middle English Scientific Prose (1350-1500); The Málaga Corpus of Early Modern English Scientific Prose (1500-1700); and The Málaga Corpus of Late Modern English Scientific Prose (1700-1900). The three components have been purposely designed so as to contain evidence from the three text types of medical writing in English, that is, theoretical treatises, surgical treatises and recipe collections. In itself, the corpus stems from actual linguistic evidence of the period, both handwritten and printed, standing out as the ideal input for diachronic linguistic research at the levels of spelling, morpho-syntax and lexis. The present paper is particularly concerned with the third component of the corpus, The Málaga Corpus of Late Modern English Scientific Prose (1700-1900), which has been recently published and made available in the project’s webpage (https://latemodernmss.uma.es). In its current form, the corpus amounts to 2.5 million words, of which 1.5 million belong to the 18th century and the other million to the 19th century. The corpus is offered in three different formats, that is, the plain text version, the modernised version and the tagged version. The CQP-web version is also available for online use (https://latemodernmss.uma.es/cqpweb/). The present paper first describes the rationale of the corpus considering the typology of texts, their chronology, the text types and authorship. Second, the paper delves into the process of compilation, which is a sequential process consisting of a) modernisation by means of VaRD (Variant Detector) and b) automatic tagging by means of CLAWS (Constituent Likelihood Automatic Word-tagging System). The paper closes with a brief demonstration of the corpus potential using the CQP-web version.es_ES
dc.description.sponsorshipUniversidad de Málaga. Campus de Excelencia Internacional Andalucía Tech.es_ES
dc.identifier.urihttps://hdl.handle.net/10630/26563
dc.language.isospaes_ES
dc.relation.eventdate10/05/2023es_ES
dc.relation.eventplaceOviedo, Españaes_ES
dc.relation.eventtitle14º Congreso Internacional de Lingüística de Corpuses_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectLingüística computacionales_ES
dc.subjectInglés técnicoes_ES
dc.subject.otherCorpuses_ES
dc.subject.otherInglés moderno tardíoes_ES
dc.subject.otherInglés científicoes_ES
dc.subject.otherEtiquetado de corpuses_ES
dc.titleThe Málaga Corpus of Late Modern English Scientific Prosees_ES
dc.typeconference outputes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationf634bb56-f67c-4e8f-8928-51228f50ebcd
relation.isAuthorOfPublication.latestForDiscoveryf634bb56-f67c-4e8f-8928-51228f50ebcd

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
The Málaga Corpus of Late Modern English Scientific Prose_Abstract.pdf
Size:
67.99 KB
Format:
Adobe Portable Document Format
Description: