Datasets

Corpus de Sonetos del Siglo de Oro anotado con información métrica / Corpus of Spanish Golden-Age Sonnets with metrical information.

https://github.com/bncolorado/CorpusSonetosSigloDeOro

How to cite: Navarro-Colorado, Borja; Ribes Lafoz, María, and Sánchez, Noelia (2015) «Metrical annotation of a large corpus of Spanish sonnets: representation, scansion and evaluation» 10th edition of the Language Resources and Evaluation Conference 2016 Portorož, Slovenia.

ELTeC: European Literary Text Collection.

A balanced selection of European novels from the period 1840 to 1920.

https://github.com/COST-ELTeC

How to cite:

– Lou Burnard, Christof Schöch, Carolin Odebrecht (2021): “In Search of Comity: TEI for Distant Reading”, in: Journal of the Text Encoding Initiative 14. DOI: https://doi.org/10.4000/jtei.3500.

– Burnard, Lou; Borja Navarro Colorado, Carolin Odebrecht and Martina Scholger (2022) «Collaborative creation of a multi-lingual literary corpus. Challenges and best practices for corpus design» in COST Action Distant Reading Closing Conference, Krakov/on-line, 21-22 June 2022.

ELTeC-SPA: Spanish novels for the European Literary Text Collection
(ELTeC)

A balanced selection of Spanish novels from the period 1840 to 1920. Collection editor: Borja Navarro Colorado (University of Alicante)

https://github.com/COST-ELTeC/ELTeC-spa

How to cite:

– Burnard, Lou; Borja Navarro Colorado, Carolin Odebrecht and Martina Scholger (2022) «Collaborative creation of a multi-lingual literary corpus. Challenges and best practices for corpus design» in COST Action Distant Reading Closing Conference, Krakov/on-line, 21-22 June 2022.