PARALLELIZATION OF THE SENSEM CORPUS: SPANISH AND CATALAN

Authors

  • Gloria Vázquez García Universitat de Lleida
  • Ana Fernández Montraveta Universitat Autònoma de Barcelona

DOI:

https://doi.org/10.1344/AFEL2011.1.7

Keywords:

corpus annotations, corpus for Spanish Language, corpus for Catalan Language, parallel corpus, syntax, semantics

Abstract

This paper presents a parallel corpus for Spanish and Catalan, SenSem. The parallel corpus has been annotated at several linguistic levels (morphological, syntactical and semantic). The information covered at the different levels ranges from words to phrases, and sentences. One of the main values of the resource presented in this work is that it is the first corpus for Catalan that has been annotated with information regarding sentence semantics such as aspectuality and modality. The methodology followed in this project was to translate into Catalan the corpus we already had available from a previous project in Spanish. The link between both corpora has been established at the different informative units mentioned above. All the linguistic annotation has been inherited, correcting necessary aspects when it proved necessary

Downloads

Download data is not yet available.

Downloads

Published

2022-05-13

How to Cite

Vázquez García, G., & Fernández Montraveta, A. (2022). PARALLELIZATION OF THE SENSEM CORPUS: SPANISH AND CATALAN. Anuari De Filologia. Estudis De Lingüística, 1(1), 167–193. https://doi.org/10.1344/AFEL2011.1.7