Creator

Mark Davies

Abstract

Corpus del Español word, lemma, and part of speech data format.

Zip folder contains 20 .txt files of linguistic data from Colombia split into two categories: General (g) and Blogs (b).

Texts are separated by a line with ## and the textID.

File Format

.zip

File Size (MB)

1185

Creation Date

11-17-2016

Deposit Date

6-18-2024

License Restrictions

Corpora data is subject to access and use restrictions, including:

  • Data cannot be distributed outside Gonzaga
  • Access limited to restricted login or password
  • Data cannot be used to create software or products for sale or consumption
  • Data is for research and substantial portions (50,000 words or more) cannot be made available to undergraduates
  • Any publications or products based on the data should reference the source of the data (see Citation Information)
See the full limitations at Restrictions on use of the corpora.

Share

COinS