Corpus Resources

How to Cite the Corpus

Carvalho, Ana M. (2012-). Corpus del Español en el Sur de Arizona (CESA). University of Arizona. cesa.arizona.edu

Links to Other Sociolinguistic Corpora

Spanish

CANOLFAN: The Miami corpus consists of conversations by Spanish-English bilinguals
CESA: Corpus del Español en el sur de Arizona (Corpus of Spanish in Southern Arizona)
Corpus MIAMI
Corpus of Mexican Spanish in Salinas
COSER: Corpus oral y Sonora del Español Rural
ESLORA: Español de Galiza
PRESEEA: Proyecto para el estudio sociolingüistico del español de España y de América
Spanish in Texas
Text messages - McSweeney, M. (2016) The Bilingual Youth Texts Corpus.

 

Dutch

Brieven als Buit

 

English

ACE: Australian National Corpus
DECTE: Diachronic Electronic Corpus of Tyneside English
ICE: International Corpus of English
SBCSAE: The Santa Barbara Corpus of Spoken American English
SLAAP: The Sociolinguistic Archive and Analysis Project (American English)
PAC: The Phonology of Contemporary English
WSC: The Wellington Corpus of Spoken New Zealand English
CORAAL: The Corpus of Regional African American Language

 

French

CFPP: Le Corpus de Français Parlé Parisien
PFC: Phonologie du Français Contemporain (Phonology of Contemporary French)
Valibel -- Discours et variation (French spoken in Bruxelles)

 

German

The Kiel Corpus
KiDKo: KiezDeutsch-Korpus (German in in a multilingual neighborhood of Berlin)

 

Hebrew

CoSIH: The Corpus of Spoken Israeli Hebrew

 

Italian

LABLITA: Corpus of Spontaneous Spoken Italian

 

Lithuanian

Vilniečių interviu bazė "Kalba Vilnius" (database of sociolinguistic interviews "Vilnius Speaking")

 

Portuguese

Amostra da Fala Paulistana
Projeto São Paulo 2020
ALIP: Amostra Lingüística do Interior Paulista (Brazilian Portuguese)
Several Portuguese Corpura
Pará

 

Sign Language

Auslan: Australian Sign Language
BSLCP: British Sign Language Corpus Project

 

Swedish

The Swedish Spoken Language Corpus at Göteborg University

 

Multiple Languages

CHILDES: Child Language Data Exchange System (Spanish-English bilingual children)
CORAL: Integrated Reference Corpora for Spoken Romance Languages
HLVC: Heritage Language Variation and Change Project (Various heritage languages spoken in Toronto)
LDC: The Linguistic Data Consortium
The New England Corpus of Heritage and Second Language Speakers (Oral and written production of heritage and L2 speakers of Spanish and Portuguese in New England)
Talk Bank
Wikicorpus (Catalan, Spanish and English)

CESA Protocol