Links to other sociolinguistic corpora

 

Spanish:

CANOLFAN: The Miami corpus consists of conversations by Spanish-English bilinguals. http://bangortalk.org.uk/speakers.php?c=miami

CESA : Corpus del Español en el sur de Arizona (Corpus of Spanish in Southern Arizona)- https://cesa.arizona.edu

Corpus MIAMI: http://bangortalk.org.uk/speakers.php?c=miami&fbclid=IwAR1gYuXaXUhYw-u7_B0zS2kMwdMYyd-pQW_JGTkjrGxdoL11AgpcRsZFYI8

Corpus of Mexican Spanish in Salinas - http://itcdland.csumb.edu/~eabrown/

COSER: Corpus oral y Sonora del Español Rural - www.uam.es/coser

ESLORA: Español de Galiza ESLORA (usc.es)

PRESEEA: Proyecto para el estudio sociolingüistico del español de España y de América - http://preseea.linguas.net

Spanish in Texas - http://sites.la.utexas.edu/spanishtx/

Text messages - McSweeney, M. (2016) The Bilingual Youth Texts Corpus. Available online at
  https://byts.commons.gc.cuny.edu.

 

OTHER LANGUAGES:

 

Dutch

Brieven als Buit- http://brievenalsbuit.inl.nl/zeebrieven/page/about

 

English

ACE: Australian National Corpus  -https://www.ausnc.org.au/corpora/facet_listing?include_subfolders

DECTE: Diachronic Electronic Corpus of Tyneside English - http://research.ncl.ac.uk/decte/

ICE: International Corpus of English - http://ice-corpora.net/ice/index.htm

SBCSAE: The Santa Barbara Corpus of Spoken American English ­http://www.linguistics.ucsb.edu/research/santa-barbara-corpus

SLAAP: The Sociolinguistic Archive and Analysis Project (American English)  -http://slaap.lib.ncsu.edu

PAC: The Phonology of Contemporary English - http://www.projet-pac.net/index.php/protocol

WSC: The Wellington Corpus of Spoken New Zealand English - http://www.victoria.ac.nz/lals/resources/corpora-default/corpora-wsc

CORAAL: The Corpus of Regional African American Language https://oraal.uoregon.edu/coraal

 

French

CFPP: Le Corpus de Français Parlé Parisien - http://cfpp2000.univ-paris3.fr

PFC: Phonologie du Français Contemporain (Phonology of Contemporary French) - http://www.projet-pfc.net

Valibel -- Discours et variation (French spoken in Bruxelles) -http://www.uclouvain.be/81834.html

 

German

The Kiel Corpus - http://www.ipds.uni-kiel.de/forschung/kielcorpus.en.html

KiDKo: KiezDeutsch-Korpus (German in in a multilingual neighborhood of Berlin) - www.kiezdeutschkorpus.de

 

Hebrew

CoSIH: The Corpus of Spoken Israeli Hebrew - http://humanities.tau.ac.il/~cosih/english/

 

Italian

LABLITA: Corpus of Spontaneous Spoken Italian - http://lablita.dit.unifi.it/corpora/descriptions/lablita/

 

LITHUANIAN

Vilniečių interviu bazė "Kalba Vilnius" (database of sociolinguistic interviews "Vilnius Speaking") - http://kalbavilnius.sociolingvistika.lt

 

PORTUGUESE

Amostra da Fala Paulistana - http://projetosp2010.fflch.usp.br/node/3

Projeto São Paulo 2020 - http://projetosp2010.fflch.usp.br/corpus

ALIP: Amostra Lingüística do Interior Paulista (Brazilian Portuguese)  -http://www.iboruna.ibilce.unesp.br

Several Portuguese Corpura - http://comet.fflch.usp.br/corporaportugues 

Pará - http://geosociolin.com/

 

SIGN LANGUAGE

Auslan: Australian Sign Language - http://elar.soas.ac.uk/deposit/0001

BSLCP: British Sign Language Corpus Project - http://www.bslcorpusproject.org/data/

 

Swedish

The Swedish Spoken Language Corpus at Göteborg University - http://www.ling.gu.se/projekt/old_tal/SLcorpus.html

 

Multiple Languages:

CHILDES: Child Language Data Exchange System (Spanish-English bilingual children) - http://childes.psy.cmu.edu

CORAL: Integrated Reference Corpora for Spoken Romance Languages. https://benjamins.com/catalog/scl.15

HLVC: Heritage Language Variation and Change Project (Various heritage languages spoken in Toronto) - http://projects.chass.utoronto.ca/ngn/HLVC/1_4_corpus.php

LDC: The Linguistic Data Consortium - https://www.ldc.upenn.edu/about

The New England Corpus of Heritage and Second Language Speakers (Oral and written production of heritage and L2 speakers of Spanish and Portuguese in New England) - http://digitalhumanities.umass.edu/projects/new-england-corpus-heritage-and-second-language-speakers

Talk Bank - http://www.talkbank.org

Wikicorpus (Catalan, Spanish and English)-  http://www.cs.upc.edu/~nlp/wikicorpus/