Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create dataset loader for TalkBankDB CHILDES #684

Open
SamuelCahyawijaya opened this issue May 27, 2024 · 0 comments
Open

Create dataset loader for TalkBankDB CHILDES #684

SamuelCahyawijaya opened this issue May 27, 2024 · 0 comments

Comments

@SamuelCahyawijaya
Copy link
Collaborator

Dataloader name: talkbankdb_childes/talkbankdb_childes.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?talkbankdb_childes

Dataset talkbankdb_childes
Description The Child Language Data Exchange System (CHILDES) (https://childes.talkbank.org) is the child language component of the TalkBank system (https://www.talkbank.org). Data can be accessed through the TalkBankDB portal or using a Python API (see link below) or the package described here: https://link.springer.com/article/10.3758/s13428-018-1176-7. TalkBank is an interdisciplinary project designed to create an openly available database for recording and transcribing spoken language interactions. It comprises a series of topic-specific databases for particular research areas. These areas include classroom discourse, aphasia, conversation analysis, Supreme Court, bilingualism, second language learning, dementia, child languages and five other more specific topic areas.
Subsets Indonesian, Javanese, Manado Malay, Tagalog, Thai, Yau
Languages ind, jav, xmm, tgl, tha, jau
Tasks Automatic Speech Recognition
License BSD 3-clause Clear license (bsd-3-clause-clear)
Homepage https://github.com/TalkBank/TBDBpy
HF URL -
Paper URL https://direct.mit.edu/coli/article/26/4/657/1687
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: No status
Development

No branches or pull requests

1 participant