Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset #1

Open
raymondxyy opened this issue Dec 14, 2018 · 0 comments
Open

Dataset #1

raymondxyy opened this issue Dec 14, 2018 · 0 comments

Comments

@raymondxyy
Copy link
Owner

raymondxyy commented Dec 14, 2018

Dataset

Updated: 12/27/2018, 11:47 PM


Generic Dataset

An abstract Dataset class is defined for easy batch processing. The base class is a direct copy of the Dataset class in PyTorch, with a few additional requirements:

  • Audio file paths must be saved in a list all_files. No other information (especially memory-intensive ones such as audio waveforms) should be stored.
  • Audio waveforms are read from paths only on indexing the dataset.

Other Datasets

Apart from the generic Dataset class, some common speech-related datasets subclass Dataset:

  • Speech recognition dataset (ASRDataset) holds transcripts and a transcript-to-label map that transforms a string sequence to a integer sequence.
    • Currently supporting WSJ0 (ASRWSJ0), WSJ1 (ASRWSJ1)
  • Speech enhancement dataset (SEDataset) holds a sequence of degraded-and-clean-speech pair.
    • Currently supporting VCTK (SEVCTKNoRev, SEVCTK2chan) and RATS (SERATS_SAD)
  • Speech activity detection dataset (SADDataset) holds time-stamps of speech-active regions for each speech file.
    • Will be added later

Overview

See audlib.data.dataset for the abstract interfaces of Dataset and its subclasses. For implementations of specific datasets, see the Wall Street Journal (WSJ) module in audlib.data.wsj, or other dataset modules.

@raymondxyy raymondxyy changed the title Separate normal dataset from asr dataset ASRDataset subclasses Dataset Dec 15, 2018
@raymondxyy raymondxyy changed the title ASRDataset subclasses Dataset ASR Dataset Dec 15, 2018
@raymondxyy raymondxyy changed the title ASR Dataset Dataset Dec 28, 2018
This was referenced Dec 28, 2018
Closed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant