Skip to content

Commit

Permalink
Merge pull request #32 from SuperKogito/add_SynAct_dataset
Browse files Browse the repository at this point in the history
add SynAct dataset
  • Loading branch information
SuperKogito authored Mar 14, 2023
2 parents b1db9b4 + b5e556a commit 0660df2
Show file tree
Hide file tree
Showing 3 changed files with 17 additions and 1 deletion.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@ The table can be browsed, sorted and searched under https://superkogito.github.i
| Dataset | Year | Content | Emotions | Format | Size | Language | Paper | Access | License |
|:--------------------------------------------------------------------------------------------------------------------------------------------------|:----------------|:------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------|:------------------------|:------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------|:------------------------------------------------------------------------------------------------------------------------------------------|
| <sub>[MESD](https://data.mendeley.com/datasets/cy34mh68j9/5)</sub> | <sub>2022</sub> | <sub>864 audio files of single-word emotional utterances with Mexican cultural shaping.</sub> | <sub>6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness.</sub> | <sub>Audio</sub> | <sub>0,097 GB</sub> | <sub>Spanish (Mexican)</sub> | <sub>[The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning](https://pubmed.ncbi.nlm.nih.gov/34891601/)</sub> | <sub>Open</sub> | <sub>[CC BY 4.0](https://creativecommons.org/licenses/by/4.0/)</sub> |
| <sub>[SyntAct](https://zenodo.org/record/6573016#.ZAjy_9LMJpj)</sub> | <sub>2022</sub> | <sub>SyntAct</sub> | <sub> a synthesized database of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public</sub> | <sub>Audio</sub> | <sub>941 MB</sub> | <sub>German</sub> | <sub>[SyntAct: A Synthesized Database of Basic Emotions](http://felix.syntheticspeech.de/publications/synthetic_database.pdf)</sub> | <sub>Open</sub> | <sub>[CC BY-SA 4.0](https://creativecommons.org/licenses/by/4.0)</sub> |
| <sub>[MLEnd](https://www.kaggle.com/datasets/jesusrequena/mlend-spoken-numerals)</sub> | <sub>2021</sub> | <sub>~32700 audio recordings files produced by 154 speakers. Each audio recording corresponds to one English numeral (from "zero" to "billion")</sub> | <sub>Intonations: neutral, bored, excited and question</sub> | <sub>Audio</sub> | <sub>2.27 GB</sub> | <sub>--</sub> | <sub>--</sub> | <sub>Open</sub> | <sub>Unknown</sub> |
| <sub>[ASVP-ESD](https://www.kaggle.com/datasets/dejolilandry/asvpesdspeech-nonspeech-emotional-utterances)</sub> | <sub>2021</sub> | <sub>~13285 audio files collected from movies, tv shows and youtube containing speech and non-speech.</sub> | <sub>12 different natural emotions (boredom, neutral, happiness, sadness, anger, fear, surprise, disgust, excitement, pleasure, pain, disappointment) with 2 levels of intensity.</sub> | <sub>Audio</sub> | <sub>2 GB</sub> | <sub>Chinese, English, French, Russian and others</sub> | <sub>--</sub> | <sub>Open</sub> | <sub>Unknown</sub> |
| <sub>[ESD](https://hltsingapore.github.io/ESD/)</sub> | <sub>2021</sub> | <sub>29 hours, 3500 sentences, by 10 native English speakers and 10 native Chinese speakers.</sub> | <sub>5 emotions: angry, happy, neutral, sad, and surprise.</sub> | <sub>Audio, Text</sub> | <sub>2.4 GB (zip)</sub> | <sub>Chinese, English</sub> | <sub>[Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset](https://arxiv.org/pdf/2010.14794.pdf)</sub> | <sub>Open</sub> | <sub>Academic License</sub> |
Expand Down
1 change: 1 addition & 0 deletions src/ser-datasets.csv
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
Dataset,Year,Content,Emotions,Format,Size,Language,Paper,Access,License
`MESD <https://data.mendeley.com/datasets/cy34mh68j9/5>`_,2022,864 audio files of single-word emotional utterances with Mexican cultural shaping.,"6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness.",Audio,"0,097 GB",Spanish (Mexican),`The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning <https://pubmed.ncbi.nlm.nih.gov/34891601/>`_,Open,`CC BY 4.0 <https://creativecommons.org/licenses/by/4.0/>`_
`SyntAct <https://zenodo.org/record/6573016#.ZAjy_9LMJpj>`_,2022,SyntAct, a synthesized database of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public,Audio,941 MB,German,`SyntAct: A Synthesized Database of Basic Emotions <http://felix.syntheticspeech.de/publications/synthetic_database.pdf>`_,Open,`CC BY-SA 4.0 <https://creativecommons.org/licenses/by/4.0>`_
`MLEnd <https://www.kaggle.com/datasets/jesusrequena/mlend-spoken-numerals>`_,2021,"~32700 audio recordings files produced by 154 speakers. Each audio recording corresponds to one English numeral (from ""zero"" to ""billion"")","Intonations: neutral, bored, excited and question",Audio,2.27 GB,--,--,Open,Unknown
`ASVP-ESD <https://www.kaggle.com/datasets/dejolilandry/asvpesdspeech-nonspeech-emotional-utterances>`_,2021,"~13285 audio files collected from movies, tv shows and youtube containing speech and non-speech.","12 different natural emotions (boredom, neutral, happiness, sadness, anger, fear, surprise, disgust, excitement, pleasure, pain, disappointment) with 2 levels of intensity.",Audio,2 GB,"Chinese, English, French, Russian and others",--,Open,Unknown
`ESD <https://hltsingapore.github.io/ESD/>`_,2021,"29 hours, 3500 sentences, by 10 native English speakers and 10 native Chinese speakers.","5 emotions: angry, happy, neutral, sad, and surprise.","Audio, Text",2.4 GB (zip),"Chinese, English",`Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset <https://arxiv.org/pdf/2010.14794.pdf>`_,Open,Academic License
Expand Down
16 changes: 15 additions & 1 deletion src/ser-datasets.json
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,20 @@
"Paper-link": "https://pubmed.ncbi.nlm.nih.gov/34891601/",
"License-link": "https://creativecommons.org/licenses/by/4.0/"
},
"SyntAct": {
"Year": 2022,
"Content": "SyntAct",
"Emotions": " a synthesized database of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public",
"Format": "Audio",
"Size": "941 MB",
"Language": "German",
"Paper": "SyntAct: A Synthesized Database of Basic Emotions",
"Access": "Open",
"License": "CC BY-SA 4.0",
"Dataset-link": "https://zenodo.org/record/6573016#.ZAjy_9LMJpj",
"Paper-link": "http://felix.syntheticspeech.de/publications/synthetic_database.pdf",
"License-link": "https://creativecommons.org/licenses/by/4.0"
},
"MLEnd": {
"Year": 2021,
"Content": "~32700 audio recordings files produced by 154 speakers. Each audio recording corresponds to one English numeral (from \"zero\" to \"billion\")",
Expand Down Expand Up @@ -587,4 +601,4 @@
"Paper-link": "http://kom.aau.dk/~tb/speech/Emotions/des.pdf",
"License-link": "--"
}
}
}

0 comments on commit 0660df2

Please sign in to comment.