diff --git a/README.md b/README.md index 724b131..55a1f03 100644 --- a/README.md +++ b/README.md @@ -1,8 +1,9 @@ -***Spoken Emotion Recognition Datasets:*** *A collection of datasets (count=39) for the purpose of emotion recognition/detection in speech. +***Spoken Emotion Recognition Datasets:*** *A collection of datasets (count=40) for the purpose of emotion recognition/detection in speech. The table is chronologically ordered and includes a description of the content of each dataset along with the emotions included.* | Dataset | Year | Content | Emotions | Format | Size | Language | Paper | Access | License | |---------------------------------------------------------------------------------------------------|-----------------|-------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------|---------------------|-------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------|----------------------------------------------------------------------------------------------| +| [MESD] | 2022 | 864 audio files of single-word emotional utterances with Mexican cultural shaping. | 6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness. | Audio | 86 MB | Spanish (Mexican) | [The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning] | Open | [CC BY 4.0] | |[ASVP-ESD] | 2021 | ~13285 audio files collected from movies, tv shows and youtube containing speech and non-speech. | 12 different natural emotions (boredom, neutral, happiness, sadness, anger, fear, surprise, disgust, excitement, pleasure, pain, disappointment) with 2 levels of intensity. | Audio | 2 GB | Chinese, English, French, Russian and others | -- | Open access | Unknown | | [ESD] | 2021 | 29 hours, 3500 sentences, by 10 native English speakers and 10 native Chinese speakers. | 5 emotions: angry, happy, neutral, sad, and surprise. | Audio, Text | 2.4 GB (zip) | English, Chinese | [Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset] | Open access | Available under an Academic License | | [MuSe-CAR] | 2021 | 40 hours, 6,000+ recordings of 25,000+ sentences by 70+ English speakers (see db link for details). | continuous emotion dimensions characterized using valence, arousal, and trustworthiness. | Audio, Video, Text | 15 GB | English | [The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements] | Restricted access | Available under an Academic License & Commercial License | @@ -71,6 +72,7 @@ The table is chronologically ordered and includes a description of the content o [//]: # (datasets) +[MESD]: https://data.mendeley.com/datasets/cy34mh68j9/5 [ASVP-ESD]: https://www.kaggle.com/datasets/dejolilandry/asvpesdspeech-nonspeech-emotional-utterances [ESD]: https://hltsingapore.github.io/ESD/ [MuSe-CAR]: https://zenodo.org/record/4134758 @@ -128,6 +130,7 @@ The table is chronologically ordered and includes a description of the content o [//]: # (papers) +[The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning]: https://pubmed.ncbi.nlm.nih.gov/34891601/ [Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset]: https://arxiv.org/pdf/2010.14794.pdf [The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements]: https://arxiv.org/pdf/2101.06053.pdf [The MSP-Conversation Corpus]: http://www.interspeech2020.org/index.php?m=content&c=index&a=show&catid=290&id=684 diff --git a/src/index.rst b/src/index.rst index d5ac7c5..bc8964e 100644 --- a/src/index.rst +++ b/src/index.rst @@ -45,6 +45,7 @@ However, we cannot guarantee that all listed links are up-to-date. Read more in .. datasets +.. _`MESD`: https://data.mendeley.com/datasets/cy34mh68j9/5 .. _`ASVP-ESD`: https://www.kaggle.com/datasets/dejolilandry/asvpesdspeech-nonspeech-emotional-utterances .. _`ESD`: https://hltsingapore.github.io/ESD/ .. _`MuSe-CAR`: https://zenodo.org/record/4134758 @@ -102,6 +103,7 @@ However, we cannot guarantee that all listed links are up-to-date. Read more in .. papers +.. _`The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning`: https://pubmed.ncbi.nlm.nih.gov/34891601/ .. _`Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset`: https://arxiv.org/pdf/2010.14794.pdf .. _`The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements`: https://arxiv.org/pdf/2101.06053.pdf .. _`The MSP-Conversation Corpus`: http://www.interspeech2020.org/index.php?m=content&c=index&a=show&catid=290&id=684 diff --git a/src/ser-datasets.csv b/src/ser-datasets.csv index 247480e..8393493 100644 --- a/src/ser-datasets.csv +++ b/src/ser-datasets.csv @@ -1,4 +1,5 @@ "Dataset","Year","Content","Emotions","Format","Size","Language","Paper","Access","License" +"`MESD`_","2022","864 audio files of single-word emotional utterances with Mexican cultural shaping.","6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness.","Audio","86 MB","Spanish (Mexican)","`The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning`_","Open","`CC BY 4.0`_" "`ASVP-ESD`_","2021","~13285 audio files collected from movies, tv shows and youtube containing speech and non-speech.","12 different natural emotions (boredom, neutral, happiness, sadness, anger, fear, surprise, disgust, excitement, pleasure, pain, disappointment) with 2 levels of intensity.","Audio","2 GB","Chinese, English, French, Russian and others","--","Open","Unknown" "`ESD`_","2021","29 hours, 3500 sentences, by 10 native English speakers and 10 native Chinese speakers.","5 emotions: angry, happy, neutral, sad, and surprise.","Audio, Text","2.4 GB (zip)","English, Chinese","`Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset`_","Open","Academic License" "`MuSe-CAR`_","2021","40 hours, 6,000+ recordings of 25,000+ sentences by 70+ English speakers (see db link for details).","continuous emotion dimensions characterized using valence, arousal, and trustworthiness.","Audio, Video, Text","15 GB","English","`The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements`_","Restricted","Academic License & Commercial License"