SA-using-SMOTE-and-Stacking-deep-learning-model

Sentiment Analysis of Imbalanced Dataset using SMOTE

The Sentiment Analysis of Imbalanced Datasets (in English, Arabic, and Moroccan Dialect) using SMOTE is novel method based on BERT embedding (for each language) and stacked deep learning algorithms, namely: LSTM, BiLSTM, GRU, and CNN that provide state-of-the-art results on the imbalanced dialect dataset in terms of accuracy.

Datasets

We used three datasets in English, Arabic, and Moroccan Dialect as shown in the following table:

Language	Dataset	positive	negative
English	Reviews	27411	8189
Arabic	Arabic_tweets	835	1253
Moroccan Dialect	dataset	14628	24932

Python comparability

This code is compatible with python 3.x. If python 3 is not default in your system, please using python3 and pip3 commands instead of python and pip commands.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Arabic_twitter.csv		Arabic_twitter.csv
README.md		README.md
SA Arabic using SMOTE.py		SA Arabic using SMOTE.py
SA Dialect using SMOTE.py		SA Dialect using SMOTE.py
SA English using SMOTE.py		SA English using SMOTE.py
Tweets.csv		Tweets.csv
dataset.csv		dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SA-using-SMOTE-and-Stacking-deep-learning-model

Datasets

Python comparability

About

Releases

Packages

Languages

nassera2014/SA-using-SMOTE

Folders and files

Latest commit

History

Repository files navigation

SA-using-SMOTE-and-Stacking-deep-learning-model

Datasets

Python comparability

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages