Politics and public opinion. You can read the full version of the report here.
Original dataset:Zhai, Yujia, 2020, "Weibo COVID dataset", https://doi.org/10.7910/DVN/DULFFJ, Harvard Dataverse, V1
Chinese emotional dataset used for sentiment analysis, imported in code folder as 情感词汇本体.xlsx.
1_datafile_stream_processing-jsonTocsv.ipynb: Convert .json datafile to pandas dataframe and store as .csv files.
2.1_data_stream_processing-sampling.ipynb: Randomly sample 1% of the original sample for stage 1 explorative pilot analysis.
2.2_sentiment_analysis_topic_modeling.Rmd: Text tokenisation, sentimentt analysis, topic modeling of the sampled data in R. Visualisation included.
3.1_data_stream_processing-sub topic.ipynb: Filter sub-datasets with key words from the original dataset.
3.2_subtopic_sentiment_analysis.Rmd: An automated function conducting text processing and sentiment analysis for the sub-topic datasets. Results exported for visualisation in Tableau.
Sample data:
word_freq.rds: Cleaned word frequency by post and by date, from sampled data.
![sentiment index](https://private-user-images.githubusercontent.com/55522464/373831718-68f53705-4301-41b8-bd35-11dc6660ccf1.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0MzcxMjYsIm5iZiI6MTczOTQzNjgyNiwicGF0aCI6Ii81NTUyMjQ2NC8zNzM4MzE3MTgtNjhmNTM3MDUtNDMwMS00MWI4LWJkMzUtMTFkYzY2NjBjY2YxLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDA4NTM0NlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWMzYTkwOGJkMjgxYzM4N2Y1ZmMzZTgwOTZkNzQyMGI3MGJmYzNmZWU0ZmZjYmViMDM4NTE5NTJmMmI4OGVkOGMmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.CtClNiZwYBWnaohmHY3mwFLwGd5_ppLcJSaiFdkNkf8)
![Sentiment trend](https://private-user-images.githubusercontent.com/55522464/373831721-2c427768-f39d-4f1d-9313-675718ec38be.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0MzcxMjYsIm5iZiI6MTczOTQzNjgyNiwicGF0aCI6Ii81NTUyMjQ2NC8zNzM4MzE3MjEtMmM0Mjc3NjgtZjM5ZC00ZjFkLTkzMTMtNjc1NzE4ZWMzOGJlLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDA4NTM0NlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTJmZGJmMTJmYjBiMzg4ZmJmNTMyMzlhY2Y5MWM0Y2QyOTFhYzY1NDIxNmNhMjA5ODNmOTUzZDUzMmI5ZGM3ODUmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.LrwjKlMMopoANkQs3dSMPuF975sc2Oon0B33O9PVVxg)
![Topic](https://private-user-images.githubusercontent.com/55522464/373831730-0e3c582b-8c4c-4a1a-8f95-0e0fb7fff54c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0MzcxMjYsIm5iZiI6MTczOTQzNjgyNiwicGF0aCI6Ii81NTUyMjQ2NC8zNzM4MzE3MzAtMGUzYzU4MmItOGM0Yy00YTFhLThmOTUtMGUwZmI3ZmZmNTRjLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDA4NTM0NlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWQyZTAyY2MzNTFiZjMwZDg4MzcyOTRmYjcxMDhmODhjZGE4NTc5M2EzYjkxNTNhZGFiODI0ODY0YmE1OTM0MGQmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.zNTeQbEJ7O76eXdJREFN7v_apI1fQ8e0p6DO4Kkluv8)