This repository contains data I used in my analysis to figure out the interests of data scientists based on the questions they asked on Data Science Stack Exhange.
2021_questions.csv
contains data on questions asked on DSSE in 2021.
all_questions.csv
contains data on all questioned asked on DSSE between 2021 to 2022
python 3.7 and above
pandas - this is a python library for data-analysis. You can install it using a simple pip install pandas
in command line or !pip install pandas
on your jupyter notebook.
matplotlib - this is a python library for data visualization. you use the same pip command here, i.e pip install matplotlib