Slides, codes and data for my presentation in CSS workshop 2020/10/01
Brief description of files are:
scraping.ipynb
: codes for scraping the GitHub issue pagesanalysis_code0.ipynb
: codes for first part of the analyses (TF-IDF)analysis_code1.Rmd
: codes for second part of the analyses (Primacy effect)quick_analysis_1001.ipynb
: codes for the slide talking about what happened this weekscraped_data.pkl
andscraped_data_1001
: data from runningscraping.ipynb
,quick_analysis_1001.ipynb
, respectivelyprocessed_data.pkl
andprocessed_data.csv
: data files processed for the main analysisSlides.pdf
: the slides for the presentation
Versions of Python and R packages are listed in each Jupyter Notebook files and R Markdown files. I have not tested the codes for any other versions that are not listed in the files, so there could be an issue when using other versions. Note that the Jupyter Notebooks might have different versions for the same package because I was running several virtual environments...
You would need to install Chrome and download the ChromeDriver that matches your OS and Chrome version for Selenium to work properly.
Slides were created using Google Presentation.