GitHub

Homepage

Is there such a thing as 'Oscar-bait'?

A look into English-speaking movies, award winner and nominees from 1934 to 2017 using data from IMDb, Kaggle and self-curation.

An exercise with a very fruitful and teaching preprocessing part. The multi-million row IMDb datasets, filtered and enhanced with webcrawling will probably inspire more analysis later on.

Read

Link (R Markdown)

Analysis on Taylor Swift - reputation

Using audio data from Spotify and lyrical sentiment analysis using tidyText to compare Taylor Swift's latest album to her previous work

First exercise without a ready-to-go dataset. First time working with APIs and pipes (%>%). Further practice with ggplot2.

Link (R Markdown)

Exploring Retail Sales

Exploratory data analysis on sales of Rossman stores in Germany

A learning practice in visualization using ggplot2, and exploratory data analysis.

Link (R Markdown)

House Pricing Prediction

Predicting sale prices for houses, given labeled data for previousy sold homes.

A regression problem solved using XGBoost & 10-fold cross validation.

Link (R Markdown)

McKinsey Hackathon - Stroke Prediction

Using labeled patient data to predict possibility of a stroke for individual patients.

My submission to McKinsey's 24-hour online hackathon. A probabilistic classification problem solved using conditional random forest. Earned an AUCROC score of 0.847, while the winning submission got 0.860.

Link (R Markdown)

Titanic Survivor Prediction

Using labeled passenger data to predict who survives the disaster

A "Hello World" project to data science. A classification problem with three seperate solutions using logistic regression, decision tree and random forest.

Link (R Markdown)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
Retail_Prediction_RMD_files/figure-markdown_github		Retail_Prediction_RMD_files/figure-markdown_github
Taylor_Swift_files/figure-markdown_github		Taylor_Swift_files/figure-markdown_github
house_pricing_files/figure-markdown_github		house_pricing_files/figure-markdown_github
mckinsey_healthcare_files		mckinsey_healthcare_files
titanic_files/figure-markdown_github		titanic_files/figure-markdown_github
Retail_Prediction_RMD.md		Retail_Prediction_RMD.md
Taylor_Swift.md		Taylor_Swift.md
_config.yml		_config.yml
average_ABV_countries.html		average_ABV_countries.html
chart_1.png		chart_1.png
chart_2.png		chart_2.png
chart_3.png		chart_3.png
chart_4.png		chart_4.png
house_pricing.R		house_pricing.R
house_pricing.md		house_pricing.md
mckinsey_hc.R		mckinsey_hc.R
mckinsey_healthcare.md		mckinsey_healthcare.md
proper_imdb_notebook_3.html		proper_imdb_notebook_3.html
proper_imdb_notebook_3.md		proper_imdb_notebook_3.md
proptable.JPG		proptable.JPG
readme.md		readme.md
script_titanic.R		script_titanic.R
titanic.md		titanic.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Homepage

Is there such a thing as 'Oscar-bait'?

A look into English-speaking movies, award winner and nominees from 1934 to 2017 using data from IMDb, Kaggle and self-curation.

Read

Analysis on Taylor Swift - reputation

Using audio data from Spotify and lyrical sentiment analysis using tidyText to compare Taylor Swift's latest album to her previous work

Exploring Retail Sales

Exploratory data analysis on sales of Rossman stores in Germany

House Pricing Prediction

Predicting sale prices for houses, given labeled data for previousy sold homes.

McKinsey Hackathon - Stroke Prediction

Using labeled patient data to predict possibility of a stroke for individual patients.

Titanic Survivor Prediction

Using labeled passenger data to predict who survives the disaster

About

Releases

Packages

Languages

yildirimgoks/analyticsportfolio

Folders and files

Latest commit

History

Repository files navigation

Homepage

Is there such a thing as 'Oscar-bait'?

A look into English-speaking movies, award winner and nominees from 1934 to 2017 using data from IMDb, Kaggle and self-curation.

Read

Analysis on Taylor Swift - reputation

Using audio data from Spotify and lyrical sentiment analysis using tidyText to compare Taylor Swift's latest album to her previous work

Exploring Retail Sales

Exploratory data analysis on sales of Rossman stores in Germany

House Pricing Prediction

Predicting sale prices for houses, given labeled data for previousy sold homes.

McKinsey Hackathon - Stroke Prediction

Using labeled patient data to predict possibility of a stroke for individual patients.

Titanic Survivor Prediction

Using labeled passenger data to predict who survives the disaster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages