Skip to content
This repository has been archived by the owner on Sep 24, 2020. It is now read-only.

Assignment 2

dgupt097 edited this page Oct 31, 2018 · 9 revisions

In this assignment, we have to do the mining of crime data using Machine Learning tools (R recommended). For this, we would utilize the City of Chicago’s crime data (one year prior to present) available at https://data.cityofchicago.org/Public-Safety/Crimes-One-year-prior-to-present/x2n5-8w5q/data

  1. Data Extraction

a) Export the data from https://data.cityofchicago.org/Public-Safety/Crimes-One-year-prior-to-present/x2n5-8w5q/data

b) Import the data to R and view it. (Provide Screenshot)

  1. Data Exploration and Preprocessing (Provide Screenshot for all)

a) str() – to view the structure of the dataset. For example, str(crime.data)

b) summary() – to identify the general summary of the data. For example, summary(crime.data)

c) clean the data (fix duplicate and missing values)

d) analyze the data for the one you want to provide visualization (mentioned in Question 3). For example, if you want to plot for the most committed crime, analyze Primary Description and Secondary description field

  1. Data Visualization: (Choose any 1) (Provide Screenshot of the plot you choose)

(i) What type of crime is committed in the most

(ii) Time of the day that has the most number of crime

(iii) Days of the week and Month of the year that has the highest number of crimes

For more information, please refer Sengupta, A., Kumar, M., & Upadhyay, S. (n.d.). Crime Analyses Using R 13.1 Introduction. Retrieved from https://data.cityof

Clone this wiki locally