-
Notifications
You must be signed in to change notification settings - Fork 0
Assignment 2
In this assignment, we have to do the mining of crime data using Machine Learning tools (R recommended). For this, we would utilize the City of Chicago’s crime data (one year prior to present) available at https://data.cityofchicago.org/Public-Safety/Crimes-One-year-prior-to-present/x2n5-8w5q/data
- Data Extraction
a) Export the data from https://data.cityofchicago.org/Public-Safety/Crimes-One-year-prior-to-present/x2n5-8w5q/data
b) Import the data to R and view it. (Provide Screenshot)
- Data Exploration and Preprocessing (Provide Screenshot for all)
a) str() – to view the structure of the dataset. For example, str(crime.data)
b) summary() – to identify the general summary of the data. For example, summary(crime.data)
c) clean the data (fix duplicate and missing values)
d) analyze the data for the one you want to provide visualization (mentioned in Question 3). For example, if you want to plot for the most committed crime, analyze Primary Description and Secondary description field
- Data Visualization: (Choose any 1) (Provide Screenshot of the plot you choose)
(i) What type of crime is committed in the most
(ii) Time of the day that has the most number of crime
(iii) Days of the week and Month of the year that has the highest number of crimes
For more information, please refer Sengupta, A., Kumar, M., & Upadhyay, S. (n.d.). Crime Analyses Using R 13.1 Introduction. Retrieved from https://data.cityof
Overview
Basic Concepts
- What is Data Mining?
- Data mining goals
- Data objects and statistical concepts
- Machine Learning techniques
- Applications
- Related Technologies
Machine Learning Algorithms
- Association rules
- Classification
- Prediction
- Clustering
Machine Learning tool tutorials
Assignment
Advanced Topics
Data warehouse and OLAP