-
Notifications
You must be signed in to change notification settings - Fork 0
/
Na1.txt
74 lines (51 loc) · 2.09 KB
/
Na1.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
Notes - 1
Title:- Data Science
Learning Path:
1. Statistics
a) Basic Statistics
b) Advanced Statistics
2. Analytics
a) Basic
b) Advanced
3. Machine Learning ( R and Python )
4. Visualization Tools ( Tableaue )
5. Databases
Introduction
{{{{{{{
Here We Start:
Data Science is a scientific method, process, systems to extract knowledge , insights from data.
This Data can be either structured or unstructured.
Data Science = Maths + Statistics
Data Science Specialization :
1. Speech Analysis
2. Text Analysis(Natural Language Processing)
3. Image Processing
4. Video Processing
5. Medicine Simulation
6. Material Simulation
Work flow of a Data Scientist:
1. Understand the Business Problem
( Understand the Business Problem by why's , or interviews, questionaires with client)
2. Data Acquisition
( Acquire the require data to solve the business problem, Data can be Webserver logs, DB Data, Online Repositories etc..)
3. Data Preparation
( Data cleaning- Removes inconsistent Data Type, Misspelled Attributes, Missing& Duplicate Values
Data Transformation- Modifies Data using defined Mapping Rule , Tools like Talend and Informatica to form complex transformation to understand the data better
)
4. Data Analysis
( Defines and Refines rge selection of feature variable that will be used in the model development.
5. Data Modeling
( Apply ML Algorithm like Decision Treee, apply on Data to identify best fit for Business )
Trains the Models with Training Data set and Test, Select Best ML Model.
ML Prefers python for Modelling the data ).
6. Visulization and Communication
( Create Powerfull reports and Dashboards using Tableaue, Power BI and Qlick View and communicate the customer
about how the business problem is going to be solved with the best fit ML Model )
7. Deploy and Test and Maintain ML Model
( Deploy and Test ML and Model and Maintains the Model )
}}}}}}
Upcoming Lessons- Note -2 :-
Learning Path mentioned above starts from here...
Statistics
Basic Statistics
Advanced Statistics..............