Skip to content

HarryQin99/Data-Linkage-And-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data_Linkage_And_Classification

My Data_Linkage and Classification Project

Main skills used in this project

  • Data Linkage on the stocks of Google and Amazon based on their name, description and price
  • Using the library like fuzzywuzzy and textdistance for the data linkage
  • Using the idea of Blocking to make the linkage part more efficient and with higher accuracy
  • Comparing three the accuracy difference Classsfication Algo, decision tree, k-nn(n = 5) and k-nn(n = 10)
  • Feature engineering and selection
    • Interaction term pairs and Clustering label
    • Principal Component Analysis
    • Naive choosing the first four features

About

My Data_Linkage and Classification Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages