Machine Learning

My machine learning code written by Python.

1. Environment Setup

(1) Install Python 3.5 at Windows10.
(2) Install IPython 4.0.3.
(3) Install machine learning packages installer Anaconda.
(4) Run IPython and access "http://127.0.0.1:8888" at browser.

> ipython notebook

2. ML libs/packages

2.1 numpy

2.2 matplotlib

2.3 scipy

2.4 pandas

2.5 seaborn

3. ML algorithms

Samples should be opened by ipython.

3.1 Supervised Learning

3.1.1 Classification

Linear Model

Linear Discriminant Analysis

Decision Tree

Random Forest

SVM

SVM with diffirent kernels

Neural Network

Basic Three Layers Network

Gradient Boosting for classification

GradientBoostingClassifier

CNN (Deep Learning)

XGBoost

DBN

RNN

RNN for MNIST

DCNN

Deconvolution network

3.1.2 Regression

Linear Regression

3.2 Un-Supervised Learning

3.2.1 HMM

HMM basic
HMM application
- https://www.cnblogs.com/wangzming/p/7512607.html
- https://zhuanlan.zhihu.com/p/20688517

3.2.1 Cluster

k-Nearest Neighbor

DBSCAN

dbscan with precomputed metric

3.2.2 PCA

pca algorithm

3.3 Model evaluation

3.4 Model selection

3.5 Tensorflow

3.5.1 TF Basic

3.5.2 tf.estimator

DNNClassifier set batchsize and epoch

3.5.3 TF models

3.6 Tensorboard

3.6.1 Tensorboard by Tensorflow

3.6.2 Tensorboard by Keras

3.7 keras

3.7.1 basic

3.7.2 models

3.7.3 complex

get middle layer output

3.8 theano

Install theano at win

3.9 Incremental learning

Incremental learning by SGDClassifier partial_fit

3.10 outlier detection

IsolationForest

3.11 sklearn

3.12 jupyter

3.13 mxnet

3.13.1 NDArray

3.13.2 Basic

4. Feature Engineering

4.1 Working With Text Data

Extract 3 types of text feature: bag of words, TF, TF-IDF

4.2 String Hash

4.3 Normalization

4.4 Feature selection

4.5 imbalance data process

RandomOverSampler for Imbalance Data

4.6 missing values

5. Image process

5.1 OpenCV

5.1.1 OpenCV Python

Installation

Basic

Preprocess

Projects

defect detection

5.1.2 OpenCV CPP

opencv 2.4.9 & windows-7

Init opencv cpp project

5.1.3 Features & Matcher

5.1.4 Geometric Transformations

5.2 Useful features

Image smooth, shift, rotate, zoom by scipy.ndimage
image enhancement for ocr
Keep gray image pixel value range [0,255] when room/shift/rotate by setting order=1. commit-220ac520a0d008e74165fe3aace42b93844aedde
template match

5.3 OCR

5.4 3D graph process

stl file parse

5.5 face_recognition

6. Distributed ML

6.1 Spark

6.1.4 Spark Cluster

6.1.5 Mlib

6.1.6 spark at aws emr

steps to create/run spark code at emr

6.2 Hadoop

6.2.1 Environment Setup

6.2.2 Run Hadoop self-example at Standalone mode

hadoop example

6.2.3 HDFS

HDFS basic operation at single node cluster

6.2.4 mrjob

word count for mrjob map reduce basic

7. NLP

7.1 nltk

7.2 word2vec

7.3 Others

7.4 keyword & abstract extraction

from Chinese text

7.5 gensim

7.6 AllenNLP

7.7 Spacy

7.8 gensim

7.9 keras-bert

Get text embeddings by pretrained BERT model

7.10 wordcloud

plot wordcloud basic

7.11 wordnet

wordnet basic and environment setup

7.12 NER

BiLSTM-CRF-NER

7.13 LDA

LDA of sklearn

8. Audio

8.1 pyAudioAnalysis

8.2 signal data augmentation

add gaussian noise

9. GPU

10. Video

11. recommandation system

11.1 surprise

12. other machine learning related algorithm

13. Small project/features

14. related tools

14.1 conda

15. front-end AI

15.1 JS access camera

js open camera and take photo

15.2 face-api.js

run face-api.js examples

16. D3.js

17. LOFO

get feature importance by LOFO and FastLOFO

18. vaderSentiment

sentiment analysis by the lib

19. offline deployment

build sklearn model and py files to elf

20. keras-nlp

*keras-nlp dataset and classifier basic demo

Name		Name	Last commit message	Last commit date
Latest commit History 510 Commits
CNN		CNN
DBN		DBN
HMM		HMM
NLP		NLP
NN		NN
OCR/tesseract/basic_usage		OCR/tesseract/basic_usage
RNN		RNN
allennlp		allennlp
attention		attention
bert		bert
bilstm_crf_ner		bilstm_crf_ner
checkpoints		checkpoints
cluster		cluster
conda		conda
d3/1.basic		d3/1.basic
data_compression		data_compression
decision_tree		decision_tree
deconvolution_network		deconvolution_network
ensemble		ensemble
face_recognition		face_recognition
fea_eng/normalization		fea_eng/normalization
feature_selection		feature_selection
gensim		gensim
gpu_related		gpu_related
hadoop		hadoop
image_process		image_process
imblearn		imblearn
incremental_learning		incremental_learning
js_ai		js_ai
jupyter		jupyter
keras		keras
knn		knn
linear_model		linear_model
lofo		lofo
matplotlib		matplotlib
model_evaluate_selection		model_evaluate_selection
mxnet		mxnet
numpy		numpy
others		others
outlier_detection		outlier_detection
pandas		pandas
poker_ai		poker_ai
pyaudioanalysis		pyaudioanalysis
pyinstaller_model_package		pyinstaller_model_package
pytorch		pytorch
random_forest		random_forest
refs		refs
regression		regression
related_alg		related_alg
scipy		scipy
seaborn		seaborn
sklearn		sklearn
spacy		spacy
spark		spark
stl_file_3d		stl_file_3d
surprise		surprise
svm		svm
tensorboard		tensorboard
tensorflow		tensorflow
text_feature		text_feature
theano		theano
vaderSentiment		vaderSentiment
video/opencv		video/opencv
wordcloud		wordcloud
wordnet/wordnet_basic_and_env_setup		wordnet/wordnet_basic_and_env_setup
xgboost		xgboost
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
keras-nlp-text-classification.ipynb		keras-nlp-text-classification.ipynb

License

ybdesire/machinelearning

Folders and files

Latest commit

History

Repository files navigation