GitHub - partharamanujam/octave-ml: Octave based Machine-learning routines

Octave based Machine-learning routines

Setup

# ensure octave is installed
# get code
$ git clone https://github.com/partharamanujam/octave-ml.git
# include .../octave-ml/octavelib in the octave path 
# now check examples folder

Features

Supervised Learning

Normal Equation - Linear Regression
Gradient Descent - Linear Regression
Gradient Descent - Logistic Regression
Support Vector Machines - Classification
Neural Networks - Classification : ToDo

Un-Supervised Learning

K Means - Clustering
Anomaly Detection - (Multivariate) Gaussian Distribution
Recommender Systems - Collaborative filtering (Low Rank Matrix Factorization)

Miscellaneous / Utilities

Feature Scaling/Normalization
Principal Component Analysis (Dimensionality Reduction)

Installation Dependencies

Octave 3.6.4 or above - https://www.gnu.org/software/octave/download.html
Octave-Forge packages - http://octave.sourceforge.net
- specfun
- image
LIBSVM for Octave - http://www.csie.ntu.edu.tw/~cjlin/libsvm

Philosophy

Some of the commonly used Machine-learning and support routines implemented in Octave. This is to provide a starting point for more advanced work.

See examples folder for usage.

Terminology / Conventions

Input-features

Features are inputs from the training-set to be used for machine-learning. This is usually represented by the matrix-variable "X".

Outputs

Outputs refer to the actual/known results corresponding to the input-features from the training-set. This is usually represented by the vector-variable "y".

Bias and Variance

Bias refers to the erroneous assumptions in the learning algorithm, and Variance refers to the error from sensitivity to small fluctuations in the training set. For more details, refer to Bias Variance Tradeoff

Theta-coefficients

Theta refers to the hypothesis of coefficients/parameters that map/fit the input-features to the output-results. This is usually represented by the vector-variable "theta" (or Theta).

Lambda regularization-parameter

Lambda is the regularization parameter used to manage fitting of parameters. This is usually represented by the vector-variable "lambda".

Feature scaling/normalization

Feature scaling/normalization is the process of modifying the input-features to allow for better fitting. This is usually done using a combination of mean (represented by parameter mu), and standard-deviation (represented by parameter sigma). Note that the bias-term is usually not scaled/normalized.

Estimated-value

Estimated-value refers to the predicted value for a given set of input-features using previously computed theta from the training-set. This is usually represented by the variable "p".

Machine-Learning & Support Routines

This section provides a list of various machine-learning and support routines available. For more detailed information, please look at the embedded documentation using 'help' for the specific routine.

License

This code: MIT
Porter-Stemmer: BSD
Examples data: Courtesy ML-007 class

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
examples		examples
octavelib		octavelib
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json

License

partharamanujam/octave-ml

Folders and files

Latest commit

History

Repository files navigation

Setup

Features

Supervised Learning

Un-Supervised Learning

Miscellaneous / Utilities

Installation Dependencies

Philosophy

Terminology / Conventions

Input-features

Outputs

Bias and Variance

Theta-coefficients

Lambda regularization-parameter

Feature scaling/normalization

Estimated-value

Machine-Learning & Support Routines

License

About

Resources

License

Stars

Watchers

Forks

Languages