Vasco

Small machine learning library in C++ (a work very much in progress).

Some background to the theory at http://blog.jamild.com/post/77023959349/from-the-hessian-to-widrow-hoff-calculus-and-machine

Usage (Stochastic and Batch Gradient Descent)

tl;dr: SupervisedData is your training example, the Learner subclasses are the model.

First initialize a vector of type SupervisedData*. This will be your training set.

For each training example, initialize a SupervisedData instance dynamically with the desired number of features (this number should be consistent among one set):
SupervisedData *newData = new SupervisedData(num_features);

Then set the features by passing an array of type float to the method setFeatures:
newData->setFeatures(num_features, features);

Finally, set the supervised training value -- the value that the model "should" result in. For a sigmoid hypothesis or any classification algorithm where there is a binary activation function, this value should be either 1 or 0.
newData->setSupervisedValue(target);

Alternatively, you can call a constructor that does all this in one step:
SupervisedData *newData = new SupervisedData(num_features, features, target);

Now, create your hypothesis function. This is operating under the assumption that the output is a linear combination of the parameters and the features (i.e. sum of all the x-sub-i and theta-sub-i). For example, a sigmoid activation function would be like so: float hyp(float y) { float h = 1 + exp(-1*y); h = 1 / h; return h; }

Now, initialize the learner instance; either StochasticLearner or BatchDescentLearner -- Learner is an abstract data class. Include the number of parameters/features, the vector of SupervisedData s, and the desired learning rate.
StochasticLearner learner(num_features, training_set, learning_rate)

Then, simply call the update method of Learner (perhaps on another thread -- it can be very slow for large data sets or a large number of parameters). This should set the _parameterValues variable.

To get a hypothesis for unknown/untrained data, just initialize a Data instance in the same way as SupervisedData, except without the last argument:
Data *newData = new Data(num_features, features);

and call getHypothesisForData from the learner instance:
learner.getHypothesisForData(newData);

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.gitignore		.gitignore
BatchDescentLearner.cpp		BatchDescentLearner.cpp
BatchDescentLearner.h		BatchDescentLearner.h
Cluster.cpp		Cluster.cpp
Cluster.h		Cluster.h
Data.cpp		Data.cpp
Data.h		Data.h
INSTALL		INSTALL
Learner.cpp		Learner.cpp
Learner.h		Learner.h
Makefile		Makefile
NeuralNetwork.cpp		NeuralNetwork.cpp
NeuralNetwork.h		NeuralNetwork.h
Neuron.cpp		Neuron.cpp
Neuron.h		Neuron.h
Node.cpp		Node.cpp
Node.h		Node.h
README.md		README.md
StochasticLearner.cpp		StochasticLearner.cpp
StochasticLearner.h		StochasticLearner.h
SupervisedData.cpp		SupervisedData.cpp
SupervisedData.h		SupervisedData.h
iris.txt		iris.txt
test-kmeans-1.cpp		test-kmeans-1.cpp
test-kmeans-2.cpp		test-kmeans-2.cpp
test-neuron-1.cpp		test-neuron-1.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vasco

Usage (Stochastic and Batch Gradient Descent)

About

Releases

Packages

Languages

Jamil/Vasco

Folders and files

Latest commit

History

Repository files navigation

Vasco

Usage (Stochastic and Batch Gradient Descent)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages