Distilling-Deep-Networks

The idea of this project is to create a python decorator that could be plugged into a model (should be PyTorch with specific configuration to define the layers) and will inherit the model adding the necessary attributes and methods to keep track of the training statistics and build an interactive dashboard to explore those stats.

Previous to the construction of the dashboard it is essential to determine the statistics we are interested in:

Distribution of weights
Distribution of the gradients
Evolution of the ratio of weights and update steps
Code and try different Optimizers to compare statistics in different regimes

Analyze how different combinations of architectures of networks, activation functions, learning_rate, batch_size change the perfomarnce on the test inference evaluating next picture, and the evolution of the gradients and the activations to see how the flow during the forward and backward passes respectively.

The examples shown will be for a simple 1 hidden layer CNN (Note 1 hidden layer has 2 layers of parameters W1 and W2)

Example of analyzing the weights:

Example of analyzing the gradients:

Example of analyzing the activations:

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Recursivity_Ensembles_LeCun_Backup		Recursivity_Ensembles_LeCun_Backup
figures		figures
models		models
.gitignore		.gitignore
0_SDG_PyTorch.py		0_SDG_PyTorch.py
0_SGD.py		0_SGD.py
README.md		README.md
class_test.py		class_test.py
dashboard.py		dashboard.py
data.py		data.py
torchdataset.py		torchdataset.py
train_valid_test.py		train_valid_test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distilling-Deep-Networks

About

Releases

Packages

Languages

PabloRR100/Distilling-Deep-Networks

Folders and files

Latest commit

History

Repository files navigation

Distilling-Deep-Networks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages