Rigging the Lottery: Making All Tickets Winners #4

raulincze · 2020-10-13T12:15:15Z

Stumbled on this here: https://www.reddit.com/r/MachineLearning/comments/j0xr24/d_machine_learning_wayr_what_are_you_reading_week/g71offh/?utm_source=reddit&utm_medium=web2x&context=3

You'll really see a patter in what I'm reading lately...

Typical way to get a sparse model is to train a dense model and drop connections (may or may not include multiple iterations.). But this approach have two main limitations.

The maximum size of sparse model is limited to the largest dense model that can be trained.

Large amount of computation is performed for parameters that are zero valued or that will be zero during inference.
In this new approach, called "RigL", they randomly initialize a sparse neural network, and at regularly spaced intervals it removes a fraction of connections based on their magnitudes and activates new ones using gradient information.

Link to the paper: https://arxiv.org/pdf/1911.11134.pdf

raulincze added Weight Pruning Model Optimisation Computer Vision Neural Networks labels Oct 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rigging the Lottery: Making All Tickets Winners #4

Rigging the Lottery: Making All Tickets Winners #4

raulincze commented Oct 13, 2020

Rigging the Lottery: Making All Tickets Winners #4

Rigging the Lottery: Making All Tickets Winners #4

Comments

raulincze commented Oct 13, 2020