`module` `concrete.ml.sklearn.qnn_module`

Sparse Quantized Neural Network torch module.

Global Variables

MAX_BITWIDTH_BACKWARD_COMPATIBLE

`class` `SparseQuantNeuralNetwork`

Sparse Quantized Neural Network.

This class implements an MLP that is compatible with FHE constraints. The weights and activations are quantized to low bit-width and pruning is used to ensure accumulators do not surpass an user-provided accumulator bit-width. The number of classes and number of layers are specified by the user, as well as the breadth of the network

`method` `init`

__init__(
    input_dim,
    n_layers,
    n_outputs,
    n_hidden_neurons_multiplier=4,
    n_w_bits=3,
    n_a_bits=3,
    n_accum_bits=8,
    n_prune_neurons_percentage=0.0,
    activation_function=<class 'torch.nn.modules.activation.ReLU'>,
    quant_narrow=False,
    quant_signed=True
)

Sparse Quantized Neural Network constructor.

Args:

input_dim: Number of dimensions of the input data
n_layers: Number of linear layers for this network
n_outputs: Number of output classes or regression targets
n_w_bits: Number of weight bits
n_a_bits: Number of activation and input bits
n_accum_bits: Maximal allowed bit-width of intermediate accumulators
n_hidden_neurons_multiplier: The number of neurons on the hidden will be the number of dimensions of the input multiplied by n_hidden_neurons_multiplier. Note that pruning is used to adjust the accumulator size to attempt to keep the maximum accumulator bit-width to n_accum_bits, meaning that not all hidden layer neurons will be active. The default value for n_hidden_neurons_multiplier is chosen for small dimensions of the input. Reducing this value decreases the FHE inference time considerably but also decreases the robustness and accuracy of model training.
n_prune_neurons_percentage: How many neurons to prune on the hidden layers. This should be used mostly through the dedicated .prune() mechanism. This can be used in when setting n_hidden_neurons_multiplier high (3-4), once good accuracy is obtained, to speed up the model in FHE.
activation_function: a torch class that is used to construct activation functions in the network (eg torch.ReLU, torch.SELU, torch.Sigmoid, etc)
quant_narrow : whether this network should use narrow range quantized integer values
quant_signed : whether to use signed quantized integer values

Raises:

ValueError: if the parameters have invalid values or the computed accumulator bit-width is zero

`method` `enable_pruning`

enable_pruning() → None

Enable pruning in the network. Pruning must be made permanent to recover pruned weights.

Raises:

ValueError: if the quantization parameters are invalid

`method` `forward`

forward(x: Tensor) → Tensor

Forward pass.

Args:

x (torch.Tensor): network input

Returns:

x (torch.Tensor): network prediction

`method` `make_pruning_permanent`

make_pruning_permanent() → None

Make the learned pruning permanent in the network.

`method` `max_active_neurons`

max_active_neurons() → int

Compute the maximum number of active (non-zero weight) neurons.

The computation is done using the quantization parameters passed to the constructor. Warning: With the current quantization algorithm (asymmetric) the value returned by this function is not guaranteed to ensure FHE compatibility. For some weight distributions, weights that are 0 (which are pruned weights) will not be quantized to 0. Therefore the total number of active quantized neurons will not be equal to max_active_neurons.

Returns:

n (int): maximum number of active neurons

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

concrete.ml.sklearn.qnn_module.md

concrete.ml.sklearn.qnn_module.md

`module` `concrete.ml.sklearn.qnn_module`

Global Variables

`class` `SparseQuantNeuralNetwork`

`method` `init`

`method` `enable_pruning`

`method` `forward`

`method` `make_pruning_permanent`

`method` `max_active_neurons`

Files

concrete.ml.sklearn.qnn_module.md

Latest commit

History

concrete.ml.sklearn.qnn_module.md

File metadata and controls

module concrete.ml.sklearn.qnn_module

Global Variables

class SparseQuantNeuralNetwork

method __init__

method enable_pruning

method forward

method make_pruning_permanent

method max_active_neurons

`module` `concrete.ml.sklearn.qnn_module`

`class` `SparseQuantNeuralNetwork`

`method` `init`

`method` `enable_pruning`

`method` `forward`

`method` `make_pruning_permanent`

`method` `max_active_neurons`