tfmot.quantization.keras.QuantizeConfig

ABC interface for Keras layers to express how they should be quantized.

Used in the notebooks

Used in the guide
Quantization aware training comprehensive guide

This is an experimental API not subject to backward compatibility.

QuantizeConfig encapsulates all the information needed by the quantization code to quantize a layer. It specifies what parts of a layer should be quantized and how they should be quantized.

It can be used to precisely control the quantization behavior of a layer. The framework provides default behavior for each layer, but this can be used to override it.

Create QuantizeConfig for a Dense layer:

class MyDenseQuantizeConfig(QuantizeConfig):

  def get_weights_and_quantizers(self, layer):
    return [(layer.kernel, LastValueQuantizer())]

  def get_activations_and_quantizers(self, layer):
    return [(layer.activation, MovingAverageQuantizer())]

  def set_quantize_weights(self, layer, quantize_weights):
    layer.kernel = quantize_weights[0]

  def set_quantize_activations(self, layer, quantize_activations):
    layer.activation = quantize_activations[0]

  def get_output_quantizers(self, layer):
    # Does not quantize output, since we return an empty list.
    return []

  def get_config(self):
    return {}

For a full example, see https://www.tensorflow.org/model_optimization/guide/quantization/training_comprehensive_guide.md

Methods

`from_config`

View source

@classmethod
from_config(
    config
)

Instantiates a QuantizeConfig from its config.

Args
`config`	Output of `get_config()`.

Returns
A `QuantizeConfig` instance.

`get_activations_and_quantizers`

View source

@abc.abstractmethod
get_activations_and_quantizers(
    layer
)

Return activations to be quantized along with their quantizers.

This function tells the quantize code which activations within a layer should be quantized, and how. The activations are the activation attributes in a layer, and the quantizers are Quantizer instances.

This method is invoked by the quantization code when quantizing a layer.

Example for a Dense layer:

def get_activations_and_quantizers(self, layer):
  return [(layer.activation, MovingAverageQuantizer())]

Args
`layer`	layer being quantized.

Returns
List of 2-tuples. Each tuple is a keras activation and an associated quantizer.

`get_config`

View source

@abc.abstractmethod
get_config()

Returns the config used to serialize QuantizeConfig.

`get_output_quantizers`

View source

@abc.abstractmethod
get_output_quantizers(
    layer
)

Returns the quantizer used to quantize the outputs from a layer.

For certain layers, we may want to quantize the outputs tensors returned by the layer's call function. This allows us to quantize those output tensors.

This function should return a list of quantizers. In most cases, a layer outputs only a single tensor so it should only have one quantizer. Return an empty list for if no quantization operation is desired on the results of the layer.

Args
`layer`	layer being quantized.

Returns
List of `Quantizer`s to be used to quantize the resulting tensors from a layer.

`get_weights_and_quantizers`

View source

@abc.abstractmethod
get_weights_and_quantizers(
    layer
)

Return weights to be quantized along with their quantizers.

This function tells the quantize code which weights within a layer should be quantized, and how. The weights are the TF variables in a layer and the quantizers are Quantizer instances.

This method is invoked by the quantization code when quantizing a layer.

Example for a Dense layer:

def get_weights_and_quantizers(self, layer):
  return [(layer.kernel, LastValueQuantizer())]

Args
`layer`	layer being quantized.

Returns
List of 2-tuples. Each tuple is a weight tensor and an associated quantizer.

`set_quantize_activations`

View source

@abc.abstractmethod
set_quantize_activations(
    layer, quantize_activations
)

Replace the activations in the layer with quantized activations.

This method is invoked by the quantization code to replace the activations within a layer with quantized activations. It is responsible for ensuring that the activations within a layer are properly replaced.

Example for a Dense layer:

def set_quantize_activations(self, layer, quantize_activations):
  layer.activation = quantize_activations[0]

Args
`layer`	layer being quantized.
`quantize_activations`	List of quantized activations to replace the original activations in the layer.

Returns
None

`set_quantize_weights`

View source

@abc.abstractmethod
set_quantize_weights(
    layer, quantize_weights
)

Replace the weights in the layer with quantized weights.

This method is invoked by the quantization code to replace the weights within a layer with quantized weights. It is responsible for ensuring that the weights within a layer are properly replaced.

Example for a Dense layer:

def set_quantize_weights(self, layer, quantize_weights):
  layer.kernel = quantize_weights[0]

Args
`layer`	layer being quantized.
`quantize_weights`	List of quantized weight tensors.

Returns
None

tfmot.quantization.keras.QuantizeConfig Stay organized with collections Save and categorize content based on your preferences.

Used in the notebooks

Methods

from_config

get_activations_and_quantizers

get_config

get_output_quantizers

get_weights_and_quantizers

set_quantize_activations

set_quantize_weights

tfmot.quantization.keras.QuantizeConfig

`from_config`

`get_activations_and_quantizers`

`get_config`

`get_output_quantizers`

`get_weights_and_quantizers`

`set_quantize_activations`

`set_quantize_weights`