tfmot.quantization.keras.QuantizeConfig

ABC interface for Keras layers to express how they should be quantized.

Used in the notebooks

Used in the guide

This is an experimental API not subject to backward compatibility.

QuantizeConfig encapsulates all the information needed by the quantization code to quantize a layer. It specifies what parts of a layer should be quantized and how they should be quantized.

It can be used to precisely control the quantization behavior of a layer. The framework provides default behavior for each layer, but this can be used to override it.

Create QuantizeConfig for a Dense layer:

class MyDenseQuantizeConfig(QuantizeConfig):

  def get_weights_and_quantizers(self, layer):
    return [(layer.kernel, LastValueQuantizer())]

  def get_activations_and_quantizers(self, layer):
    return [(layer.activation, MovingAverageQuantizer())]

  def set_quantize_weights(self, layer, quantize_weights):
    layer.kernel = quantize_weights[0]

  def set_quantize_activations(self, layer, quantize_activations):
    layer.activation = quantize_activations[0]

  def get_output_quantizers(self, layer):
    # Does not quantize output, since we return an empty list.
    return []

  def get_config(self):
    return {}

For a full example, see https://www.tensorflow.org/model_optimization/guide/quantization/training_comprehensive_guide.md

Methods

from_config

View source

Instantiates a QuantizeConfig from its config.

Args
config Output of get_config().

Returns
A QuantizeConfig instance.

get_activations_and_quantizers

View source

Return activations to be quantized along with their quantizers.

This function tells the quantize code which activations within a layer should be quantized, and how. The activations are the activation attributes in a layer, and the quantizers are Quantizer instances.

This method is invoked by the quantization code when quantizing a layer.

Example for a Dense layer:

def get_activations_and_quantizers(self, layer):
  return [(layer.activation, MovingAverageQuantizer())]

Args
layer layer being quantized.

Returns
List of 2-tuples. Each tuple is a keras activation and an associated quantizer.

get_config

View source

Returns the config used to serialize QuantizeConfig.

get_output_quantizers

View source

Returns the quantizer used to quantize the outputs from a layer.

For certain layers, we may want to quantize the outputs tensors returned by the layer's call function. This allows us to quantize those output tensors.

This function should return a list of quantizers. In most cases, a layer outputs only a single tensor so it should only have one quantizer. Return an empty list for if no quantization operation is desired on the results of the layer.

Args
layer layer being quantized.

Returns
List of Quantizers to be used to quantize the resulting tensors from a layer.

get_weights_and_quantizers

View source

Return weights to be quantized along with their quantizers.

This function tells the quantize code which weights within a layer should be quantized, and how. The weights are the TF variables in a layer and the quantizers are Quantizer instances.

This method is invoked by the quantization code when quantizing a layer.

Example for a Dense layer:

def get_weights_and_quantizers(self, layer):
  return [(layer.kernel, LastValueQuantizer())]

Args
layer layer being quantized.

Returns
List of 2-tuples. Each tuple is a weight tensor and an associated quantizer.

set_quantize_activations

View source

Replace the activations in the layer with quantized activations.

This method is invoked by the quantization code to replace the activations within a layer with quantized activations. It is responsible for ensuring that the activations within a layer are properly replaced.

Example for a Dense layer:

def set_quantize_activations(self, layer, quantize_activations):
  layer.activation = quantize_activations[0]

Args
layer layer being quantized.
quantize_activations List of quantized activations to replace the original activations in the layer.

Returns
None

set_quantize_weights

View source

Replace the weights in the layer with quantized weights.

This method is invoked by the quantization code to replace the weights within a layer with quantized weights. It is responsible for ensuring that the weights within a layer are properly replaced.

Example for a Dense layer:

def set_quantize_weights(self, layer, quantize_weights):
  layer.kernel = quantize_weights[0]

Args
layer layer being quantized.
quantize_weights List of quantized weight tensors.

Returns
None