Updated: Aug 7th, 2020

Quantization

  • Post training quantization for dynamic-range kernels -- Launched
  • Post training quantization for (8b) fixed-point kernels -- Launched
  • Quantization aware training for (8b) fixed-point kernels and experimentation for <8b -- Launched
  • [WIP] Post training quantization for (8b) fixed-point RNNs
  • Quantization aware training for (8b) fixed-point RNNs
  • [WIP] Quality and performance improvements to post training dynamic-range quantization

Pruning / Sparsity

  • During-training magnitude-based weight pruning -- Launched
  • Sparse model execution support in TensorFlow Lite -- WIP

Weight clustering

  • During-training weight clustering -- Launched

Cascading compression techniques

  • [WIP] Additional support for combining different compression techniques. Today, users can only combine one during-training technique with post-training quantization. The proposal is coming soon.

Compression

  • [WIP] Tensor compression API