Updated: Mar 27th, 2020

Quantization

  • Post training quantization for dynamic-range kernels -- Launched
  • Post training quantization for (8b) fixed-point kernels -- Launched
  • Quantization aware training for (8b) fixed-point kernels and experimentation for <8b -- Launched
  • Post training quantization for (8b) fixed-point RNNs
  • Quantization aware training for (8b) fixed-point RNNs
  • Quality and performance improvements to post training dynamic-range quantization

Pruning / Sparsity

  • During-training magnitude-based weight pruning -- Launched
  • Sparse model execution support in TensorFlow Lite -- WIP
  • Weight clustering API

Compression

  • Tensor compression API