Attention: TensorFlow Lite is now part of Google AI Edge. The latest documentation is now at ai.google.dev/edge/lite. Learn more

tflite_model_maker.audio_classifier.YamNetSpec

View source on GitHub

Model good at detecting environmental sounds, using YAMNet embedding.

tflite_model_maker.audio_classifier.YamNetSpec(
    model_dir: None = None,
    strategy: None = None,
    yamnet_model_handle='https://tfhub.dev/google/yamnet/1',
    frame_length=EXPECTED_WAVEFORM_LENGTH,
    frame_step=(EXPECTED_WAVEFORM_LENGTH // 2),
    keep_yamnet_and_custom_heads=True
)

Used in the notebooks

Used in the tutorials
Transfer Learning for the Audio Domain with TensorFlow Lite Model Maker

Args
`model_dir`	The location to save the model checkpoint files.
`strategy`	An instance of TF distribute strategy. If none, it will use the default strategy (either SingleDeviceStrategy or the current scoped strategy.
`yamnet_model_handle`	Path of the TFHub model for retrining.
`frame_length`	The number of samples in each audio frame. If the audio file is shorter than `frame_length`, then the audio file will be ignored.
`frame_step`	The number of samples between two audio frames. This value should be smaller than `frame_length`, otherwise some samples will be ignored.
`keep_yamnet_and_custom_heads`	Boolean, decides if the final TFLite model contains both YAMNet and custom trained classification heads. When set to False, only the trained custom head will be preserved.

Attributes
`target_sample_rate`

Attributes

target_sample_rate

Methods

`create_model`

View source

create_model(
    num_classes, train_whole_model=False
)

`create_serving_model`

View source

create_serving_model(
    training_model
)

Create a model for serving.

`export_tflite`

View source

export_tflite(
    model,
    tflite_filepath,
    with_metadata=True,
    export_metadata_json_file=True,
    index_to_label=None,
    quantization_config=None
)

Converts the retrained model to tflite format and saves it.

This method overrides the default CustomModel._export_tflite method, and include the spectrom extraction in the model.

The exported model has input shape (1, number of wav samples)

Args
`model`	An instance of the keras classification model to be exported.
`tflite_filepath`	File path to save tflite model.
`with_metadata`	Whether the output tflite model contains metadata.
`export_metadata_json_file`	Whether to export metadata in json file. If True, export the metadata in the same directory as tflite model. Used only if `with_metadata` is True.
`index_to_label`	A list that map from index to label class name.
`quantization_config`	Configuration for post-training quantization.

`get_default_quantization_config`

View source

get_default_quantization_config()

Gets the default quantization configuration.

`preprocess_ds`

View source

preprocess_ds(
    ds, is_training=False, cache_fn=None
)

Returns a preprocessed dataset.

`run_classifier`

View source

run_classifier(
    model, epochs, train_ds, validation_ds, **kwargs
)

Class Variables
EMBEDDING_SIZE	`1024`
EXPECTED_WAVEFORM_LENGTH	`15600`

tflite_model_maker.audio_classifier.YamNetSpec

Used in the notebooks

Args

Attributes

Methods

create_model

create_serving_model

export_tflite

get_default_quantization_config

preprocess_ds

run_classifier

Class Variables

`create_model`

`create_serving_model`

`export_tflite`

`get_default_quantization_config`

`preprocess_ds`

`run_classifier`