Conozca lo último en aprendizaje automático, IA generativa y más en el Simposio WiML 2023.

Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

CLI de TFDS

TFDS CLI es una herramienta de línea de comandos que proporciona varios comandos para trabajar fácilmente con conjuntos de datos de TensorFlow.

Ver en TensorFlow.org

Ejecutar en Google Colab

Ver fuente en GitHub

Descargar libreta

Deshabilitar registros TF en la importación

%%capture
%env TF_CPP_MIN_LOG_LEVEL=1  # Disable logs on TF import

Instalación

La herramienta CLI se instala con tensorflow-datasets (o tfds-nightly ).

pip install -q tfds-nightly
tfds --version

Para la lista de todos los comandos CLI:

tfds --help

usage: tfds [-h] [--helpfull] [--version] {build,new} ...

Tensorflow Datasets CLI tool

optional arguments:
  -h, --help   show this help message and exit
  --helpfull   show full help message and exit
  --version    show program's version number and exit

command:
  {build,new}
    build      Commands for downloading and preparing datasets.
    new        Creates a new dataset directory from the template.

`tfds new` : Implementación de un nuevo conjunto de datos

Este comando lo ayudará a comenzar a escribir su nuevo conjunto de datos de Python creando un <dataset_name>/ que contiene los archivos de implementación predeterminados.

Uso:

tfds new my_dataset

2022-02-07 04:04:10.397902: E tensorflow/stream_executor/cuda/cuda_driver.cc:271] failed call to cuInit: CUDA_ERROR_NO_DEVICE: no CUDA-capable device is detected
Dataset generated at /tmpfs/src/temp/docs/my_dataset
You can start searching `TODO(my_dataset)` to complete the implementation.
Please check https://www.tensorflow.org/datasets/add_dataset for additional details.

Creará:

ls -1 my_dataset/

__init__.py
checksums.tsv
dummy_data/
my_dataset.py
my_dataset_test.py

Consulte nuestra guía de conjunto de datos de escritura para obtener más información.

Opciones Disponibles:

tfds new --help

usage: tfds new [-h] [--helpfull] [--dir DIR] dataset_name

positional arguments:
  dataset_name  Name of the dataset to be created (in snake_case)

optional arguments:
  -h, --help    show this help message and exit
  --helpfull    show full help message and exit
  --dir DIR     Path where the dataset directory will be created. Defaults to
                current directory.

`tfds build` : descarga y prepara un conjunto de datos

Utilice tfds build <my_dataset> para generar un nuevo conjunto de datos. <my_dataset> puede ser:

Una ruta a dataset/ carpeta o archivo dataset.py (vacío para el directorio actual):
- tfds build datasets/my_dataset/
- cd datasets/my_dataset/ && tfds build
- cd datasets/my_dataset/ && tfds build my_dataset
- cd datasets/my_dataset/ && tfds build my_dataset.py
Un conjunto de datos registrado:
- tfds build mnist
- tfds build my_dataset --imports my_project.datasets