Attention: This MediaPipe Solutions Preview is an early release. Learn more.

mediapipe_model_maker.image_classifier.Dataset

Stay organized with collections Save and categorize content based on your preferences.

Dataset library for image classifier.

Inherits From: ClassificationDataset, Dataset

tf_dataset A tf.data.Dataset object that contains a potentially large set of elements, where each element is a pair of (input_data, target). The input_data means the raw input data, like an image, a text etc., while the target means the ground truth of the raw input data, e.g. the classification label of the image etc.
size The size of the dataset. tf.data.Dataset donesn't support a function to get the length directly since it's lazy-loaded and may be infinite.

label_names

num_classes

size Returns the size of the dataset.

Note that this function may return None becuase the exact size of the dataset isn't a necessary parameter to create an instance of this class, and tf.data.Dataset donesn't support a function to get the length directly since it's lazy-loaded and may be infinite. In most cases, however, when an instance of this class is created by helper functions like 'from_folder', the size of the dataset will be preprocessed, and this function can return an int representing the size of the dataset.

Methods

from_folder

View source

Loads images and labels from the given directory.

Assume the image data of the same label are in the same subdirectory.

Args
dirname Name of the directory containing the data files.
shuffle boolean, if true, random shuffle data.

Returns
Dataset containing images and labels and other related info.

Raises
ValueError if the input data directory is empty.

gen_tf_dataset

View source

Generates a batched tf.data.Dataset for training/evaluation.

Args
batch_size An integer, the returned dataset will be batched by this size.
is_training A boolean, when True, the returned dataset will be optionally shuffled and repeated as an endless dataset.
shuffle A boolean, when True, the returned dataset will be shuffled to create randomness during model training.
preprocess A function taking three arguments in order, feature, label and boolean is_training.
drop_remainder boolean, whether the finaly batch drops remainder.

Returns
A TF dataset ready to be consumed by Keras model.

split

View source

Splits dataset into two sub-datasets with the given fraction.

Primarily used for splitting the data set into training and testing sets.

Args
fraction float, demonstrates the fraction of the first returned subdataset in the original data.

Returns
The splitted two sub datasets.

__len__

View source

Returns the number of element of the dataset.