Logging Datasets other than files (for example: tensorflow_dataset object)

stevencocke · March 29, 2023, 3:49pm

Hello,

I see many examples in the documentation for logging actual files as datasets/artifacts, but how do I log datasets that aren’t files? For example, I am using tensorflow_datasets to download my dataset directly into train and validation splits and would like to log these directly. Is there an easy way to do this or can they only live in a table object?

Thank you

mohammadbakir · April 6, 2023, 9:45pm

Hi @stevencocke wandb.log() function, accepts a variety of data types including NumPy arrays, Python dictionaries, and other data structures. Are you looking to do something similar to this?

import tensorflow_datasets as tfds
import wandb

# Initialize WandB
wandb.init(project="tf-data-test")

# Load MNIST dataset
ds_train, ds_test = tfds.load('mnist', split=['train[:20]', 'test[:20]'], shuffle_files=True)

# Create a WandB Table
table = wandb.Table(columns=["image", "label"])

# Log examples to WandB & add data to table
for example in ds_train:
    image = example['image'].numpy()
    label = example['label'].numpy()
    wandb.log({"image": wandb.Image(image, caption=f"Label: {label}")})
    table.add_data(wandb.Image(image), label)

# Log the table to WandB
wandb.log({"mnist": table})

mohammadbakir · April 13, 2023, 6:10pm

Hi @stevencocke, since we have not heard back from you we are going to close this request. If you would like to re-open the conversation, please let us know!

system · June 12, 2023, 6:10pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Artifacts logged with run_id W&B Help artifacts	4	1018	September 27, 2022
Join over different tables in a run W&B Help tables , wandb	3	1081	March 12, 2023
Logging and using artifacts in one run W&B Help artifacts , wandb	4	601	May 6, 2024
Memory limit when uploading a image dataset as table W&B Help artifacts	6	131	May 7, 2024
Collab example for building an "evaluation" table using wandb.log() W&B Help	4	487	April 20, 2022

Logging Datasets other than files (for example: tensorflow_dataset object)

Related topics