Quickstart

Loading an image

import torchio as tio

# From a file (lazy, no data read yet)
image = tio.ScalarImage("t1.nii.gz")
print(image.shape)    # reads header only: (1, 256, 256, 176)
print(image.spacing)  # (1.0, 1.0, 1.0)

# Data is loaded on first access
tensor = image.data   # shape: (1, 256, 256, 176), dtype: float32

# From a URL or cloud path
image = tio.ScalarImage("https://example.com/t1.nii.gz")
image = tio.ScalarImage("s3://bucket/t1.nii.gz")

# From a file-like object
import io
buf = io.BytesIO(open("t1.nii.gz", "rb").read())
image = tio.ScalarImage(buf, suffix=".nii.gz")

Creating from a tensor

import torch

tensor = torch.randn(1, 128, 128, 128)
image = tio.ScalarImage(tensor)

Creating from SimpleITK or NiBabel

import SimpleITK as sitk
import nibabel as nib

# From a SimpleITK Image (preserves spacing, origin, direction)
sitk_image = sitk.ReadImage("t1.nii.gz")
image = tio.ScalarImage(sitk_image)

# From a NiBabel Nifti1Image (lazy, data not loaded yet)
nifti = nib.load("t1.nii.gz")
image = tio.ScalarImage(nifti)

Creating from bytes

If you have raw image bytes (e.g., from an HTTP response or a database), pass them directly:

response = requests.get("https://example.com/brain.nii.gz")
image = tio.ScalarImage(response.content)

# Or with a BytesIO buffer
import io
buf = io.BytesIO(some_bytes)
image = tio.ScalarImage(buf, suffix=".nii.gz")

Creating from a Zarr store

For large-scale datasets stored as .nii.zarr, you can pass a zarr.abc.store.Store directly. Instantiation is O(1): the store is only accessed when metadata or data is needed:

import zarr

store = zarr.storage.FsspecStore("s3://bucket/brain.nii.zarr", mode="r")
image = tio.ScalarImage(store)              # instant, no I/O
print(image.shape)                          # triggers header read
image.data                                  # triggers full load

# Select a specific pyramid level
image = tio.ScalarImage(store, reader_kwargs={"level": 1})

Attaching metadata

Pass any extra keyword arguments to attach metadata to an image:

image = tio.ScalarImage("t1.nii.gz", protocol="MPRAGE", te=3.5)
image.protocol       # "MPRAGE" (attribute access)
image["te"]          # 3.5     (dict-style access)
image.metadata       # {"protocol": "MPRAGE", "te": 3.5}

Slicing

Slicing follows the (C, I, J, K) layout and keeps things lazy. Only the requested region is read from disk:

image = tio.ScalarImage("big_volume.nii.gz")
patch = image[:, 100:200, 100:200, 50:100]  # no full load
patch.data.mean()  # reads only this region

The affine origin is updated automatically so the patch stays in the correct world coordinates.

Grouping data into a Subject

A Subject (also available as Study) holds images, spatial annotations, and metadata:

import torch

subject = tio.Subject(
    t1=tio.ScalarImage("t1.nii.gz"),
    seg=tio.LabelMap("seg.nii.gz"),
    landmarks=tio.Points(torch.tensor([[64.0, 64.0, 32.0]])),
    tumors=tio.BoundingBoxes(
        torch.tensor([[10, 20, 30, 50, 60, 70]]),
        format=tio.BoundingBoxFormat.IJKIJK,
    ),
    age=45,
)

subject.t1           # Image access
subject.landmarks    # Points access
subject.tumors       # BoundingBoxes access
subject["seg"]       # dict-style access
subject.age          # metadata access (returns 45)

Saving

# Any format SimpleITK supports
image.save("output.nii.gz")
image.save("output.nrrd")

# NIfTI-Zarr (chunked, lazy-loadable)
image.save("output.nii.zarr")

Batching with a DataLoader

from torch.utils.data import Dataset

class BrainDataset(Dataset):
    def __init__(self, paths):
        self.subjects = [
            tio.Subject(
                t1=tio.ScalarImage(p / "t1.nii.gz"),
                seg=tio.LabelMap(p / "seg.nii.gz"),
            )
            for p in paths
        ]

    def __len__(self):
        return len(self.subjects)

    def __getitem__(self, idx):
        return self.subjects[idx]

loader = tio.SubjectsLoader(BrainDataset(paths), batch_size=4)
batch = next(iter(loader))
batch.t1.data.shape  # (4, 1, 256, 256, 176)

See the DataLoader how-to guide for more details.

Applying transforms

Transforms accept Subjects, Images, Tensors, NumPy arrays, SimpleITK Images, NiBabel images, or MONAI-style dicts, and return the same type:

# Single deterministic transform
flipped = tio.Flip(axes=(0,))(subject)

# Random augmentation pipeline
augment = tio.Compose([
    tio.Flip(axes=(0, 1, 2), p=0.5),
    tio.Noise(std=(0.01, 0.1)),   # random std each call
])
augmented = augment(subject)

# Custom distribution for parameters
from torch.distributions import LogNormal
noisy = tio.Noise(std=LogNormal(loc=-2, scale=0.5))(subject)

# Works directly on tensors too
noisy_tensor = tio.Noise(std=0.05)(tensor)

# Works with MONAI-style dicts
data = {"image": tensor, "label": label_tensor}
augmented = tio.Noise(std=0.1)(data)  # returns dict

# Works on batches from SubjectsLoader (same params, vectorised)
batch = next(iter(loader))  # SubjectsBatch
augmented_batch = augment(batch)

See the transform design concepts for the full picture.

Where to go next

Tutorials

Step-by-step walkthroughs for common workflows.

→ Tutorials
How-to guides

Recipes for specific tasks: custom readers, NIfTI-Zarr, etc.

→ How-to guides
Concepts

Understand the design: lazy loading, affines, backends.

→ Concepts
API Reference

Complete reference for all classes and functions.

→ Reference