Skip to content

pymmcore-plus/ome-writers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

109 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

ome-writers

License PyPI Python Version CI codecov codspeed

OME-TIFF and OME-ZARR writer APIs designed for microscopy acquisition.

Documentation

Detailed documentation is available at https://pymmcore-plus.github.io/ome-writers/.

Purpose

ome-writers provides a unified interface for writing microscopy image data to OME-compliant formats (OME-TIFF and OME-Zarr) using various different backends. It is designed for streaming acquisition: receiving 2D camera frames one at a time and writing them to multi-dimensional arrays with proper metadata.

The core problem ome-writers solves:

Map a stream of 2D frames (arriving in acquisition order) to storage locations in multi-dimensional arrays, while generating OME-compliant metadata for both TIFF and Zarr formats.

We prioritize:

  • βœ… Correctness: Strict adherence to both OME-TIFF and OME-Zarr specifications.
  • πŸ’― Completeness: We want all the metadata to go to its proper place.
  • πŸš€ Performance: Very minimal, native-backed hot-path logic when appending frames.
  • πŸ€Έβ€β™‚οΈ Flexibility: Pick from 5+ array backends, suiting your dependency preferences.
  • πŸ“– Usability: Relatively small, well organized API, with extensive documentation.
  • πŸ’ͺ Stability: Minimal dependencies, exhaustive testing and validation.

Installation

You can install ome-writers via pip. You must also select select at least one backend extra:

pip install ome-writers[<backend>]

...where <backend> is a comma-separated list of one or more of the following:

  • tensorstore β€” Uses tensorstore, supports OME-Zarr v0.5.
  • acquire-zarr β€” Uses acquire-zarr, supports OME-Zarr v0.5.
  • zarr-python β€” Uses zarr-python, supports OME-Zarr v0.5.
  • zarrs-python β€” Uses zarrs-python, supports OME-Zarr v0.5.
  • tifffile β€” Uses tifffile, supports OME-TIFF.
  • all β€” install all backends.

Note

All zarr-backends use yaozarrs to generate OME-Zarr metadata and create zarr hierarchies (only array-writing is handled by the selected backend).

(Developers using uv sync will end up with all backends installed by default.)

Basic Usage

Note

More complete usage examples are available in the usage documentation

from ome_writers import AcquisitionSettings, Dimension, create_stream

settings = AcquisitionSettings(
    root_path="example_5d_image.ome.zarr",
    dimensions=[
        Dimension(name="t", count=10, chunk_size=1, type="time"),
        Dimension(name="c", count=2, chunk_size=1, type="channel"),
        Dimension(name="z", count=5, chunk_size=1, type="space", scale=5),
        Dimension(name="y", count=256, chunk_size=64, type="space", scale=0.1),
        Dimension(name="x", count=256, chunk_size=64, type="space", scale=0.1),
    ],
    dtype="uint16",
    overwrite=True,
)

with create_stream(settings) as stream:
    for frame in ...:
        stream.append(frame)

High-Level Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  AcquisitionSettings  │─────▢│   FrameRouter   │─────▢│  ArrayBackend         β”‚
β”‚                       β”‚      β”‚                 β”‚      β”‚                       β”‚
β”‚  Declarative model    β”‚      β”‚  __next__() ->  β”‚      β”‚  write(pos,idx,frame) β”‚
β”‚  of acquisition order β”‚      β”‚    (pos, idx)   β”‚      β”‚  finalize()           β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

AcquisitionSettings (schema.py)

The schema is the declarative description of what to create. In addition to other storage details such as data types, chunking, compression, and other metadata, it must fully describe the dimensionality of the data and the exact order in which frames will arrive.

Explicit non-goal: ome-writers does not attempt to handle non-deterministic acquisition patterns (e.g., event-driven acquisitions where data shape is unknown ahead of time). However, we do support an unbounded first dimension (e.g., time or whatever). For this case, we recommend a flat 3D structure (e.g., FYX with unbounded F) where F is "any frame", storing metadata for mapping frames to logical dimensions externally.

It answers:

  • What dimensions exist? (T, C, Z, Y, X, positions, plates, etc.)
  • What is the acquisition order? (how will frames arrive)
  • What is the storage order? (how should axes be arranged on disk)
  • Data types, chunking, compression, sharding, etc.

The schema separates acquisition order (the order dimensions appear in the dimensions list) from storage order (controlled by the storage_order field). This allows data to arrive in one order (e.g., TZCYX) but be stored in another (e.g., TCZYX for NGFF compliance).

FrameRouter

The router is the stateful iterator that maps frame numbers to storage locations. It:

  1. Reads the schema to understand both acquisition and storage order
  2. Maintains iteration state (which frame are we on?)
  3. Computes the permutation from acquisition order to storage order
  4. Yields (position_key, storage_index) tuples for each frame

The router is the only component that knows about both orderings. It iterates in acquisition order (because that's how frames arrive) and emits storage-order indices (because that's what backends need).

ArrayBackend

Backends are format-specific writers that handle the actual I/O. They:

  1. Create arrays/files based on the schema
  2. Write frames to specified locations
  3. Generate format-appropriate metadata
  4. Handle finalization (flushing, closing)

Supported backends:

  • tensorstore β€” OME-Zarr v0.5 via yaozarrs
  • zarr-python β€” OME-Zarr v0.5 via yaozarrs
  • acquire-zarr β€” OME-Zarr v0.5 via yaozarrs
  • tifffile β€” OME-TIFF

Backends receive indices in storage order and don't need to know about acquisition order.

Design Principles

  1. Schema is declarative β€” describes the target structure, not how to build it
  2. Router handles the mapping — single place for acquisition→storage order logic
  3. Backends are simple adapters β€” receive storage-order indices, write bytes
  4. Position is a meta-dimension β€” appears in iteration but becomes separate arrays/files, not an array axis

Why this layer of abstraction?

The separation of schema, router, and backend allows us to leave the performance-critical tasks to C++ libraries (like tensorstore, acquire-zarr), while keeping "fiddly" metadata logic and frame routing in Python (where it's easier to maintain).

The API of this library is heavily inspired by the acquire-zarr API (declare deterministic experiment with schema, append frames with single append() calls). But we also:

  • want to support both zarr and tiff formats (OME-TIFF)
  • want to support other zarr array libraries, such as tensorstore.
  • want to take advantage of Python for metadata management (e.g. ome-types for OME-XML generation and yaozarrs for OME-Zarr metadata)

Supported Use Cases

  • Single 5D image (TCZYX or any permutation) β€” the common case
  • Multi-position acquisition β€” separate arrays/files per stage position
  • Well plates β€” hierarchical plate/well/field structure with explicit acquisition order
  • Unbounded first dimension β€” e.g., streaming time-lapse with unknown total frames

Currently Unsupported Edge Cases

  • Jagged arrays: E.g.
    • one channel does Z-stacks while another does single planes. In other words, the outer array is regular, but some inner frames are missing/skipped.
    • different positions have different shapes (nT, nZ, etc), such as is possible when using subsequences in useq-schema. (maybe this is just the responsibility of the user to create multiple streams).
  • Multi-camera setups, particularly with different image shapes or data types. (here too... the caller could just call append() in the right order for each buffer)
  • What happens if you want to skip a frame at runtime, maybe append(None)?

Contributing

We welcome contributions to ome-writers! See our contributing guide for details.

About

Unified API for streaming camera frames to either OME-TIFF or OME-Zarr format

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Contributors 8

Languages