OME-TIFF and OME-ZARR writer APIs designed for microscopy acquisition.
Detailed documentation is available at https://pymmcore-plus.github.io/ome-writers/.
ome-writers provides a unified interface for writing microscopy image data to
OME-compliant formats (OME-TIFF and OME-Zarr) using various different backends.
It is designed for streaming acquisition: receiving 2D camera frames one at
a time and writing them to multi-dimensional arrays with proper metadata.
The core problem ome-writers solves:
Map a stream of 2D frames (arriving in acquisition order) to storage locations in multi-dimensional arrays, while generating OME-compliant metadata for both TIFF and Zarr formats.
We prioritize:
- β Correctness: Strict adherence to both OME-TIFF and OME-Zarr specifications.
- π― Completeness: We want all the metadata to go to its proper place.
- π Performance: Very minimal, native-backed hot-path logic when appending frames.
- π€ΈββοΈ Flexibility: Pick from 5+ array backends, suiting your dependency preferences.
- π Usability: Relatively small, well organized API, with extensive documentation.
- πͺ Stability: Minimal dependencies, exhaustive testing and validation.
You can install ome-writers via pip. You must also select select at least
one backend extra:
pip install ome-writers[<backend>]...where <backend> is a comma-separated list of one or more of the following:
tensorstoreβ Uses tensorstore, supports OME-Zarr v0.5.acquire-zarrβ Uses acquire-zarr, supports OME-Zarr v0.5.zarr-pythonβ Uses zarr-python, supports OME-Zarr v0.5.zarrs-pythonβ Uses zarrs-python, supports OME-Zarr v0.5.tifffileβ Uses tifffile, supports OME-TIFF.allβ install all backends.
Note
All zarr-backends use yaozarrs to generate OME-Zarr metadata and create zarr hierarchies (only array-writing is handled by the selected backend).
(Developers using uv sync will end up with all backends installed by default.)
Note
More complete usage examples are available in the usage documentation
from ome_writers import AcquisitionSettings, Dimension, create_stream
settings = AcquisitionSettings(
root_path="example_5d_image.ome.zarr",
dimensions=[
Dimension(name="t", count=10, chunk_size=1, type="time"),
Dimension(name="c", count=2, chunk_size=1, type="channel"),
Dimension(name="z", count=5, chunk_size=1, type="space", scale=5),
Dimension(name="y", count=256, chunk_size=64, type="space", scale=0.1),
Dimension(name="x", count=256, chunk_size=64, type="space", scale=0.1),
],
dtype="uint16",
overwrite=True,
)
with create_stream(settings) as stream:
for frame in ...:
stream.append(frame)βββββββββββββββββββββββββ βββββββββββββββββββ βββββββββββββββββββββββββ
β AcquisitionSettings βββββββΆβ FrameRouter βββββββΆβ ArrayBackend β
β β β β β β
β Declarative model β β __next__() -> β β write(pos,idx,frame) β
β of acquisition order β β (pos, idx) β β finalize() β
βββββββββββββββββββββββββ βββββββββββββββββββ βββββββββββββββββββββββββ
The schema is the declarative description of what to create. In addition to other storage details such as data types, chunking, compression, and other metadata, it must fully describe the dimensionality of the data and the exact order in which frames will arrive.
Explicit non-goal:
ome-writersdoes not attempt to handle non-deterministic acquisition patterns (e.g., event-driven acquisitions where data shape is unknown ahead of time). However, we do support an unbounded first dimension (e.g., time or whatever). For this case, we recommend a flat 3D structure (e.g., FYX with unbounded F) where F is "any frame", storing metadata for mapping frames to logical dimensions externally.
It answers:
- What dimensions exist? (T, C, Z, Y, X, positions, plates, etc.)
- What is the acquisition order? (how will frames arrive)
- What is the storage order? (how should axes be arranged on disk)
- Data types, chunking, compression, sharding, etc.
The schema separates acquisition order (the order dimensions appear in the
dimensions list) from storage order (controlled by the storage_order
field). This allows data to arrive in one order (e.g., TZCYX) but be stored in
another (e.g., TCZYX for NGFF compliance).
The router is the stateful iterator that maps frame numbers to storage locations. It:
- Reads the schema to understand both acquisition and storage order
- Maintains iteration state (which frame are we on?)
- Computes the permutation from acquisition order to storage order
- Yields
(position_key, storage_index)tuples for each frame
The router is the only component that knows about both orderings. It iterates in acquisition order (because that's how frames arrive) and emits storage-order indices (because that's what backends need).
Backends are format-specific writers that handle the actual I/O. They:
- Create arrays/files based on the schema
- Write frames to specified locations
- Generate format-appropriate metadata
- Handle finalization (flushing, closing)
Supported backends:
- tensorstore β OME-Zarr v0.5 via yaozarrs
- zarr-python β OME-Zarr v0.5 via yaozarrs
- acquire-zarr β OME-Zarr v0.5 via yaozarrs
- tifffile β OME-TIFF
Backends receive indices in storage order and don't need to know about acquisition order.
- Schema is declarative β describes the target structure, not how to build it
- Router handles the mapping β single place for acquisitionβstorage order logic
- Backends are simple adapters β receive storage-order indices, write bytes
- Position is a meta-dimension β appears in iteration but becomes separate arrays/files, not an array axis
The separation of schema, router, and backend allows us to leave the performance-critical tasks to C++ libraries (like tensorstore, acquire-zarr), while keeping "fiddly" metadata logic and frame routing in Python (where it's easier to maintain).
The API of this library is heavily inspired by the acquire-zarr API
(declare deterministic experiment with schema, append frames with single append() calls).
But we also:
- want to support both zarr and tiff formats (OME-TIFF)
- want to support other zarr array libraries, such as tensorstore.
- want to take advantage of Python for metadata management (e.g.
ome-typesfor OME-XML generation andyaozarrsfor OME-Zarr metadata)
- Single 5D image (TCZYX or any permutation) β the common case
- Multi-position acquisition β separate arrays/files per stage position
- Well plates β hierarchical plate/well/field structure with explicit acquisition order
- Unbounded first dimension β e.g., streaming time-lapse with unknown total frames
- Jagged arrays: E.g.
- one channel does Z-stacks while another does single planes. In other words, the outer array is regular, but some inner frames are missing/skipped.
- different positions have different shapes (nT, nZ, etc), such as is possible when using subsequences in useq-schema. (maybe this is just the responsibility of the user to create multiple streams).
- Multi-camera setups, particularly with different image shapes or data types.
(here too... the caller could just call
append()in the right order for each buffer) - What happens if you want to skip a frame at runtime, maybe
append(None)?
We welcome contributions to ome-writers! See our contributing
guide for details.