Specifying the data format

The README.md contains some high-level information about what is stored in an SPZ file. But right now, this is not detailed enough for people to write an encoder or decoder based on that description. It is not a _specification_, in that manner. I'll try to summarize a few points of which I think that they require further clarification.

### Coordinate systems

There is an issue about this at https://github.com/nianticlabs/spz/issues/14 . Some details about the coordinate system have been added in a [recent PR](https://github.com/nianticlabs/spz/pull/25), by inserting [this section](https://github.com/nianticlabs/spz/commit/7e2ece7cbfd427c3b43901ea0452eb800081a6f9#diff-b335630551682c19a781afebcf4d07bf978fb1f8ac04c6bf87428ed5106870f5R14). I'm not sure whether the issue should be closed, or whether there are further details that should be added. 

<sub>Beyond SPZ itself:</sub> 
Apparently, there is a certain fragmentation when it comes to coordinate system conventions for splats. A [recent comment in another issue](https://github.com/mkkellogg/GaussianSplats3D/issues/47#issuecomment-2891611215) lists a few conventions that can be found in the wild. 

The [`KHR_spz_gaussian_splats_compression` PR](https://github.com/KhronosGroup/glTF/blob/068b74f3bc8f0a1bb13e2265409caddb76d31d12/extensions/2.0/Khronos/KHR_spz_gaussian_splats_compression/README.md#conformance) currently says

> When compressing or decompressing the SPZ data to be stored within the glTF, you must specify a Left-Up-Front (LUF) coordinate system in the SPZ `PackOptions` or `UnpackOptions` within the SPZ library. 

Assuming that SPZ data **always** is in 'RUB' convention, it is not clear whether any coordinate system conversion is required when SPZ is stored within glTF. Specifically: I have some SPZ with RUB convention. Can I use that SPZ **directly** as the `bufferView/buffer` data in this extension, or do I have to perform a coordinate system conversion? (Ideally, that should not be necessary to do such conversions _at runtime_, but maybe there are strong reasons to do such a conversion even in the context of a last-mile delivery format)

In any case, the `spz-loader` npm package seems to [call the `rotate180DegAboutX` function](https://github.com/drumath2237/spz-loader/blob/4032056c1d2628ae9cf89dc0c2233cc15381e8ee/packages/core/lib/spz-wasm/main.cpp#L23) by default. The [`rotate180DegAboutX` function](https://github.com/nianticlabs/spz/blob/1986118bb48e4541594b479a96da15df0c00e5af/src/cc/splat-types.h#L164) is just one (fixed) coordinate system conversion, of which it is not clear where and why it should be applied or not. And more importantly: It doesn't look like it's possible to define any other coordinate system conversion in that library! (One could bring this up as an issue in that repository)

### Alpha/Opacity values

The ["Alphas" section](https://github.com/nianticlabs/spz/tree/9ba83ffedac9016bb76452598cb0dc676ad7e238?tab=readme-ov-file#alphas) currently says

> Alphas are represented as 8-bit unsigned integers.

This does not seem to be the case. From my understanding, the values that are stored in the "alphas" are values in [0, 255], but they are **not** classical alpha values where this range is mapped to [0,1]. Instead, they are undergoing the sigmoid/inverseSigmoid translation. This needs to be pointed out _explicitly_. 

(Whether something should be called "alpha" when it's not really an "alpha" value is another question...)

Related: https://github.com/nianticlabs/spz/issues/22


### Colors

Similar to the above: The ["Colors" section](https://github.com/nianticlabs/spz/tree/9ba83ffedac9016bb76452598cb0dc676ad7e238?tab=readme-ov-file#colors) currently says

> Colors are stored as `(r, g, b)` values, where each color component is represented as an unsigned 8-bit integer.

These are values in [0,255]. But they are not (r, g, b) color components, and not mapped to color values in [0,1]. They seem to be encodings of the spherical harmonics coefficients for dimension 0. If this was the case, then the values in [0,255] should be mapped to the range [-c,c], with `c = 1.0 /(Math.sqrt(1.0 / Math.PI) * 0.5)` (roughly 0.5/0.283 ~= 1.764).  But apparently SPZ uses some [colorScale](https://github.com/nianticlabs/spz/blob/1986118bb48e4541594b479a96da15df0c00e5af/src/cc/load-spz.cc#L45), which (I think) causes these values to be mapped to a range [-3.3, 3.3]. **Iff** that "colorScale" has a mathematical justification, then this should be made more explicit. (If it does not have a mathematical justification, this may be more tricky...)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specifying the data format #42

Coordinate systems

Alpha/Opacity values

Colors

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Specifying the data format #42

Description

Coordinate systems

Alpha/Opacity values

Colors

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions