Description
Background:
At Radiant Earth, we've been using the label
extension for both raster and vector label assets in our catalogs/datasets. This means that any item which represents a label asset from an ML dataset will employ the label
extension. These labels will most commonly be either GeoJSON or GeoTIFF, with some exceptions of Numpy array files, etc. However, reading closer into the specification today, I noticed a contradiction.
Issue
In the first section of the specification, there is the verbiage:
These labels can take several forms, though all are expected to be contained with a GeoJSON FeatureCollection
- emphasis on expected to be contained with a GeoJSON FeatureCollection
Yet then, the label:type
field stores an ENUM of either raster
or vector
.
If the former is true, that label assets must be contained within a GeoJSON, then raster images with classification labels do not apply to this extension.
Proposal/request
- Revise the specification to remove the contradiction, in one place stating labels are expected to be GeoJSON only, while in another place stating it can be GeoJSON (vector) or GeoTIFF (raster). Also
vector
value forlabel:type
could also imply other vector file formats, e.g. shapefile, not just GeoJSON. - Add explanation for how to handle raster label data. Should these be a combination of raster & classification extensions? Should the label extension still be applied to an item with raster label assets?