As someone who deals with geospatial processes like this daily, I have 2 notes. ...

kbgg · 2024-12-15T16:17:37 1734279457

Totally agree. When I worked at Radiant Earth the most frequent question we got was "I have this data I want to share, how do I create a good STAC for it?"

It was a totally valid question but one that's practically impossible to answer. As a result there's just so much variation between STACs and you never really know what you're going to get.

sid_tf · 2024-12-15T17:21:09 1734283269

Hi Sid here one of authors of blog. STAC being open and extensible, makes it a double edge sword yes. Quite refreshing to hear this from someone who was at Radiant coz, it shows we still haven't reached a great way of sharing data yet.

What do you think of attempts like Source Cooperative?

kbgg · 2024-12-15T17:55:57 1734285357

There's clearly a need for Source Cooperative given the overwhelming positive feedback we received during the beta. However, Source Cooperative is entirely dependent on Microsoft and Amazon subsidizing all of the S3 / Azure Blob Storage costs. They could pull the rug out at any moment, like we've seen with Planetary Computer, and Source Cooperative would no longer be sustainable.

Disclaimer: I built Source Cooperative and left Radiant 2 months ago.

sid_tf · 2024-12-15T17:04:53 1734282293

Agree on both fronts! STAC is pretty complex. My attempt here was to make raw data access easy and fast, not to solve STAC, which I believe stac-geoparquet basically makes an attempt to fix (makes it columnar and hence faster to query at scale).

And yes, having a parquet will add overhead of needing some form of catalog. But I believe we are very close to having Iceberg with native geo types being that catalog. at the same time, it opens another can of worms (databricks and other catalogs etc).

silver lining is that parquet (geoparquet) makes geo data closer to regular data.

whinvik · 2024-12-15T19:16:40 1734290200

Not sure I understand. The blog mentions adding columns to the Geoparquet so it's either an extension or a new standard.

Not going to discourage what you are doing but just reading the blog, my immediate instinct is to not try to check out what you build.

sid_tf · 2024-12-16T06:22:07 1734330127

Sure! Glad you shared your honest opinion. But I just want to reiterate that the blog and its subsequent library which will be released is not being done to create a new standard. All throughout the blog its been clearly stated that this is "a new approach", not "new standard" or "new format" or even "better standard".