Friday, November 01, 2024

[new] databases/arrow 18.0.0

hi,

following thrift, here's the port for the c++ part of arrow:
https://github.com/apache/arrow/blob/main/cpp/README.md
it provides the parquet library for https://parquet.apache.org/.

some open questions:
- i've put the port in databases because for me its sort-of a database
format: "The universal columnar format and multi-language toolbox for
fast data interchange and in-memory analytics"

but it can go into devel or textproc, i'm not settled on it. devel is
already a bit crowded...

- the toplevel in https://github.com/apache/arrow/ has zero build goo,
so from the same distfile one has to build by subdir (eg setting
WRKDIST=${WRKDIR}/${DISTNAME}/cpp), hence the pkgname being arrow-cpp
since i'm only interested in the c++ part.

should i name the port databases/arrow-cpp ? databases/arrow/cpp in
preparation for potential other ports for various bindings ?

i'm still struggling to make it build fine the json part that depends on
rapidjson so for now this is commented out. will keep working on it, but
right now i have enough for testing the geoparquet support in gdal.

feedback welcome ! oks too ofc :)

Landry

No comments:

Post a Comment