hi,
following thrift, here's the port for the c++ part of arrow:
https://github.com/apache/arrow/blob/main/cpp/README.md
it provides the parquet library for https://parquet.apache.org/.
some open questions:
- i've put the port in databases because for me its sort-of a database
format: "The universal columnar format and multi-language toolbox for
fast data interchange and in-memory analytics"
but it can go into devel or textproc, i'm not settled on it. devel is
already a bit crowded...
- the toplevel in https://github.com/apache/arrow/ has zero build goo,
so from the same distfile one has to build by subdir (eg setting
WRKDIST=${WRKDIR}/${DISTNAME}/cpp), hence the pkgname being arrow-cpp
since i'm only interested in the c++ part.
should i name the port databases/arrow-cpp ? databases/arrow/cpp in
preparation for potential other ports for various bindings ?
i'm still struggling to make it build fine the json part that depends on
rapidjson so for now this is commented out. will keep working on it, but
right now i have enough for testing the geoparquet support in gdal.
feedback welcome ! oks too ofc :)
Landry
No comments:
Post a Comment