Raúl Cumplido
raulcumplido.bsky.social
Raúl Cumplido
@raulcumplido.bsky.social
Software and stuff. Working on Apache Arrow
Reposted by Raúl Cumplido
PyArrow 21 was a great release, especially for @hf.co users: PyArrow now seamlessly handles hf:// URIs and does content-defined chunking to reduce transfer and storage costs on HF. Check out this blog post: huggingface.co/blog/parquet... #apachearrow #apacheparquet
Parquet Content-Defined Chunking
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
August 5, 2025 at 3:49 PM
Reposted by Raúl Cumplido
Check out "Scaling the r-spatial ecosystem" by Dewey Dunnington 🌍📦
An exploration of how R’s spatial tools can be used for big(ger) data.

Video: youtu.be/tjNEoIYr_ag?...
Slides: dewey.dunnington.ca/slides/rspat...

#RStats #rspatial #GIS #SpatialData
August 6, 2025 at 2:03 PM
Reposted by Raúl Cumplido
Lots of improvements, medium and small, in this new release of the Apache Arrow monorepo (C++, Python, R, Ruby...). It also includes performance improvements in the Parquet reader that we contributed at @quantstack.bsky.social, and we hope to contribute more of them in the future.
July 22, 2025 at 8:31 AM
Reposted by Raúl Cumplido
Apache Arrow Summit 25 is happening! Join us in person on October 2nd, in Paris (hosted by @pydataparis.bsky.social ). The Call For Proposal is open, submit your talks before July 26th:
sessionize.com/arrow-summit...
Arrow Summit 2025: Call for Speakers
sessionize.com
July 7, 2025 at 7:37 AM
Reposted by Raúl Cumplido
The pycrdt Python library found its new home in the y-crdt organization.
Originally created for the Jupyter project, it is a general Yjs-compatible CRDT implementation that can be used in any project, independently of Jupyter.
github.com/y-crdt/pycrdt
GitHub - y-crdt/pycrdt: CRDTs based on Yrs.
CRDTs based on Yrs. Contribute to y-crdt/pycrdt development by creating an account on GitHub.
github.com
April 19, 2025 at 8:59 AM
Reposted by Raúl Cumplido
Martin Renou @martinrenou.bsky.social is presenting #JupyterCAD at the PyData Paris meetup.

Nice demo of in-browser collaborative editing of CAD models.
April 8, 2025 at 5:39 PM
Reposted by Raúl Cumplido
We’re back !
Thanks to Artefact for hosting us.
April 8, 2025 at 5:14 PM
Reposted by Raúl Cumplido
Finally a new #Shapely feature release! 🎉
Shapely 2.1.0 highlights include initial support for geometries with M or ZM values, functionality for coverage validation and simplification, and much more.

For a full overview, see shapely.readthedocs.io/en/latest/re...

#python #geopython #geospatial
April 3, 2025 at 11:55 AM
Reposted by Raúl Cumplido
What an achievement:

* All official Debian bookworm live images rebuild reproducibly

lists.reproducible-builds.org/pipermail/rb...
Irregular status update about reproducible Debian live ISO images
lists.reproducible-builds.org
March 27, 2025 at 3:44 AM
Reposted by Raúl Cumplido
Reposted by Raúl Cumplido
Check out what is new on the Apache Arrow ADBC 17 libraries release: arrow.apache.org/blog/2025/03...
Apache Arrow ADBC 17 (Libraries) Release
The Apache Arrow team is pleased to announce the version 17 release of the Apache Arrow ADBC libraries. This release includes 18 resolved issues from 13 distinct contributors. This is a release of the...
arrow.apache.org
March 7, 2025 at 11:12 AM
Reposted by Raúl Cumplido
If you're looking for an up-to-date tour of @arrow.apache.org and its latest developments, watch @jorisvandenbossche.bsky.social's excellent talk at @pydataparis.bsky.social 2024.

www.youtube.com/watch?v=3ehl...
Joris Van den Bossche - The expanding Apache Arrow universe | PyData Paris 2024
YouTube video by PyData
www.youtube.com
March 4, 2025 at 11:05 AM
Reposted by Raúl Cumplido
Data wants to be free: comparing and explaining how Arrow's data serialization can be better than what's in protocols like PostgreSQL's

arrow.apache.org/blog/2025/02...

#apachearrow #arrow
Data Wants to Be Free: Fast Data Exchange with Apache Arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics. It specifies a standardized language-independent column-oriented memory form...
arrow.apache.org
February 28, 2025 at 6:14 AM
Reposted by Raúl Cumplido
We're excited to introduce R-Lite, a #WebAssembly distribution of #R for the browser, allowing you to use the R kernel in JupyterLite!

Read the article by Isabel Paredes, who spearheaded this project at QuantStack, on the @jupyter.org blog:

blog.jupyter.org/r-in-the-bro...
R in the Browser: Announcing Our WebAssembly Distribution
R is now available in emscripten-forge, enabling the Xeus-R kernel in JupyterLite
blog.jupyter.org
February 28, 2025 at 10:33 AM
Reposted by Raúl Cumplido
Excited to present at #GeoPython2025 in beautiful Basel, Switzerland! This one's on a brand-new 'geography' extension for @duckdb.org and a few examples of how the Geography and S2Cell/Center/Union types can help scale workflows for truly global extents. Slides @ dewey.dunnington.ca/slides/geopy...!
February 25, 2025 at 10:27 AM
Reposted by Raúl Cumplido
We have a new Java Release. The version 18.2.0, see details on the blog post here: arrow.apache.org/blog/2025/02...
Apache Arrow Java 18.2.0 Release
The Apache Arrow team is pleased to announce the v18.2.0 release of Apache Arrow Java. This is the first release since Arrow Java landed in its own repository. Changelog What’s Changed GH-466: Export ...
arrow.apache.org
February 21, 2025 at 9:23 AM
Reposted by Raúl Cumplido
Hello world! We start our Blue Sky account with a new Apache Arrow patch release announcement, see more details about some fixes on our 19.0.1 release: arrow.apache.org/blog/2025/02...
Apache Arrow 19.0.1 Release
The Apache Arrow team is pleased to announce the 19.0.1 release. This release primarily addresses a bug in the recent Arrow 19.0.0 release which prevents Arrow C++ and libraries binding it (e.g., Pyth...
arrow.apache.org
February 20, 2025 at 8:51 AM
Reposted by Raúl Cumplido
JupyterCAD 3.0 is here! 🎉
We are excited to announce JupyterCAD 3.0, bringing major improvements to the web-based collaborative CAD editor for JupyterLab:
blog.jupyter.org/announcing-j...

✅ Color Customization
🐍 Embedded Python Console
🎯 Improved UX
🖱️ Mouse-based 3D Controls
🤝 Suggestions Support
Announcing JupyterCAD 3.0
The latest iteration of the web-based collaborative CAD editor
blog.jupyter.org
February 17, 2025 at 9:56 AM