Andrew Lamb
@andrewlamb1111.bsky.social
Apache {DataFusion PMC}, Database Internals
They have been growing and I think we had challenges of what to do with ones that looked like AI dumps. We aren't really into rejecting things outright, but just letting them site there isn't great either. I am pleased now that the rationale / guidance is written down
October 28, 2025 at 5:51 PM
They have been growing and I think we had challenges of what to do with ones that looked like AI dumps. We aren't really into rejecting things outright, but just letting them site there isn't great either. I am pleased now that the rationale / guidance is written down
One of the factors in open source software is that you don't get to choose who uses your work or what they do with it. I believe on the balance it is a net positive for everyone involved, but that is certainly a value judgement
October 24, 2025 at 9:57 AM
One of the factors in open source software is that you don't get to choose who uses your work or what they do with it. I believe on the balance it is a net positive for everyone involved, but that is certainly a value judgement
I 100% agree that including a WASM based encoder/decoder will be a barrier to implementation for any file format (including Parquet).
My broader point was that there is no technical reason it could not be added to Parquet, not that it necessarily could or should be added
My broader point was that there is no technical reason it could not be added to Parquet, not that it necessarily could or should be added
October 7, 2025 at 1:34 PM
I 100% agree that including a WASM based encoder/decoder will be a barrier to implementation for any file format (including Parquet).
My broader point was that there is no technical reason it could not be added to Parquet, not that it necessarily could or should be added
My broader point was that there is no technical reason it could not be added to Parquet, not that it necessarily could or should be added
The only thing is getting consensus -- there is no technical blocker
October 2, 2025 at 11:48 AM
The only thing is getting consensus -- there is no technical blocker
"It is not 100% clear to me how a new file format (or three) will drive additional ecosystem adoption :thinking:"
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
October 1, 2025 at 7:21 PM
"It is not 100% clear to me how a new file format (or three) will drive additional ecosystem adoption :thinking:"
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...