@kszucs.bsky.social (Apache Arrow PMC member) announces how to drastically speed up Parquet files uploads and downloads via deduplication.
Best part: the feature enabling this is open source !
huggingface.co/blog/parquet...
@kszucs.bsky.social (Apache Arrow PMC member) announces how to drastically speed up Parquet files uploads and downloads via deduplication.
Best part: the feature enabling this is open source !
huggingface.co/blog/parquet...
$ pip install \
-i pypi.anaconda.org/scientific-p... \
"pyarrow>=21.0.0.dev0"
it's changing the way I view data versioning👇
$ pip install \
-i pypi.anaconda.org/scientific-p... \
"pyarrow>=21.0.0.dev0"
it's changing the way I view data versioning👇
Github:
github.com/OpenDriveLab...
HuggingFace:
huggingface.co/agibot-world
Github:
github.com/OpenDriveLab...
HuggingFace:
huggingface.co/agibot-world
Also, this is the best paper heading I’ve seen in quite some time. The 'en tête' looks fantastic.
(⚡Llama 3.3) Chat with the paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/euclid-mult...
🤗 Dataset: huggingface.co/datasets/eu...
Also, this is the best paper heading I’ve seen in quite some time. The 'en tête' looks fantastic.
(⚡Llama 3.3) Chat with the paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/euclid-mult...
🤗 Dataset: huggingface.co/datasets/eu...
How? By combining step-wise reward models with tree search algorithms :)
We're open sourcing the full recipe and sharing a detailed blog post 👇
How? By combining step-wise reward models with tree search algorithms :)
We're open sourcing the full recipe and sharing a detailed blog post 👇
Hugging Face's integration of an "AI Query" overlay in their SQL console exemplifies this. Users input natural language, AI suggests SQL queries—streamlining data exploration seamlessly. Probably the best showcase of this pattern in a freely accessible product.
Hugging Face's integration of an "AI Query" overlay in their SQL console exemplifies this. Users input natural language, AI suggests SQL queries—streamlining data exploration seamlessly. Probably the best showcase of this pattern in a freely accessible product.
Contains *newly-collected* data, prioritizing *regional knowledge*.
Setting the stage for truly global AI evaluation.
Ready to see how your model measures up?
#AI #Multilingual #LLM #NLProc
Contains *newly-collected* data, prioritizing *regional knowledge*.
Setting the stage for truly global AI evaluation.
Ready to see how your model measures up?
#AI #Multilingual #LLM #NLProc
- Building custom feeds using ML
- Creating dashboards for data exploration
- Developing custom models for Bluesky
To gather @bsky.app resources on @huggingface.bsky.social. I've established a community org 🤗 huggingface.co/bluesky-comm...
- Building custom feeds using ML
- Creating dashboards for data exploration
- Developing custom models for Bluesky
To gather @bsky.app resources on @huggingface.bsky.social. I've established a community org 🤗 huggingface.co/bluesky-comm...
📊 1M public posts from Bluesky's firehose API
🔍 Includes text, metadata, and language predictions
🔬 Perfect to experiment with using ML for Bluesky 🤗
huggingface.co/datasets/blu...
📊 1M public posts from Bluesky's firehose API
🔍 Includes text, metadata, and language predictions
🔬 Perfect to experiment with using ML for Bluesky 🤗
huggingface.co/datasets/blu...