Daniel van Strien
@danielvanstrien.bsky.social
Machine Learning Librarian at @hf.co
Small models work great for GLAM but there aren't enough examples!
With @wjbmattingly.bsky.social I'm launching small-models-for-glam on @hf.co to create/curate models that run on modest hardware and address GLAM use cases.
Follow the org to keep up-to-date!
huggingface.co/small-models...
With @wjbmattingly.bsky.social I'm launching small-models-for-glam on @hf.co to create/curate models that run on modest hardware and address GLAM use cases.
Follow the org to keep up-to-date!
huggingface.co/small-models...
October 16, 2025 at 1:22 PM
Small models work great for GLAM but there aren't enough examples!
With @wjbmattingly.bsky.social I'm launching small-models-for-glam on @hf.co to create/curate models that run on modest hardware and address GLAM use cases.
Follow the org to keep up-to-date!
huggingface.co/small-models...
With @wjbmattingly.bsky.social I'm launching small-models-for-glam on @hf.co to create/curate models that run on modest hardware and address GLAM use cases.
Follow the org to keep up-to-date!
huggingface.co/small-models...
465 people. 122 languages. 58,185 annotations!
FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.
Huge thanks to all who contributed!
huggingface.co/blog/davanst...
FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.
Huge thanks to all who contributed!
huggingface.co/blog/davanst...
July 8, 2025 at 12:07 PM
465 people. 122 languages. 58,185 annotations!
FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.
Huge thanks to all who contributed!
huggingface.co/blog/davanst...
FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.
Huge thanks to all who contributed!
huggingface.co/blog/davanst...
Inspired by @hf.co's official MCP server, I built my own to expose my semantic search API for the HF ecosystem!
Features AI-powered search, parameter analysis via safetensors, and tools to find similar models/datasets.
Try: "Find non maths reasoning datasets from 2025"!
Features AI-powered search, parameter analysis via safetensors, and tools to find similar models/datasets.
Try: "Find non maths reasoning datasets from 2025"!
June 9, 2025 at 11:09 AM
Inspired by @hf.co's official MCP server, I built my own to expose my semantic search API for the HF ecosystem!
Features AI-powered search, parameter analysis via safetensors, and tools to find similar models/datasets.
Try: "Find non maths reasoning datasets from 2025"!
Features AI-powered search, parameter analysis via safetensors, and tools to find similar models/datasets.
Try: "Find non maths reasoning datasets from 2025"!