Intrigued by the warning it created: "make the warning page of what happens if you insert a crab into the Universal Crabbifier"
"show me a security camera still..."
(all zero shot, first try)
Intrigued by the warning it created: "make the warning page of what happens if you insert a crab into the Universal Crabbifier"
"show me a security camera still..."
(all zero shot, first try)
Nari Lab's Dia does some of the best expressive AI voice I have seen and it is open weights & created by two undergrads with no funding
Nari Lab's Dia does some of the best expressive AI voice I have seen and it is open weights & created by two undergrads with no funding
🥇 The first autoregressive video model with top-tier quality output
🔓 100% open-source & tech report
📊 Exceptional performance on major benchmarks
🥇 The first autoregressive video model with top-tier quality output
🔓 100% open-source & tech report
📊 Exceptional performance on major benchmarks
TREC, which is a community of researchers in information retrieval and natural language processing convened by the NIST, found that an independent human judge correlates better with GPT-4o than a human judge.
TREC, which is a community of researchers in information retrieval and natural language processing convened by the NIST, found that an independent human judge correlates better with GPT-4o than a human judge.
Project: lllyasviel.github.io/frame_pack_g...
Image-to-5-Seconds (30fps, 150 frames)
Project: lllyasviel.github.io/frame_pack_g...
Image-to-5-Seconds (30fps, 150 frames)
BitNet achieves performance on par with leading full-precision LLMs — and it’s blazingly fast⚡️⚡️uses much lower memory🎉
Everything is open-sourced, per them.
BitNet achieves performance on par with leading full-precision LLMs — and it’s blazingly fast⚡️⚡️uses much lower memory🎉
Everything is open-sourced, per them.
This blog post examines the various flavors of “Deep Research” from a technical implementation perspective.
leehanchung.github.io/blogs/2025/0...
This blog post examines the various flavors of “Deep Research” from a technical implementation perspective.
leehanchung.github.io/blogs/2025/0...
blog.google/technology/d...
blog.google/technology/d...
PapersChat provides an agentic AI interface for querying papers, retrieving insights from ArXiv & PubMed, and structuring responses efficiently.
github.com/AstraBert/Pa...
PapersChat provides an agentic AI interface for querying papers, retrieving insights from ArXiv & PubMed, and structuring responses efficiently.
github.com/AstraBert/Pa...
- Stat for LLM: How statistical methods can improve LLM uncertainty quantification, interpretability, trustworthiness & more.
- Stat for LLM: How statistical methods can improve LLM uncertainty quantification, interpretability, trustworthiness & more.
This is what it came up with on its own.
This is what it came up with on its own.
They find that they learn to share highly abstract grammatical concept representations, even across unrelated languages!
They find that they learn to share highly abstract grammatical concept representations, even across unrelated languages!
Original release: 8 models, 540K downloads. Just the beginning...
The community turned those open-weight models into +550 NEW models on @huggingface. Total downloads? 2.5M—nearly 5X the originals.
Original release: 8 models, 540K downloads. Just the beginning...
The community turned those open-weight models into +550 NEW models on @huggingface. Total downloads? 2.5M—nearly 5X the originals.
sakana.ai/transformer-...
Adaptation is a remarkable natural phenomenon, like how the octopus blends into its environment, or how the brain rewires itself after injury.
🧵 1/N
sakana.ai/transformer-...
Adaptation is a remarkable natural phenomenon, like how the octopus blends into its environment, or how the brain rewires itself after injury.
🧵 1/N
sakana.ai/transformer-...
Adaptation is a remarkable natural phenomenon, like how the octopus blends into its environment, or how the brain rewires itself after injury.
🧵 1/N
- Includes a "Deep Thinking" mode, surpassing O1-preview and O1 models on the AIME benchmark.
- Outperforms deepseek-v3, gpt4o, and llama3.1-405B on popular benchmarks.
team.doubao.com/en/special/d...
- Includes a "Deep Thinking" mode, surpassing O1-preview and O1 models on the AIME benchmark.
- Outperforms deepseek-v3, gpt4o, and llama3.1-405B on popular benchmarks.
team.doubao.com/en/special/d...