Lightnews — Scholar-powered news

Reposted

alphaXiv

@alphaxiv.org

We used DeepSeek-V3 to classify every AI paper on arXiv by topic (agents, VLMs, etc) 🚀

Now you can instantly filter to see what's trending in each area 🚨

February 14, 2025 at 12:27 AM

Reposted

Amine Ouazad

@amineouazad.bsky.social

The peer review process should be opened in economics. One way to do this is to embrace @alphaxiv.org www.alphaxiv.org/explore/papers

December 31, 2024 at 5:29 PM

Reposted

alphaXiv

@alphaxiv.org

🚀The return of BERT, LLMs that beat physicians?!, and the dawn of agents to simulate users. Presenting Santa's nice list for AI! 🎅

- ModernBERT
- Superhuman performance of LLMs against Physicians (discussion with author @adamrodmanmd.bsky.social )
- LMagent: A Large-scale Multimodal Agents Society

December 21, 2024 at 7:48 PM

Reposted

alphaXiv

@alphaxiv.org

Thank you so much @jieyusz.bsky.social and @mameister4.bsky.social! We're really glad that alphaXiv can help authors through community feedback and open discussion :)

Jay Patel @infotainment.bsky.social · Dec 17

@alphaxiv.org : Some strong praise for you in a recent paper's Acknowledgement section. I stumbled across it today:

www.cell.com/neuron/fullt...

December 18, 2024 at 12:12 AM

Reposted

alphaXiv

@alphaxiv.org

🚀 Trending AI papers on alphaXiv this week

- Star Attention (discussion with author @shantanuacharya.bsky.social)

- One Diffusion to Generate Them All (discussion with author Duong Le from The Allen Institute for AI)

December 2, 2024 at 6:39 PM

Reposted

alphaXiv

@alphaxiv.org

LLaVA-o1 is the first visual language model capable of systematic reasoning similar to GPT-o1 🚀

But how does it perform on multimodal math reasoning questions? 🔎

New numbers from LlaVA-o1 on the MathVision Dataset from author Guowei Xu

LLaVA-o1 (11B): 23.7%
Qwen2-VL-72B: 25.9%
Qwen2-VL-7B: 16.3%

November 28, 2024 at 1:55 AM

Reposted

alphaXiv

@alphaxiv.org

Finding good data mixtures for LLM training can be tricky - Aioli provides a unified framework to construct pre-training data mixtures. Talk to the authors @MayeeChen @michahu.bsky.social @nicholaslourie.bsky.social @kchonyc Christopher Ré @HazyResearch directly here!

November 18, 2024 at 11:42 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news