Alexander
banner
alejocisme57.bsky.social
Alexander
@alejocisme57.bsky.social
Reposted by Alexander
DeepSeek introduced CodeI/O, a method that helps AI learn reasoning patterns hidden in code.

Models train to predict inputs/outputs of given code, all while explaining its reasoning with Chain-of-Thought (CoT) in natural language.

This improves models' general reasoning skills

Here's how:
February 17, 2025 at 11:34 AM
Reposted by Alexander
SwiftKV promises to make open source AI up to 75% cheaper, enhancing its efficiency

We spoke to Yuxiong He and Samyam Rajbhandari, Snowflake’s AI research leads, who explained:
- How SwiftKV works
- Additional methods, limitations
- Why they decided to open source it
www.youtube.com/watch?v=9x1k...
SwiftKV – HOW TO MAKE OPEN SOURCE AI UP TO 75% CHEAPER
YouTube video by Turing Post
www.youtube.com
January 18, 2025 at 12:03 AM
Reposted by Alexander
10 recent advancements in math reasoning:

▪️ AceMath from NVIDIA
▪️ Qwen2.5-Math-PRM and PROCESSBENCH evaluation
▪️ rStar-Math from @msftresearch.bsky.social
▪️ BoostStep
▪️ URSA
▪️ U-MATH
▪️ SVE-Math
...

It's a very interesting shift in AI! Check this out for more info: huggingface.co/posts/Ksenia...
January 19, 2025 at 11:11 PM
Reposted by Alexander
January 15, 2025 at 3:32 PM
Reposted by Alexander
New work from Alibaba_Qwen🔥

Qwen2.5-Math-PRM 7B & 72B 🔢 Process Reward Models for enhanced process supervision in the mathematical reasoning of LLMs.

Paper:
huggingface.co/papers/2501....
Model:
huggingface.co/Qwen/Qwen2.5...
huggingface.co/Qwen/Qwen2.5...
Paper page - The Lessons of Developing Process Reward Models in Mathematical Reasoning
Join the discussion on this paper page
huggingface.co
January 16, 2025 at 10:26 PM
Reposted by Alexander
AIs beat humans in basically all creativity tests like the Torrence Test, but it isn't clear what exactly that means compared to humans.
January 18, 2025 at 1:10 PM
Reposted by Alexander
Yet more evidence that people can’t accurately detect well-prompted AI writing (and AI can’t accurately detect well-prompted AI writing, either). arxiv.org/pdf/2407.08853
January 18, 2025 at 8:34 PM
Reposted by Alexander
AIs in the world can create complex systems with unexpected & risky feedback loops.

Each output affects the world, affecting inputs, affecting outputs... Even simple loops can drive optimization behavior, making systems turn extreme. No training needed, just interaction. arxiv.org/pdf/2402.06627
January 19, 2025 at 5:16 PM