Oleksii Kuchaiev
banner
kuchaev.bsky.social
Oleksii Kuchaiev
@kuchaev.bsky.social
AI model alignment @ NVIDIA
I’ll be in Singapore attending ICLR2025. Looking forward to chatting in person about model post-training, alignment and reasoning! ✈️🇸🇬
April 21, 2025 at 10:45 PM
New base models from NVIDIA - Nemotron-H: mamba-transformer hybrids are now on @hf.co hub huggingface.co/collections/...
Nemotron-H - a nvidia Collection
Mamba-Transformer hybrid models
huggingface.co
April 14, 2025 at 6:46 PM
New paper from our team. An inference-time scaling approach which can boost non-math benchmarks such as Arena-Hard of existing models. We get Arena-Hard of 92.7 for 70B model. As of 5 Mar 2025, surpassing o1-preview-2024-09- 12 (90.4) and DS-R1 (92.3). arxiv.org/pdf/2503.04378
March 7, 2025 at 6:42 PM
My favorite AI conference, GTC, is coming back to San Jose, California on March 17-21! Join us and thousands of other developers and innovators. This link gives you 25% off your conference pass www.nvidia.com/gtc/?ncid=GT...
GTC AI Conference 2025
Experience In Person and Online.
www.nvidia.com
March 4, 2025 at 8:50 PM
Our team put together a unified mathematical framework to analyze popular model alignment algorithms. “Reward-aware Preference Optimization: A Unified Mathematical Framework
for Model Alignment” arxiv.org/pdf/2502.00203.
February 4, 2025 at 5:25 PM