Lightnews — Scholar-powered news

How fast is it? Here's the demo app running FastVLM 0.5B model on iPhone 16 Pro. Time to first token is shown on the screen, highlighting near real-time performance.

July 23, 2025 at 6:35 PM

Peter Gray

@peteryugray.bsky.social

And for a comprehensive overview of Apple research at the conference - including the complete schedule of orals, posters, workshops, booth programming and more - see this post: machinelearning.apple.com/updates/appl...

This link will take you to a page that’s not on LinkedIn

lnkd.in

July 11, 2025 at 5:12 PM

Peter Gray

@peteryugray.bsky.social

Accepted as a Spotlight at @iclr-conf.bsky.social the work shares a new method for fine-grained control over #genAI output - without the computational overhead, complexity, and volume of data needed by #RLHF or fine-tuning, and with more reliable results than prompt engineering.

April 10, 2025 at 5:28 PM

Peter Gray

@peteryugray.bsky.social

Congratulations!

January 16, 2025 at 6:26 PM

Peter Gray

@peteryugray.bsky.social

Devs can now benefit from faster inference for their production LLMs on NVIDIA GPUs - benchmarking shows 2.7x acceleration in token generation 5/5

December 18, 2024 at 10:15 PM

Peter Gray

@peteryugray.bsky.social

o make this advancement production-ready for NVIDIA GPUs, the team collaborated with NVIDIA to integrate ReDrafter into the NVIDIA TensorRT-LLM framework: developer.nvidia.com/blog/nvidia-... 4/5

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference | NVIDIA Technical Blog

Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM) inference now available with NVIDIA TensorRT-LLM...

developer.nvidia.com

December 18, 2024 at 10:15 PM

Peter Gray

@peteryugray.bsky.social

Earlier this year, Apple Machine Learning researchers published & open sourced ReDrafter, a novel approach to speculative decoding: machinelearning.apple.com/research/rec... 2/5

December 18, 2024 at 10:15 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news