Lightnews — Scholar-powered news

Taneem

@taneem-ibrahim.bsky.social

110 followers 94 following 5 posts

Tinkering with vLLM @RedHat

Posts Replies Media Videos

Taneem

@taneem-ibrahim.bsky.social

FP8-quantized version of Llama 4 Maverick can be downloaded from HuggingFace: huggingface.co/collections/...

Llama 4 - a meta-llama Collection

Llama 4 release

huggingface.co

April 5, 2025 at 8:22 PM

Taneem

@taneem-ibrahim.bsky.social

The official release by Meta includes an FP8-quantized version of Llama 4 Maverick 128E supported by Red Hat’s LLM Compressor library, enabling the 128 expert model to fit on a single NVIDIA 8xH100 node, resulting in more performance with lower costs.

April 5, 2025 at 8:20 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news