Lightnews — Scholar-powered news

Hugo Larcher

@hlarcher.bsky.social

1.5K followers 120 following 2 posts

ML Infra engineer @huggingface. HPC and ML infra.

Posts Replies Media Videos

Hugo Larcher

@hlarcher.bsky.social

This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤩!

January 16, 2025 at 9:39 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news