Lightnews — Scholar-powered news

Appu Shaji

@appughar.bsky.social

480 followers 260 following 29 posts

CEO and Founder at Mobius Labs.

Here are for discussions in various facets of AI, such as multimodality, quantisation, efficiency and more. A few of our recent work appears at https://blog.mobiuslabs.com/

Posts Replies Media Videos

Appu Shaji

@appughar.bsky.social

Re-distilling a distilled model ( Qwen-Deepseek R1 1.5B ) . Getting few percentage point increase in benchmarks.

Mobius Labs @mobius-labs.bsky.social · Jan 24

Our re-distilled Deepseek R1 (1.5B) outperforms the original distilled model! Get it at huggingface.co/mobiuslabsgm.... We’re distilling more models and look forward to releasing them soon!

January 24, 2025 at 5:36 PM

Appu Shaji

@appughar.bsky.social

Super thrilled to release a new version of gemlite, delivering up to 7–8x faster prefill and 3–6x faster batch decoding speed 🚀🚀🚀🚀🚀 compared to PyTorch's tinygemm.

Mobius Labs @mobius-labs.bsky.social · Dec 5

Releasing a new version of Gemlite github.com/mobiusml/gem... significantly improved performance on datacenter GPUS (A100/H100) delivering up to 7–8x faster prefill and 3–6x faster batch decoding compared to PyTorch's tinygemm.

GitHub - mobiusml/gemlite: Fast low-bit matmul kernels in Triton

Fast low-bit matmul kernels in Triton. Contribute to mobiusml/gemlite development by creating an account on GitHub.

github.com

December 5, 2024 at 2:45 PM

Appu Shaji

@appughar.bsky.social

"Many years later, as he faced the firing squad, Colonel Aureliano Buendía was to remember that distant afternoon when his father took him to discover ice."

Are there other examples of such tense melding in literature?

p.s..: Anticipatory apologies for not using the Spanish version.

December 3, 2024 at 11:31 AM

Appu Shaji

@appughar.bsky.social

This! In general, the goal of any review system should be to verify, reproduce, and push the boundaries of our collective scientific knowledge. Compared to openly reproducible code and evaluations, the merits of rushed and often opinionated reviews are frequently inferior. ( note: very ML specific )

Jeremy Howard @howard.fm · Nov 25

Personally, I assess the quality of a work by studying and running its code, checking its analysis, considering its claims in the light of the data provided, and considering its impact based on my own understanding of the field.

I doubt a random reviewer would do a better job in their limited time.

November 25, 2024 at 1:44 PM

Reposted by Appu Shaji

Mobius Labs

@mobius-labs.bsky.social

Really happy to contribute to the batched version of faster-whisper that is 4x faster and more accurate 🚀🚀🚀

github.com/SYSTRAN/fast...

Release faster-whisper 1.1.0 · SYSTRAN/faster-whisper

New Features New batched inference that is 4x faster and accurate, Refer to README on usage instructions. Support for the new large-v3-turbo model. VAD filter is now 3x faster on CPU. Feature Extr...

github.com

November 25, 2024 at 11:32 AM

Appu Shaji

@appughar.bsky.social

Hello, everyone! I love the AI community on X, though not so much the constant squabbling and bickering. I'm here with a faint hope to find more of the former and less of the latter.

November 19, 2024 at 8:30 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news