Lightnews — Scholar-powered news

Segmond

@segmond.bsky.social

2.6K followers 120 following 7 posts

I once was here.

Posts Replies Media Videos

Segmond

@segmond.bsky.social

It's not blazing fast for me, but I never did try to optimize for speed, but 1780 t/s for 3126 then eval and 21.5t/s for 2873 token generation across 2 3090s, using llama.cpp. 976mb fp16 K and same for V with 32k context. I like that it generates long context without yapping.

April 24, 2025 at 7:27 PM

Segmond

@segmond.bsky.social

Besides yesterday in the 2nd part, I'm not reading the puzzle. I'm just asking the LLM to solve it, and it's solving it faster than I can read the puzzle. Solving them in roughly a minute, with 4090's they will probably be solved in 15-20 seconds.

December 5, 2024 at 5:57 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news