Lightnews — Scholar-powered news

Zack Angelo

@zackangelo.bsky.social

24 followers 120 following 8 posts

building ai inference @ mixlayer

Posts Replies Media Videos

Zack Angelo

@zackangelo.bsky.social

just realized bsky doesn't support gifs lol

December 15, 2024 at 2:40 PM

Zack Angelo

@zackangelo.bsky.social

functions can even compose, here's the model using the output of one as the input into another

December 13, 2024 at 8:24 PM

Zack Angelo

@zackangelo.bsky.social

weird that the instruction tuned Llama3 8b models are downloaded less than the original?

December 4, 2024 at 3:53 PM

Zack Angelo

@zackangelo.bsky.social

I doubt they switch to a lower precision model, but would not be surprised if they start using a quantized or fp8 KV cache. Much easier to switch out dynamically in response to load vs the model weights.

November 23, 2024 at 5:43 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news