Lightnews — Scholar-powered news

nick

@bukovec.dev

19 followers 36 following 11 posts

mariners and machine learning
ee ms student @ stanford

Posts Replies Media Videos

nick

@bukovec.dev

big "model distillation" wants you to believe that you can

January 29, 2025 at 7:15 PM

nick

@bukovec.dev

bring back the hackintosh

January 29, 2025 at 5:52 AM

nick

@bukovec.dev

Here's an article about using QLoRA on Llama 2 and Mistral using a 3090. Although the tricky thing with R1 is that it's MoE, so I think you'll have to load all 671M params into memory for training. It might be easier to fine-tune one of the Llama-distilled versions.

medium.com/@geronimo7/f...

Finetuning Llama 2 and Mistral

A beginner’s guide to finetuning LLMs with QLoRA

medium.com

January 29, 2025 at 5:36 AM

nick

@bukovec.dev

since i’m just starting to mess around with MoE, the use of dynamic biases is really interesting to me. a super cool intuition!

December 29, 2024 at 11:19 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news