Kaizen
banner
kaizenteki.bsky.social
Kaizen
@kaizenteki.bsky.social
🏳️‍🌈 🫂🏳️‍⚧️
Tech / science / books / LGBT / music stuff I dig.
LoRA makes "intruder dimensions" in singular vectors that differ from the pre-trained model while full fine-tune remains spectrally ~ to the base model. The takeaway LoRA excels at instruction fine-tuning with smaller datasets, while full fine-tuning shines in continued pre-training scenarios.
December 5, 2025 at 4:22 AM
LoRA Without Regret
How LoRA matches full training performance more broadly than expected.
thinkingmachines.ai
December 5, 2025 at 4:15 AM
November 18, 2025 at 4:10 AM
Kimi K2 - ~5 million
November 8, 2025 at 5:03 PM