SmolKaiju
banner
smolkaiju.bsky.social
SmolKaiju
@smolkaiju.bsky.social
Tinkering with hobby electronics, Python programming, LLMs, 3D printing, and all things novel.
Brazilian jiu-jitsu brown belt & MMA enthusiast.
Advocate for creativity, curiosity, and universal human dignity.
Idk how many people I’ve tried to explain this to, but the whole reason we haven’t had a WW3 is because of the close economic ties the entire world now have.

These ties make it mutually beneficial to not start another world war. Without these ties the chances of major wars grow much greater.
April 17, 2025 at 3:01 PM
Feel like I’m playing Russian roulette with Aliexpress these days.

Will my order cost me $10, $35, or $100 by the time it gets here? Who knows!

Find out next week on “These Tariffs are going to slow consumer spending and ruin our economy.”
April 17, 2025 at 2:57 PM
Mark my words, there will be an attempt to regulate open source AI out of existence in the next 4 years.

They’ll push it as a national security risk. They’ll say open source is too dangerous to share with foreign companies.

They may or may not be successful, but there will be an attempt.
January 29, 2025 at 7:43 PM
Reposted by SmolKaiju
Open Thoughts project

They are building the best reasoning datasets out in the open.

Building off their work with Stratos, today they are releasing OpenThoughts-114k and OpenThinker-7B.

Repo: github.com/open-thought...
January 29, 2025 at 6:49 AM
The federal government under the New Deal helped lift many into the middle class.

And the republicans want to destroy those jobs & the middle class.

We all benefit from a larger middle class. The only ones who don’t are the billionaires trying to privatize our gov.
January 29, 2025 at 4:31 PM
In 2025, up is down and down is up lol
January 27, 2025 at 10:38 PM
When your best argument against Deepseek is “They have to follow Chinese laws!”

You just straight up don’t have an argument.

Some people just really hate open source I guess.
January 27, 2025 at 5:32 PM
She’s so tired of hearing about this stuff lol
January 27, 2025 at 3:43 PM
I’m glad to see Deepseek hit #1 in the App Store.

Wasn’t sure we’d ever see an open model get so much recognition.

What sucks is they they now have message limits. I hope their eventual pro-tier is less than $20.
January 27, 2025 at 3:24 PM
Found my old iPod Touch today.

Man these things were amazing! I’d honestly buy another one if the launched it today.

Do I need one? No, but I love the size.
January 25, 2025 at 8:18 PM
Reposted by SmolKaiju
huggingface is doing a fully open source replication of R1 github.com/huggingface/...
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.
github.com
January 25, 2025 at 2:31 PM
It’s funny how you’ll go months getting ghosted by recruiters and rarely any hits. Feeling like shit.

Then a month later you’re getting 3 interviews a week. Feeling on top of the world.

No changes to my resume. Guessing it’s because it’s January, but still interesting.
January 24, 2025 at 7:32 PM
Reposted by SmolKaiju
Introducing the smollest VLMs yet! 🤏
SmolVLM (256M & 500M) runs on <1GB GPU memory.
Fine-tune it on your laptop and run it on your toaster. 🚀
Even the 256M model outperforms our Idefics 80B (Aug '23).
How small can we go? 👀
January 23, 2025 at 1:33 PM
I’m trying to keep politics to minimum, but when they continue to intersect with some of my hobbies, I find it hard to ignore.

Sama is a weasel.

Yeah, he’s doing it to appease, but the new admin wants to take away his rights and he’s cheering it along.

www.aclu.org/trump-on-lgbtq-rights
January 23, 2025 at 3:10 PM
Anyone else have a Skanner as a kid?

Well, some awesome dev combined Gen AI and a barcode battle game. It’s honestly simple but very fun.

apps.apple.com/us/app/warco...
January 23, 2025 at 3:08 AM
Quick cheap vegan meal

Mushrooms, tofu, potatoes, onions, mung bean sprouts, & broccoli. Slice them how you like and put them in a pan.

Season with olive oil, saffron, salt, turmeric & whatever else (I like a dash of MSG).

Bake 15m then broil 15m.

Around $10-15 to make!
January 22, 2025 at 3:15 AM
Reposted by SmolKaiju
To be clear, the recipe to replicate o1 style models is not new techniques, but applying them in a new way.
This shouldn't be surprising.
January 21, 2025 at 3:46 PM
Went in threads for the first time in months.

You can’t call someone a liar or coward. Even when the lie and act cowardly.

What a terrible platform.
January 21, 2025 at 4:10 PM
R1 is finally on Hugging Chat!
🎉 🙌
January 21, 2025 at 4:05 PM
Reposted by SmolKaiju
1/5

So, how did DeepSeek develop DeepSeek R1?

They used both DeepSeek-V3-Base and a simple prompt:

1. They asked the same question multiple times to DeepSeek-V3-Base as a group.
2. They then graded the answers, assigning an accuracy score and a format score (e.g., <think></think>).
January 21, 2025 at 7:49 AM
I love how Deepseek just came out and started dropping the best open models like it’s nothing.

V3 & R1 are all I really need rn.

OpenAI is probably a bit worried right about now.

Even Altman is on Twitter trying to quell the hype he and his employees created the past couple weeks.
January 21, 2025 at 2:25 PM
RedNote responds to Musk going mask off.

I was surprised they allowed this topic at all. Interesting to see their reactions.
January 21, 2025 at 7:06 AM
Reposted by SmolKaiju
Kimi k1.5 --- an o1-level multi-modal model

- SOTA short-CoT performance, outperforming GPT-4o and Claude Sonnet 3.5 on 📐AIME, 📐MATH-500, 💻 LiveCodeBench by a large margin (up to +550%)
- Long-CoT performance matches o1 across multiple modalities (👀MathVista, 📐AIME, 💻Codeforces, etc)

Demo: kimi.ai
January 20, 2025 at 4:49 PM
Reposted by SmolKaiju
Historian of fascism here. It was a Nazi salute and a very belligerent one too.
January 20, 2025 at 10:16 PM
Anyone that thinks musks salute shouldn’t be posted because it’s triggering should go check out r/jewish

They’re obviously disgusted and scared, but they aren’t trying to stick their heads in the sand.

On here people want to cover it up. That’s a big mistake imo. No one believes until they see.
January 21, 2025 at 12:04 AM