Mariusz Kurman
mkurman.bsky.social
Mariusz Kurman
@mkurman.bsky.social
AI Tech Lead @ Kruk SA | CEO @ MedIT Solutions | MD | Medcases.io app creator
Here is my experimental Llama 3.2 3B with o1-like thinking. It utilizes Thoughts when needed, so don't be surprised when it's not.

Enjoy!

Give some likes to make me feel better 😂

huggingface.co/mkurman/llam...
mkurman/llama-3.2-MEDIT-3B-o1 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
January 4, 2025 at 1:17 PM
storm.genie.stanford.edu - A great tool from Stanford for creating articles. For me, a stronger Gemini with Deep Thinking. Definitely worth trying!
January 1, 2025 at 5:34 PM
Deepseek MTP is something you should definitely look at
December 28, 2024 at 2:35 PM
Predicting the next token as a learning objective is insufficient for optimal LLM training.
December 28, 2024 at 1:35 AM
HDIC - How Do I Contribute?

A new technique we are working on seems to have a huge impact on language models' generative capabilities, allowing the layers to self-esteem their contribution to the final prediction.
December 4, 2024 at 6:48 PM
RIP JetBrains subscription ☠️ after six years, it became too heavy to use as a daily IDE. I‘m now on the VS Code team.
December 4, 2024 at 10:04 AM
What research tools would you recommend for searching and analyzing scientific papers?
December 3, 2024 at 2:18 PM
We built a new small language model SmolLM2-MedIT-Upscale-2B, based on SmolLM2-1.7B-Instruct from Hugging Face. The premise was simple - increasing the vector in attention layers would positively impact the model's capabilities.

What did we prove? 1/4
meditsolutions/SmolLM2-MedIT-Upscale-2B · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
December 2, 2024 at 2:54 PM
This video clip excellently demonstrates the exceptional uses of AI.

youtu.be/MMryYio0v6k?...
QUEBONAFIDE - FUTURAMA 3
YouTube video by PÓŁNOC / POŁUDNIE
youtu.be
November 30, 2024 at 11:28 PM
It looks like we can scale up any model not only in-depth but also in width. Insane.
#llm
November 29, 2024 at 11:26 AM