Shachar Mirkin
banner
shacharmirkin.bsky.social
Shachar Mirkin
@shacharmirkin.bsky.social
Language and Natural Language Processing, mostly
#NLP #nlproc

https://shacharmirkin.github.io/
The impact of model size and dedicated hardware on inference energy consumption

MIAI Days Grenoble
June 20, 2025 at 8:27 AM
A nice thing they did at the conference last night was to prepare takeaway boxes from dinner and offer them to anyone who was leaving ("anti gaspillage")

MIAI Days Grenoble
June 20, 2025 at 7:36 AM
Using VR and AI to prepare for natural and subsequent technological crises.
Population is modeled as multiple agents.
They use this system to run different scenrios and train local autorithies and emergency services.
June 19, 2025 at 2:49 PM
Just talked with a researcher who studies AI regulation, esp. how Europe compares to the rest of the world.
He said that Japan recently published their AI Act, and that there’s no penalty for violations, just naming and shaming.
Apparently, that’s enough for Japan
June 19, 2025 at 7:32 AM
Attending MIAI Days 2025 @ Grenoble.
Will share a bit over here
June 19, 2025 at 7:31 AM
🌍UniversalNER, projet collaboratif NER multilingue, couvre déjà près de 20 langues… mais pas encore le français ! 😱
🇫🇷 Du coup, on cherche des annotateurs pour le français.

Rejoignez-nous sur Discord : discord.com/channels/125...
Discord - Group Chat That’s All Fun & Games
Discord is great for playing games and chilling with friends, or even building a worldwide community. Customize your own space to talk, play, and hang out.
discord.com
June 2, 2025 at 8:32 AM
I got paid to be reviewer 2 🥹
May 19, 2025 at 8:39 AM
🤷
May 13, 2025 at 4:06 PM
Many people (me included) are seeing Invalid Notebook messages on GitHub lately, so I wrote down all the solutions (workarounds) that worked for me in this gist
gist.github.com/shacharmirki...
April 18, 2025 at 12:17 PM
My son, Ofir Mirkin, is presenting today a poster about programmable inflatable panels (that's CS for physics) 🤩
March 27, 2025 at 3:57 PM
Why are LLMs always getting the number of fingers wrong?
March 12, 2025 at 3:40 PM
I love it when I read something and then later don't remember if it was in English or in French
February 19, 2025 at 8:27 AM
February 11, 2025 at 5:19 PM
February 11, 2025 at 5:18 PM
A math/physics problem: ⛷️

You ski down a slope and want to to gain speed. There's a little "hill" coming up, and you need to decide whether to continue on your steady slope or ski up the hill, to accelerate more on the way down the steeper slope >>
February 11, 2025 at 8:32 AM
If the LLM wrote a function, would you let the LLM write the unit tests as well or write them yourself?
February 9, 2025 at 8:37 AM
we now have AI-generated-text and AI-generated-text-detection, and then we have AI-generated-text-humanization and humanized-AI-generated-text-detection and then
January 12, 2025 at 11:27 AM
Has anyone managed to run Genesis on a Mac?
Genesis
Genesis is a comprehensive physics simulation platform designed for general purpose Robotics, Embodied AI, & Physical AI applications. It is simultaneously multiple things: A universal physics engine…
buff.ly
January 8, 2025 at 5:48 PM
On the occasion of the beginning of a new year, here's my technology forecast for the next 50 years*

*First published in 2018, I'm not changing a thing
January 2, 2025 at 10:57 AM
In a few years AI will replace us all, and the only remaining engineers will still be trying to solve writing a mixed left-to-right & right-to-left messages
December 23, 2024 at 10:20 AM
Just when I was hesitating whether it was still worth investing my time in that BERT project :)
December 20, 2024 at 12:22 PM
I created a little Colab notebook showing how to using multiple virtual environments in a single notebook.
If there's a better way to do it, I'll be happy to learn
using multiple virtual environments in Google Colab
using multiple virtual environments in Google Colab - multiple_venvs_in_colab.ipynb
buff.ly
December 19, 2024 at 10:29 AM
Reposted by Shachar Mirkin
Next week we're launching a collaborative annotation effort to build a big multilingual dataset, so you can have high-quality data in your language.

We are really close to getting leads for 100 languages! Can you help us cover the remaining 200?
December 3, 2024 at 12:45 PM
A little tip to newcomers from someone with a longer experience here (practically 1 week):
put an image and a bio before following starter packs, so people you follow know to follow you back
November 22, 2024 at 4:49 PM