Ian Arawjo
ianarawjo.bsky.social
Ian Arawjo
@ianarawjo.bsky.social
Asst Prof at Université de Montréal, Associate Member of Mila-Quebec AI Institute. PhD from Cornell InfoSci. Creator of ChainForge. Programming and culture, LLM evaluation tooling.
The 🕯️ Candle Test: Can LLMs be tricked into changing their answer to a simple riddle? Turns out most top-of-the-line models—from Claude 3.7 to Llama 3.1—fail at this task, with the sole exception of GPT-4o: chainforge.ai/play/?f=65d5...
April 5, 2025 at 3:01 PM
A prophecy for programming in the future ---programming has always, and always will be, in a process of change. It is those who embrace the future monstrosity, as Derrida called it, that lead the space. From the end of the paper, "To Write Code," CHI 2020:
February 28, 2025 at 12:10 AM
NSF has made an official statement regarding the freeze (new.nsf.gov/executive-or...). All grant work associated with DEI (including “accessibility”) “principles and frameworks” is supposed to be “ceased”:
January 29, 2025 at 5:45 PM
December 23, 2024 at 9:05 PM
o3 is like a Christmas dream. Most of my daily work involves filling grid patterns with colors, so when this new model drops, it will be game-changing. 🙏
December 22, 2024 at 7:38 PM
New ChainForge release ✨out now: Generate a table from a prompt, extend rows, add a column like magic! Great for getting test data for getting started.
December 19, 2024 at 8:50 PM
We added a "add a column" feature: Extend an existing table by adding a new column, and flash-fill. Here, adding synopses of Miyazaki films to an existing table:
December 19, 2024 at 3:01 PM
Generating a table of test inputs...
December 17, 2024 at 4:23 PM
Well, that's a wrap. Finished my class notes for the Empirical Methods in HCI course this fall, and it came in at a whopping 152 (!!) pages of content. Still more refining to do before a public release, but happy to share the materials with instructors at this point.
December 4, 2024 at 12:19 AM
Went to a digital humanities conference and attendee had an interesting question... What LLMs will reply when asked about generalizations of ethnic groups? Here's a test ---GPT3.5 is happy to generalize for North America, not so much in Europe---and Claude Haiku always replies:
December 2, 2024 at 6:13 PM