Suzana Ilić
suzanailic.bsky.social
Suzana Ilić
@suzanailic.bsky.social
Reposted by Suzana Ilić
ColBench: Technical framework for multi-turn LLM reasoning evaluation

- Reliable simulation with LLMs as human collaborators
- Functional verifiers measuring similarity to reference artefacts
- Supports both backend programming and visual frontend design

huggingface.co/datasets/fac...
facebook/collaborative_agent_bench · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
March 24, 2025 at 9:24 AM
Reposted by Suzana Ilić
Time for my #CreativeAI meetup to return 🤖🥳

After many years, join us on 5th March at Newspeak House with @terencebroad.bsky.social from UAL CCI, the AI research artist @cheesetalk.bsky.social and Thu Nguyen-Phuoc from Meta

Sign up here: bit.ly/4jTb7rD
Creative AI meetup: The Return · Luma
This event will host talks artists and researchers presenting AI technologies and their creative applications. Event schedule: 18:30 Arrival 19:00 Terence…
bit.ly
February 13, 2025 at 5:36 PM
I‘m still not sure about bluesky- is this working for folks? Who should I follow?
February 11, 2025 at 2:02 PM
Very interesting work!
Today, we are publishing the first-ever International AI Safety Report, backed by 30 countries and the OECD, UN, and EU.

It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵

Full Report: assets.publishing.service.gov.uk/media/679a0c...

1/21
January 29, 2025 at 2:24 PM
Reposted by Suzana Ilić
🎉 50,000+ annotations reached! The FineWeb2-C community is helping build better language models on annotation at a time.

📊 Current stats:
- 115 languages represented
- 419 amazing contributors
- 24 languages with complete datasets

But we're not done yet! 🧵
January 16, 2025 at 5:32 PM
Microsoft Responsible AI Mixer in NYC happening now!
January 23, 2025 at 11:44 PM
On the Origin of Deep Learning
arxiv.org/pdf/1702.07800
January 22, 2025 at 1:20 PM
Wait..
January 22, 2025 at 1:38 AM
Happy New Year, friends!
January 1, 2025 at 1:26 PM
woohooo! 🙌
December 19, 2024 at 12:45 PM
Friday read: The o1 System Card cdn.openai.com/o1-system-ca...
cdn.openai.com
December 6, 2024 at 2:35 PM
Reposted by Suzana Ilić
The Lichess database of games, puzzles, and engine evaluations is now on @hf.co - https://huggingface.co/Lichess. Billions of chess data points to download, query, and stream and we're excited to see what you'll build with it! ♟️ 🤗
December 6, 2024 at 9:46 AM