Lightnews — Scholar-powered news

Reposted by Suzana Ilić

Daniel van Strien

@danielvanstrien.bsky.social

ColBench: Technical framework for multi-turn LLM reasoning evaluation

- Reliable simulation with LLMs as human collaborators
- Functional verifiers measuring similarity to reference artefacts
- Supports both backend programming and visual frontend design

huggingface.co/datasets/fac...

facebook/collaborative_agent_bench · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

March 24, 2025 at 9:24 AM

Reposted by Suzana Ilić

Luba Elliott

@elluba.bsky.social

Time for my #CreativeAI meetup to return 🤖🥳

After many years, join us on 5th March at Newspeak House with @terencebroad.bsky.social from UAL CCI, the AI research artist @cheesetalk.bsky.social and Thu Nguyen-Phuoc from Meta

Sign up here: bit.ly/4jTb7rD

Creative AI meetup: The Return · Luma

This event will host talks artists and researchers presenting AI technologies and their creative applications. Event schedule: 18:30 Arrival 19:00 Terence…

bit.ly

February 13, 2025 at 5:36 PM

Suzana Ilić

@suzanailic.bsky.social

I‘m still not sure about bluesky- is this working for folks? Who should I follow?

February 11, 2025 at 2:02 PM

Suzana Ilić

@suzanailic.bsky.social

Very interesting work!

Yoshua Bengio @yoshuabengio.bsky.social · Jan 29

Today, we are publishing the first-ever International AI Safety Report, backed by 30 countries and the OECD, UN, and EU.

It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵

Full Report: assets.publishing.service.gov.uk/media/679a0c...

1/21

January 29, 2025 at 2:24 PM

Reposted by Suzana Ilić

Daniel van Strien

@danielvanstrien.bsky.social

🎉 50,000+ annotations reached! The FineWeb2-C community is helping build better language models on annotation at a time.

📊 Current stats:
- 115 languages represented
- 419 amazing contributors
- 24 languages with complete datasets

But we're not done yet! 🧵

Screenshot of this text: Total annotations submitted: 50,035 Languages with annotations: 115 Total contributors: 419

January 16, 2025 at 5:32 PM

Suzana Ilić

@suzanailic.bsky.social

Microsoft Responsible AI Mixer in NYC happening now!

January 23, 2025 at 11:44 PM

Suzana Ilić

@suzanailic.bsky.social

On the Origin of Deep Learning
arxiv.org/pdf/1702.07800

January 22, 2025 at 1:20 PM

Suzana Ilić

@suzanailic.bsky.social

Wait..

January 22, 2025 at 1:38 AM

Suzana Ilić

@suzanailic.bsky.social

Happy New Year, friends!

January 1, 2025 at 1:26 PM

Suzana Ilić

@suzanailic.bsky.social

woohooo! 🙌

December 19, 2024 at 12:45 PM

Suzana Ilić

@suzanailic.bsky.social

Friday read: The o1 System Card cdn.openai.com/o1-system-ca...

cdn.openai.com

December 6, 2024 at 2:35 PM

Reposted by Suzana Ilić

Lichess

@lichess.org

The Lichess database of games, puzzles, and engine evaluations is now on @hf.co - https://huggingface.co/Lichess. Billions of chess data points to download, query, and stream and we're excited to see what you'll build with it! ♟️ 🤗

December 6, 2024 at 9:46 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news