Lightnews — Scholar-powered news

Reposted

Lorenz Meyer

@thereallorenzmeyer.bsky.social

Liebe Medien, ich kann die Schlagzeile "Keine Einigung zwischen USA und Dänemark" nicht mehr sehen. Wenn ein Bewaffneter eine Bank stürmt, titelt ihr doch auch nicht: "Räuber und Kassiererin finden keinen Konsens über Geldübergabe." Hört auf, imperiale Aggression als normale Diplomatie zu framen.

January 15, 2026 at 10:44 AM

boydgraber.bsky.social

@boydgraber.bsky.social

Today's the deadline to apply for an AI-specific teaching track position at UMD:

umd.wd1.myworkdayjobs.com/UMCP/job/Uni...

Please join us!

August 22, 2025 at 3:47 PM

boydgraber.bsky.social

@boydgraber.bsky.social

My students and I are presenting three papers on Monday at #ACL2025 and this thread will recap them (including their videos).

July 28, 2025 at 8:35 AM

boydgraber.bsky.social

@boydgraber.bsky.social

The precursor to this paper "The Incoherence of Coherence" had our most-watched paper video ever, so I thought we had to surpass it somehow ... so we decided to do a song parody (of Roxanne, obviously):

youtu.be/87OBxEM8a9E

July 18, 2025 at 6:37 PM

boydgraber.bsky.social

@boydgraber.bsky.social

We had our first human–computer cooperative AI tournament at the UMD. Key takeaways: 1) computers are getting better at trivia 2) they still suck at calibration 3) our teaming mechanic kept the games competitive and mostly fun (at least that’s what the players said).

Human-Computer AI Collaborative Tournament Gameplay

June 17, 2025 at 3:35 PM

boydgraber.bsky.social

@boydgraber.bsky.social

Today is the deadline to sign up for our Human-Computer trivia competition held on June 14, 2024 in College Park, MD. $150 prize for the team who can answer the most questions with the help of an AI.

June 10, 2025 at 4:23 PM

boydgraber.bsky.social

@boydgraber.bsky.social

Do you like trivia? Can you spot when AI is feeding you BS? Or can you make AIs turn themselves inside out? Then on June 14 at College Park (or June 21 online), we have a competition for you.

June 5, 2025 at 4:17 PM

Reposted

Alexander Doria

@dorialexander.bsky.social

New Pleias paper: "What the HellaSwag?"
HellaSwag is currently on of the most widely LLM benchmarks in the world. We introduce a new critical method to assess the validity of standard LLM evals and show it does not accurately measure common sense reasoning. arxiv.org/abs/2504.07825

April 14, 2025 at 3:44 PM

Reposted

Nishant Balepur

@nbalepur.bsky.social

This was a really fun paper to put together with Rachel and @boydgraber.bsky.social allowing me to vent many of my frustrations working with MCQA over the past year 😪🫡

Please check out the paper, we would love to hear your feedback! 📄👇

February 24, 2025 at 9:04 PM

Reposted

William Jurayj

@williamjurayj.bsky.social

🚨 You are only evaluating a slice of your test-time scaling model's performance! 🚨

📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!

📝: arxiv.org/abs/2502.13962

February 20, 2025 at 3:14 PM

boydgraber.bsky.social

@boydgraber.bsky.social

Is anyone in my network connected to Align to Innovate? Or know somebody who is?

alignbio.org

Align to Innovate

Reproducible. Scalable. Sharable. Improving research science with programmable experiments.

alignbio.org

February 5, 2025 at 2:04 AM

Reposted

Andrew Middleton

@mapcenter.com

Hi. I'm Andrew. I own New England's oldest map store because last year I moved across the country after an old guy retired and gave it to me Willy Wonka-style. Visit my store in Rhode Island. www.mapcenter.com

A nerd in a teal sweater holding a stack of books about railroads in front of a wall of framed maps.

December 17, 2024 at 11:18 PM

boydgraber.bsky.social

@boydgraber.bsky.social

In about half an hour, I'll be doing my annual Q&A session on grad admissions:

youtube.com/live/jVjTbPH...

YouTube

Share your videos with friends, family, and the world

youtube.com

December 6, 2024 at 1:26 PM

Reposted

Birds Are Dinosaurs

@birdsaredinosaurs.bsky.social

At its heart, Star Trek is a utopian fantasy about a society so advanced that they are capable of holding productive meetings that last no longer than three minutes

A TNG scene. We're in the Enterprise conference room, where Picard is holding a meeting with Data, Troi, and Dr Crusher. Barclay is also there, for unknown reasons. Maybe he wandered into the meeting and just sat down, and everyone was too polite to mention it. That happened to me once. I was visiting an office location I didn't normally go to and I wasn't quite sure which conference room I was supposed to be in. I walked into one and sat down and it took me five minutes to realize I was in the wrong meeting. From the perspective of the people in that room, halfway through that meeting, a stranger walked in, sat next to the boss, took notes for five minutes, then walked out without saying a word.

December 3, 2024 at 4:58 PM

boydgraber.bsky.social

@boydgraber.bsky.social

I just made my way to Bluesky, so I thought it might be a good opportunity to shamelessly remind people to vote in the ACL board elections (where I'm running for an at large post on a platform of improving virtual conferences).

Check your e-mail for "Reminder: ACL 2024 Elections - Please Vote".

November 26, 2024 at 8:14 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news