Lightnews — Scholar-powered news

Reposted by Joe Stacey

Lisa Alazraki

@lisaalaz.bsky.social

We have released #AgentCoMa, an agentic reasoning benchmark where each task requires a mix of commonsense and math to be solved 🧐

LLM agents performing real-world tasks should be able to combine these different types of reasoning, but are they fit for the job? 🤔

🧵⬇️

August 28, 2025 at 2:01 PM

Joe Stacey

@joestacey.bsky.social

Here’s my review of the US after a few days here. Did I miss anything? 🤔

The good:

- Americans are the most charming, friendly and hospitable people
- it’s super fun how the country is split into states that all have different laws and stuff, with different vibes state to state

July 17, 2025 at 11:24 AM

Joe Stacey

@joestacey.bsky.social

Any chance Keir Starmer can reshuffle himself in as foreign secretary, and shuffle in another prime minister who actually has some vague idea about what they want to achieve? 🙏🤦‍♂️

July 2, 2025 at 5:34 PM

Joe Stacey

@joestacey.bsky.social

Finally the heatwave has ended, and the UK is once again a bearable place to be 😍😍

If you have any UK-based collaborations, their productivity is about to increase like 10 fold

July 2, 2025 at 11:52 AM

Joe Stacey

@joestacey.bsky.social

We have a fun new #NLProc paper on arXiv about improving the robustness of fine-tuned NLI models!

Have a look :)
arxiv.org/abs/2505.20209

May 27, 2025 at 3:50 PM

Joe Stacey

@joestacey.bsky.social

Should I use an LLM to help refine my paper writing for the ARR deadline? 🤔🤔

It will improve the paper for sure, but probably also making the tone a whole lot more annoying

May 18, 2025 at 9:05 AM

Reposted by Joe Stacey

Juan Diego Rodriguez

@juand-r.bsky.social

If you're at #NAACL2025 and want to hear about similarity effects for property inheritance in LMs, please stop by!

I will be presenting this work on Wednesday at the 11-12:30 poster session on Interpretability & analysis for language models (Hall 3).

aclanthology.org/2025.naacl-l...

April 28, 2025 at 8:07 PM

Reposted by Joe Stacey

Imperial NLP

@imperial-nlp.bsky.social

Excited to share our ICLR and NAACL papers! Please come and say hi, we're super friendly :)

April 22, 2025 at 6:42 PM

Joe Stacey

@joestacey.bsky.social

Wow, the old ITV Agatha Christie’s Poirot is brilliant. Some tv for 1989…

Gonna go binge watch the 13 seasons now 😍

April 5, 2025 at 7:59 PM

Joe Stacey

@joestacey.bsky.social

I feel like the length of the ARR author rebuttals keep growing every cycle

Is this a good thing for authors or reviewers that the responses can be so long? I feel like it’s a bit sub-optimal for both at the moment

April 4, 2025 at 8:40 AM

Reposted by Joe Stacey

Nishant Balepur

@nbalepur.bsky.social

Had a great time presenting my research on building more helpful QA systems @imperialcollegeldn.bsky.social! Thank you @joestacey.bsky.social for letting me invite myself 🫶

And loved visiting London+Edinburgh this week, hope to be back soon! 🙏

March 21, 2025 at 12:07 PM

Joe Stacey

@joestacey.bsky.social

Was fantastic to have you here at Imperial! Thanks for your excellent talk, and looking forward to following what you do next 🙂

Nishant Balepur @nbalepur.bsky.social · Mar 21

Had a great time presenting my research on building more helpful QA systems @imperialcollegeldn.bsky.social! Thank you @joestacey.bsky.social for letting me invite myself 🫶

And loved visiting London+Edinburgh this week, hope to be back soon! 🙏

March 21, 2025 at 7:12 PM

Reposted by Joe Stacey

Lisa Alazraki

@lisaalaz.bsky.social

Do LLMs need rationales for learning from mistakes? 🤔
When LLMs learn from previous incorrect answers, they typically observe corrective feedback in the form of rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance!

🧵

February 13, 2025 at 3:38 PM

Reposted by Joe Stacey

Marek Rei

@marekrei.bsky.social

Today was the launch event of the @genaihub.bsky.social. We announced the development of Nightingale AI, a foundation world model for health. It was great to be on the panel for GenAI in Healthcare, among such amazing experts.
www.genai.ac.uk

March 4, 2025 at 6:37 PM

Joe Stacey

@joestacey.bsky.social

Thanks so much to everyone who has helped make this switch to BlueSky work. Honestly, making this switch was a pretty massive achievement, so thanks everyone for contributing ❤️❤️

February 28, 2025 at 10:24 PM

Joe Stacey

@joestacey.bsky.social

This paper is really cool. They decompose NLI (and defeasible NLI) hypotheses into atoms, and then use these atoms to measure the logical consistency of LLMs.

E.g. for an entailment NLI example, each hypothesis atom should also be entailed by the premise.

Very nice idea 👏👏

February 18, 2025 at 4:14 PM

Joe Stacey

@joestacey.bsky.social

I’m a week into my trip from Cairo to Riyadh, and wow what a place Egypt is! Honestly its been one of the funnest places I’ve travelled, and for sure I need to come back again

Crossed into Aqaba (Jordan) yesterday, so now onto Saudi 🙂

February 9, 2025 at 2:08 PM

Joe Stacey

@joestacey.bsky.social

I’m going away to do a bit of travelling, going overland from Cairo to Riyadh 😍 I love travelling in the Middle East so it should be interesting

I’ve got that feeling of nervous excitement I always get before a trip 😬😁

January 29, 2025 at 10:14 AM

Joe Stacey

@joestacey.bsky.social

Insanely jealous to everyone who has papers at #NAACL in Albuquerque! Albuquerque just sounds so exotic, and is such a cool place for a conference.

No offence to Vienna, but Albuquerque sounds way more fun 😉

January 23, 2025 at 7:23 AM

Joe Stacey

@joestacey.bsky.social

Feeling gooooood after submitting my #ARR reviews early 😍 Time to enjoy the weekend! 🕺

January 17, 2025 at 5:20 PM

Joe Stacey

@joestacey.bsky.social

I was super excited to read the ModernBERT paper! Love this interest in creating a better encoder model.

"ModernBERT-base is the first encoder to beat DeBERTaV3-base since its release in 2021" 🤯- arxiv.org/pdf/2412.13663

Pretty amazing how successful DeBERTa has been!

January 14, 2025 at 2:14 PM

Joe Stacey

@joestacey.bsky.social

Excited to start my #ARR #NLP reviews!

I'll try my best and see if I can get 100% of my reviews to be 'great' this round.

If you didn't see it already, ARR publishes how many of your reviews are considered to be 'great': stats.aclrollingreview.org

Join me for the challenge :)

ARR Dashboard

stats.aclrollingreview.org

January 7, 2025 at 2:55 PM

Joe Stacey

@joestacey.bsky.social

At some point in life I realised I actually really love travelling by train. Kind of a strange hobby, but wow it is fun 😍

Here are my top ten train journeys so far.

January 2, 2025 at 11:32 AM

Joe Stacey

@joestacey.bsky.social

Imperial are hiring computing lecturers (including for AI/ML/NLP)!

Here's a little thread about why you should consider applying :)

Marek Rei @marekrei.bsky.social · Dec 24

We are hiring 6 lecturers at @imperialcollegeldn.bsky.social to work on AI, ML, graphics, vision, quantum and software engineering. This includes researchers working on LLMs, NLP, generative models and text applications. Deadline 6 Jan. @imperial-nlp.bsky.social www.imperial.ac.uk/jobs/search-...

Description

Please note that job descriptions are not exhaustive, and you may be asked to take on additional duties that align with the key responsibilities ment...

www.imperial.ac.uk

December 28, 2024 at 4:12 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news