Joe Stacey
joestacey.bsky.social
Joe Stacey
@joestacey.bsky.social
NLP PhD student at Imperial College London and Apple AI/ML Scholar.
Pinned
We have a fun new #NLProc paper on arXiv about improving the robustness of fine-tuned NLI models!

Have a look :)
arxiv.org/abs/2505.20209
Reposted by Joe Stacey
We have released #AgentCoMa, an agentic reasoning benchmark where each task requires a mix of commonsense and math to be solved 🧐

LLM agents performing real-world tasks should be able to combine these different types of reasoning, but are they fit for the job? 🤔

🧵⬇️
August 28, 2025 at 2:01 PM
Here’s my review of the US after a few days here. Did I miss anything? 🤔

The good:

- Americans are the most charming, friendly and hospitable people
- it’s super fun how the country is split into states that all have different laws and stuff, with different vibes state to state
July 17, 2025 at 11:24 AM
Any chance Keir Starmer can reshuffle himself in as foreign secretary, and shuffle in another prime minister who actually has some vague idea about what they want to achieve? 🙏🤦‍♂️
July 2, 2025 at 5:34 PM
Finally the heatwave has ended, and the UK is once again a bearable place to be 😍😍

If you have any UK-based collaborations, their productivity is about to increase like 10 fold
July 2, 2025 at 11:52 AM
We have a fun new #NLProc paper on arXiv about improving the robustness of fine-tuned NLI models!

Have a look :)
arxiv.org/abs/2505.20209
May 27, 2025 at 3:50 PM
Should I use an LLM to help refine my paper writing for the ARR deadline? 🤔🤔

It will improve the paper for sure, but probably also making the tone a whole lot more annoying
May 18, 2025 at 9:05 AM
Reposted by Joe Stacey
If you're at #NAACL2025 and want to hear about similarity effects for property inheritance in LMs, please stop by!

I will be presenting this work on Wednesday at the 11-12:30 poster session on Interpretability & analysis for language models (Hall 3).

aclanthology.org/2025.naacl-l...
April 28, 2025 at 8:07 PM
Reposted by Joe Stacey
Excited to share our ICLR and NAACL papers! Please come and say hi, we're super friendly :)
April 22, 2025 at 6:42 PM
Wow, the old ITV Agatha Christie’s Poirot is brilliant. Some tv for 1989…

Gonna go binge watch the 13 seasons now 😍
April 5, 2025 at 7:59 PM
I feel like the length of the ARR author rebuttals keep growing every cycle

Is this a good thing for authors or reviewers that the responses can be so long? I feel like it’s a bit sub-optimal for both at the moment
April 4, 2025 at 8:40 AM
Reposted by Joe Stacey
Had a great time presenting my research on building more helpful QA systems @imperialcollegeldn.bsky.social! Thank you @joestacey.bsky.social for letting me invite myself 🫶

And loved visiting London+Edinburgh this week, hope to be back soon! 🙏
March 21, 2025 at 12:07 PM
Was fantastic to have you here at Imperial! Thanks for your excellent talk, and looking forward to following what you do next 🙂
Had a great time presenting my research on building more helpful QA systems @imperialcollegeldn.bsky.social! Thank you @joestacey.bsky.social for letting me invite myself 🫶

And loved visiting London+Edinburgh this week, hope to be back soon! 🙏
March 21, 2025 at 7:12 PM
Reposted by Joe Stacey
Do LLMs need rationales for learning from mistakes? 🤔
When LLMs learn from previous incorrect answers, they typically observe corrective feedback in the form of rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance!

🧵
February 13, 2025 at 3:38 PM
Reposted by Joe Stacey
Today was the launch event of the @genaihub.bsky.social. We announced the development of Nightingale AI, a foundation world model for health. It was great to be on the panel for GenAI in Healthcare, among such amazing experts.
www.genai.ac.uk
March 4, 2025 at 6:37 PM
Thanks so much to everyone who has helped make this switch to BlueSky work. Honestly, making this switch was a pretty massive achievement, so thanks everyone for contributing ❤️❤️
February 28, 2025 at 10:24 PM
This paper is really cool. They decompose NLI (and defeasible NLI) hypotheses into atoms, and then use these atoms to measure the logical consistency of LLMs.

E.g. for an entailment NLI example, each hypothesis atom should also be entailed by the premise.

Very nice idea 👏👏
February 18, 2025 at 4:14 PM
I’m a week into my trip from Cairo to Riyadh, and wow what a place Egypt is! Honestly its been one of the funnest places I’ve travelled, and for sure I need to come back again

Crossed into Aqaba (Jordan) yesterday, so now onto Saudi 🙂
February 9, 2025 at 2:08 PM
I’m going away to do a bit of travelling, going overland from Cairo to Riyadh 😍 I love travelling in the Middle East so it should be interesting

I’ve got that feeling of nervous excitement I always get before a trip 😬😁
January 29, 2025 at 10:14 AM
Insanely jealous to everyone who has papers at #NAACL in Albuquerque! Albuquerque just sounds so exotic, and is such a cool place for a conference.

No offence to Vienna, but Albuquerque sounds way more fun 😉
January 23, 2025 at 7:23 AM
Feeling gooooood after submitting my #ARR reviews early 😍 Time to enjoy the weekend! 🕺
January 17, 2025 at 5:20 PM
I was super excited to read the ModernBERT paper! Love this interest in creating a better encoder model.

"ModernBERT-base is the first encoder to beat DeBERTaV3-base since its release in 2021" 🤯- arxiv.org/pdf/2412.13663

Pretty amazing how successful DeBERTa has been!
January 14, 2025 at 2:14 PM
Excited to start my #ARR #NLP reviews!

I'll try my best and see if I can get 100% of my reviews to be 'great' this round.

If you didn't see it already, ARR publishes how many of your reviews are considered to be 'great': stats.aclrollingreview.org

Join me for the challenge :)
ARR Dashboard
stats.aclrollingreview.org
January 7, 2025 at 2:55 PM
At some point in life I realised I actually really love travelling by train. Kind of a strange hobby, but wow it is fun 😍

Here are my top ten train journeys so far.
January 2, 2025 at 11:32 AM
Imperial are hiring computing lecturers (including for AI/ML/NLP)!

Here's a little thread about why you should consider applying :)
We are hiring 6 lecturers at @imperialcollegeldn.bsky.social to work on AI, ML, graphics, vision, quantum and software engineering. This includes researchers working on LLMs, NLP, generative models and text applications. Deadline 6 Jan. @imperial-nlp.bsky.social www.imperial.ac.uk/jobs/search-...
Description
Please note that job descriptions are not exhaustive, and you may be asked to take on additional duties that align with the key responsibilities ment...
www.imperial.ac.uk
December 28, 2024 at 4:12 PM