ehudreiter.bsky.social
@ehudreiter.bsky.social
Aberdeen CS is hiring a new lecturer for its "Joint Institute" with South China Normal University. Basically you would be based and do research in Aberdeen, but would be expected to go to China a few times a year and teach at SCNU.

Closing 28 Nov

www.abdn.ac.uk/jobs/vacanci...
Lecturer in Computing Science, Natural & Computing Sciences (NCS253A) | The University of Aberdeen
University of Aberdeen Research Jobs
www.abdn.ac.uk
November 12, 2025 at 9:14 AM
I'm disturbing reports about chatbots encouraging children to kill themselves. such as www.bbc.co.uk/news/article... . Shame that the AI Safety community in general, and the @AISecurityInst in particular, seem to have little interest in this, very disappointing...
Mothers say AI chatbots encouraged their sons to kill themselves
In her first UK interview Megan Garcia speaks to Laura Kuenssberg about the death of her teenage son.
www.bbc.co.uk
November 10, 2025 at 8:51 AM
New blog: Understanding what users want from NLG

When building an NLG system, it really helps to understand what users want; this came up several times at the recent INLG conference. I discuss some of our work in this space, and give a few suggestions.

ehudreiter.com/2025/11/06/u...
Understanding what users want from NLG
When building an NLG system, it really helps to understand what users want; this came up several times at the recent INLG conference. I discuss some of our work in this space, and give a few sugges…
ehudreiter.com
November 6, 2025 at 7:26 AM
I'm trying to understand OpenAI's healthbench. "HealthBench: Evaluating Large Language Models Towards Improved Human Health" doesnt say much about the BM(eg, very few examples). Are there other papers? I dont care how well model X performs, I want to judge if I can trust the BM
November 5, 2025 at 2:27 PM
Just back from INLG. Nice event as always, but I am concerned that it is losing its uniqueness. Maybe for 2026 Ill suggest some special tracks which are interesting to INLG community but not ARR types (eg, user requirements/eval, non-LLM techniques).
November 5, 2025 at 9:15 AM
New blog: Most common uses of AI in Healthcare

Data on usage of AI in healthcare suggests that most common uses in 2025 are probably (A) giving personalised health information to patients and (B) helping clinicians write documents.

ehudreiter.com/2025/10/21/m...
Most common uses of AI in Healthcare
I review some data on usage of AI in healthcare, and conclude that the most common uses in 2025 are probably (A) giving personalised health information to patients and (B) helping clinicians write …
ehudreiter.com
October 21, 2025 at 6:21 AM
One of my main goals for 2025-26 is to help my 6 senior PhD students submit their PhDs before I retire. Glad to say that Nicolay Babakov has now done so, with viva scheduled for Dec. Other five students seem to be on track, which is encouraging.
October 15, 2025 at 9:13 AM
Somewhat frustrated yesterday to once again read ACL paper which did all sorts of complex things (including the usual results tables showing best approach) on garbage data. With minimal ack of this in limitations. Most fundamental rule of CS is Garbage In, Garbage Out
October 9, 2025 at 8:46 AM
New blog: Good diagrams for research papers

Ive seen a number of diagrams recently which are too complicated and difficult to understand. I explain some of the problems I see and give advice.

ehudreiter.com/2025/10/08/g...
Good diagrams for research papers
Ive seen a number of diagrams recently which are too complicated and difficult to understand. I explain some of the problems I see and give advice.
ehudreiter.com
October 8, 2025 at 8:27 AM
Really interesting paper on real-world evaluation in IR. I should learn more about eval in IR, its not something Ive ever properly looked at
dl.acm.org/doi/10.1145/...
What Matters in a Measure? A Perspective from Large-Scale Search Evaluation | Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
dl.acm.org
September 30, 2025 at 8:27 AM
Several people have asked me recently if I will still be able to contribute to research projects after I retire in summer 2026. Absolutely! I will have emeritus statius, and am very hapy to remain involved in research projects at Aberdeen amd elsewhere.
September 26, 2025 at 10:21 AM
Aberdeen CS is hiring! We are especially interested in hiring new faculty in NLP. Closing date is 8 Oct. For more info, see below (or contact me)

www.abdn.ac.uk/jobs/vacanci...
Lecturer in Computing Science, Natural & Computing Sciences (NCS249A) | The University of Aberdeen
Browse and apply for current job openings at the University of Aberdeen across various schools, departments and roles, including admin and academic.
www.abdn.ac.uk
September 24, 2025 at 8:56 AM
New blog: Reflections on blogging

I am often asked about my experience blogging, sometimes by people who are considering writing their own blog. In this “meta” blog, I summarise my thoughts and experiences about my blog.

ehudreiter.com/2025/09/23/r...
Reflections on blogging
I am often asked about my experience blogging, sometimes by people who are considering writing their own blog. In this “meta” blog, I summarise my thoughts and experiences about my blog…
ehudreiter.com
September 23, 2025 at 7:53 AM
Aberdeen CS will probably be looking for a new lecturer in NLP. Formal advert is not out yet, but feel free to contact me informally if interested.
September 18, 2025 at 9:06 AM
Reposted
The registration page for #INLG2025 is now live! Join us in Vietnam at the Oct 29 - Nov 2 for the best conference on #NaturalLanguageGeneration

2025.inlgmeeting.org/registration...

Curious to see what will be presented? Check out this list of accepted papers! 2025.inlgmeeting.org/accepted-pap...
September 16, 2025 at 12:15 PM
New blog: Defining hallucination is not straightforward

Many researchers assume that hallucination is a binary feature; either something is a hallucination or it is not. This is too simplistic. I describe some of the issues I have seen below.

ehudreiter.com/2025/09/10/d...
Defining hallucination is not straightforward
Most academic work assumes that hallucination is a binary feature: either something is a hallucination or it is not a hallucination. But this is too simplistic. In real-world contexts we see many s…
ehudreiter.com
September 11, 2025 at 6:58 AM
At ACL, I engaged with 50 papers (went to oral, talked to poster person). Decided (looked at paper sometimes), that 3 of these robust, interesting, relevant to me; 2 of these 3 won awards. Hum, maybe in future I should focus on 40 award papers, ignore the other 3000?
September 4, 2025 at 8:44 AM
Excited by recent positive evaluations of NLG apps developed by my students to encourage safer driving in UK and Nigeria. We see stat sig reductions in unsafe driving incidents in both countries.

ehudreiter.com/2025/09/03/e...
Encouraging safer driving with NLG apps
I am very excited by recent positive evaluations of NLG apps developed by my students to encourage safer driving in UK and Nigeria. We see statistically significant reductions in unsafe driving inc…
ehudreiter.com
September 3, 2025 at 5:46 AM
Last week I had to deal with two cases of papers containing hallucinated references. This is not acceptable! Shows complete disdain for understand prev work, and suggests rest of paper may be fabricated.

Ok to use LLM to suggest related work, but read (or at least skim) them!
September 1, 2025 at 7:58 AM
Watched recording of ACL panel on generalisability (recommended to me). I share concerns about "LLM popcorn", but my biggest concern about NLP is lack of research diversity. Everyone does LLM, few people do impact or qual eval, little interest in genuine collab with other fields
August 22, 2025 at 8:23 AM
New blog: I hate pay-to-publish

The academic world has changed since I got my PhD in 1990. One of the worst changes is that researchers now often pay thousands of pounds to publish their work. Unfair to researchers with limited funding, and bad for science.

ehudreiter.com/2025/08/19/i...
I hate pay-to-publish
The academic world has changed in many ways since I got my PhD in 1990. One of the worst changes is that researchers in 2025 usually need to pay thousands of pounds to publish their work. This is u…
ehudreiter.com
August 19, 2025 at 8:32 AM
Very interesting meta-analysis of human-AI collab. Shows more effective in content creation (eg report writing) than in decision making, which does not surprise me

When combinations of humans and AI are useful: A systematic review and meta-analysis

www.nature.com/articles/s41...
When combinations of humans and AI are useful: A systematic review and meta-analysis - Nature Human Behaviour
Vaccaro et al. present a systematic review and meta-analysis of the performance of human–AI combinations, finding that on average, human–AI combinations performed significantly worse than the best of ...
www.nature.com
August 13, 2025 at 9:07 AM
Reposted
Excited to announce the first-ever Workshop for Young Researchers in Natural Language Generation (YNLG), supported by @siggen.bsky.social, taking place on October 29, 2025 in Hanoi, Vietnam, co-located with INLG 2025.
Call for Submissions is out now!

ynlg-workshop.github.io
August 12, 2025 at 7:05 AM
New blog: More on evaluating impact

I got great feedback from recent paper and talk on eval impact, and summarise some of the suggested papers (including more examples of impact eval) and insightful comments (eg, about eval “ecosystem”) I received.

ehudreiter.com/2025/08/05/m...
More on evaluating impact
I recently published a paper and gave a talk about evaluating real-world impact. I got some great feedback from this, and summarise some of the suggested papers (including more examples of impact e…
ehudreiter.com
August 5, 2025 at 6:40 AM
I'll be at ACL next week (Tue-Thur, not Sun/Mon). Look forward to meeting old friends and new people who want to connect! Ill also be giving an invited talk on impact evaluation at the GEM workshop on Thur 31 July
July 25, 2025 at 2:18 PM