Lightnews — Scholar-powered news

Reposted by Bastian Bunzeck

Samuel

@samuel.fm

google: we have invented agi but have hidden it in such an obscure website no one will ever find it
anthropic: through our interpretability research, we discovered claude imagines himself wearing a bow tie at all times
openai: we added slot machines

December 29, 2025 at 7:49 PM

Reposted by Bastian Bunzeck

Deb Raji

@rajiinio.bsky.social

A fascinating recent development is that the ML research community -- as the earliest adopters of "AI for research" -- are at the frontlines of dealing with all the problems that come with that (ie. reduced trust in results & reviewers, increased submission load etc).

Every other field is next! 😭

Thomas Dietterich @tdietterich.bsky.social · Sep 14

We need new rules for publishing AI-generated research. The teams developing automated AI scientists have customarily submitted their papers to standard refereed venues (journals and conferences) and to arXiv. Often, acceptance has been treated as the dependent variable. 1/

December 27, 2025 at 7:46 PM

Bastian Bunzeck

@bbunzeck.bsky.social

Look what Santa has slipped unter my virtual Christmas tree🎄🤩

languagemit.bsky.social @languagemit.bsky.social · 6d

New book! I have written a book, called Syntax: A cognitive approach, published by MIT Press.

This is open access; MIT Press will post a link soon, but until then, the book is available on my website:
tedlab.mit.edu/tedlab_websi...

tedlab.mit.edu

December 24, 2025 at 10:35 PM

Reposted by Bastian Bunzeck

Raoul Schubert

@raoulschubert.bsky.social

Very happy to announce that my alma mater Bielefeld University (Germany) now offers an international linguistics master's program, 100% taught in English!

Here's a link with more information: linguistlist.org/issues/36/38...

LINGUIST List 36.3881 FYI: International Master’s Program Linguistics, Bielefeld University

The LINGUIST List, International Linguistics Community Online.

linguistlist.org

December 20, 2025 at 6:08 AM

Reposted by Bastian Bunzeck

Negar Foroutan

@negarforoutan.bsky.social

1/ 🌍 How does mixing data from hundreds of languages affect LLM training?
In our new paper "Revisiting Multilingual Data Mixtures in Language Model Pretraining" we revisit core assumptions about multilinguality using 1.1B-3B models trained on up to 400 languages.
🧵👇

December 15, 2025 at 6:18 PM

Reposted by Bastian Bunzeck

Per Engzell

@pengzell.bsky.social

All research is exploratory if you’re confused enough

December 19, 2025 at 8:01 AM

Reposted by Bastian Bunzeck

ELLIS

@ellis.eu

🏹 Job alert: Two fully funded PhD positions in Natural Language Processing at University of Leipzig

📍 Leipzig 🇩🇪
📅 Apply by Jan 15th
🔗 https://ellis.eu/research/jobs/2025-12-16-two-fully-funded-phd-positions-in-natural-language-processing

Two fully funded PhD positions in Natural Language Processing at University of Leipzig | European Laboratory for Learning and Intelligent Systems

ellis.eu

December 18, 2025 at 7:05 AM

Reposted by Bastian Bunzeck

Ai2

@ai2.bsky.social

Introducing Bolmo, a new family of byte-level language models built by "byteifying" our open Olmo 3—and to our knowledge, the first fully open byte-level LM to match or surpass SOTA subword models across a wide range of tasks. 🧵

December 15, 2025 at 5:19 PM

Reposted by Bastian Bunzeck

Scott Ashworth

@soashworth.bsky.social

And they want to take TikTok away from kids.

studying for my ML final and realizing that i'm actually the model, finals are unseen data, straight hw memorization is overfitting, and understanding the theory is
generalization

December 14, 2025 at 2:25 AM

Reposted by Bastian Bunzeck

Leonie Weissweiler

@weissweiler.bsky.social

🧑‍🔬I’m recruiting PhD students in Natural Language Processing @unileipzig.bsky.social Computer Science, together with @scadsai.bsky.social!

Topics include, but aren’t limited to:

🔎Linguistic Interpretability
🌍Multilingual Evaluation
📖Computational Typology

Please share!

#NLProc #NLP

December 11, 2025 at 1:36 PM

Reposted by Bastian Bunzeck

Leonie Weissweiler

@weissweiler.bsky.social

🥳Life Update!

I’m thrilled to share that I’ll be starting as assistant professor for Natural Language Processing @unileipzig.bsky.social in April! I’m deeply grateful to everyone who supported me on this journey.

I will be recruiting PhD students with @scadsai.bsky.social, stay tuned for details!

December 10, 2025 at 1:10 PM

Reposted by Bastian Bunzeck

naitian

@naitian.org

A couple years (!) in the making: we’re releasing a new corpus of embodied, collaborative problem solving dialogues. We paid 36 people to play Portal 2’s co-op mode and collected their speech + game recordings.

Paper: arxiv.org/abs/2512.03381
Website: berkeley-nlp.github.io/portal-dialo...

1/n

A figure demonstrating the different aspects of the corpus described in the tweet. There is a main isomorphic 3D view of a level in the Portal 2 co-op game, with some portals, lasers, and the blue and orange players. Inset, there are first-person captures of the blue and orange player views. There is also a box containing the transcribed dialogue with timestamps and labels for the discursive acts. Finally, there is a box containing a task and a list of subtasks. Some subtasks are already crossed out, with the time that they have been completed. The last subtask ("Player 2 places portal 4 on wall 4") is marked incomplete.

The dialogue is as follows:

Blue: Can you put your other portal up here? (tagged as directive)
Orange: Where? (tagged as request for clarification)
Blue: On uh, on this wall. (tagged as directive)
Blue: So that it uh points at the circle. (tagged as directive)
Orange: Okay. (tagged as commit)

The full list of subtasks is:

Task: Redirect lasers
Subtask: Player 1 places portal 1 on wall 1. (completed)
Subtask: Player 1 polaces portal 2 on wall 2 or 3. (completed)
Subtask: Player 2 places portal 3 opposite of portal 2. (completed)
Subtask: Player 2 places portal 4 on wall 4. (incomplete)

December 5, 2025 at 6:54 PM

Reposted by Bastian Bunzeck

Cambridge University Press - Linguistics

@cambup-linguistics.cambridge.org

New Cambridge Element, Creative Construction Grammar, by Thomas Hoffmann and Mark Turner, out now! Read Open Access at
https://cup.org/4pkd1np
#languageandlinguistics #LangSky

December 6, 2025 at 9:00 AM

Reposted by Bastian Bunzeck

T Hoffmann

@linguistur.bsky.social

Finally out & open access: Hoffmann & Turner on Creative Construction Grammar. How do we communicate complex meanings? How do we combine words into sentences?

#creativity #language #linguistics #Construction Grammar

Find out at

www.cambridge.org/core/element...

Creative Construction Grammar

Cambridge Core - Cognitive Linguistics - Creative Construction Grammar

www.cambridge.org

December 5, 2025 at 9:26 AM

Reposted by Bastian Bunzeck

Barthe Bloom

@barthebloom.bsky.social

Check out this exiting issue with papers from various flavors of Construction Grammar describing English constructions! www.degruyterbrill.com/journal/key/...

Special Issue: Describing English Constructions; Issue Editors: Barthe Bloom and Thomas Herbst

Volume 73, issue 3 of the journal Zeitschrift für Anglistik und Amerikanistik was published in 2025.

www.degruyterbrill.com

December 4, 2025 at 7:39 AM

Reposted by Bastian Bunzeck

Sung Kim

@sungkim.bsky.social

Transformers v5 Release Candidate

"This is the first major release in five years where 800 commits have been pushed to main since the latest minor release. This release introduces several refactors that significantly simplify our APIs and internals, and comes with a large number of bug fixes."

December 2, 2025 at 12:06 AM

Reposted by Bastian Bunzeck

Francesca Padovani

@frap98.bsky.social

Last week I had the pleasure of hosting a fantastic friend and researcher, @mdhk.net , who came to visit us in Groningen for a couple of days from Amsterdam! 🎉

December 1, 2025 at 3:33 PM

Reposted by Bastian Bunzeck

Institut für Finnougristik, LMU München

@finnougristiklmu.bsky.social

We are hiring a doctoral candidate to work in our project that will create a diachronic corpus of Northern Mansi. It is a 65% position (E13 TV-L) for 2 years 9 months starting in spring 2026. Please spread the word! www.finnougristik.uni-muenchen.de/forschung/fo...

PhD position in a project on Northern Mansi - Lehrstuhl für Finnougristik - LMU München

www.finnougristik.uni-muenchen.de

December 1, 2025 at 9:02 AM

Reposted by Bastian Bunzeck

Juan Diego Rodriguez

@juand-r.bsky.social

I’m excited to present SimpleStories at EurIPS!

Also if anyone at #EurIPS is interested in chatting about LLM data efficiency, interpretability, model inconsistency or other topics feel free to DM me.

Dataset and models: lnkd.in/e_VGWqhP
Code: lnkd.in/eEidmv74
Paper: lnkd.in/eH6jS9uY

December 1, 2025 at 3:41 AM

Reposted by Bastian Bunzeck

Judith Tonhauser

@judithtonhauser.bsky.social

Postdoc position in Stuttgart, Germany (TV-L 13, 100%) for 18 months, on authority presuppositions in AI systems with Dr. Agnieszka Faleńska and me. For more information and application info, see here: safety.https://www.ims.uni-stuttgart.de/documents/team/falensaa/aphic_postoc.pdf

www.ims.uni-stuttgart.de

November 24, 2025 at 8:02 AM

Reposted by Bastian Bunzeck

Michael Pleyer

@symbolicstorage.bsky.social

🚨NEW PUBLICATION ALERT!🚨
The 'Design Features' of Language Revisited (w/ @mperlman.bsky.social @glupyan.bsky.social Koen de Reus & @limorraviv.bsky.social)
Feature Review out now in #OpenAccess in @cp-trendscognsci.bsky.social! #language #linguistics
Paper: doi.org/10.1016/j.ti...

November 25, 2025 at 7:49 PM

Reposted by Bastian Bunzeck

Nivi Mani

@nivimani.bsky.social

We are advertising **11 new PhD positions** in the second cohort of our RTG on Curiosity (details on all 11 positions here: www.uni-goettingen.de/de/open+posi...). One of these positions is in my group looking at the role of curiosity in early word learning (www.uni-goettingen.de/en/644546.ht...)

Open Positions - Georg-August-Universität Göttingen

Webseiten der Georg-August-Universität Göttingen

www.uni-goettingen.de

November 25, 2025 at 1:32 PM

Reposted by Bastian Bunzeck

Paul Vicol

@paulvicol.bsky.social

🚀 Introducing TMLR Beyond PDF!

🎬 This is a new, HTML-based submission format for TMLR, that supports interactive figures and videos, along with the usual LaTeX and images.

🎉 Thanks to TMLR Editors in Chief: Hugo Larochelle, @gautamkamath.com, Naila Murray, Nihar B. Shah, and Laurent Charlin!

November 25, 2025 at 4:12 PM

Reposted by Bastian Bunzeck

Elen Le Foll 🇫🇷 🇬🇧 🇩🇪

@elenlefoll.fediscience.org.ap.brid.gy

We began day 2 of our Large Language Models (LLM) for linguistics research workshop @UniKoeln with a fascinating keynote by Charlotte Pouw on "Interpreting models for speech generation and understanding using methods from #psycholinguistics". Charlotte shared […]

[Original post on fediscience.org]

Charlotte presenting a slide with plots entitled The Role of Data Exposure

November 25, 2025 at 8:54 AM

Reposted by Bastian Bunzeck

Petra Wagner

@petrasusannewagner.bsky.social

Tomorrow, we will show within the science festival #geniale how AI voice modification can help explaining the subtle differences between voice qualities, expressing personality, age, mood, gender, health, and much more! #trr318, #bielefeld #tts #xAI wissenswerkstadt.de/veranstaltun...

Sag was! | Wissenswerkstadt Bielefeld

Mit Hilfe von KI Stimmeigenschaften erklärbarer machen

wissenswerkstadt.de

November 21, 2025 at 1:15 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news