Lightnews — Scholar-powered news

Reposted by Najoung Kim

Kanishka Misra

@kanishka.bsky.social

“All bears have a property”, “Some bears have a property”, “Bears have a property” are different in terms of how the property is generalized to a specific bear – a great example of how language constrains thought!

This holds for kids, adults, and according to our new work, (V)LMs! 🧵

Title page of our paper: "Bears, all bears, and some bears. Language Constraints on Language Models' Inductive Inferences"

January 27, 2026 at 4:16 PM

Najoung Kim

@najoung.bsky.social

happy to have finally made a pilgrimage to ucsd, a special place for some of us :)

Kanishka Misra @kanishka.bsky.social · Dec 8

@najoung.bsky.social and I did a side quest during NeurIPS and gave a joint talk at UCSD on our thoughts about the role of NNs/LLMs in Human CogSci!

Thanks @alexwarstadt.bsky.social and Rachel Dudley for hosting us!

Catch our slides here: docs.google.com/presentation...

UCSD Talk 25-12-5 - Kim and Misra (Public)

Whence Insights? Delineating Human and Machine CogSci, and hypothesis generation as a potential bridge Najoung Kim and Kanishka Misra UCSD Linguistics, 25/12/5

docs.google.com

December 8, 2025 at 8:19 PM

Najoung Kim

@najoung.bsky.social

My lab at BU is recruiting PhD students and possibly a postdoc this year!

We study humans & machines, centered around topics like meaning, generalization, evaluation methods and design, and the nature of computation and representation that underlie language and cognition.

🫴🫴

November 19, 2025 at 5:20 PM

Reposted by Najoung Kim

Najoung Kim

@najoung.bsky.social

Here are the slides: docs.google.com/presentation...

For context the intended audience is Lang Dev researchers who are attending a session on "what can LLMs tell us about human language".

If you have any thoughts I'd love to hear them!

SLD Plenary - Najoung Kim [external share]

Whence insights? The value of delineating human and machine CogSci Najoung Kim (Boston University) Society for Language Development Annual Symposium November 6, 2025 1

docs.google.com

November 18, 2025 at 9:31 PM

Najoung Kim

@najoung.bsky.social

for something to be considered creative, it has to be semantically/pragmatically felicitous!!!

Arkadiy Saakyan @asaakyan.bsky.social · Nov 4

N-gram novelty is widely used as a measure of creativity and generalization. But if LLMs produce highly n-gram novel expressions that don’t make sense or sound awkward, should they still be called creative? In a new paper, we investigate how n-gram novelty relates to creativity.

N-gram novelty is widely used to evaluate language models' ability to generate text outside of their training data. More recently, it has also been adopted as a metric for measuring textual creativity. However, theoretical work on creativity suggests that this approach may be inadequate, as it does not account for creativity's dual nature: novelty (how original the text is) and appropriateness (how sensical and pragmatic it is). We investigate the relationship between this notion of creativity and n-gram novelty through 7542 expert writer annotations (n=26) of novelty, pragmaticality, and sensicality via close reading of human and AI-generated text. We find that while n-gram novelty is positively associated with expert writer-judged creativity, ~91% of top-quartile expressions by n-gram novelty are not judged as creative, cautioning against relying on n-gram novelty alone. Furthermore, unlike human-written text, higher n-gram novelty in open-source LLMs correlates with lower pragmaticality. In an exploratory study with frontier close-source models, we additionally confirm that they are less likely to produce creative expressions than humans. Using our dataset, we test whether zero-shot, few-shot, and finetuned models are able to identify creative expressions (a positive aspect of writing) and non-pragmatic ones (a negative aspect). Overall, frontier LLMs exhibit performance much higher than random but leave room for improvement, especially struggling to identify non-pragmatic expressions. We further find that LLM-as-a-Judge novelty scores from the best-performing model were predictive of expert writer preferences.

November 10, 2025 at 1:22 PM

Najoung Kim

@najoung.bsky.social

honored to have given a plenary address at the Society for Language Development annual symposium titled "Whence insights? The value of delineating human and machine CogSci". It's a synthesis of a few years of thoughts, recently concretized with @aditya-yedetore.bsky.social and @kanishka.bsky.social

screenshot of a slide titled "Whence insights? The value of delineating human and machine CogSci"

November 9, 2025 at 1:27 PM

Najoung Kim

@najoung.bsky.social

👾 Full-time research assistant position (1 year) with @sebschu.bsky.social and me! 👾

We're looking for someone to join the research agent evaluation team, starting Fall 2025. Application link to be available soon, but feel free to send us your CV and/or come talk to us at #ACL2025. 🧵

July 25, 2025 at 5:08 PM

Reposted by Najoung Kim

Lizzie Coppock

@lizzieloo.bsky.social

I have a sabbatical coming up and I'm going to Nepal! Why Nepal? You can read about it at my first blog post about this trip.

TL;DR: linguistic diversity, writing systems (and pretty scripts), classifiers, and a school for Newar kids in Kathmandu.

sites.bu.edu/lislab/2025/...

Field trip to Nepal! | Linguistic Semantics Lab (LiSLab)

sites.bu.edu

July 24, 2025 at 10:05 PM

Najoung Kim

@najoung.bsky.social

ever since VLMs were a thing i've been interested in how the additional visual modality changes language in meaningful ways. after negative findings after negative findings, excited to report this result! proud of our junior authors for digging into this 🐸

yuluqin.bsky.social @yuluqin.bsky.social · Jul 22

Does vision training change how language is represented and used in meaningful ways?🤔The answer is a nuanced yes! Comparing VLM-LM minimal pairs, we find that while the taxonomic organization of the lexicon is similar, VLMs are better at _deploying_ this knowledge. [1/9]

July 22, 2025 at 4:01 PM

Najoung Kim

@najoung.bsky.social

green carded finally 💚💚

July 8, 2025 at 2:11 PM

Najoung Kim

@najoung.bsky.social

Seeing an experiment and thinking "but have they tried X? what if we do Y?" is a key part of research and a start to new discoveries. RExBench tests if coding agents can implement new extensions.

It complements recent evals (eg PaperBench from OpenAI
) on replication! See 👇 for details

Sebastian Schuster @sebschu.bsky.social · Jul 2

Can coding agents autonomously implement AI research extensions?

We introduce RExBench, a benchmark that tests if a coding agent can implement a novel experiment based on existing research and code.

Finding: Most agents we tested had a low success rate, but there is promise!

Screenshot of the RExBench preprint title page.

July 2, 2025 at 3:47 PM

Reposted by Najoung Kim

Koyena Pal

@koyena.bsky.social

🚨 Registration is live! 🚨

The New England Mechanistic Interpretability (NEMI) Workshop is happening Aug 22nd 2025 at Northeastern University!

A chance for the mech interp community to nerd out on how models really work 🧠🤖

🌐 Info: nemiconf.github.io/summer25/
📝 Register: forms.gle/v4kJCweE3UUH...

June 30, 2025 at 10:55 PM

Najoung Kim

@najoung.bsky.social

i'll be in copenhagen for a few days, lmk if you want to get coffee! will be around most of Thurs and early Sat. alternatively you can also come see me give a talk (at a museum apparently) on Fri:

cphnlp.github.io

Copenhagen NLP Symposium 2025

symposium website

cphnlp.github.io

June 18, 2025 at 4:26 PM

Reposted by Najoung Kim

Anna Rogers

@annarogers.bsky.social

📢 The Copenhagen NLP Symposium on June 20th!

- Invited talks by @loubnabnl.hf.co (HF) @mziizm.bsky.social (Cohere) @najoung.bsky.social (BU) @kylelo.bsky.social (AI2) Yohei Oseki (UTokyo)
- Exciting posters by other participants

Register to attend and/or present your poster at cphnlp.github.io /1

Copenhagen NLP Symposium 2025

symposium website

cphnlp.github.io

May 26, 2025 at 1:08 PM

Reposted by Najoung Kim

Matt Goldrick

@mattgoldrick.bsky.social

Very sorry to learn of the passing of LouAnn Gerken, who had such an impact on our understanding of the acquisition of speech sounds and sound patterns obits.mlive.com/us/obituarie...

LouAnn Gerken Obituary (1959 - 2025) - Tucson, AZ - Arizona Daily Star

View LouAnn A. Gerken's obituary, send flowers and sign the guestbook.

obits.mlive.com

May 29, 2025 at 3:55 PM

Najoung Kim

@najoung.bsky.social

hello NAACL friends I'm giving a keynote today at RepL4NLP at 1:30PM local time, come say hi! I'll mostly be musing about things with light research discussions

Screenshot of a slide that says "what does it take to convince ourselves that a system is exhibiting compositionality?" with a side comment "mostly AI, but humans too!!" for the word system

May 4, 2025 at 1:48 PM

Najoung Kim

@najoung.bsky.social

in liminal state, as correctly described by a colleague

March 29, 2025 at 1:55 AM

Najoung Kim

@najoung.bsky.social

so very excited that naomi is joining!!! a huge win for cds 💖

Naomi Saphra @nsaphra.bsky.social · Mar 27

Life update: I'm starting as faculty at Boston University
@bucds.bsky.social in 2026! BU has SCHEMES for LM interpretability & analysis, I couldn't be more pumped to join a burgeoning supergroup w/ @najoung.bsky.social @amuuueller.bsky.social. Looking for my first students, so apply and reach out!

CDS building which looks like a jenga tower

March 27, 2025 at 11:15 AM

Najoung Kim

@najoung.bsky.social

"Gaming Linguists" good bigram

Korean to English Gaming Linguists Urgently Required

February 4, 2025 at 7:42 PM

Najoung Kim

@najoung.bsky.social

Repost appreciated! 🙏

ACL 2025 Ling theory & Cognitive modeling track is looking for emergency reviewers. The emergency review period is between 3/18-26, and these reviewers will be excluded from the ARR cycle. If you're interested, please sign up here! docs.google.com/forms/d/1fH7...

ACL 2025 Ling theory & Cognitive modeling track emergency reviewer volunteer form

The Linguistic Theories, Cognitive Modeling, and Psycholinguistics track at ACL 2025 is looking for emergency reviewers. The emergency reviews will take place between 18th to 26th of March, 2025. Thes...

docs.google.com

December 18, 2024 at 3:37 PM

Najoung Kim

@najoung.bsky.social

have many tasks but immobilized by car

December 1, 2024 at 5:41 PM

Reposted by Najoung Kim

Ben Lipkin

@benlipkin.bsky.social

Lots of folks talking about scaling LLM inference over this last year

Internally, I’ve been developing and using a library that makes this extremely easy, and I decided to open-source it
Meet the decoding library: github.com/benlipkin/de...

1/7