naitian
naitian.org
naitian
@naitian.org
NLP / CSS PhD at Berkeley I School. I develop computational methods to study culture as a social language.
Reposted by naitian
We were excited to host @naitian.org at today’s lab seminar for a talk on variation, semiotics, fashion, and style. A refreshing perspective at the intersection of sociolinguistics and NLP!

#NLProc
February 13, 2026 at 4:55 PM
ew why is claude doing that
February 5, 2026 at 9:21 PM
Reposted by naitian
Best gas masks
“How did these people go out and get gas masks?” AG Bondi asked.
www.theverge.com
February 3, 2026 at 2:55 PM
(And prob need to refine those thoughts some more anyway)
January 31, 2026 at 12:58 AM
And I’d go a step further and say that I believe “position papers” are a useful / unique avenue for interdisciplinary work to get picked up (and that, e.g., HCI papers that aren’t position papers are maybe definitionally more disciplinary), but I don’t really wanna get into the weeds of that on bsky
January 31, 2026 at 12:58 AM
Ya I was being kinda flippant in the original post — but I think the existence claim is true, that there are interdisciplinary theoretical papers that are branded as position papers.
bsky.app/profile/nait...
(Ok maybe not “just”, but certainly that is one of its uses, and lines up w/ the distribution of subfields here) (I can’t help but hedge on the spooky scary internet)
January 31, 2026 at 12:58 AM
If by position paper you mean papers that are “just, like, your opinion, man”, I think that’s not what most (self-declared) position papers are. If you mean papers that are extending a theoretical argument without any math, that‘s kind of my point.
January 30, 2026 at 11:55 PM
(Ok maybe not “just”, but certainly that is one of its uses, and lines up w/ the distribution of subfields here) (I can’t help but hedge on the spooky scary internet)
January 29, 2026 at 4:57 PM
“Position paper” is just a label to make some kinds of interdisciplinary theoretical work fit into the CS publishing schema.
In addition, when considering different subfields, we find striking differences in how we classify 'position' papers, leading to huge differences in how this policy affects different CS subfields.
January 29, 2026 at 4:42 PM
Reposted by naitian
Gary Larson: In my cartoon I invented Cow Tools as a cautionary tale

Cows: At long last, we have created the Cow Tools from classic newspaper comic Cow Tools
January 19, 2026 at 5:04 PM
I just got back from Michigan and—
January 7, 2026 at 6:36 PM
Showing this to everyone that I meet at a conference that thinks I’m an extrovert
A farewell card for #chr2025 people, safe travels everyone✨
December 13, 2025 at 6:08 PM
Reposted by naitian
Excited to get this work out in the world at #chr2025 (with Sabrina Baur, Mackenzie Cramer, Anna Ho and Tom McEnaney) -- asking: how much do contemporary songs tell stories, and how has that changed over the past half century?

anthology.ach.org/volumes/vol0...
Measuring the Stories in Contemporary Songs
anthology.ach.org
December 12, 2025 at 1:09 PM
Reposted by naitian
Keeping this at hand in case I need to point to it and tap
December 12, 2025 at 4:39 PM
Reposted by naitian
New grant program announcement: You've heard me talk this morning about those 23 new DH awards from @schmidtsciences.bsky.social HAVI program? Well, we just announced our new RFP for 2026! Teams can be global too! Please consider applying. RFP here: www.schmidtsciences.org/opportunity/...
2026 Humanities and Artificial Intelligence Virtual Institute (HAVI) RFP - Schmidt Sciences
Overview  [Read the full RFP] Schmidt Sciences is requesting proposals to the Humanities and AI Virtual Institute (HAVI), aimed at fostering research in the digital humanities with a particular focus ...
www.schmidtsciences.org
December 11, 2025 at 6:49 PM
Reposted by naitian
Lavinia Dunagan and @dallascard.bsky.social find implicit references to bible verses using a combination of neural embeddings and text similarity—neither is enough on its own #CHR2025
December 11, 2025 at 2:54 PM
Reposted by naitian
Awesome, the Edinburgh HCRC map task corpus 30 years later without the insane "different maps" complication.

Of course, not really fully embodied, but at least spatial, and super well controlled. #AI

Here's the older one: groups.inf.ed.ac.uk/maptask/
via @zaqdelinguist.bsky.social
December 6, 2025 at 8:38 AM
Reposted by naitian
This looks really interesting. Seems primarily targeted at cog-sci and/or linguistics research, with Portal 2 just the example problem-solving domain, but probably also of interest to game-studies and game-AI people.
A couple years (!) in the making: we’re releasing a new corpus of embodied, collaborative problem solving dialogues. We paid 36 people to play Portal 2’s co-op mode and collected their speech + game recordings.

Paper: arxiv.org/abs/2512.03381
Website: berkeley-nlp.github.io/portal-dialo...

1/n
December 6, 2025 at 4:04 AM
some of them are football games and some of them are… football games, but different
December 5, 2025 at 8:36 PM
Yes! All credit to @teaywright.bsky.social!
December 5, 2025 at 8:35 PM
We hope this corpus will be useful for linguists, cognitive scientists, or anyone else who wants to study language use in complex + goal-oriented environments! You can explore our data here:

Data explorer: berkeley-nlp.github.io/portal-dialo...
YouTube: www.youtube.com/channel/UCQw...

4/4
Portal Dialogue Corpus - Data Explorer
berkeley-nlp.github.io
December 5, 2025 at 6:54 PM
We also provide dialogue act annotations and timestamps of when players completed subtasks for each level. We can use these annotations to show that, e.g., directives are more common before the completion of hard tasks, while confirmations are more common after. 3/n
December 5, 2025 at 6:54 PM
Many linguistic phenomena like spatial reference or convention formation are underrepresented in existing datasets, because they’re not embodied in complex or goal-oriented environments. In this clip, players combine both language and movement to successfully establish left vs right. 2/n
December 5, 2025 at 6:54 PM
A couple years (!) in the making: we’re releasing a new corpus of embodied, collaborative problem solving dialogues. We paid 36 people to play Portal 2’s co-op mode and collected their speech + game recordings.

Paper: arxiv.org/abs/2512.03381
Website: berkeley-nlp.github.io/portal-dialo...

1/n
December 5, 2025 at 6:54 PM
Oh how I yearn for happier days
Liverpool 2 - 0 Madrid
Lions 23 - 20 Bears
Michigan 13 - 10 Ohio State
Liverpool 2 - 0 Man City

What a week I'm floating
November 29, 2025 at 9:01 PM