naitian
naitian.org
naitian
@naitian.org
NLP / CSS PhD at Berkeley I School. I develop computational methods to study culture as a social language.
September 20, 2025 at 4:29 PM
Speaking of #ic2s2, @amirgo.bsky.social's keynote talks about cultural differences are not necessarily oppositional, and we should consider other kinds of relations --- an idea that's also in sociocultural linguistics, and which we mention in the paper!
July 23, 2025 at 3:23 PM
I'm thrilled to be doing an oral presentation on "Culture is not Trivia" at #ACL2025 next Wednesday 7/30, as well as participating in the human-centered NLP panel afterwards!

(thanks also @lauraknelson.bsky.social for the shoutout in her #ic2s2 keynote today!)

aclanthology.org/2025.acl-lon...
July 23, 2025 at 2:35 PM
July 2, 2025 at 4:57 AM
And I think lost poetry provides a counterexample to this claim:
May 19, 2025 at 5:25 PM
go bears!
March 10, 2025 at 2:20 AM
There's been a lot of work on "culture" in NLP, but not much agreement on what it is.

A position paper by me, @dbamman.bsky.social, and @ibleaman.bsky.social on cultural NLP: what we want, what we have, and how sociocultural linguistics can clarify things.

Website: naitian.org/culture-not-...

1/n
February 18, 2025 at 8:45 PM
February 16, 2025 at 5:09 PM
I've been obsessed with Sotheby's auction videos bc there's such an internally coherent set of language and gesture that the auctioneers use

and because there is one pose they all do that reminds me of pointer dogs
December 16, 2024 at 10:00 PM
I love reading conversation analysis papers bc sometimes it just feels like watching TV
December 12, 2024 at 6:56 PM
Emotional range reflects discursive patterns: phrases with low range tend to be grounded in restrictive interactional settings, while those with high range are more open-ended. Performance on screen draws on the same semiotic toolbox that the audience interacts with day-to-day! 5/n
November 19, 2024 at 4:56 PM
We use this parallel dataset of text and audio to answer questions about narrative trajectory and emotionality. We find that emotionality in general has decreased over the years, even after controlling for the dialogue. This means the effect isn’t just due to a shift in writing over the years. 3/n
November 19, 2024 at 4:56 PM
Taking advantage of our large corpus of contemporary American film, we create a pipeline of speech emotion recognition and transcription models to match the words being spoken with information about how they were delivered. 2/n
November 19, 2024 at 4:56 PM
🎬 Coming soon to a theater near you!🍿

Film is a semiotically rich medium: meaning is conveyed through the music, visuals, language, and more. A new paper from me and @dbamman.bsky.social explores what it means to computationally study performance in film.

Website: naitian.org/once-more-wi...

1/n
November 19, 2024 at 4:56 PM
election anxiety is grabbing the raw data to calculate my own stats
November 6, 2024 at 4:44 AM
Check out my library card collection
July 9, 2024 at 12:57 AM
I'm already 10x-ing my writing with AI.
April 21, 2024 at 10:03 PM
Doing all this required a huge engineering effort in which we developed a pipeline to process over 27.9M images posted to Reddit between 2011-2021. We make the semantic clusters publicly available, as well as URLs to fetch all the images.
November 16, 2023 at 1:59 AM
We use these clusters to surface subreddits that use visually different templates to convey the same semantics. We find not only do subreddits prefer certain styles over others, but they choose ones that index localized cultural knowledge!
November 16, 2023 at 1:58 AM
Recalling distributional semantics, we fine-tune an LM to learn how fill text aligns with the template, giving semantic embeddings for templates that we can cluster. Visually diverse clusters emerge even for complex semantic functions!
November 16, 2023 at 1:58 AM
We take advantage of the unique compositional multimodality of memes to learn the semantics of meme templates without supervision by breaking them down into the base image template and the text fill that goes inside.
November 16, 2023 at 1:57 AM
Memes are pervasive in online speech. Do they have the socially meaningful variation we see in other aspects of language? YES!

Preprint from me, @davidjurgens.bsky.social and @dbamman.bsky.social on the semantic structure and visual diversity of 3.8M Reddit memes.

naitian.org/social-memeing
November 16, 2023 at 1:56 AM
is this... interdisciplinarity?
October 2, 2023 at 5:08 PM