brendan o’connor
@brenocon.bsky.social
natural language processing, social science, umass, western mass
http://brenocon.com he/him
http://brenocon.com he/him
Reposted by brendan o’connor
The massacre of the ethics/safety teams and the internal reorientation away from anything that hinted at broader purpose (with exception for the more profitable bits of natsec) is a story that has yet to be properly told.
there was an incredibly stark change at my job between 2023 and 2025. just absolutely day and night.
i dont think there’s ever been a point where corporate america has had a sincere sense of morality but it is probably a sign of the times that the culture at most major firms does not even pretend to encourage ethical behavior anymore. the transformation is really stark in tech.
November 10, 2025 at 6:39 PM
The massacre of the ethics/safety teams and the internal reorientation away from anything that hinted at broader purpose (with exception for the more profitable bits of natsec) is a story that has yet to be properly told.
Reposted by brendan o’connor
Watts & Strogatz (1998) except you’re living in the bad timeline that’s also the dumbest possible timeline. www.nytimes.com/2025/10/30/u...
October 31, 2025 at 7:15 PM
Watts & Strogatz (1998) except you’re living in the bad timeline that’s also the dumbest possible timeline. www.nytimes.com/2025/10/30/u...
Reposted by brendan o’connor
Your regular reminder that NSF is required by US law to support increasing the participation of historically less represented groups in science and technology fields
October 28, 2025 at 1:55 AM
Your regular reminder that NSF is required by US law to support increasing the participation of historically less represented groups in science and technology fields
and (i believe) tal is presenting this work at the umass linguistics colloquium, this friday! ILC S211 at 3:30pm
Another banger from @tallinzen.bsky.social .
Also fits with some of the criticisms of Centaur and my faculty-based approach generally; if you want LLMs to model human cognition, give them more architecture akin to human faculty psychology like long and short-term memory.
arxiv.org/abs/2510.05141
Also fits with some of the criticisms of Centaur and my faculty-based approach generally; if you want LLMs to model human cognition, give them more architecture akin to human faculty psychology like long and short-term memory.
arxiv.org/abs/2510.05141
To model human linguistic prediction, make LLMs less superhuman
When people listen to or read a sentence, they actively make predictions about upcoming words: words that are less predictable are generally read more slowly than predictable ones. The success of larg...
arxiv.org
October 15, 2025 at 9:19 PM
and (i believe) tal is presenting this work at the umass linguistics colloquium, this friday! ILC S211 at 3:30pm
Reposted by brendan o’connor
October 6, 2025 at 8:26 PM
Reposted by brendan o’connor
I have a new blog post about the so-called “tokenizer-free” approach to language modeling and why it’s not tokenizer-free at all. I also talk about why people hate tokenizers so much!
September 25, 2025 at 3:14 PM
I have a new blog post about the so-called “tokenizer-free” approach to language modeling and why it’s not tokenizer-free at all. I also talk about why people hate tokenizers so much!
Reposted by brendan o’connor
My #UMassAmherst colleagues Jen Lundquist and Kathy Forde ran a great workshop - "Reclaim the Narrative" at UMass last week on helping university staff and faculty tell stories about the importance of the work we do for our students and for society as a whole.
September 24, 2025 at 4:20 PM
My #UMassAmherst colleagues Jen Lundquist and Kathy Forde ran a great workshop - "Reclaim the Narrative" at UMass last week on helping university staff and faculty tell stories about the importance of the work we do for our students and for society as a whole.
great paper! we already found it useful to inform another ongoing project (in more of a health domain; many domains face similar issues)
Very excited that my paper with @katakeith.bsky.social is now out in @polanalysis.bsky.social. We investigate whether LLMs actually follow the instructions/definitions provided in codebooks, propose some diagnostics, and release a new evaluation dataset.
www.cambridge.org/core/journal...
www.cambridge.org/core/journal...
Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts | Political Analysis | Cambridge Core
Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts
www.cambridge.org
September 19, 2025 at 3:01 PM
great paper! we already found it useful to inform another ongoing project (in more of a health domain; many domains face similar issues)
Reposted by brendan o’connor
When I was placed on the Professor Watchlist in 2021, people sent death threats about my children. I had security officers monitor my 8yo at school.
Where is all the outrage for those of us who have been targeted for years? Where is the outrage for our families?
My own colleagues are silent.
Where is all the outrage for those of us who have been targeted for years? Where is the outrage for our families?
My own colleagues are silent.
September 15, 2025 at 8:25 PM
When I was placed on the Professor Watchlist in 2021, people sent death threats about my children. I had security officers monitor my 8yo at school.
Where is all the outrage for those of us who have been targeted for years? Where is the outrage for our families?
My own colleagues are silent.
Where is all the outrage for those of us who have been targeted for years? Where is the outrage for our families?
My own colleagues are silent.
Reposted by brendan o’connor
UMSI is running multiple searches this year, starting with the John Derby Evans Professor in Information, at the Assistant or Associate level!
This is open to anyone working at the intersection of tech and society, with a closing date of Nov 1, 2025. Please share!
www.si.umich.edu/people/facul...
This is open to anyone working at the intersection of tech and society, with a closing date of Nov 1, 2025. Please share!
www.si.umich.edu/people/facul...
John Derby Evans Professorship in Information (Assistant or Associate Professor) | umsi
The University of Michigan School of Information (UMSI) invites applications for a tenure-track faculty position focusing on technology and society.
www.si.umich.edu
September 15, 2025 at 8:08 PM
UMSI is running multiple searches this year, starting with the John Derby Evans Professor in Information, at the Assistant or Associate level!
This is open to anyone working at the intersection of tech and society, with a closing date of Nov 1, 2025. Please share!
www.si.umich.edu/people/facul...
This is open to anyone working at the intersection of tech and society, with a closing date of Nov 1, 2025. Please share!
www.si.umich.edu/people/facul...
Reposted by brendan o’connor
LLMs introduce a huge range of new capabilities for research, but also make it possible for researchers to "hack" their results in new ways by how they chose to use models for annotation
This is a useful pass at quantifying some of the risk, and some mitigation strategies arxiv.org/pdf/2509.08825
This is a useful pass at quantifying some of the risk, and some mitigation strategies arxiv.org/pdf/2509.08825
September 15, 2025 at 2:21 PM
LLMs introduce a huge range of new capabilities for research, but also make it possible for researchers to "hack" their results in new ways by how they chose to use models for annotation
This is a useful pass at quantifying some of the risk, and some mitigation strategies arxiv.org/pdf/2509.08825
This is a useful pass at quantifying some of the risk, and some mitigation strategies arxiv.org/pdf/2509.08825
this is really bad (CW suicide discussion. a lot of it, thanks to chatgpt)
I got the complaint in the horrific OpenAI self harm case the the NY Times reported today
This is way way worse even than the NYT article makes it out to be
OpenAI absolutely deserves to be run out of business
This is way way worse even than the NYT article makes it out to be
OpenAI absolutely deserves to be run out of business
August 26, 2025 at 5:19 PM
this is really bad (CW suicide discussion. a lot of it, thanks to chatgpt)
Reposted by brendan o’connor
Pleased to share the latest version of my paper with Arthur Spirling and @lexipalmer.bsky.social on replication using LMs
We show:
1. current applications of LMs in political science research *don't* meet basic standards of reproducibility...
We show:
1. current applications of LMs in political science research *don't* meet basic standards of reproducibility...
December 17, 2024 at 7:50 PM
Pleased to share the latest version of my paper with Arthur Spirling and @lexipalmer.bsky.social on replication using LMs
We show:
1. current applications of LMs in political science research *don't* meet basic standards of reproducibility...
We show:
1. current applications of LMs in political science research *don't* meet basic standards of reproducibility...
Reposted by brendan o’connor
GPT-5 lands first place on NoCha, our long-context book understanding benchmark.
That said, this is a tiny improvement (~1%) over o1-preview, which was released almost one year ago. Have long-context models hit a wall?
Accuracy of human readers is >97%... Long way to go!
That said, this is a tiny improvement (~1%) over o1-preview, which was released almost one year ago. Have long-context models hit a wall?
Accuracy of human readers is >97%... Long way to go!
August 8, 2025 at 2:13 AM
GPT-5 lands first place on NoCha, our long-context book understanding benchmark.
That said, this is a tiny improvement (~1%) over o1-preview, which was released almost one year ago. Have long-context models hit a wall?
Accuracy of human readers is >97%... Long way to go!
That said, this is a tiny improvement (~1%) over o1-preview, which was released almost one year ago. Have long-context models hit a wall?
Accuracy of human readers is >97%... Long way to go!
Reposted by brendan o’connor
Terence Tao (@teorth.bsky.social) has written a thread on Mastodon about the impact of the federal grant freeze to UCLA, particularly to his own field of Mathematics. UCLA's IPAM (Institute of Pure and Applied Mathematics) could shut down entirely
mathstodon.xyz/@tao/1149568...
mathstodon.xyz/@tao/1149568...
Terence Tao (@tao@mathstodon.xyz)
The current administration in the US has, through various funding agencies such as the NSF and NIH, has recently suspended virtually all federal grants to my home university, UCLA (including my own p...
mathstodon.xyz
August 2, 2025 at 6:27 PM
Terence Tao (@teorth.bsky.social) has written a thread on Mastodon about the impact of the federal grant freeze to UCLA, particularly to his own field of Mathematics. UCLA's IPAM (Institute of Pure and Applied Mathematics) could shut down entirely
mathstodon.xyz/@tao/1149568...
mathstodon.xyz/@tao/1149568...
Reposted by brendan o’connor
Check this out! Happy to talk to folks about Valley living, feel free to DM.
Mount Holyoke College in lovely western Massachusetts is hiring a tenure-track neuroscientist. Amazing students and fantastic faculty support - please share!
mtholyoke.wd5.myworkdayjobs.com/en-US/Extern...
mtholyoke.wd5.myworkdayjobs.com/en-US/Extern...
Assistant Professor of Neuroscience and Behavior
Job no: R-0000002388 Position Title: Assistant Professor of Neuroscience and Behavior Work Type: Faculty Full time In-Person Start Date: 07/01/2026 Job Description: The Department of Neuroscience and ...
mtholyoke.wd5.myworkdayjobs.com
July 28, 2025 at 3:20 PM
Check this out! Happy to talk to folks about Valley living, feel free to DM.
Reposted by brendan o’connor
You basically got it. He said "I would love to see the people who hire graduating PhD students say with a straight face right now that they would go ahead and hire students who had *only* done work on the small models that you were describing." (The full stream is available on underline)
July 28, 2025 at 4:07 PM
You basically got it. He said "I would love to see the people who hire graduating PhD students say with a straight face right now that they would go ahead and hire students who had *only* done work on the small models that you were describing." (The full stream is available on underline)
Reposted by brendan o’connor
The #ACL2025 #ACL2025NLP feed is up and running! It matches both hashtags and any posts from or mentions of @aclmeeting.bsky.social
Pin it to your home 📌 and enjoy!
bsky.app/profile/did:...
Pin it to your home 📌 and enjoy!
bsky.app/profile/did:...
July 17, 2025 at 11:15 AM
The #ACL2025 #ACL2025NLP feed is up and running! It matches both hashtags and any posts from or mentions of @aclmeeting.bsky.social
Pin it to your home 📌 and enjoy!
bsky.app/profile/did:...
Pin it to your home 📌 and enjoy!
bsky.app/profile/did:...
#acl2025 anyone get a good quote of phil resnik's last comment?
context: (some?all?) panelists & him agree the field needs more deep, careful research on smaller models to do better science. everyone is frustrated with impossibility of large-scale pretraining experiments
context: (some?all?) panelists & him agree the field needs more deep, careful research on smaller models to do better science. everyone is frustrated with impossibility of large-scale pretraining experiments
July 28, 2025 at 3:24 PM
#acl2025 anyone get a good quote of phil resnik's last comment?
context: (some?all?) panelists & him agree the field needs more deep, careful research on smaller models to do better science. everyone is frustrated with impossibility of large-scale pretraining experiments
context: (some?all?) panelists & him agree the field needs more deep, careful research on smaller models to do better science. everyone is frustrated with impossibility of large-scale pretraining experiments
Reposted by brendan o’connor
Excited to present two papers at #ACL2025!
🗓️30 July, 11 AM: 𝛿-Stance: A Large-Scale Real World Dataset of Stances in Legal Argumentation. w/ Douglas Rice and @brenocon.bsky.social
📍At Hall 4/5. 🧵👇
🗓️30 July, 11 AM: 𝛿-Stance: A Large-Scale Real World Dataset of Stances in Legal Argumentation. w/ Douglas Rice and @brenocon.bsky.social
📍At Hall 4/5. 🧵👇
July 28, 2025 at 10:57 AM
Excited to present two papers at #ACL2025!
🗓️30 July, 11 AM: 𝛿-Stance: A Large-Scale Real World Dataset of Stances in Legal Argumentation. w/ Douglas Rice and @brenocon.bsky.social
📍At Hall 4/5. 🧵👇
🗓️30 July, 11 AM: 𝛿-Stance: A Large-Scale Real World Dataset of Stances in Legal Argumentation. w/ Douglas Rice and @brenocon.bsky.social
📍At Hall 4/5. 🧵👇
Reposted by brendan o’connor
🗓️29 July, 4 PM: Automated main concept generation for narrative discourse assessment in aphasia. w/
@marisahudspeth.bsky.social, Polly Stokes, Jacquie Kurland, and @brenocon.bsky.social
📍Hall 4/5.
Come by to chat about argumentation, narrative texts, policy & law, and beyond! #ACL2025NLP
@marisahudspeth.bsky.social, Polly Stokes, Jacquie Kurland, and @brenocon.bsky.social
📍Hall 4/5.
Come by to chat about argumentation, narrative texts, policy & law, and beyond! #ACL2025NLP
July 28, 2025 at 10:57 AM
🗓️29 July, 4 PM: Automated main concept generation for narrative discourse assessment in aphasia. w/
@marisahudspeth.bsky.social, Polly Stokes, Jacquie Kurland, and @brenocon.bsky.social
📍Hall 4/5.
Come by to chat about argumentation, narrative texts, policy & law, and beyond! #ACL2025NLP
@marisahudspeth.bsky.social, Polly Stokes, Jacquie Kurland, and @brenocon.bsky.social
📍Hall 4/5.
Come by to chat about argumentation, narrative texts, policy & law, and beyond! #ACL2025NLP
Reposted by brendan o’connor
Highlighting this thread. Based on what I'm seeing at #ic2s2 this week, this line of work is hot (if a bit crowded), but I predict will only be more widely adopted by social scientists in the future.
What are your favorite recent papers on using LMs for annotation (especially in a loop with human annotators), synthetic data for task-specific prediction, active learning, and similar?
Looking for practical methods for settings where human annotations are costly.
A few examples in thread ↴
Looking for practical methods for settings where human annotations are costly.
A few examples in thread ↴
July 23, 2025 at 1:07 PM
Highlighting this thread. Based on what I'm seeing at #ic2s2 this week, this line of work is hot (if a bit crowded), but I predict will only be more widely adopted by social scientists in the future.
Reposted by brendan o’connor
🥳 🎉 ❤️ The ACL 2025 Proceedings are live on the ACL Anthology 🥰 !
We’re thrilled to pre-celebrate the incredible research 📚 ✨ that will be presented starting Monday next week in Vienna 🇦🇹 !
Start exploring 👉 aclanthology.org/events/acl-2...
#NLProc #ACL2025NLP #ACLAnthology
We’re thrilled to pre-celebrate the incredible research 📚 ✨ that will be presented starting Monday next week in Vienna 🇦🇹 !
Start exploring 👉 aclanthology.org/events/acl-2...
#NLProc #ACL2025NLP #ACLAnthology
Annual Meeting of the Association for Computational Linguistics (2025) - ACL Anthology
pdf bibProceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)Wanxiang Che | Joyce Nabende | Ekaterina Shutova | Mohammad Taher Pilehvar
aclanthology.org
July 22, 2025 at 8:00 PM
🥳 🎉 ❤️ The ACL 2025 Proceedings are live on the ACL Anthology 🥰 !
We’re thrilled to pre-celebrate the incredible research 📚 ✨ that will be presented starting Monday next week in Vienna 🇦🇹 !
Start exploring 👉 aclanthology.org/events/acl-2...
#NLProc #ACL2025NLP #ACLAnthology
We’re thrilled to pre-celebrate the incredible research 📚 ✨ that will be presented starting Monday next week in Vienna 🇦🇹 !
Start exploring 👉 aclanthology.org/events/acl-2...
#NLProc #ACL2025NLP #ACLAnthology
Unfortunately I'm missing #ic2s2 but our work on practical event extraction & analyzing international news coverage bias will be presented there as well! this work --
Excited to share our FAME method for news identification: Fingerprint-to-Article Matching for Events from a DB! We use it to study news coverage of disasters and conflicts (w @brenocon.bsky.social @ethanz.bsky.social). Check out our talk and poster at @icwsm.bsky.social!🧵👇
arxiv.org/abs/2506.12925
arxiv.org/abs/2506.12925
July 22, 2025 at 5:30 PM
Unfortunately I'm missing #ic2s2 but our work on practical event extraction & analyzing international news coverage bias will be presented there as well! this work --
Reposted by brendan o’connor
🥲 The cost to future society will be much more than the dollars they think they’re saving #mathsky www.scientificamerican.com/article/can-...
Math Is Quietly in Crisis over NSF Funding Cuts
A 72 percent reduction in federal funding is devastating to math research. The American Mathematical Society is offering $1 million in backstop grants—but it’s likely not enough.
www.scientificamerican.com
July 19, 2025 at 1:24 AM
🥲 The cost to future society will be much more than the dollars they think they’re saving #mathsky www.scientificamerican.com/article/can-...