Lingthusiasm - Lingthusiasm Episode 98: Helping computers decode...
Lingthusiasm Episode 98: Helping computers decode sentences - Interview with Emily M. Bender When a human learns a new word, we’re learning to attach that word to a set of concepts in the real world. When a computer “learns” a new word, it is creating some associations between that word and other words it has seen before, which can sometimes give it the appearance of understanding, but it doesn’t have that real-world grounding, which can sometimes lead to spectacular failures: hilariously implausible from a human perspective, just as plausible from the computer’s. In this episode, your host Lauren Gawne gets enthusiastic about how computers process language with Dr. Emily M. Bender, who is a linguistics professor at the University of Washington, USA, and cohost of the podcast Mystery AI Hype Theater 3000. We talk about Emily’s work trying to formulate a list of rules that a computer can use to generate grammatical sentences in a language, the differences between that and training a computer to generate sentences using the statistical likelihood of what comes next based on all the other sentences, and the further differences between both those things and how humans map language onto the real world. We also talk about paying attention to communities not just data, the labour practices behind large language models, and how Emily’s persistent questions led to the creation of the Bender Rule (always state the language you’re working on, even if it’s English). Click here for a link to this episode in your podcast player of choice or read the transcript here. Announcements: The 2024 Lingthusiasm Listener Survey is here! It’s a mix of questions about who you are as our listener, as well as some fun linguistics experiments for you to participate in. If you have taken the survey in previous years, there are new questions, so you can participate again this year. In this month’s bonus episode we get enthusiastic about three places where we can learn things about linguistics!! We talk about two linguistically interesting museums that Gretchen recently visited: the Estonian National Museum, as well as Mundolingua, a general linguistics museum in Paris. We also talk about Lauren’s dream linguistics travel destination: Martha’s Vineyard. Join us on Patreon now to get access to this and 90+ other bonus episodes. You’ll also get access to the Lingthusiasm Discord server where you can chat with other language nerds. Also, Patreon now has gift memberships! If you’d like to get a gift subscription to Lingthusiasm bonus episodes for someone you know, or if you want to suggest them as a gift for yourself, here’s how to gift a membership. Here are the links mentioned in the episode: Emily Bender Emily Bender on Bluesky and Twitter Mystery AI Hype Theater 3000 Mystery AI Hype Theater 3000: The Newsletter The AI Con by Emily M. Bender and Alex Hanna ‘Data Sovereignty and the Kaitiakitanga License’ on Te Hiku wordfreq by Robyn Speer on GitHub Lingthusiasm Episode ‘Making machines learn language - Interview with Janelle Shane’ Bonus with Janelle Shane: we do a dramatic reading of the funniest auto-generated Lingthusiasm episodes You can listen to this episode via Lingthusiasm.com, Soundcloud, RSS, Apple Podcasts/iTunes, Spotify, YouTube, or wherever you get your podcasts. You can also download an mp3 via the Soundcloud page for offline listening. To receive an email whenever a new episode drops, sign up for the Lingthusiasm mailing list. You can help keep Lingthusiasm ad-free, get access to bonus content, and more perks by supporting us on Patreon. Lingthusiasm is on Bluesky, Twitter, Instagram, Facebook, Mastodon, and Tumblr. Email us at contact [at] lingthusiasm [dot] com Gretchen is on Bluesky as @GretchenMcC and blogs at All Things Linguistic. Lauren is on Bluesky as @superlinguo and blogs at Superlinguo. Lingthusiasm is created by Gretchen McCulloch and Lauren Gawne. Our senior producer is Claire Gawne, our production editor is Sarah Dopierala, our production assistant is Martha Tsutsui Billins, our editorial assistant is Jon Kruk, and our technical editor is Leah Velleman. Our music is ‘Ancient City’ by The Triangles. This episode of Lingthusiasm is made available under a Creative Commons Attribution Non-Commercial Share Alike license (CC 4.0 BY-NC-SA).