Ben Litterer
@blitt.bsky.social
PhD student interested in computational approaches to language, politics, and media
Iowa | Michigan
Iowa | Michigan
Podcasts are a popular medium, but data for computational research is limited! We introduce the Structured Podcast Research Corpus (SPoRC - huggingface.co/datasets/bli...), a large, multimodal dataset of English podcasts 🧵
arxiv.org/abs/2411.07892
arxiv.org/abs/2411.07892
Mapping the Podcast Ecosystem with the Structured Podcast Research Corpus
Podcasts provide highly diverse content to a massive listener base through a unique on-demand modality. However, limited data has prevented large-scale computational analysis of the podcast ecosystem....
arxiv.org
November 14, 2024 at 10:36 PM
Podcasts are a popular medium, but data for computational research is limited! We introduce the Structured Podcast Research Corpus (SPoRC - huggingface.co/datasets/bli...), a large, multimodal dataset of English podcasts 🧵
arxiv.org/abs/2411.07892
arxiv.org/abs/2411.07892