#Quanteda
newspapers get new authors and new guidelines in article writing. And the changes (apart from the en-dash) are too small to really spot with human eyes.

But I do agree with Mario that the 2025 change is... unexpectedly large.

Ok breakfast, then let's see about the quanteda package.

3/3 FOR NOW
November 2, 2025 at 9:15 AM
Text Analytics in R with quanteda (Part 1)

"Required Packages
library(quanteda)
library(quanteda.textstats)
library(quanteda.textplots)
library(readr)
library(dplyr)
library(ggplot2)
library(stringr)
library(DT)
library(tidytext)
Understanding Text Analytics Fundamentals
Text analytic...

Co..."
October 14, 2025 at 4:37 PM
You don’t. You just need a DFM, you can do that easily outside of quanteda by just creating a contingency table of count by token. I thought it would be easier. It probably wasn’t necessary.
August 22, 2025 at 7:12 PM
why do you need all the quanteda stuff?
August 22, 2025 at 7:09 PM
Trying to apply a regex-based dictionary to a (fairly large) corpus in Quanteda, and my computer is chocking hard on it. Are there any tips for optimizing such tasks that other users may have?
#Quanteda #R
August 11, 2025 at 3:53 PM
Updates on CRAN: bplsr (1.0.4), CDMConnector (2.1.1), data.table (1.17.8), future.mirai (0.10.1), gstat (2.1-4), MBAnalysis (2.1.0), newsmap (0.9.2), performance (0.15.0), quanteda (4.3.1), torch (0.15.1), ttservice (0.5.3), wordmap (0.9.5)
July 10, 2025 at 1:41 PM
CRAN updates: CDMConnector newsmap quanteda ttservice wordmap #rstats
July 10, 2025 at 1:02 PM
Caught doing quanteda out in the open...
Der #Demokratie bei der Arbeit zugucken: Forscher*innen der #FUBerlin haben Reden aus allen 16 Landtagen analysiert & in der Datenbank "StateParl“ durchsuchbar gemacht.
Die teils überraschenden Ergebnisse ➡️
www.fu-berlin.de/campusleben/...

Foto: Nicolas Pannetier | Atelier Limo
July 9, 2025 at 4:04 PM
New conceptual review + tutorial on text embeddings out in #APA_Journals w/ @almogsi. Beginner-friendly, but experts will find spicy new takes as well. Tag a colleague who’s still counting words... #RStats #tidyverse #quanteda
1/
June 24, 2025 at 2:10 PM
Se abordarán estrategias cuantitativas, cualitativas y computacionales para la investigación social, incluyendo:
🔹 IA y modelos de lenguaje
🔹 RRSS (Gephi, UCINET)
🔹 Análisis textual con R (Quanteda)
🔹 Etnografía multisituada
🔹 Participación social y datos digitales
June 5, 2025 at 8:19 PM
I released the wordvector package v0.5.0. It is rapidly getting better and different from the original Word2vec package. Please read "Align word vectors of multiple Word2vec models" about the new function blog.koheiw.net?p=2299 #rstats #quanteda
Align word vectors of multiple Word2vec models
I have been developing a new R package called wordvector since last year. I started it as a fork of the Word2vec package but made several important changes to make it fully compatible with quanteda…
blog.koheiw.net
May 24, 2025 at 2:50 AM
Updates on CRAN: insight (1.3.0), morepls (0.2), NetCoupler (0.1.1), quanteda (4.3.0), riskRegression (2025.05.20), rmdwc (0.3.1), spatstat.random (3.4-1), SSDM (0.2.11)
May 20, 2025 at 1:38 PM
CRAN updates: GenomeAdmixR quanteda rmdwc #rstats
May 20, 2025 at 12:02 PM
In diesem Video erkläre ich die Grundlagen der Textanalyse in R. Ich zeige, wie man einen Corpus, Tokens und eine Dokumenten Feature Matrix in R mit quanteda erstellen kann und wie man damit arbeiten kann.
www.youtube.com/watch?v=TSNw...
Grundlagen der Textanalyse in R
YouTube video by Dr. Benjamin Schlegel
www.youtube.com
March 17, 2025 at 5:30 PM
Updates on CRAN: OmopSketch (0.2.0), pastboon (0.1.3), pim (2.0.4), qfa (4.0), quanteda (4.2.0), QuickJSR (1.5.1), RBesT (1.8-0), RcppGetconf (0.0.4), spatstat.explore (3.3-4), spData (2.3.4), statnet.common (4.11.0), tidyterra (0.6.2), webdav (0.1.3)
January 8, 2025 at 1:24 PM
CRAN updates: bdsvd ggiraph highs isopam laminr mixlm pim quanteda #rstats
January 8, 2025 at 12:02 PM
A few days ago, I received an email from a researcher asking if text analysis is becoming irrelevant because of AI... blog.koheiw.net?p=2254 #text-as-data #quanteda
December 13, 2024 at 12:16 AM
I already blogged today, so don't want to write more. But here is the code
November 24, 2024 at 3:04 PM
If you think the number of topics, k, is the only important parameter for topic models, you need to read this post and the research paper. blog.koheiw.net?p=2233 I created a new model to optimize the Dirichlet priors to analyze imbalanced corpus more accurately. #rstats #quanteda
A new topic model for analysis imbalanced corpus
I have been developing and testing a new topic model called model Distributed Asymmetric Allocation (DAA) because latent Dirichlet allocation (LDA) takes a long time to fit to a large corpus but do…
blog.koheiw.net
November 23, 2024 at 10:02 AM
Would love to be added to this starter pack
#quanteda #stopwords #spacyr
November 23, 2024 at 8:38 AM
Not sure if it's helpful but two R #packages which may be of use are quanteda:: (for quant text analysis) and tidytext:: is useful for text mining. If this helps at all? Otherwise I am clueless on the subject.
November 18, 2024 at 6:21 PM
Quanteda, seu lindo. Adorei estudar e trabalhar com você hoje.

Amanhã será dia de só escrever análise e aprimorar os achados que tive no exploratório enquanto montava o código.

quanteda: Quantitative Analysis of Textual Data
quanteda tutorials :: Tutorials for quanteda
Introduction to quantitative text analysis using quanteda
tutorials.quanteda.io
September 10, 2024 at 1:18 AM
Hoje estou me sentindo muito garoto de programa fazendo as adaptações do meu código no R utilizando a documentação do pacote que mudou.

Quanteda eu te odeio, mas te amo!
September 9, 2024 at 3:49 PM