Leland McInnes
lelandmcinnes.bsky.social
Leland McInnes
@lelandmcinnes.bsky.social
A Mathematician dabbling in Data Science, especially unsupervised learning and data exploration. UMAP, HDBSCAN, PyNNDescent, DataMapPlot. (He/Him)
I'll be giving a talk about DataMapPlot for visualizing data maps at Scipy this year. I would love to meet potential users and chat about where to go next.

cfp.scipy.org/scipy2025/ta...
June 23, 2025 at 11:41 PM
I also updated the ArXiv data map example to make use of new features in datamapplot.
lmcinnes.github.io/datamapplot_...

You can tweak parameters and build your own version:
gist.github.com/lmcinnes/e11...
June 22, 2025 at 9:59 PM
Explore Wikipedia through a data map. Pages are grouped by semantic similarity, for topic clusters.
Hover to see details, zoom to explore more fine-grained topics, click to go to a page. Search by page
name to find interesting starting points for exploration.

lmcinnes.github.io/datamapplot_...
June 22, 2025 at 3:36 PM
A recent new addition to umap-learn was the ability to have ParametricUMAP embeddings that can be smoothly updated with new incoming data -- creating new clusters as needed, and gently adjusting the rest of the embedding to make space for them.

umap-learn.readthedocs.io/en/latest/tr...
November 3, 2024 at 8:21 PM
I tried using the current topic modelling workflow to make a map of 5 million Hacker News stories:

lmcinnes.github.io/datamapplot_...
October 18, 2024 at 5:12 PM
Shift click and drag to select papers and get a word-cloud summary.

Click on an individual point to open the paper.
October 11, 2024 at 1:42 AM
Explore 2.4 million papers on ArXiv:

lmcinnes.github.io/datamapplot_...
October 11, 2024 at 1:41 AM
A new release of DataMapPlot adds the ability to place labels over top of the map for a word-cloud style look. As usual there remain a lot of options to fine tune and customize to your needs.

github.com/TutteInstitu...
datamapplot.readthedocs.io/en/latest/la...
May 6, 2024 at 1:53 PM