Kai Li
nalsi.bsky.social
Kai Li
@nalsi.bsky.social
Assistant Professor at School of Information Sciences of the University of Tennessee, Knoxville. #scientometrics and #metadata researcher. Former #librarian. #Cat person.

ORCID: https://orcid.org/0000-0002-7264-365X
Reposted by Kai Li
Hi DH friends, join us on Nov 10, 10-11 am CT, for “New Book History Research with Internet Data”, a hybrid panel sponsored by SHAR, to explore challenges and opportunities of using Internet data and digital methods for book history research. More info in the poster attached and comments :)
November 6, 2025 at 6:09 PM
Excited to join the “New Book History Research with Internet Data” panel at Indiana University Bloomington on Nov 10. We’ll discuss how online data (reviews, platforms, metrics) reshape book history & publishing studies.

🔗 events.iu.edu/siceiub/even...

#BookHistory #DigitalHumanities
events.iu.edu
November 4, 2025 at 3:41 AM
Reposted by Kai Li
I am beyond thrilled to share that DATA BY DESIGN: AN INTERACTIVE HISTORY OF DATA VISUALIZATION, 1789-1900 is now open for community review at 💚 📊 dataxdesign.io 📊 💙. It's the work of 15+ people across 5 institutions, 2 continents, 2 babies, and a global pandemic. A 🧵 but first:
May 21, 2024 at 2:32 PM
Reposted by Kai Li
Applications are open for the Doctoral Consortium at the 2025 Summit on Responsible Computing, AI, and Society rcais.github.io. Taking place October 27, 2025.

Applications received by September 15 will receive full consideration.
2025 Summit on Responsible Computing, AI, and Society
2025 Summit on Responsible Computing, AI, and Society
rcais.github.io
July 24, 2025 at 1:24 PM
Reposted by Kai Li
I'm glad to share the new data paper in the Journal of Open Humanities Data with my wonderful collaborator @nalsi.bsky.social and students!! 😇 Check out our work here: openhumanitiesdata.metajnl.com/articles/10....
GraphEidos: A Dataset of Visual Rhetoric in Digital Humanities | Journal of Open Humanities Data
openhumanitiesdata.metajnl.com
July 9, 2025 at 3:00 PM
Reposted by Kai Li
AI & Society faculty opportunity posted just this month with a start date of Fall 2025 and beyond: www.ubjobs.buffalo.edu/postings/57734 ‼️🐂

For "interdisciplinary scholars whose research agenda connects the study of AI with humanistic and/or social scientific line(s) of study"
Assistant, Associate or Full Professor, AI & Society
The Department of AI and Society (AIS) at the University at Buffalo (UB) invites candidates to apply for multiple positions as Assistant Professor, Associate Professor, or Full Professor. The new AIS ...
www.ubjobs.buffalo.edu
July 7, 2025 at 6:42 PM
And the paper is published here: www.nature.com/articles/s41....
June 11, 2025 at 1:52 PM
🚨 Excited to share our latest study (forthcoming in Humanities and Social Sciences Communications) collaborated with Dr. Chaoqun Ni and Xiang Zheng::
Gender disparities in STEM research enterprise in China
👉 ssrn.com/abstract=523...
Gender disparities in STEM research enterprise in China
Gender diversity is essential to the creation of high-quality research and scientific advances. This study provides a large-scale analysis of gender disparities
ssrn.com
May 24, 2025 at 1:43 PM
Reposted by Kai Li
Really glad that this is out! Fantastic collaboration with @honglin-bao.bsky.social and @innovation.bsky.social!

Key insight: Knowledge diffusion is nearly impossible to constrain—where there's a will, there's a way. From the atomic bomb to ChatGPT, determined minds always find a path.
1/3
Where there’s a will there’s a way: ChatGPT is used more for science in countries where it is prohibited
Abstract. Regulating AI is a key societal challenge, but effective methods remain unclear. This study evaluates geographic restrictions on AI services, focusing on ChatGPT, which OpenAI blocks in seve...
direct.mit.edu
April 19, 2025 at 12:42 AM
🚀 New article out on @InformationMatters!
I had the pleasure of collaborating with Dr. Jaihyun Park from NTU Singapore to rethink data reuse in the age of #LLMs.
Check it out 👉
🔗 informationmatters.org/2025/04/reth...
#DataLifecycle #OpenScience #AI
Rethinking Reuse in Data Lifecycle in the Age of Large Language Models - Information Matters
In the world we are living in, a digital world, some data slips past our awareness, but very little data ever truly disappears. As we, information scientists, are concerned with reproducibility and re...
informationmatters.org
April 18, 2025 at 3:51 PM
🎉 Excited to share that Scientific Data is launching a new collection: "Data on Science and Innovation Processes, Outputs and Outcomes." This special issue welcomes data descriptors of datasets that shed light on how science and innovation work—from inputs to outcomes. www.nature.com/collections/...
About the Guest Editors | Data on Science and Innovation Processes, Outputs and Outcomes
This Scientific Data collection aims to gather data descriptors on high-quality, reusable datasets that illuminate the inputs, processes, and outputs of science and innovation.
www.nature.com
April 14, 2025 at 1:01 PM
Reposted by Kai Li
Just published on the blog, Assistant Professor Kai Li gives new perspectives on challenges and opportunities for large-scale analyses on data use and data citations. makedatacount.org/read-our-blo...
April 3, 2025 at 5:24 PM
Reposted by Kai Li
Modern-Day Oracles or Bullshit Machines?

Jevin West (@jevinwest.bsky.social) and I have spent the last eight months developing the course on large language models (LLMs) that we think every college freshman needs to take.

thebullshitmachines.com
INTRODUCTION
thebullshitmachines.com
February 4, 2025 at 4:12 PM
Reposted by Kai Li
The more I read and think about generative AI and what its creators want us to do with if, the more I'm reminded of this quote from Victor Papanek's 'Design for the Real World'
December 20, 2024 at 12:33 PM
Reposted by Kai Li
Every government needs effective science advisers; not every scientist is a born adviser but schooling and practice can help to bridge the gap, as we argue in this week’s Nature’s editorial
@natureportfolio.bsky.social 🧪

www.nature.com/articles/d41...
Advising governments about science is essential but difficult. So train people to do it
A great scientist doesn’t necessarily make an effective science adviser — but schooling and practice can help to bridge the gap.
www.nature.com
December 6, 2024 at 8:41 AM
Reposted by Kai Li
Call for submissions: RESSH2025 Conference on research assessment reform in the humanities and social sciences. Held next May in Helsinki.

vastuullinentiede.fi/en/events/re...
RESSH2025 Conference
RESSH2025 conference of the international association ENRESSH (European Network for Research Evaluation in the SSH) is organized 19-21 May, 2024, in Helsinki, Finland. It brings together specialists o...
vastuullinentiede.fi
December 6, 2024 at 3:02 PM
Reposted by Kai Li
how do researchers use LMs in their work & why?

we surveyed 800 researchers across fields of study, race, gender, seniority asking their opinions on:

🐟 which research activities (eg coding, writing)
🐠 benefits vs risks
🦈 willingness to disclose

findings in @simonaliao.bsky.social's thread 🧵
December 2, 2024 at 8:55 PM
A poem about information organization and retrieval written by ChatGPT. Read it in the last session of my class. The students seem to like it.
November 22, 2024 at 2:23 AM
Reposted by Kai Li
Science is political in so many ways. One of them is reflected in #citation politics. And we all do it, willingly or not.

This paper is a glimpse into how some strategies (e.g. decolonizing or diversifying citations) that aim to undermine existing hierarchies have limitations worth thinking about.
November 19, 2024 at 8:52 AM
Reposted by Kai Li
I will assign this in future classes—it nails one common argument I find unconvincing in regards to students, which is the idea of LLMss as interlocutors to help refine thinking—that only really works if you’re an expert who can spot the mistakes—students are typically not in a position to do that
November 18, 2024 at 5:12 PM
A very personal thought: I really don't like how #StarterPack is designed in Bluesky. It creates a structure where the creators get all the power to manage the list and the credit for the list. So as useful as it is to share certain information, it is also a barrier to more equal communication.
November 17, 2024 at 3:05 PM
Reposted by Kai Li
Podcasts are a popular medium, but data for computational research is limited! We introduce the Structured Podcast Research Corpus (SPoRC - huggingface.co/datasets/bli...), a large, multimodal dataset of English podcasts 🧵

arxiv.org/abs/2411.07892
Mapping the Podcast Ecosystem with the Structured Podcast Research Corpus
Podcasts provide highly diverse content to a massive listener base through a unique on-demand modality. However, limited data has prevented large-scale computational analysis of the podcast ecosystem....
arxiv.org
November 14, 2024 at 10:36 PM
Doing a research on books in the #REF2021 project. One weird finding is that political science has very high share of authored books among all books (vs. edited books and book chapters). #bibliographicdata #scisci #scientometrics
November 11, 2024 at 2:47 AM
Reposted by Kai Li
Once again, millennials are the real greatest generation
November 8, 2024 at 12:57 PM
Reposted by Kai Li
We’re going to hear lots of stories about which people, policies and rhetoric are to blame for the Democrats’ defeat.

Some of those stories may even be true!

But an underrated factor is that 2024 was an absolutely horrendous year for incumbents around the world 👇 
November 7, 2024 at 11:33 AM