Mat Kelly
machawk1.bsky.social
Mat Kelly
@machawk1.bsky.social
Assistant Professor - Information Science
Drexel University College of Computing & Informatics (CCI)

WS-DL & ODU CS alum
Most things web archiving-related
Now presenting our #jcdl2025 paper exploring IPARO, a decentralized web-archival system on IPFS/IPNS that embeds version links into archived content. We evaluate linking policies and storage/lookup/replay tradeoffs. @ipfs.tech

Slides: matkelly.com/presentation...
Paper: matkelly.com/papers/2025_...
December 18, 2025 at 8:33 PM
Reposted by Mat Kelly
More good news for the new year! We'll be hosting an official re-launch event for the COPTR tool discovery wiki! Please come along if you can... https://www.dpconline.org/events/eventdetail/579/-/dpclinic-january-relaunching-coptr
#DPClinic January - Relaunching COPTR! - Digital Preservation Coalition
www.dpconline.org
December 18, 2025 at 9:02 AM
Reposted by Mat Kelly
The ACM Digital Library, where a LOT of computing-related research is published (I'd say at least 75% of my own publications), is now not only providing (without consent of the authors and without opt-in by readers) AI-generated summaries of papers, but they appear as the *default* over abstracts.
December 16, 2025 at 11:31 PM
TIL about github.com/midwork-find..., a DuckDB ext. to query web archive CDX APIs directly from SQL. i.e., query Wayback Machine & Common Crawl CDX APIs like they were database tables!

#webarchiving @archive.org @waybackmachine.bsky.social

h/t @atomotic.com via @inkdroid.org(merveilles.town/@ink)
github.com
December 17, 2025 at 9:23 PM
My PhD student Chris Rauch presented OntExtract @ #JCDL2025! Web-based NLP w/ PROV-O provenance—comparing manual vs LLM pipelines & track semantic change.

Demo: ontextract.ontorealm.net
Slides: matkelly.com/presentations/2025_jcdl_ontextract.pdf
Paper: matkelly.com/papers/2025_jcdl_ontextract.pdf
December 16, 2025 at 6:02 PM
The work described in this article was supported by IMLS 🫶

#webarchiving
Our JASIST article, "Problems with archiving and replaying current web advertisements", was recently published: onlinelibrary.wiley.com/share/author...
@wiley.com
Co-authors: Alex Poole, Hyung Wook Choi, Christopher Rauch, @machawk1.bsky.social, @phonedudemln.bsky.social, @weiglemc.bsky.social
(1/9)
onlinelibrary.wiley.com
December 10, 2025 at 9:20 PM
Reposted by Mat Kelly
We also published a blog post on Information Matters, which provides a summary of our process for creating a dataset of 279 archived web ads and the problems we identified while archiving and replaying these ads. (2/9)
#WebArchiveWednesday @webscidl.bsky.social
informationmatters.org/2025/12/prob...
Problems With Archiving and Replaying Web Advertisements - Information Matters
Advertisements are an integral part of our cultural heritage, and this extends to online web advertisements. Unlike print ads, web ads are dynamic and interactive, which makes them difficult to archiv...
informationmatters.org
December 10, 2025 at 9:17 PM
Reposted by Mat Kelly
The Institute of Museum and Library Services, the federal agency that provides funding for America's libraries, has announced it is reinstating all grants it had previously terminated, including those to libraries and library organizations across the U.S.

Read more: https://bit.ly/4496W4V
December 4, 2025 at 1:30 AM
Reposted by Mat Kelly
Trip report for the 2025 ACM Hypertext Conference #HT2025 @acmht.bsky.social

@webscidl.bsky.social PhD students Tarannum Zaki, Rochana Obadage, & Dominik Soos and alumni @machawk1.bsky.social & @ibnesayeed.bsky.social attended in September, 2025.

ws-dl.blogspot.com/2025/11/2025...
2025-11-12: 36th ACM Conference on Hypertext and Social Media (HT 2025) Trip Report
The Web Science and Digital Libraries Research Group at Old Dominion University.
ws-dl.blogspot.com
November 13, 2025 at 7:22 PM
Reposted by Mat Kelly
To study Twitter is to study archived Twitter. And if you're replaying archived pages, you need to be familiar with the different generations of UIs.

Tarannum Zaki of @webscidl.bsky.social explores and classifies the different UIs.

ws-dl.blogspot.com/2025/10/2025...
2025-10-26: Exploring the Different Generations of Twitter/X's Tweet UI
The Web Science and Digital Libraries Research Group at Old Dominion University.
ws-dl.blogspot.com
October 27, 2025 at 9:18 PM
Reposted by Mat Kelly
Really excited to share this new paper on "Data Visualizations as Propaganda", co-led by PhD students Priya Dhawka and Nina Lutz, which just won a Best Paper award at the CSCW conference: dl.acm.org/doi/10.1145/...

[Short thread]
Data Visualizations as Propaganda: Tracing Lineages, Provenance, and Political Framings in Online Anti-Immigrant Discourse | Proceedings of the ACM on Human-Computer Interaction
Along with other visual content, data visualizations are increasingly used within online discourse, including political communication. Though often considered to be ''objective'', data visualizations ...
dl.acm.org
October 27, 2025 at 9:10 PM
The proceedings & recordings of the 5 presentations at the Web Archiving and Digital Libraries (WADL) 2025 Workshop from ACM Hypertext in September 2025 are now live!

* Proceedings: ualberta.scholaris.ca/items/02da5c...
* Recordings: wadlworkshop.github.io/2025/content... (@ Internet Archive & YT)
Proceedings of the Web Archiving and Digital Libraries (WADL) Workshop 2025
ualberta.scholaris.ca
October 24, 2025 at 5:01 PM
Reposted by Mat Kelly
please reach out with any tenure-track faculty opportunities in information, media, communication, or interdisciplinary computer science departments 🫡

learn more about me here: yuxiwu.com
yuxi wu, phd
yuxiwu.com
October 17, 2025 at 4:12 PM
Our article "Liberation of LMS-siloed Instructional Data" was just released by the Code4Lib Journal!

The work documents our efforts in extracting IMLS-funded bootcamp content from Blackboard for future reuse in library carpentries & for reference by students.

📝 journal.code4lib.org/articles/18462
October 22, 2025 at 3:49 PM
Reposted by Mat Kelly
Folks in digital preservation looking for a chill place to hang out off here - I run a digipres mastodon instance, digipres.club. I try to keep it running well and low stress. All are welcome.
digipres.club
Hometown is adapted from Mastodon, a decentralized social network with no ads, no corporate surveillance, and ethical design.
digipres.club
October 3, 2025 at 5:56 PM
Reposted by Mat Kelly
Highly recommend joining digipres.club! Misty and the moderators that help her do a great job of running a chill and interesting place free from all the annoying ads and algorithms of the commercial social media options.
Folks in digital preservation looking for a chill place to hang out off here - I run a digipres mastodon instance, digipres.club. I try to keep it running well and low stress. All are welcome.
digipres.club
Hometown is adapted from Mastodon, a decentralized social network with no ads, no corporate surveillance, and ethical design.
digipres.club
October 3, 2025 at 7:49 PM
That's a wrap on ACM Hypertext 2025! #ACMHT2025 #HT2025 #HT25

Next year's Hypertext 2026 will be in London at School of Advanced Study at University of London and the British Library. #HT2026

September 8-11, 2026

The theme will be Hypertext as Method.
September 18, 2025 at 9:32 PM
At #ACMHT2025 day 4, Yavuz Selim Kartal (from @gesis.org) discusses supplementing social media posts with AI-generated summaries, which are perceived as most understandable form of enrichment as compared to metadata enrichment or pull quotes. @acmht.bsky.social #HT2025

📄 dl.acm.org/doi/10.1145/...
September 18, 2025 at 2:41 PM
At Day 4 of #ACMHT2025, Karlijn Dinnissen describes the role of fairness & diversity in user choices and perception of music playlists.

* What do end users think makes a fair MusicRec sys?
* How does that impact their choices?
* How should they be informed?

#HT2025

📄 dl.acm.org/doi/10.1145/...
September 18, 2025 at 2:15 PM
Ágnes Horvát highlights in her @acmht.bsky.social keynote that the frequency of certain words in PubMed abstracts has drastically increased due to authors massaging their texts with LLMs. #HT2025 #ACMHT2025
September 17, 2025 at 7:35 PM
At the #HT2025 Day 3 keynote, Ágnes Horvát asks, "How is science effectively promoted in online social spaces? Does self-publicity online lead to a tangible increase in citations?"

When we account for venue, popularity of authors, etc., there still seems to be a net positive.
September 17, 2025 at 7:11 PM
In his #nht25 prez @ #HT25, David Millard (@hoosfoos.bsky.social) ids the parallels between hypertext & those that will come as a result of gen AI. Who controls meaning? What are the boundaries of text? We typically think of AI as an assistant, the human being's still the wordsmith.
September 17, 2025 at 3:24 PM
@eastgate.bsky.social describes resurrecting Information Cities at #NHT25 as a narrative, a journey toward understanding, a central theme in open world games like Elder Scrolls or Cyberpunk 2077 #ACMHT2025 #HT2025 @acmht.bsky.social
September 17, 2025 at 2:37 PM
@drchargood.bsky.social kicks off Hypertext '25 (@acmht.bsky.social) Day 3 w/ the Narrative & Hypertext Workshop #NHT25, a venue for both those new to HT and established projects. Scheduled are 3 paper presentations, a panel, and a debate. #ACMHT2025

nht.ecs.soton.ac.uk/2025/program...
nht.ecs.soton.ac.uk
September 17, 2025 at 2:28 PM
Reposted by Mat Kelly
Our browser extension, ArchiveWeb.page has recently surpassed 20,000 users on the Chrome Web store:

chromewebstore.google.com/detail/webre...

That's 20k+ users creating their own high-fidelity, interactive web archives, right in their own browsers!

If you use ArchiveWeb.page, leave us a review!
ArchiveWeb.page
ArchiveWeb.page
September 17, 2025 at 12:24 AM