HTTP Archive 💾
banner
httparchive.org
HTTP Archive 💾
@httparchive.org
Public dataset that tracks how the web is built. Maintained by @patmeenan.com, @paulcalvano.bsky.social, @tunetheweb.com, @maxostapenko.com, and Nurullah Demir
Pinned
What do you think? Shall we do another Web Almanac this year? Diving deep into all the data we collect to see what’s changing in web trends?

Check out this post if interested in getting involved:
github.com/HTTPArchive/...

Or nominate your favorite experts that you’d love to see author a chapter.
Contribute to the 2025 Web Almanac · HTTPArchive almanac.httparchive.org · Discussion #4062
Dear all, We are excited to announce the Call for Contributions for the 2025 Web Almanac (6th Edition)! The Web Almanac is an annual report that provides an overview of the state of the web, based ...
github.com
We've taken the new version of the Core Web Vitals Tech Report out of Beta and now consider this the report to use:

httparchive.org/reports/tech...

Anyone still using the old Looker Studio-based report, it's time to check out the new goodness with much quicker response times and improved UX!
October 15, 2025 at 10:03 PM
Reposted by HTTP Archive 💾
New @httparchive.org analysis about sites adding AI Bots and Crawlers to their robots.txt files. While robots.txt doesn't "block" bots by itself, it's a clear demonstration of the preferences of site owners and the sentiment towards AI crawlers on today's web. paulcalvano.com/2025-08-21-a...
AI Bots and Robots.txt
There’s been a lot of discussion lately around AI crawlers and bots, which are used to train LLMs and/or fetch content on behalf of their users. In the past few weeks I’ve seen blog posts about the am...
paulcalvano.com
August 21, 2025 at 12:36 PM
We're please to welcome Nurullah Demir as the newest maintainer of the HTTP Archive project!

Nurullah has helped lead the Web Almanac over the last two years and we look forward to seeing what this year's edition comes up with!

Welcome to the team Nurullah!
June 30, 2025 at 7:46 PM
🚨 Calling all web experts! 🚨

The 2025 Web Almanac is still open for contributors!

Know someone perfect for it? Mention them here and help us reach the right folks. 🙌

📢 Please help us spread the word!

🔗 Learn more: github.com/HTTPArchive/...
May 19, 2025 at 4:52 AM
Reposted by HTTP Archive 💾
It's that time of year where the web almanac is seeking contributors for the next edition.
It takes a fair amount of work,but it's ridiculously rewarding,
& guaranteed warm fuzzy feelings when you see the chapter you contributed towards published.

I encourage you to give it a go see:
Contribute to the 2025 Web Almanac · HTTPArchive almanac.httparchive.org · Discussion #4062
Dear all, We are excited to announce the Call for Contributions for the 2025 Web Almanac (6th Edition)! The Web Almanac is an annual report that provides an overview of the state of the web, based ...
github.com
May 15, 2025 at 6:34 PM
Reposted by HTTP Archive 💾
What do you think? Shall we do another Web Almanac this year? Diving deep into all the data we collect to see what’s changing in web trends?

Check out this post if interested in getting involved:
github.com/HTTPArchive/...

Or nominate your favorite experts that you’d love to see author a chapter.
Contribute to the 2025 Web Almanac · HTTPArchive almanac.httparchive.org · Discussion #4062
Dear all, We are excited to announce the Call for Contributions for the 2025 Web Almanac (6th Edition)! The Web Almanac is an annual report that provides an overview of the state of the web, based ...
github.com
April 27, 2025 at 8:13 PM
Reposted by HTTP Archive 💾
Ask me how excited I got seeing @johnmu.com cite my and @tamethebots.com ‘s Page Weight chapter at Search Central Live
March 20, 2025 at 4:32 PM
Reposted by HTTP Archive 💾
Day 010 #100DaysOfPerf: In an apt compliment and follow up to the page weight post, working with media is one rife with challenges. Media loading, media formats, media codecs... But it's what makes the web so valuable.
✨ MEDIA ✨ chapter from the @httparchive.org is another worth checking 🧵⬇️
March 16, 2025 at 4:32 AM
Reposted by HTTP Archive 💾
Though I shared a link from the @httparchive.org and the 2024 Web Almanac, sharing a chart from the the actual archive: over 5 yrs, the page weight has grown almost 30%, and there has never been an indication of slowing down. The almanac chapter covers much of the data proving so. 🧵⬇️
March 16, 2025 at 1:30 AM
Reposted by HTTP Archive 💾
Thoughtful font loading can provide performance benefits. This @httparchive.org Web Almanac FONT chapter surprisingly touches on quite a bit, from formats, sizes and more. You can read it all here:
📚: almanac.httparchive.org/en/2024/fonts
Enjoy! #100DaysOfPerf
Fonts | 2024 | The Web Almanac by HTTP Archive
Fonts chapter of the 2024 Web Almanac covering where fonts are loaded from, font formats, font loading performance, variable fonts, and color fonts.
almanac.httparchive.org
March 11, 2025 at 8:04 PM
Reposted by HTTP Archive 💾
Day 007 #100DaysOfPerf: Since I touched on the @httparchive.org Web Almanac, decided to continue sharing more of their content. A note: The HTTP Archive really started as a way to see how the web was built, and its performance. So it will always have a hint of performance discussion. Today? FONTS 🧵⬇️
March 11, 2025 at 7:10 PM
Reposted by HTTP Archive 💾
🏁 The @httparchive.org Web Almanac's Performance chapter is a comprehensive report on the state of performance on the web, w/ focus on the Core Web Vitals. Shout outs @inesakrap.bsky.social + J. Zigisova for penning the near 8000 word piece.
#100DaysOfPerf
📚: almanac.httparchive.org/en/2024/perf...
Performance | 2024 | The Web Almanac by HTTP Archive
Performance chapter of the 2024 Web Almanac covering Core Web Vitals, with deep dives into the Largest Contentful Paint, Cumulative Layout Shift, and Interaction to Next Paint metrics and their diagno...
almanac.httparchive.org
March 10, 2025 at 6:49 PM
Reposted by HTTP Archive 💾
Day 006 #100DaysOfPerf: Let's keep it moving by highlighting a great piece from the @httparchive.org Web Almanac, and their #performance chapter. "No one ever complained about a fast website", is a classic quote over the years, and the Performance Chapter highlights web data and patterns 🧵⬇️
March 10, 2025 at 5:56 PM
We've just published the 19th and final chapter of the 2024 Web Almanac on JavaScript by Abdul Haddi Amjad and Nishu Goel.

almanac.httparchive.org/en/2024/java...
JavaScript | 2024 | The Web Almanac by HTTP Archive
JavaScript chapter of the 2024 Web Almanac covering the usage of JavaScript on the web, libraries and frameworks, compression, web components, and source maps.
almanac.httparchive.org
March 3, 2025 at 8:37 PM
Reposted by HTTP Archive 💾
I recently published my annual dive into the
@httparchive.org, focusing on page growth, #webperf and #ux:

www.speedcurve.com/blog/page-bl...

A common question is "How big SHOULD my pages be?" According to analysis by @infrequently.org, the ideal page should be <1.4 MB with <365 KB coming from JS.
February 5, 2025 at 7:12 PM
Reposted by HTTP Archive 💾
Just published the results of my annual dive into the @httparchive.org. Key findings:

😱 Med page has grown 8%
😱 90p page has grown 24%
😱 90p mobile page is 10MB
😱 Main culprits: JS & video

Dig in & learn what your page size targets should be and how to hit them: www.speedcurve.com/blog/page-bl...
SpeedCurve | Page bloat update: How does ever-increasing page size affect your business and your users?
The median web page has grown 8% in one year. How does this affect your Core Web Vitals, your search rank, your business and your users?
www.speedcurve.com
January 29, 2025 at 7:35 PM
Just about to start...
✨ The Web Almanac LIVE STREAM II ✨
featuring Chapters (authors):
🔸 SEO (Jamie, Mikael)
🔸 Privacy (Max O)
🔸 HTTP (Robin)
🔸 Cookies (Yana)
🔸 3rd Parties (Yash)
📆 Thursday, January 16th
⏰ 14h EST, everytimezone.com/s/6e8b3a3d
🔗 www.youtube.com/live/zCiMls2...
A 🔄 would be ✨🙏🏾✨.
January 16, 2025 at 6:56 PM
Reposted by HTTP Archive 💾
Great episode search news from
@johnmu.com, and honoured to see a shout out for the
@httparchive.org web almanac SEO chapter!

youtu.be/tVSasQC6G_k?...
The Current State of SEO, Revamped Search Console Emails, and more! (January ‘25)
YouTube video by Google Search Central
youtu.be
January 15, 2025 at 10:27 AM
Reposted by HTTP Archive 💾
HTTP/1.1/2/3 are all present in unison on the web today, w/ a 21/70/9 split. Or is it? @programmingart.bsky.social will shed the light on the @httparchive.org Web Almanac data, and the reality of the protocol's adoption. Join us to hear him share findings this Thursday
🔗 ⬇️
bsky.app/profile/henr...
January 13, 2025 at 5:11 PM
Reposted by HTTP Archive 💾
One of the @httparchive.org Web Almanac chapters we'll explore next week is SEO. Some interesting notes:
🔸 14% of Robot TXT files return as 404 🤯 (1 in 7)
🔸 10%+ of pages have invalid elements in the which has important consequences.
🔹 Find out why + more 1 week today
🔗 See pinned tweet
January 9, 2025 at 5:49 PM
Reposted by HTTP Archive 💾
🕵🏾 PRIVACY. How many of you consider it OR even know what's happening? Some data:
🔸 94% of 📲 sites have 1 tracker
🔸 1/4 have 10
🔸 ~10% have anywhere from 55-85 🤯
Luckily, Thursday next week, we'll walk through the Privacy chapter from the @httparchive.org Web Almanac. (pinned post for links)
January 10, 2025 at 8:25 PM
Reposted by HTTP Archive 💾
📊 One of the @httparchive.org Web Almanac chapters we're discussing next week: COOKIES:
🔸 ~61% of sites use 3rd 🍪. top 100k? ~73%-77%.
🔸 How are are we bring tracked? Yana Dimova will share the data w/ us!
📆 Thu Jan 16
⏰ 14h EST, everytimezone.com/s/6e8b3a3d
more info in pinned post.
January 11, 2025 at 7:35 PM
Reposted by HTTP Archive 💾
We're kicking off 2025 in style on Search With Candour with the inimitable @not-a-robot.com 🙌

Jamie joins me to dish all the details about the SEO chapter of @httparchive.org's The Web Almanac.

We discuss everything from robots.txt to core web vitals, canonical tags & much more!

Episode out now!
January 6, 2025 at 5:14 PM