#Jsoup
After this week I feel like I know more about #CSS that any human has a right to

Doing dynamic tagging of website templates via JSoup

Had no idea what could be done with CSS until I dug in on this

Bet I’ve only just scratched the surface however

#BuildInPublic
October 11, 2025 at 3:29 PM
I take a similar approach with HTML, only I use jsoup to go to Clojure data structures and do my analysis from there.
September 21, 2025 at 3:41 PM
📢 𝗕𝗿𝗶𝗱𝗴𝗶𝗻𝗴 𝘁𝗵𝗲 𝗚𝗮𝗽: 𝗛𝘁𝗺𝗹𝗨𝗻𝗶𝘁-𝗷𝘀𝗼𝘂𝗽 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻

Excited to showcase htmlunit-jsoup (github.com/HtmlUnit/htm...), a bridge library that opens up the entire #jsoup (jsoup.org) ecosystem to #HtmlUnit (www.htmlunit.org) developers.
July 20, 2025 at 2:07 PM
🚀 Introducing bx-jsoup for BoxLang!
https://cstu.io/59c59f
We're thrilled to announce the release of bx-jsoup - a powerful new module that brings enterprise-grade HTML fluent parsing and cleaning capabilities
July 15, 2025 at 8:40 PM
🚀 Domine o Webscraping com Kotlin e Jsoup! Aprenda a extrair dados de páginas HTML e explore os erros mais comuns de desenvolvedores em uma tabela interativa. Dê o próximo passo na sua jornada de programação e transforme dados em insights valiosos! 🌐✨ Vídeo Completo Aqui
June 14, 2025 at 12:02 PM
If you are looking for kotlin-first alternative of 𝗷𝘀𝗼𝘂𝗽(a popular java library for parsing HTML and XML), you can try using 𝗸𝘀𝗼𝘂𝗽.
Bonus: It's also a kotlin multiplatform library.

Github link: github.com/fleeksoft/ks...
#kotlin #kmp #ksoup #scraping #parser
GitHub - fleeksoft/ksoup: Ksoup is a Kotlin Multiplatform library for working with HTML and XML. It's a port of the renowned Java library Jsoup.
Ksoup is a Kotlin Multiplatform library for working with HTML and XML. It's a port of the renowned Java library Jsoup. - fleeksoft/ksoup
github.com
May 29, 2025 at 5:10 AM
摸鱼岛:一站式在线信息聚合与休闲平台

摸鱼岛介绍 摸鱼岛是一款开源在线平台,旨在为用户提供信息聚合与轻松互动服务。网站地址为 React、Antd、json-viewer 与 aj-captcha-react,确保界面简洁高效。后端采用 SpringBoot、MySQL、Redis、Netty、WebSocket 与 JSoup 爬虫,结合 Redis 信息压缩将响应速度从1秒优化至约100毫秒。 功能方面,摸鱼群聊支持与编程导航用户实时交流,直接发起技术提问或分享 B…
摸鱼岛:一站式在线信息聚合与休闲平台
摸鱼岛介绍 摸鱼岛是一款开源在线平台,旨在为用户提供信息聚合与轻松互动服务。网站地址为 React、Antd、json-viewer 与 aj-captcha-react,确保界面简洁高效。后端采用 SpringBoot、MySQL、Redis、Netty、WebSocket 与 JSoup 爬虫,结合 Redis 信息压缩将响应速度从1秒优化至约100毫秒。 功能方面,摸鱼群聊支持与编程导航用户实时交流,直接发起技术提问或分享 B 站学习视频。聚合信息源涵盖编程导航、知乎、微博与网易云热榜,用户可一站查看编程与娱乐两类热门内容。每日待办模块帮助用户记录和管理日常任务,提升效率与自律。内置小游戏五子棋与2048,可与 AI 或好友对战,增添休闲乐趣。工具箱提供 JSON 格式化功能,满足开发者日常数据处理需求。老板键功能分为普通跳转与隐藏展示两种模式,按 Ctrl + Shift + B 可快速启动隐藏页,再按 Ctrl + Shift + S 进入设置自定义页面。 摸鱼岛特色功能 摸鱼群聊 这个功能也是编程导航的🐟友提的意见,一开始是不打算走的,后面看见许多人都想要这个功能就加上了,编程导航 ➕ 摸鱼聊天,美滋滋,不会的技术直接提问、推荐的 B 站视频直接分享,大家一起学习摸鱼,共同进步。 聚合信息源 🌈 项目聚合了市面上比较常见的信息源:如编程导航、知乎、微博、网易云等信息源热榜,后端使用了爬虫➕直接请求接口获取数据等操作,来进行数据源的爬取,通过 Redis + 信息压缩,提高了数据💡的返回速度(PS:开始 1s =》优化后 100ms 整整十倍)。在这里你能看到编程相关的以及娱乐相关的,学习娱乐两不误。 每日待办💡 这个也是我个人的需求,嘿嘿,我平时也需要每天列一下待办,如果我能在自己网站写下待办岂不美哉于是有了一下界面。 小游戏 🎮 小游戏呢目前有五子棋和 2048 ,这些小游戏都是 AI 帮我写的 五子棋 五子棋支持与 AI 对战、与朋友在线对战(只需要创建房间,把房间号发给好友就可以啦,我跟朋友经常偷偷对战😊) 五子棋对战效果如下⬇️: 2048 工具箱🔧 JSON 格式化工具 由于身为一个程序猿,经常需要 JSON 格式化来格式数据,我在想能不能让 AI 给我写一个 JSON 格式化工具呢,于是有了一下界面⬇️: 摸鱼岛网站地址 GitHub: 官网:
www.ahhhhfs.com
April 18, 2025 at 10:53 AM
- better devex with configuration properties in places that didn't quite work before
- Docker Model Runner support
- enhanced model integrations
- JSoup HTML document reader
- even better MCP support (session based architecture!)
April 10, 2025 at 9:36 PM
🚀 Desvende os segredos do Webscraping! Aprenda a usar Kotlin e Jsoup para extrair dados de páginas HTML, enquanto explora os erros mais comuns de devs em uma tabela interativa. Transforme seus conhecimentos em código de forma prática e divertida! 🌐✨ Vídeo Completo Aqui
March 30, 2025 at 12:02 PM
刚写完一个链家 SpringBoot 航母级爬虫,房产数据玩家狂喜!

🔥 硬核功能:
✅ 多城市秒级抓取(JSoup 精准解析)
✅ 高颜值Web看板(Tailwind CSS 真·生产力)
✅ Excel/CSV双格式导出(Apache POI 稳如老狗)
✅ 中介看了沉默的房源详情页(图片+文字全收录)

👇 灵魂卖点:
💡 一套代码打通链家数据流水线
💡 租房党比价神器 / 地产人分析外挂
💡 拒绝996,数据导出直接扔给Excel

附开发心路:凌晨4点的JSoup报错,比咖啡更提神🙃

github.com/ctkqiang/Lia...
GitHub - ctkqiang/LianJiaScraper: 这是一个基于Spring Boot框架开发的链家房源数据爬虫系统。本项目致力于为用户提供一个便捷、高效的房源数据采集解决方案。通过自动化爬取链家网站的房源信息,系统能够实时获取各个城市的房源详情,包括房屋价格、位置、面积、户型等关键信息。
这是一个基于Spring Boot框架开发的链家房源数据爬虫系统。本项目致力于为用户提供一个便捷、高效的房源数据采集解决方案。通过自动化爬取链家网站的房源信息,系统能够实时获取各个城市的房源详情,包括房屋价格、位置、面积、户型等关键信息。 - ctkqiang/LianJiaScraper
github.com
February 9, 2025 at 6:49 PM
Question for my meta downloaders:

Trying to download certain info before deleting. I went with html, which, as it turns out, is just links to facebook pages, not an actual download of data. Has anyone done the JSOUP option? and how did that work for you?
January 29, 2025 at 3:14 PM
🚀 Aprenda Web Scraping com Kotlin e Jsoup! 🌐 Descubra como manipular HTML e desbravar a DOM para extrair dados valiosos. Ideal para quem quer aprimorar suas habilidades em programação e web! 🖥️ Não perca essa oportunidade! Vídeo Completo Aqui
January 22, 2025 at 12:02 PM
Try loading it into JSoup?
January 6, 2025 at 6:06 PM
🚀 Desvende os segredos do Webscraping! Aprenda a programar em Kotlin com Jsoup para extrair dados valiosos de páginas HTML. Descubra as cagadas mais comuns dos devs e como evitá-las! 🌐💻 Dê o primeiro passo na manipulação de dados e navegação na DOM! Vídeo Completo Aqui
December 3, 2024 at 12:02 AM
I've written a new tutorial article on scraping with the Jsoup lib

briacd.com/web-scraping...

#dev #springboot #scraping
Web Scraping with Jsoup in a Spring Boot Project
Practical guide to configure and use Jsoup in a Spring Boot project to fetch and manipulate HTML data.
briacd.com
November 23, 2024 at 2:08 PM
Want to build a Web Scraping API like a pro? Learn how to use Java, Spring Boot, and Jsoup to create powerful data-driven solutions!

Check out the step-by-step guide here: www.3idatascraping.com/how-to-build...

#WebScraping #Java #SpringBoot #Jsoup #TechTips #CodingCommunity #Developers
November 21, 2024 at 8:32 AM
#babashka v1.12.195 is out now!
Including Jsoup which makes bb compatible with the hickory library !
#clojure
November 12, 2024 at 9:36 AM
Merged jsoup into the #babashka master branch now. Test it out with:

bash <(curl raw.githubusercontent.com/babashka/bab...) --dev-build --dir /tmp

hickory should work from source

#clojure
November 11, 2024 at 10:05 AM
JSoupよりSeleniumの方が自動化向いてそうなのでこっち採用するか
October 9, 2024 at 8:28 AM
🚀 Aprenda Web Scraping com Kotlin e Jsoup! 🌐 Descubra como manipular páginas HTML e navegar pela DOM de forma simples e eficaz. Criamos uma tabela divertida com os erros mais comuns dos devs. Não fique de fora dessa! 💻✨ #Kotlin #WebScraping #Jsoup Vídeo Completo Aqui
October 5, 2024 at 12:02 AM
Pergunta, vc precisa fazer trigger de algum evento? ou é entrar e coletar?

Se for isso eu usaria o jsoup, caso teria que trigar eu usaria o selenium e obviamente com um n instancias ec2 pra ficar mudando o ip.
September 16, 2024 at 12:40 PM
...cause we just naively exposed jsoup, and now the extensions all depend on jsoup's interfaces and luaj's coercion quirks. but if we didn't do that, we'd have hundreds of lines of boilerplate to wrap the library. or we'd have a really shitty buitin library and force pain onto the extension devs
September 11, 2024 at 3:41 PM
さてCIマシンのメモリ増やしたので某アプリにもdependabot適用して回してみるとjsoupのアップデートでテスト失敗することが分かったので依存ライブラリの最新化作業がかなり効率よくできるようになった。
March 16, 2024 at 1:09 PM
Gonna have jsoup (jo soup) because smelled notsbands jsoup (jon soup) and then sad bug is having bsoup (bug soup) so its obviously Souping Season
December 17, 2023 at 1:08 AM