Mantisus
mantisus.bsky.social
Mantisus
@mantisus.bsky.social
Python Developer - https://github.com/Mantisus
🇺🇦
Web scraping expert.
Contributor crawlee-python
A while back I wrote a short article for the Crawlee blog about how you can scrape Bluesky - crawlee.dev/blog/scrape-...

If you were thinking about some rocket science technologies like AI agents, or creating 100500 accounts to access data. Then you're wrong.
May 16, 2025 at 9:32 PM
Perhaps someone will be interested in an article looking at how you can extract data from Crunchbase using Crawlee for Python

crawlee.dev/blog/scrape-...
How to scrape Crunchbase using Python in 2024 (Easy Guide) | Crawlee · Build reliable crawlers. Fast.
Learn how to scrape Crunchbase using Crawlee for Python
crawlee.dev
January 16, 2025 at 7:29 AM
The Crawlee for Python development team has released a blog post about the release of version 0.5.0 which I participated in the development of )

crawlee.dev/blog/crawlee...
Crawlee for Python v0.5 | Crawlee · Build reliable crawlers. Fast.
Announcing the Crawlee for Python v0.5 release.
crawlee.dev
January 16, 2025 at 7:28 AM
Excited to share: crawlee-python v0.5.0 has been released by Apify, including several of my PRs! 🎉

github.com/apify/crawle...

Speaking of production readiness - I'm already running 2 projects based on crawlee-python, so it's battle-tested and working well.
Release 0.5.0 · apify/crawlee-python
0.5.0 (2025-01-02) 🚀 Features Add possibility to use None as no proxy in tiered proxies (#760) (0fbd017) by @Pijukatel Add use_state context method (#682) (868b41e) by @Mantisus Add pre-navigation...
github.com
January 2, 2025 at 4:03 PM
Remember the good old days when a simple requests.get() was all you needed? Now we have to act more human than some humans do 😅

P.S. This post was written by a human. Probably.
December 28, 2024 at 6:02 AM