logoalt Hacker News

andrew_zhongtoday at 6:01 AM3 repliesview on HN

Good point. The anti-bot patches here (via Patchright) are about preventing the browser from being detected as automated — things like CDP leak fixes so Cloudflare doesn't block you mid-session. It's not about bypassing access restrictions.

Our main use case is retail price monitoring — comparing publicly listed product prices across e-commerce sites, which is pretty standard in the industry. But fair point, we should make that clearer in the README.


Replies

plastic041today at 7:23 AM

robots.txt is the most basic access restrictions and it doesn't even read it, while faking itself as human[0]. It is about bypassing access restrictions.

[0]: https://github.com/lightfeed/extractor/blob/d11060269e65459e...

zendisttoday at 7:14 AM

Regardless. You should still respect robots.txt..

show 1 reply
messetoday at 6:36 AM

> It's not about bypassing access restrictions.

Yes. It is. You've just made an arbitrary choice not to define it as such.

show 1 reply