logoalt Hacker News

andrew_zhongyesterday at 3:26 PM0 repliesview on HN

I agree that When pages have similar structure, for one time extraction as it is (not reasoning from context), scraping with selectors is the way to go.

This library also supports HTML as input so running a browser is not required.