Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Scraping with BeautifulSoup works pretty well, but i've had a lot of problems with nested elements with no id since Xpath is not available on BeautifulSoup. Perhaps this would make it a little easier. I'll report back after trying it.


For the time being it doesn't use the BeautifulSoup parser so it may not work on very bad html, but I'll add an option an option to use it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: