Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It seems with all these website scrapers, we have forgotten about the Internet Archive. http://archive.org/web/


As awesome as the Internet Archive's Wayback Machine is, it is still under central control. Worse, as they (reasonably) abide to robots.txt rules, the BBC could easily block access to pages they removed at their end in the archive as well. If you care about something, you need to fully "own" it.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: