The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the Wayback Machine.
@julien51 killer demo that Ben, Emma, and Aaron put together at IWC. You sign in via micropub, then it consumes h-feed and rss/atom feeds, and lets you post replies via micropub. really incredible that they built a working first version in 4-5 hours. https://github.com/benwerd/indiereader