Internet Archive

The Internet Archive is a multi-petabyte repository of all sorts of crazy things. The Wayback Machine is there, but there's a lot of other stuff there too.


 * website: https://archive.org

Petabox
There's a really outdated article on Wikipedia about the Petabox. It's what folks at the Internet Archive call the racks-and-racks of machines that serve as a giant distributed cluster, which (IIRC) holds over 50 petabytes of stuff that people upload to https://archive.org.

Wayback Machine

 * main article: Wayback Machine

The Wayback Machine runs on top of the Petabox. It's confusing. I never figured it out.

2019

 * main article: 2019

I worked at Internet Archive back in 2019. It was quite the experience. You can see my single blog post here:


 * https://blog.archive.org/author/robla
 * https://blog.archive.org/2019/06/29/two-thin-strands-of-glass/
 * This^ was after a fiber cut. I never did get around to telling the full story of that.

I talk about leaving the Internet Archive here:


 * https://robla.blog/2019/11/05/end-of-a-chapter/