IRL

Internet Archive celebrates 10 petabytes of data preserved

The Internet Archive, the online library that lets you search historical versions of Web pages, is throwing a party in honor of its 10 quadrillionth byte of data stored.

Photo of Curt Hopkins

Curt Hopkins

Article Lead Image

The Internet Archive, whose “Wayback Machine” is a searchable portal to the history of the Web, is hosting a party Thursday night to celebrate 10 petabytes of data archived since its founding in 1996.

Featured Video

“Lets bring millions of books, music, movies, software and web pages online to over 2 million people every day and celebrate the 10,000,000,000,000,000th byte being added to the Archive,” the IA wrote in a blog post.

The latest materials added to the archive include 350,000 TV news broadcasts, a collection of Balinese literature, a digital archive of the 2011 Japanese tsunami and “hundreds of newly digitized Home movies and other ephemeral films.”

The Wayback Machine alone contains archived versions of 150 billion Web pages.

Advertisement

A petabyte, for those not in the computationally alphanumerical know, is one quadrillion bytes, or pieces of information consisting of eight bits each. So the Internet Archive contains… well, a great deal of information. By way of comparison, Google processes about 2.5 terabytes per day and only stores three terabytes—1 trillion bits each—on any given day.

The “Ten Petabyte Party” kicks off at 6:00pm PT Thursday at the Internet Archive’s San Francisco headquarters, with a cocktail reception at which legendary computer scientist Don Knuth will be rocking his organ.

The celebration will also be live-streamed, for those not able to attend in person.

Photo via Twechie/Flickr

Advertisement
 
The Daily Dot