The Library of Congress Loves Every Kind of Tweet, but It Can’t, Can’t Search Optimize Every Tweet

Elsewhere on the internet

Susana Polo

Published: Jan 4, 2013 04:19 pm

Recommended Videos

Back in 2010, the U.S. Library of Congress announced that it would be archiving every public tweet made since 2006. They’re back again today, to say “Yeah, so. You guys tweet, like, a lot.”

In the nearly three years since announcing their initiative, the Library of Congress has actually managed to create its archive of every public tweet between 2006 and 2010, and has developed a complete system for taking in and archiving everything that comes out of twitter and saving it to their servers. That means that these days, they’re taking in about half a billion tweets per day. So what’s the problem, you ask?

Well, the Library of Congress, being a place where people do research, doesn’t feel like the archive is really up to their standards yet. Primarily because performing one search query on it can take about twenty four hours to complete. From their announcement:

The Library has assessed existing software and hardware solutions that divide and simultaneously search large data sets to reduce search time, so-called “distributed and parallel computing.” To achieve a significant reduction of search time, however, would require an extensive infrastructure of hundreds if not thousands of servers. This is costprohibitive and impractical for a public institution.

The Library’s ultimate goal is to create an archive that offers “free, indexed, and searchable access” to legislative researchers and scholars, and the fact is that the technology simply isn’t there yet. But the Library is working on it, with folks from Twitter and Gnip, the company that collects their tweets for them, and with researchers themselves.

But until they work things out, all that embarassing stuff you tweeted while watching “The End of Time” and hope no one ever finds again is safe.

(via Gizmodo.)

The Mary Sue is supported by our audience. When you purchase through links on our site, we may earn a small affiliate commission. Learn more

Jul 15, 2022

What Will Conventions Look Like in 2021?

Nov 13, 2020

Dear White People, I Need To Matter Beyond a Thank You

Nov 11, 2020

Have You Ever Seen a Ghost?

Oct 26, 2020

taylor swift,, voting, tennessee, blackburn, conservatives, vote.org

Taylor Swift Says She’ll Re-Record All Her Old Albums to Regain Ownership of Them

Aug 22, 2019

Author

Susana Polo

Susana Polo thought she'd get her Creative Writing degree from Oberlin, work a crap job, and fake it until she made it into comics. Instead she stumbled into a great job: founding and running this very website (she's Editor at Large now, very fancy). She's spoken at events like Geek Girl Con, New York Comic Con, and Comic Book City Con, wants to get a Batwoman tattoo and write a graphic novel, and one of her canine teeth is in backwards.