The Sum Of Google's Services Use 2 Billion Lines Of Code And 86 Terabytes Of Data
Being a massive company, Google has a ton of cool stats to share, and fortunately, most of those are well-known. Its revenue? That's public information. The same goes for the amount of employees it has and the number of data centers it has scattered around the globe, among many other things.
However, some information that hasn't been divulged in the past relates to the amount of storage the company's codebase requires, and the number of lines of codes it encapsulates. It's a good thing it's Friday, because your mind is about to be effectively blown.
Speaking at the @Scale conference earlier this week, Google's engineering manager Rachel Potvin treated attendees to some stats that most companies wouldn't be so willing to share. Across all of its services - a la Gmail, search, YouTube, et cetera - there are a total of 1 billion files in the overarching repository.
Think about that for a moment. That's 1,000,000,000 files that directly relate to the source of Google's services, not files that users have uploaded or things like that. As for actual code lines, that doubles to 2 billion. For a bit of a comparison, the Linux kernel includes about 15 million lines of code.
Other stats include 9 million unique source code files and 35 million commits (there are 45,000 each and every workday). Total storage that this all requires? An impressive 86TB, or 86,000GB to make it sound even cooler.
Potvin notes that this massive repository is shared across 10 datacenters worldwide so that it remains undeniably safe. A smart move, as it'd take something truly catastrophic to cause concern for the data integrity at that point. And something tells me that Google is a company that could easily afford this!