What is Google?
Via John Battelle, Rick Skrenta’s remarkable piece on what Google have actually built. They don’t just have the world’s best search engine, they have the world’s largest and most scalable platform for developing huge web-based applications.
Google has taken the last 10 years of systems software research out of university labs, and built their own proprietary, production quality system. What is this platform that Google is building? It’s a distributed computing platform that can manage web-scale datasets on 100,000 node server clusters. It includes a petabyte, distributed, fault tolerant filesystem, distributed RPC code, probably network shared memory and process migration. And a datacenter management system which lets a handful of ops engineers effectively run 100,000 servers. Any of these projects could be the sole focus of a startup.
[ ... ]
While competitors are targeting the individual applications Google has deployed, Google is building a massive, general purpose computing platform for web-scale programming.
Fascinating stuff.
Adam Bramwell - 5th April 2004 08:48 - #
Harry Fuecks - 5th April 2004 11:56 - #
Milan Negovan - 5th April 2004 14:40 - #
As I recall, Google use Python quite heavily, so it wouldn't surprise me to find out that at least some of it was implemented in that.
Jim Dabell - 5th April 2004 15:10 - #
Harry Fuecks - 5th April 2004 16:00 - #
This Slashdot post quotes a Cringly column that talks about Googles attitude towards faulty hardware.
Extremely cool way of thinking.
Micah - 5th April 2004 17:32 - #
A petabyte of fault tolerant data storage; fault tolerant in software accross thousands of servers.
If the servers were on more than one physical site, just what would it take to cause data loss? And how do they write and test what must need to be incredibly low-bug software?
SB - 6th April 2004 02:25 - #
saumendra - 12th April 2004 10:19 - #