The numbers game Google vs Cuil
Monday, August 4th, 2008If you pay attention to the tech industry, or more specifically the Search Business, you have seen a lot of hype surrounding both Google and Cuil. Google recently announced they had reached the 1 Trillion mark. Cuil just came in with a bang saying they have indexed 120 Billion web page stating they have indexed more web pages than anyone else. So who is the biggest? Is someone lying?
Neither one is. It’s plain and simple marking. Both are playing with numbers and slight of terminology so that the detail is easily missed.
It is quite possible that Cuil has spent more resources on their crawler to scan web pages. They simply have managed to put a lot more effort into the gathering of data. It is quite possible they have managed to index 120 billion pages. This is no small matter. In my own exploration of indexing web pages, I found that the index really is a gigantic set of data. For 7000 indexed web pages, my index has 12 million records. It’s not optimized in any way, but you get the idea. Put in that context you can see that number may not be representive of what is important to the person doing the searching.
Look closely at the Google statement though. It says “1 trillion URLS”. It does not say they have indexed 1 trillion web pages. Those are two different things. Again, back to my own exploration of search engines. For 7000 indexed web pages, I have 300,000 URLS. Again, what google has done is no small matter, but needs to be put into context. What good are those 293,000 URLS if you can’t find them in the search engine.
So who is right? Which is better? What’s the meaning of all this? Pretty much nothing. It’s all marketing.
My personal opinion is that Google does a much better job of finding relavent data. Period. I do an ego search on Google, I find a few book reviews I’ve done on Amazon. If do the same on Cuil, I get the Amazon results but they are burried under a pile of websites that have hijacked Amazons book reviews. The reason for this could be that Cuil’s index is bigger, or that Google is doing better ranking of the same data. I can’t tell.
I do like the context and suggesting that Cuil does. And it has a slick interface. I also like the simplicity of Google seach. I think the results that Cuil presents will begin to improve as they gain experience and figure out what is really important to the users. Google will then really have something to watch out for.