Invisible web - Deep Web - Understanding

The Invisible web refers to the part of the World Wide Web that’s not indexed by the search engines. Most of us assume that that search powerhouses like Google and Bing are index all pages. But there are some places wherever a spiders of these search engines cannot enter. Take library databases which require a positive identification for access. or perhaps pages that belong to personal networks of organizations. Dynamically generated web content in response to a question  are typically left un-indexed by spiders.Google and others also don't capture pages behind private networks or standalone pages that connect to nothing at all. These are all part of the Deep Web.

University researchers say the Web you know -- Facebook (FB), Wikipedia, news -- makes up less than 1% of the entire World Wide Web. When you surf the Web, you really are just floating at the surface. Dive below and there are tens of trillions of pages -- an unfathomable number -- that most people have never seen.

Here are some more interesting stats about it
  1. The Deep Web is is thought to be almost 500 times bigger than the World Wide Web
  2. The World Wide Web contains 19 terabytes of information whereas the Hidden Web has +7,500 terabytes
  3. The Invisible Web has almost 550 billion individual documents and the WWW has 1 billion
  4. There are more than 200, 000 Deep Web websites
  5. The Deep Web represents the largest growing category with new information on the Internet.
  6. There is also quality content on the Deep Web and it is 1,000 to 2,000 times greater than the WWW
  7. The Deep Web seems to be very relevant to the information, as it doesn’t follow by search engines’ rules
  8. Even if it’s called the Invisible Web, 95% of it is public, free information

Comments

Amit K Agarwal said…
Brilliant ! But how do we access the deep web ?
The are tool available for this. But since it contains lots of sensetive and inappropiate data as well. It it should be use with csre.

Popular posts from this blog

Databases on the FDA Website

IPEXL - New Patent Search Tool

Employee Retention – A critical issue, why..and How to solve?