- interesting that this article calls wikipedia.org a high quality web site. Do they mean fo rthe URL's or info content?
- What is a "hashing function"? How does it enable a crawler to recognize which URL's it's responsible for?
- Crawlers get spammed too, good to know!
- Does a human indexer oversee the indexer?
- When the article gives "the Onion" example as a search query, unless the searcher adds the word satire and or newspaper or something else in that neighborhood, I don't see how the user could be fault the search engine for not returning the desired result.
The Deep Web: Surfacing Hidden Value
- what's the difference between the search engine and a search directory?
- what causes there to be a deep web and a surface web? Is it a choice or a consequent of the content?
No comments:
Post a Comment