by Jonathan Shakes,
Marc Langheinrich,
and Professor Oren
Etzioni at
the University of Washington
Directed Search
Most of the time,
Ahoy! can find someone's homepage in the many
references provided by the Metacrawler.
Sometimes, though, a search engine like the Metacrawler
is not enough.
Ahoy! has a method of locating homepages not
indexed by any of the web's
various search engines:
it "guesses" the URL of the page.
To inform its guesses, Ahoy! extracts patterns that are common
between other homepages it has found at the same institution.
If Ahoy! cannot find the page using the search engines, and if
Ahoy! has previously learned something about the patterns in pages at
the target institution,
Ahoy! will create a series of guesses and check to see if
any of them are the desired page.
Here's an example of the output while Ahoy! is trying to guess an URL:
...
All crawler returned. No candidates found
Ahoy is trying to locate homepage by itself. The following
are Ahoy's Hypotheses for the institution you specified:
- http://www.engr.wisc.edu/~<L>/homepage.html
- Connecting...
- Trying www.engr.wisc.edu:80/~jsmith/homepage.html
- Trying www.engr.wisc.edu:80/~john/homepage.html
- Trying www.engr.wisc.edu:80/~johns/homepage.html
- Trying www.engr.wisc.edu:80/~jonsmi/homepage.html
- Trying www.engr.wisc.edu:80/~smith/homepage.html
- Trying www.engr.wisc.edu:80/~js/homepage.html
- Trying www.engr.wisc.edu:80/~johnsmith/homepage.html
Waiting for answers
- Found Page at http://www.engr.wisc.edu:80/~john/homepage.html: John's Homepage
...
[
About Ahoy! |
Help |
FAQ |
Bugs |
Register |
Search
]
ahoy@cs.washington.edu