Search Engines – How They Work

The term search engine is often loosely-applied to describe both spider- or crawler-based Search Engines and Internet Directories that are created with human intervention. Directories are not true "search engines," but they accomplish the same thing. However, the user really doesn't care how they achieve the results as long as they are the desired ones sought. Although both Search Engines and Directories set out to accomplish the same goal, they work differently.

Spiders, Crawlers and Other Creepy Things

True Search Engines, such as HotBot, use spidering or crawling technology to automatically create their listings. They crawl or spider the web, looking for what users search upon.

As you change your web pages and add new content, Search Engines will eventually find those changes. This can affect your ranking. Page titles and actual content become increasingly important.

The spider visits and reads the web page, then follows links to other pages within the site. Then the spider returns to the site according to Search Engines settings and looks for changes. The spidered information must then go to what is known as the "index" or "catalog."

Search Engine software (and their algorithms) is the final, crucial part. This is how the program(s) searches through content in the Index to find, match and rank users' search.

The following chart shows how different engines rank web pages and who uses what to include and rank those pages.


Spiders Web Pages Submitted or Not
  AltaVistaFast SearchGoogleNorthern Light

Does Not Follow Frame Links
  ExciteFast Search

Does Not Follow Image Maps
  ExciteFast SearchGoogle

Does Not Support Robots Meta Tags
  Excite

Link Popularity Helps Ranking
  All

Learns Page Change Frequency
  AltaVista

Allows Paid Inclusion or Pay-per-Click
  AltaVistaFast SearchGoogleGoToLookSmartMSNYahoo

Indexes Full Body Text
  All

Does Not Support Meta Tag Description
  GoogleNorthern Light

Does Not Support Meta Tag Keywords
  ExciteFast SearchGoogleNorthern Light

Does Not Support Alt Text
  ExciteFast SearchInktomiNorthern Light

Direct Hits Assist Rankings
  HotBot

Penalizes Site with Meta Refresh Tags as Spamming
  AltaVista

Penalizes Site with Invisible Text as Spamming
  AltaVistaGoogleExciteNorthern Light

Penalizes Site with Tiny Text as Spamming
  AltaVistaGoogle

NOTE • Inktomi results form portions of AOL Search, HotBot & MSN
• Excite covers portions of Excite-owned Magellan and WebCrawler
• FAST Search covers crawler data used by Lycos

Internet Directories

Internet Directories, such as Yahoo, depend on human intervention for its listings. You are usually asked to select the proper category for your site (can be difficult and time-consuming) and then submit a short description of your site. Sometimes it is left up to the editor of that particular category to describe your site.

These Directories often require payment in order to review your site within 1-2 weeks. Some require payment in order to be reviewed at all. Still others offer free reviews but with no time guarantee. It can take as long as 3 months to get listed in these Directories – if you get listed at all! Keep in mind that even if you pay to have your site reviewed, that does not guarantee that they will actually list your site.

Things that help improve your ranking on a true Search Engine are not necessarily the things that will improve your ranking on a Directory. Here is where good content, fast-loading, user- and browser-friendly pages, properly optimized come in. You must remember that a human being is reviewing your site – and thousands of others.

Major Players


(in alphabetical order)

AltaVista Definitely one of the major concerns when getting your site listed. It has comprehensive, wide-ranging coverage and search options that make it popular among users. In addition to spider-based matches, it allows you to search specifically for news, shopping, and multimedia. It also gets results from LookSmart.
AOL Search The external version gets its listings from the Open Directory, Inktomi and spider results as backup to the directory information. Members are allowed to search AOL content as well.
Ask Jeeves A human-powered search service that tries to answer exact questions from the user and direct them to the proper page(s).
Excite Uses spidering technology for it's web page index and gives access to directory results from LookSmart.
Fast Search/All The Web One of the largest indexes on the Internet. Also known as All The Web, this is a showcase for FAST Search's technologies.
Google Link popularity is the key to Google. Google uses its own massive index and also provides results from Yahoo and Netscape.
GoTo An increasingly popular pay-per-click directory. Web page owners can bid on their ranking for particular search terms.
HotBot With many advanced searching features, HotBot is popular with researchers. First page of results usually come from Direct Hit and Inktomi. It's directory information comes from the Open Directory project.
IWon Gives away daily, weekly and monthly prizes in an innovative marketing scheme. It's directory comes directly from Inktomi.
LookSmart A directory of web pages reviewed by humans. It also provides Directory results to MSN Search, Excite and many others.
Lycos Main listings come from the Open Directory project, and then secondary results come from the FAST Search engine and Direct Hit.
MSN Search This service is powered by LookSmart's directory of websites and gets its secondary results from Inktomi. Some sites are also pulled from RealNames and Direct Hit.
Netscape Search Netscape Search's web page matches come mainly from it's own index, the Open Directory and also gets some results from Google.
Northern Light Another researcher's favorite because of its large index and ability to gather documents by topic. It also has special collection documents from thousands of sources, including news wires, magazines and databases. While these documents can be searched, there is a charge of up to $4 to view them.
Open Directory Uses volunteers to review and edit its catalog. Netscape, Lycos and AOL Search all utilize the Open Directory's index.
Yahoo The oldest major website Directory on the Internet and it is the second most popular search, next to Google. It has the largest human-compiled index of the web with over 1 million sites listed. If a search fails to find a match within it's own listings, it supplements its results with those from Google. Google matches are also shown after all Yahoo matches.

Using Both Technologies

With the vast number of websites being added to the Internet daily, some Directories have combined spidering technology. They usually organize them in a way that presents the Directory listings first and then presents the spidered results.

We Don't Know it All but...

We'll do our homework and give you our best effort. CONTACT US to ascertain if we're right for you!

 


Web Design... Since Before the Turn of the Century!™

© 1997-2007 Graphic Studio • All Rights Reserved.

W3C Validation of CSS

WelcomeOur Ad Philosophy Website DesignWebsite HostingFree Website AnalysisSearch EnginesMeta TagsOptimization TipsPaid Search Engine InclusionsClient ListingCollateral ServicesRelated InterestsTestimonialsContact Us