Monday, August 24, 2009

A Search Engine Explanation

By Justin Harrison

With the widespread growth of the World Wide Web a specially designed tool to search through the information available was developed called the search engine. Using both algorithms and human editing the search engine will present results organized in a list consisting of web pages, information, links, and images. These results are viewed by the user after inputting a keyword or keyword phrase in to the search engines search field.

Web crawling, indexing and searching combine in that order to obtain the most accurate results. Mass amounts of information on millions of web sites are stored and then retrieved relevant to the user's request. A web crawler is also known as a spider, it analyzes every link and indexes all information for faster retrieval.

Mata tags and even words from the webpage are studied to classify the webpage and its content. All these data are stored for future usage.

The major search engines, such as Google amass all, or a miniature portion of the source page, or "cache", in addition to information the web page offers. The search engine, AltaVista stores every word from every page. Storing the cache helps the search engine filter more easily because web pages are updated constantly. Google's technique of indexing relieves the "linkrot" and allows users to be sure that the content they find in their search results will be up to date and utilizable. The cache can be helpful when obsolete information is removed. The cache allows users to find and recover information from archived sources.

Search engine users normally input a keyword or key phrase into the search field. The engine will search for their particular keyword and key phrase on the World Wide Web. The search engine index will provide an organized list of results with the best matched web pages. A short summary of each webpage describing the contents is provided along with the list.

All the search engines look to enhance their performance by ensuring that they deliver exactly what the user looks for. The problem is accentuated by the abundance of web pages containing the keyword or the keyword phrase. However, by using web crawlers and indexing, search engines manage to filter all the sites that are irrelevant to the search being made even if it has the keywords. They have created their own unique processes for examining different web pages and their contents.

Page rank is latest addition in the techniques used by search engines to sort out various web pages and their contents. Page rank decides the relevance of a particular page by studying the correlation between its meta tags, descriptions, keywords used and the content of that webpage. The search engines rank those sites high that have association with high ranked web pages. The page rank is essential for any web page or site as it determines its probability of featuring at the top of any particular search.

About the Author:

No comments: