INTRODUCTION

ACCESSING THE INTERNET

SEARCHING THE INTERNET

A CLOSER LOOK AT GOOGLE

COMMUNICATING VIA THE INTERNET

EVALUATING WEBSITES

ONLINE RESOURCES

ADDITIONAL RESOURCES

 

 

 

 

 

 

THE INTERNET:

A GUIDE TO
WORKING SMARTER

SEARCHING THE INTERNET

WORLD WIDE WEB SEARCH TOOLS
There are several search tools available for finding information on the World Wide Web. They include Subject Directories, Search Engines, Specialized Databases, Subject Guides, and Meta Search Engines. One of the keys to effective searching is determining what tool will best answer your question.

SUBJECT DIRECTORIES
Directories are made up of selected Web pages that are evaluated and organized into hierarchical categories. They may also include Specialized Databases and Subject Guides. Web directories are compiled either by humans or computers that follow preset selection criteria and link to pages of high quality. Subject directories are time consuming to compile and maintain and tend to be smaller databases. Because they are smaller databases, the scope of the results will be smaller, too. Web directories are most useful when browsing broad subject areas, though it can be difficult to find topics that are not easily categorized. The lack of controlled vocabulary within and among directories can make browsing difficult and time consuming. To ease this problem, most directories offer keyword searchable indexes. Some Subject Directories include a web search engine along with their subject directory. Yahoo is the oldest and most popular directory on the Web, listing over one million sites. One hundred fifty editors categorize the Web for Yahoo. Other Subject Directories include:

Internet Public Library Reference Center
http://ipl.sils.umich.edu/ref/
Looksmart
http://search.looksmart.com
Open Directory
http://dmoz.org

USING SUBJECT DIRECTORIES
To use a directory first identify the broadest subject meeting your information needs. Follow the links until you reach those closest to your desired subject. Bookmark relevant sites to review later as you move through the links.

EXERCISE 1
Go to Yahoo, http://www.yahoo.com, and browse through the category Health. Notice that the topic Health is broken down into categories like Alternative Medicine, First Aid, and Travel Health and Medicine. Click on Medicine and notice that this category covers topics from acupuncture to wound care. Click on the topic Pediatrics and scroll down to the category section and click on Sleep Medicine. Find the topic Sudden Infant Death Syndrome and click on that. Click on the link to the American SIDS Institute and browse the website.

SEARCH ENGINES
Internet Search Engines are giant indexes that enable keyword searching. Search engines use computer programs referred to as robots or spiders that "crawl across the Web" to find information that matches your keywords. Indexes are much larger than directories and are more current. Search engines vary in size, frequency of updating and search options. While they all rely on spiders to collect and index information, each individual search engine uses its own interface and criteria for matching searches to documents. There is currently no common standard that search engines follow. They each operate a little differently and will retrieve unique documents with limited overlap. This is one argument for using more than one search engine, especially when you are having difficulty locating the information you desire.

Some Search engines use link analysis as well as popularity rank results to determine the quality of a page. For example, Google uses an algorithm called Page Rank which calculates the quality of a page by the number of pages that link to it. In other words, it considers a link from one page to another as a vote for that page. The more links to that site, the more votes it receives and ultimately the more relevant it is according to Google. Google also takes into consideration the quality of the site casting the vote. If it is what Google determines to be a major site like, ABC, NBC, CBS, CNN, etc, those votes are weighed more heavily than less popular sites. This type of link analysis could be compared to the number of references that a review article sites.

Major Search Engines include:
All the web: http://www.alltheweb.com
Alta Vista: http://altavista.com
Google: http://www.google.com
Teoma: http://teoma.com

Search engines are most useful when you have a very specific subject to research. To keep up on the latest news regarding search engines go to Search Engine Watch (http://www.searchenginewatch.com), Search Engine Showdown (http://www.searchengineshowdown.com) or Search Engine Colossus (http://www.searchenginecolossus.com/).

USING WEB SEARCH ENGINES
To perform a keyword search, you will need to analyze your topic, choose a search engine and then formulate a search statement. Compose a list of synonymous terms and distinctive phrases. Use several different search engines and then compare the results, which can be very different. Use Phrase Searching, Boolean Searching, and Truncation where appropriate. Check the Help file for the search engine you choose to make sure that they support these types of searches. Check the Help file often for any changes or enhancements to the search engine you are using.

EXERCISE 2
Go to Google, http://www.google.com, and type in "sudden infant death syndrome" in the subject line and click on Search. Notice how many documents are retrieved. Take a look at some of the sites retrieved.

POWER SEARCHING TECHNIQUES
When using search engines to search the Internet, often thousands of documents are retrieved. To limit/focus a search and retrieve more relevant documents, power searching techniques are used. These include Phrase Searching, Boolean Searching, Nesting and Truncation. Check the Help file for techniques available for the search engine you are using.

PHRASE SEARCHING
When inputting a keyword search with more than one term in the search statement, consider phrase searching. Entering a phrase enclosed within quotation marks signals the search engine to search for that specific phrase. If you enter multiple words and omit the quotation marks, the search engine looks for references that include any of the words regardless of whether they are related.

EXERCISE 3
Go to Google, http://www.google.com, and type in "sudden infant death syndrome" (enclose the phrase in quotation marks) in the subject line and click on Search. Look for any differences in the search results and compare to your last search. What are the differences?

BOOLEAN SEARCHING
Boolean Searching refers to a form of logic applied to a search. The Boolean operators are AND, OR, NOT. Some search engines require these operators to be capitalized. It will be useful to always use capitals. These terms may vary from search engine to search engine, but the ideas are the same. Check the Help file if in doubt.

  • AND narrows a search. It intersects two sets and is used to reduce results.
  • OR broadens a search. It is the union of two sets and is used to increase results.
  • NOT narrows a search. It is the exclusion of one set and is used to reduce results.

EXERCISE 4
Go to Excite, http://www.excite.com/, and in the subject line type "sudden infant death syndrome" AND prevention. Browse through the documents retrieved and look for the SIDS Network home page. Click on this link and see if you can find information on SIDS and smoking.

NESTING
When you have a search statement that contains more than one type of Boolean operator, use parentheses to combine synonymous terms.

The search statement: Cancer AND (teen youth adolescent) signals the search engine to look for any of the terms within the parentheses and then combine the results with the term outside of the parentheses. (Note: some search engines do not require the OR operator between the search terms. Check the Help file if in doubt.)

EXERCISE 5
Go to Northern Light, http://www.northernlight.com/. In the subject line type (infant OR newborn) AND "sleep disorders". Click on the link to the most relevant site you find and browse the website.

TRUNCATION
Truncation allows you to search on the root of a larger word. Most commonly, an asterisk (*), but sometimes a question mark (?), is used to signal the search engine to look for the root of the word and all of the possible variations.

For example, Child retrieves references with the word child in it. Child* or Child? retrieves references with the word child in it as well as child's, childish, childishly, childishness, children, children's, childhood, childlike, etc.

EXERCISE 6
Go to Webcrawler, http://www.webcrawler.com, and in the subject line type in "sleep disorders" AND child*. Examine the results. Notice that Webcrawler lists directory search results and then also Web search results. Notice under Web results, there is an option to show link summaries or just the titles.

META SEARCH ENGINES
Search engines vary in size and often retrieve different results. Using a Meta Search engine is a quick way to determine which search engines will yield the best results. Simple searches work best in Meta Search engines. Meta Search Engines search several search engines at the same time but most only retrieve the top 10-50 results from each search engine. Power searching techniques may not be properly processed or available. One Meta Search engine, Ixquick, http://www.ixquick.com/, knows which search engines can support Boolean, Phrase, Truncation, Nesting, etc, searching. It will only send your search statement to search engines that can process it properly.

USING META SEARCH ENGINES
When using Meta Search Engines you enter the search statement one time and it is sent to several search engines simultaneously. The results are then blended together. They generally only catch about 10% of search results in the search engines they cover. Some examples of Meta Search Engines include Dogpile, Mamma, MetaFind and Ixquick.

EXERCISE 7
First go to Ixquick, http://www.ixquick.com/, and in the subject line, type in SIDS. Take a look at the results. Can you tell which search engines were searched? Next, go to Dogpile, http://www.dogpile.com/index.gsp, and perform the same search. Notice the differences between the two Meta Search Engines.

SPECIALIZED DATABASES
(THE INVISIBLE OR DEEP WEB)

The Invisible Web is made up of content that Search Engines either cannot or will not index. It consists of searchable databases, password-protected sites and documents that are hidden by firewalls. These types of websites are inaccessible to the spiders, robots, and WebCrawler programs that are used to compile indexes for search engines. They can index their location, but not the information they contain. These pages often lack links that point to a page that the programs can follow, or the page may be made up of data types that search engines are not programmed to index. Specialized Databases are similar to Subject Directories as they are often organized by subject categories and evaluated and organized by humans rather than computer programs. The documents found by using these specialized databases are referred to as the Deep or Invisible Web.

Subject Directories that provide access to the Invisible Web include:

Invisible Web: http://www.invisibleweb.com
Complete Planet: http://www.completeplanet.com
Librarians Index to the Internet: http://www.lii.org
The Invisible Web Directory: http://www.invisible-web.net
Infomine: http://infomine.ucr.edu
Academic Info: http://www.academicinfo.net

SPECIALIZED SEARCH ENGINES
Specialized Search Engines provide access to invisible web resources and high quality web sites. These search engines typically index fewer but more relevant sites. Subject specialists identify and evaluate relevant, high quality sites to be included in the databases.

Examples of Specialized Databases covering Health and Medical include:

Combined Health Information Database: http://chid.nih.gov/simple/simple.html
Health wise: Go Ask Alice http://www.goaskalice.columbia.edu
Mayo Clinic: http://www.mayoclinic.com

SPECIALIZED DIRECTORIES/SUBJECT GUIDES
Specialized Directories or Subject Guides are collections of general Web pages compiled by experts in their field. These directories are often referred to as distributed subject trees. They rely solely on volunteers who are knowledgeable in a given subject area to classify and index the Internet. The subject trees are not comprehensive in scope and provide no authority control.

Examples of Specialized Directories include:

AboutCom
http://www.about.com

WWW Virtual Library
http://www.vlib.org/overview.html

 

© Florida AHEC Network, 2003