Loading Time Function... 
 

Welcome | Seminars | Stream Info. | Course Info. | Departmental Info. | Research | Resources | Students | Contact Us

Student Support

A Guide to Using the Internet for Research

The Internet is a massive network of computers which can offer users access to vast quantities of information and easy global communication. There are resources available on the Internet for almost every field of interest. This guide serves to explain what Internet resources are useful to research and gives some pointers on how to find information relevant to your research project.

It is convenient to classify the Internet resources into e-mail based communication and the World Wide Web (WWW) and this guide has been structured as such.

An additional guide for students writing their theses has been prepared by Tom Moultrie. It gives additional information on referencing and some pointers on academic standards on the web.

The Internet Research FAQ is another guide to using the Internet for research.

Electronic Mail

E-mail programs allow one to send messages to individuals or to groups of people. Discussion groups and mailing lists have been created where one can send mail to, and receive mail from, groups of people with similar interests.

Mailing lists and discussion lists

Mailing lists and discussion lists (groups) are devoted to particular subjects or fields of interest. There are a huge number different lists available on the Internet and new lists are being created almost continuously. A list related to your field of interest can be an excellent tool for research. Lists allow you keep up-to-date with the most recent announcements, developments and thoughts in a particular subject.

These lists are run by programs called listservers which store e-mail addresses of people who are interested in a particular subject. If a single e-mail message is sent to a particular listserver's list, then that message will be duplicated and sent to all members of the list. Listservers are thus an efficient means of communicating, with and between, a large group of people with a common interest (rather than sending each one a separate message).

The difference between mailing lists and discussion lists is that mailing lists are structured as announcement-only services whereas a discussion list is a forum where all subscribers can participate by asking and answering questions. With mailing lists there is usually only one person or organisation that can post information. Mailings lists are thus mainly used for distributing abstracts of new papers (published and/or working papers), for calls for papers, conference announcements, job advertisements, etc. Discussions groups on the other hand offer open discussion and debate amongst all subscribers. They can provide a forum for asking questions that will hopefully be answered by experts in the field. Although there are important differences, some people use the terms mailing lists and discussion lists interchangeably.

Discussion lists can be moderated or unmoderated. All mail sent to a moderated list must be approved by the moderator (the person in charge of the list) before the message will actually be distributed to all the list members. In unmoderated lists, any subscriber can post a message to the list without it first being approved. In some cases unmoderated lists contain many off-topic posts and suffer from a high "noise-to-signal ratio". On the other hand they can contain a wide range of freer thought and discussion. Which is the case depends on each individual list, its culture and the personalities of the subscribers. Join some lists and find out...

Finding mailing and discussion lists

Sometimes it is difficult to find out what lists there are in your field of interest. Lists are often advertised or mentioned in Web pages relating to your subject or in Usenet newsgroups (discussed below). One of the best ways of finding lists on a particular subject is to make use of some of the many Lists of Lists that are available on the Internet. Most of the better Lists of Lists are searchable and you can thus easily search for your subject in the list descriptions. However, currently no one List of Lists offers a complete listing of all discussion groups and thus it is advisable to use several List of Lists to search for your interests.

Links to some Lists of Lists are given in the Virtual Investments Library. In addition you can also find the "Wouters guide to finding mailing lists" there.

Subscribing to lists

Once you have found suitable lists, you join them by "subscribing". This is accomplished by sending a simple mail message to the listserver (NOT TO THE LIST). There are 3 common listserver programs: listserv, majordomo and listproc. An example of how to subscribe to a list is given below:

Name of the list: BUS422W_INVESTMENT

 

Address of the list server: A majordomo@southernlife.com

 

To subscribe to this list you send a message to majordomo@southernlife.com. Leave the subject blank and enter: SUBSCRIBE BUS422W_INVESTMENT in the body. (Note: if you have any signatures or comments that are automatically added to the bottom of your messages you must turn them off when subscribing to lists. Alternatively you can use the END command after your subscribe line to prevent the listserver from reading the rest of the message and your signature).

The above process is similar for both listserv and listproc lists. For listservs, however, you must add your first name and last name after the list name on the subscribe line.

e.g. SUBSCRIBE RISKNET Joe Bloggs

Soon after subscribing you should receive confirmation of subscription from the listserver. This message will explain how to unsubscribe from the list, how to send messages to the list, how to access the archives of previous postings and provide other general list information.

To send mail to a list to which you have subscribed you use the list name in the address. In the above example this would be, bus422w_investment@southernlife.com.

Many lists keep archives of all messages that have been posted to the list. These are either available by mail from the listserver or are located at a Web site. Details on how to retrieve archives are normally given in the welcoming (information) message when you join the list. When archives are kept on the listserver (as opposed to the Web) they are normally accessed by using GET commands sent to the listserver. You can usually get instructions on how to retrieve files from the listserver (and on all other listserver commands) by sending a message containing HELP in the body to the listserver.

Finding e-mail addresses of individuals

Currently the easiest way to find out someone's e-mail address is to ask them! Searchable directories of e-mail addresses are available but at present these directories are far from comprehensive.

Four11 Directory Services is the largest "white page directory" of e-mail addresses. It contains mainly e-mail addresses of people in the US but it may help you find someone. There is also a South African directory of e-mail addresses but at the time of writing it contained very few addresses. In most of these directories you can search by country, organisation and occupation as well as by name.

In addition, if this fails, you may find a directory of e-mail addresses at the Web page of the person's employer. Universities and research institutions generally have detailed directories of employee's e-mail addresses.

Usenet groups

With e-mail or a listserver, a mail message gets sent to a specific person or group of people. Usenet, also works by means of e-mail messages, however, it operates like a bulletin board rather than a postal service. Messages get sent to a central bulletin board where they are seen by anybody who looks at the board (not only the person to whom it was sent or the people on a specific list). Answers to messages are also posted to the board. In a newsgroup you have to actively enter the forum to read the messages postedwhereas with a discussion list all the messages get sent directly to your e-mail address. If you don't read Usenet news for a few days you can miss out on messages entirely as old messages expire and are removed from the "bulletin board" after a few days to make space for new messages.

Each newsgroup deals with a specific topic. Topics are grouped together into major categories. Below are some of the major categories:

  • alt: controversial or fringe topics
  • bionet: research biology newsgroups
  • bit: bitnet (a network of educational sites) newsgroups
  • biz: business related newsgroups
  • comp: computer and related subjects newsgroups
  • misc: well supported discussions which do not fit elsewhere
  • news: news about Usenet itself
  • rec: hobbies, games and recreation newsgroups
  • sci: science other than research biology
  • soc: social (often ethnic) newsgroups

In addition there are country and even organisation specific categories of newsgroups, such as:

  • za: South African newsgroups
  • uct: UCT newsgroups

There are now search engines that allow you to search through all the Usenet groups and messages for topics of interest. DejaNews currently offers the largest searchable Usenet archive. You can search by subject, keywords, author, etc.

Reference.COM is a useful search engine that provides access to the archives of over 16,000 newsgroups and a large number of mailing lists. Furthermore, Reference.COM allows you to create "stored queries" that will automatically rescan the archives at a user-specified interval and will e-mail you the results. This stored query is an example of an "intelligent agent" that will seek out news and information that interests you and then give you details on how to access this information.

Reference.COM has an advanced search function where you can interactively refine your search using message header fields such as author, organisation, subject and newsgroup name. There are also ready-made query templates which pre-select the right newsgroups and lists for a particular topic. Just complete the rest of the form and submit your query.

The World Wide Web (WWW)

The World Wide Web (WWW) is the most versatile tool available on the Internet. It consists of hypertext pages that can contain text and graphics (and even sound and video). These pages can be linked to other pages by means of hypertext links.

Uniform resource locators (URLs)

The "address" of resources on the Internet are specified by a URL (Uniform Resource Locator). URLs identify both the type of resource and specific location of it on the Internet. An example of a URL is given below:

http://www.southernlife.com/actuary/uct/research.htm

The first part of the URL (before the colon) specifies the access method. The part of the URL after the colon is interpreted as the address. In this example the access method or protocol is "http". This identifies the Hypertext Transfer Protocol (HTTP) which is the communication protocol used for the World Wide Web. The part after the double slash in the URL is the address of the computer (www.southernlife.com). The last part of the URL (after the first single slash) is the name of the file at the site. In this example the file referred to is this research guide.

Note that there are other access methods or protocols (e.g. ftp and gopher) but most Web browsers support all the protocols and so you should be able to access almost all resources from within your Web browser.

Finding sites on the WWW

The Web is so big that it is often difficult to find resources in your field of interest. Good starting points are the search engines and the various Web Directories. Virtual Libraries can be also be very useful for finding resources of interest.

If you are looking for the Web site of a company or an educational institution you can often be quite successful with educated guesses. Most organisations will use a URL that contains their name in a logical way so as to make them easy to find. When guessing URLs try http://www.company.com or http://www.company.co.za for South African companies. Common endings for academic sites are .edu for American universities and .ac.za for South African educational sites. Another common ending for URLs is .org which is normally used for official bodies or non-profit organisations.


Yahoo!

Yahoo! is one of the most popular and well-known directories. You can browse through entire sections or you can search the whole directory for your desired subject.

When you search Yahoo! it provides a list of links matching your search criteria. It also lists the names of several other search engines at the bottom of the page. Clicking on a link to one of these other search engines will conduct the current search on the specified search engine. Currently there is no one search engine that is best. They each search different databases using different methods and so using a range of search engines is often the best way to find good links.

Note that Yahoo! searches are case-insensitive - i.e. Yahoo! doesn't differentiate between upper and lower case. Yahoo! supports both the and and or Boolean operators. You never actually type the Boolean operators in your search string - you just separate the words with spaces. By default Yahoo! conducts an and search for all the words in your search string. But you can specify on the search options page that Yahoo! rather conducts an or search.

Also on the search options page you can specify whether Yahoo! searches for your string as a substring (this is the default) or as complete (and distinct) words. If Yahoo! is searching for substrings, then the words must appear consecutively in the order specified in your search string. The final setting on the search options page is the number of matching documents to be displayed on each of the pages generated. The default is 25 but you can specify 10, 25, 50 or 100 matches per page.


Alta Vista

Alta Vista is a very powerful search engine that has indexed a massive number of Web sites. It supports some quite complex searching techniques including Boolean searches. This search engine's database is so large that you must be very specific when using it otherwise you will find far too many links! Alta Vista has a simple search and an advanced search facility. The advanced search is actually better and is often easier to use. Alta Vista can search both the WWW and Usenet newsgroups.

At the Alta Vista Web site there are several pages explaining how to use the search engine. Some of the most important aspects of the help are summarised below but this guide does not attempt to give complete details on using Alta Vista.

When searching for words or phrases Alta Vista ignores all punctuation. Thus U.S.A. and U S A are identical (both 3 word phrases to Alta Vista). These two phrases are not the same as USA (1 word). If you are searching for a specific phrase (i.e. the words must be found together) then "enclose the phrase in double quotes".

If you specify a word in a search string using lower case letters then Alta Vista conducts a case-insensitive search for that word. If, however, you use an upper case letter anywhere in a word then you force Alta Vista to find exact case matches on that entire word. Thus, internet will find Internet, INTERNET, internet, or even iNtErNeT. But if you use Internet, Alta Vista will only match Internet and none of the other capitalisation variations. It is recommended that you use all lower case letters unless you specifically know the case of the word you want to find.

You can use Alta Vista to search for the occurrence of any of a family several related words by using * notation. For example, invest* will find invest, investing, investment and investments. However, it will also find words like investigate. To use the * notation Alta Vista requires that the * appears only at the end of a word and that there are at least three characters before the *. Furthermore, Alta Vista will only match strings of up to five additional lower case letters (no capitals and no digits).

When Alta Vista displays the list of matching documents the search engine puts the links that it thinks are most useful at the top of the list. "Usefulness" is based on weighted scores for pages. The exact scoring technique is quite complicated and not published completely, but, for example, documents with titles containing the words of the search string are scored highly. Also documents containing many words from the search string (if an or search is used) are weighted higher than those which contain only a few of the search string words. And documents that contain the matching words multiple times are also scored highly.

Due to this weighted scoring, even if Alta Vista returns a few thousand matching documents you should still find what you are looking for in the first 30 or so links. If not, then you should refine your search string and try again.

Alta Vista also supports various constraints in the searching process. You can use the title constraint to find only matching words in the title of the document. For example, title:bond will only find pages that contain the word bond in the document title. Other useful constraints are link (finds pages that include a link to a certain site) and url (finds pages whose URL include the matched words). Note that if you wish to search for multiple words in a title you must either use quotes (e.g. title:"bond rates") or proceed each word with title:.

Simple Query: In this search Boolean operators like and,, or and not are ignored in search strings. Alta Vista will search for Web pages that contain any of the words in the search string. Those pages that contain all or most of the words in the search string are displayed near the top of the list of matching pages. Below are some example search strings for the simple search.

bond rates
This will find all documents containing the words bond or rates or both. Pages containing both words will appear towards the top of the list. Note that in this search the words do not necessarily appear next to or even close each other in the document.

bond +rates
This will force Alta Vista to return only matches that contain at least the word rates and here again pages with both words will be at the top of the listed. This search will not find any matches which contain bond but do not contain rates (while the first search would have found pages with bond only). [Note there is a space between the word bond and the +sign but there is no space between the + sign and the word rates.

bond +rates -gilt
This is similar to the second search but it will exclude all documents that contain the word gilt.

"bond rates"
This will find only those documents that contain the exact phrase bond rates. There can be no words separating the search phrase on the matching documents. This search would probably be most useful in finding sites which provide bond rates.

Advanced Query: In the advanced searches you must use at least one of the operators and, or, not and near to combine words and phrases in search strings. Parentheses are used to control nesting of operators. The operators can be specified in upper or lower case. The results of the search can be optionally ordered by specifying (in the Results Ranking Criteria field) a set of words or phrases which then receive higher weighting. Some examples of advanced searches are given below.

bond or rates
This finds documents contain at least one of these words (documents with both words are also returned and these are near the top of the list).

bond and rates
In this search both words must be present in the matching documents.

(bond or gilt) and rates
This will find documents containing bond rates or gilt rates (or both). Note that parentheses are used to control the precedence of the Boolean operators.

bond and not gilt
This will find all documents containing bond that do not contain gilt. Note that to use the not operator you must proceed it with the and operator.

bond near rates
This will find documents where the words bond and rates appear within ten words of each other. The near operator is particularly useful when searching for someone's name. A name may be in the form of Joe Soap, Joe A. Soap, Joe Andrew Soap or Soap, Joe and the near operator will find any of these orderings.

Virtual Libraries

Virtual Libraries can also be very useful resources for conducting research. On the Internet you will find both general reference libraries and subject-specific Virtual Libraries. The general libraries contain a broad range of subjects and information. There are links to some good general libraries in the Virtual Investments Library. Subject-specific libraries obviously have much more detail on the topic than general libraries. A subject-specific Virtual Library relevant to your research topic should be an excellent resource. However, currently there are many subjects that do not have any dedicated online libraries.

The Virtual Investments Library is an example of a subject-specific library. In this case the library is in fact specific to the needs of students studying an investment and asset management course at UCT. This Virtual Investments Library is structured as a list of links to investment related sites on the Web. It links mainly to on-line magazines and newspapers as well as financial information sites. Some of the other Virtual Libraries contain complete written works or at least abstracts of published material.

Electronic Journals and Magazines

Many printed magazines, journals and newspapers (and TV stations) are now publishing on-line versions. The on-line equivalents are not generally the complete publications. Some sites do publish complete on-line versions but require you to subscribe to the service. There are a few sites (e.g. Financial Mail) that do offer complete on-line versions that are available at no charge.

Due to the ease of publishing on the Internet there are now many electronic journals and magazines that have no printed equivalents. These are called e-journals and e-zines. Some e-journals have already established a name for themselves as acclaimed academic journals. Many e-journals and e-zines are distributed in mailing list form and can usually be found in the same way as you would find a mailing or discussion list (process outlined above). Alternatively you can use one of the lists of electronic journals and magazines. There are links to some of these lists in the Virtual Investments Library.

Referencing Material from the Internet

Referencing information obtained from the Internet is slightly different to referencing printed material and currently no official referencing standards exist. One of the problems with Internet material is that there is no guarantee that a Web source you used will still be available in the future at the same location or even anywhere. Past Usenet news messages and mailing list messages may be stored in an archive but not all lists or groups are archived (yet).

It has been suggested that references include the date that the information (e.g. Web page) was accessed. Most referencing styles proposed for Internet resources require the following information (some of which may not always be available): the author, the date published or written, the URL, the (most recent?) date accessed by the researcher, the name of the information supplier (on-line publisher) and the medium. In the absence of standard referencing styles, it is best that you record at least all this information for all Internet sources that you use. Below are some examples of how Internet references can be presented:

Individual works:
Author/editor. (Year). Title (edition), [Type of medium]. Producer (optional). Available protocol (e.g. HTTP): Site/Path/File [Access date].

Example:
Pritzker, T. J. (No date). An early Fragment from Central Nepal, [Online]. Available HTTP: http://www.ingress.com/~astanart/pritzker/pritzker.html [1995, June 8].

Write "No date" when the electronic publication date is not available.

Parts of works:
Author/editor. (Year). Title. In Source (edition), [Type of medium]. Producer (optional). Available: Supplier/Database identifier or number [Access date].

Discussion list messages:
Author. (Year, Month Day). Subject of message. Discussion List, [Type of medium]. Available E-mail: DISCUSSION_LIST@e-mail_address [Access date].

Example:
RRECOME. (1995, April 1). Top ten rules of film criticism. Discussion on All Forms of Cinema, [Online]. Available E-mail: CINEMA-L@america.edu [1995, April 1].

Author's login name, in uppercase, is given as the first element. If the author's full name is known (i.e. specified in the message) then this can be used. The general consensus is, however, not to give the author's full e-mail address.

For more details on Internet referencing see the following sites:

 


Tom Moultrie has prepared an additional guide for students writing their theses. It gives additional information on referencing and some pointers on academic standards on the web.

The Internet Research FAQ is another guide to using the Internet for research.


This research guide was prepared by Steven Levin - Last updated 16/07/96

Any suggestions or comments about this guide are gladly accepted. Did you find it useful? Do you know of any better Internet research guides? What should be added?


Student Support

Site Selector:     |  Browse Course Sites


Disclaimer | Contact Faculty Office | IT Helpdesk

Copyright © 2009 Faculty of Commerce -- University of Cape Town

Managed by Commerce I.T.


Do You Need UCT's Official Logo and Stationery Templates?
1686 page views since 2007/04/25   Last modified: 01/11/2007 11:13:42     Download Adobe Reader 9.1.0    Convert your files to PDF