Web-based Statistics:
Lecture 11
Thursday, May 25, 2000

WebStat:

URL: http://www.stat.sc.edu/webstat/

* WebStat is freely available data analysis software for use over the World Wide Web. It is written in the form of a Java applet and is designed to run on any of the three major platforms (Mac, PC, Unix). It is brought to us by Webster West (west@stat.sc.edu) and Todd Ogden (ogden@stat.sc.edu) from the Department of Statistics at the University of South Carolina.

* The package was created as an initial attempt to solve many of the problems that exist with the delivery and use of modern statistical software. Many times statisticians develop procedures in languages such as Splus, SAS, Minitab, etc., which are very specific to statisticians. Students and other potential users may not have access to these languages, and therefore may not be able to use the procedures. By using Java and the World Wide Web, WebStat should reach the broadest possible audience of any statistical software of its kind.

* Support for instructors and older versions are available from the main page. E-mail help is also available at: webstat@stat.sc.edu.

* Let's look at the basic layout of the software:

Hit the orange button!

* Getting data into the table:

In addition to manually entering data into the data table, there are three other options for importing data. They are:

+ Getting data from a URL:

If your data is saved as ASCII text and in a file in a World Wide Web accessible location, you may import the data into WebStat by providing the URL of the file. Simply go to the Data menu , select Get data, and then select from URL within the sub-menu. When the dialog window appears, type in the URL of the data set you wish to load. Also, use the check box to specify whether or not the first line of the file should be used as variable names. By default, it is assumed that the first line does contain variable names. If your data is properly formatted for WebStat then you can ignore this option completely. The image below shows the URL for the Iris data set. Once you have entered the necessary information, click Okay. After a few moments, the data will appear in the data table.

+ Pasting data

You may also paste data contained in a local file into WebStat. Go the Data menu and select Paste data. The method you use from then on depends on the browser you are using.

Netscape: A dialog window will appear as shown below. Select and copy your data from the file and then paste it into the text area provided. On a PC, once you have copied your data from the file, you may simply right click the mouse in the text area and then select Paste. Use the check box to specify whether or not the first line of the file should be used as variable names. By default, it is assumed that the first line does contain variable names.

Internet Explorer: Since Internet Explorer will not allow clipboard operations in unsigned applet windows, another technique must be used. Your data will be temporarily written to a file on the WebStat server and then imported as a URL into WebStat. First, a new browser window will appear as shown below. Select and copy your data from the file and then paste it into the text area provided. On a PC, once you have copied your data from the file, you may simply right click the mouse in the text area and then select Paste. You may also type your data in manually or enter a formatting header into the text field if you so choose. The image below shows the Iris data in the new browser window. When you are finished, click Submit.

+ Loading data from a file

This option will only work for users who have the HotJava browser. Since most people don't use this browser, it is not anticipated that this option will be very useful for most WebStat users (too bad for us, I guess!).

* The website also provides us with a number of sample data sets- see under 'sample data sets' on the main page, complete with information on them such as what all the variable names refer to and what they are measuring. We'll use some of these today in our exercises.

* Overview of our impressions of the site:

We thought highly of this site. It is extremely user-friendly, and everything is explained clearly and in detail. There are downfalls of the site, however. There is no search engine, and attempting to locate documentation for the site will send you around in circles. The statistics that are available are limited in some ways (but we'll look at that as we work through the exercises). However, everything that the software does is laid out in front of you so it is easy to determine whether the site would be useful for your needs.

* Exercises:

  1. Lets look at icecream consumption and its relation to price, income of consumer, and temperature. Remember, if you want some more information on the data set or variables, click on icecream under sample data sets on the main page. Now, it seems likely to us that icecream consumption increases with temperature. What would be a good statistical method to assess this? Lets go ahead and do it. Report the results (e.g. R-square or such).
  2. Here we'll go back and use Dr. Vukasinos data set, so we can get the feel of importing a data set. So, go to 'get data' under 'data' and import: http://www.math.usu.edu/~vukasino/teaching/spring2000/complab/student_data1.prn

    So, now that we have that in (and are well practiced at that form of data entry), we realize that Webstats only deals with numerical values, and since we want to look at whether students of Biology are fatties (overweight) compared to students of Health, we need to put the data set into wordpad, and replace:

    Biology with 1
    CompSci with 2
    Health with 3
    Other with 4
    Chemistry with 5

    Now, using an ANOVA, lets try to work out if Biology students are really fatties or not. It would also be of value to look at some summary statistics here, so go ahead and generate those. Can we answer our question now?

* Homework:

Using the data set on Labor force available as a sample data set in the site:

* More exercises for extra practice with Webstats:

Go to: http://www.stat.sc.edu/110/

Or click on the link under 'featured sites' on the main page.

* References on Webstat