Stat 6520: Nonparametric Density Estimation and Smoothing

Spring, 2004
TR 12:00 - 1:15
Geology 310

Please note: There will be no class Thursday, April 1.

Assignments:



Examples:

Instructor: Michael Minnotte
Office: Lund 201-C
Phone: 797-2844 (office)
E-mail: minnotte@math.usu.edu
Office Hours: TWR 10:00 - 11:00 or by appointment

Nonparametric density estimation and nonparametric regression are powerful tools for exploratory data analysis, and allow us to gain greater understanding of our data and the populations they represent without requiring perhaps unjustified assumptions as to the distributional forms of that data. This course will examine theoretical, applied, and implementational issues in a variety of methods. Key techniques will include histograms, frequency polygons, and estimates using splines, kernels, and wavelets. Implementation, efficiency, and applications in one, two, and high dimensions wil be discussed.

Prerequisites: A good foundation in probability through expectation and variance of continuous random variables, such as Math 5710 or equivalent, will make things much easier. You may take this concurrently with 5710, but come talk to me if this is necessary.

Assignments: There will be a brief homework assignment every week. Additionally, there will be several in-class labs and out-of-class projects during the semester. These will generally require some computer work and a write-up, and will be worth more than the weekly assignments. Finally, each student will also be expected to individually produce a written and oral report on a related topic of his or her choosing during the last two weeks of class. Assignments will be handed out in class and posted to the web site.

Grades: Each assignment will include a value in points. Most homeworks will be 10 points, and the labs, projects, and reports will generally be around 50 points. Your final grade will be determined by the sum of your points in all assignments.

Text: Simonoff, Jeffrey S., (1996), Smoothing Methods in Statistics, Springer-Verlag.
This text provides a good survey of the smoothing methods we will be discussing in this course, but be warned that a lot of material will come from additional sources, especially the book by Scott and the one by Wand and Jones listed below.

Software: We will use the R computer package. R is a Gnu-license (freeware) clone of the S-Plus package, and is available for free download (Windows and Unix) from the Comprehensive R Archive Network (below). I will spend some time in class going over the use of R, and you will have the opportunity to do some work in class to gain experience while I can help you.

R Sites
The Comprehensive R Archive Network
Windows R Setup Executable Download - click on rwXXXX.exe, where XXXX gives the version number
R Frequently Asked Questions (FAQ) List
R for Beginners (58 page pdf file)
An Introduction to R (100 page pdf file)
Data Analysis and Graphics Using R -- An Introduction (112 page pdf file)


Other Sources: Beyond the required text, additional material will be drawn from a number of sources. Some additional useful references are:

Disability Statement: If a student has a disability that will likely require some accomodation by the instructor, the student must contact the instructor and document the disability through the Disability Resource Center, preferably during the first week of the course. Any requests for special considerations relating to attendance, pedagogy, taking of examinations, etc. must be discussed with and approved by the instructor. In cooperation with the Disability Resource Center, course materials can be provided in alternative formats - large print, audio, diskette or Braille.

Late Adds: The last day to add this class is January 26. Attending this class beyond that date without being officially registered will not be approved by the Dean's Office.

Disclaimer: The instructor reserves the right to alter anything about this course, pretty much on whim (but he probably won't).


Return to Mike Minnotte's home page.
Last updated: January 4, 2004