DATE & TIME:  Monday, December 1, 2003 4:10 pm

LOCATION:   Snedecor 319
                                                
SPEAKER:   Adele Cutler, Department of Mathematics and Statistics,

                                           Utah State University, Logan, Utah
 

TITLE: Random Forests - a Look Inside the "Black Box"
 

ABSTRACT:

Random Forests are among the most accurate off-the-shelf classifiers
available, and they can also be used for cluster analysis and
regression.  They do not overfit, so they can handle problems in which
there are thousands of variables and relatively few observations. Examples
include microarray and mass spectrometry analysis.

However, statisticians want more than accurate prediction: we usually want
to understand what is going on in the problem, and answers to such
questions as:
    * how well separated are the groups?
    * are there unusual sub-groups?
    * which variables are related to the groups, and how?
  In this talk, I will summarize the random forests algorithm and
demonstrate some visualization software that allows us to answer these
questions (and more), using examples from microarray analysis, the genetics
of autism, and satellite imaging.

* - Joint work with Leo Breiman, Department of Statistics, University of
California, Berkeley

COFFEE: 3:45 p.m., 104 Snedecor Hall