IDEAS home Printed from https://ideas.repec.org/p/sce/scecf9/1031.html
   My bibliography  Save this paper

Statistical Evaluation of Genetic Programming

Author

Listed:
  • M. A. Kaboudan

    (Penn State Lehigh Valley)

Abstract

A recent advance in genetic computations is the heuristic prediction model (symbolic regression), which have received little statistical scrutiny. Diagnostic checks of genetically evolved models (GEMs) as a forecasting method are therefore essential. This requires assessing the statistical properties of errors produced by GEMs. Since the predicted models and their forecasts are produced artificially by a computer program, little controls the final model specification. However, it is of interest to understand the final specification and to know the statistical characteristics of its errors, particularly if artificially produced models furnish better forecasts than humanly conceived ones. This paper's main concern is the statistical analysis of errors from genetically evolved models. Genetic programming (GP) is one of two computational algorithms for evolving regression models, the other being evolutionary programming (EP). GP-QUICK computer code written in C ++ evolves the regression models for this study. GP-QUICK replicates an original GP program in LISP by Koza. Both are designed to evolve regression models randomly, finding one that replicates the series' data-generating process best. Prediction errors from GP evolved regression models are tested for whiteness (or autocorrelation) and for normality. Well-established diagnostic tools for linear time-series modeling apply also to nonlinear models. Only diagnostic methods using errors without having to replicate the models that produced them are selected and applied to series. This restriction is avoids reproducing the resulting genetically evolved equations. These equations are generated by a random selection mechanism almost impossible to replicate with GP unless the process is deterministic, and they are usually too complex for standard statistical software to reproduce and analyze. The diagnostic methods are selected for their simplicity and speed of execution without sacrificing reliability. This paper contains four other sections. One presents the diagnostic tools to determine the statistical properties of residuals produced by GEMs. Residuals from evolved models representing systems with known characteristics are used to evaluate the statistical performance of GEMs. Another furnishes six data-generating processes representing linear, linear-stochastic, nonlinear, nonlinear-stochastic, and pseudo-random systems for which models are evolved and residuals computed. The final contains those residuals' diagnostics. Diagnostic tools include the Kolmogorov-Smirnov test for whiteness developed by Durbin (1969) in addition to statistical testing of the null hypotheses that the fitted residuals' mean, skewness, and kurtosis are independently equal to zero. Conclusions and future research are given.

Suggested Citation

  • M. A. Kaboudan, 1999. "Statistical Evaluation of Genetic Programming," Computing in Economics and Finance 1999 1031, Society for Computational Economics.
  • Handle: RePEc:sce:scecf9:1031
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a search for a similarly titled item that would be available.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sce:scecf9:1031. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: https://edirc.repec.org/data/sceeeea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.