IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v53y2009i8p3082-3093.html
   My bibliography  Save this article

CART algorithm for spatial data: Application to environmental and ecological data

Author

Listed:
  • Bel, L.
  • Allard, D.
  • Laurent, J.M.
  • Cheddadi, R.
  • Bar-Hen, A.

Abstract

Most statistical learning techniques such as Classification And Regression Trees (CART) assume independent samples to compute classification rules. This assumption is very practical for estimating quantities involved in the algorithm and for assessing asymptotic properties of estimators. In many environmental or ecological applications, the data under study are a sample of some regionalized variables, which can be modeled as random fields with spatial dependence. When the sampling scheme is very irregular, a direct application of supervised classification algorithms leads to biased discriminant rules due, for example, to the possible oversampling of some areas. The CART algorithm is adapted to the case of spatially dependent samples, focusing on environmental and ecological applications. Two approaches are considered. The first one takes into account the irregularity of the sampling by weighting the data according to their spatial pattern using two existing methods based on Vorono tessellation and regular grid, and one original method based on kriging. The second one uses spatial estimates of the quantities involved in the construction of the discriminant rule at each step of the algorithm. These methods are tested on simulations and on a classical dataset to highlight their advantages and drawbacks. They are then applied on an ecological data set to explore the relationship between pollen data and presence/absence of tree species, which is an important question for climate reconstruction based on paleoecological data.

Suggested Citation

  • Bel, L. & Allard, D. & Laurent, J.M. & Cheddadi, R. & Bar-Hen, A., 2009. "CART algorithm for spatial data: Application to environmental and ecological data," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 3082-3093, June.
  • Handle: RePEc:eee:csdana:v:53:y:2009:i:8:p:3082-3093
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(08)00443-X
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hennig, Christian & Hausdorf, Bernhard, 2004. "Distance-based parametric bootstrap tests for clustering of species ranges," Computational Statistics & Data Analysis, Elsevier, vol. 45(4), pages 875-895, May.
    2. Gey, Servane & Poggi, Jean-Michel, 2006. "Boosting and instability for regression trees," Computational Statistics & Data Analysis, Elsevier, vol. 50(2), pages 533-550, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Avner Bar-Hen & Servane Gey & Jean-Michel Poggi, 2021. "Spatial CART classification trees," Computational Statistics, Springer, vol. 36(4), pages 2591-2613, December.
    2. LeSage, James & Banerjee, Sudipto & Fischer, Manfred M. & Congdon, Peter, 2009. "Spatial statistics: Methods, models & computation," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2781-2785, June.
    3. Avner Bar-Hen & Servane Gey & Jean-Michel Poggi, 2015. "Influence Measures for CART Classification Trees," Journal of Classification, Springer;The Classification Society, vol. 32(1), pages 21-45, April.
    4. Hossein Haroonabadi, 2014. "Islanding Detection in Micro-grids using Sum of Voltage and Current Wavelet Coefficients Energy before the Main Circuit Breaker Side," Asian Engineering Review, Asian Online Journal Publishing Group, vol. 1(1), pages 1-12.
    5. Sophie Dabo-Niang & Camille Ternynck & Anne-Françoise Yao, 2016. "Nonparametric prediction of spatial multivariate data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(2), pages 428-458, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rokach, Lior, 2009. "Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4046-4072, October.
    2. Tsao, C. Andy & Chang, Yuan-chin Ivan, 2007. "A stochastic approximation view of boosting," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 325-334, September.
    3. Croux, Christophe & Joossens, Kristel & Lemmens, Aurelie, 2007. "Trimmed bagging," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 362-368, September.
    4. Antonio D’Ambrosio & Massimo Aria & Roberta Siciliano, 2012. "Accurate Tree-based Missing Data Imputation and Data Fusion within the Statistical Learning Paradigm," Journal of Classification, Springer;The Classification Society, vol. 29(2), pages 227-258, July.
    5. Hennig, Christian, 2007. "Cluster-wise assessment of cluster stability," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 258-271, September.
    6. Avner Bar-Hen & Servane Gey & Jean-Michel Poggi, 2015. "Influence Measures for CART Classification Trees," Journal of Classification, Springer;The Classification Society, vol. 32(1), pages 21-45, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:53:y:2009:i:8:p:3082-3093. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.