IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v61y2020i4d10.1007_s00362-018-0997-x.html
   My bibliography  Save this article

Dynamic recursive tree-based partitioning for malignant melanoma identification in skin lesion dermoscopic images

Author

Listed:
  • Massimo Aria

    (University of Naples Federico II)

  • Antonio D’Ambrosio

    (University of Naples Federico II)

  • Carmela Iorio

    (University of Naples Federico II)

  • Roberta Siciliano

    (University of Naples Federico II)

  • Valentina Cozza

    (Parthenope University of Naples)

Abstract

In this paper, multivalued data or multiple values variables are defined. They are typical when there is some intrinsic uncertainty in data production, as the result of imprecise measuring instruments, such as in image recognition, in human judgments and so on. So far, contributions in symbolic data analysis literature provide data preprocessing criteria allowing for the use of standard methods such as factorial analysis, clustering, discriminant analysis, tree-based methods. As an alternative, this paper introduces a methodology for supervised classification, the so-called Dynamic CLASSification TREE (D-CLASS TREE), dealing simultaneously with both standard and multivalued data as well. For that, an innovative partitioning criterion with a tree-growing algorithm will be defined. Main result is a dynamic tree structure characterized by the simultaneous presence of binary and ternary partitions. A real world case study will be considered to show the advantages of the proposed methodology and main issues of the interpretation of the final results. A comparative study with other approaches dealing with the same types of data will be also shown. The comparison highlights that, even if the results are quite similar in terms of error rates, the proposed D-CLASS tree returns a more interpretable tree-based structure.

Suggested Citation

  • Massimo Aria & Antonio D’Ambrosio & Carmela Iorio & Roberta Siciliano & Valentina Cozza, 2020. "Dynamic recursive tree-based partitioning for malignant melanoma identification in skin lesion dermoscopic images," Statistical Papers, Springer, vol. 61(4), pages 1645-1661, August.
  • Handle: RePEc:spr:stpapr:v:61:y:2020:i:4:d:10.1007_s00362-018-0997-x
    DOI: 10.1007/s00362-018-0997-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-018-0997-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-018-0997-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Antonio D’Ambrosio & Massimo Aria & Roberta Siciliano, 2012. "Accurate Tree-based Missing Data Imputation and Data Fusion within the Statistical Learning Paradigm," Journal of Classification, Springer;The Classification Society, vol. 29(2), pages 227-258, July.
    2. Billard L. & Diday E., 2003. "From the Statistics of Data to the Statistics of Knowledge: Symbolic Data Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 470-487, January.
    3. Gil, Maria Angeles & Montenegro, Manuel & Gonzalez-Rodriguez, Gil & Colubi, Ana & Rosa Casals, Maria, 2006. "Bootstrap approach to the multi-sample test of means with imprecise data," Computational Statistics & Data Analysis, Elsevier, vol. 51(1), pages 148-162, November.
    4. Cappelli, Carmela & Mola, Francesco & Siciliano, Roberta, 2002. "A statistical approach to growing a reliable honest tree," Computational Statistics & Data Analysis, Elsevier, vol. 38(3), pages 285-299, January.
    5. Tatjana Lange & Karl Mosler & Pavlo Mozharovskyi, 2014. "Fast nonparametric classification based on data depth," Statistical Papers, Springer, vol. 55(1), pages 49-69, February.
    6. Riccardo Borgoni & Ann Berrington, 2013. "Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures," Quality & Quantity: International Journal of Methodology, Springer, vol. 47(4), pages 1991-2008, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roberta Siciliano & Antonio D’Ambrosio & Massimo Aria & Sonia Amodio, 2017. "Analysis of Web Visit Histories, Part II: Predicting Navigation by Nested STUMP Regression Trees," Journal of Classification, Springer;The Classification Society, vol. 34(3), pages 473-493, October.
    2. Xiaohui Liu & Shihua Luo & Yijun Zuo, 2020. "Some results on the computing of Tukey’s halfspace median," Statistical Papers, Springer, vol. 61(1), pages 303-316, February.
    3. Miguel de Carvalho & Gabriel Martos, 2022. "Modeling interval trendlines: Symbolic singular spectrum analysis for interval time series," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(1), pages 167-180, January.
    4. Ivan Miguel Pires & Faisal Hussain & Nuno M. Garcia & Eftim Zdravevski, 2020. "Improving Human Activity Monitoring by Imputation of Missing Sensory Data: Experimental Study," Future Internet, MDPI, vol. 12(9), pages 1-18, September.
    5. Stanislav Nagy, 2021. "Halfspace depth does not characterize probability distributions," Statistical Papers, Springer, vol. 62(3), pages 1135-1139, June.
    6. Gil, Maria Angeles & Gonzalez-Rodriguez, Gil & Colubi, Ana & Montenegro, Manuel, 2007. "Testing linear independence in linear models with interval-valued data," Computational Statistics & Data Analysis, Elsevier, vol. 51(6), pages 3002-3015, March.
    7. Drago, Carlo, 2015. "Exploring the Community Structure of Complex Networks," MPRA Paper 81024, University Library of Munich, Germany.
    8. Philip Hans Franses & Max Welz, 2022. "Evaluating heterogeneous forecasts for vintages of macroeconomic variables," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(4), pages 829-839, July.
    9. Dias, Sónia & Brito, Paula & Amaral, Paula, 2021. "Discriminant analysis of distributional data via fractional programming," European Journal of Operational Research, Elsevier, vol. 294(1), pages 206-218.
    10. A. Pedro Duarte Silva & Peter Filzmoser & Paula Brito, 2018. "Outlier detection in interval data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 785-822, September.
    11. J. Le-Rademacher & L. Billard, 2013. "Principal component histograms from interval-valued observations," Computational Statistics, Springer, vol. 28(5), pages 2117-2138, October.
    12. Vencalek, Ondrej & Pokotylo, Oleksii, 2018. "Depth-weighted Bayes classification," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 1-12.
    13. Dyckerhoff, Rainer & Mozharovskyi, Pavlo, 2016. "Exact computation of the halfspace depth," Computational Statistics & Data Analysis, Elsevier, vol. 98(C), pages 19-30.
    14. Sun, Yuying & Han, Ai & Hong, Yongmiao & Wang, Shouyang, 2018. "Threshold autoregressive models for interval-valued time series data," Journal of Econometrics, Elsevier, vol. 206(2), pages 414-446.
    15. Lima Neto, Eufrásio de A. & de Carvalho, Francisco de A.T., 2010. "Constrained linear regression models for symbolic interval-valued variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 333-347, February.
    16. Antonio Balzanella & Antonio Irpino, 2020. "Spatial prediction and spatial dependence monitoring on georeferenced data streams," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(1), pages 101-128, March.
    17. Liu, Xiaohui & Rahman, Jafer & Luo, Shihua, 2019. "Generalized and robustified empirical depths for multivariate data," Statistics & Probability Letters, Elsevier, vol. 146(C), pages 70-79.
    18. Paolo Giordani, 2015. "Lasso-constrained regression analysis for interval-valued data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 9(1), pages 5-19, March.
    19. Hao, Peng & Guo, Junpeng, 2017. "Constrained center and range joint model for interval-valued symbolic data regression," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 106-138.
    20. Fei Liu & L. Billard, 2022. "Partition of Interval-Valued Observations Using Regression," Journal of Classification, Springer;The Classification Society, vol. 39(1), pages 55-77, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:61:y:2020:i:4:d:10.1007_s00362-018-0997-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.