IDEAS home Printed from https://ideas.repec.org/a/igg/jhisi0/v16y2021i4p1-25.html
   My bibliography  Save this article

A Data Representation Model for Personalized Medicine

Author

Listed:
  • Hafid Kadi

    (Department of Computer Science, University of Mustapha Stambouli, Mascara, Algeria & Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC, Caen, France)

  • Mohammed Rebbah

    (Department of Computer Science, University of Mustapha Stambouli, Mascara, Algeria)

  • Boudjelal Meftah

    (Department of Computer Science, University of Mustapha Stambouli, Mascara, Algeria)

  • Olivier Lézoray

    (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC, Caen, France)

Abstract

Personalized medicine exploits the patient data, for example, genetic compositions, and key biomarkers. During the data mining process, the key challenges are the information loss, the data types heterogeneity and the time series representation. In this paper, a novel data representation model for personalized medicine is proposed in light of these challenges. The proposed model will account for the structured, temporal and non-temporal data and their types, namely, numeric, nominal, date, and Boolean. After the "Date and Boolean" data transformation, the nominal data are treated by dispersion while several clustering techniques are deployed to control the numeric data distribution. Ultimately, the transformation process results in three homogeneous representations with these representations having only two dimensions to ease the exploration of the represented dataset. Compared to the Symbolic Aggregate Approximation technique, the proposed model preserves the time-series information, conserves as much data as possible and offers multiple simple representations to be explored.

Suggested Citation

  • Hafid Kadi & Mohammed Rebbah & Boudjelal Meftah & Olivier Lézoray, 2021. "A Data Representation Model for Personalized Medicine," International Journal of Healthcare Information Systems and Informatics (IJHISI), IGI Global, vol. 16(4), pages 1-25, October.
  • Handle: RePEc:igg:jhisi0:v:16:y:2021:i:4:p:1-25
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJHISI.295822
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. J. A. Hartigan & M. A. Wong, 1979. "A K‐Means Clustering Algorithm," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 28(1), pages 100-108, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carlos Carrasco-Farré, 2022. "The fingerprints of misinformation: how deceptive content differs from reliable sources in terms of cognitive effort and appeal to emotions," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-18, December.
    2. Felix Mbuga & Cristina Tortora, 2021. "Spectral Clustering of Mixed-Type Data," Stats, MDPI, vol. 5(1), pages 1-11, December.
    3. Zhang, Weibin & Zha, Huazhu & Zhang, Shuai & Ma, Lei, 2023. "Road section traffic flow prediction method based on the traffic factor state network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 618(C).
    4. Michal Bernardelli & Zbigniew Korzeb & Pawel Niedziolka, 2021. "The banking sector as the absorber of the COVID-19 crisis’ economic consequences: perception of WSE investors," Oeconomia Copernicana, Institute of Economic Research, vol. 12(2), pages 335-374, June.
    5. Jelle R Dalenberg & Luca Nanetti & Remco J Renken & René A de Wijk & Gert J ter Horst, 2014. "Dealing with Consumer Differences in Liking during Repeated Exposure to Food; Typical Dynamics in Rating Behavior," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-11, March.
    6. Custodio João, Igor & Lucas, André & Schaumburg, Julia & Schwaab, Bernd, 2023. "Dynamic clustering of multivariate panel data," Journal of Econometrics, Elsevier, vol. 237(2).
    7. Carlos Fernández-Hernández & Carmelo J. León & Jorge E. Araña & Flora Díaz-Pére, 2016. "Market segmentation, activities and environmental behaviour in rural tourism," Tourism Economics, , vol. 22(5), pages 1033-1054, October.
    8. Zhang, Tonglin & Lin, Ge, 2021. "Generalized k-means in GLMs with applications to the outbreak of COVID-19 in the United States," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    9. Andreas Lackner & Michael Müller & Magdalena Gamperl & Delyana Stoeva & Olivia Langmann & Henrieta Papuchova & Elisabeth Roitinger & Gerhard Dürnberger & Richard Imre & Karl Mechtler & Paulina A. Lato, 2023. "The Fgf/Erf/NCoR1/2 repressive axis controls trophoblast cell fate," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    10. Utkarsh J. Dang & Michael P.B. Gallaugher & Ryan P. Browne & Paul D. McNicholas, 2023. "Model-Based Clustering and Classification Using Mixtures of Multivariate Skewed Power Exponential Distributions," Journal of Classification, Springer;The Classification Society, vol. 40(1), pages 145-167, April.
    11. Beibei Yu & Zhonghui Wang & Haowei Mu & Li Sun & Fengning Hu, 2019. "Identification of Urban Functional Regions Based on Floating Car Track Data and POI Data," Sustainability, MDPI, vol. 11(23), pages 1-18, November.
    12. Liguo Fei & Jun Xia & Yuqiang Feng & Luning Liu, 2019. "A novel method to determine basic probability assignment in Dempster–Shafer theory and its application in multi-sensor information fusion," International Journal of Distributed Sensor Networks, , vol. 15(7), pages 15501477198, July.
    13. Bernd Scherer & Diogo Judice & Stephan Kessler, 2010. "Price reversals in global equity markets," Journal of Asset Management, Palgrave Macmillan, vol. 11(5), pages 332-345, December.
    14. Ugofilippo Basellini & Carlo Giovanni Camarda, 2020. "Modelling COVID-19 mortality at the regional level in Italy," Working Papers axq0sudakgkzhr-blecv, French Institute for Demographic Studies.
    15. Andrew Webb, 1997. "Radial basis functions for exploratory data analysis: An iterative majorisation approach for Minkowski distances based on multidimensional scaling," Journal of Classification, Springer;The Classification Society, vol. 14(2), pages 249-267, September.
    16. Jianzhong Ma & Christopher I Amos, 2012. "Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-12, July.
    17. Annah Vimbai Bengesai & Evelyn Derera, 2021. "The Association Between Women Empowerment and Emotional Violence in Zimbabwe: A Cluster Analysis Approach," SAGE Open, , vol. 11(2), pages 21582440211, June.
    18. Urmeneta, Jon & Izquierdo, Juan & Leturiondo, Urko, 2023. "A methodology for performance assessment at system level—Identification of operating regimes and anomaly detection in wind turbines," Renewable Energy, Elsevier, vol. 205(C), pages 281-292.
    19. Zhang, Weibin & Feng, Xinyu & Goerlandt, Floris & Liu, Qing, 2020. "Towards a Convolutional Neural Network model for classifying regional ship collision risk levels for waterway risk analysis," Reliability Engineering and System Safety, Elsevier, vol. 204(C).
    20. Niels Waller & Heather Kaiser & Janine Illian & Mike Manry, 1998. "A comparison of the classification capabilities of the 1-dimensional kohonen neural network with two pratitioning and three hierarchical cluster analysis algorithms," Psychometrika, Springer;The Psychometric Society, vol. 63(1), pages 5-22, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jhisi0:v:16:y:2021:i:4:p:1-25. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.