IDEAS home Printed from https://ideas.repec.org/a/vrs/eaiada/v22y2018i2p9-19n1.html
   My bibliography  Save this article

Visualization of Categorical Data Using Extracat Package in R

Author

Listed:
  • Brzezińska Justyna

    (University of Economics in Katowice, Katowice, Poland)

Abstract

Visualization in research process plays a crucial role. There are several advanced plots for visualizing categorical data, such as mosaic, association, double-decker, sieve or fourfold plot that are based on the graphical presentation of residuals in a contingency table. In this paper we present new methods for visualizing categorical data such as rmb, fluctile and scpcp plot available in extracat package in R. This package provides a well-structured representation of categorical data and allows for a detailed presentation of the relationship between categories in terms of proportions. We describe rmb, fluctile and cpcp. Those plots are based on the concept of multiple bar charts, a fluctuation diagram from a multidimensional table and parallel coordinates respectively. Such plots are mostly used for a visualization of a contingency table or a data frame; they can also be used for exploratory analysis and allows for a graphical presentation even for a high number of variables [Pilhöfer, Unwin 2013]. All the calculations and plots are obtained using R software.

Suggested Citation

  • Brzezińska Justyna, 2018. "Visualization of Categorical Data Using Extracat Package in R," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 9-19, June.
  • Handle: RePEc:vrs:eaiada:v:22:y:2018:i:2:p:9-19:n:1
    DOI: 10.15611/eada.2018.2.01
    as

    Download full text from publisher

    File URL: https://doi.org/10.15611/eada.2018.2.01
    Download Restriction: no

    File URL: https://libkey.io/10.15611/eada.2018.2.01?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Meyer, David & Zeileis, Achim & Hornik, Kurt, 2006. "The Strucplot Framework: Visualizing Multi-way Contingency Tables with vcd," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 17(i03).
    2. Pilhöfer, Alexander & Unwin, Antony, 2013. "New Approaches in Visualization of Categorical Data: R Package extracat," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i07).
    3. Unwin, Antony & Volinsky, Chris & Winkler, Sylvia, 2003. "Parallel coordinates for exploratory modelling analysis," Computational Statistics & Data Analysis, Elsevier, vol. 43(4), pages 553-564, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pilhöfer, Alexander & Unwin, Antony, 2013. "New Approaches in Visualization of Categorical Data: R Package extracat," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i07).
    2. Sewell, Daniel K., 2018. "Visualizing data through curvilinear representations of matrices," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 255-270.
    3. Manuel Eugster & Friedrich Leisch, 2011. "Exploratory analysis of benchmark experiments an interactive approach," Computational Statistics, Springer, vol. 26(4), pages 699-710, December.
    4. Yee, Thomas W., 2010. "The VGAM Package for Categorical Data Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i10).
    5. Matthias Templ & Andreas Alfons & Peter Filzmoser, 2012. "Exploring incomplete data using visualization techniques," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 6(1), pages 29-47, April.
    6. repec:jss:jstsof:28:i08 is not listed on IDEAS
    7. Cook, Dianne & Hofmann, Heike, 2011. "R Graphics (2nd Edition)," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 43(b03).
    8. repec:jss:jstsof:32:i10 is not listed on IDEAS
    9. Justyna Brzezińska, 2012. "Independence analysis of nominal data with the use of log-linear models in R," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 13(2), pages 311-320, June.
    10. Heiberger, Richard & Robbins, Naomi, 2014. "Design of Diverging Stacked Bar Charts for Likert Scales and Other Applications," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(i05).
    11. Hothorn, Torsten & Hornik, Kurt & van de Wiel, Mark A. & Zeileis, Achim, 2008. "Implementing a Class of Permutation Tests: The coin Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i08).
    12. C. Hurley & R. Oldford, 2011. "Eulerian tour algorithms for data visualization and the PairViz package," Computational Statistics, Springer, vol. 26(4), pages 613-633, December.
    13. Andreas Alfons & Stefan Kraft & Matthias Templ & Peter Filzmoser, 2011. "Simulation of close-to-reality population data for household surveys with application to EU-SILC," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 20(3), pages 383-407, August.
    14. Pavlo Maksimov & Johannes Zerweck & Jitender P Dubey & Nikola Pantchev & Caroline F Frey & Aline Maksimov & Ulf Reimer & Mike Schutkowski & Morteza Hosseininejad & Mario Ziller & Franz J Conraths & Ge, 2013. "Serotyping of Toxoplasma gondii in Cats (Felis domesticus) Reveals Predominance of Type II Infections in Germany," PLOS ONE, Public Library of Science, vol. 8(11), pages 1-1, November.
    15. Monira Hamid & Christopher Thron & Sallam Fageeri, 2021. "Demographics of Sudanese University Students in Relation to Regional Conflict and Underdevelopment," Social Sciences, MDPI, vol. 10(3), pages 1-33, March.
    16. Klusáček, Petr & Navrátil, Josef & Martinát, Stanislav & Krejčí, Tomáš & Golubchikov, Oleg & Pícha, Kamil & Škrabal, Jaroslav & Osman, Robert, 2021. "Planning for the future of derelict farm premises: From abandonment to regeneration?," Land Use Policy, Elsevier, vol. 102(C).
    17. Claudio Conversano & Domenico Vistocco, 2010. "Analysis of mutual funds' management styles: a modeling, ranking and visualizing approach," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(11), pages 1825-1845.
    18. Zeileis, Achim & Hornik, Kurt & Murrell, Paul, 2009. "Escaping RGBland: Selecting colors for statistical graphics," Computational Statistics & Data Analysis, Elsevier, vol. 53(9), pages 3259-3270, July.

    More about this item

    Keywords

    categorical data; cpcp plot; rmb plot; fluctile plot; R software;
    All these keywords.

    JEL classification:

    • C30 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - General
    • C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models
    • C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:eaiada:v:22:y:2018:i:2:p:9-19:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.