IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v053i07.html
   My bibliography  Save this article

New Approaches in Visualization of Categorical Data: R Package extracat

Author

Listed:
  • Pilhöfer, Alexander
  • Unwin, Antony

Abstract

The R package extracat provides two new graphical methods for displaying categorical data extending the concepts of multiple barcharts and parallel coordinates plots. The first method called rmb plot uses a crossover of mosaicplots and multiple barcharts to display the frequencies of a data table split up into conditional relative frequencies of one target variable and the absolute frequencies of the corresponding combinations of the remaining explanatory variables. It provides a well-structured representation of the data which is easy to interpret and allows precise comparisons. The graphic can additionally be used as a generalization of spineplots or with barcharts for the conditional relative frequencies. Several options, including ceiling censored zooming, residual shadings and a choice of color palettes, are provided. An interactive version based on the R package iWidgets is also presented. The second graphic cpcp uses the interactive parallel coordinates plots in the iplots package to visualize categorical data. Sequences of points are used to represent each of the variable categories, while ordering algorithms are applied to represent a hierarchical structure in the data and keep the arrangement clear. This interactive graphic is well-suited for exploratory analysis and allows a visual interpretation even for a higher number of variables and a mixture of categorical and numeric scales.

Suggested Citation

  • Pilhöfer, Alexander & Unwin, Antony, 2013. "New Approaches in Visualization of Categorical Data: R Package extracat," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i07).
  • Handle: RePEc:jss:jstsof:v:053:i07
    DOI: http://hdl.handle.net/10.18637/jss.v053.i07
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v053i07/v53i07.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v053i07/extracat_1.6-3.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v053i07/v53i07.R
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v053.i07?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fox, John, 2003. "Effect Displays in R for Generalised Linear Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 8(i15).
    2. Zeileis, Achim & Hornik, Kurt & Murrell, Paul, 2009. "Escaping RGBland: Selecting colors for statistical graphics," Computational Statistics & Data Analysis, Elsevier, vol. 53(9), pages 3259-3270, July.
    3. Meyer, David & Zeileis, Achim & Hornik, Kurt, 2006. "The Strucplot Framework: Visualizing Multi-way Contingency Tables with vcd," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 17(i03).
    4. Fox, John & Hong, Jangman, 2009. "Effect Displays in R for Multinomial and Proportional-Odds Logit Models: Extensions to the effects Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i01).
    5. Unwin, Antony & Volinsky, Chris & Winkler, Sylvia, 2003. "Parallel coordinates for exploratory modelling analysis," Computational Statistics & Data Analysis, Elsevier, vol. 43(4), pages 553-564, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Brzezińska Justyna, 2018. "Visualization of Categorical Data Using Extracat Package in R," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 9-19, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    2. repec:jss:jstsof:32:i01 is not listed on IDEAS
    3. Matthias Templ & Andreas Alfons & Peter Filzmoser, 2012. "Exploring incomplete data using visualization techniques," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 6(1), pages 29-47, April.
    4. Jamie C. Moore & Gabriele B. Durrant & Peter W. F. Smith, 2021. "Do coefficients of variation of response propensities approximate non‐response biases during survey data collection?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 301-323, January.
    5. Brzezińska Justyna, 2018. "Visualization of Categorical Data Using Extracat Package in R," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 9-19, June.
    6. Fox, John & Hong, Jangman, 2009. "Effect Displays in R for Multinomial and Proportional-Odds Logit Models: Extensions to the effects Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i01).
    7. Andrea S. Grunst & Melissa L. Grunst, 2015. "Context-dependent relationships between multiple sexual pigments and paternal effort," Behavioral Ecology, International Society for Behavioral Ecology, vol. 26(4), pages 1170-1179.
    8. Sewell, Daniel K., 2018. "Visualizing data through curvilinear representations of matrices," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 255-270.
    9. Manuel Eugster & Friedrich Leisch, 2011. "Exploratory analysis of benchmark experiments an interactive approach," Computational Statistics, Springer, vol. 26(4), pages 699-710, December.
    10. Yee, Thomas W., 2010. "The VGAM Package for Categorical Data Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i10).
    11. Jimena Sobrino-Piazza & Simon Foster & Natalia Estévez-Lamorte & Meichun Mohler-Kuo, 2021. "Parental Monitoring, Individual Dispositions, and Alcohol Use Disorder: A Longitudinal Study with Young Swiss Men," IJERPH, MDPI, vol. 18(18), pages 1-10, September.
    12. Giovanni Cassani & Robert Grimm & Walter Daelemans & Steven Gillis, 2018. "Lexical category acquisition is facilitated by uncertainty in distributional co-occurrences," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-36, December.
    13. Stanislav Katina & Liberty Vittert & Adrian W. Bowman, 2021. "Functional data analysis and visualisation of three‐dimensional surface shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 691-713, June.
    14. Ulrich Matter & Alois Stutzer, 2019. "Does Public Attention Reduce The Influence Of Moneyed Interests? Policy Positions On Sopa/Pipa Before And After The Internet Blackout," Economic Inquiry, Western Economic Association International, vol. 57(4), pages 1879-1895, October.
    15. Erdogan, Murside Rabia & Camgoz, Selin Metin & Karan, Mehmet Baha & Berument, M. Hakan, 2022. "The switching behavior of large-scale electricity consumers in The Turkish electricity retail market," Energy Policy, Elsevier, vol. 160(C).
    16. Giuliano Guerra & Roberto Patuelli & Rico Maggi, 2012. "Ethnic concentration, cultural identity and immigrant self-employment in Switzerland," Chapters, in: Peter Nijkamp & Jacques Poot & Mediha Sahin (ed.), Migration Impact Assessment, chapter 4, pages 147-171, Edward Elgar Publishing.
    17. Ekbrand, Hans & Halleröd, Björn, 2018. "The more gender equity, the less child poverty? A multilevel analysis of malnutrition and health deprivation in 49 low- and middle-income countries," World Development, Elsevier, vol. 108(C), pages 221-230.
    18. Xu, JieLan, 2020. "Generational trends of gendered mobility: How do they interact with geographical contexts?," Journal of Transport Geography, Elsevier, vol. 82(C).
    19. Leonardo Salvatore Alaimo & Mariantonietta Fiore & Antonino Galati, 2020. "How the Covid-19 Pandemic Is Changing Online Food Shopping Human Behaviour in Italy," Sustainability, MDPI, vol. 12(22), pages 1-18, November.
    20. Lenth, Russell V., 2016. "Least-Squares Means: The R Package lsmeans," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 69(i01).
    21. Jonas Schöley, 2021. "The centered ternary balance scheme: A technique to visualize surfaces of unbalanced three-part compositions," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 44(19), pages 443-458.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:053:i07. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.