IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v025i03.html
   My bibliography  Save this article

Getting Things in Order: An Introduction to the R Package seriation

Author

Listed:
  • Hahsler, Michael
  • Hornik, Kurt
  • Buchta, Christian

Abstract

Seriation, i.e., finding a suitable linear order for a set of objects given data and a loss or merit function, is a basic problem in data analysis. Caused by the problem's combinatorial nature, it is hard to solve for all but very small sets. Nevertheless, both exact solution methods and heuristics are available. In this paper we present the package seriation which provides an infrastructure for seriation with R. The infrastructure comprises data structures to represent linear orders as permutation vectors, a wide array of seriation methods using a consistent interface, a method to calculate the value of various loss and merit functions, and several visualization techniques which build on seriation. To illustrate how easily the package can be applied for a variety of applications, a comprehensive collection of examples is presented.

Suggested Citation

  • Hahsler, Michael & Hornik, Kurt & Buchta, Christian, 2008. "Getting Things in Order: An Introduction to the R Package seriation," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 25(i03).
  • Handle: RePEc:jss:jstsof:v:025:i03
    DOI: http://hdl.handle.net/10.18637/jss.v025.i03
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v025i03/v25i03.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v025i03/seriation_0.1-3.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v025i03/v25i03.R.zip
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v025.i03?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Niermann, Stefan, 2005. "Optimizing the Ordering of Tables With Evolutionary Computation," The American Statistician, American Statistical Association, vol. 59, pages 41-46, February.
    2. Dray, Stéphane & Dufour, Anne-Béatrice, 2007. "The ade4 Package: Implementing the Duality Diagram for Ecologists," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 22(i04).
    3. Hahsler, Michael & Hornik, Kurt, 2007. "TSPInfrastructure for the Traveling Salesperson Problem," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i02).
    4. Nathan Gale & William Halperin & C. Costanzo, 1984. "Unclassed matrix shading and optimal ordering in hierarchical cluster analysis," Journal of Classification, Springer;The Classification Society, vol. 1(1), pages 75-92, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hofmarcher, Paul & Crespo Cuaresma, Jesus & Grün, Bettina & Humer, Stefan & Moser, Mathias, 2018. "Bivariate jointness measures in Bayesian Model Averaging: Solving the conundrum," Journal of Macroeconomics, Elsevier, vol. 57(C), pages 150-165.
    2. Shailendra Pratap & Prashant K. Srivastava & Ashish Routray & Tanvir Islam & Rajesh Kumar Mall, 2020. "Appraisal of hydro-meteorological factors during extreme precipitation event: case study of Kedarnath cloudburst, Uttarakhand, India," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 100(2), pages 635-654, January.
    3. Wu, Han-Ming & Tien, Yin-Jing & Chen, Chun-houh, 2010. "GAP: A graphical environment for matrix visualization and cluster analysis," Computational Statistics & Data Analysis, Elsevier, vol. 54(3), pages 767-778, March.
    4. Laurent, Monique & Seminaroti, Matteo, 2016. "Similarity-First Search : A New Algorithm With Application to Robinsonian Matrix Recognition," Other publications TiSEM 8468be57-ed46-400c-9c0e-7, Tilburg University, School of Economics and Management.
    5. Maciej Jagódka & Małgorzata Snarska, 2021. "The State of Human Capital and Innovativeness of Polish Voivodships in 2004–2018," Sustainability, MDPI, vol. 13(22), pages 1-20, November.
    6. Troxler, David & Zabel, Astrid, 2021. "Clearing forests to make way for a sustainable economy transition in Switzerland," Forest Policy and Economics, Elsevier, vol. 129(C).
    7. Nyi-Nyi Htun & Diego Rojo & Jeroen Ooge & Robin De Croon & Aikaterini Kasimati & Katrien Verbert, 2022. "Developing Visual-Assisted Decision Support Systems across Diverse Agricultural Use Cases," Agriculture, MDPI, vol. 12(7), pages 1-30, July.
    8. Telcs, András & Kosztyán, Zsolt Tibor & Banász, Zsuzsanna & Csányi, Vivien Valéria, 2019. "Felsőoktatási ligák, parciális rangsorok képzése biklaszterezési eljárásokkal [How to rate higher education systems partial rankings using bi-clustering methods]," Közgazdasági Szemle (Economic Review - monthly of the Hungarian Academy of Sciences), Közgazdasági Szemle Alapítvány (Economic Review Foundation), vol. 0(9), pages 905-931.
    9. Crespo Cuaresma, Jesus & Grün, Bettina & Hofmarcher, Paul & Humer, Stefan & Moser, Mathias, 2015. "A Comprehensive Approach to Posterior Jointness Analysis in Bayesian Model Averaging Applications," Department of Economics Working Paper Series 193, WU Vienna University of Economics and Business.
    10. Piccarreta, Raffaella & Bonetti, Marco, 2019. "Assessing and comparing models for sequence data by microsimulation (with Supplementary Material)," SocArXiv 3mcfp, Center for Open Science.
    11. Hahsler, Michael, 2017. "An experimental comparison of seriation methods for one-mode two-way data," European Journal of Operational Research, Elsevier, vol. 257(1), pages 133-143.
    12. Eric C. Chi & Genevera I. Allen & Richard G. Baraniuk, 2017. "Convex biclustering," Biometrics, The International Biometric Society, vol. 73(1), pages 10-19, March.
    13. Kamini Yadav & Hatim M. E. Geli, 2021. "Prediction of Crop Yield for New Mexico Based on Climate and Remote Sensing Data for the 1920–2019 Period," Land, MDPI, vol. 10(12), pages 1-27, December.
    14. Nametala, Ciniro Aparecido Leite & Faria, Wandry Rodrigues & Lage, Guilherme Guimarães & Pereira, Benvindo Rodrigues, 2023. "Analysis of hourly price granularity implementation in the Brazilian deregulated electricity contracting environment," Utilities Policy, Elsevier, vol. 81(C).
    15. Aliyev, Denis A. & Zirbel, Craig L., 2023. "Seriation using tree-penalized path length," European Journal of Operational Research, Elsevier, vol. 305(2), pages 617-629.
    16. Tseng, George C., 2010. "Quantile map: Simultaneous visualization of patterns in many distributions with application to tandem mass spectrometry," Computational Statistics & Data Analysis, Elsevier, vol. 54(4), pages 1124-1137, April.
    17. Martin Junge & Rainer Reisenzein, 2015. "Maximum Likelihood Difference Scaling versus Ordinal Difference Scaling of emotion intensity: a comparison," Quality & Quantity: International Journal of Methodology, Springer, vol. 49(5), pages 2169-2185, September.
    18. Piccarreta, Raffaella & Struffolino, Emanuela, 2019. "An Integrated Heuristic for Validation in Sequence Analysis," SocArXiv v7mj8, Center for Open Science.
    19. Nicholas J. Croucher & Joseph J. Campo & Timothy Q. Le & Jozelyn V. Pablo & Christopher Hung & Andy A. Teng & Claudia Turner & François Nosten & Stephen D. Bentley & Xiaowu Liang & Paul Turner & David, 2024. "Genomic and panproteomic analysis of the development of infant immune responses to antigenically-diverse pneumococci," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    20. Keefe Murphy & T. Brendan Murphy & Raffaella Piccarreta & I. Claire Gormley, 2021. "Clustering longitudinal life‐course sequences using mixtures of exponential‐distance models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1414-1451, October.
    21. Amon, Julian & Hornik, Kurt, 2022. "Is it all bafflegab? – Linguistic and meta characteristics of research articles in prestigious economics journals," Journal of Informetrics, Elsevier, vol. 16(2).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:jss:jstsof:25:i03 is not listed on IDEAS
    2. Wenzel Kröber & Martin Böhnke & Erik Welk & Christian Wirth & Helge Bruelheide, 2012. "Leaf Trait-Environment Relationships in a Subtropical Broadleaved Forest in South-East China," PLOS ONE, Public Library of Science, vol. 7(4), pages 1-11, April.
    3. Pengfei Song & Wen Qin & YanGan Huang & Lei Wang & Zhenyuan Cai & Tongzuo Zhang, 2020. "Grazing Management Influences Gut Microbial Diversity of Livestock in the Same Area," Sustainability, MDPI, vol. 12(10), pages 1-12, May.
    4. la Grange, Anthony & le Roux, Niël & Gardner-Lubbe, Sugnet, 2009. "BiplotGUI: Interactive Biplots in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 30(i12).
    5. Jonas Eberle & Renier Myburgh & Dirk Ahrens, 2014. "The Evolution of Morphospace in Phytophagous Scarab Chafers: No Competition - No Divergence?," PLOS ONE, Public Library of Science, vol. 9(5), pages 1-16, May.
    6. Liesbeth François & Katrien Wijnrocx & Frédéric G Colinet & Nicolas Gengler & Bettine Hulsegge & Jack J Windig & Nadine Buys & Steven Janssens, 2017. "Genomics of a revived breed: Case study of the Belgian campine cattle," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-14, April.
    7. Wittek, Peter, 2013. "Two-way incremental seriation in the temporal domain with three-dimensional visualization: Making sense of evolving high-dimensional datasets," Computational Statistics & Data Analysis, Elsevier, vol. 66(C), pages 193-201.
    8. Aliyev, Denis A. & Zirbel, Craig L., 2023. "Seriation using tree-penalized path length," European Journal of Operational Research, Elsevier, vol. 305(2), pages 617-629.
    9. Hammond, Jim & Rosenblum, Nathaniel & Breseman, Dana & Gorman, Léo & Manners, Rhys & van Wijk, Mark T. & Sibomana, Milindi & Remans, Roseline & Vanlauwe, Bernard & Schut, Marc, 2020. "Towards actionable farm typologies: Scaling adoption of agricultural inputs in Rwanda," Agricultural Systems, Elsevier, vol. 183(C).
    10. Calenge, Clément, 2007. "Exploring Habitat Selection by Wildlife with adehabitat," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 22(i06).
    11. Sara Rachik & Urania Christaki & Luen Luen Li & Savvas Genitsaris & Elsa Breton & Sébastien Monchy, 2018. "Diversity and potential activity patterns of planktonic eukaryotic microbes in a mesoeutrophic coastal area (eastern English Channel)," PLOS ONE, Public Library of Science, vol. 13(5), pages 1-26, May.
    12. Serra W. Buchanan & Megan Baskerville & Maren Oelbermann & Andrew M. Gordon & Naresh V. Thevathasan & Marney E. Isaac, 2020. "Plant Diversity and Agroecosystem Function in Riparian Agroforests: Providing Ecosystem Services and Land-Use Transition," Sustainability, MDPI, vol. 12(2), pages 1-12, January.
    13. Catharine Prussing & Kevin J Emerson & Sara A Bickersmith & Maria Anice Mureb Sallum & Jan E Conn, 2019. "Minimal genetic differentiation of the malaria vector Nyssorhynchus darlingi associated with forest cover level in Amazonian Brazil," PLOS ONE, Public Library of Science, vol. 14(11), pages 1-16, November.
    14. Anna Favati & Josefina Zidar & Hanne Thorpe & Per Jensen & Hanne Løvlie, 2016. "The ontogeny of personality traits in the red junglefowl, Gallus gallus," Behavioral Ecology, International Society for Behavioral Ecology, vol. 27(2), pages 484-493.
    15. repec:jss:jstsof:22:i01 is not listed on IDEAS
    16. Luca Freschi & Roger Vargas & Ashaque Husain & S. M. Mostofa Kamal & Alena Skrahina & Sabira Tahseen & Nazir Ismail & Anna Barbova & Stefan Niemann & Daniela Maria Cirillo & Anna S. Dean & Matteo Zign, 2021. "Population structure, biogeography and transmissibility of Mycobacterium tuberculosis," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    17. Doppstadt, C. & Koberstein, A. & Vigo, D., 2016. "The Hybrid Electric Vehicle – Traveling Salesman Problem," European Journal of Operational Research, Elsevier, vol. 253(3), pages 825-842.
    18. Sólymos, Péter, 2009. "Processing Ecological Data in R with the mefa Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 29(i08).
    19. Alessandro Bellino & Daniela Baldantoni & Vittoria Milano & Lucia Santorufo & Jérôme Cortet & Giulia Maisto, 2021. "Spatial Patterns and Scales of Collembola Taxonomic and Functional Diversity in Urban Parks," Sustainability, MDPI, vol. 13(23), pages 1-11, November.
    20. Keith Hunley & Kiela Gwin & Brendan Liberman, 2016. "A Reassessment of the Impact of European Contact on the Structure of Native American Genetic Diversity," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-17, August.
    21. Vasilios Liordos & Jukka Jokimäki & Marja-Liisa Kaisanlahti-Jokimäki & Evangelos Valsamidis & Vasileios J. Kontsiotis, 2021. "Niche Analysis and Conservation of Bird Species Using Urban Core Areas," Sustainability, MDPI, vol. 13(11), pages 1-15, June.
    22. María Concepción Vega-Hernández & Carmen Patino-Alonso, 2021. "Comparing COSTATIS and Generalized Procrustes Analysis with Multi-Way Public Education Expenditure Data," Mathematics, MDPI, vol. 9(15), pages 1-13, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:025:i03. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.