IDEAS home Printed from https://ideas.repec.org/a/wsi/ijitdm/v16y2017i06ns0219622017500286.html
   My bibliography  Save this article

Thinking and Modeling for Big Data from the Perspective of the I Ching

Author

Listed:
  • Chuang Lin

    (Department of Computer Science, Shenzhen University, Shenzhen, P. R. China†Department of Computer Science, Tsinghua University, Beijing, P. R. China)

  • Guoliang Li

    (#x2020;Department of Computer Science, Tsinghua University, Beijing, P. R. China)

  • Zhiguang Shan

    (State Information Center of China, Beijing, P. R. China)

  • Yong Shi

    (University of Chinese Academy of Sciences, Beijing, P. R. China)

Abstract

Data is growing faster than ever before and is changing our daily life. However it is rather challenging to manage the big data [F. H. Cate, The big debate, Science 346 (2014) 810, J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh and A. H. Byers, Big Data: The Next Frontier for Innovation, Competition, and Productivity (Mckinsey global Institute, 2011), S. Lohr, The Age of Big Data (New York Times, 2012), p. 11, L. Einav and J. Levin, Economics in the age of big data, Science 345 (2014) 715, M. J. Khoury and J. P. A. Ioannidis, Big data meets public health, Science 346 (2014) 1054–1055, V. Marx, Biology: The big challenges of big data, Nature 498(7453) (2013) 255–260.]. In this paper, we propose the big data thinking and modeling techniques from the perspective of the I Ching, which is a very famous imaginal thinking theory in China with 3,000 years history. The I Ching has been proven to be very useful and practical in many domains, e.g., 36 stratagems.Firstly, inspired from the three components of the I Ching, image, number and principle, we propose a new three-cycle big data thinking way, from data to phenomenon, from phenomenon to correlation, and from correlation to knowledge, which is a generalization of the fourth paradigm (from causality to correlation) proposed by Jim Gray.Secondly, inspired from the three entities of the I Ching, heaven, earth and human, we propose a new big data modeling method. We use the tree entities to represent the big data. We map the 4V of big data (volume, variety, velocity, veracity) to four opposition and uniform relations in the I Ching, and generate the eight diagrams. By capturing the relationships between eight diagrams, we generate the 64 hexagrams, and use 64 hexagrams to model big data. We also provide the principle rules to understand the knowledge generated by the model.Thirdly, we discuss how to utilize our model to describe big-data management tools, including, MapReduce, Spark, Storm. We also provide a new model for handling distributed data streams.We do think that we provide a new practical way of thinking and modeling for big data. We also believe that this will open up many new research directions on big data.

Suggested Citation

  • Chuang Lin & Guoliang Li & Zhiguang Shan & Yong Shi, 2017. "Thinking and Modeling for Big Data from the Perspective of the I Ching," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(06), pages 1451-1463, November.
  • Handle: RePEc:wsi:ijitdm:v:16:y:2017:i:06:n:s0219622017500286
    DOI: 10.1142/S0219622017500286
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219622017500286
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219622017500286?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Vivien Marx, 2013. "The big challenges of big data," Nature, Nature, vol. 498(7453), pages 255-260, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lin Zhu & Xiantao Liu & Sha He & Jun Shi & Ming Pang, 2015. "Keywords co-occurrence mapping knowledge domain research base on the theory of Big Data in oil and gas industry," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(1), pages 249-260, October.
    2. Zhang, Yi & Huang, Ying & Porter, Alan L. & Zhang, Guangquan & Lu, Jie, 2019. "Discovering and forecasting interactions in big data research: A learning-enhanced bibliometric study," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 795-807.
    3. Stefano Bianchini & Moritz Müller & Pierre Pelletier, 2022. "Artificial intelligence in science: An emerging general method of invention," Post-Print hal-03958025, HAL.
    4. Jun Feng & Zhenting Li & Shizhen Zhang & Chun Bao & Jingxian Fang & Yun Yin & Bolei Chen & Lei Pan & Bing Wang & Yu Zheng, 2023. "A Microimage-Processing-Based Technique for Detecting Qualitative and Quantitative Characteristics of Plant Cells," Agriculture, MDPI, vol. 13(9), pages 1-16, September.
    5. Tang, Ming & Liao, Huchang, 2021. "From conventional group decision making to large-scale group decision making: What are the challenges and how to meet them in big data era? A state-of-the-art survey," Omega, Elsevier, vol. 100(C).
    6. Janssen, Marijn & van der Voort, Haiko & Wahyudi, Agung, 2017. "Factors influencing big data decision-making quality," Journal of Business Research, Elsevier, vol. 70(C), pages 338-345.
    7. Haitham Nobanee & Mehroz Nida Dilshad & Mona Al Dhanhani & Maitha Al Neyadi & Sultan Al Qubaisi & Saeed Al Shamsi, 2021. "Big Data Applications the Banking Sector: A Bibliometric Analysis Approach," SAGE Open, , vol. 11(4), pages 21582440211, December.
    8. Reza Farrahi Moghaddam & Fereydoun Farrahi Moghaddam & Mohamed Cheriet, 2014. "A Multi-Entity Input Output (MEIO) Approach to Sustainability - Water-Energy-GHG (WEG) Footprint Statements in Use Cases from Auto and Telco Industries," Papers 1404.6227, arXiv.org, revised Apr 2014.
    9. Yoshiyuki Ogata & Kazuto Mannen & Yasuto Kotani & Naohiro Kimura & Nozomu Sakurai & Daisuke Shibata & Hideyuki Suzuki, 2018. "ConfeitoGUI: A toolkit for size-sensitive community detection from a correlation network," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-18, October.
    10. S. Vijayakumar Bharathi, 2017. "Prioritizing and Ranking the Big Data Information Security Risk Spectrum," Global Journal of Flexible Systems Management, Springer;Global Institute of Flexible Systems Management, vol. 18(3), pages 183-201, September.
    11. Jonathan E Butner & Ascher K Munion & Brian R W Baucom & Alexander Wong, 2019. "Ghost hunting in the nonlinear dynamic machine," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-21, December.
    12. Subhroshekhar Ghosh & Soumendu Sundar Mukherjee, 2022. "Learning with latent group sparsity via heat flow dynamics on networks," Papers 2201.08326, arXiv.org.
    13. J. Lars Kirkby & Dang H. Nguyen & Duy Nguyen & Nhu N. Nguyen, 2022. "Inversion-free subsampling Newton’s method for large sample logistic regression," Statistical Papers, Springer, vol. 63(3), pages 943-963, June.
    14. Zbysław Dobrowolski, 2021. "Internet of Things and Other E-Solutions in Supply Chain Management May Generate Threats in the Energy Sector—The Quest for Preventive Measures," Energies, MDPI, vol. 14(17), pages 1-11, August.
    15. Dawen Xia & Xiaonan Lu & Huaqing Li & Wendong Wang & Yantao Li & Zili Zhang, 2018. "A MapReduce-Based Parallel Frequent Pattern Growth Algorithm for Spatiotemporal Association Analysis of Mobile Trajectory Big Data," Complexity, Hindawi, vol. 2018, pages 1-16, January.
    16. Matteo Fontana & Massimo Tavoni & Simone Vantini, 2019. "Functional Data Analysis of high-frequency load curves reveals drivers of residential electricity consumption," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-16, June.
    17. Lu Jiang & Xinyu Kang & Shan Huang & Bo Yang, 2022. "A refinement strategy for identification of scientific software from bioinformatics publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 3293-3316, June.
    18. Gamermann, Daniel & Antunes, Felipe Leite, 2018. "Statistical analysis of Brazilian electoral campaigns via Benford’s law," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 171-188.
    19. Alberto Fernández & Sara Río & Abdullah Bawakid & Francisco Herrera, 2017. "Fuzzy rule based classification systems for big data with MapReduce: granularity analysis," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(4), pages 711-730, December.
    20. Akshay Mani & Resmi Ravindran & Soujanya Mannepalli & Daniel Vang & Paul A Luciw & Michael Hogarth & Imran H Khan & Viswanathan V Krishnan, 2015. "Data Mining Strategies to Improve Multiplex Microbead Immunoassay Tolerance in a Mouse Model of Infectious Diseases," PLOS ONE, Public Library of Science, vol. 10(1), pages 1-19, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:16:y:2017:i:06:n:s0219622017500286. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.