IDEAS home Printed from https://ideas.repec.org/a/eee/stapro/v136y2018icp15-19.html
   My bibliography  Save this article

Data learning from big data

Author

Listed:
  • Torrecilla, José L.
  • Romo, Juan

Abstract

Technology is generating a huge and growing availability of observations of diverse nature. This big data is placing data learning as a central scientific discipline. It includes collection, storage, preprocessing, visualization and, essentially, statistical analysis of enormous batches of data. In this paper, we discuss the role of statistics regarding some of the issues raised by big data in this new paradigm and also propose the name of data learning to describe all the activities that allow to obtain relevant knowledge from this new source of information.

Suggested Citation

  • Torrecilla, José L. & Romo, Juan, 2018. "Data learning from big data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 15-19.
  • Handle: RePEc:eee:stapro:v:136:y:2018:i:c:p:15-19
    DOI: 10.1016/j.spl.2018.02.038
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016771521830083X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.spl.2018.02.038?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Clifford Lynch, 2008. "How do your data grow?," Nature, Nature, vol. 455(7209), pages 28-29, September.
    2. Secchi, Piercesare, 2018. "On the role of statistics in the era of big data: A call for a debate," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 10-14.
    3. James, Gareth M., 2018. "Statistics within business in the era of big data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 155-159.
    4. Gad Abraham & Michael Inouye, 2014. "Fast Principal Component Analysis of Large-Scale Genome-Wide Data," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-5, April.
    5. Dunson, David B., 2018. "Statistics in the big data era: Failures of the machine," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 4-9.
    6. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sim Jia Jin & Abdul Halim Abdullah & Mahani Mokhtar & Umar Haiyat Abdul Kohar, 2022. "The Potential of Big Data Application in Mathematics Education in Malaysia," Sustainability, MDPI, vol. 14(21), pages 1-23, October.
    2. Russell Tatenda Munodawafa & Satirenjit Kaur Johl, 2019. "Big Data Analytics Capabilities and Eco-Innovation: A Study of Energy Companies," Sustainability, MDPI, vol. 11(15), pages 1-21, August.
    3. Pedro Galeano & Daniel Peña, 2019. "Data science, big data and statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 289-329, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Reid, Nancy, 2018. "Statistical science in the world of big data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 42-45.
    2. Claudio Vitari & Elisabetta Raguseo, 2016. "Big data value and financial performance: an empirical investigation [Digital data, dynamic capability and financial performance: an empirical investigation in the era of Big Data]," Post-Print halshs-01923271, HAL.
    3. Lili Liu & Atlas Khan & Elena Sanchez-Rodriguez & Francesca Zanoni & Yifu Li & Nicholas Steers & Olivia Balderes & Junying Zhang & Priya Krithivasan & Robert A. LeDesma & Clara Fischman & Scott J. Heb, 2022. "Genetic regulation of serum IgA levels and susceptibility to common immune, infectious, kidney, and cardio-metabolic traits," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    4. Chakraborty, Tanujit & Chakraborty, Ashis Kumar & Murthy, C.A., 2019. "A nonparametric ensemble binary classifier and its statistical properties," Statistics & Probability Letters, Elsevier, vol. 149(C), pages 16-23.
    5. Lu, Xuefei & Borgonovo, Emanuele, 2023. "Global sensitivity analysis in epidemiological modeling," European Journal of Operational Research, Elsevier, vol. 304(1), pages 9-24.
    6. Nadeem Shafique Butt & Ahmad Azam Malik & Muhammad Qaiser Shahbaz, 2021. "Bibliometric Analysis of Statistics Journals Indexed in Web of Science Under Emerging Source Citation Index," SAGE Open, , vol. 11(1), pages 21582440209, January.
    7. Rita Yi Man Li & Herru Ching Yu Li, 2018. "Have Housing Prices Gone with the Smelly Wind? Big Data Analysis on Landfill in Hong Kong," Sustainability, MDPI, vol. 10(2), pages 1-19, January.
    8. Blazquez, Desamparados & Domenech, Josep, 2018. "Big Data sources and methods for social and economic analyses," Technological Forecasting and Social Change, Elsevier, vol. 130(C), pages 99-113.
    9. Claudio Vitari & Elisabetta Raguseo, 2019. "Big data analytics business value and firm performance: Linking with environmental context," Post-Print hal-02293765, HAL.
    10. Muhammad Haseeb & Hafezali Iqbal Hussain & Beata Ślusarczyk & Kittisak Jermsittiparsert, 2019. "Industry 4.0: A Solution towards Technology Challenges of Sustainable Business Performance," Social Sciences, MDPI, vol. 8(5), pages 1-24, May.
    11. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    12. Emanuele Aliverti & Kristian Lum & James E. Johndrow & David B. Dunson, 2021. "Removing the influence of group variables in high‐dimensional predictive modelling," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 791-811, July.
    13. Bouzebda, Salim & Chaouch, Mohamed, 2022. "Uniform limit theorems for a class of conditional Z-estimators when covariates are functions," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    14. Olhede, Sofia C. & Wolfe, Patrick J., 2018. "The future of statistics and data science," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 46-50.
    15. Yu, Yan & Ibarra, Julio E. & Kumar, Kuldeep & Chergarova, Vasilka, 2021. "Coevolution of cyberinfrastructure development and scientific progress," Technovation, Elsevier, vol. 100(C).
    16. Yasset Perez-Riverol & Max Kuhn & Juan Antonio Vizcaíno & Marc-Phillip Hitz & Enrique Audain, 2017. "Accurate and fast feature selection workflow for high-dimensional omics data," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-14, December.
    17. Maddalena Favaretto & David Shaw & Eva De Clercq & Tim Joda & Bernice Simone Elger, 2020. "Big Data and Digitalization in Dentistry: A Systematic Review of the Ethical Issues," IJERPH, MDPI, vol. 17(7), pages 1-15, April.
    18. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    19. Faraway, Julian J. & Augustin, Nicole H., 2018. "When small data beats big data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 142-145.
    20. Sultana DIDI & Ahoud AL HARBY & Salim BOUZEBDA, 2022. "Wavelet Density and Regression Estimators for Functional Stationary and Ergodic Data: Discrete Time," Mathematics, MDPI, vol. 10(19), pages 1-33, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:stapro:v:136:y:2018:i:c:p:15-19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.