Data learning from big data

Jos\'e L. Torrecilla; Juan Romo

arXiv:1806.03971·stat.OT·June 12, 2018

Data learning from big data

Jos\'e L. Torrecilla, Juan Romo

PDF

TL;DR

This paper discusses the emergence of data learning as a discipline focused on extracting knowledge from big data through statistical analysis, emphasizing its role in managing and understanding large, diverse datasets.

Contribution

It introduces the concept of data learning as a comprehensive framework for handling big data, highlighting the importance of statistics in this new paradigm.

Findings

01

Data learning encompasses collection, storage, preprocessing, visualization, and analysis.

02

Statistics play a central role in extracting knowledge from big data.

03

The paper proposes the term 'data learning' for this integrated approach.

Abstract

Technology is generating a huge and growing availability of observa tions of diverse nature. This big data is placing data learning as a central scientific discipline. It includes collection, storage, preprocessing, visualization and, essentially, statistical analysis of enormous batches of data. In this paper, we discuss the role of statistics regarding some of the issues raised by big data in this new paradigm and also propose the name of data learning to describe all the activities that allow to obtain relevant knowledge from this new source of information.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.