Big Data Analytics 简明教程

Discuss Big Data Analytics

过去十年中,人们必须处理的数据量已经爆炸式增长,与此同时,数据存储的价格也系统性下降。私营公司和研究机构捕获了海量有关用户交互、业务、社交媒体以及移动电话和汽车等设备传感器的字节数据。这个时代的挑战是要理解这片数据之海。

The volume of data that one has to deal has exploded to unimaginable levels in the past decade, and at the same time, the price of data storage has systematically reduced. Private companies and research institutions capture terabytes of data about their users’ interactions, business, social media, and also sensors from devices such as mobile phones and automobiles. The challenge of this era is to make sense of this sea of data.

这就是 big data analytics 的用武之地。大数据分析很大程度上涉及从不同来源收集数据,对其进行整理,使其可供分析师使用,并最终交付对组织业务有用的数据产品。将从不同来源检索的大量非结构化原始数据转换为对组织有用的数据产品是构成大数据分析核心的过程。在本教程中,我们将讨论大数据分析的最基本概念和方法。

This is where big data analytics comes into picture. Big Data Analytics largely involves collecting data from different sources, munge it in a way that it becomes available to be consumed by analysts and finally deliver data products useful to the organization business. The process of converting large amounts of unstructured raw data, retrieved from different sources to a data product useful for organizations forms the core of Big Data Analytics. In this tutorial, we will discuss the most fundamental concepts and methods of Big Data Analytics.