|
1 | 1 | <properties
|
2 |
| - pageTitle="Introduction to Hadoop in HDInsight: Big data analysis in the cloud | Azure" |
| 2 | + pageTitle="Introduction to Hadoop in the cloud: Big data analysis | Azure" |
3 | 3 | description="An introduction to the Hadoop components on HDInsight. Learn how HDInsight uses Hadoop clusters in the cloud to manage, analyze, and report on big data."
|
4 | 4 | services="hdinsight"
|
5 | 5 | documentationCenter=""
|
|
19 | 19 |
|
20 | 20 | # Introduction to Hadoop in HDInsight: Big-data processing and analysis in the cloud
|
21 | 21 |
|
22 |
| -Get an introduction to the Hadoop ecosystem in Azure HDInsight - components, common terminology, and scenarios. Also, find out about tutorials and resources for using Hadoop in HDInsight. |
| 22 | +Get an introduction to the Hadoop ecosystem in Azure HDInsight - components, common terminology, and solutions. Also, find out about documentation, tutorials, and resources for using Hadoop in HDInsight. |
| 23 | + |
| 24 | +## What is Hadoop in HDInsight? |
23 | 25 |
|
24 | 26 | Azure HDInsight deploys and provisions Apache Hadoop clusters in the cloud, providing a software framework designed to manage, analyze, and report on big data. The Hadoop core provides reliable data storage with the Hadoop Distributed File System (HDFS), and a simple MapReduce programming model to process and analyze, in parallel, the data stored in this distributed system.
|
25 | 27 |
|
26 | 28 |
|
27 |
| -### What is big data? |
| 29 | +## What is big data? |
28 | 30 | Big data refers to data being collected in ever-escalating volumes, at increasingly high velocities, and for a widening variety of unstructured formats and variable semantic contexts.
|
29 | 31 |
|
30 | 32 | Big data describes any large body of digital information, from the text in a Twitter feed, to the sensor information from industrial equipment, to information about customer browsing and purchases on an online catalog. Big data can be historical (meaning stored data) or real-time (meaning streamed directly from the source).
|
31 | 33 |
|
32 | 34 | For big data to provide actionable intelligence or insight, not only must the right questions be asked and data be relevant to the issues be collected, the data must be accessible, cleaned, analyzed, and then presented in a useful way. That's where Hadoop in HDInsight can help.
|
33 | 35 |
|
34 |
| -## In this article |
35 |
| - |
36 |
| -This article provides an overview of Hadoop on HDInsight, including: |
37 |
| - |
38 |
| -* **[Overview of the Hadoop ecosystem on HDInsight](#overview)** - HDInsight is the Hadoop solution on Azure and provides implementations of Storm, HBase, Pig, Hive, Sqoop, Oozie, Ambari, and so on. HDInsight also integrates with business intelligence (BI) tools such as Excel, SQL Server Analysis Services, and SQL Server Reporting Services. |
39 |
| - |
40 |
| -* **[Advantages of Hadoop in the cloud](#advantage)** - Reasons you should consider the HDInsight cloud implementation of Hadoop. |
41 |
| - |
42 |
| -* **[HDInsight solutions for big-data analysis](#solutions)** - Some practical ways you can use HDInsight to answer questions for your organization, from analyzing Twitter sentiment to analyzing HVAC system effectiveness. |
43 |
| - |
44 |
| -* **[Resources for learning more about big-data analysis, Hadoop, and HDInsight](#resources)** - Links to additional information. |
45 | 36 |
|
46 | 37 | ## <a name="overview"></a>Overview of the Hadoop ecosystem on HDInsight
|
47 | 38 |
|
48 |
| -Apache Hadoop is the rapidly expanding technology stack that is the go-to solution for big-data analysis. HDInsight is the framework for the Microsoft Azure cloud implementation of Hadoop. |
| 39 | +Apache Hadoop is the rapidly expanding technology stack that is the go-to solution for big-data analysis. HDInsight is framework for the Microsoft Azure cloud implementation of Hadoop. It includes implementations of Storm, HBase, Pig, Hive, Sqoop, Oozie, Ambari, and so on. HDInsight also integrates with business intelligence (BI) tools such as Excel, SQL Server Analysis Services, and SQL Server Reporting Services. |
| 40 | + |
49 | 41 |
|
50 | 42 | * Azure HDInsight deploys and provisions Hadoop clusters in the cloud, by using either **Linux** or **Windows** as the underlying OS.
|
51 | 43 |
|
|
0 commit comments