Oreilly - Learning Hadoop 2
by | Released November 2015 | ISBN: 9781785888113
An introduction to storing, structuring, and analyzing data at scale with HadoopAbout This VideoExplore Hadoop and its ecosystem of core components, and set up an instanceImport, organize, and query data with HDFS, Flume, Sqoop, and HiveLearn Pig, a simplified scripting language for Hadoop, to manipulate your dataIn DetailHadoop emerged in response to the proliferation of masses and masses of data collected by organizations, offering a strong solution to store, process, and analyze what has commonly become known as Big Data. It comprises a comprehensive stack of components designed to enable these tasks on a distributed scale, across multiple servers and thousands of machines. Learning Hadoop 2 introduces you to the powerful system synonymous with Big Data, demonstrating how to create an instance and leverage Hadoop ecosystem's many components to store, process, manage, and query massive data sets with confidence.We open this course by providing an overview of the Hadoop component ecosystem, including HDFS, Sqoop, Flume, YARN, MapReduce, Pig, and Hive, before installing and configuring our Hadoop environment. We take a look at Hue, the graphical user interface of Hadoop.We will then discover HDFS, Hadoop's file-system used to store data. We will learn how to import and export data, both manually and automatically. Afterward, we turn our attention toward running computations using MapReduce, and get to grips working with Hadoop's scripting language, Pig. Lastly, we will siphon data from HDFS into Hive, and demonstrate how it can be used to structure and query data sets. Show and hide more
- Chapter 1 : The Hadoop Ecosystem
- The Course Overview 00:01:52
- Overview of HDFS and YARN 00:07:25
- Overview of Sqoop and Flume 00:03:18
- Overview of MapReduce 00:03:39
- Overview of Pig 00:03:04
- Overview of Hive 00:06:34
- Chapter 2 : Installing and Configuring Hadoop
- Downloading and Installing Hadoop 00:02:53
- Exploring Hue 00:05:24
- Chapter 3 : Data Import and Export
- Manual Import 00:04:33
- Importing from Databases Using Sqoop 00:06:27
- Using Flume to Import Streaming Data 00:05:08
- Chpater 4 : Using MapReduce and Pig
- Coding "Word Count" in MapReduce 00:05:55
- Coding "Word Count" in Pig 00:02:30
- Performing Common ETL Functions in Pig 00:08:48
- Using User-defined Functions in Pig 00:05:58
- Chapter 5 : Using Hive
- Importing Data from HDFS into Hive 00:04:57
- Importing Data Directly from a Database 00:02:23
- Performing Basic Queries in Hive 00:06:59
- Putting It All Together 00:02:16
Show and hide more