Oreilly - Getting Started with Hadoop 2.x
by A K M Zahiduzzaman | Released April 2017 | ISBN: 9781787122550
To build strong foundation by exploring Hadoop ecosystem with real-world examples. About This VideoGet a better understanding of how to set up a HDFS cluster between local storage and the Hadoop filesystemRun your own Hadoop clusters on your own machine or in the cloudImplement the best practices for Hadoop developmentIn DetailHadoop emerged in response to the proliferation of masses and masses of data collected by organizations, offering a strong solution to store, process, and analyze what has commonly become known as Big Data. It comprises a comprehensive stack of components designed to enable these tasks on a distributed scale, across multiple servers and thousands of machines.This course introduces you to the powerful system synonymous with Big Data, demonstrating how to create an instance and leverage Hadoop ecosystem's many components to store, process, manage, and query massive data sets with confidence.The video course opens with an introduction to the world of Hadoop, where we discuss Nodes, Data Sets, and operations such as map and reduce. The second section deals HDFS, Hadoop's file-system used to store data. Further on, you'll discover the differences between jobs and tasks, and get to know about the Hadoop UI. After this, we turn our attention to storing data in HDFS and Data Transformations. Lastly, we will learn how to implement an algorithm in Hadoop map-reduce way and analyze the overall performance. Show and hide more
- Chapter 1 : Intro to the Hadoop World
- The Course Overview 00:03:44
- Installing Hadoop in Local 00:22:51
- Bring Process to Data 00:04:43
- NameNode Versus DataNode 00:04:15
- Map and Reduce Operations 00:08:00
- Order of Execution and Parallel Thinking 00:04:39
- Chapter 2 : File System Overdrive with HDFS
- Formatting a HDFS 00:06:38
- Formatting a HDFS 00:04:34
- Some Helpful Commands to Communicate with the HDFS 00:03:35
- HDFS Protocol and Using It in Applications 00:11:12
- Chapter 3 : Let's Run Some Hadoop Jobs
- Hadoop Jobs Versus Tasks 00:04:47
- The Hadoop UI for Task Progress 00:04:06
- Running a Couple of Example Jobs 00:10:09
- Analyze the Work Flow/Data Flow/Process Flow 00:07:26
- Chapter 4 : It's Show Time
- Introduction to the Movie Dataset 00:04:05
- Data Transformation and Storing to HDFS 00:17:55
- Devise a Simple Algorithm for Recommendation 00:04:07
- Implement the Algorithm in Hadoop Map-Reduce Way and Analyze Performance 00:10:39
Show and hide more