Lynda - Learning Hadoop
Hadoop is indispensible when it comes to processing big data—as necessary to understanding your information as servers are to storing it. This course is your introduction to Hadoop, its file system (HDFS), its processing engine (MapReduce), and its many libraries and programming tools. Developer and big-data consultant Lynn Langit shows how to set up a Hadoop development environment, run and optimize MapReduce jobs, code basic queries with Hive and Pig, and build workflows to schedule jobs. Plus, get a sneak peek at some up-and-coming libraries like Impala and the lightning-fast Spark.
Table of Contents
Introduction1. Why Move Away from Relational Databases?2. What Is Hadoop?3. Understanding Hadoop Core Components4. Setting Up the Hadoop Development Environment5. Understanding MapReduce 1.06. Tuning MapReduce7. Understanding MapReduce 2.0/YARN8. Understanding Hive9. Understanding Pig10. Understanding Workflows and Connectors11. Other Hadoop Libraries12. Understanding Spark13. Visualizing Hadoop Output with ToolsConclusion
TO MAC USERS: If RAR password doesn't work, use this archive program:
RAR Expander 0.8.5 Beta 4 and extract password protected files without error.
TO WIN USERS: If RAR password doesn't work, use this archive program:
Latest Winrar and extract password protected files without error.