Oreilly - Architectural Considerations for Hadoop Applications
by | Released March 2015 | ISBN: 9781491923313
Implementing solutions with Apache Hadoop requires understanding not just Hadoop, but a broad range of related projects in the Hadoop ecosystem such as Hive, Pig, Oozie, Sqoop, and Flume. The good news is that there's an abundance of materials – books, web sites, conferences, etc. – for gaining a deep understanding of Hadoop and these related projects. The bad news is there's still a scarcity of information on how to integrate these components to implement complete solutions. In this video we'll walk through an end-to-end case study of a clickstream analytics engine to provide a concrete example of howto architect and implement a complete solution with Hadoop. Show and hide more Publisher resources View/Submit Errata
- Introduction to Clickstream Case Study 00:11:19
- Requirements 00:08:04
- Data Modeling 00:14:55
- Data Ingest 00:16:16
- Data Processing Engines - Part 1 00:16:23
- Data Processing Engines - Part 2 00:10:59
- Data Processing Patterns 00:09:32
- Orchestration 00:14:34
- Putting It All Together 00:03:08
- Demo 00:21:47
- Q&A 00:24:35
Show and hide more
TO MAC USERS: If RAR password doesn't work, use this archive program:
RAR Expander 0.8.5 Beta 4 and extract password protected files without error.
TO WIN USERS: If RAR password doesn't work, use this archive program:
Latest Winrar and extract password protected files without error.