Oreilly - Introduction to Alluxio
by Calvin Jia | Released June 2016 | ISBN: 9781771376006
Alluxio is the solution of choice for big companies who need to manage data at multi-petabyte scale. In this course, PMC member Calvin Jia offers a full-blown Alluxio tour to any data scientist, developer or system administrator looking to improve the performance of their workloads, develop applications with Alluxio, or deploy and manage Alluxio clusters.He offers a high level view (why Alluxio was developed, the problems it solves, who uses it, etc.) as well as a hands-on practicum. You'll set-up your own deployment (locally and in a cluster) using a compute framework on top of Alluxio, connecting it to multiple persistent data stores while preserving one namespace. Take this course and you'll come away knowing the benefits Alluxio brings to big data stacks.Understand the features and benefits of Alluxio and master the basics of how to use itDiscover why companies like Intel, Baidu, and Alibaba use Alluxio for their big data needsLearn how the storage unification layer bridges computation frameworks and storage systemsGain practical experience deploying Alluxio in local and cluster modesLearn how to use Alluxio tools like the command line and the web UIExplore the Alluxio open source ecosystem and learn who the players areCalvin Jia is the software engineer from Alluxio, Inc. who co-led the "Unified Namespace and Tiered Storage in Alluxio" session at Strata+Hadoop World 2016 San Jose. He holds a Bachelor of Science (BS), Electrical Engineering and Computer Science degree from the University of California, Berkeley. Show and hide more Publisher resources View/Submit Errata
- Introduction
- About Alluxio And The Course 00:03:38
- About The Author 00:01:24
- Using Alluxio Locally
- Downloading Alluxio 00:03:03
- Starting The System Locally 00:05:09
- Interacting Via The Shell 00:02:45
- Browsing The Web UI 00:03:53
- Examples With Alluxio
- Setting Up Alluxio With Spark And S3 00:06:15
- Running Spark on Alluxio with S3 00:05:29
- Using Alluxio With Unified Namespace 00:06:05
- Deploying Alluxio On A Cluster
- Deploying Alluxio In AWS 00:07:49
- Conclusion
- Contributing To The Project And Conclusion 00:03:52
Show and hide more