->
Lynda - Apache PySpark by Example - 802868
Lynda - Apache PySpark by Example
Want to get up and running with Apache Spark as soon as possible? If you're well versed in Python, the Spark Python API (PySpark) is your ticket to accessing the power of this hugely popular big data platform. This practical, hands-on course helps you get comfortable with PySpark, explaining what it has to offer and how it can enhance your data science work. To begin, instructor Jonathan Fernandes digs into the Spark ecosystem, detailing its advantages over other data science platforms, APIs, and tool sets. Next, he looks at the DataFrame API and how it's the platform's answer to many big data challenges. Finally, he goes over Resilient Distributed Datasets (RDDs), the building blocks of Spark.


Table of Contents

  • Introduction
  • 1. Introduction to Apache Spark
  • 2. Technical Setup
  • 3. Working with the DataFrame API
  • 4. Functions
  • 5. Resilient Distributed Datasets (RDDs)
  • Conclusion
  • Lynda - Apache PySpark by Example


     TO MAC USERS: If RAR password doesn't work, use this archive program: 

    RAR Expander 0.8.5 Beta 4  and extract password protected files without error.


     TO WIN USERS: If RAR password doesn't work, use this archive program: 

    Latest Winrar  and extract password protected files without error.


     Coktum   |  

    Information
    Members of Guests cannot leave comments.




    rss