->
Oreilly - Analyzing Data Using Spark 2.0 DataFrames With Python - 9781491986844
Oreilly - Analyzing Data Using Spark 2.0 DataFrames With Python
by Jose Marcial Portilla | Released May 2017 | ISBN: 9781491986837


Apache Spark 2.0 has become the gold standard for processing large datasets. This course, designed for learners with basic Python programming experience, takes you on an introductory journey into the world of big data analysis using Spark 2.0, Python, and the Spark DataFrame API.Beginning with an overview of Spark 2.0 and Python, and then moving into a detailed examination of DataFrames, you'll learn about using SQL with DataFrames, DataFrame dates and timestamps, DataFrame aggregate operations, and about DataFrames and missing data. The course includes a hands-on data analysis exercise using real stock data. Learners should have Python and Spark installed on their computers before starting the class. Gain a core understanding of Spark 2.0 and Spark DataFrames Learn how to use Python with Spark DataFrames Gain big data experience analyzing stock data with Python and Spark DataFramesJose Marcial Portilla is Head of Data Science at SF Bay area based Pierian Data, where he creates and delivers data science and Python training courses for Fortune 500 clients such as Credit Suisse, General Electric, and The New York Times. Jose holds degrees in Mechanical Engineering from Santa Clara University. Show and hide more Publisher resources Download Example Code
  1. Introduction
    • Welcome To The Course 00:00:45
    • About The Author 00:00:28
    • Course Curriculum Overview 00:00:33
  2. Overview Of Spark 2.0
    • What Is Spark 00:01:08
    • Why Spark 2.0 DataFrames 00:00:29
  3. DataFrame Basics
    • Jupyter Notebook Overview 00:02:57
    • Python Review Part One 00:08:11
    • Python Review Part Two 00:08:09
    • Creating A DataFrame 00:02:29
    • Data Input 00:03:01
    • Data Output 00:03:00
    • Getting DataFrame Information 00:02:17
    • Selecting Columns And Rows 00:03:00
    • Creating and Renaming Columns 00:03:57
    • Using SQL With DataFrames 00:02:27
    • Filtering The Data 00:04:52
  4. Spark DataFrame Dates And Timestamps
    • Introduction To Date And Timestamps 00:00:15
    • Working With Dates 00:04:22
    • Working With Timestamps 00:04:03
  5. Spark DataFrame Aggregate Operations
    • Introduction To Aggregate And GroupBy Concepts 00:00:24
    • Spark GroupBy Method 00:03:08
    • Spark Built In Aggregate Methods 00:03:25
    • Sorting And Ordering 00:01:34
  6. Spark DataFrame Working With Missing Data
    • Introduction To Missing Data 00:00:21
    • Dropping Data 00:04:25
    • Filling Missing Data 00:02:46
  7. Spark DataFrame Exercises
    • Introduction To Exercises 00:00:34
    • Exercise Solutions 00:04:05
  8. Thank You
    • What Is Next And Where To Go From Here 00:00:24
  9. Show and hide more

    Oreilly - Analyzing Data Using Spark 2.0 DataFrames With Python


 TO MAC USERS: If RAR password doesn't work, use this archive program: 

RAR Expander 0.8.5 Beta 4  and extract password protected files without error.


 TO WIN USERS: If RAR password doesn't work, use this archive program: 

Latest Winrar  and extract password protected files without error.


 Coktum   |  

Information
Members of Guests cannot leave comments.




rss