Lynda - Data Science on Google Cloud Platform: Building Data Pipelines - 699344
Lynda - Data Science on Google Cloud Platform: Building Data Pipelines
Cloud computing brings unlimited scalability and elasticity to data science applications. Expertise in the major platforms, such as Google Cloud Platform (GCP), is essential to the IT professional. This course—one of a series by veteran cloud engineering specialist and data scientists Kumaran Ponnambalam—shows how to use the latest technologies in GCP to build a big data pipeline that ingests, transports, and transforms data entirely in the cloud. Learn how to set up data processing jobs using Apache Beam and Cloud Dataflow. Discover how to leverage Cloud Pub/Sub for stream ingestion and real-time messaging. Finally, find out how to process the stream events in Cloud Dataflow. The course uses an end-to-end use case that shows how to apply the knowledge and best practices from the course in a practical data science workflow.


Table of Contents

  • Introduction
  • 1. GCP Data Pipeline Products
  • 2. Apache Beam
  • 3. Setting Up Dataflow
  • 4. Data Processing with Beam and Dataflow
  • 5. Cloud Pub/Sub
  • 6. Streaming with Dataflow
  • Conclusion
  • Lynda - Data Science on Google Cloud Platform: Building Data Pipelines


    Information
    Members of Guests cannot leave comments.




    rss