Oreilly - Hands-On Amazon Redshift for Data Warehousing
by Colibri Digital | Released January 2019 | ISBN: 9781838558888
Build scalable, serverless data warehouses with machine learning and massively parallel processing in the cloud with Amazon RedshiftAbout This VideoKick traditional data warehouse technologies into touch with a combination of cloud hosting and cutting-edge optimization algorithmsGo from data warehouse fundamentals to a fully functioning peta-scale data warehouse in just 3 hours, and learn everything you need to build your own cloud data warehouseLearn the do's and don'ts of data warehousing with this simple hands-on guide to building data warehouses on AWSIn DetailAmazon Redshift is a low-cost cloud data platform that can scale from gigabytes to petabytes on a high-performance, column-oriented SQL engine. Amazon Redshift brings the power of scale-out architecture to the world of traditional data warehousing.In this course, you will explore this low-cost, cloud-based storage, which can be scaled up or down to meet your true size and performance needs. You will learn to configure a sample data warehouse. Next, you will explore Redshift's internal workings and architecture, and learn what makes it so fast. You will get hands-on experience connecting, querying, and building BI and data viz products and learn how to secure, maintain, and administer your new platform.By the end of this course, you will be able to scale from gigabytes to petabytes on this high-performance, column-oriented SQL engine. Show and hide more
- Chapter 1 : Data Warehousing for the Internet Age
- The Course Overview 00:02:51
- Do We Still Need a Data Warehouse? 00:06:20
- Data Technologies Compared: Relational, Data Warehouse, NoSQL, and Big Data 00:05:04
- Providing Business Intelligence on Internet-Scale Data 00:05:38
- Cloud-Native Data Warehousing 00:03:34
- Chapter 2 : Getting Started with Redshift
- Launching a Redshift Data Warehouse on AWS 00:05:45
- Launching a Redshift Data Warehouse Using Cloudformation 00:08:08
- Redshift Technology Deep Dive: Columnar Filesystem 00:05:15
- Redshift Technology Deep Dive: Massively Parallel Processing 00:04:20
- Chapter 3 : Creating a Redshift Data Warehouse from Disparate Datasets
- Sourcing Appropriate Data Sets 00:05:02
- Ingesting Various Sizes of Data Set into Redshift 00:09:15
- Connecting to and Querying the Data Warehouse 00:05:22
- Redshift Technology Deep Dive: Query Caching 00:04:47
- Chapter 4 : Optimizing Redshift for Scale
- Ingesting Enormous Volumes of Data by Copying Directly from S3 00:07:16
- Optimizing Redshift Data Types for Query Performance at Scale 00:05:02
- Evenly Distributing Data Across Your Cluster to Improve Filters and Joins 00:06:43
- Chapter 5 : Connecting Redshift with Disconnected Data Using Redshift Spectrum
- Exploratory Analytics for Disconnected Data 00:06:43
- Loading a Disconnected Dataset 00:07:56
- Glue Data Catalog - Creating a Schema for the External Dataset 00:06:55
- Chapter 6 : Visualizing Your Results with Amazon QuickSight
- The BI Use Case for Data Warehousing 00:04:59
- Introducing Amazon Quicksight 00:06:06
- What Is Spice and How Can It Be Used to Accelerate Analysis? 00:04:51
- Loading Data into SPICE 00:05:55
Show and hide more