->
Lynda - SQL for Exploratory Data Analysis Essential Training - 672259
Lynda - SQL for Exploratory Data Analysis Essential Training
Learn how to use SQL to understand the characteristics of data sets destined for data science and machine learning. The course begins with an introduction to exploratory data analysis and how it differs from hypothesis-driven statistical analysis. Instructor Dan Sullivan explains how SQL queries and statistical calculations, and visualization tools like Excel and R, can help you verify data quality and avoid incorrect assumptions. Next, find out how to perform data-quality checks, reveal and recover missing values, and check business logic. Discover how to use box plots to understand non-normal distribution of data and use histograms to understand the frequency of data values in particular attributes. Dan also explains how to use the chi square test to understand dependencies and measure correlations between attributes. The course concludes with a collection of tips and best practices for exploratory data analysis.


Table of Contents

  • Introduction
  • 1. Introduction to Exploratory Data Analysis
  • 2. Data Quality Checks
  • 3. Calculating Quartiles
  • 4. Histograms
  • 5. Checking Correlation between Attributes
  • Conclusion
  • Lynda - SQL for Exploratory Data Analysis Essential Training


     TO MAC USERS: If RAR password doesn't work, use this archive program: 

    RAR Expander 0.8.5 Beta 4  and extract password protected files without error.


     TO WIN USERS: If RAR password doesn't work, use this archive program: 

    Latest Winrar  and extract password protected files without error.


     Coktum   |  

    Information
    Members of Guests cannot leave comments.




    rss