Oreilly - Data Science and Machine Learning with Python - Hands On!
by Frank Kane | Released September 2016 | ISBN: 9781787127081
Perform data mining and Machine Learning efficiently using Python and SparkAbout This VideoTake your first steps in the world of data science by understanding the tools and techniques of data analysisTrain efficient Machine Learning models in Python using the supervised and unsupervised learning methodsLearn how to use Apache Spark for processing Big Data efficientlyIn DetailThe job of a data scientist is one of the most lucrative jobs out there today – it involves analyzing large amounts of data, and gathering actionable business insights from it using a variety of tools. This course will help you take your first steps in the world of data science, and empower you to conduct data analysis and perform efficient machine learning using Python. Gain value from your data using the various data mining and data analysis techniques in Python, and develop efficient predictive models to predict future results. You will also learn how to perform large-scale machine learning on Big Data using Apache Spark. You don't have to be an expert coder in Python to get the most out of this course – just a basic programming knowledge of Python is sufficient.Downloading the example code for this course: You can download the example code files for all Packt video courses you have purchased from your account at http://www.PacktPub.com. If you purchased this course elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you. Show and hide more Publisher resources Download Example Code
- Chapter 1 : Getting Started
- Introduction 00:02:45
- Getting What You Need 00:02:37
- Installing Enthought Canopy 00:06:20
- Python Basics – Part 1 00:15:58
- Python Basics – Part 2 00:09:41
- Running Python Scripts 00:03:55
- Introducing the Pandas Library 00:10:15
- Chapter 2 : Statistics and Probability Refresher, and Python Practise
- Types of Data 00:06:59
- Mean, Median, and Mode 00:05:26
- Using Mean, Median, and Mode in Python 00:08:30
- Variation and Standard Deviation 00:11:12
- Probability Density Function and Probability Mass Function 00:03:28
- Common Data Distributions 00:07:45
- Percentiles and Moments 00:12:33
- A Crash Course in matplotlib 00:13:46
- Covariance and Correlation 00:11:31
- [Exercise] Conditional Probability 00:10:16
- Exercise Solution – Conditional Probability of Purchase by Age 00:02:19
- Bayes' Theorem 00:05:23
- Chapter 3 : Predictive Models
- Linear Regression 00:11:01
- Polynomial Regression 00:08:05
- [Activity] Multivariate Regression and Predicting Car Prices 00:09:53
- Multi-Level Models 00:04:37
- Chapter 4 : Machine Learning with Python
- Supervised versus Unsupervised Learning and Train/Test 00:08:57
- Using Train/Test to Prevent Overfitting of a Polynomial Regression 00:05:48
- Bayesian Methods – Concepts 00:04:00
- Implementing a Spam Classifier with Naive Bayes 00:08:06
- K-Means Clustering 00:07:24
- Clustering People Based on Income and Age 00:05:14
- Measuring Entropy 00:03:10
- Decision Trees – Concepts 00:08:43
- Decision Trees – Predicting Hiring Decisions 00:09:47
- Ensemble Learning 00:05:59
- Support Vector Machines (SVM) Overview 00:04:28
- Using SVM to Cluster People by using scikit-learn 00:05:36
- Chapter 5 : Recommender Systems
- User-Based Collaborative Filtering 00:07:57
- Item-Based Collaborative Filtering 00:08:16
- Finding Movie Similarities 00:09:08
- Improving the Results of Movie Similarities 00:08:00
- Making Movie Recommendations to People 00:10:22
- Improve the Recommender's Results 00:05:30
- Chapter 6 : More Data Mining and Machine Learning Techniques
- K-Nearest Neighbors – Concepts 00:03:45
- Using KNN to predict a rating for a movie 00:12:29
- Dimensionality Reduction and Principal Component Analysis 00:05:44
- A PCA Example with the Iris Dataset 00:09:05
- Data Warehousing Overview – ETL and ELT 00:09:05
- Reinforcement Learning 00:12:44
- Chapter 7 : Dealing with Real-World Data
- Bias/Variance Trade-off 00:06:16
- K-Fold Cross-Validation to Avoid Overfitting 00:10:55
- Data Cleaning and Normalization 00:07:10
- Cleaning Web Log Data 00:10:56
- Normalizing Numerical Data 00:03:23
- Detecting Outliers 00:07:00
- Chapter 8 : Apache Spark – Machine Learning on Big Data
- Installing Spark – Part 1 00:07:03
- Installing Spark – Part 2 00:13:29
- Spark Introduction 00:09:11
- Spark and the Resilient Distributed Dataset (RDD) 00:11:42
- Introducing MLLib 00:05:09
- Decision Trees in Spark 00:16:01
- K-Means Clustering in Spark 00:11:07
- TF/IDF 00:06:44
- Searching Wikipedia with Spark 00:08:12
- Using the Spark 2.0 DataFrame API for MLLib 00:07:57
- Chapter 9 : Experimental Design
- A/B Testing Concepts 00:08:23
- T-Tests and P-Values 00:06:00
- Hands On with T-Tests 00:06:04
- Determining How Long to Run an Experiment 00:03:25
- A/B Test Gotchas 00:09:27
- Chapter 10 : Deep Learning and Neural Networks
- Deep Learning Pre-Requisites 00:10:51
- The History of Artificial Neural Networks 00:11:15
- [Activity] Deep Learning in the Tensorflow Playground 00:12:00
- Deep Learning Details 00:09:30
- Introducing Tensorflow 00:12:40
- [Activity] Using Tensorflow, Part 1 00:09:50
- [Activity] Using Tensorflow, Part 2 00:13:28
- [Activity] Introducing Keras 00:13:50
- [Activity] Using Keras to Predict Political Affiliations 00:12:24
- Convolutional Neural Networks (CNN's) 00:11:28
- [Activity] Using CNN's for handwriting recognition 00:08:12
- Recurrent Neural Networks (RNN's) 00:11:03
- [Activity] Using a RNN for sentiment analysis 00:10:02
- The Ethics of Deep Learning 00:11:02
- Learning More about Deep Learning 00:01:45
- Chapter 11 : Final Project
- Your final project assignment 00:06:26
- Final Project Review 00:08:59
- Chapter 12 : You Made It!
- More to Explore 00:02:59
- Bonus Video: Discounts on my Spark and MapReduce courses! 00:01:06
Show and hide more 9781787127081.data.science.and.OR.part1.rar
9781787127081.data.science.and.OR.part2.rar
TO MAC USERS: If RAR password doesn't work, use this archive program:
RAR Expander 0.8.5 Beta 4 and extract password protected files without error.
TO WIN USERS: If RAR password doesn't work, use this archive program:
Latest Winrar and extract password protected files without error.