->
Oreilly - Strata Data Conference 2019 - San Francisco, California - 9781492050520
Oreilly - Strata Data Conference 2019 - San Francisco, California
by O'Reilly Media, Inc. | Released March 2019 | ISBN: 9781492050520


Thousands of the data scientists, analysts, engineers, developers, and executives converged at the Strata Data Conference San Francisco in March 2019 to absorb the insights and wisdom of the data world's best minds. The conference featured more than 300 speakers, 10 keynotes, 10 tutorials, and 150+ technical sessions. This video compilation captures the best from the conference, offering more than 100 hours of material to review at your own pace. Highlights include: The Strata Business Summit - speakers, executive briefings, and tech sessions laser focused on a central theme: How do the world's leading companies build their successful data strategies? Learn about recommendation engines, AI-based personalization solutions, data governance, ML based customer insight harvesting, and more from data wizards like Zachery Anderson (Electronic Arts), Eric Bradlow (The Wharton School), David Talby (Pacific AI), Paco Nathan (derwen.ai), Jonathan Francis (Starbucks), and JoLynn Lavin (General Mills).The Strata Data Ethics Summit – Is your AI really making good decisions or have you built a deceptive black box that reinforces ugly stereotypes? Alistair Croll (Strata Chair), Tim O'Reilly (O'Reilly Media), and Susan Etlinger's (Altimeter Group) eight hour deep dive into the thorny issues of data and algorithms with help from Jana Eggers (Nara Logics), Rumman Chowdhury (Accenture), Kathy Baxter (Salesforce), Carole Piovesan (McCarthy Tétrault), and more.Hours of tutorials from the world's top data engineers, such as Francesca Lazzeri (Microsoft) and Holden Karau (Google) on training and deploying models with Kubeflow across different cloud vendors; Dean Wampler (Lightbend) on performing machine learning using Kafka-based streaming pipelines; and Jason Dai (Intel) on the Analytics Zoo, an analytics/AI platform that seamlessly unites Spark, TensorFlow, Keras, and BigDL programs into an integrated pipeline.Sessions devoted to Data Science, Machine Learning & AI, including Sharad Goel (Stanford University) on the challenges of "fair machine learning", which aims to ensure that decisions guided by algorithms are equitable; Kelley Rivoire (Stripe) on scaling machine learning using the Railyard API; Vinod Vaikuntanathan (MIT) on performing machine learning on encrypted data; and Jeremy Howard (platform.ai) on recent advances in deep learning that allow non-engineers to train neural networks from scratch without needing code or pre-existing labels.Sessions focused on Data Engineering & Architecture, including Karthik Ramasamy (Streamlio) on reducing stream processing complexity using Apache Pulsar Functions; Rachel Warren's (Salesforce Einstein) review of Spark tuning; and Tobias Knaup (Mesosphere) on the critical learning that must take place before and after you've trained and modeled your deep learning models.Multiple sessions on Streaming and IoT, Business Analytics, UX and Visualization, plus keynotes from AI/cryptography expert Shafi Goldwasser (UC Berkeley) and from "Likewar: The Weaponization of Social Media" co-author Peter Singer.The Future of the Firm, A six-session mini-conference within Strata that highlights how leading edge tech companies adapt to the workforce, business, and economic trends shaping the future of business. Led by Josh Bersin (Founder, Bersin by Deloitte) and executives from Capital One, Bloomberg Beta, Genentech, Publicis Sapient, and more. Show and hide more
  1. Keynotes
    • The journey to the data-driven enterprise from the edge to AI - Amy O'Connor (Cloudera) 00:14:24
    • Scoring your business in the AI Matrix (sponsored by Dataiku) - Jed Dougherty (Dataiku) 00:05:26
    • Sustaining machine learning in the enterprise - Ben Lorica (O'Reilly Media) 00:12:51
    • Cyberconflict: A new era of war, sabotage, and fear - David Sanger (The New York Times) 00:28:04
    • Streamlining your Data Assets: A Strategy for the Journey to AI (sponsored by IBM) - Dinesh Nirmal (IBM) 00:05:02
    • AI and cryptography: Challenges and opportunities - Shafi Goldwasser (UC Berkeley | MIT | Weizmann Institute of Science | Duality) 00:22:19
    • Hacking the vote: The neuropolitical universe - Elizabeth Svoboda (What Makes a Hero?) 00:08:16
    • Data warehousing is not a use case (sponsored by Google Cloud) - Jordan Tigani (Google ) 00:09:18
    • Chatting with machines: Strange things 60 billion bot logs say about human nature - Lauren Kunze (Pandorabots) 00:13:26
    • The enterprise data cloud - Mike Olson (Cloudera) 00:09:45
    • Forecasting uncertainty at Airbnb - Theresa Johnson (Airbnb) 00:10:22
    • The cyberthreat-scape: Key trends in cybersecurity - Peter Singer (New America) 00:20:05
    • Strata Data Awards: Winners Announced 00:02:38
  2. Data Science, Machine Learning & AI
    • Ludwig, a code-free deep learning toolbox - Piero Molino (Uber AI) 00:36:08
    • Natural language understanding in task-oriented conversational AI - Sonal Gupta (Facebook) 00:41:13
    • Applying deep learning at Google for recommendations - Ron Bodkin (Google) 00:48:29
    • The measure and mismeasure of fairness in machine learning - Sharad Goel (Stanford University) 00:51:39
    • New models for generating training data for AI - Roger Chen (Computable) 00:39:11
    • From an archived data field to GO-JEK’s world-class product feature for customer experience - Divya Choudhary (GO-JEK) 00:36:15
    • Talking to the machines: Monitoring production machine learning systems - Ting-Fang Yen (DataVisor) 00:27:30
    • Online evaluation of machine learning models - Ted Dunning (MapR) 00:40:31
    • Artificial intelligence on human behavior: New insights into customer segmentation - Melinda Han Williams (Dstillery) 00:34:41
    • Machine learning prediction of blood alcohol content: A digital signature of behavior - Kirstin Aschbacher (UCSF Cardiology) 00:47:16
    • NLP from scratch: Solving the cold start problem for natural language processing - Michael Johnson (Lockheed Martin), Norris Heintzelman (Lockheed Martin) 00:43:17
    • Building high-performance text classifiers on a limited labeling budget - Mario Inchiosa (Microsoft), Robert Horton (Microsoft), Ali Zaidi (Microsoft) 00:44:23
    • Masquerading Malicious DNS Traffic - David Rodriguez (Cisco Systems) 00:37:59
    • On a Deep Journey Towards Five Nines - Aashish Sheshadri (PayPal Inc) 00:40:39
    • The magic behind your Lyft ride prices - a case study of Machine Learning and Streaming - Rakesh Kumar (Lyft Inc), Thomas Weise (Lyft) 00:44:32
    • Efficient Multi-armed Bandit with Thompson Sampling for applications with Delayed feedback - Shradha Agrawal (Adobe Systems Inc) 00:42:18
    • The future of machine learning is decentralized - Alex Ingerman (Google) 00:40:01
    • Federated learning - Mike Lee Williams (Cloudera Fast Forward Labs) 00:52:12
    • Anomaly detection using deep learning to measure quality of Large Datasets​ - Sridhar Alla (Comcast), Syed Nasar (Cloudera) 00:40:59
    • Our New Publishing Platform Will Make You A Better Writer: Using AI To Assist The Newsroom - Boris Yakubchik (Forbes) 00:38:29
    • Applied Machine Learning In Finance - Chakri Cherukuri (Bloomberg LP) 00:34:11
    • Nutrition Data Science - Noah Gift (UC Davis ), Michelle Davenport (Quantitative Nutrition) 00:38:42
    • Deploying Data Science for National Economic Statistics - Jeff Chen (US Bureau of Economic Analysis) 00:39:55
    • Cloud-Native Machine Learning: Emerging Trends and the Road Ahead - Tristan Zajonc (Cloudera), Tim Chen (Cloudera) 00:40:16
    • Machine Learning for Preventive Maintenance of Mining Haul Trucks - Alex Gorbachev (Pythian), Paul Spiegelhalter (The Pythian Group) 1:07:28
    • Testing ad content with survey experiments. - Patrick Miller (Civis Analytics) 00:29:49
    • Using Deep Learning to automatically rank millions of hotel images - Christopher Lennan (idealo.de) 00:35:22
    • Personalizing the guest-booking experience at Airbnb - Kapil Gupta (Airbnb) 00:40:32
    • Real Time Analytics on Deep Learning: when Tensorflow meets Presto at Uber - Zhenxiao Luo (Uber) 00:42:55
    • Interpretable and Resilient AI for Financial Services - Jari Koister (FICO) 00:47:32
  3. Sponsored
    • Managing globally distributed data for deep learning using TensorFlow on YARN (sponsored by WANdisco) - Jagane Sundar (WANdisco) 00:29:53
    • High-performance data lakes for AI workloads using object storage (sponsored by MinIO) - Scott Mcclellan (PRGX) 00:50:39
    • Augmented OLAP for big data from on-premises to multicloud (sponsored by Kyligence) - Yang Li (Kyligence) 00:39:12
    • How to compete in the AI arms race (sponsored by Oracle Cloud Infrastructure) - Ian Swanson (Oracle) 00:22:36
    • The death of coding: How AI redefines our relationship with computers (sponsored by IBM) - Sam Lightstone (IBM) 00:46:45
    • From data to discovery: The power of choice and control (sponsored by SAS) - Sarah Gates (SAS) 00:28:33
    • Intelligent design patterns for cloud-based analytics and BI (sponsored by Arcadia Data) - Priyank Patel (Arcadia Data) 00:32:59
    • Transforming AI, ML, and BI on big data at Verizon (sponsored by Kyvos Insights) - Syed Latheef (Verizon) 00:37:05
    • The new frontier: Marsh’s data voyage into the public cloud (sponsored by Impetus) - Stephen Dantu (Marsh) 00:48:19
    • Uncovering the next generation of data architecture for insights at the speed of thought (sponsored by Actian) - Raghu Chakravarthi (Actian) 00:37:39
    • Walmart's journey from business intelligence to artificial intelligence (sponsored by Walmart Labs) - Prakhar Mehrotra (Walmart Labs) 00:41:55
    • Rethinking big data analytics with Google Cloud (sponsored by Google Cloud) - Jordan Tigani (Google) 00:35:54
    • Break through the limits of your current database (sponsored by MemSQL) - Franck Leveneur (WAG Walking) 00:32:54
    • Solving the enterprise data dilemma (sponsored by erwin, Inc.) - Adam Famularo (erwin, Inc.) 00:39:03
    • Strategies for leveraging legacy data for real time, cloud, and cluster (sponsored by Syncsort) - Ashwin Ramachandran (Syncsort) 00:36:34
    • Go serverless with Elasticsearch: Eliminate scaling and performance bottlenecks for faster data search (sponsored by Vizion.ai) - Geoff Tudor (Vizion.ai) 00:32:10
  4. Data Engineering & Architecture
    • Cloud programming simplified: A Berkeley view on serverless computing - Eric Jonas (UC Berkeley) 00:37:39
    • MLflow: An open platform to simplify the machine learning lifecycle - Corey Zumar (Databricks) 00:40:24
    • Automation of root cause analysis for big data stack applications - Alkis Simitsis (Micro Focus), Shivnath Babu (Unravel Data Systems | Duke University) 00:36:32
    • How Intuit reduced time to reliable insights for data pipelines - Sandeep U (Intuit) 00:45:09
    • ROCKSET: The design and implementation of a data system for low-latency queries for search and analytics - Igor Canadi (Rockset), Dhruba Borthakur (Rockset) 00:43:54
    • How Netflix measures app performance on 250 million unique devices across 190 countries - Vivek Pasari (Netflix) 00:50:24
    • Adaptive ETL to optimize query performance at Lyft - James Taylor (Lyft) 00:35:37
    • Accelerating analytical antelopes: Integrating Apache Kudu's RPC into Apache Impala - Lars Volker (Cloudera), Michael Ho (Cloudera) 00:41:18
    • When SQL users run wild: Resource management features and techniques to tame Apache Impala - Tim Armstrong (Cloudera) 00:44:40
    • Cloud-native data pipelines with Apache Kafka - Gwen Shapira (Confluent) 00:42:57
    • Serverless workflows for orchestration hybrid cluster-based and serverless processing - Rustem Feyzkhanov (Instrumental) 00:23:23
    • ML and AI at scale at PayPal - Subhadra Tatavarti (PayPal), Chen Kovacs (PayPal) 00:39:15
    • Taking graph applications to production - Denise Gosnell (DataStax) 00:41:29
    • Bullet: Querying streaming data in transit with sketches - Akshai Sarma (Oath), Nathan Speidel (Yahoo) 00:40:11
    • Clusters in Kubernetes on a cluster: Building a multitenant environment for the field - Paul Curtis (MapR Technologies) 00:41:04
    • Managing Uber's Data Workflows at Scale - Alex Kira (Uber) 00:34:22
    • Presto: Tuning Performance of SQL-on-Anything Analytics - Kamil Bajda-Pawlikowski (Starburst), Martin Traverso (Facebook) 00:40:43
    • Real-time monitoring of Twitter network infrastructure with Heron - Julien Delange (Twitter), Neng Lu (Twitter) 00:35:31
    • Persistent Storage for Machine Learning in KubeFlow - Skyler Thomas (MapR), Terry He (MapR Technologies) 00:42:40
    • Live-Aggregators: A Scalable, Cost Effective and Reliable Way of Aggregating Billions of Messages in Realtime - Osman Sarood (Mist Systems), Chunky Gupta (Mist Systems) 00:39:32
    • From flat files to deconstructed database: The evolution and future of the big data ecosystem - Julien Le Dem (WeWork) 00:43:49
    • Building Rakuten Analytics: A Story of Evolutions - Juan Paulo Gutierrez (Rakuten) 00:36:16
    • Transforming behavioural analytics at Atlassian - Rohan Dhupelia (Atlassian), Jimmy Li (Atlassian) 00:37:09
    • Taming large-state to join datasets for Personalization - Sonali Sharma (Netflix), Shriya Arora (Netflix) 00:38:27
    • Spark Adaptive Execution Unleash the Power of Spark SQL - Haifeng Chen (Intel) 00:34:45
    • New Directions in Record Linkage - Yves Thibaudeau (U.S. Census Bureau) 00:37:26
    • Netflix - The journey towards a self-service data platform - Kurt Brown (Netflix) 00:37:17
    • How to Protect Big Data in a Containerized Environment - Thomas Phelan (BlueData) 00:39:55
    • Data Science in Deutsche Telekom - Predicting global travel patterns and network demand - Václav Surovec (Deutsche Telekom IT), Gabor Kotalik (Deutsche Telekom AG) 00:33:25
    • Optimizing Computing Clusters Resource Utilization with In-Memory Distributed File System - Shouwei Chen (Rutgers University), Yue Li (MemVerge) 00:42:11
    • Put Kafka in jail with Strimzi - Sean Glover (Lightbend) 00:39:01
    • Disrupting Data Discovery - Mark Grover (Lyft), Tao Feng (Lyft) 00:42:06
    • Scaling Apache Spark on Kubernetes at Lyft - Li Gao (Lyft Inc.), Bill Graham (Lyft Inc.) 00:32:33
    • Faster ML over Joins of Tables - Arun Kumar (University of California, San Diego) 00:42:36
    • Scanner: Efficient Video Analysis at Scale - Alex Poms (Stanford University), Will Crichton (Stanford University) 00:40:20
    • Automating DevOps for Machine Learning - Diego Oppenheimer (Algorithmia) 00:40:10
    • Database migrations don't have to be painful, but the road will be bumpy - Adrian Lungu (Adobe), Serban Teodorescu (Adobe) 00:31:07
    • Cost Effective Presto on AWS with Spot Nodes - Shubham Tagra (Qubole) 00:39:15
    • Cruise Control: Effortless Management of Kafka Clusters - Adem Efe Gencer (LinkedIn) 00:41:21
    • Enabling Insights and Analytics with Data Streaming Architectures and Pipelines using Kafka and Hadoop - Mohammad Quraishi (Cigna) 00:40:17
    • Real Time Analytics at Uber: bring SQL into everything - Zhenxiao Luo (Uber) 00:40:22
    • Data processing at the speed of 100 Gbps using Apache Crail - Patrick Stuedi (IBM Research) 00:29:32
  5. Strata Data Ethics Summit
    • Getting real about ethical technology - Susan Etlinger (Altimeter Group) 00:15:39
    • The human side of data and technology - Bradley Voytek (UC San Diego) 00:14:23
    • AI's terrible twos: When AI does what we taught it - Jana Eggers (Nara Logics) 00:17:11
    • Say what? The ethical challenges of designing for humanlike interaction - Jonathan Foster (Microsoft) 00:13:18
    • Is your AI making good decisions? - Brian Rieger (Labelbox) 00:12:40
    • Panel: Causes - Bradley Voytek (UC San Diego), Jana Eggers (Nara Logics), Jonathan Foster (Microsoft), Brian Rieger (Labelbox) 00:35:32
    • On the Accountability of Black Boxes: How we can control what we can’t exactly measure. - Yiannis Kanellopoulos (Code4Thought) 00:17:57
    • The future of data ethics - Alistair Croll (Solve For Interesting), Susan Etlinger (Altimeter Group), Tim O'Reilly (O'Reilly Media) 00:03:27
  6. Future of the Firm
    • The future of the firm: Starting now - Josh Bersin (Bersin by Deloitte) 00:34:38
    • A human-centered approach to AI and machine learning - Cathryn Posey (Capital One) 00:17:46
    • Automating yourself out of a job? The problem with knowledge work - James Cham (Bloomberg Beta) 00:15:04
    • The brave new world of computational propaganda - Renee DiResta (New Knowledge) 00:43:27
    • The conscience of a company - Tim O'Reilly (O'Reilly Media), Janet Haven (Data & Society), Catherine Bracy (TechEquity Collaborative) 00:40:04
  7. Law and Ethics
    • Owning ethics: Doing ethics inside a tech company - Jake Metcalf (Ethical Resolve), Emanuel Moss (Data & Society) 00:41:54
    • Community and regional data sharing policy frameworks: Frontier stories - Mei Fung (Customer Think) 00:35:23
  8. Strata Business Summit
    • Executive Briefing: From the edge to AI—Taking control of your data for fun and profit - Mike Olson (Cloudera) 00:33:06
    • Recommendation engines and mobile gaming - Bysshe Easton (KIXEYE), Thomas Dobbs (KIXEYE) 00:40:36
    • Scaling visualization for big data and analytics in the cloud - Jaipaul Agonus (FINRA), Daniel Monteiro (FINRA) 00:43:04
    • Shortcuts that short-circuit talent pipelines: Data-driven optimization of hiring - Maryam Jahanshahi (TapRecruit) 00:43:17
    • The ethics of analytics - Bill Franks (International Institute For Analytics) 00:58:44
    • Yay, we are going to deploy an AI/ML powered app. But wait! Where do I deploy? - Swatee Singh (American Express) 00:35:27
    • The collision between AI and underground infrastructure - Greg Quist (SmartCover Systems) 00:32:48
    • Understanding the data universe with a data catalog - John Haddad (Informatica) 00:43:06
    • What the reproducibility problem means for your business - Stuart Buck (Laura and John Arnold Foundation) 00:40:53
    • Apache Superset: An open source data visualization platform - Maxime Beauchemin (Lyft) 00:38:35
    • An alternative approach to adding data science to an organization: Use Jupyter and start with the domain experts - Dave Stuart (Department of Defense ) 00:41:22
  9. Business Analytics and Visualization
    • How Walgreens transformed supply chain management with Kyvos, Tableau, and big data - Neerav Jain (Walgreens), Anne Cruz (Walgreens) 00:37:10
    • Understanding the world food economy with satellite images and AI - Alex Kudriashova (Astro Digital) 00:20:54
    • When Self-Service BI meets Geospatial Analysis, - kyungtaak Noh (SK Telecom) 00:27:39
    • The Paradise Papers and West Africa Leaks: Behind the scenes with the ICIJ - Pierre Romera (International Consortium of Investigative Journalists (ICIJ)) 00:39:03
  10. Case studies
    • Voice of the Customer; a Case Study in how Machine Learning can Automate Consumer Insights - JoLynn Lavin (General Mills, Inc) 00:23:14
    • Informing the Art of Business with Data and Science - Craig Rowley (Columbia Sportswear) 00:28:36
    • Sharing Cancer Genomic Data from Clinical Sequencing Using Blockchain - Benjamin Glicksberg (UCSF) 00:25:15
    • Leveraging fashion data to make shopping recommendations - Rhonda Textor (True Fit) 00:31:56
  11. Executive Briefing and best practices
    • Executive Briefing: Why machine-learned models crash and burn in production and what to do about it - David Talby (Pacific AI) 00:37:20
    • Executive Briefing: What it takes to use machine learning in fast data pipelines - Dean Wampler (Lightbend) 00:32:04
    • From Data Driven to Data Competitive - June Andrews (GE) 00:29:24
    • How to extract stories from your data and tell them visually? It Can Be Done. We Will Show You How. - Ambal Balakrishnan (IBM) 00:34:45
    • Organic Intelligence: Telling a story about the Human Experience with Math - Robin Way (Corios) 00:40:21
  12. Streaming and IoT
    • Critical Turbine Maintenance: Monitoring & Diagnosing Planes and Power Plants in Real Time - June Andrews (GE), John Rutherford (GE) 00:40:24
    • Flink SQL in Action - Fabian Hueske (Ververica) 00:42:52
    • Serverless for data and AI - Avner Braverman (Binaris) 00:40:04
    • Apache Druid auto scale-out/in for streaming data ingestion on Kubernetes - Jinchul Kim (SK Telecom) 00:43:17
  13. Culture and organization
    • Data Science University: Transforming a Fortune 5 workforce - Marc Paradis (UnitedHealth Group) 00:41:09
    • Scaling data infrastructure in the fashion world; or, “What is this? Business intelligence for ants?” - Francesco Mucio (Zalando) 00:33:39
    • Creating a data engineering culture at USAA - Jesse Anderson (Big Data Institute), Thomas Goolsby (USAA) 00:45:53
  14. Visualization and UX
    • Bringing data to life: Combining machine learning and art to tell a data story - Nancy Rausch (SAS Institute) 00:24:07
  15. Jupyter
    • From Jupyter to production: Accelerating solutions to business problems in production - Manu Mukerji (8x8) 00:36:44
    • Talking with Jupyter - M Pacer (Netflix) 00:38:30
    • Where does Jupyter fit into building end-to-end ML products? - Omoju Miller (GitHub) 00:30:42
    • Scaling Jupyter with Jupyter Enterprise Gateway - Alan Chin (IBM), Luciano Resende (IBM) 00:41:45
    • Jupyter Book: Online interactive books with the Jupyter Notebook- Chris Holdgraf (Berkeley Institute for Data Science) 00:44:18
  16. Tutorials
    • Hands-on Machine Learning with Kafka-based Streaming Pipelines - Boris Lublinsky (Lightbend), Dean Wampler (Lightbend) - Part 1 00:46:47
    • Hands-on Machine Learning with Kafka-based Streaming Pipelines - Boris Lublinsky (Lightbend), Dean Wampler (Lightbend) - Part 2 00:45:55
    • Hands-on Machine Learning with Kafka-based Streaming Pipelines - Boris Lublinsky (Lightbend), Dean Wampler (Lightbend) - Part 3 00:37:59
    • Hands-on Machine Learning with Kafka-based Streaming Pipelines - Boris Lublinsky (Lightbend), Dean Wampler (Lightbend) - Part 4 00:47:41
    • Introduction to Flink via Flink SQL - Fabian Hueske (Ververica) - Part 1 00:39:24
    • Introduction to Flink via Flink SQL - Fabian Hueske (Ververica) - Part 2 00:37:12
    • Introduction to Flink via Flink SQL - Fabian Hueske (Ververica) - Part 3 00:47:16
    • Managing data science in the enterprise - Joshua Poduska (Domino Data Lab), Kimberly Shenk (NakedPoppy), Mac Steele (Domino Data Lab) - Part 1 00:53:05
    • Managing data science in the enterprise - Joshua Poduska (Domino Data Lab), Kimberly Shenk (NakedPoppy), Mac Steele (Domino Data Lab) - Part 2 00:46:12
    • Managing data science in the enterprise - Joshua Poduska (Domino Data Lab), Kimberly Shenk (NakedPoppy), Mac Steele (Domino Data Lab) - Part 3 00:19:18
    • Streamlining a Machine Learning Project Team - Sourav Dey (Manifold), Alex Ng (Manifold) - Part 1 00:26:51
    • Streamlining a Machine Learning Project Team - Sourav Dey (Manifold), Alex Ng (Manifold) - Part 2 00:39:05
    • Streamlining a Machine Learning Project Team - Sourav Dey (Manifold), Alex Ng (Manifold) - Part 3 00:58:40
    • Running multidisciplinary big data workloads in the cloud - Jason Wang (Cloudera), Tony Wu (Cloudera), Vinithra Varadharajan (Cloudera) - Part 1 00:54:19
    • Running multidisciplinary big data workloads in the cloud - Jason Wang (Cloudera), Tony Wu (Cloudera), Vinithra Varadharajan (Cloudera) - Part 2 00:44:42
    • Running multidisciplinary big data workloads in the cloud - Jason Wang (Cloudera), Tony Wu (Cloudera), Vinithra Varadharajan (Cloudera) - Part 3 00:33:18
    • Foundations for Successful Data Projects - Jonathan Seidman (Cloudera), Ted Malaska (Capital One) - Part 1 00:39:05
    • Foundations for Successful Data Projects - Jonathan Seidman (Cloudera), Ted Malaska (Capital One) - Part 2 00:40:40
    • Foundations for Successful Data Projects - Jonathan Seidman (Cloudera), Ted Malaska (Capital One) - Part 3 00:43:50
    • Foundations for Successful Data Projects - Jonathan Seidman (Cloudera), Ted Malaska (Capital One) - Part 4 00:44:04
    • Architecting a data platform for enterprise use - Mark Madsen (Think Big Analytics), Todd Walter (Teradata) - Part 1 00:39:38
    • Architecting a data platform for enterprise use - Mark Madsen (Think Big Analytics), Todd Walter (Teradata) - Part 2 00:57:41
    • Architecting a data platform for enterprise use - Mark Madsen (Think Big Analytics), Todd Walter (Teradata) - Part 3 00:53:25
    • Architecting a data platform for enterprise use - Mark Madsen (Think Big Analytics), Todd Walter (Teradata) - Part 4 00:43:14
    • Natural language understanding at scale with Spark NLP - David Talby (Pacific AI), Alex Thomas (Indeed), Claudiu Branzan (G2 Web Services) - Part 1 00:31:37
    • Natural language understanding at scale with Spark NLP - David Talby (Pacific AI), Alex Thomas (Indeed), Claudiu Branzan (G2 Web Services) - Part 2 00:51:35
    • Natural language understanding at scale with Spark NLP - David Talby (Pacific AI), Alex Thomas (Indeed), Claudiu Branzan (G2 Web Services) - Part 3 00:52:34
    • Natural language understanding at scale with Spark NLP - David Talby (Pacific AI), Alex Thomas (Indeed), Claudiu Branzan (G2 Web Services) - Part 4 00:33:35
    • Recurrent Neural Networks without a PhD workshop - Martin Gorner (Google) - Part 1 00:51:58
    • Recurrent Neural Networks without a PhD workshop - Martin Gorner (Google) - Part 2 00:49:05
    • Recurrent Neural Networks without a PhD workshop - Martin Gorner (Google) - Part 3 00:55:27
    • Recurrent Neural Networks without a PhD workshop - Martin Gorner (Google) - Part 4 00:51:52
    • The Hitchhiker's Guide to Deep Learning Based Recommenders in Production - Abhishek Kumar (Publicis.Sapient), Pramod Singh (Sapient Razorfish) - Part 1 00:48:38
    • The Hitchhiker's Guide to Deep Learning Based Recommenders in Production - Abhishek Kumar (Publicis.Sapient), Pramod Singh (Sapient Razorfish) - Part 2 00:58:47
    • The Hitchhiker's Guide to Deep Learning Based Recommenders in Production - Abhishek Kumar (Publicis.Sapient), Pramod Singh (Sapient Razorfish) - Part 3 00:35:46
    • The Hitchhiker's Guide to Deep Learning Based Recommenders in Production - Abhishek Kumar (Publicis.Sapient), Pramod Singh (Sapient Razorfish) - Part 4 00:54:18
    • Practical Techniques for Interpretable Machine Learning - Patrick Hall (H2O.ai | George Washington University) - Part 1 00:51:47
    • Practical Techniques for Interpretable Machine Learning - Patrick Hall (H2O.ai | George Washington University) - Part 2 00:38:49
    • Practical Techniques for Interpretable Machine Learning - Patrick Hall (H2O.ai | George Washington University) - Part 3 00:48:41
    • Practical Techniques for Interpretable Machine Learning - Patrick Hall (H2O.ai | George Washington University) - Part 4 00:37:24
    • Learning Presto: SQL-on-Anything - Matt Fuller (Starburst) - Part 1 00:22:56
    • Learning Presto: SQL-on-Anything - Matt Fuller (Starburst) - Part 2 00:42:01
    • Learning Presto: SQL-on-Anything - Matt Fuller (Starburst) - Part 3 00:58:30
    • Learning Presto: SQL-on-Anything - Matt Fuller (Starburst) - Part 4 00:54:46
  17. Show and hide more

    Oreilly - Strata Data Conference 2019 - San Francisco, California

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part1.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part2.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part3.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part4.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part5.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part6.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part7.OR.rar

    9781492050520.Strata.Data.Conference.2019.San.Francisco.California.part8.OR.rar


 TO MAC USERS: If RAR password doesn't work, use this archive program: 

RAR Expander 0.8.5 Beta 4  and extract password protected files without error.


 TO WIN USERS: If RAR password doesn't work, use this archive program: 

Latest Winrar  and extract password protected files without error.


 Coktum   |  

Information
Members of Guests cannot leave comments.




rss