This course focuses on one of SQL Server 2019’s most impactful features—Big Data Clusters. You will learn about data virtualization and data lakes for this complete artificial intelligence (AI) and machine learning (ML) platform within the SQL Server database engine.
You will be shown how to use Big Data Clusters to combine large volumes of streaming data for analysis along with data stored in a traditional database. For instance, you can stream large volumes of data from Apache Spark in real-time while executing Transact-SQL queries to bring in relevant additional data from your corporate, SQL Server database.
This course provides everything necessary to get started working with Big Data Clusters in SQL Server 2019. You will learn about the architectural foundations that are made up from Kubernetes, Spark, HDFS, and SQL Server on Linux. You will be shown how to configure and deploy Big Data Clusters. You will be ready to use and unveil the full potential of SQL Server 2019: combining different types of data spread across widely disparate sources into a single view that is useful for business intelligence and machine learning analysis.
- What a Big Data Cluster is
- How to deploy BDC
- How to analyze large volumes of data directly from SQL Server
- How to analyze large volumes of data via Apache Spark
- How to manage data stored in HDFS from SQL Server as if it were relational data
- How to implement advanced analytics solutions through machine learning
- How to expose different data sources as a single logical source using data virtualization
This course is intended for data engineers, data scientists, data architects, and database administrators who want to employ data virtualization and big data analytics in their environments.
Course Video Content: 7 Hours 6 Minutes
Test Questions: 75
Module 1: What are Big Data Clusters?
1.2 Linux, PolyBase, and Active Directory
1.3 ScenariosModule 2: Big Data Cluster Architecture
2.4 Hadoop and Spark
2.6 EndpointsModule 3: Deployment of Big Data Clusters
3.2 Install Prerequisites
3.3 Deploy Kubernetes
3.4 Deploy BDC
3.5 Monitor and Verify DeploymentModule 4: Loading and Querying Data in Big Data Clusters
4.2 HDFS with Curl
4.3 Loading Data with T-SQL
4.4 Virtualizing Data
4.5 Restoring a DatabaseModule 5: Working with Spark in Big Data Clusters
5.2 What is Spark
5.3 Submitting Spark Jobs
5.4 Running Spark Jobs via Notebooks
5.5 Transforming CSV
5.7 Spark to SQL ETLModule 6: Machine Learning on Big Data Clusters
6.2 Machine Learning Services
6.3 Using MLeap
6.4 Using Python
6.5 Using RModule 7: Create and Consume Big Data Cluster Apps
7.2 Deploying, Running, Consuming, and Monitoring an App
7.3 Python Example - Deploy with azdata and Monitoring
7.4 R Example - Deploy with VS Code and Consume with Postman
7.5 MLeap Example - Create a yaml file
7.6 SSIS Example - Implement scheduled execution of a DB backupModule 8: Maintenance of Big Data Clusters
8.3 Managing and Automation
8.4 Course Wrap Up
LEARN365 Courses Include 12 Months Unlimited Online Access to:
Expert Instructor-Led Training: Learn 365 uses only the industry's finest instructors in the IT industry. They have a minimum of 15 years real-world experience and are subject matter experts in their fields. Unlike a live class, you can fast-forward, repeat or rewind all your lectures. This creates a personal learning experience and gives you all the benefit of hands-on training with the flexibility of doing it around your schedule 24/7.
Visual Demonstrations & Multimedia Presentations: Our courseware includes instructor-led demonstrations and visual presentations that allow students to develop their skills based on real world scenarios explained by the instructor. Learn 365 always focuses on real world scenarios and skill-set development.
Quizzes & Exam Simulators: Learn 365's custom practice exams prepare you for your exams differently and more effectively than the traditional exam preps on the market. You will have practice quizzes after each module to ensure you are confident on the topic you have completed before proceeding. This will allow you to gauge your effectiveness before moving to the next module in your course. Learn 365 courses also include practice exams designed to replicate and mirror the environment in the testing center. These exams are on average 100 questions to ensure you are 100% prepared before taking your certification exam.
Social Learning & Networking: Learn 365 has designed a world class Learning Management System (LMS). This system allows you to interact and collaborate with other students and Learn 365 employees, form study groups, engage in discussions in our NOW@ Forums, rate and like different courses and stay up to date with all the latest industry knowledge through our forums, student contributions and announcement features.
Flash Cards & Educational Games: IT online learning knows that education is not a one size fits all approach. Students learn in different ways through different tools. That is why we provide Flash Cards and Education Games throughout our courses. This will allow you to train in ways that keep you engaged and focused. Each course will have dozens of Flash Cards so you can sharpen your skill-sets throughout your training as well as educational games designed to make sure your retention level of the materials is extremely high.
Navigation and Controls: Learn 365's self-paced training programs are designed in a modular fashion to allow you the flexibility to work with expert level instruction anytime 24/7. All courses are arranged in defined sections with navigation controls allowing you to control the pace of your training. This allows students to learn at their own pace around their schedule.
Certificate of Completion: Upon completion of your training course, you will receive a Certificate of completion displaying your full name, course completed as well as the date of completion. You can print this out or save it digitally to showcase your accomplishment.
Need to train your Team? Contact Us for Discounts on Multiple Subscription Purchases.