SQL Server Big Data Clusters, 2nd Edition
- Author: Benjamin Weissman, Enrico Van De Laar
- ISBN: 148425984X
- Year: 2020
- Pages: 277
- Language: English
- File size: 11.4 MB
- File format: PDF, ePub
- Category: SQL
Use this manual to one of SQL Server 2019’s most impactful features–Big Data Clusters. You will learn about information virtualization and data lakes for this complete artificial intelligence (AI) and machine learning (ML) system within the SQL Server database engine. You will understand the way to use Big Data Clusters to combine huge volumes of streaming data for evaluation together with data stored in a conventional database. By way of example, you can flow substantial volumes of information from Apache Spark in real time whilst executing Transact-SQL inquiries to bring in applicable additional data from your company, SQL Server database.
Filled with clear examples and use cases, this book provides everything necessary to get started working with Big Data Clusters in SQL Server 2019. Then you are shown the way to install and configure Big Data Clusters in on-premises environments or at the cloud. Next, you are educated about querying. You will learn to compose queries in Transact-SQL–taking advantage of skills you’ve honed for years–and with these questions you will have the ability to examine and analyze information from a wide variety of sources such as Apache Spark.
Through the theoretical base provided in this publication and easy-to-follow example scripts and laptops, you’ll be ready to utilize and unveil the full potential of SQL Server 2019: combining different kinds of data spread across widely disparate sources into a single view that’s useful for business intelligence and machine learning analysis.
What You Will Learn
- Install, manage, and troubleshoot Big Data Clusters in cloud or on-premise environments
- Analyze Huge volumes of data directly from SQL Server and/or Apache Spark
- Manage data stored in HDFS from SQL Server as Though It were relational data
- Implement advanced analytics options via machine learning and AI
- Expose Unique data sources as one logical source using data virtualization
Who This Book Is For
Data engineers, information scientists, information architects, and database administrators who Wish to use data virtualization and big data analytics in their surroundings