Data Lake Analytics on Microsoft Azure (häftad)
Häftad (Paperback / softback)
Antal sidor
1st ed.
Khattar, Pankaj
134 Illustrations, black and white; XVII, 222 p. 134 illus.
254 x 178 x 13 mm
427 g
Antal komponenter
1 Paperback / softback
Data Lake Analytics on Microsoft Azure (häftad)

Data Lake Analytics on Microsoft Azure

A Practitioner's Guide to Big Data Engineering

Häftad Engelska, 2020-10-09
  • Skickas inom 5-8 vardagar.
  • Gratis frakt inom Sverige över 159 kr för privatpersoner.
Finns även som
Visa alla 1 format & utgåvor
Get a 360-degree view of how the journey of data analytics solutions has evolved from monolithic data stores and enterprise data warehouses to data lakes and modern data warehouses. You will This book includes comprehensive coverage of how: To architect data lake analytics solutions by choosing suitable technologies available on Microsoft Azure The advent of microservices applications covering ecommerce or modern solutions built on IoT and how real-time streaming data has completely disrupted this ecosystem These data analytics solutions have been transformed from solely understanding the trends from historical data to building predictions by infusing machine learning technologies into the solutions Data platform professionals who have been working on relational data stores, non-relational data stores, and big data technologies will find the content in this book useful. The book also can help you start your journey into the data engineer world as it provides an overview of advanced data analytics and touches on data science concepts and various artificial intelligence and machine learning technologies available on Microsoft Azure. What Will You Learn You will understand the: Concepts of data lake analytics, the modern data warehouse, and advanced data analytics Architecture patterns of the modern data warehouse and advanced data analytics solutions Phases-such as Data Ingestion, Store, Prep and Train, and Model and Serve-of data analytics solutions and technology choices available on Azure under each phase In-depth coverage of real-time and batch mode data analytics solutions architecture Various managed services available on Azure such as Synapse analytics, event hubs, Stream analytics, CosmosDB, and managed Hadoop services such as Databricks and HDInsight Who This Book Is For Data platform professionals, database architects, engineers, and solution architects
Visa hela texten

Passar bra ihop

  1. Data Lake Analytics on Microsoft Azure
  2. +
  3. Building Microservices Applications on Microsoft Azure

De som köpt den här boken har ofta också köpt Building Microservices Applications on Microsof... av Harsh Chawla, Hemant Kathuria (häftad).

Köp båda 2 för 818 kr


Har du läst boken? Sätt ditt betyg »

Fler böcker av författarna

  • Building Microservices Applications on Microsoft Azure

    Harsh Chawla, Hemant Kathuria

    Implement microservices starting with their architecture and moving on to their deployment, manageability, security, and monitoring. This book focuses on the key scenarios where microservices architecture is preferred over a monolithic architectur...

Övrig information

Harsh Chawla has been working on data platform technologies for last 14 years. He has been in various roles in the Microsoft world for last 12 years, going from CSS to services to technology strategy. He currently works as an Azure specialist with data and AI technologies and helps large IT enterprises build modern data warehouses, advanced analytics, and AI solutions on Microsoft Azure. He has been a community speaker and blogger on data platform technologies. Pankaj Khattar is a seasoned Software Architect with over 14 years of experience in design and development of Big Data, Machine Learning and AI based products. He currently works with Microsoft on the Azure platform as a Sr. Cloud Solution Architect for Data & AI technologies. He also possesses extensive industry experience in the field of building scalable multi-tier distributed applications and client/server based development. You can connect with him on LinkedIn at


Chapter 1: Introduction and The Need of Data LakeChapter Goal: The chapter introduces the readers to the concept & need of a data lake in this big data environment.The chapter also covers how to create a data lake & architecture patterns to be followed for data lake analytics. No of pages 15 Sub -Topics 1. Relational and non-relation data stores 2. Base for data: relational and non-relational databases 3. Warehouses of data: data warehouses 4. Markets for data: data marts 5. Introduction to data lake 6. Need to create a data lake Chapter 2: Data Just Got Bigger Chapter Goal: Today, enterprises have mix of relational and non-relational stores. However, when it comes to analyzing all this data - there must be a neutral platform which can understand these types of data. This introduces us to modern world concepts of distributed data storage & processing. It also talks about data sciences & machine learning concepts & how they are revolutionizing the data analysis world. No of pages : 20 Sub - Topics: 1. Massively parallel processing, distributed data and spark the Hadoop 2. Distributed systems vs massively parallel processing systems (MPP) 3. Respective use cases for distributed and MPP systems 4. Science for data 5. Learning of machines 6. Overview of data analytics and advanced data analytics Chapter 3: Emergence of Cloud Lakes Chapter Goal: The chapter enlighten the users with multiple cloud-based technologies available which are scalable, agile and performance in terms of computation, storage & analytics options. It goes into details about the suggested architecture on Microsoft Azure to solve Modern data warehouse, analytics use cases. No of pages: 20 Sub - Topics: 1. Data travels to Cloud with added benefits 2. Overview of phases of data analytics architecture 3. Available products under each phase on Microsoft Azure Chapter 4: Phases in Managing Data Analytics Pipeline Chapter Goal: This chapter covers in-depth context of this book. After we understand the background, this chapter will provide understanding of what are the phases of building entire data analytics pipeline. All the phases discussed in this book are critical to understand and any analytics solution will adhere to this common principle some way or the other. In each phase, there are different solutions to cater respective issues. It covers the data life cycle from upstream to downstream applications. No of pages: 20 Sub - Topics: 1. Real time and batch mode data processing 2. Phases in data Management * Ingest * Store * Analytics * Visualization 3. Cloud data lake architecture patterns Chapter 5: Data Ingestion in the Lake Chapter Goal: The chapter talks about the limitations about the traditional storage & how the big data technologies has emerged as the champion in solving the limitations & changing the concepts of Extract, Transform & Load (ETL) to Extract, Load & Transform(ELT). No of pages: 20 Sub - Topics: 1. Traditional limitations, can big data help? 2. ETL now becomes ELT 3. Tools in cloud for data ingestion * Azure Data Factory on Microsoft Azure * SQL server integration services on-premise 4. Overview of partner solutions for ETL/ELT - Informatica PowerEdge Chapter 6: Data Storage & Farming Chapter Goal: The chapter shares with readers that how once the data is available in storage layers, how it can be grown & real time data storage & analysis needs can be catered, it also talks about batch & real time data processing & storage. No of pages: 20 Sub - Topics: 1. Grow the data 2. Role of Azure data lake store, Blob, relational and non-relational stores 3.