Shiyan Xu – författare
Visar alla böcker från författaren Shiyan Xu. Handla med fri frakt och snabb leverans.
3 produkter
3 produkter
E-bok
Engelska, 2025708 kr
Läs direkt efter köp
Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using your query engine of choice.Authors Shiyan Xu, Prashant Wason, Bhavani Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You'll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.This book helps you:Understand the need for transactional data lakehouses and the challenges associated with building themExplore data ecosystem support provided by Apache Hudi for popular data sources and query enginesPerform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applicationsApply different storage techniques and considerations such as indexing and clustering to maximize your lakehouse performanceBuild end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics
E-bok
PDF, Engelska, 2025708 kr
Läs direkt efter köp
Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using your query engine of choice.Authors Shiyan Xu, Prashant Wason, Bhavani Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You'll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.This book helps you:Understand the need for transactional data lakehouses and the challenges associated with building themExplore data ecosystem support provided by Apache Hudi for popular data sources and query enginesPerform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applicationsApply different storage techniques and considerations such as indexing and clustering to maximize your lakehouse performanceBuild end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics
Häftad, Engelska, 2025
533 kr
Skickas inom 5-8 vardagar
Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using their query engine of choice.Authors Shiyan Xu, Prashant Wason, Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You'll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.This book helps you:Understand the need for transactional data lakehouses and the challenges associated with building themGet up to speed with Apache Hudi and learn how it makes building data lakehouses easyExplore data ecosystem support provided by Apache Hudi for popular data sources and query enginesPerform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applicationsImplement data engineering techniques to operate and manage Apache Hudi tablesApply different storage techniques and considerations, such as indexing and clustering to maximize your lakehouse performanceBuild end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics