Sridhar Alla – författare
786 kr
Läs direkt efter köp
793 kr
Läs direkt efter köp
Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade.
Applied Data Science Using PySpark is divided unto six sections which walk you through the book. In section 1, you start with the basics of PySpark focusing on data manipulation. We make you comfortable with the language and then build upon it to introduce you to the mathematical functions available off the shelf. In section 2, you will dive into the art of variable selection where we demonstrate various selection techniques available in PySpark. In section 3, we take you on a journey through machine learning algorithms, implementations, and fine-tuning techniques. We will also talk about different validation metrics and how to use them for picking the best models. Sections 4 and 5 go through machine learning pipelines and various methods available to operationalize the model and serve it through Docker/an API. In the final section, you will cover reusable objects for easy experimentation and learn some tricks that can help you optimize your programs and machine learning pipelines.
By the end of this book, you will have seen the flexibility and advantages of PySpark in data science applications. This book is recommended to those who want to unleash the power of parallel computing by simultaneously working with big datasets.
What You Will Learn
Build an end-to-end predictive modelImplement multiple variable selection techniquesOperationalize modelsMaster multiple algorithms and implementationsWho This Book is For
Data scientists and machine learning and deep learning engineers who want to learn and use PySpark for real-time analysis of streamingdata.616 kr
Skickas inom 10-15 vardagar
865 kr
Läs direkt efter köp
What You Will Learn
Perform basic data analysis and construct models in scikit-learn and PySparkTrain, test, and validate your models (hyperparameter tuning)Know what MLOps is and what an ideal MLOps setup looks likeEasily integrate MLFlow into your existing or future projectsDeploy your models and perform predictions with them on the cloudWho This Book Is ForData scientists and machine learning engineers who want to learn MLOps and know how to operationalize their models919 kr
Skickas inom 5-8 vardagar
413 kr
Läs direkt efter köp
Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3
About This Book
Learn Hadoop 3 to build effective big data analytics solutions on-premise and on cloudIntegrate Hadoop with other big data tools such as R, Python, Apache Spark, and Apache FlinkExploit big data using Hadoop 3 with real-world examplesWho This Book Is For
Big Data Analytics with Hadoop 3 is for you if you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3''s powerful features, or you''re new to big data analytics. A basic understanding of the Java programming language is required.
What You Will Learn
Explore the new features of Hadoop 3 along with HDFS, YARN, and MapReduceGet well-versed with the analytical capabilities of Hadoop ecosystem using practical examplesIntegrate Hadoop with R and Python for more efficient big data processingLearn to use Hadoop with Apache Spark and Apache Flink for real-time data analyticsSet up a Hadoop cluster on AWS cloudPerform big data analytics on AWS using Elastic Map ReduceIn Detail
Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. Big Data Analytics with Hadoop 3 shows you how to do just that, by providing insights into the software as well as its benefits with the help of practical examples.
Once you have taken a tour of Hadoop 3''s latest features, you will get an overview of HDFS, MapReduce, and YARN, and how they enable faster, more efficient big data processing. You will then move on to learning how to integrate Hadoop with the open source tools, such as Python and R, to analyze and visualize data and perform statistical computing on big data. As you get acquainted with all this, you will explore how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing. In addition to this, you will understand how to use Hadoop to build analytics solutions on the cloud and an end-to-end pipeline to perform big data analysis using practical use cases.
By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform big data analytics and get insight effortlessly.
Style and approach
Filled with practical examples and use cases, this book will not only help you get up and running with Hadoop, but will also take you farther down the road to deal with Big Data Analytics
565 kr
Skickas inom 5-8 vardagar
613 kr
Skickas inom 5-8 vardagar
550 kr
Skickas inom 3-6 vardagar
786 kr
Läs direkt efter köp
This beginner-oriented book will help you understand and perform anomaly detection by learning cutting-edge machine learning and deep learning techniques. This updated second edition focuses on supervised, semi-supervised, and unsupervised approaches to anomaly detection. Over the course of the book, you will learn how to use Keras and PyTorch in practical applications. It also introduces new chapters on GANs and transformers to reflect the latest trends in deep learning.
Beginning Anomaly Detection Using Python-Based Deep Learning begins with an introduction to anomaly detection, its importance, and its applications. It then covers core data science and machine learning modeling concepts before delving into traditional machine learning algorithms such as OC-SVM and Isolation Forest for anomaly detection using scikit-learn. Following this, the authors explain the essentials of machine learning and deep learning, and how to implement multilayer perceptrons for supervised anomaly detection in both Keras and PyTorch. From here, the focus shifts to the applications of deep learning models for anomaly detection, including various types of autoencoders, recurrent neural networks (via LSTM), temporal convolutional networks, and transformers, with the latter three architectures applied to time-series anomaly detection. This edition has a new chapter on GANs (Generative Adversarial Networks), as well as new material covering transformer architecture in the context of time-series anomaly detection.
After completing this book, you will have a thorough understanding of anomaly detection as well as an assortment of methods to approach it in various contexts, including time-series data. Additionally, you will have gained an introduction to scikit-learn, GANs, transformers, Keras, and PyTorch, empowering you to create your own machine learning- or deep learning-based anomaly detectors.
What You Will Learn
Understand what anomaly detection is, why it it is important, and how it is appliedGrasp the core concepts of machine learning.Master traditional machine learning approaches to anomaly detection using scikit-kearn.Understand deep learning in Python using Keras and PyTorchProcess data through pandas and evaluate your model''s performance using metrics like F1-score, precision, and recallApply deep learning to supervised, semi-supervised, and unsupervised anomaly detection tasks for tabular datasets and time series applicationsWho This Book Is ForData scientists and machine learning engineers of all levels of experience interested in learning the basics of deep learning applications in anomaly detection.