Data Engineering for Machine Learning Pipelines: From Python Libraries to ML Pipelines and Cloud Platforms - Paperback

Name: Data Engineering for Machine Learning Pipelines: From Python Libraries to ML Pipelines and Cloud Platforms - Paperback
Brand: Books by splitShops
SKU: 9798868806018
Price: 70.18 USD
Availability: InStock

$70.18 USD

Sale Sold out

Shipping calculated at checkout.

In stock (100 units), ready to be shipped

Available Offers

Fastest Delivery Tomorrow With Vip DealOrder within 1 hr 8 mins.

Instant 10% Discount On HDFC Banks Credit/Debit Cards EMI and CreditCard

Product variants

Quantity

Secure checkout with

Daily deals
Return policy
Payment method
Help center 24/7

Product Details

Flight Range: Up to 1,000 meters (3,280 feet)

Maximum Speed: 45 kilometers per hour (28 miles per hour)

Shipping And Return

For all orders exceeding a value of 100USD shipping is offered for free.

Returns will be accepted for up to 10 days of Customer’s receipt or tracking number on unworn items. You, as a Customer, are obliged to inform us via email before you return the item.

Otherwise, standard shipping charges apply. Check out our delivery Terms & Conditions for more details.

View Product Details

Shopping cart
Product	Product subtotal	Quantity	Price	Product subtotal
Data Engineering for Machine Learning Pipelines: From Python Libraries to ML Pipelines and Cloud Platforms - Paperback	Data Engineering for Machine Learning Pipelines: From Python Libraries to ML Pipelines and Cloud Platforms - Paperback $70.18/ea	$0.00	Quantity	$70.18/ea	$0.00

Product Description

Report copyright infringement

by Pavan Kumar Narayanan (Author)

This book covers modern data engineering functions and important Python libraries, to help you develop state-of-the-art ML pipelines and integration code.

The book begins by explaining data analytics and transformation, delving into the Pandas library, its capabilities, and nuances. It then explores emerging libraries such as Polars and CuDF, providing insights into GPU-based computing and cutting-edge data manipulation techniques. The text discusses the importance of data validation in engineering processes, introducing tools such as Great Expectations and Pandera to ensure data quality and reliability. The book delves into API design and development, with a specific focus on leveraging the power of FastAPI. It covers authentication, authorization, and real-world applications, enabling you to construct efficient and secure APIs using FastAPI. Also explored is concurrency in data engineering, examining Dask's capabilities from basic setup to crafting advanced machine learning pipelines. The book includes development and delivery of data engineering pipelines using leading cloud platforms such as AWS, Google Cloud, and Microsoft Azure. The concluding chapters concentrate on real-time and streaming data engineering pipelines, emphasizing Apache Kafka and workflow orchestration in data engineering. Workflow tools such as Airflow and Prefect are introduced to seamlessly manage and automate complex data workflows.

What sets this book apart is its blend of theoretical knowledge and practical application, a structured path from basic to advanced concepts, and insights into using state-of-the-art tools. With this book, you gain access to cutting-edge techniques and insights that are reshaping the industry. This book is not just an educational tool. It is a career catalyst, and an investment in your future as a data engineering expert, poised to meet the challenges of today's data-driven world.

What You Will Learn

Elevate your data wrangling jobs by utilizing the power of both CPU and GPU computing, and learn to process data using Pandas 2.0, Polars, and CuDF at unprecedented speeds
Design data validation pipelines, construct efficient data service APIs, develop real-time streaming pipelines and master the art of workflow orchestration to streamline your engineering projects
Leverage concurrent programming to develop machine learning pipelines and get hands-on experience in development and deployment of machine learning pipelines across AWS, GCP, and Azure

Who This Book Is For

Data analysts, data engineers, data scientists, machine learning engineers, and MLOps specialists

Back Jacket

This book covers modern data engineering functions and important Python libraries, to help you develop state-of-the-art ML pipelines and integration code.

What You Will Learn

Elevate your data wrangling jobs by utilizing the power of both CPU and GPU computing, and learn to process data using Pandas 2.0, Polars, and CuDF at unprecedented speeds
Design data validation pipelines, construct efficient data service APIs, develop real-time streaming pipelines and master the art of workflow orchestration to streamline your engineering projects
Leverage concurrent programming to develop machine learning pipelines and get hands-on experience in development and deployment of machine learning pipelines across AWS, GCP, and Azure

Number of Pages: 636

Dimensions: 1.33 x 10 x 7 IN

Illustrated: Yes

Publication Date: September 28, 2024