[Book][Neylson Crepalde] Big Data on Kubernetes [ENG, 2024]

A practical guide to building efficient and scalable data solutions

What is this book about?

With step-by-step instructions and examples, this book will teach you the skills needed to build and deploy complex data pipelines on Kubernetes, resulting in efficient and scalable big data solutions.

This book covers the following exciting features:

Install and use Docker to run containers and build concise images
Gain a deep understanding of Kubernetes architecture and its components
Deploy and manage Kubernetes clusters on different cloud platforms
Implement and manage data pipelines using Apache Spark and Apache Airflow
Deploy and configure Apache Kafka for real-time data ingestion and processing
Build and orchestrate a complete big data pipeline using open-source tools
Deploy Generative AI applications on a Kubernetes-based architecture

Chapters:

Part 1: Docker and Kubernetes

✅ Getting Started with Containers
📖 Kubernetes Architecture
✅ Getting Hands-On with Kubernetes

Part 2: Big Data Stack

📖 The Modern Data Stack
✅ Big Data Processing with Apache Spark
✅ Building Pipelines with Apache Airflow
✅ Apache Kafka for Real-Time Events and Data Ingestion

Part 3: Connecting It All Together

✅ Deploying the Big Data Stack on Kubernetes
✅ Data Consumption Layer
✅ Building a Big Data Pipeline on Kubernetes
⏸️ Generative AI on Kubernetes
📖 Where to Go from Here

Предложить инженеру работу / подработку на проекте с kubernetes, microservices, machine learning, big data, golang

Name		Name	Last commit message	Last commit date
Latest commit History 470 Commits
Chapter01 - Getting Started with Containers		Chapter01 - Getting Started with Containers
Chapter03 - Getting Hands-On with Kubernetes		Chapter03 - Getting Hands-On with Kubernetes
Chapter05 - Big Data Processing with Apache Spark		Chapter05 - Big Data Processing with Apache Spark
Chapter06 - Building Pipelines with Apache Airflow		Chapter06 - Building Pipelines with Apache Airflow
Chapter07 - Apache Kafka for Real-Time Events and Data Ingestion		Chapter07 - Apache Kafka for Real-Time Events and Data Ingestion
Chapter08 - Deploying the Big Data Stack on Kubernetes		Chapter08 - Deploying the Big Data Stack on Kubernetes
Chapter09 - Data Consumption Layer		Chapter09 - Data Consumption Layer
Chapter10 - Building a Big Data Pipeline on Kubernetes		Chapter10 - Building a Big Data Pipeline on Kubernetes
Chapter11 - Generative AI on Kubernetes		Chapter11 - Generative AI on Kubernetes
data		data
img		img
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[Book][Neylson Crepalde] Big Data on Kubernetes [ENG, 2024]

What is this book about?

Chapters:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[Book][Neylson Crepalde] Big Data on Kubernetes [ENG, 2024]

What is this book about?

Chapters:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages