Streaming Systems: The What, Where, When, and How of Large-Scale Data Processing
by Tyler Akidau, Slava Chernyak, and Reuven LaxA comprehensive guide to stream processing, exploring concepts and architectures vital for real-time data engineering.
Designing Data-Intensive Applications
by Martin KleppmannAn essential read on data architecture, this book covers the principles of building scalable and fault-tolerant systems.
The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling
by Ralph Kimball and Margy RossA classic in data warehousing, this book provides foundational knowledge on data modeling relevant to real-time processing.
Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale
by Neha Narkhede, Gwen Shapira, and Todd PalinoLearn about Kafka, a key technology for real-time data pipelines, and its role in distributed systems.
Architecting the Cloud: Design Decisions for Cloud Computing Service Models (SaaS, PaaS, and IaaS)
by Michael J. KavisExplores cloud architecture principles, crucial for implementing scalable data pipelines in a cloud environment.
Building Microservices: Designing Fine-Grained Systems
by Sam NewmanA guide to microservices architecture, providing insights on building scalable and resilient applications.
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
by Benjamin Bengfort, Jenny Kim, and Daniel T. O'ConnorThis book covers techniques for real-time data analysis, enhancing your ability to handle streaming data effectively.
Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking
by Foster Provost and Tom FawcettOffers insights into data-driven decision-making, essential for understanding the business implications of data pipelines.