The Hadoop ecosystem consists of many components. It is a headache for people who want to learn or understand them. This book can help data engineers or architects understand the internals of the big data technologies, starting from the basic HDFS and MapReduce to Kafka, Spark, etc. There are currently 2 volumes, the volume 1 mainly describes batch processing, and the volume 2 mainly describes stream processing.



Inhalt

Batch processing vs Stream processing Scala Chapter 1. Spark Chapter 2. Spark SQL Chapter 3. Spark Streaming Chapter 4. Spark Structured Streaming Chapter 5. Kafka Data Lake Architecture

Titel
Exploring Hadoop Ecosystem (Volume 2)
Untertitel
Stream Processing
Autor
EAN
9781667184500
Format
E-Book (epub)
Hersteller
Digitaler Kopierschutz
frei
Dateigrösse
14.07 MB