Data Engineering with Scala and Spark
Data Engineering with Scala and Spark
Build streaming and batch pipelines that process massive amounts of data using Scala
Bhattacharjee, Rupam; Tome, Eric; Radford, David
Packt Publishing Limited
01/2024
300
Mole
Inglês
9781804612583
15 a 20 dias
Descrição não disponível.
Table of Contents
Scala Essentials for Data Engineers
Environment Setup
An Introduction to Apache Spark and Its APIs - DataFrame, Dataset, and Spark SQL
Working with Databases
Object Stores and Data Lakes
Understanding Data Transformation
Data Profiling and Data Quality
Test-Driven Development, Code Health, and Maintainability
CI/CD with GitHub
Data Pipeline Orchestration
Performance Tuning
Building Batch Pipelines Using Spark and Scala
Building Streaming Pipelines Using Spark and Scala
Scala Essentials for Data Engineers
Environment Setup
An Introduction to Apache Spark and Its APIs - DataFrame, Dataset, and Spark SQL
Working with Databases
Object Stores and Data Lakes
Understanding Data Transformation
Data Profiling and Data Quality
Test-Driven Development, Code Health, and Maintainability
CI/CD with GitHub
Data Pipeline Orchestration
Performance Tuning
Building Batch Pipelines Using Spark and Scala
Building Streaming Pipelines Using Spark and Scala
Este título pertence ao(s) assunto(s) indicados(s). Para ver outros títulos clique no assunto desejado.
Data Engineering; Data Processing; ETL; ELT; Data Transformation; Data Ingestion; Data Quality; Spark; Scala; Apache Spark; Software Engineering; Stream Processing; Batch Processing; Real-time analytical processing; Data Analysis; CI/CD
Table of Contents
Scala Essentials for Data Engineers
Environment Setup
An Introduction to Apache Spark and Its APIs - DataFrame, Dataset, and Spark SQL
Working with Databases
Object Stores and Data Lakes
Understanding Data Transformation
Data Profiling and Data Quality
Test-Driven Development, Code Health, and Maintainability
CI/CD with GitHub
Data Pipeline Orchestration
Performance Tuning
Building Batch Pipelines Using Spark and Scala
Building Streaming Pipelines Using Spark and Scala
Scala Essentials for Data Engineers
Environment Setup
An Introduction to Apache Spark and Its APIs - DataFrame, Dataset, and Spark SQL
Working with Databases
Object Stores and Data Lakes
Understanding Data Transformation
Data Profiling and Data Quality
Test-Driven Development, Code Health, and Maintainability
CI/CD with GitHub
Data Pipeline Orchestration
Performance Tuning
Building Batch Pipelines Using Spark and Scala
Building Streaming Pipelines Using Spark and Scala
Este título pertence ao(s) assunto(s) indicados(s). Para ver outros títulos clique no assunto desejado.