Data Engineering with Python
Data Engineering with Python
Work with massive datasets to design data models and automate data pipelines using Python
Crickard, Paul
Packt Publishing Limited
10/2020
356
Mole
Inglês
9781839214189
15 a 20 dias
What is Data Engineering?
Building Our Data Engineering Infrastructure
Reading and Writing Files
Working with Databases
Cleaning, Transforming, and Enriching Data
Building a 311 Data Pipeline
Features of a Production Pipeline
Version Control Using the NiFi Registry
Monitoring and Logging Pipelines
Deploying your Pipelines
Building a Production Data Pipeline
Building a Kafka Cluster
Streaming Data with Apache Kafka
Data Processing with Apache Spark
Real-Time Edge Data with MiNiFi, Kafka, and Spark
Appendix
What is Data Engineering?
Building Our Data Engineering Infrastructure
Reading and Writing Files
Working with Databases
Cleaning, Transforming, and Enriching Data
Building a 311 Data Pipeline
Features of a Production Pipeline
Version Control Using the NiFi Registry
Monitoring and Logging Pipelines
Deploying your Pipelines
Building a Production Data Pipeline
Building a Kafka Cluster
Streaming Data with Apache Kafka
Data Processing with Apache Spark
Real-Time Edge Data with MiNiFi, Kafka, and Spark
Appendix