Unified Batch and Stream Processing with Apache Beam Training Course

Apache Beam is an open source, unified programming model for defining and executing parallel data processing pipelines. It's power lies in its ability to run both batch and streaming pipelines, with execution being carried out by one of Beam's supported distributed processing back-ends: Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow. Apache Beam is useful for ETL (Extract, Transform, and Load) tasks such as moving data between different storage media and data sources, transforming data into a more desirable format, and loading data onto a new system.

In this instructor-led, live training (onsite or remote), participants will learn how to implement the Apache Beam SDKs in a Java or Python application that defines a data processing pipeline for decomposing a big data set into smaller chunks for independent, parallel processing.

Unified Batch and Stream Processing with Apache Beam Training Course

Course Outline

Requirements

Upcoming Courses

Unified Batch and Stream Processing with Apache Beam

Unified Batch and Stream Processing with Apache Beam

Unified Batch and Stream Processing with Apache Beam

Unified Batch and Stream Processing with Apache Beam

Unified Batch and Stream Processing with Apache Beam

Related Categories

Apache Beam

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites