Workshop: Apache Kakfa and KSQL in Action: Let’s Build a Streaming Data Pipeline

Location: Cyril Magnin II

Duration: 9:00am - 4:00pm

Day of week: Monday

Level: Beginner

Key Takeaways

  • Brief introduction to Apache Kafka, and streaming concepts

  • Integrating systems with Kafka using Kafka Connect

  • Exploring data with KSQL

  • Using KSQL to filter, join, and aggregate data

Prerequisites

Participants must have their own laptop and complete the prerequisite steps detailed here: https://github.com/confluentinc/examples/blob/5.1.0-post/ksql-workshop/pre-requisites.adoc
For reasons of compatibility we strongly recommend the use of Mac or Linux for the workshop.
Knowledge of SQL is assumed. Participants will benefit from existing knowledge of the concepts of Kafka (topics, streaming, etc). 

Have you ever thought that you needed to be a programmer to do stream processing and build streaming data pipelines? Think again! Apache Kafka is a distributed, scalable, and fault-tolerant streaming platform, providing low-latency pub-sub messaging coupled with native storage and stream processing capabilities. Integrating Kafka with RDBMS, NoSQL, and object stores is simple with Kafka Connect, which is part of Apache Kafka. KSQL is the open-source SQL streaming engine for Apache Kafka, and makes it possible to build stream processing applications at scale, written using a familiar SQL interface. 

In this workshop you will learn the architectural reasoning for Apache Kafka and the benefits of real-time integration, and then build a streaming data pipeline using nothing but your bare hands, Kafka Connect, and KSQL. 

Gasp as we filter events in real time! Be amazed at how we can enrich streams of data with data from RDBMS! Be astonished at the power of streaming aggregates for anomaly detection!

Speaker: Tim Berglund

Senior Director of Developer Experience @Confluent

Tim is a teacher, author, and technology leader with Confluent, where he serves as the Senior Director of Developer Experience. He can frequently be found at speaking at conferences in the United States and all over the world. He is the co-presenter of various O’Reilly training videos on topics ranging from Git to Distributed Systems, and is the author of Gradle Beyond the Basics. He tweets as @tlberglund, blogs very occasionally at http://timberglund.com, and lives in Littleton, CO, USA with the wife of his youth and their youngest child, the other two having mostly grown up.

Find Tim Berglund at

2019 Tracks

  • Groking Timeseries & Sequential Data

    Techniques, practices, and approaches around time series and sequential data. Expect topics including image recognition, NLP/NLU, preprocess, & crunching of related algorithms.

  • Deep Learning in Practice

    Deep learning use cases around edge computing, deep learning for search, explainability, fairness, and perception.