Not just Hadoop, but other big data and cloud technologies, such as HBase, Hypertable, PIG, etc. Looks like they meet at various places around town, at various and sundry times. As the dictionary described a pookah in Harvey: it appears now and then, here and there, to this one and that one. Subscribe to the meetup and get informed.
Atlanta Hadoop Users' Group
Atlanta Hadoop Users Group News
Streaming with Kafka and Introduction to KSQL
Big Data ATL
This double-header meetup event is presented in conjunction with the Apache Kafka® ATL group!
Mitch Seymour, Senior Data Systems Engineer, Mailchimp.
Mitch will be doing a 2-part series on KSQL. This session will cover the basics and the next meetup in August/September we'll have part 2 which is a deeper dive into some more advanced topics.
6:00 - 7:00 Social / Networking
7:00 - 7:45 Talk1: Everything you Wanted to Know about Apache Kafka but you Were too Afraid to Ask! (Ricardo Ferreira, Confluent)
7:50 - 8:50 Talk2: KSQL 101 / Intro to KSQL (Mitch Seymour, Mailchimp)
8:50-9:00 Q&A, Wrap up
Abstracts / Speaker Bios:
Everything you Wanted to Know about Apache Kafka but you Were too Afraid to Ask! (Ricardo Ferreira, Confluent)
Streaming platforms have emerged as a popular, new trend, but what exactly is a streaming platform? Part messaging system, part Hadoop made fast, part fast ETL and scalable data integration, with Apache Kafka at the core, streaming platforms offer an entirely new perspective on managing the flow of data. This talk will explain what a streaming platform such as Apache Kafka is and some of the use cases and design patterns around its use. Moreover, this talk will also present and answer a set of random -- but recurring -- questions from the community about Apache Kafka.
KSQL is the streaming SQL engine that enables real-time data processing against Apache Kafka®. It provides an easy-to-use, yet powerful interactive SQL interface for stream processing on Kafka, without the need to write code in a programming language such as Java or Python. KSQL is scalable, elastic, fault-tolerant, and it supports a wide range of streaming operations, including data filtering, transformations, aggregations, joins, windowing, and sessionization.
This talk will cover the basics of KSQL, a streaming overview, architecture and uses of the technology.
Mitch Seymour is a Senior Data Systems Engineer at Mailchimp working on the company’s data pipeline, which handles billions of events per day across 2 production clusters. Using Kafka Streams and KSQL, his team builds stream processing applications to support data science and business intelligence initiatives across the company.
Ricardo is a Developer Advocate at Confluent, the company founded by the creators of Apache Kafka. He has +21 years of experience working with Software Engineering, where he specialized in different types of Distributed Systems such as Integration, SOA, NoSQL, Messaging, API Management, and Cloud Computing. Prior to Confluent, he worked for other vendors such as Oracle, Red Hat and IONA Technologies, as well as several consulting firms.
While not working and like any good Brazilian; he enjoys cooking Churrasco’s (i.e.: Brazilian Barbecue) with his family & friends, where he gets the chance to talk about anything that is not IT related. Currently, he lives in Apex, North Carolina, with his wife, son and two dogs.
Atlanta, GA 30309 - USA
Thursday, July 25 at 6:00 PM