Create a streaming dataset for Apache Kafka with Docker

Create a streaming dataset for Apache Kafka with Docker

Mar 20, 2023

Experiencing Apache Kafka without a streaming dataset is impossible, and finding streaming datasets ready to be used with Kafka is quite difficult.

This video showcases how you can start creating fake streaming data in minutes using Docker.

Check out how to create an Aiven authentication token https://www.youtube.com/watch

Follow the set of instructions to create a fake streaming dataset in our docs https://docs.aiven.io/docs/products/kafka/howto/fake-sample-data

Check out the fake data repository on GitHub https://github.com/aiven/fake-data-producer-for-apache-kafka-docker

Find out how you can customize the fake data generators with Python https://github.com/aiven/python-fake-data-producer-for-apache-kafka

Francesco Tisiot on Twitter: https://twitter.com/FTisiot
Francesco Tisiot on LinkedIn: https://www.linkedin.com/in/francescotisiot/

CHAPTERS

00:25 The fake data producer for Apache Kafka on Docker GitHub repository

1:05 The fake data producer parameters

3:46 Starting the fake data producer

5:30 Checking the data in Apache Kafka

7:14 Recap of the problem and solution

ABOUT AIVEN
Aiven’s cloud data platform helps your business reach its highest potential by making your data work for you.

It provides fully managed open source data infrastructure on all major clouds, helping developers focus on what they do best: innovate and create without worrying about the limitations of technology.

We like to think that Aiven is not only a cloud data platform but also an
extension of your team. We are dedicated to helping you to succeed by removing barriers and finding the right solutions – with the help of the best data technology there is.

CONNECT WITH US
Website: http://aiven.io​​​
LinkedIn: https://linkedin.com/company/aiven​​​
GitHub: https://github.com/aiven​​​
Twitter: https://twitter.com/aiven_io​​​