In this lab, you'll build a real-time data processing pipeline using Amazon Kinesis services. You will configure Kinesis Data Streams to ingest streaming data, process it with AWS Lambda functions, and store the results in Amazon S3. Additionally, you'll set up AWS Glue to catalog the data and enable quick access with Amazon Athena for data analytics. This lab will give you hands-on experience with critical AWS services that are foundational for real-time analytics solutions, demonstrating how to efficiently integrate streaming and batch processes.
A streaming video platform needs to process user activity data in real-time to provide insights into viewer engagement and content popularity. As the data engineer, you'll implement a processing pipeline that can handle hundreds of data streams simultaneously and offer near real-time analytics.