Skip to content

About us

Welcome to our Special Interest Group on Streaming & Real-Time Data 🚀
This group is a space for practitioners, architects, and enthusiasts passionate about modern streaming and real-time data systems. Our objective is to share knowledge, exchange ideas, and foster thought leadership across the evolving streaming ecosystem.

Topics we explore include:
• Event Streaming Technologies (Apache Kafka and beyond)
• Event-Driven Architecture
• Stream Processing
• Streaming Databases
• Real-Time Analytics
• Data Mesh
• …and more

Whether you’re building large-scale platforms or just getting started, this community is designed to spark meaningful discussions, learning, and collaboration. Join us and be part of the conversation shaping the future of streaming data.

Previous meetup recordings - https://www.youtube.com/@Platformatory/playlists

Upcoming events

1

See all
  • Bangalore Streams In-Person meetup - March 2026

    Bangalore Streams In-Person meetup - March 2026

    Smarsh Inc, Second Floor, Salarpuria Cambridge, 9, Cambridge Rd, Halasuru, Udani Layout, Bengaluru, Karnataka 560008, Bengaluru, IN

    Hello Bengaluruđź‘‹
    Join us for exciting discussions in the streaming world with opportunities to network with peers and leaders in the industry.

    đź“… When: March 7 2026 9:30am - 02:00pm
    📌Where: Smarsh Inc, Bengaluru (4th floor)
    🗺Directions: https://maps.app.goo.gl/Rrq1d7M9teNei7o56 (Smarsh Inc, Salarpuria Cambridge, 9, Cambridge Rd, Halasuru, Udani Layout, Bengaluru, Karnataka 560008 · Bengaluru)

    Thanks to RisingWave for sponsoring the F&B for the meetup!

    đź•’ Schedule :

    10:00 am - 10:20 am: Welcome & registrations
    10:30 am - 11:15 am: `Real-Time Analytics at Scale using RisingWave and StarRocks` by Sri Charan Sirpa, Tech Lead at KaptureCX
    11:20 pm - 12:00 pm: `Scaling Backend Systems at PhysicsWallah: From Pipelines to ClickHouse and Kafka` by V Santhosh Kumar, SDE-2 at PhysicsWallah
    12:00 pm - 12:15 pm: Networking break
    12:15 pm - 1:00 pm: `Architecting enterprise database synchronisation using CDC and Kafka Connect` by Balaji K, Lead Platform Engineer at Platformatory Labs
    1:00 pm - 2:00 pm: Lunch & Networking

    🎙️Talks:
    Real-Time Analytics at Scale using RisingWave and StarRocks
    Speaker: Sri Charan Sirpa, Tech Lead – Data Platform Team at KaptureCX
    About the talk: Building real-time analytics is no longer just about fast dashboards—it’s about processing streaming data continuously and serving low-latency analytical queries at scale.
    In this talk, I’ll walk through how we design real-time analytics using RisingWave for stream processing and StarRocks as the serving OLAP database. I’ll explain how streaming data flows from event sources into RisingWave, how materialized views are created and maintained in real time, and how StarRocks enables fast analytical queries across multiple business use cases such as Ticketing, VoiceBots, and QA systems.
    The session focuses on practical architecture choices, data modeling strategies, and performance considerations when running real-time analytics in production.

    Scaling Backend Systems at PhysicsWallah: From Pipelines to ClickHouse and Kafka
    Speaker: V Santhosh Kumar, SDE-2 at PhysicsWallah
    About the talk: In the session, I’ll walk through how our video engagement system at PhysicsWallah evolved from a simple, state-based model to a high-throughput, near real-time event pipeline handling hundreds of millions of events per day.
    We’ll explore why our early assumption — that lecture completion was a sufficient proxy for engagement — worked at small scale but broke down as we grew. As student behaviors diversified, our binary progress model flattened meaningful learning patterns into identical signals, exposing deeper limitations in how we captured and processed data.
    I’ll cover the architectural transition from state tracking to event-driven design, the challenges we faced while scaling Kafka consumers, the impact of silent data loss and rebalances, and why we eventually rethought our execution model to prioritize predictability under sustained load.
    Beyond the technical journey, this talk will focus on key engineering lessons around signal integrity, system design under pressure, and why preserving behavioral truth becomes a product-critical responsibility at scale.

    Architecting enterprise database synchronisation using CDC and Kafka Connect
    Speaker: Balaji K, Lead Platform Engineer at Platformatory Labs
    About the talk: In this session, I will present a real-time data synchronization use case implemented to replicate data between two Oracle databases. The pipeline uses the Oracle CDC Source Connector to capture database changes from the source Oracle database and publish them to Kafka topics, and the JDBC Sink Connector to write the data into the destination Oracle database. While the overall architecture works well for standard relational columns, handling LOB data types such as NCLOB introduced additional complexity. The Oracle CDC connector publishes LOB data in byte format within Kafka topics, which requires careful handling of character encoding and data reconstruction before writing it back to the destination database.
    In this talk, I will briefly walk through the architecture, highlight the challenges encountered while processing LOB columns, and explain the approach used to reconstruct the data using custom Kafka Connect transformations.

    • Photo of the user
    • Photo of the user
    • Photo of the user
    141 attendees

Group links

Members

4,172
See all