• About TC
  • Affiliate Disclaimer
  • Privacy Policy
  • TOS
  • Contact
Monday, May 12, 2025
Techcratic
Click For A Secret Deal
  • TC
  • AI
    Artificial Intelligence

    3 Excellent Practical Generative AI Courses

    Artificial Intelligence

    Building End-to-End Data Pipelines with Dask

    Artificial Intelligence

    Automate document translation and standardization with Amazon Bedrock and Amazon Translate

    Artificial Intelligence

    InterVision accelerates AI development using AWS LLM League and Amazon SageMaker AI

    Artificial Intelligence

    FireDucks: An Accelerated Fully Compatible Pandas Library

    Artificial Intelligence

    Breaking Out of Beginner: Python Patterns for Intermediate Data Scientists

    Artificial Intelligence

    Building a Personal Knowledge Management Tool with Reor

    Artificial Intelligence

    Build a location-aware agent using Amazon Bedrock Agents and Foursquare APIs

    Artificial Intelligence

    How to Fully Automate Text Data Cleaning with Python in 5 Steps

  • Crypto
    Metaplanet Acquires 1,241 Bitcoin, Holdings Reach 6,796 BTC Amid Strong Yield Performance

    Metaplanet Acquires 1,241 Bitcoin, Holdings Reach 6,796 BTC Amid Strong Yield Performance

    Bitcoin Bull Cycle is Over: CryptoQuant CEO

    Funds Safe After Oracle Incident Triggers Emergency Vote

    Coinbase Exposes 10,000+ Pages of Hidden Crypto Oversight Files—Transparency Just Got Real

    Coinbase Exposes 10,000+ Pages of Hidden Crypto Oversight Files—Transparency Just Got Real

    Ethereum chart pattern supports ‘moon shot’ rally to new price highs if confirmed — Trader

    Ethereum chart pattern supports ‘moon shot’ rally to new price highs if confirmed — Trader

    Six-Figure Bitcoin Rally Triggers Rare Price Inversion in South Korea

    Six-Figure Bitcoin Rally Triggers Rare Price Inversion in South Korea

    Geneva Gamble: Markets Await Outcome of Secretive US-China Talks

    Geneva Gamble: Markets Await Outcome of Secretive US-China Talks

    Bitcoin Hits $100K, A Royal BTC Drain, and More — Week in Review

    Bitcoin Hits $100K, A Royal BTC Drain, and More — Week in Review

    Bitcoin Price Watch: Bullish Trend Remains Intact Amid Volume Divergence

    Bitcoin Price Watch: Bullish Trend Remains Intact Amid Volume Divergence

    ASEAN, China, Japan, and South Korea Stand Against Trump’s Protectionism

    ASEAN, China, Japan, and South Korea Stand Against Trump’s Protectionism

  • Cybersecurity
    Cybersecurity

    Google Pays $1.375 Billion to Texas Over Unauthorized Tracking and Biometric Data Collection

    Cybersecurity

    Deploying AI Agents? Learn to Secure Them Before Hackers Strike Your Business

    Cybersecurity

    Google Rolls Out On-Device AI Protections to Detect Scams in Chrome and Android

    Cybersecurity

    Chinese Hackers Exploit SAP RCE Flaw CVE-2025-31324, Deploy Golang-Based SuperShell

    Cybersecurity

    SonicWall Patches 3 Flaws in SMA 100 Devices Allowing Attackers to Run Code as Root

    Cybersecurity

    Qilin Ransomware Ranked Highest in April 2025 with Over 45 Data Leak Disclosures

    Cybersecurity

    MirrorFace Targets Japan and Taiwan with ROAMINGMOUSE and Upgraded ANEL Malware

    Cybersecurity

    Cisco Patches CVE-2025-20188 (10.0 CVSS) in IOS XE That Enables Root Exploits via JWT

    Cybersecurity

    OttoKit WordPress Plugin with 100K+ Installs Hit by Exploits Targeting Multiple Flaws

  • Deals
    SIHOO M18 Ergonomic Office Chair for Big and Tall People Adjustable Headrest with 2D…

    SIHOO M18 Ergonomic Office Chair for Big and Tall People Adjustable Headrest with 2D…

    ACEMAGIC AMD Laptop,AMD Ryzen 7 5825U(8C/16T), Radeon RX Vega 8 Graphics,15.6“FHD…

    ACEMAGIC AMD Laptop,AMD Ryzen 7 5825U(8C/16T), Radeon RX Vega 8 Graphics,15.6“FHD…

    NCAA Football 09 (Renewed)

    NCAA Football 09 (Renewed)

    Mortal Kombat Armageddon

    Mortal Kombat Armageddon

    Atari Flashback Classics Collection Vol.1 (PS4)

    Atari Flashback Classics Collection Vol.1 (PS4)

    StarTech.com 4-Port USB C Hub – USB 3.2 Gen 2 (10Gbps) – 3x USB-A & 1x USB-C – Powered -…

    StarTech.com 4-Port USB C Hub – USB 3.2 Gen 2 (10Gbps) – 3x USB-A & 1x USB-C – Powered -…

    TP-Link Powered USB Hub 3.0 with 7 USB 3.0 Data Ports and 2 Smart Charging USB Ports….

    TP-Link Powered USB Hub 3.0 with 7 USB 3.0 Data Ports and 2 Smart Charging USB Ports….

    Portable Charger with Built in Cables, Portable Charger with Cords Wires Slim 10000mAh…

    Portable Charger with Built in Cables, Portable Charger with Cords Wires Slim 10000mAh…

    PNY GeForce 8800GTS 320 MB DDR3 PCIe Graphics Card

    PNY GeForce 8800GTS 320 MB DDR3 PCIe Graphics Card

  • Gaming
    Zelda Breath of the Wild – Best Shields by Durability + Parry Power

    Zelda Breath of the Wild – Best Shields by Durability + Parry Power

    There’s another Risk of Rain 2 expansion on the way

    There’s another Risk of Rain 2 expansion on the way

    GTA 6: Trailer 2 Final Prediction (I’m Done)

    GTA 6: Trailer 2 Final Prediction (I’m Done)

    OFFICIAL GTA 6 TRAILER SHOWCASE! ROCKSTAR GAMES GTA 6 TRAILER IS HERE!!

    OFFICIAL GTA 6 TRAILER SHOWCASE! ROCKSTAR GAMES GTA 6 TRAILER IS HERE!!

    GOD OF WAR RAGNAROK –Part 10 — Gameplay Walkthrough  [4K 60FPS] – No Commentary

    GOD OF WAR RAGNAROK –Part 10 — Gameplay Walkthrough [4K 60FPS] – No Commentary

    Killing Floor 3’s new release date is late July

    Killing Floor 3’s new release date is late July

    Find a path to the surface God Of War Ragnarok

    Find a path to the surface God Of War Ragnarok

    PS5 Shocks Gaming Industry Outperforms Series X! Forspoken Flops? | New Spiderman 2 PS5 Info…

    PS5 Shocks Gaming Industry Outperforms Series X! Forspoken Flops? | New Spiderman 2 PS5 Info…

    A unique horror strategy game about drilling for oil released on Steam last week

    A unique horror strategy game about drilling for oil released on Steam last week

  • Tesla
    4PCS Track Hooks for Tesla Cybertruck Bed Rail Track Accessories, (L-Track Hooks)

    4PCS Track Hooks for Tesla Cybertruck Bed Rail Track Accessories, (L-Track Hooks)

    TOZO 2025 Upgraded Wireless Charger 15W Max Fast Wireless Charging Pad Compatible with…

    TOZO 2025 Upgraded Wireless Charger 15W Max Fast Wireless Charging Pad Compatible with…

    Nycetek Car Phone Mount for Tesla, Strongest Magnetic [No Glue Required], 2025 Upgrade…

    Nycetek Car Phone Mount for Tesla, Strongest Magnetic [No Glue Required], 2025 Upgrade…

    Tesla employees ask Elon Musk to resign, confirm massive demand problem, get fired for it

    Tesla employees ask Elon Musk to resign, confirm massive demand problem, get fired for it

    Nilight Anti-Theft Shackle Hitch Receiver 2Inch 45000 LBs Breaking Strength 3/4″…

    Nilight Anti-Theft Shackle Hitch Receiver 2Inch 45000 LBs Breaking Strength 3/4″…

    JUSTTOP Car Steering Wheel Desk for Laptop,Multipurpose Travel Car Accessories, Car Seat…

    JUSTTOP Car Steering Wheel Desk for Laptop,Multipurpose Travel Car Accessories, Car Seat…

    Car Seat Covers Front Seat Covers Universal Fit for Cars, SUV, Sedans, Anti-Slip…

    Car Seat Covers Front Seat Covers Universal Fit for Cars, SUV, Sedans, Anti-Slip…

    Roof Sun Shade Umbrella for Tesla Cybertruck 2024, Heat Insulation 4- Layer UV Block…

    Roof Sun Shade Umbrella for Tesla Cybertruck 2024, Heat Insulation 4- Layer UV Block…

    Truck Hooks for Tesla Cybertruck 2024, Designed specifically for Cybertruck MOLLE…

    Truck Hooks for Tesla Cybertruck 2024, Designed specifically for Cybertruck MOLLE…

  • UFO
    Legendary British Alien Sighting | History's Greatest Mysteries (S6)

    Legendary British Alien Sighting | History's Greatest Mysteries (S6)

    Ancient Aliens: TOP 4 ALIEN GLYPHS DECIPHERED

    Ancient Aliens: TOP 4 ALIEN GLYPHS DECIPHERED

    RiffTrax: Plan 9 From Outer Space – from the stars of Mystery Science Theater 3000!

    RiffTrax: Plan 9 From Outer Space – from the stars of Mystery Science Theater 3000!

    ET Contactee Rebecca Videnoff Psychic Medium, Akashic Records Reader, Reiki Healer & Channeler

    ET Contactee Rebecca Videnoff Psychic Medium, Akashic Records Reader, Reiki Healer & Channeler

    Men’s Casual Jogger Shorts Elastic Waist Cargo Shorts Relaxed Fit SweatShorts Running Workout Sport Twill Shorts

    Men’s Casual Jogger Shorts Elastic Waist Cargo Shorts Relaxed Fit SweatShorts Running Workout Sport Twill Shorts

    Intergalactic PS5 – 10 New Details You May Not Know

    Intergalactic PS5 – 10 New Details You May Not Know

    EXPLORATION – Inspirational NASA Space Film

    EXPLORATION – Inspirational NASA Space Film

    BARCO One Velocity Scrub Top for Men – V-Neck Medical Top, Eco-Friendly Fabric, 4-Way Stretch Men’s Scrub Top

    BARCO One Velocity Scrub Top for Men – V-Neck Medical Top, Eco-Friendly Fabric, 4-Way Stretch Men’s Scrub Top

    SCARY ALIEN INVASION CONSPIRACY THEORY || NOT ALONE || Ep.4

    SCARY ALIEN INVASION CONSPIRACY THEORY || NOT ALONE || Ep.4

No Result
View All Result
  • TC
  • AI
    Artificial Intelligence

    3 Excellent Practical Generative AI Courses

    Artificial Intelligence

    Building End-to-End Data Pipelines with Dask

    Artificial Intelligence

    Automate document translation and standardization with Amazon Bedrock and Amazon Translate

    Artificial Intelligence

    InterVision accelerates AI development using AWS LLM League and Amazon SageMaker AI

    Artificial Intelligence

    FireDucks: An Accelerated Fully Compatible Pandas Library

    Artificial Intelligence

    Breaking Out of Beginner: Python Patterns for Intermediate Data Scientists

    Artificial Intelligence

    Building a Personal Knowledge Management Tool with Reor

    Artificial Intelligence

    Build a location-aware agent using Amazon Bedrock Agents and Foursquare APIs

    Artificial Intelligence

    How to Fully Automate Text Data Cleaning with Python in 5 Steps

  • Crypto
    Metaplanet Acquires 1,241 Bitcoin, Holdings Reach 6,796 BTC Amid Strong Yield Performance

    Metaplanet Acquires 1,241 Bitcoin, Holdings Reach 6,796 BTC Amid Strong Yield Performance

    Bitcoin Bull Cycle is Over: CryptoQuant CEO

    Funds Safe After Oracle Incident Triggers Emergency Vote

    Coinbase Exposes 10,000+ Pages of Hidden Crypto Oversight Files—Transparency Just Got Real

    Coinbase Exposes 10,000+ Pages of Hidden Crypto Oversight Files—Transparency Just Got Real

    Ethereum chart pattern supports ‘moon shot’ rally to new price highs if confirmed — Trader

    Ethereum chart pattern supports ‘moon shot’ rally to new price highs if confirmed — Trader

    Six-Figure Bitcoin Rally Triggers Rare Price Inversion in South Korea

    Six-Figure Bitcoin Rally Triggers Rare Price Inversion in South Korea

    Geneva Gamble: Markets Await Outcome of Secretive US-China Talks

    Geneva Gamble: Markets Await Outcome of Secretive US-China Talks

    Bitcoin Hits $100K, A Royal BTC Drain, and More — Week in Review

    Bitcoin Hits $100K, A Royal BTC Drain, and More — Week in Review

    Bitcoin Price Watch: Bullish Trend Remains Intact Amid Volume Divergence

    Bitcoin Price Watch: Bullish Trend Remains Intact Amid Volume Divergence

    ASEAN, China, Japan, and South Korea Stand Against Trump’s Protectionism

    ASEAN, China, Japan, and South Korea Stand Against Trump’s Protectionism

  • Cybersecurity
    Cybersecurity

    Google Pays $1.375 Billion to Texas Over Unauthorized Tracking and Biometric Data Collection

    Cybersecurity

    Deploying AI Agents? Learn to Secure Them Before Hackers Strike Your Business

    Cybersecurity

    Google Rolls Out On-Device AI Protections to Detect Scams in Chrome and Android

    Cybersecurity

    Chinese Hackers Exploit SAP RCE Flaw CVE-2025-31324, Deploy Golang-Based SuperShell

    Cybersecurity

    SonicWall Patches 3 Flaws in SMA 100 Devices Allowing Attackers to Run Code as Root

    Cybersecurity

    Qilin Ransomware Ranked Highest in April 2025 with Over 45 Data Leak Disclosures

    Cybersecurity

    MirrorFace Targets Japan and Taiwan with ROAMINGMOUSE and Upgraded ANEL Malware

    Cybersecurity

    Cisco Patches CVE-2025-20188 (10.0 CVSS) in IOS XE That Enables Root Exploits via JWT

    Cybersecurity

    OttoKit WordPress Plugin with 100K+ Installs Hit by Exploits Targeting Multiple Flaws

  • Deals
    SIHOO M18 Ergonomic Office Chair for Big and Tall People Adjustable Headrest with 2D…

    SIHOO M18 Ergonomic Office Chair for Big and Tall People Adjustable Headrest with 2D…

    ACEMAGIC AMD Laptop,AMD Ryzen 7 5825U(8C/16T), Radeon RX Vega 8 Graphics,15.6“FHD…

    ACEMAGIC AMD Laptop,AMD Ryzen 7 5825U(8C/16T), Radeon RX Vega 8 Graphics,15.6“FHD…

    NCAA Football 09 (Renewed)

    NCAA Football 09 (Renewed)

    Mortal Kombat Armageddon

    Mortal Kombat Armageddon

    Atari Flashback Classics Collection Vol.1 (PS4)

    Atari Flashback Classics Collection Vol.1 (PS4)

    StarTech.com 4-Port USB C Hub – USB 3.2 Gen 2 (10Gbps) – 3x USB-A & 1x USB-C – Powered -…

    StarTech.com 4-Port USB C Hub – USB 3.2 Gen 2 (10Gbps) – 3x USB-A & 1x USB-C – Powered -…

    TP-Link Powered USB Hub 3.0 with 7 USB 3.0 Data Ports and 2 Smart Charging USB Ports….

    TP-Link Powered USB Hub 3.0 with 7 USB 3.0 Data Ports and 2 Smart Charging USB Ports….

    Portable Charger with Built in Cables, Portable Charger with Cords Wires Slim 10000mAh…

    Portable Charger with Built in Cables, Portable Charger with Cords Wires Slim 10000mAh…

    PNY GeForce 8800GTS 320 MB DDR3 PCIe Graphics Card

    PNY GeForce 8800GTS 320 MB DDR3 PCIe Graphics Card

  • Gaming
    Zelda Breath of the Wild – Best Shields by Durability + Parry Power

    Zelda Breath of the Wild – Best Shields by Durability + Parry Power

    There’s another Risk of Rain 2 expansion on the way

    There’s another Risk of Rain 2 expansion on the way

    GTA 6: Trailer 2 Final Prediction (I’m Done)

    GTA 6: Trailer 2 Final Prediction (I’m Done)

    OFFICIAL GTA 6 TRAILER SHOWCASE! ROCKSTAR GAMES GTA 6 TRAILER IS HERE!!

    OFFICIAL GTA 6 TRAILER SHOWCASE! ROCKSTAR GAMES GTA 6 TRAILER IS HERE!!

    GOD OF WAR RAGNAROK –Part 10 — Gameplay Walkthrough  [4K 60FPS] – No Commentary

    GOD OF WAR RAGNAROK –Part 10 — Gameplay Walkthrough [4K 60FPS] – No Commentary

    Killing Floor 3’s new release date is late July

    Killing Floor 3’s new release date is late July

    Find a path to the surface God Of War Ragnarok

    Find a path to the surface God Of War Ragnarok

    PS5 Shocks Gaming Industry Outperforms Series X! Forspoken Flops? | New Spiderman 2 PS5 Info…

    PS5 Shocks Gaming Industry Outperforms Series X! Forspoken Flops? | New Spiderman 2 PS5 Info…

    A unique horror strategy game about drilling for oil released on Steam last week

    A unique horror strategy game about drilling for oil released on Steam last week

  • Tesla
    4PCS Track Hooks for Tesla Cybertruck Bed Rail Track Accessories, (L-Track Hooks)

    4PCS Track Hooks for Tesla Cybertruck Bed Rail Track Accessories, (L-Track Hooks)

    TOZO 2025 Upgraded Wireless Charger 15W Max Fast Wireless Charging Pad Compatible with…

    TOZO 2025 Upgraded Wireless Charger 15W Max Fast Wireless Charging Pad Compatible with…

    Nycetek Car Phone Mount for Tesla, Strongest Magnetic [No Glue Required], 2025 Upgrade…

    Nycetek Car Phone Mount for Tesla, Strongest Magnetic [No Glue Required], 2025 Upgrade…

    Tesla employees ask Elon Musk to resign, confirm massive demand problem, get fired for it

    Tesla employees ask Elon Musk to resign, confirm massive demand problem, get fired for it

    Nilight Anti-Theft Shackle Hitch Receiver 2Inch 45000 LBs Breaking Strength 3/4″…

    Nilight Anti-Theft Shackle Hitch Receiver 2Inch 45000 LBs Breaking Strength 3/4″…

    JUSTTOP Car Steering Wheel Desk for Laptop,Multipurpose Travel Car Accessories, Car Seat…

    JUSTTOP Car Steering Wheel Desk for Laptop,Multipurpose Travel Car Accessories, Car Seat…

    Car Seat Covers Front Seat Covers Universal Fit for Cars, SUV, Sedans, Anti-Slip…

    Car Seat Covers Front Seat Covers Universal Fit for Cars, SUV, Sedans, Anti-Slip…

    Roof Sun Shade Umbrella for Tesla Cybertruck 2024, Heat Insulation 4- Layer UV Block…

    Roof Sun Shade Umbrella for Tesla Cybertruck 2024, Heat Insulation 4- Layer UV Block…

    Truck Hooks for Tesla Cybertruck 2024, Designed specifically for Cybertruck MOLLE…

    Truck Hooks for Tesla Cybertruck 2024, Designed specifically for Cybertruck MOLLE…

  • UFO
    Legendary British Alien Sighting | History's Greatest Mysteries (S6)

    Legendary British Alien Sighting | History's Greatest Mysteries (S6)

    Ancient Aliens: TOP 4 ALIEN GLYPHS DECIPHERED

    Ancient Aliens: TOP 4 ALIEN GLYPHS DECIPHERED

    RiffTrax: Plan 9 From Outer Space – from the stars of Mystery Science Theater 3000!

    RiffTrax: Plan 9 From Outer Space – from the stars of Mystery Science Theater 3000!

    ET Contactee Rebecca Videnoff Psychic Medium, Akashic Records Reader, Reiki Healer & Channeler

    ET Contactee Rebecca Videnoff Psychic Medium, Akashic Records Reader, Reiki Healer & Channeler

    Men’s Casual Jogger Shorts Elastic Waist Cargo Shorts Relaxed Fit SweatShorts Running Workout Sport Twill Shorts

    Men’s Casual Jogger Shorts Elastic Waist Cargo Shorts Relaxed Fit SweatShorts Running Workout Sport Twill Shorts

    Intergalactic PS5 – 10 New Details You May Not Know

    Intergalactic PS5 – 10 New Details You May Not Know

    EXPLORATION – Inspirational NASA Space Film

    EXPLORATION – Inspirational NASA Space Film

    BARCO One Velocity Scrub Top for Men – V-Neck Medical Top, Eco-Friendly Fabric, 4-Way Stretch Men’s Scrub Top

    BARCO One Velocity Scrub Top for Men – V-Neck Medical Top, Eco-Friendly Fabric, 4-Way Stretch Men’s Scrub Top

    SCARY ALIEN INVASION CONSPIRACY THEORY || NOT ALONE || Ep.4

    SCARY ALIEN INVASION CONSPIRACY THEORY || NOT ALONE || Ep.4

No Result
View All Result
Techcratic
No Result
View All Result

glassflow/clickhouse-etl: Real-time deduplication and temporal joins for streaming data

Hacker News by Hacker News
May 11, 2025
in Hacker News
Reading Time: 20 mins read
127 3
A A
0
Home Hacker News
Share on FacebookShare on XShare on LinkedIn

2025-05-11 09:33:00
github.com


GlassFlow Logo

Join our weekly office hours every Wednesday 15:00-18:00 CET

Join Next Office Hour

Slack
Email Support
Twitter

GlassFlow for ClickHouse Streaming ETL is a real-time stream processor designed to simplify data pipeline creation and management between Kafka and ClickHouse. It provides a powerful, user-friendly interface for building and managing real-time data pipelines with built-in support for deduplication and temporal joins.

Built specifically for data engineers, GlassFlow handles late-arriving events, ensures exactly-once correctness, and scales with high-throughput data. It delivers accurate, low-latency results from streaming data without compromising simplicity or performance. The tool’s intuitive web interface makes it easy to configure and monitor pipelines, while its robust architecture ensures reliable data processing.

  • Streaming Deduplication:

    • Real-time deduplication of Kafka streams before ingestion into ClickHouse
    • Configurable time windows up to 7 days for deduplication
    • Simple configuration of deduplication keys and time windows
    • One-click setup for deduplicated data pipelines
    • Prevents duplicate data from reaching ClickHouse
  • Temporal Stream Joins:

    • Join two Kafka streams in real-time
    • Configurable time windows up to 7 days for stream joins
    • Configure join keys and time windows through the UI
    • Simplified join setup process
    • Produce joined streams ready for ClickHouse ingestion
  • Built-in Kafka Connector:

    • Powered by NATS-Kafka Bridge
    • Automatic data extraction from Kafka topics
    • Seamless integration with Kafka clusters
    • No manual data pulling required
    • Supports multiple Kafka topics and partitions
    • Native support for JSON data types
  • Optimized ClickHouse Sink:

    • Native ClickHouse connection for maximum performance
    • Configurable batch sizes for efficient data ingestion
    • Adjustable wait times for optimal throughput
    • Built-in retry mechanisms
    • Automatic schema detection and management
    • Full support for JSON data types in ClickHouse
  • User-Friendly Interface: Web-based UI for pipeline configuration and management

  • Local Development: Includes demo setup with local Kafka and ClickHouse instances

  • Docker Support: Easy deployment using Docker and docker-compose

  • Self-Hosted: Open-source solution that can be self-hosted in your infrastructure

GlassFlow ClickHouse ETL Introduction
  • Docker and Docker Compose
  • Git
  1. Clone the repository:
git clone https://github.com/glassflow/clickhouse-etl.git
cd clickhouse-etl
  1. Start the services using Docker Compose:
  1. Access the web interface at http://localhost:8080 to configure your pipeline:
  • Connection Setup:
    • Configure Kafka source connection parameters
    • Configure ClickHouse sink connection parameters
  • Pipeline Configuration:
    • Select Kafka input streams
    • Define deduplication key and time window
    • For joins: specify join keys and time window for each stream
    • Configure ClickHouse table settings
  • Pipeline Management:
    • Start/stop pipelines
    • Check pipeline logs
  1. Access the web interface at http://localhost:8080
  2. Configure your source (Kafka) and sink (ClickHouse) connections
  3. Define your pipeline transformations:
    • Set up deduplication rules
    • Configure temporal joins between streams
    • Define data transformations
  4. Start the pipeline and monitor its progress

The project includes a comprehensive demo setup in the demos folder that provides a complete local testing environment. This setup includes:

  • Local Kafka instance with Zookeeper
  • Local ClickHouse instance
  • Sample data generation
  • Example pipeline configurations
  • Step-by-step instructions for running the demo

For detailed instructions on setting up and running the local testing environment, see the Demo README file.

The project consists of several key components:

  • GlassFlow API: Core ETL engine written in Go
  • Web UI: User interface for pipeline management
  • NATS: Message broker for internal communication
  • Kafka Bridge: Connector for Kafka integration using the NATS-Kafka Bridge

For information about third-party components and their licenses, please see our NOTICE file.

The pipeline configuration is defined in a JSON file that specifies the source, sink, and any transformations. While the web interface automatically generates this configuration, understanding its structure can be helpful for advanced users.

The pipeline configuration is a JSON object that defines how data flows from Kafka topics to ClickHouse tables. It consists of three main components:

  1. Source Configuration (Kafka)
  2. Sink Configuration (ClickHouse)
  3. Join Configuration (Optional)

Field Type Required Description
pipeline_id string Yes Unique identifier for the pipeline. Must be non-empty.
source object Yes Configuration for the Kafka source. See Source Configuration.
sink object Yes Configuration for the ClickHouse sink. See Sink Configuration.
join object No Configuration for joining multiple Kafka topics. See Join Configuration.

The source configuration defines how to connect to and consume from Kafka topics.

Field Type Required Description
type string Yes “kafka” is the only supported source
provider string No Kafka provider, e.g. “aiven”
topics array Yes List of Kafka topics to consume from. See Topic Configuration.
connection_params object Yes Kafka connection parameters. See Connection Parameters.

Field Type Required Description
brokers array Yes List of Kafka broker addresses (e.g., [“localhost:9092”]).
protocol string Yes Security protocol for Kafka connection (e.g., “SASL_SSL”).
mechanism string Yes Authentication mechanism (e.g., “SCRAM-SHA-256”).
username string Yes Username for Kafka authentication.
password string Yes Password for Kafka authentication.
root_ca string No Cert. file for Kafka authentication.

Each topic in the topics array has the following configuration:

Field Type Required Description
name string Yes Name of the Kafka topic.
consumer_group_initial_offset string Yes Initial offset for the consumer group (“earliest” or “newest”).
schema object Yes Event schema definition. See Schema Configuration.
deduplication object Yes Deduplication settings. See Deduplication Configuration.

Field Type Required Description
type string Yes Schema type (Currently only “json” is supported).
fields array Yes List of field definitions. See Field Configuration.

Field Type Required Description
name string Yes Field name.
type string Yes Field type (e.g., “String”, “Integer”).

Deduplication Configuration

Field Type Required Description
enabled boolean Yes Whether deduplication is enabled.
id_field string Yes Field name used for message deduplication.
id_field_type string Yes Type of the ID field (e.g., “string”).
time_window string Yes Time window for deduplication (e.g., “1h” for one hour).

The sink configuration defines how to connect to and write to ClickHouse.

Field Type Required Description
type string Yes Must be “clickhouse”.
host string Yes ClickHouse server hostname.
port integer Yes ClickHouse server port.
database string Yes ClickHouse database name.
username string Yes ClickHouse username.
password string Yes ClickHouse password.
table string Yes Target table name.
secure boolean No Whether to use secure connection. Defaults to false.
max_batch_size integer No Maximum number of records to batch before writing. Defaults to 1000.
max_delay_time string No Maximum delay time before the messages are flushed into the sink. Defaults to “10m”.
table_mapping array Yes List of field to column mappings. See Table Mapping Configuration.

Table Mapping Configuration

Each mapping in the table_mapping array has the following configuration:

Field Type Required Description
source_id string Yes Name of the source topic.
field_name string Yes Source field name.
column_name string Yes Target column name.
column_type string Yes Target column type.

The join configuration defines how to join data from multiple Kafka topics.

Field Type Required Description
enabled boolean Yes Whether joining is enabled.
type string Yes Join type (e.g., “temporal”).
sources array Yes List of sources to join. See Join Source Configuration.

Join Source Configuration

Each source in the sources array has the following configuration:

Field Type Required Description
source_id string Yes Name of the Kafka topic to join.
join_key string Yes Field name used for joining records.
time_window string Yes Time window for joining records (e.g., “1h” for one hour).
orientation string Yes Join orientation (“left” or “right”).

{
  "pipeline_id": "kafka-to-clickhouse-pipeline",
  "source": {
    "type": "kafka",
    "provider": "aiven",
    "connection_params": {
      "brokers": [
        "kafka-broker-0:9092",
        "kafka-broker-1:9092"
      ],
      "protocol": "SASL_SSL",
      "mechanism": "SCRAM-SHA-256",
      "username": "",
      "password": "",
      "root_ca": ""
    },
    "topics": [
      {
        "consumer_group_initial_offset": "earliest",
        "name": "user_logins",
        "schema": {
          "type": "json",
          "fields": [
            {
              "name": "session_id",
              "type": "string"
            },
            {
              "name": "user_id",
              "type": "string"
            },
            {
              "name": "timestamp",
              "type": "datetime"
            }
          ]
        },
        "deduplication": {
          "enabled": true,
          "id_field": "session_id",
          "id_field_type": "string",
          "time_window": "12h"
        }
      },
      {
        "consumer_group_initial_offset": "earliest",
        "name": "orders",
        "schema": {
          "type": "json",
          "fields": [
            {
              "name": "user_id",
              "type": "string"
            },
            {
              "name": "order_id",
              "type": "string"
            },
            {
              "name": "timestamp",
              "type": "datetime"
            }
          ]
        },
        "deduplication": {
          "enabled": true,
          "id_field": "order_id",
          "id_field_type": "string",
          "time_window": "12h"
        }
      }
    ]
  },
  "join": {
    "enabled": false,
    "type": "temporal",
    "sources": [
      {
        "source_id": "user_logins",
        "join_key": "user_id",
        "time_window": "1h",
        "orientation": "left"
      },
      {
        "source_id": "orders",
        "join_key": "user_id",
        "time_window": "1h",
        "orientation": "right"
      }
    ]
  },
  "sink": {
    "type": "clickhouse",
    "provider": "aiven",
    "host": "",
    "port": "12753",
    "database": "default",
    "username": "",
    "password": "",
    "secure": true,
    "max_batch_size": 1,
    "max_delay_time": "10m",
    "table": "user_orders",
    "table_mapping": [
      {
        "source_id": "user_logins",
        "field_name": "session_id",
        "column_name": "session_id",
        "column_type": "UUID"
      },
      {
        "source_id": "user_logins",
        "field_name": "user_id",
        "column_name": "user_id",
        "column_type": "UUID"
      },
      {
        "source_id": "orders",
        "field_name": "order_id",
        "column_name": "order_id",
        "column_type": "UUID"
      },
      {
        "source_id": "user_logins",
        "field_name": "timestamp",
        "column_name": "login_at",
        "column_type": "DataTime"
      },
      {
        "source_id": "orders",
        "field_name": "timestamp",
        "column_name": "order_placed_at",
        "column_type": "DateTime"
      }
    ]
  }
}

💡 Note: The web interface automatically generates this configuration based on user input, so manual editing is not required.

We welcome contributions! Please see our Contributing Guidelines for details.

This project is licensed under the Apache License 2.0.



Source Link


Keep your files stored safely and securely with the SanDisk 2TB Extreme Portable SSD. With over 69,505 ratings and an impressive 4.6 out of 5 stars, this product has been purchased over 8K+ times in the past month. At only $129.99, this Amazon’s Choice product is a must-have for secure file storage.

Help keep private content private with the included password protection featuring 256-bit AES hardware encryption. Order now for just $129.99 on Amazon!


Start your free Amazon Prime trial
today and unlock unlimited streaming and more!

Help Power Techcratic’s Future – Scan To Support

If Techcratic’s content and insights have helped you, consider giving back by supporting the platform with crypto. Every contribution makes a difference, whether it’s for high-quality content, server maintenance, or future updates. Techcratic is constantly evolving, and your support helps drive that progress.

As a solo operator who wears all the hats, creating content, managing the tech, and running the site, your support allows me to stay focused on delivering valuable resources. Your support keeps everything running smoothly and enables me to continue creating the content you love. I’m deeply grateful for your support, it truly means the world to me! Thank you!

BITCOIN

Bitcoin Logo

Bitcoin QR Code

bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge

Scan the QR code with your crypto wallet app

DOGECOIN

Dogecoin Logo

Dogecoin QR Code

D64GwvvYQxFXYyan3oQCrmWfidf6T3JpBA

Scan the QR code with your crypto wallet app

ETHEREUM

Ethereum Logo

Ethereum QR Code

0xe9BC980DF3d985730dA827996B43E4A62CCBAA7a

Scan the QR code with your crypto wallet app

Please read the Privacy and Security Disclaimer on how Techcratic handles your support.

Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.

Tags: Hacker News
Share161Tweet101Share28
Previous Post

THE CALLISTO PROTOCOL (Game Review)

Next Post

Puppy Adventures ~ Fairy Lake ~ Museum ~ Sauble Beach ~ Southampton Ontario

Hacker News

Hacker News

Stay updated with Hacker News, where technology meets entrepreneurial spirit. Get the latest on tech trends, startup news, and discussions from the tech community. Read the latest updates here at Techcratic.

Related Posts

The Glitch Gallery
Hacker News

The Glitch Gallery

May 12, 2025
1.3k
Researchers unveil unprecedented satellite that will have to be turned off when it floats over the US: ‘This was a pity’
Hacker News

Researchers unveil unprecedented satellite that will have to be turned off when it floats over the US: ‘This was a pity’

May 11, 2025
1.3k
Klarna changes its AI tune and again recruits humans for customer service
Hacker News

Klarna changes its AI tune and again recruits humans for customer service

May 11, 2025
1.3k
Google Gemini update breaks content filters • The Register
Hacker News

Google Gemini update breaks content filters • The Register

May 11, 2025
1.3k
» Why the Apple II Didn’t Support Lowercase Letters
Hacker News

» Why the Apple II Didn’t Support Lowercase Letters

May 11, 2025
1.3k
Efeckc17/simple-todo-c: A lightweight, native Windows Todo application built with pure C and the Win32 API.
Hacker News

Efeckc17/simple-todo-c: A lightweight, native Windows Todo application built with pure C and the Win32 API.

May 11, 2025
1.3k
Load More
Next Post
Puppy Adventures ~ Fairy Lake ~ Museum ~ Sauble Beach ~ Southampton Ontario

Puppy Adventures ~ Fairy Lake ~ Museum ~ Sauble Beach ~ Southampton Ontario

Everything You Need To Know About The Private Business Jet

Everything You Need To Know About The Private Business Jet

Who Makes RCA TVs And Where Are They Built?

Who Makes RCA TVs And Where Are They Built?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Tech Resources

  • 30 Second Tech ™
  • AI
  • App Zone ™
  • Apple
  • Ars Technica
  • CNET
  • ComputerWorld
  • Crypto News
  • Cybersecurity
  • Endgadget
  • Fossbytes
  • Gaming
  • GeekWire
  • Gizmodo
  • Google News
  • Hacker News
  • Harvard Tech
  • I Like Cats ™
  • I Like Dogs ™
  • LifeHacker
  • MacRumors
  • Macworld
  • Mashable
  • Microsoft
  • MIT Tech
  • PC World
  • Photofocus
  • Physics
  • Random Tech
  • Retro Rewind ™
  • Robot Report
  • SiliconANGLE
  • SlashGear
  • Smartphone
  • StackSocial
  • Tech Art
  • Tech Careers
  • Tech Deals
  • Techcratic ™
  • TechCrunch
  • Techdirt
  • TechRepublic
  • Techs Got To Eat ™
  • TechSpot
  • Tesla
  • The Verge
  • TNW
  • Trusted Reviews
  • UFO
  • VentureBeat
  • Visual Capitalist
  • Weird Stuff
  • Wired
  • ZDNet

Tech News

  • 30 Second Tech ™
  • AI
  • AnandTech
  • Apple Insider
  • Ars Technica
  • CNET
  • ComputerWorld
  • Crypto News
  • Cybersecurity
  • Endgadget
  • ExtremeTech
  • Fossbytes
  • Gaming
  • GeekWire
  • Gizmodo

Tech News

  • Harvard Tech
  • MacRumors
  • Macworld
  • Mashable
  • Microsoft
  • MIT Tech
  • Physics
  • PC World
  • Random Tech
  • Retro Rewind ™
  • SiliconANGLE
  • SlashGear
  • Smartphone
  • StackSocial
  • Tech Careers

Tech News​

  • Tech Art
  • TechCrunch
  • Techdirt
  • TechRepublic
  • Techs Got To Eat ™
  • TechSpot
  • Tesla
  • The Verge
  • TNW
  • Trusted Reviews
  • UFO
  • VentureBeat
  • Visual Capitalist
  • Weird Stuff
  • Wired
  • ZDNet

Site Links

  • About Techcratic
  • Affiliate Disclaimer
  • Affiliate Link Policy
  • Contact Techcratic
  • Dealors Discount Store
  • Privacy and Security Disclaimer
  • Privacy Policy
  • RSS Feed
  • Site Map
  • Support Techcratic
  • Techcratic
  • Tech Deals
  • TOS
  • 𝕏
Click For A Secret Deal

Techcratic – Your All In One Tech Hub © 2020 – 2025
All Rights Reserved
∞

No Result
View All Result
  • Home
  • Apple
  • Gaming
  • Microsoft
  • AnandTech