VPL Connectors

This document describes how to connect Varpulis to external systems for both event ingestion (sources) and output routing (sinks).

Overview

Connector	Input	Output	Status	Feature Flag
MQTT	Yes	Yes	Production	`mqtt`
NATS	Yes	Yes	Production	`nats`
HTTP	No	Yes	Output only (webhooks)	default
Kafka	Yes	Yes	Available	`kafka`
Console	No	Yes	Debug	default
PostgreSQL CDC	Yes	No	Available	`cdc`

Feature Flags

Connectors are compiled via Cargo feature flags:

bash

# Build with MQTT only
cargo build --release --features mqtt

# Build with all connectors
cargo build --release --features all-connectors

# Docker build with Kafka support
docker build -f deploy/docker/Dockerfile \
  --build-arg FEATURES="mqtt,kafka" \
  -t varpulis/varpulis:latest .

Available features: mqtt, kafka, nats, postgres, mysql, sqlite, database, redis, persistence, cdc, encryption, all-connectors.

Connector Declaration Syntax

Connectors are declared at the top of a VPL file using connector Name = type (params):

varpulis

connector MqttSensors = mqtt (
    host: "localhost",
    port: 1883,
    client_id: "varpulis-app"
)

connector KafkaOutput = kafka (
    brokers: ["kafka:9092"],
    group_id: "varpulis-consumer"
)

connector AlertWebhook = http (
    url: "https://hooks.example.com/alerts"
)

Source Binding with `.from()`

Bind a stream to ingest events from a connector:

varpulis

stream Temperatures = TemperatureReading
    .from(MqttSensors, topic: "sensors/temperature/#")

Sink Routing with `.to()`

Route a stream's output to a connector:

varpulis

stream AlertsToKafka = AllAlerts
    .to(KafkaOutput)

stream CriticalToWebhook = CriticalAlerts
    .to(AlertWebhook)

MQTT Connector

MQTT is the recommended connector for IoT and production deployments. It provides reliable message delivery with QoS support.

Declaration

varpulis

connector MqttSensors = mqtt (
    host: "localhost",
    port: 1883,
    client_id: "varpulis-app"
)

Parameters

Parameter	Type	Required	Default	Description
`host`	string	Yes	-	MQTT broker hostname or IP address
`port`	int	Yes	1883	MQTT broker port
`client_id`	string	Yes	-	Unique identifier for this client

Topic Wildcards

Topics are specified in .from() and support MQTT wildcards:

# - Multi-level wildcard (matches any number of levels)
+ - Single-level wildcard (matches exactly one level)

Examples:

sensors/#            # Matches sensors/temp, sensors/humidity, sensors/zone1/temp
sensors/+            # Matches sensors/temp, sensors/humidity (but not sensors/zone1/temp)
sensors/+/temp       # Matches sensors/zone1/temp, sensors/zone2/temp

Event Format

Events received from MQTT must be JSON with an event_type field (or type for short):

json

{
  "type": "TemperatureReading",
  "sensor_id": "sensor-1",
  "zone": "lobby",
  "value": 23.5,
  "timestamp": 1706400000
}

Output Format

Stream .emit() results are published as JSON:

json

{
  "event_type": "HighTempAlert",
  "data": {
    "alert_type": "HIGH_TEMPERATURE",
    "zone": "lobby",
    "temperature": 45.2
  },
  "timestamp": "2026-02-04T10:30:00Z"
}

Complete Example

varpulis

# Connector declarations
connector MqttSensors = mqtt (
    host: "localhost",
    port: 1883,
    client_id: "fraud-detector-prod"
)

connector KafkaAlerts = kafka (
    brokers: ["kafka:9092"],
    group_id: "fraud-alerts"
)

# Event definitions
event Login:
    user_id: str
    ip_address: str
    device: str

event Transaction:
    user_id: str
    amount: float
    status: str
    merchant: str

# Ingest from MQTT
stream Logins = Login
    .from(MqttSensors, topic: "transactions/login")

stream Transactions = Transaction
    .from(MqttSensors, topic: "transactions/payment")

# Pattern: Login followed by failed transaction within 10 minutes
stream SuspiciousActivity = Login as login
    -> Transaction where user_id == login.user_id and status == "failed" as tx
    .within(10m)
    .emit(
        alert_type: "LOGIN_THEN_FAILED_TX",
        user_id: login.user_id,
        login_ip: login.ip_address,
        failed_amount: tx.amount,
        merchant: tx.merchant,
        severity: if tx.amount > 1000 then "high" else "medium"
    )

# Route alerts to Kafka
stream AlertsOut = SuspiciousActivity
    .to(KafkaAlerts)

Running with MQTT

bash

# Basic execution (requires --features mqtt)
varpulis run --file fraud_detection.vpl

# With verbose logging
RUST_LOG=info varpulis run --file fraud_detection.vpl

Deprecated: `config mqtt` Block

Deprecated: The config mqtt { } block syntax is deprecated. Use the connector declaration + .from() syntax instead. The legacy syntax still works but will be removed in a future version.

varpulis

# DEPRECATED - do not use
config mqtt {
    broker: "localhost",
    port: 1883,
    client_id: "my-app",
    input_topic: "events/#",
    output_topic: "alerts"
}

# USE THIS INSTEAD
connector MqttBroker = mqtt (
    host: "localhost",
    port: 1883,
    client_id: "my-app"
)

stream Events = MyEvent
    .from(MqttBroker, topic: "events/#")

Kafka Connector

Kafka provides high-throughput, durable event streaming. Requires the kafka feature flag.

Declaration

varpulis

connector KafkaBroker = kafka (
    brokers: ["broker1:9092", "broker2:9092"],
    group_id: "varpulis-consumer"
)

Parameters

Parameter	Type	Required	Default	Description
`brokers`	array	Yes	-	List of Kafka broker addresses
`group_id`	string	Yes	-	Consumer group ID
`batch_size`	int	No	65536	Maximum size (bytes) of a Kafka producer batch
`linger_ms`	int	No	5	Time (ms) to wait for additional messages before sending a batch
`compression_type`	string	No	`lz4`	Compression codec: `none`, `gzip`, `snappy`, `lz4`, `zstd`
`message_timeout_ms`	int	No	30000	Timeout (ms) for message delivery acknowledgment
`exactly_once`	bool	No	false	Enable transactional (exactly-once) delivery semantics
`transactional_id`	string	No	-	Explicit transactional ID (implies exactly-once)

Batching and Throughput

By default, Varpulis sends Kafka events concurrently: all events in a batch are enqueued into librdkafka's internal buffer, then delivery acknowledgments are awaited together. This lets librdkafka's internal batcher combine messages according to batch_size and linger_ms, yielding 10x+ throughput compared to per-event delivery.

Tune these parameters for your workload:

varpulis

connector HighThroughputKafka = kafka (
    brokers: "broker1:9092,broker2:9092",
    batch_size: 131072,
    linger_ms: 10,
    compression_type: "lz4"
)

Note: These parameter names use VPL underscore convention. They map to rdkafka's batch.size, linger.ms, compression.type, and message.timeout.ms respectively. You can also use the dot-notation names directly.

Usage

varpulis

# Ingest from Kafka
stream Events = SensorReading
    .from(KafkaBroker, topic: "sensor-events")

# Output to Kafka
stream AlertsOut = ProcessedAlerts
    .to(KafkaBroker)

Building with Kafka

bash

# Requires rdkafka (librdkafka)
cargo build --release --features mqtt,kafka

Security

Kafka supports multiple authentication and encryption methods. Security credentials should not be placed directly in VPL files. Instead, use an external credentials file with named security profiles.

Security Parameters

Parameter	Type	Description
`security_protocol`	string	Protocol: `PLAINTEXT`, `SSL`, `SASL_SSL`, `SASL_PLAINTEXT`
`sasl_mechanism`	string	SASL mechanism: `PLAIN`, `SCRAM-SHA-256`, `SCRAM-SHA-512`, `OAUTHBEARER`
`sasl_username`	string	SASL username
`sasl_password`	string	SASL password (use credentials file, not inline VPL)
`ssl_ca_location`	string	Path to CA certificate (PEM)
`ssl_certificate_location`	string	Path to client certificate (PEM)
`ssl_key_location`	string	Path to client private key (PEM)

These parameters are defined in a credentials profile, not in the VPL connector declaration.

Profile Usage

Define security credentials in ~/.varpulis/credentials.yaml:

yaml

profiles:
  production:
    connector_type: kafka
    properties:
      security_protocol: SASL_SSL
      sasl_mechanism: SCRAM-SHA-512
      sasl_username: varpulis-app
      sasl_password: "ENC[AES256-GCM,base64...]"
      ssl_ca_location: /etc/varpulis/certs/ca.pem

Then reference the profile in VPL:

varpulis

connector Kafka = kafka (
    brokers: "kafka-1:9093,kafka-2:9093",
    group_id: "varpulis-prod",
    profile: "production"
)

For full details on credentials file format, master key setup, encryption, mTLS, SCRAM walkthroughs, and security best practices, see the Connector Security Guide.

NATS Connector

NATS provides lightweight, high-performance messaging. It uses a single multiplexed connection for both subscriptions and publishing. Requires the nats feature flag.

Declaration

varpulis

connector NatsMarket = nats (
    servers: "nats://localhost:4222",
    queue_group: "varpulis"
)

Parameters

Parameter	Type	Required	Default	Description
`servers`	string	Yes	-	NATS server URL(s), e.g. `nats://host:4222`
`queue_group`	string	No	-	Queue group for load-balanced consumption

Subject Wildcards

NATS subjects use . as a separator with two wildcard tokens:

* — Matches a single token: trades.* matches trades.AAPL but not trades.us.AAPL
> — Matches one or more tokens (must be last): trades.> matches trades.AAPL and trades.us.AAPL

Examples:

sensors.*              # Matches sensors.temp, sensors.humidity (NOT sensors.zone1.temp)
sensors.>              # Matches sensors.temp, sensors.zone1.temp, sensors.zone1.zone2.temp
market.trades.*        # Matches market.trades.AAPL, market.trades.GOOG
market.>               # Matches market.trades.AAPL, market.quotes.AAPL, etc.

Event Format

Events received from NATS must be JSON. Two formats are supported:

Flat format (recommended):

json

{"type": "Trade", "symbol": "AAPL", "price": 150.25, "volume": 1000}

Nested format (with data envelope):

json

{"event_type": "Trade", "data": {"symbol": "AAPL", "price": 150.25, "volume": 1000}}

The type or event_type field determines which VPL stream processes the event. If neither is present, the last .-delimited segment of the NATS subject is used as the event type.

Queue Groups (Load Balancing)

When multiple Varpulis instances share the same queue_group, each NATS message is delivered to exactly one instance:

varpulis

connector NatsShared = nats (
    servers: "nats://localhost:4222",
    queue_group: "varpulis-workers"
)

Without queue_group, every instance receives every message (fan-out).

Usage

varpulis

# Ingest from NATS
stream Trades = Trade
    .from(NatsMarket, topic: "trades.>")

# Output to NATS
stream Alerts = HighValueTrades
    .to(NatsMarket, topic: "alerts.high-value")

Complete Example

varpulis

connector NatsMarket = nats (
    servers: "nats://localhost:4222",
    queue_group: "market-processor"
)

connector NatsAlerts = nats (
    servers: "nats://localhost:4222"
)

event Trade:
    symbol: str
    price: float
    volume: int
    exchange: str

# Ingest trades from all exchanges
stream Trades = Trade
    .from(NatsMarket, topic: "market.trades.>")

# Detect large trades and publish alerts
stream LargeTrades = Trade
    .from(NatsMarket, topic: "market.trades.>")
    .where(volume > 10000)
    .emit(
        alert_type: "LARGE_TRADE",
        symbol: symbol,
        price: price,
        volume: volume,
        exchange: exchange
    )
    .to(NatsAlerts, topic: "alerts.large-trades")

# Detect rapid trade sequence: 3 trades for the same symbol within 10s
stream RapidTrading = Trade as t1
    -> Trade where symbol == t1.symbol as t2
    -> Trade where symbol == t1.symbol as t3
    .within(10s)
    .emit(
        alert_type: "RAPID_TRADING",
        symbol: t1.symbol,
        trade_count: 3,
        price_change: t3.price - t1.price
    )
    .to(NatsAlerts, topic: "alerts.rapid-trading")

Running with NATS

bash

# Start nats-server
docker run -d -p 4222:4222 nats:latest

# Run the pipeline
varpulis run --file market_pipeline.vpl

# Publish test events
nats pub market.trades.NYSE '{"type":"Trade","symbol":"AAPL","price":150.25,"volume":15000,"exchange":"NYSE"}'

Building with NATS

bash

# Build with NATS support
cargo build --release --features nats

# Build with multiple connectors
cargo build --release --features mqtt,nats,kafka

NATS Cluster Transport

NATS is also used as the transport layer for Varpulis cluster communication (coordinator-worker messaging). This is a separate feature from the data connector. See NATS Transport Architecture for details.

HTTP Connector

The HTTP connector sends events to webhooks and REST APIs (output only).

Declaration

varpulis

connector AlertWebhook = http (
    url: "https://webhook.example.com/alerts"
)

Usage

varpulis

stream CriticalAlerts = AllAlerts
    .where(severity == "critical")
    .to(AlertWebhook)

HTTP Source (Server Mode)

For HTTP input, use Varpulis in server mode with the REST API:

bash

# Start the server
varpulis server --port 9000 --api-key "your-key" --metrics

# Inject events via HTTP POST
curl -X POST http://localhost:9000/api/v1/pipelines/<id>/events \
  -H "X-API-Key: your-key" \
  -H "Content-Type: application/json" \
  -d '{"event_type": "Login", "fields": {"user_id": "user123"}}'

PostgreSQL CDC Connector

PostgreSQL Change Data Capture (CDC) streams database changes (INSERT, UPDATE, DELETE) as Varpulis events using PostgreSQL logical replication. Requires the cdc feature flag.

Prerequisites

PostgreSQL 10+ with wal_level = logical in postgresql.conf
A publication for the tables you want to track:

sql

-- Create a publication for specific tables
CREATE PUBLICATION my_pub FOR TABLE orders, payments;

-- Or for all tables
CREATE PUBLICATION my_pub FOR ALL TABLES;

Declaration

varpulis

connector pg = postgres_cdc(
    host: "localhost",
    dbname: "myapp",
    publication: "my_pub",
    slot_name: "varpulis_slot"
)

Parameters

Parameter	Type	Required	Default	Description
`host`	string	Yes	-	PostgreSQL hostname
`port`	int	No	5432	PostgreSQL port
`dbname`	string	Yes	-	Database name
`user`	string	No	`postgres`	Database user (must have replication privilege)
`password`	string	No	-	Database password
`slot_name`	string	No	`varpulis_slot`	Logical replication slot name
`publication`	string	No	`varpulis_pub`	Publication name
`tables`	array	No	all	Specific tables to track

Event Format

Each database change becomes a Varpulis event:

Event type: {table}.{INSERT|UPDATE|DELETE} (e.g., orders.INSERT)
_table field: Table name
_op field: Operation (INSERT, UPDATE, DELETE)
Column fields: Each column value as an event field
UPDATE events: Include both old and new column values

Usage

varpulis

connector pg = postgres_cdc(
    host: "localhost",
    dbname: "myapp",
    publication: "my_pub"
)

# Stream all order inserts
stream NewOrders = pg.from(orders)
    .where(_op == "INSERT")
    .select(order_id, amount, customer_id)

# Detect rapid price changes
stream PriceChanges = pg.from(products)
    .where(_op == "UPDATE")
    .select(product_id, old_price, new_price)
    .where(abs(new_price - old_price) / old_price > 0.1)
    .emit(alert: "Large price change", product_id: product_id)

Building with CDC

bash

cargo build --release --features cdc

Console Connector

For debugging, stream output is printed to stdout when no .to() connector is specified:

varpulis

stream DebugOutput = SomeStream
    .where(value > 100)
    .emit(debug_info: "High value detected", value: value)

VPL Connectors ​

Overview ​

Feature Flags ​

Connector Declaration Syntax ​

Source Binding with .from() ​

Sink Routing with .to() ​

MQTT Connector ​

Declaration ​

Parameters ​

Topic Wildcards ​

Event Format ​

Output Format ​

Complete Example ​

Running with MQTT ​

Deprecated: config mqtt Block ​

Kafka Connector ​

Declaration ​

Parameters ​

Batching and Throughput ​

Usage ​

Building with Kafka ​

Security ​

Security Parameters ​

Profile Usage ​

NATS Connector ​

Declaration ​

Parameters ​

Subject Wildcards ​

Event Format ​

Queue Groups (Load Balancing) ​

Usage ​

Complete Example ​

Running with NATS ​

Building with NATS ​

NATS Cluster Transport ​

HTTP Connector ​

Declaration ​

Usage ​

HTTP Source (Server Mode) ​

PostgreSQL CDC Connector ​

Prerequisites ​

Declaration ​

Parameters ​

Event Format ​

Usage ​

Building with CDC ​

Console Connector ​

See Also ​

VPL Connectors

Overview

Feature Flags

Connector Declaration Syntax

Source Binding with `.from()`

Sink Routing with `.to()`

MQTT Connector

Declaration

Parameters

Topic Wildcards

Event Format

Output Format

Complete Example

Running with MQTT

Deprecated: `config mqtt` Block

Kafka Connector

Declaration

Parameters

Batching and Throughput

Usage

Building with Kafka

Security

Security Parameters

Profile Usage

NATS Connector

Declaration

Parameters

Subject Wildcards

Event Format

Queue Groups (Load Balancing)

Usage

Complete Example

Running with NATS

Building with NATS

NATS Cluster Transport

HTTP Connector

Declaration

Usage

HTTP Source (Server Mode)

PostgreSQL CDC Connector

Prerequisites

Declaration

Parameters

Event Format

Usage

Building with CDC

Console Connector

See Also