Cardinal Lakerunner

Cardinal Lakerunner

Turn your S3-compatible object store into a blazing-fast observability backend

What is Lakerunner?

Lakerunner is an event-driven ingestion engine that transforms S3-compatible object storage into a real-time observability backend. It automatically monitors for structured telemetry data, converts it into optimized Apache Parquet format, and handles indexing, aggregation, and compaction operations in real timeβ€”all without vendor lock-in.

⚑Event-Driven Architecture

Triggers ingestion automatically when new objects are created in S3 buckets via S3 Object Notifications.

πŸ”“No Vendor Lock-In

Works with any S3-compatible object storage. Your data stays in your infrastructure.

Key Features

πŸš€

Automatic S3 Ingestion

Triggered by object creation events for real-time data processing.

πŸ“Š

OpenTelemetry Native

Native support for OpenTelemetry log and metric proto files.

πŸ”Œ

Multiple Data Sources

CSV, JSON.gz, DataDog, FluentBit, and more.

⏱️

Quick Setup

Deploy locally in under 5 minutes with automated installation.

πŸ“ˆ

Grafana Integration

Built-in Grafana plugin for immediate data visualization and querying.

☸️

Cloud Native

Docker and Kubernetes support with Helm charts.

Supported Input Formats

πŸ“

OpenTelemetry Protocol Files

Native support for OTLP logs and metrics

πŸ“„

CSV Data

Structured data from CSV files

πŸ—œοΈ

JSON.gz Compressed Files

Compressed JSON data ingestion

πŸ”—

DataDog & FluentBit

Integration with popular observability tools

Getting Started

Deploy Lakerunner locally in under 5 minutes with our automated installation script

Requirements

  • βœ“Docker
  • βœ“Kubernetes (via kind or minikube)
  • βœ“kubectl
  • βœ“Helm 3.14+
# Quick install with automated script
$ curl -sSL https://github.com/cardinalhq/lakerunner-cli | bash

Once running: Access Grafana at localhost:3000 to explore your ingested telemetry through the bundled Cardinal datasource plugin.