WhatPowerstheBrain.
DuckDB execution engine. Rust concurrency layer. Apache Iceberg storage. 13-step query pipeline. Here's how it all fits together.
Integrations
Works With Your Agent Framework
First-class support for the tools your agents already use.
QUERY PIPELINE
13 Steps. Every Query. Every Time.
Every SQL statement — from agent or human — passes through the same hardened pipeline. Security, governance, and billing are never optional.
THE ENGINE ROOM
Built on Giants. Engineered for Production.
We didn't reinvent the wheel. We built the car. HatiData combines the raw vectorized speed of DuckDB with the concurrency, security, and state management required for the modern cloud.
The Kernel: Vectorized Execution
We leverage DuckDB’s columnar, vectorized execution engine to process analytical queries at the speed of memory.
- Zero-Copy: Data is processed in Apache Arrow format.
- SIMD Optimized: Parallel instruction execution for aggregations.
- MIT Licensed: Open standards. No proprietary lock-in.
The Brain: Concurrency & Safety
DuckDB is single-process. HatiData makes it cloud-native. Our custom Rust proxy handles connection pooling, async scheduling, and fault tolerance.
- Stateless: Nodes spin up/down in milliseconds.
- Safe: Memory-safe concurrency prevents crashes under load.
- Smart: Automatic query routing based on data locality.
The State: Governance & Storage
We decouple compute from storage completely. Your data lives in object storage; we just borrow it.
- IAM Native: We use cloud-native IAM (AWS, GCP, Azure), not static keys.
- ACID Compliance: Full transactional integrity via Apache Iceberg.
- Infinite Scale: Storage scales independently of compute.
Why HatiData?
DuckDB is an incredible engine. But production workloads need more than an engine — they need a platform.
| Feature | Raw DuckDB (DIY) | HatiData (Enterprise) |
|---|---|---|
| Execution Engine | Vectorized (Fast) | Vectorized (Fast) |
| Concurrency | Single-Process / Locked | Multi-Node Auto-Scaling |
| Storage Layer | Local Files / Manual S3 | Managed Iceberg Catalog Sync |
| Security | None (File Permissions) | RBAC, IAM, & SSO Integration |
| Caching | OS Page Cache | Intelligent NVMe Tiering |
| Billing | N/A | Per-Second (No Minimums) |
The “No Lock-In” Guarantee
We use the open-source DuckDB engine and the standard Apache Iceberg format — you are never locked into a proprietary ecosystem. You can read your HatiData tables with Spark, Trino, or a local Python script — anytime, anywhere.
SECURITY
Enterprise Security, From Day One
Not Phase 3. Not 'coming soon'. Day One.
CMEK
Your encryption keys. Your KMS. We never see them.
PrivateLink
Zero public internet traversal. Private connectivity only.
Immutable Audit
Every query logged. S3 Object Lock. 7-year retention.
RBAC + Masking
6 roles. Column-level masking. Row-level security.
DEPLOYMENT
One Command to Promote
Start local. Push to cloud or VPC with a single command. Data, schemas, and agent memories migrate seamlessly.
See it in action.
Install locally in 30 seconds. The full pipeline runs on your laptop.