Cloud-native open source data platform

Dataround is a cloud-native, open source platform that combines lakehouse and real-time data warehouse capabilities, providing high performance, stability, and elasticity for analytics, streaming, and machine learning.

Dataround Link

Dataround link is an open-source data integration tool designed for multi-source heterogeneous data synchronization. It supports seamless integration and synchronization of structured, semi-structured, and unstructured data.

  • Rich connectors for databases, message queues, files
  • Batch and streaming ingestion with CDC support
  • Operational monitoring, alerts, and retry policies

Dataround Platform

Dataround platform focuses on operational excellence for enterprise deployments.

  • Integrates Spark high-performance compute engine and Apache Iceberg open table format
  • Built-in Doris real-time data warehouse with sub-second analytics
  • Cloud-native architecture for Kubernetes and major clouds, elastic and resilient
  • Enterprise-grade HA and performance with Apache Ranger access control

Key Features

Powerful capabilities for modern data platforms

Real-time Analytics

Doris real-time data warehouse provides sub-second query performance, enabling instant insights and decisions.

Apache Flink real-time data processing framework, enabling real-time data processing and analysis.

Apache Iceberg open table format, enabling near-real-time data ingestion and analysis.

Cloud Native

Built on Kubernetes with auto-scaling, high availability, and seamless integration with major cloud providers.

Supports external storage services like S3, HDFS, etc., providing flexible and scalable persistent storage.

Multi-source Integration

Seamless integration and synchronization of structured, semi-structured, and unstructured data.

Provides CDC support for real-time data synchronization.

Provides data synchronization based on original data files, reducing serialization and deserialization overhead.

Enterprise Security

Integrates Apache Ranger for fine-grained access control, audit trails, and compliance guarantees.

Supports data masking, encryption, and policy-based access control.