Found 5,222 repositories(showing 30)
seaweedfs
SeaweedFS is a distributed storage system for object storage (S3), file systems, and Iceberg tables, designed to handle billions of files with O(1) disk access and effortless horizontal scaling.
apache
Apache Doris is an easy-to-use, high performance and unified analytics database.
trinodb
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
StarRocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
apache
Apache Iceberg
Eventual-Inc
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
cocopon
:antarctica: Bluish color scheme for Vim and Neovim
timeplus-io
⚡ Fastest SQL ETL pipeline in a single C++ binary, built for stream processing, observability, analytics and AI/ML
Mooncake-Labs
Real-time analytics on Postgres tables
apache
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
tansu-io
Apache Kafka® compatible broker with S3, PostgreSQL, SQLite, Apache Iceberg and Delta Lake
aws-samples
Provide JSON file template that demonstrate how to create customize Well-Architected reviews using Custom lenses.
BemiHQ
Open-source Snowflake & Fivetran alternative, with Postgres compatibility.
Snowflake-Labs
pg_lake: Postgres with Iceberg and data lake access
projectnessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
datazip-inc
OLake - Fastest Databases, Kafka & S3 Replication to Apache Iceberg or Plain Parquet. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supported sources : Postgres, MongoDB, MySQL, Oracle, MSSql, DB2, Kafka, S3.
apache
Apache Iceberg
supabase
S3 compatible object storage service that stores metadata in Postgres
lakekeeper
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
apache
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
Mrkuhuo
【2026最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。
apache
PyIceberg
ClickHouse
ClickBench: a Benchmark For Analytical Databases
paradedb
DuckDB-powered data lake analytics from Postgres
Netflix
Iceberg is a table format for large, slow-moving tabular data
zsvoboda
New Generation Opensource Data Stack Demo
nimtable
The observability platform for Iceberg lakehouses.
apache
Apache Iceberg - Go
duckdb
No description available
Open Control Plane for Tables in Data Lakehouse