Found 173,617 repositories(showing 30)
pathwaycom
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
pathwaycom
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
DataTalksClub
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
apache
Apache Kafka - A distributed event streaming platform
yudaocode
一个涵盖六个专栏:Spring Boot 2.X、Spring Cloud、Spring Cloud Alibaba、Dubbo、分布式消息队列、分布式事务的仓库。希望胖友小手一抖,右上角来个 Star,感恩 1024
heibaiying
大数据入门指南 :star:
influxdata
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
zhisheng17
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
aalansehaiyang
【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!
sogou
C++ Parallel Computing and Asynchronous Networking Framework
debezium
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
IBM
Sarama is a Go library for Apache Kafka.
provectus
Open-Source Web UI for Apache Kafka Management
redpanda-data
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
yahoo
CMAK is a tool for managing Apache Kafka clusters
wangzhiwubigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
AutoMQ
AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.
ThreeDotsLabs
Building event-driven applications the easy way in Go.
risingwavelabs
Event streaming platform for agents, apps, and analytics. Continuously ingest, transform, and serve event data in real time, at scale.
redpanda-data
Fancy stream processing made operationally mundane
segmentio
Kafka library in Go
HariSekhon
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
Graylog2
Free and open log management
didi
一站式云原生实时流数据平台,通过0侵入、插件化构建企业级Kafka服务,极大降低操作、存储和管理实时流数据门槛
CoderLeixiaoshuai
『Java八股文』Java面试套路,Java进阶学习,打破内卷拿大厂Offer,升职加薪!
dotnetcore
Distributed transaction solution in micro-service base on eventually consistency, also an eventbus with Outbox pattern
wurstmeister
Dockerfile for Apache Kafka
robinhood
Python Stream Processing
apache
Flink CDC is a streaming data integration tool
MaterializeInc
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL