Found 51,607 repositories(showing 30)
CodePhiliaX
AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
Tencent
🏆 Real-Time no-code, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and Frontend(Client) can customize response JSONs 🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构
heibaiying
大数据入门指南 :star:
prestodb
The official home of the Presto distributed SQL query engine for big data
trinodb
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
wangzhiwubigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
aden-hive
Outcome driven agent development framework and runtime harness
tobymao
Python SQL Parser and Transpiler
delta-io
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
HariSekhon
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
apache
Apache Hive
WeiYe-Jing
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
isar
Lightweight and blazing fast key-value database written in pure Dart.
alibaba
:honeybee: BeeHive is a solution for iOS Application module programs, it absorbed the Spring Framework API service concept to avoid coupling between modules.
TheHive-Project
TheHive is a Collaborative Case Management Platform, now distributed as a commercial version
liyupi
🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
apache
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
WeBankFinTech
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
MoRan1607
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
CodeRayZhang
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
geekyouth
深圳地铁大数据客流分析系统🚇🚄🌟
learning-at-home
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
apache
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
cazala
CoinHive cryptocurrency miner for node.js
Qihoo360
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
apache
Apache Drill is a distributed MPP query layer for self describing data
collabH
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
dropbox
Python interface to Hive and Presto. 🐝
water8394
:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结