Found 2,821 repositories(showing 30)
jamesdensmore
No description available
datopian
Data Pipes for CSV
Smars-Bin-Hu
A cloud-native data pipeline and visualization project analyzing Formula 1 racing data using Azure, Databricks, Delta Lake, Tableau, and Python for insightful EDA and interactive dashboards.
Stability-AI
Iterable datapipelines for pytorch training.
weiji14
The 🌏 data science library you've been waiting for~
LuQQiu
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
bovine
Network TCP port forwarding
jspsych
Send data from your behavioral experiments to the OSF. Born-open data as a service.
guimou
Various demos of data pipelines
epoch8
No description available
shazam
Domain-specific language to help build and maintain AWS Data Pipelines
kromozome2003
Building Json data pipeline within Snowflake using Streams and Tasks
josephmachado
Example repo to create end to end tests for data pipeline.
FalconSoft
dataPipe is a data processing and data analytics library for JavaScript. Inspired by LINQ (C#) and Pandas (Python)
hieuimba
Spark-based pipeline to extract and parse monthly games from the Lichess database.
JuliaAPlavin
The most convenient piping syntax for generic data manipulation in Julia.
NorthConcepts
DataPipeline Examples
rasbt
Code for the DataPipes article
openclimatefix
OCF's DataPipe based dataloader for training and inference
meraki-analytics
No description available
swordstick
Tool for sync data from Mysql to Mysql with Rename table and filter columns
weiji14
The ecosystem of geospatial machine learning tools in the Pangeo world.
rebremer
Data pipeline project using Data Factory, Databricks and Cosmosdb Graph, deployed using Azure DevOps, secured using firewalls and Azure AD
yjmade
A data processing framework
saadhaxxan
This project consist of Datapipeline that collects data from Google Admob, Facebook Ads, Google Analytics and Google Ads into a single csv file.
beyondhj
DataPipeline 是一款批流一体数据融合平台。
litwellchi
data pipeline code of large video generation model
patrickfleith
Simple guides to create your dream LLM dataset
No description available