Search Results

Found 14 repositories(showing 14)

-RL-Execution-Optimizer

SamSon1402

❤️45

No description available

HTML

Updated 3 weeks ago

RL_MIG_scheduler

artecs-group

❤️35

Tools for training, analysis and execution of an optimized task scheduling RL agent on GPUs with Multi-Instance GPU (MIG).

MIT

Python

Updated 5 months ago

multi-instance-gpureinforcement-learningscheduling

Market-Microstructure-Optimized-Trade-Execution-using-RL

grantbelford

❤️35

Based on GEN/ETH market microstructure LOB data, trades, objectives & constraints – recommend suitable algos which maximize pnl/minimize inventory losses within a fixed time horizon. The Reinforcement Learning (RL) method is chosen to help achieve optimized trade execution relative to given constraints.

Updated 1 year ago

Stockfish-DQN-Agent

arashsajjadi

🧡60

♟️ Optimized Chess RL Trainer using DQN vs Stockfish. Built with PyTorch and python-chess, it learns using per-move rewards from Stockfish evaluations. Implements Prioritized Experience Replay (PER) and parallel CPU/GPU execution for faster training. The agent dynamically adjusts Stockfish skill level based on performance.

MIT

Python

Updated 3 weeks ago

artificial-intelligencechess-enginedeep-learning+7

Multiclass_Classification_HarmfulBrainActivity

aryanmaingi

❤️35

ResNet50-based deep learning model for multiclass classification of harmful brain activity using raw EEG (Parquet, 200 Hz) and regional spectrogram power (LL, RL, LP, RP). Trained with Stratified Group K-Fold for patient-wise generalization. Uses zero-imputation for stable tensor input. Optimized for Kaggle execution without changing preprocessing.

Jupyter Notebook

Updated 3 months ago

SQL-Query-Execution-RL-Optimizer

KS-KARTHIK-05

❤️40

No description available

MIT

Jupyter Notebook

Updated 1 month ago

mcp-agent-optimizer

shawnli

❤️35

Optimization framework for large-scale MCP service integration with hierarchical routing, RL-based tool selection, and parallel execution

Python

Updated 5 months ago

RL-Enhanced-Rule-Based-Trading-System

YM1587

❤️45

A human-defined rule-based trading strategy (directional anchor) , reinforcement learning (RL) agent that optimizes execution decisions with strong risk management, monitoring, and validation layers

Python

Updated 1 month ago

Predicto-AI

Shafiyullah

❤️40

A full-stack Auto-ML platform integrating XGBoost, Prophet, and Deep RL. Features a professional Streamlit dashboard, CLI, and optimized execution pipelines for tabular data analysis

MIT

Python

Updated 3 months ago

automldata-sciencemachine-learning+5

dba-agent-openenv

arsalannxs

🧡55

A real-world OpenEnv RL environment where an AI agent acts as a Database Administrator (DBA) to optimize slow SQL queries by managing indexes and reducing execution cost.

Python

Updated 1 week ago

micro-coder-rl-sandbox

jeffjaehoyang

🧡65

An experimental, small-scale RL framework designed to profile and optimize the execution latency of verifiable reward functions. This project implements a RL training loop for a 1.5B parameter coding model, specifically targeting the "Evaluation Bottleneck" where standard containerization (Docker) throttling limits GPU utilization.

Updated 3 days ago

RCRacer

daleyadrichem

🧡50

**RCRacer** is a modular, deterministic racing simulation framework for controller development, RL training, and evolutionary optimization. It features a strict layered architecture separating simulation, agents, execution, and GUI—ensuring reproducibility, scalability, and clean experimentation.

MIT

Python

Updated 1 month ago

openenv

720822103143-debug

💛70

Built a real-world task scheduling environment using OpenEnv standards. Implemented a Q-learning agent to optimize CPU job execution based on priority and duration. Deployed an interactive RL dashboard using Gradio on Hugging Face Spaces for live simulation and evaluation.

MIT

Python

Updated 1 day ago

SmartFlow

AdityaSreevatsaK

❤️35

SmartFlow enhances bike-sharing efficiency by combining deep reinforcement learning with agentic AI. The RL model optimizes bike distribution, while agentic AI coordinates real-time actions, like alerting truck drivers. This scalable approach ensures smart decisions and timely execution for urban transport.

MIT

HTML

Updated 7 months ago

agentic-aijupyter-notebookkeras+9

All 14 repositories loaded

GitHub Explorer

Search Results

-RL-Execution-Optimizer

RL_MIG_scheduler

Market-Microstructure-Optimized-Trade-Execution-using-RL

Stockfish-DQN-Agent

Multiclass_Classification_HarmfulBrainActivity

SQL-Query-Execution-RL-Optimizer

mcp-agent-optimizer

RL-Enhanced-Rule-Based-Trading-System

Predicto-AI

dba-agent-openenv

micro-coder-rl-sandbox

RCRacer

openenv

SmartFlow

-RL-Execution-Optimizer

RL_MIG_scheduler

Market-Microstructure-Optimized-Trade-Execution-using-RL

Stockfish-DQN-Agent

Multiclass_Classification_HarmfulBrainActivity

SQL-Query-Execution-RL-Optimizer

mcp-agent-optimizer

RL-Enhanced-Rule-Based-Trading-System

Predicto-AI

dba-agent-openenv

micro-coder-rl-sandbox

RCRacer

openenv

SmartFlow