Search Results

Found 79 repositories(showing 30)

Logic-RL

Unakar

💛74

Reproduce R1 Zero on Logic Puzzle

2.4k

164

Apache-2.0

Python

Updated 1 day ago

Logic-RL-Lite

DolbyUUU

❤️45

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

Python

Updated 2 months ago

deepseekdeepseek-r1fine-tuning+6

COLA

jiangnan7

❤️40

MLIR-based HLS and RL-driven logic synthesis co-optimization.

BSD-3-Clause

Updated 4 months ago

reinforcement_learning_in_logic_synthesis

lkuresevic

❤️35

(WIP) Training an RL model to produce synthesis recipes for logic optimization.

Python

Updated 1 year ago

graph-neural-networkslogic-synthesisreinforcement-learning

This repository contains Dongming Shen's code and documentation for the research projects conducted at the AIDyS Lab, USC. The project focuses on integrating Reinforcement Learning (RL) to solve partially observable Markov decision processes (POMDP) under finite linear temporal logic (LTL) constraints.

C++

Updated 4 months ago

optimal-controlpartially-observable-markov-decision-processpythonpackage+2

tinier-ppo-tutorial

anshulsawant

❤️45

This project provides a hands-on tutorial for understanding and implementing the Proximal Policy Optimization (PPO) algorithm to fine-tune Large Language Models (LLMs) using Reinforcement Learning (RL). It is inspired by the logic found in the TinyZero repository but significantly simplified for pedagogical purposes.

Python

Updated 2 months ago

waifu-llm-vrm

waifuai

🧡50

Integrates LLM conversational AI characters into Godot game engine projects. ✨ Manages character personality, state, and interaction logic using a generative LLM. 🧠 Connects to a running Godot instance for seamless communication via sockets or godot-rl. 🎮 Includes an optional class for controlling VRM model animations and expressions directly.

MIT-0

Python

Updated 1 month ago

rl-logic

dotty-cps-async

❤️45

monadic infrastructure for reinforcement learning.

Apache-2.0

Scala

Updated 1 month ago

Chinese-Logic-RL

Trae1ounG

❤️35

Exploring R1 on Logic Puzzle in Chinese

Python

Updated 11 months ago

chinesellmllm-reasoning+1

NavBot-X-Isaac

saibot007

🧡65

NavBot-X Isaac Lab is a premium robotics simulation project focused on building an autonomous inspection rover in Isaac Sim / Isaac Lab. It brings together cinematic environment design, checkpoint-based task logic, omnidirectional rover control concepts, and gesture-control integration, with future expansion toward RL and Physical AI

Python

Updated 16 hours ago

gesture-recognitionisaac-labisaac-sim+5

aea_spot_micro

andretosi

🧡50

This project is an open-source version of a robot dog, complete with an advanced training environment and custom controller logic. The project aims to train the robot using RL algorithms and plans to support multiple backends for simulation, from PyBullet and MuJoCo to Gazebo, Unity, or Unreal Engine using ROS2 for communication.

MIT

Jupyter Notebook

Updated 1 month ago

LogicRL

Nyrus-Y

❤️30

No description available

MIT

Python

Updated 3 years ago

rl_logic_synthesis

phyzhenli

❤️35

No description available

Python

Updated 1 month ago

Slime-RLVE

jbarnes850

🧡50

Verifiable math/logic environments for slime RL training

Apache-2.0

Python

Updated 1 month ago

floating-wind-turbine-implementation-with-new-IDHP-PI-blade-pitch-controller

mohsenab999

❤️45

Ongoing OPEN FAST project: Implementing an innovative Reinforcement Learning (RL) controller (IDHP-IPC) for load mitigation on an ITI Barge floating wind turbine. External logic is scripted in Python and linked via a C bridge to the DISCON library.

Fortran

Updated 1 month ago

rlsquare_logic

jonberliner

❤️20

rlsquare algorithm!

Python

Updated 11 years ago

Logic-RL-trl

Dutch-voyage

❤️25

No description available

Python

Updated 1 year ago

ModalLogicRegularisedRL

k191105

❤️45

No description available

Python

Updated 3 weeks ago

Gym_RL

sy-shi

❤️35

A document to describe the logic of RL with Gym.

Updated 1 year ago

tensor-logic-guide

jacobarrio

❤️45

Interactive guide to understanding Tensor Logic - designed for RL engineers

HTML

Updated 1 month ago

mario-kart

umd-xlab

❤️35

Robust safety verification of RL agents using Signal Temporal Logic (STL)

Python

Updated 2 weeks ago

Tautology_Logic

Nozidoali

❤️35

Pre-trained RL agent that synthesize tautology or near tautology logic

Python

Updated 2 years ago

PPO-LTL

richardzhangatuoe

🧡50

Safe RL framework integrating Linear Temporal Logic (LTL) constraints into PPO via a logic-to-cost mechanism.

MIT

HTML

Updated 1 month ago

Math-RL

AHartNtkn

❤️40

An attempt to create an RL environment so that RL algorithms can do math an logic, treated as a sort of game.

MIT

Jupyter Notebook

Updated 6 years ago

Artificial-Intelligence

rupeshsjce

❤️35

Python

Updated 4 years ago

fast-rl-grpo

sagar0x0

❤️45

Optimizing RL training with GRPO. This repo implements the RL pipeline logic and optimizing it with different technique including custom kernels, reducing overhead mainly in grpo step. Features detailed Nsight profiling and benchmarking

Python

Updated 2 months ago

LineFollow-Robocon-IsaacSim

rahulpanchall7

❤️40

Virtual robotics racing simulation using Isaac Sim and ROS2. Two Robotnik Summit robots race using PID and RL-based control. Customize logic and experiment with robot behavior!

MIT

Python

Updated 4 months ago

toy_rl_reasoning_post_training

Johnny95420

❤️35

This project uses Reinforcement Learning (RL) and LoRA to boost Qwen2-0.5B's reasoning, based on ProRL. The model surpasses instruction-tuning on math benchmarks and learns general problem-solving logic.

Python

Updated 8 months ago

Reinforcement-Learning-Driven-Autonomous-Driving

thundivalappil

❤️45

Autonomous driving is a sequential decision-making problem under uncertainty. Instead of relying solely on rule-based logic, Reinforcement Learning (RL) learns driving behavior through interaction: the agent observes the environment state, takes actions, and improves a policy by maximizing long-term reward.

Python

Updated 2 months ago

Snack-RL

Horese07

❤️35

A small reinforcement learning project that trains an agent to play Snake. The repository contains game logic, an RL agent and model code, a replay buffer implementation, utilities, and a script to play the game as a human. A training progress image is included to show example training results.

Python

Updated 5 months ago

GitHub Explorer

Search Results

Logic-RL

Logic-RL-Lite

COLA

reinforcement_learning_in_logic_synthesis

POMDP-RL

tinier-ppo-tutorial

waifu-llm-vrm

rl-logic

Chinese-Logic-RL

NavBot-X-Isaac

aea_spot_micro

LogicRL

rl_logic_synthesis

Slime-RLVE

floating-wind-turbine-implementation-with-new-IDHP-PI-blade-pitch-controller

rlsquare_logic

Logic-RL-trl

ModalLogicRegularisedRL

Gym_RL

tensor-logic-guide

mario-kart

Tautology_Logic

PPO-LTL

Math-RL

Artificial-Intelligence

fast-rl-grpo

LineFollow-Robocon-IsaacSim

toy_rl_reasoning_post_training

Reinforcement-Learning-Driven-Autonomous-Driving

Snack-RL

Logic-RL

Logic-RL-Lite

COLA

reinforcement_learning_in_logic_synthesis

POMDP-RL

tinier-ppo-tutorial

waifu-llm-vrm

rl-logic

Chinese-Logic-RL

NavBot-X-Isaac

aea_spot_micro

LogicRL

rl_logic_synthesis

Slime-RLVE

floating-wind-turbine-implementation-with-new-IDHP-PI-blade-pitch-controller

rlsquare_logic

Logic-RL-trl

ModalLogicRegularisedRL

Gym_RL

tensor-logic-guide

mario-kart

Tautology_Logic

PPO-LTL

Math-RL

Artificial-Intelligence

fast-rl-grpo

LineFollow-Robocon-IsaacSim

toy_rl_reasoning_post_training

Reinforcement-Learning-Driven-Autonomous-Driving

Snack-RL