multi-agent-rl

Star

Here are 19 public repositories matching this topic...

Language: All

Filter by language

All 19 Python 18 Jupyter Notebook 1

Sort: Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

hsvgbkhgbv / SQDDPG

Star 121

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

framework reinforcement-learning openai-gym pytorch policy-gradient multiagent-reinforcement-learning multi-agent-reinforcement-learning marl sqddpg shapley-q-value multi-agent-rl

Updated Nov 4, 2024
Python

tk-yasuno / dql-bridge-maintenance

Star 2

A deep reinforcement learning system for optimizing bridge maintenance decisions across municipal infrastructure fleets, implementing cross-subsidy budget sharing and cooperative multi-agent learning.

reinforcement-learning deep-q-learning decision-support-system predictive-maintenance cbm resource-sharing large-scale-optimization markov-decision-process multi-agent-rl prognostics-health-management budget-allocation bridge-maintenance disaster-resilience infrastructure-maintenance-management infrastructure-resilience municipal-infrastructure hadr-ai cooperative-rl cross-subsidy

Updated Dec 5, 2025
Python

Nikelroid / adversarial-coevolution

Star 2

Adversarial Co-Evolution of RL and LLM Agents: A framework for training high-performance PPO agents against Large Language Models in Gin Rummy, utilizing curriculum learning and knowledge distillation.

reinforcement-learning pytorch knowledge-distillation gin-rummy curriculum-learning ppo multi-agent-rl pettingzoo stable-baselines3 llm ollama

Updated Dec 13, 2025
Python

chizkidd / huggingface-deep-RL-course

Star 1

Going through the Hugging Face Deep Reinforcement Learning course.

q-learning godot atari vizdoom imitation-learning gymnasium actor-critic deep-q-learning deep-rl ppo policy-gradients unity-ml-agents optuna huggingface multi-agent-rl taxi-v3 huggy stable-baselines3 frozenlake-v1 lunarlander-v3

Updated Mar 4, 2026
Jupyter Notebook

Vaioskn / adversarial-soccer-rl

Star 1

Deterministic hex-grid soccer environment with two adversarial agents. Implements Q-Learning, Minimax-Q (via LP), and Belief-Q with online belief updates; trains in SE2G/SE6G to reduce state space and evaluates behaviors in the full environment with comprehensive visualizations.

reinforcement-learning linear-programming q-learning game-theory markov-games markov-decision-process multi-agent-rl adversarial-rl minimax-q belief-q state-space-reduction

Updated Sep 28, 2024
Python

julesser / DeepRL-P3-Collaboration-Competition

Star 0

Project 3 of Udacity's Deep Reinforcement Learning Nanodegree Program

unity-ml-agents multi-agent-rl reinforcment-learning

Updated Oct 25, 2021
Python

SarathL754 / Multi-agent-RL-texas-holdem-aec

Star 0

An engineering-focused multi-agent reinforcement learning system for Texas Hold'em using PettingZoo AEC and a custom PyTorch PPO self-play setup.

reinforcement-learning self-play multi-agent-rl ppo-pytorch pettingzoo

Updated Jan 24, 2026
Python

Devanik21 / Hexapawn-RL

Star 0

Hexapawn Game Engine Proper 3x3 board with pawn movement Strategic RL Agents Minimax with Alpha-Beta Pruning (depth configurable 1-7) Q-Learning with temporal difference updates Experience replay for efficient learning Epsilon-greedy exploration with decay Multi-level decision hierarchy (immediate threats - strategic planning)

reinforcement-learning q-learning epsilon-greedy game-theory deep-rl temporal-difference multi-agent-rl minimax-alpha-beta hexapawn-game

Updated Dec 14, 2025
Python

JEONGHEESIK / CrazyArcade

Star 0

keureijiakeideu mojag + Reinforcement Learning ( DQN, PPO )

python reinforcement-learning pytorch sensor-fusion game-ai actor-critic deep-rl client-server-architecture experience-replay tcp-sockets multi-agent-rl

Updated Dec 6, 2025
Python

Devanik21 / Tic-Tac-Toe-RL-Battle-Arena

Star 0

Key Features 1. Flexible Game Configuration Adjustable grid size (3x3 up to 10x10) Customizable win condition (e.g., 5-in-a-row on a 7x7 board) 2. Two Competing RL Agents Agent 1 (Blue X) vs Agent 2 (Red O) Each has independent Q-Learning parameters Watch them evolve different strategies over time

reinforcement-learning tic-tac-toe q-learning game-theory deep-rl self-play multi-agent-rl multi-agent-games competitive-rl

Updated Dec 13, 2025
Python

Devanik21 / Dark-Thermodynamic-Mind

Star 0

Dark Zero Point Genesis: PPO Latent World Models Under Thermodynamic Scarcity 256 agents. 128D Latent Manifolds. Zero supervision. Agents utilize PPO-clipped surrogate objectives. Survival = Predictive Error Coding (PEC) x Energy Efficiency across a 50/15 Seasonal Cycle.

energy-efficiency cognitive-architecture neuromorphic-computing free-energy-principle multi-agent-rl ppo-reinforcement-learning autopoietic-systems latent-world-models predictive-error-coding thermodynamic-science

Updated Feb 26, 2026
Python

alizangeneh / multiagent-warehouse-navigation-dqn

Star 0

Research-grade Reinforcement Learning framework for single-agent and multi-agent warehouse navigation using Deep Q-Networks (DQN), PyTorch, replay buffer, target networks, logging, and full test suite. Built for PhD-level RL and autonomous systems research.

machine-learning reinforcement-learning robotics decision-making deep-reinforcement-learning path-planning pytorch dqn multiagent-systems gridworld deep-q-network ai-research target-network autonomous-navigation experience-replay multi-agent-rl cooperative-agents multi-agent-navigation warehouse-robotics

Updated Dec 11, 2025
Python

tk-yasuno / dql-multi-equipments-cbm

Star 0

Multi-Equipment CBM system using QR-DQN with advanced probability distribution analysis. Coordinated maintenance decision-making for 4 industrial equipment units with realistic anomaly rates (1.9-2.2%), comprehensive risk analysis (VaR/CVaR), and 51-quantile distribution visualization.

reinforcement-learning risk-analysis uncertainty-estimation value-at-risk deep-q-learning predictive-maintenance qr-dqn distributional-rl condition-based-maintenance multi-agent-rl cvar prognostics-health-management disaster-resilience equipment-maintenance infrastructure-prognostics

Updated Dec 21, 2025
Python

Devanik21 / The-Game-of-Nim-RL

Star 0

Classic Nim Rules - 3 customizable piles, take any number from one pile per turn, last to take loses Q-Learning Agents - Two independent agents that learn optimal strategy through self-play

reinforcement-learning q-learning game-theory deep-rl nim-game self-play combinatorial-game-theory multi-agent-rl

Updated Dec 14, 2025
Python

Devanik21 / general-gamer-ai-lite

Star 0

A specialized Reinforcement Learning (RL) project focused on multi-task mastery across 10 distinct gaming environments. General-Gamer-AI-Lite implements a lightweight multi-task agent designed to learn shared representations and transfer knowledge between varied game mechanics, from classic arcade challenges to strategic grid worlds.

reinforcement-learning deep-reinforcement-learning game-theory multi-task-learning multi-agent-rl curiosity-driven-exploration ppo-algorithm hierarchical-rl self-play-rl game-environment-simulation

Updated Jan 26, 2026
Python

mwasifanwar / multi_agent_rl

Star 0

Coordinated multi-agent systems that learn to solve complex collaborative and competitive tasks.

game-theory emergent-behavior game-theory-model game-theory-algorithms game-theory-framework multi-agent-rl cooperative-ai distributed-ai

Updated Nov 4, 2025
Python

tk-yasuno / dql-aged-multi-equipment-cbm

Star 0

Multi-Equipment CBM (Condition-Based Maintenance) optimization using Deep Q-Learning with cost leveling and scenario comparison. Advanced RL system with QR-DQN, N-step learning, and parallel environments for HVAC equipment predictive maintenance.

reinforcement-learning deep-q-learning cost-optimization predictive-maintenance qr-dqn scenario-analysis distributional-rl condition-based-maintenance multi-agent-rl prognostics-health-management disaster-resilience n-step-learning infrastructure-prognostics hvac-maintenance

Updated Dec 25, 2025
Python

Devanik21 / Qwirkle-RL

Star 0

Comprehensive Qwirkle RL agent application. We'll use Monte Carlo Tree Search (MCTS) combined with Q-Learning - the best approaches for tile-placement games with high branching factors.

reinforcement-learning deep-reinforcement-learning q-learning game-theory monte-carlo-tree-search combinatorial-game-theory multi-agent-rl tile-placement-games

Updated Dec 14, 2025
Python

Devanik21 / CoNnEcTX

Star 0

Pure RL Agents: I implemented Q-Learning agents that learn through Self-Play. They play against each other to get smarter without human help! Symmetry Optimization: To make them "genius" faster, I added logic so they understand that a board mirrored left-to-right is the same situation. This cuts the learning time in half!

reinforcement-learning q-learning game-theory autonomous-agents deep-rl self-play multi-agent-rl self-improving-systems symmetry-optimization

Updated Dec 13, 2025
Python

Improve this page

Add a description, image, and links to the multi-agent-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-agent-rl topic, visit your repo's landing page and select "manage topics."

Learn more

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-agent-rl

Here are 19 public repositories matching this topic...

hsvgbkhgbv / SQDDPG

tk-yasuno / dql-bridge-maintenance

Nikelroid / adversarial-coevolution

chizkidd / huggingface-deep-RL-course

Vaioskn / adversarial-soccer-rl

julesser / DeepRL-P3-Collaboration-Competition

SarathL754 / Multi-agent-RL-texas-holdem-aec

Devanik21 / Hexapawn-RL

JEONGHEESIK / CrazyArcade

Devanik21 / Tic-Tac-Toe-RL-Battle-Arena

Devanik21 / Dark-Thermodynamic-Mind

alizangeneh / multiagent-warehouse-navigation-dqn

tk-yasuno / dql-multi-equipments-cbm

Devanik21 / The-Game-of-Nim-RL

Devanik21 / general-gamer-ai-lite

mwasifanwar / multi_agent_rl

tk-yasuno / dql-aged-multi-equipment-cbm

Devanik21 / Qwirkle-RL

Devanik21 / CoNnEcTX

Improve this page

Add this topic to your repo