Texas A&M and Simon Fraser Universities Open-Source RL Toolkit for Card Games

A team of researchers from Texas A&M University and Canada’s Simon Fraser University have open-sourced a toolkit called “RLCard” for applying RL research to card games.

by Synced

2019-11-12

Comments 19

In July the poker-playing bot Pluribus beat top professionals in a six-player no-limit Texas Hold’Em poker game. Pluribus taught itself from scratch using a form of reinforcement learning (RL) to become the first AI program to defeat elite humans in a poker game with more than two players.

Compared to perfect information games such as Chess or Go, poker presents a number of unique challenges with its concealed cards, bluffing and other human strategies. Now a team of researchers from Texas A&M University and Canada’s Simon Fraser University have open-sourced a toolkit called “RLCard” for applying RL research to card games.

While RL has already produced a number of breakthroughs in goal-oriented tasks and has high potential, it’s not without its drawbacks. An instability in applications with multiple agents for example has slowed RL development in domains with numerous agents, large states and action spaces, and sparse rewards. Multi-player card games are therefore emerging as a good test environment for improving RL.

The RLCard toolkit supports card game environments such as Blackjack, Leduc Hold’em, Dou Dizhu, Mahjong, UNO, etc. to bridge reinforcement learning and imperfect information games. Because not every RL researcher has a game-theory background, the team designed the interfaces to be easy-to-use and the environments to be configurable. Factors such as state representation, action abstraction, reward design, and even the game rules can be adjusted by researchers.

The research team evaluated RLCard using state-of-the-art RL algorithms in RLCard environments, and by the amount of computation resources required to generate game data. They measured performance using the winning rate of the RL agents against random agents and in self-play tournaments. The team applied Deep Q-Network (DQN), Neural Fictitious Self-Play (NFSP), and Counterfactual Regret Minimization (CFR) algorithms to the environments and saw similar results against random agents. Although NFSP was stronger than DQN on most environments, both were highly unstable in larger games such as UNO, Mahjong and Dou Dizhu.

While RLCard is specifically designed to support RL in card games, there are other RL toolkits available, such as the OpenAI Gym introduced by OpenAI, and SC2LE (StarCraft II Learning Environment) introduced by DeepMind and Blizzard.

The first author on the research paper Daochen Zha, a graduate research assistant at Texas A&M University. Zha told Synced he hopes the toolkit can stimulate research that helps improve RL performance not only in card games but also across other domains with multiple agents, large state and action spaces, and sparse rewards.

The paper RLCard: A Toolkit for Reinforcement Learning in Card Games is on arXiv. The open-source toolkit is available on GitHub.

Journalist: Fangyu Cai | Editor: Michael Sarazen

19 comments on “Texas A&M and Simon Fraser Universities Open-Source RL Toolkit for Card Games”

Mariya

2019-11-14

Thank you for the post.

Loading...

Reply
Pingback: #Texas A&M and #SimonFraser Universities #OpenSource RL Toolkit for… | Dr. Roy Schestowitz (罗伊)
Pingback: Links 18/11/2019: Last Linux RC, OSMC Updated | Techrights
djamila_st

2019-11-20

This is very valuable information

Loading...

Reply
djamila_st

2020-01-13

Thanks alot for the nice article.

Loading...

Reply
ben azzi

2020-01-19

Thanks for giving me strong reason to continue my blog commenting strategy

Loading...

Reply
soundos

2020-02-02

thank you for published

Loading...

Reply
soundos

2020-02-02

great article

Loading...

Reply
soundos

2020-02-02

Ithanks for sharing

Loading...

Reply
zizi

2020-02-22

It’s worth reading your.

Loading...

Reply
soundos

2020-02-23

it’s good thanks for sharing

Loading...

Reply
ben azzi

2020-03-01

Really informative post. I read some post of the site and find those very much helpful like this post.

Loading...

Reply
zoli zoli

2020-03-07

Thank you.

Loading...

Reply
djamila_st

2020-03-27

Thanks. This post is really amazing

Loading...

Reply
Baker ST

2020-04-22

I was reading this and i realy found what i was looking for your article is really informative and i’ll be grɑteful if ʏou keep writing in the future.

Loading...

Reply
amel sm

2020-05-10

Thanks for sharing such an informative blog, . . . the blog consists really a great stuff.

Loading...

Reply
zoulikha

2020-06-20

Thank you so much for writing this. I’ve never had such an eloquent description for what I do. Much to think about

Loading...

Reply
ziane

2020-06-20

Thank you so much for writing this. I’ve never had such an eloquent description for what I do. Much to think about

Loading...

Reply
Anna

2026-06-18

This is a very detailed overview of how modern e-commerce logistics and warehouse systems are evolving with automation and robotics. The way Warehouse Management Systems optimize storage, picking, and packaging shows how data-driven algorithms are reducing inefficiencies that once relied heavily on manual labor. The integration of robotics like warehouse bots also reflects how companies are prioritizing speed and accuracy to meet rising consumer expectations, especially with same-day and next-day delivery models. At the same time, the growing scale of returns management highlights how complex reverse logistics has become in the global supply chain. All of this clearly shows how innovation in Freight Forwarding and warehouse automation is reshaping the future of global trade and e-commerce delivery performance.

Loading...

Reply