Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

artificial intelligence - In counterfactual regret minimization, why are additions to regret weighted by reach probability? - Computer Science Stack Exchange #967

Open
1 task
ShellLM opened this issue Jan 3, 2025 · 1 comment
Labels
Algorithms Sorting, Learning or Classifying. All algorithms go here. artificial-intelligence forum-discussion Quotes clipped from forums game-theory probability-theory

Comments

@ShellLM
Copy link
Collaborator

ShellLM commented Jan 3, 2025

In Counterfactual Regret Minimization, Why Are Additions to Regret Weighted by Reach Probability?

Context

The question explores the reasoning behind weighting additions to regret and strategy in Counterfactual Regret Minimization (CFR) algorithms.

Original Algorithm Lines

25.  rI[a] ← rI[a] + π−i.(vσI→a[a] − vσ[a])
26.  sI[a] ← sI[a] + πi.σt(I,a)

Key Variables

  • rI[a]: Accumulated regret for information set I and action a
  • sI[a]: Accumulated strategy for information set I and action a
  • πi: Probability of reaching game state for learning player
  • π−i: Probability of reaching game state for other player

Motivation

The weighting serves two primary purposes:

  1. Compute strategy with reasonable runtime
  2. Create an upper bound on overall regret that can be minimized

Theoretical Basis

The approach is grounded in the paper "Regret Minimization in Games with Incomplete Information", which proves that overall regret is bounded by cumulative counterfactual regret.

The goal is to approximate a Nash equilibrium strategy by systematically minimizing cumulative counterfactual regret across multiple iterations.

Tags

  • artificial-intelligence
  • probability-theory
  • game-theory

Suggested labels

None

@ShellLM ShellLM added Algorithms Sorting, Learning or Classifying. All algorithms go here. forum-discussion Quotes clipped from forums labels Jan 3, 2025
@ShellLM
Copy link
Collaborator Author

ShellLM commented Jan 3, 2025

Related content

#652 similarity score: 0.85
#882 similarity score: 0.85
#940 similarity score: 0.84
#810 similarity score: 0.82
#703 similarity score: 0.82
#953 similarity score: 0.82

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Algorithms Sorting, Learning or Classifying. All algorithms go here. artificial-intelligence forum-discussion Quotes clipped from forums game-theory probability-theory
Projects
None yet
Development

No branches or pull requests

2 participants