Skip to content

Game Theory

Bayesian game

Bayesian game

Coalition game

transferable utility

  • utility can be losslessly transfered between players
    • have a common currency with absolutely equal value
      • utility of the currency independent to anything else

hedonic game

  • non-transferable utility game
  • players don't care ahout the rest of the partition
  • coalitions don't allocate profit for players
  • e.g. stable marriage

The top coalition property of Banerjee et al. (2001) and the common ranking property of Farrell and Scotchmer (1988) are sufficient conditions for core stability in hedonic games.

top coalition

gt-88.png

defined in Core in a simple coalition formation game

shapley value

  • each player's marginal contribution
  • rules
  • dummy
  • additive
    • union of any 2 set >= sum
  • game-theory-1.jpg
  • 算各個成員的貢獻
  • e.g.
  • \(v(\{1\})=1\), \(v(\{2\})=2\), \(v(\{1,2\})=4\) if {1} → {1,2}, \(\phi_1=1\), \(\phi_2=3\) if {2} → {1,2}, \(\phi_1=2\), \(\phi_2=2\)\(\phi_1=1.5\) \(\phi_2=2.5\)
  • game-theory-2.jpg
    • game-theory-3.png
  • 3 player in majority vote, 1 can veto
    • core
    • \(x_1+x_2\geq1\), \(x_1+x_3\geq1\), \(x_1+x_2+x_3=1\), \(x_i\geq0\)
    • \(x_1=1\),\(x_2=x_3=0\)
    • UN 五理事國瓜分
    • shapley
    • \(x_1=\frac{2}{3}\), \(x_2=x_3=\frac{1}{6}\)

core

  • stable 的 coalition (s.t. none has the incentive to deviate and form a new coalition)
  • each coalition get at least what they can generate alone

convex

  • \(v(S\cup T)\geq v(S)+v(T)-v(S\cap T)\)
  • nonempty core
  • the shapley value is the core
  • → fair & stable

stability

  • Nash stable
    • with all players being selfish, none will move
  • Individually stable
    • with all players respecting pareto criterion, none will move
    • Nash stable \(\in\) Individually stable

Since agents are assumed to be rational, agents only engage in a unilateral deviation if it makes them better off. Any partition in which no such deviation is possible is called Nash stable (NS).

This type of deviation can be refined by additionally requiring that no agent in the welcoming coalition is worseoff when agent i joins. A partition in which no such deviation is possible is called individually stable (IS).

ref: Reaching Individually Stable Coalition Structures in Hedonic Games

rules

  • coalition unanimity
    • all members need to agree
  • equilibrium binding agreements
    • coalitions can only break into smaller coalitions
  • open membership
    • no rules

externality

an equilibrium coalition structure has positive externalities if forming a coalition or merging into a big coalition will make other coalitions worse off

social choice

voting scheme

  • plurality
  • most-preferred by most people
  • 最普通的那種
  • cumulative voting
  • multiple votes
  • approval voting
  • unlimited votes (?)
  • plurality with elimination
  • iteratively eliminate one with fewest votes and revotes until a majority winner
  • while no majority winner:
      eliminate one with fewest votes
      revote
    
  • France
  • Borda Rule, Borda Count
  • preference ranking
  • sum score, highest one wins
  • successive elimination
  • a,b,c,d,e → (((a vs. b) vs. c) vs. d) vs. e
  • many legislative process

Condorcet consistency

  • an outcome that wins every pairwise comparisons
  • Condorcet cycle
  • A>B, B>C, C>A

social welfare function

  • decide the entire preference ordering
  • pareto efficient
  • whenever everyone agrees on one ordering, the social welfare function select that ordering
  • independent of irrelevant alternatives, IIA
  • 我對 a&b 的排序只 depend on 其他人對 a&b 的排序,跟其他人對 a&c c&d 的排序都無關
  • dictatorship
  • return 一個事先決定的人的排序
  • satisfy pareto & IIA
    • everyone prefer a over b → dictator obviously prefer a over b too → social welfare function i.e. dictator itself select a over b - pareto efficient
    • only care about el dictator, so IIA
  • Arrow's Theorem
  • any social welfare function W with over 3 or more outcomes that is pareto efficient & IIA, is dictorial
  • 證明沒看

social choice function

  • decide only the winner
  • weak pareto efficiency
  • b 永遠不被選擇 if a > b for everyone
  • monotonicity
  • if A's already winning, A will remain the winner when increase the support for A
  • dictatorship
  • exist some agent j such that social choice function C always selects the top choice of j
  • Muller-Satterthwaite Theorem
  • any social choice function that is weakly pareto efficient & monotonic, is dictorial
  • plurality satisfies weak PE & non-dictorial, so it's not monotonic
    • game-theory-4.jpg

single-peaked preference

  • median voting
  • Condorcet winner exist whenever odd voters
    • game-theory-5.jpg
  • no incentive to lie

transitivity

x > y, y > z → x > z

mechanism design

  • inverse game theory
  • design a game
  • bayes-nash implementation
  • problems
    • could be more than one equilibrium
    • agents may mis-cooperate and plan none of the equilibria
    • asymmetric equilibria (?)
  • refinement
    • symmetric bayes-nash implementation
    • ex-post implementation
  • direct implementation
  • indirect implementation
  • Duverger's Law
  • 既存兩大派,你喜歡第三者你還是不會投
  • direct relevation mechanism
  • each agent is asked to report his individual preferences
  • simultaneous move Bayesian game
    • Bayesian version of normal form game
  • truthful
  • tell everything about your true preferences
  • indirect relevation mechanism
  • agents are asked to send messages other than preferences
  • imperfect information Bayesian extensive form game
  • relevation principle ^ad0bd2
  • if a social choice function can be implemented by any mechanism, then it can be also implemented by a truth-telling direct revelation mechanism
    • 在 original (complicated) mechanism 外面 fascade 一層,把 truth map 成各成員想要 play 的 strategy → 各成員會想要 play truthfully (對於 original mechanism 來說各成員 strategy 不變)
    • 那層 fascade 等同於 agent 的 mentira function
  • significance: 可以用 direct & truthful 的方式分析 without the loss of generality
  • Gibbard-Satterthwaite Theorem
  • 已知 at lease 3 outcomes & social choice function C is onto (每個 outcome O 都有被 map 到), truthful reporting is a dominant strategy iff C is dictorial
  • only consider dominant strategy implementation
  • 有 dominant strategy 者
  • single-peaked domains
    • median voting
    • max/min of peaks
  • trade
    • has private value for selling/buying goods
    • price fixed in advance
    • decide to buy/sell at that price

transferable utility mechanism

  • strictly pareto efficient
  • game-theory-6.png
  • mechanism itself is also an agent
  • budget balance
  • game-theory-7.png
  • 拿的錢跟給的錢一樣多
  • weak budge balance
    • >=
    • 不虧就好
  • individual rationality
  • ex interim individual rationality
    • game-theory-8.png
    • 參加這個 mechanism,average over the possible valuations for other agents 時,淨效益 >= 0
  • ex post individual rationality
    • game-theory-9.png
    • equilibrium 時淨效益 >= 0
    • stronger than ex interim
  • tractability
  • choice function & payment function \(\in O(n^k)\)
  • revenue maximization/minimization
  • maximum fairness
  • maximize the happiniess of the least happy person i.e. maximize the min utility
  • price-of-anarchy minimization
  • game-theory-10.jpg
  • minimize {best social welfare} \(\div\) {welfare of the worst equilibrium}
    • iterate through all equilibrium, select the max ratio of best social welfare & real equilibrium, and minimize it

VCG mechanism

  • has truth as a dominant strategy
  • make efficient choices
  • Groves mechanisms
  • game-theory-11.png
  • payment rule
    • depend on others
  • Vickrey-Clarke-Groves, VCG mechanism
  • VCG
  • pivotal mechanism
  • game-theory-12.jpg
  • payment function = max of everyone else's utility when excluding you - when including you (\(i\))
    • so if you're not pivotal, you don't pay anything (as you don't change anything)
    • can be interpreted as, you get paid the sum of others' utility when you exist, and you pay the sum of others' utility when you don't
    • generally >= 0
    • social cost of the individual \(i\)
    • > 0 → you make things worse by existing
  • significance
    • internalize the externality of your choice
    • how your choice affect others will impact your payment
  • Green-Laffont Theorem
    • game-theory-13.jpg
  • truthful reporting is a dominant strategy only if it's Groves mechanism
  • e.g.
  • selfish routing game-theory-14.jpg
    • 要走 A → F
    • without AB, min sum of others' cost = 6 (max utility = -6)
    • with AB, min sum of others' cost = 2 (max utility = -2)
    • so \(p_{AB} = -6-(-2) = -4\), AB get 4

limitations of VCG

  • require full disclosure
  • but in repeated games, agents may want to disclose their information, and VCG isn't a good mechanism in this case
  • susceptible to collusion game-theory-15.jpg
  • if 1&2 collude game-theory-16.jpg
    • choice is unchanged, but they're better off
    • so VCG is susceptible to collusion
  • not frugal
  • payment is unbounded, might be more than what an agent's willing to accept
  • game-theory-17.jpg
  • revenue monotonicity violated
  • game-theory-18.png game-theory-19.png
  • agent 2 can eliminate it's payment by submitting another vote
  • couldn't return all revenue to agents
  • if VCG return the money collected from agents, it might change agents' incentives
    • even if you do something else with the money, if some agents can somehow benefits from it, incentives are changed
  • Mayernon-Satterthwaite Theorem
  • game-theory-20.png
  • exist distributions s.t. no Bayesian incentive-compatibile mechanism is simultaneously efficient, weakly budget balanced & interim individual rational
  • game-theory-21.jpg
    • p(1, 0) 的情況,若要 sller & buyer 都不說謊,則結果矛盾
  • significance: there exist inefficient trades

breaking limitations

  • to satisfy individual rationality
  • choice-set monotonicity
    • the set of choices with one agent removed is a subset of when the agent is involved
    • game-theory-22.png
  • no negative externalities
    • when an agents is excluded, the mechanism won't do negative utitlity on the agent
    • game-theory-23.png
  • scenarios that satisfy both
    • road building
    • game-theory-24.jpg
    • the choices is independent of the numbers of agent → satisfy choice-set monotonicity
    • none would have negative utility under any choice → satisfy no negative externalities
    • trading
    • game-theory-25.jpg
  • VCG is ex-post individual rational when the choice-set monotonicity & no negative externalities are both satisfied
    • game-theory-26.png game-theory-27.png
    • \(x(v)\) = choice of the mechanism
    • \(u_i\) = utility under VCG's choice - payment
    • \(_{-i}\) = when exclude i
  • to satisfy weak budget balance
  • no single-agent effect
    • the welfare of agents other than \(i\) is weakly increased when dropping i
    • e.g. game-theory-28.png
  • VCG is weakly budget-balanced when the no single-agent effect is satisfied
    • game-theory-29.png
  • game-theory-30.png
  • VCG is as budget-balanced as any efficient mechanism can be if it's ex post individually rational
    • satisfies weak budget balance in any case where any dominant strategy, efficient & ex interim (weaker than ex post) is able to

auctions

  • types of auctions
  • English auction
    • ordinary one
  • Japanese auction
    • the auctioneer increases the price continously, people choose when to give up, last person standing wins
    • giving up is irreversible
    • analytically more tractable than English
  • Dutch auction
    • the actioneer decreaces the price continously, the first one call you mine wins
    • minimal communication
  • first-price auction
    • sealed bid
    • the one writing the highest price wins
    • strategically equivalent to Dutch auction
    • bid less than valuation
    • tradeoff between probability of winning & amount you pay
    • no dominant strategy
    • game-theory-31.png
      • game-theory-32.jpg game-theory-33.jpg
    • game-theory-34.png
      • more people, you bid closer to your valuation
    • async = True
  • second-price auction
    • sealed bid
    • the one writing the highest price wins, but pays what the second-highest bidder writes
    • is VCG
    • when pivotal, I win and others get nothing when I participate, and the one with the second highest utility wins when I don't participate, so I pay the second highest bid (??????)
    • dominant strategy is to bid your true value i.e. truth-telling
  • all-pay auction
    • sealed bid
    • the one writing the highest price wins, but everyone pays what they writes
  • definition
  • market-based (an exchange in terms of currency)
  • mediated (has an auctioneer)
  • well-specified (rules)
    • rules for bidding
    • rules for what information if revealed
    • rules for clearing
    • when it ends
    • allocation
    • payment
  • under independent private value (IPV) model, the dominant strategy is to bid your true value in English, Japenese & second-price auctions
  • you see other bidders in English & Japanese, but under IPV, it makes no difference

revenue equivalence

  • game-theory-35.jpg
  • kth order stastistics
  • expected value of kth largest draw = \(\dfrac{n+1-k}{n+1}v_{max}\)
  • 1st & 2nd price auctions satisfy the requirements of revenue equivalence theorem
  • symmetric game → symmetric equivalence
  • in a symmetric equilibrium, higher bid iff higher valuation
    • person with highest valuation wins
  • a 1st-price auction bidder bids his expected payment as he's the winner of a 2nd-price auction
    • other n-1 values are uninformly drawn from \([0, v_i]\)
    • \(E[\text{2nd-highest bid}] = \dfrac{(n-1)+1-1}{(n-1)+1}v_{i}=\dfrac{n-1}{n}v_i\)
  • proof
  • 沒看
  • https://www.coursera.org/learn/game-theory-2/lecture/ZfHhY/4-5-revenue-equivalence 8:40

optimal auctions

  • optimal reserve price in a second-price auction
  • reserve price also acts as another bidder
  • if 2 bids are uniformly drawn from [0,1], with reserve price = R
    • both < R, \(p=R^2\) → revenue = 0
    • 1 above 1 below R, \(p=(1-R)(R)\), revenue = R (2nd-highest bid)
    • both >= R, \(p=(1-R)^2\), revenue \(\dfrac{1+R}{3}\)
    • game-theory-36.png
    • expected revenue = \(\dfrac{1+3R^2-4R^3}{3}\)
    • max when R=\(\dfrac{1}{2}\)
    • https://math.stackexchange.com/a/2213498
  • virutal valuation
  • game-theory-37.png
    • your valuation, adjusted
  • game-theory-38.png
  • Mayerson's optimal auction
  • game-theory-39.jpg game-theory-40.png
  • sell to the agent with the highest virtual valuation
    • the winner pay the critical valuation of being the winner
  • the optimal is a second-price aunction
    • game-theory-41.png
  • not VCG, not efficient
    • can end up not selling
  • virtual valuation makes weak bidders more competitive → higher bidder need to bids higher, increases the competition

Ch3 Contract Theory

  • mechanism design for the interactions between employer/seller & employee
  • solve assymetric
  • give incentive to self-reveal
  • in wireless network
  • employer/seller/PU/CC
    • service provider
    • base station
  • employee/buyer/SU/ED
    • user
    • device
  • mobile crowdsourcing example
  • employee's contraints
  • incentive compatibility, IC
    • contract maximize utility
    • local downward incentive constraints (LDIC) (?)
  • individual rationality, IR
    • utility bigger than without contract
  • adverse selection
  • screening problem
    • uninformed party offer the contract
    • solve adverse selection ny using relevation principle
    • force the informed party to choose the contract that reveil its status
    • offer multiple contracts intended for employees with different skill levels
  • most incentive problems are a combination of both adverse selection & moral hazard
  • bilateral & multilateral
  • bilateral
    • one-to-one
  • multilateral
  • one & multi-dimensional
  • how many you consider
  • static & repeated
  • static
    • memoryless
    • one-shot deal, histories don't affect
  • repeated
    • histories will affect
    • more complex
    • renegotiation & designing long-term contract
    • equivalent to market equilibrium

CRN spectrum scheme

  • one dimensional, bilateral, static, adverse selection
  • cooperative spectrum sharing
  • PU 跟 SU 買不用的資源
  • PU = primary user
  • SU = secondary user
  • down payment, t
  • 頭款,簽訂合約時 SU 付的錢
  • installment payment, r
  • 尾款,事成後 SU 付的錢
  • hidden information → adverse selection
  • PU doesn't know SU's capability to complete the task
  • SU's type
    • higher → 成功率愈高 → PU decreases down payment, increase installment payment
    • lower → 成功率愈低 → PU increases down payment, decrease installment payment
    • trade-off between 拿到錢 & 更多勞工
  • solution: PU offers different types of contracs to different types of SUs
  • hidden action → moral hazard
  • PU doesn't know how much effort i.e. transmission power SU put in
  • SU's operation cost \(\psi=\frac{c}{2}e^2\)
    • c = cost coeff.
    • e = effort
  • solution: PU designs each contract carefullt
  • payoff
  • SU
    • \(U_{SU_i}=\theta_ie_i(R-r_i)-t_i-\frac{c}{2}e_i^2\)
    • \(\theta_i\) = \(SU_i\)'s capability
    • \(\theta_ie_i\) = \(SU_i\)'s probability to compete the task
      • make c high enough s.t. \(\theta_ie_i<1\)
    • R = revenue (if task completed)
    • literal meaning: expected profit - down payment - operation cost
  • PU
    • \(U_{PU_i}=t_i+\theta_ie_ir_i\)
    • literal meaning: down payment + operation installment payment
    • \(U_{PU}=\displaystyle{\sum_{i=1}^n}\beta_i(t_i+\theta_ie_ir_i)\)
    • \(\beta_i\) = \(SU_i\)'s type
  • social welfare
    • sum of the expected payoff of SU & PU
    • \(U=\displaystyle{\sum_{i=1}^n}\beta_i(\theta_ie_iR-\frac{c}{2}e_i^2)\)
    • installment payment & down payment 為內力
    • literal meaning: expected revenue - SU's operation cost
  • cases
  • general case
    • both moral hazard & adverse selection
    • PU's payoff maximization problem
    • \(\displaystyle{\max_{(t_i,r_i)}\sum_{i=1}^n}\beta_i(t_i+\theta_ie_ir_i)\)
    • (IC) \(\theta_ie_i(R-r_i)-t_i-\frac{c}{2}e_i^2\geq \theta_ie_i'(R-r_j)-t_j-\frac{c}{2}e_i'^2\)
      • SU 選擇最爽的合約簽
    • (IR) \(\theta_ie_i(R-r_i)-t_i-\frac{c}{2}e_i^2\geq 0\)
      • SU 簽合約比不簽還爽
    • SU's optimal effort \(e_i^*=\frac{1}{c}\theta_i(R-r_i)\)
      • 跟 down payment t 無關
      • 代進去後用 lagrange multiplier 得 t 的 iterative 解
  • extreme case
    • only moral hazard
    • no adverse selection i.e. PU knows SU's capability
    • PU's problem
      • \(\displaystyle{\max_{(t_i,r_i)}}\space t_i+\frac{1}{c}\theta_i^2(R-r_i)r_i\)
      • IR
    • solution
      • \(t_i=\frac{1}{2c}\theta_i^2R^2\)
      • \(r_i=0\)
      • 只需要 down payment 而不需 installment payment
    • only adverse selection
    • no moral hazard → SU's effort fixed at \(\hat{e}\)
    • solution
      • \(t_i=-\frac{1}{2}c\hat{e}^2<0\)
      • \(r_i=R\)
      • PU asks 100% future revenue from SU
      • PU pays the operation cost to SU for SU to join
  • discussion
  • adverse selection
    • SU has the incentive to choose the intended contract and reveal its true type
  • moral hazard
    • provide incentive for SU to reduce its operation cost
    • if no moral hazard, when SU reduces the effort, PU will notice and redesign the contract to seize the means of production
  • financing contract analysis when only 2 type
  • 2 types of SU
    • \(\theta_H\): high capability
    • \(\theta_L\): low capability
  • results
    • installment payment \(r_H=0\) regardless of cost coeff. c, revenue R or probability \(\beta\)
    • game-theory-42.png
    • \(\theta_H\) 多金
  • low down payment for \(\theta_L\) game-theory-43.png
    • \(\theta_L\) 窮 → 大部分 payment 在拿到 revenue 後收
  • system performance
  • only moral hazard & only adverse selection 的情況都把 SU payoff 壓成 0 bc SU 其中一項資訊透明
  • the 2 extreme cases can be the bounds of PU's payoff & social welfare
  • cost coeff. ↑ payoff & social welfare ↓
    • game-theory-44.png
  • revenue ↑ payoff & social welfare ↑
    • game-theory-45.png
  • distribution ↑ PU's payoff & social welfare ↑
    • PU will ask more money if SU is belived to be a highly capable one → \(\beta\) has a negative effect on SU'spayoff
    • game-theory-46.png

Ch4 stochastic game

  • absorbing state
  • 不會再變的 state
  • have a absorbing state → finite stages
  • payoffs at every stage depends on the state
  • repeated game: payoff matrix same for every stage, i.e. only has 1 state
  • only 1 player → markov decision process (like 交電)
  • transition probability depend on the state & the actions of the playsers
  • only depend on 1 player → single-controller stochastic game
  • the evolution of state can follow a Markov process or DEs
  • stationary strategies
  • mixed stategies depending only one the current state
    • not on time
    • not on past actions or states
  • always play the mixed action \(\sigma_{i,s}\) at state \(s\in S\)
  • finite-horizon stochastic game
  • finite stages
  • infinite-horizon stochastic game
  • infinite stages
  • discounted payoff
  • current money is better than future money
  • discount factor \(\delta \in (0,1)\)
  • \(\epsilon\)-Nash equilibrium
  • near-Nash equilibrium
  • \(\epsilon=1\) → Nash equilibrium
  • discounted infinite-horizon stochastic game
  • two-player, zero-sum stochastic game
  • at most 1 eq payoff at every initial state \(s_1\) (?)
    • the value of the game
  • every stochastic game with finite state and action space admits a \(\delta\)-discounted Nash equilibrium in stationary strategies

Ch6 Learning in Games

  • sequential
  • 不同 player 等上一個 player 算完再算 response, 再傳給下一個
  • 快速 converge
  • Gauss Seidel Method
  • 對角線最大 → converge
  • parallel
  • 不同 player 同時計算 best response
  • less sync, slower convergence
  • jacobi method
  • lloyd-max algorithm
  • supermodular game
  • increasing difference
    • 變大時 difference 也變大
  • potential game
  • 每個 iteration utitlity 都增加
  • potetential strictly increase
  • if potential 有 max → 趨近收斂於 nash
  • fictitious play
  • mixed stategy 用實際機率代替
  • dominance solvable game
  • 一直刪掉 dominant strategy → one left - nash
  • regret
  • coarse correlated equilibrium
  • coarse correlated equilibrium
  • 每個 block 的機率
    • 相對於 NE mixed 是 player 的策略機率
  • coarse-correlated equilibrium 包含了 mixed & pure 的 NE
  • game-theory-47.png
  • game-theory-48.png
    • \(|a_i\) 表知道在哪個 block
  • https://www.youtube.com/watch?v=sQOrIpARr5E
  • reinforcement
  • decentralized, 不用管他人的 mixed
  • iterate update strategy & utitlity
  • recurrent neural network, RNN
  • output 餵給 input
  • update all layers
  • hidden layers all connected
  • echo state networks, ESN
  • RNN 簡化版
  • 只 update 最後層
  • hidden layer 不 connect (sparse) 且有 randomization