Hiroki Naganuma

Algorithms for Solving Bilevel Optimization Problems

Implicit Function Theorem (in Stackelberg Games)

Introduction

Key Insights

Research Findings

References

Penalty Based Method

Overview of the Penalty-Based Method

  1. Definition:
    • A method that transforms a bilevel optimization problem into a single-objective minimization problem.
  2. Details of the Method:
    • Initial Approach:
      • Explained based on the Stackelberg game.
    • Transformation:
      • First, it’s converted into a constrained optimization problem. Then, using the Lagrange multiplier, it’s transformed into an unconstrained problem.
    • Optimality Condition:
      • The function h describes the violation of the optimality condition. f2* represents the optimal f2 value.
    • Updates:
      • The values of x and y are updated by a significant delta (Δ). The choice of Δ should not deviate significantly from the gradient of f1.
  3. Advantages:
    • Stabilizes the learning dynamics in games.
    • Allows the application of adaptive optimization methods.
  4. Disadvantages:
    • Inefficient: Requires many gradient descent steps for each update.
    • In cases with poor loss landscapes, the optimal f2* found in each iteration can vary significantly.

Notations:

Two Time Scale Update Rule (TTUR)

Application

Hyperparameter Optimization

Model-based Reinforcement Learning

GANs

Latent Diffusion Models

IRM

Misc