Stochastic Principal-Agent Problems: Computing and Learning Optimal History-Dependent Policies | Read Paper on Bytez