Retrosynthesis Planning via Worst-path Policy Optimisation in Tree-structured MDPs | Read Paper on Bytez