Gradient Descent Only Converges to Minimizers: Non-Isolated Critical Points and Invariant Regions | Read Paper on Bytez