BCE vs. CE in Deep Feature Learning | Read Paper on Bytez