Learning Asynchronous and Sparse Human-Object Interaction in Videos | Read Paper on Bytez