Multimodal Transformer for Automatic 3D Annotation and Object Detection | Read Paper on Bytez