SAM-Guided Masked Token Prediction for 3D Scene Understanding | Read Paper on Bytez