MSCCL++: Rethinking GPU Communication Abstractions for AI Inference | Read Paper on Bytez