T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning | Read Paper on Bytez