Cheaply Estimating Inference Efficiency Metrics for Autoregressive Transformer Models | Read Paper on Bytez