Input throughput measures how fast models process tokens when reading/encoding text. Output throughput measures how fast they generate new tokens. The ratio line shows Falcon-H1/Qwen2.5 performance comparison.