Posts tagged with "mutual information"
1 post found
Mar 31, 2026 information theory Shannon channel capacity attention bounds mutual information rate-distortion
The Information-Theoretic Limits of Context Windows
There are fundamental limits on how much information a fixed-width attention mechanism can extract from n tokens. Here's the math from Shannon's channel capacity to attention bounds.