How do you know it swaps to ram vs on the TPU?
Would be interested in testing this on my pixel.
Because TPU has 2GB and weight + context needs more
Because TPU has 2GB and weight + context needs more