Re per token, I had the same reaction, but given both labs are economically advantaged moving customers to per-token consumption... almost want to avoid this on principle. Even if not intentional, benefitting from a degraded product is not something I want to accept or enable.
More now than ever (since original ChatGPT release), the OSS models and open harnesses (eg Pi) are looking mighty attractive.
If pricing is per-token then in theory the vendor can offer you modes that optimize token usage or quality whereas all-you-can eat encourages vendors to satisfy you just enough to keep paying but the trend is towards lower quality responses.