Local, offline system you control is worth a lot. Introducing an external dependency guarantees you will have downtime outside of your control.
Right, but that doesn't answer why you'd need a fast 7b LLM rather than a slightly less fast 14b LLM.
Right, but that doesn't answer why you'd need a fast 7b LLM rather than a slightly less fast 14b LLM.