I don't think you will get that anytime soon because for a model to work well with something like openclaw it needs a massive context window.
but but but but unified memory! (jk, I don't actually believe in Apple marketing words)
There might be future optimizations. Like, have your small model do COT to find where to look for memory that is relevant.
but but but but unified memory! (jk, I don't actually believe in Apple marketing words)
There might be future optimizations. Like, have your small model do COT to find where to look for memory that is relevant.