> Circular reasoning: that's true only if the posterior is normal, or if your "optimal" is defined by second moments.
That doesn't sound right, it is an error minimising technique. Are we not talking about minimising mean square errors? Why would the posterior need to be normal? And why would optimal need to be defined by 2nd moments?