> You could run it on a cluster of nodes
Not sure this is a MBP either.
Not even a cluster of Mac Pros could run a dense 5T parameter model with RDMA, to my knowledge.
Not even a cluster of Mac Pros could run a dense 5T parameter model with RDMA, to my knowledge.