Does this sort of thing scale? Would a 30B or higher model see similar performance/memory gains...

swiftcoder • today at 7:58 AM • 0 replies • view on HN

Does this sort of thing scale? Would a 30B or higher model see similar performance/memory gains under this scheme?

alt Hacker News