A3B is especially nice, MoE really shines on memory bandwidth contained platforms like the DGX Spark.
looks like MTP support has now been merged and also updated unsloth quants to go with it (not just the extras, all of 'em!)
looks like MTP support has now been merged and also updated unsloth quants to go with it (not just the extras, all of 'em!)