Oh hey I just implemented this in golang. Mine implementation heavily optimized for cpu.
can you share your repo.
can you share your repo.