ParallelKittens: Simple and Fast Multi-GPU AI Kernels hazyresearch.stanford.edu 7 points by pella a day ago