On Improving Sparse Matrix-Matrix Multiplication On Gpus