▍1. Matrix_mulitiple
使用CUDA并行语言编写的矩阵乘法,与串行语句的对比试验,主要检测出CPU与GPU运行的时间,通过比较时间,达到比较效果。另外此程序可以随意更改矩阵的维数,增强程序的移植性。(CUDA parallel language use matrix multiplication, and serial statement comparison test, the main test of CPU and GPU to run time, by comparing the time, to compare results. Also this program can change the dimension of the matrix, and enhanced portability of the program.)