当然,得到的矩阵非常大,但是sgemm和在C-order中传递到dgemm都是有效的,>>> import scipy.linalg.blas#sgemm works, A.T is in F-order
>>> C = scipy.linalg.blas.sgemm(alpha=1.0, a=A.T, b=A.T, trans_
当它试图运行时,它会遇到一个CUBLAS_STATUS_ALLOC_FAILED错误。谷歌搜索什么都找不到。:372] failed to create cublas handle: CUBLAS_STATUS_ALLOC_FAILED
W c:\tf_jenkins\home\workspace\release-win\device\gpu\os\windows\tensorflow\stream_executor\stream.cc:1390] attempting to perform BLAS operationu