CUDA Networks
|
Implementation of the Matrix::transpose method for GPU-accelerated matrix transposition. More...
Go to the source code of this file.
Functions | |
__global__ void | matrixTransposeKernel (const double *input, double *output, int rows, int cols) |
CUDA kernel for matrix transposition. More... | |
Implementation of the Matrix::transpose method for GPU-accelerated matrix transposition.
Definition in file matrix_transpose.cu.
__global__ void matrixTransposeKernel | ( | const double * | input, |
double * | output, | ||
int | rows, | ||
int | cols | ||
) |
CUDA kernel for matrix transposition.
input | Pointer to the input matrix data. |
output | Pointer to the output (transposed) matrix data. |
rows | Number of rows in the input matrix. |
cols | Number of columns in the input matrix. |
Definition at line 18 of file matrix_transpose.cu.