Go to the source code of this file.
Functions | |
void | cs_blas_cuda_finalize (void) |
Finalize CUDA BLAS API. More... | |
double | cs_blas_cuda_asum (cs_lnum_t n, const cs_real_t x[]) |
Return the absolute sum of vector values using CUDA. More... | |
double | cs_blas_cuda_dot (cs_lnum_t n, const cs_real_t x[], const cs_real_t y[]) |
Return the dot product of 2 vectors: x.y using CUDA. More... | |
void | cs_blas_cuda_axpy (cs_lnum_t n, const cs_real_t *alpha, const cs_real_t *x, cs_real_t *y) |
void | cs_blas_cuda_scal (cs_lnum_t n, const cs_real_t *alpha, cs_real_t *x) |
Return the absolute sum of vector values using CUDA.
[in] | n | size of array x |
[in] | x | array of floating-point values (on device) |
Return the dot product of 2 vectors: x.y using CUDA.
[in] | n | size of arrays x and y |
[in] | x | array of floating-point values (on device) |
[in] | y | array of floating-point values (on device) |
void cs_blas_cuda_finalize | ( | void | ) |
Finalize CUDA BLAS API.
This frees resources such as the cuBLAS handle, if used.