Go to the source code of this file.
Functions | |
| void | cs_blas_cuda_finalize (void) |
| Finalize CUDA BLAS API. | |
| double | cs_blas_cuda_asum (cs_lnum_t n, const cs_real_t x[]) |
| Return the absolute sum of vector values using CUDA. | |
| double | cs_blas_cuda_dot (cs_lnum_t n, const cs_real_t x[], const cs_real_t y[]) |
| Return the dot product of 2 vectors: x.y using CUDA. | |
| void | cs_blas_cuda_axpy (cs_lnum_t n, const cs_real_t *alpha, const cs_real_t *x, cs_real_t *y) |
| void | cs_blas_cuda_scal (cs_lnum_t n, const cs_real_t *alpha, cs_real_t *x) |
Return the absolute sum of vector values using CUDA.
| [in] | n | size of array x |
| [in] | x | array of floating-point values (on device) |
Return the dot product of 2 vectors: x.y using CUDA.
| [in] | n | size of arrays x and y |
| [in] | x | array of floating-point values (on device) |
| [in] | y | array of floating-point values (on device) |
| void cs_blas_cuda_finalize | ( | void | ) |
Finalize CUDA BLAS API.
This frees resources such as the cuBLAS handle, if used.