8.2
general documentation
Loading...
Searching...
No Matches
cs_blas.c File Reference
#include "cs_defs.h"
#include <math.h>
#include <stdio.h>
#include "cs_base.h"
#include "cs_log.h"
#include "cs_parall.h"
#include "cs_blas.h"
Include dependency graph for cs_blas.c:

Functions

void cs_blas_library_info (cs_log_t log_type)
 Print information on BLAS libraries used.
void cs_blas_set_reduce_algorithm (cs_blas_reduce_t mode)
 Set the preferred BLAS reduction algorithm family.
void cs_axpy (cs_lnum_t n, double a, const cs_real_t *x, cs_real_t *restrict y)
 Constant times a vector plus a vector: y <– ax + y.
double cs_sum (cs_lnum_t n, const cs_real_t *x)
double cs_weighted_sum (cs_lnum_t n, const cs_real_t *w, const cs_real_t *x)
double cs_dot (cs_lnum_t n, const cs_real_t *x, const cs_real_t *y)
 Return the dot product of 2 vectors: x.y.
double cs_dot_xx (cs_lnum_t n, const cs_real_t *x)
 Return dot products of a vector with itself: x.x.
double cs_dot_wxx (cs_lnum_t n, const cs_real_t *w, const cs_real_t *x)
void cs_dot_xx_xy (cs_lnum_t n, const cs_real_t *restrict x, const cs_real_t *restrict y, double *xx, double *xy)
 Return 2 dot products of 2 vectors: x.x, and x.y.
void cs_dot_xy_yz (cs_lnum_t n, const cs_real_t *restrict x, const cs_real_t *restrict y, const cs_real_t *restrict z, double *xy, double *yz)
 Return 2 dot products of 3 vectors: x.y, and y.z.
void cs_dot_xx_xy_yz (cs_lnum_t n, const cs_real_t *restrict x, const cs_real_t *restrict y, const cs_real_t *restrict z, double *xx, double *xy, double *yz)
 Return 3 dot products of 3 vectors: x.x, x.y, and y.z.
void cs_dot_xx_yy_xy_xz_yz (cs_lnum_t n, const cs_real_t *restrict x, const cs_real_t *restrict y, const cs_real_t *restrict z, double *xx, double *yy, double *xy, double *xz, double *yz)
 Return 5 dot products of 3 vectors: x.x, y.y, x.y, x.z, and y.z.
double cs_gdot (cs_lnum_t n, const cs_real_t *x, const cs_real_t *y)
 Return the global dot product of 2 vectors: x.y.
double cs_gres (cs_lnum_t n, const cs_real_t *vol, const cs_real_t *x, const cs_real_t *y)
 Return the global residual of 2 intensive vectors: 1/sum(vol) . sum(x.y.vol)
double cs_gmean (cs_lnum_t n, const cs_real_t *vol, const cs_real_t *x)
 Return the global spacial average of an intensive vectors: 1/sum(vol) . sum(x.vol)

Detailed Description

BLAS (Basic Linear Algebra Subroutines) type functions

Function Documentation

◆ cs_axpy()

void cs_axpy ( cs_lnum_t n,
double a,
const cs_real_t * x,
cs_real_t *restrict y )

Constant times a vector plus a vector: y <– ax + y.

Parameters
[in]nsize of arrays x and y
[in]amultiplier for x
[in]xarray of floating-point values
[in,out]yarray of floating-point values

◆ cs_blas_library_info()

void cs_blas_library_info ( cs_log_t log_type)

Print information on BLAS libraries used.

Parameters
[in]log_typelog type

◆ cs_blas_set_reduce_algorithm()

void cs_blas_set_reduce_algorithm ( cs_blas_reduce_t mode)

Set the preferred BLAS reduction algorithm family.

This may not be enforced for all algorithms, though it should at least be enforced for the most general functions such as cs_dot.

Parameters
[in]modeBLAS mode to use

◆ cs_dot()

double cs_dot ( cs_lnum_t n,
const cs_real_t * x,
const cs_real_t * y )

Return the dot product of 2 vectors: x.y.

Parameters
[in]nsize of arrays x and y
[in]xarray of floating-point values
[in]yarray of floating-point values
Returns
dot product

◆ cs_dot_wxx()

double cs_dot_wxx ( cs_lnum_t n,
const cs_real_t * w,
const cs_real_t * x )

◆ cs_dot_xx()

double cs_dot_xx ( cs_lnum_t n,
const cs_real_t * x )

Return dot products of a vector with itself: x.x.

For better precision, a superblock algorithm is used.

Parameters
[in]nsize of array x
[in]xarray of floating-point values
Returns
dot product

◆ cs_dot_xx_xy()

void cs_dot_xx_xy ( cs_lnum_t n,
const cs_real_t *restrict x,
const cs_real_t *restrict y,
double * xx,
double * xy )

Return 2 dot products of 2 vectors: x.x, and x.y.

The products could be computed separately, but computing them simultaneously adds more optimization opportunities and possibly better cache behavior.

Parameters
[in]nsize of arrays x and y
[in]xarray of floating-point values
[in]yarray of floating-point values
[out]xxx.x dot product
[out]xyx.y dot product

◆ cs_dot_xx_xy_yz()

void cs_dot_xx_xy_yz ( cs_lnum_t n,
const cs_real_t *restrict x,
const cs_real_t *restrict y,
const cs_real_t *restrict z,
double * xx,
double * xy,
double * yz )

Return 3 dot products of 3 vectors: x.x, x.y, and y.z.

The products could be computed separately, but computing them simultaneously adds more optimization opportunities and possibly better cache behavior.

Parameters
[in]nsize of arrays x and y, and z
[in]xarray of floating-point values
[in]yarray of floating-point values
[in]zarray of floating-point values
[out]xxx.x dot product
[out]xyx.y dot product
[out]yzy.z dot product

◆ cs_dot_xx_yy_xy_xz_yz()

void cs_dot_xx_yy_xy_xz_yz ( cs_lnum_t n,
const cs_real_t *restrict x,
const cs_real_t *restrict y,
const cs_real_t *restrict z,
double * xx,
double * yy,
double * xy,
double * xz,
double * yz )

Return 5 dot products of 3 vectors: x.x, y.y, x.y, x.z, and y.z.

The products could be computed separately, but computing them simultaneously adds more optimization opportunities and possibly better cache behavior.

Parameters
[in]nsize of arrays x and y, and z
[in]xarray of floating-point values
[in]yarray of floating-point values
[in]zarray of floating-point values
[out]xxx.x dot product
[out]yyy.y dot product
[out]xyx.y dot product
[out]xzx.z dot product
[out]yzy.z dot product

◆ cs_dot_xy_yz()

void cs_dot_xy_yz ( cs_lnum_t n,
const cs_real_t *restrict x,
const cs_real_t *restrict y,
const cs_real_t *restrict z,
double * xy,
double * yz )

Return 2 dot products of 3 vectors: x.y, and y.z.

The products could be computed separately, but computing them simultaneously adds more optimization opportunities and possibly better cache behavior.

Parameters
[in]nsize of arrays x and y, and z
[in]xarray of floating-point values
[in]yarray of floating-point values
[in]zarray of floating-point values
[out]xyx.y dot product
[out]yzy.z dot product

◆ cs_gdot()

double cs_gdot ( cs_lnum_t n,
const cs_real_t * x,
const cs_real_t * y )

Return the global dot product of 2 vectors: x.y.

In parallel mode, the local results are summed on the default global communicator.

For better precision, a superblock algorithm is used.

Parameters
[in]nsize of arrays x and y
[in]xarray of floating-point values
[in]yarray of floating-point values
Returns
dot product

◆ cs_gmean()

double cs_gmean ( cs_lnum_t n,
const cs_real_t * vol,
const cs_real_t * x )

Return the global spacial average of an intensive vectors: 1/sum(vol) . sum(x.vol)

In parallel mode, the local results are summed on the default global communicator.

Parameters
[in]nsize of arrays x
[in]volarray of floating-point values
[in]xarray of floating-point values
Returns
global residual

◆ cs_gres()

double cs_gres ( cs_lnum_t n,
const cs_real_t * vol,
const cs_real_t * x,
const cs_real_t * y )

Return the global residual of 2 intensive vectors: 1/sum(vol) . sum(x.y.vol)

In parallel mode, the local results are summed on the default global communicator.

Parameters
[in]nsize of arrays x and y
[in]volarray of floating-point values
[in]xarray of floating-point values
[in]yarray of floating-point values
Returns
global residual

◆ cs_sum()

double cs_sum ( cs_lnum_t n,
const cs_real_t * x )

◆ cs_weighted_sum()

double cs_weighted_sum ( cs_lnum_t n,
const cs_real_t * w,
const cs_real_t * x )