Neko  0.8.1
A portable framework for high-order spectral element flow simulations
fusedcg_device Module Reference

Defines a fused Conjugate Gradient method for accelerators. More...

Data Types

type  fusedcg_device_t
 Fused preconditioned conjugate gradient method. More...
 
interface  cuda_fusedcg_update_p
 
interface  cuda_fusedcg_update_x
 
interface  cuda_fusedcg_part2
 

Functions/Subroutines

subroutine device_fusedcg_update_p (p_d, z_d, po_d, beta, n)
 
subroutine device_fusedcg_update_x (x_d, p_d, alpha, p_cur, n)
 
real(kind=rp) function device_fusedcg_part2 (a_d, b_d, c_d, alpha_d, alpha, p_cur, n)
 
subroutine fusedcg_device_init (this, n, max_iter, M, rel_tol, abs_tol)
 Initialise a fused PCG solver. More...
 
subroutine fusedcg_device_free (this)
 Deallocate a pipelined PCG solver. More...
 
type(ksp_monitor_t) function fusedcg_device_solve (this, Ax, x, f, n, coef, blst, gs_h, niter)
 Pipelined PCG solve. More...
 

Variables

integer, parameter device_fusedcg_p_space = 10
 

Detailed Description

Defines a fused Conjugate Gradient method for accelerators.

Function/Subroutine Documentation

◆ device_fusedcg_part2()

real(kind=rp) function fusedcg_device::device_fusedcg_part2 ( type(c_ptr), value  a_d,
type(c_ptr), value  b_d,
type(c_ptr), value  c_d,
type(c_ptr), value  alpha_d,
real(c_rp)  alpha,
integer  p_cur,
integer  n 
)
private

Definition at line 169 of file fusedcg_device.F90.

Here is the call graph for this function:
Here is the caller graph for this function:

◆ device_fusedcg_update_p()

subroutine fusedcg_device::device_fusedcg_update_p ( type(c_ptr), value  p_d,
type(c_ptr), value  z_d,
type(c_ptr), value  po_d,
real(c_rp)  beta,
integer(c_int)  n 
)
private

Definition at line 144 of file fusedcg_device.F90.

Here is the call graph for this function:
Here is the caller graph for this function:

◆ device_fusedcg_update_x()

subroutine fusedcg_device::device_fusedcg_update_x ( type(c_ptr), value  x_d,
type(c_ptr), value  p_d,
type(c_ptr), value  alpha,
integer(c_int)  p_cur,
integer(c_int)  n 
)
private

Definition at line 157 of file fusedcg_device.F90.

Here is the call graph for this function:
Here is the caller graph for this function:

◆ fusedcg_device_free()

subroutine fusedcg_device::fusedcg_device_free ( class(fusedcg_device_t), intent(inout)  this)
private

Deallocate a pipelined PCG solver.

Definition at line 247 of file fusedcg_device.F90.

◆ fusedcg_device_init()

subroutine fusedcg_device::fusedcg_device_init ( class(fusedcg_device_t), intent(inout), target  this,
integer, intent(in)  n,
integer, intent(in)  max_iter,
class(pc_t), intent(inout), optional, target  M,
real(kind=rp), intent(inout), optional  rel_tol,
real(kind=rp), intent(inout), optional  abs_tol 
)
private

Initialise a fused PCG solver.

Definition at line 194 of file fusedcg_device.F90.

◆ fusedcg_device_solve()

type(ksp_monitor_t) function fusedcg_device::fusedcg_device_solve ( class(fusedcg_device_t), intent(inout)  this,
class(ax_t), intent(inout)  Ax,
type(field_t), intent(inout)  x,
real(kind=rp), dimension(n), intent(inout)  f,
integer, intent(in)  n,
type(coef_t), intent(inout)  coef,
type(bc_list_t), intent(inout)  blst,
type(gs_t), intent(inout)  gs_h,
integer, intent(in), optional  niter 
)
private

Pipelined PCG solve.

Definition at line 308 of file fusedcg_device.F90.

Here is the call graph for this function:

Variable Documentation

◆ device_fusedcg_p_space

integer, parameter fusedcg_device::device_fusedcg_p_space = 10
private

Definition at line 50 of file fusedcg_device.F90.