Overview

The /iterative folder contains various iterative solution algorithms. Generally speaking, direct solvers favor robustness over speed, whereas iterative solvers lean the other direction. Often the two solution frameworks are combined together. In finite element substructuring methods, for example, direct solvers are used to explicitly factor out interior subdomain unknowns, and the leftover interface-matching problem is then solved using iterative techniques. Another noteworthy example is mixed precision refinement, wherein a low (single) precision direct solver is used as a preconditioner to solve a high (double) precision A*x=b problem. By providing both direct and iterative solution routines, MyraMath seeks to be an attractive "ecosystem" for implementing these kinds of algorithms.

All the iterative algorithms in MyraMath take Action's as their inputs. Action is an abstraction for a "callback" to apply a matrix-vector product (b=A*x), the key step for Krylov-type linear solution and eigensolution algorithms. In addition to Krylov-type algorithms, simpler algorithms like backwards refinement and randomized low-rank sampling are also implemented in terms of Action.

Below are links to the sections of this page:

Action interface

Most of the classes we've used so far have been encapsulations of data. For example, Matrix encapsulates a 2D table of numbers, PatternBuilder encapsulates a bitmask of nonzero entries, and SparseRCholeskySolver encapsulates a (permuted) sparse triangular factor L such that A = L*L'. Action represents a slightly different concept, it is used to encapsulate behavior. Specifically, Action encapsulates the behavior of applying a linear operator A to a column vector x to produce another column vector b. This is a fundamental operation behind many iterative solution algorithms: given a candidate solution x, the Action A can be applied to it to measure a residual, which is used in turn to improve the solution (ad infinitum).

Many functions and methods could readily be adapted into Action's. For instance, diagonal scaling (think dense/dimm.h), a sparse matrix-vector multiply (think sparse/gemm.h) and even the .solve() method of a Solver, all of these model the concept of an Action by mapping one input vector x into another output vector b. Within /iterative there are many "adaptor" methods to make atomic Action's:

iterative/IdentityAction.h the identity operator, an Action that just copies input into output
iterative/DimmAction.h adapts dimm() (diagonal scaling) into an Action
iterative/GemmAction.h adapts gemm() [either the /sparse or /dense version] into an Action
iterative/SymmAction.h adapts symm() [either the /sparse or /dense version] into an Action
iterative/HemmAction.h adapts hemm() [either the /sparse or /dense version] into an Action
iterative/TrmmAction.h adapts trmm() [either the /sparse or /dense version] into an Action
iterative/TrsmAction.h adapts trsm() [either the /sparse or /dense version] into an Action
iterative/SolveAction.h adapts any class with a .solve() method into an Action

These atomic Action's can be combined together using the usual "vector-space" operations using the usual overloaded operators (+,-,*,/ etc):

iterative/SumAction.h allows Action's to be added (b=(A1+A2)*x
iterative/DifferenceAction.h allows Action's to be subtracted (b=(A1-A2)*x
iterative/ProductAction.h allows Action's to be cascaded (b=A1*(A2*x)
iterative/ScaleAction.h allows Action's to be scaled (b=alpha*A*x)

This language of composition can be used to rapidly prototype all sorts of preconditioning effects (additive/multiplicative combination, deflation, etc). Even third-party or end-user code can be encapsulated within an Action (see iterative/UserAction.h) . Any process that maps input vectors to output vectors is a suitable model for the Action concept. See the Appendix below for a working example. Just like MatrixRange served as the "interface type" to get external algebraic data into direct routines (like getrf()), Action serves as the "interface type" to get external algebraic behavior into iterative routines (like gmres()).

Refinement

Backwards refinement (see iterative/refine.h) is a simple process for improving the accuracy of a direct solver. Each step entails:

Forward multiply to evaluate the residual, r = b-A*x
Backsolve to find the correction vector, A*c = r
Update the solution with the correction, x += c

Refinement delegates to two Action's: one to apply A when forming the residual, and another to solve by A when forming the correction (this "inverse" Action is traditionally called M). The example below (tutorial/iterative/refine.cpp) demonstrates how to use refine(). The forward multiply by A is accomplished by make_SymmAction(), and the backsolution M comes from wrapping a make_SolveAction() around an existing instance of an RLDLTSolver.

 #include <myramath/dense/RLDLTSolver.h>
 #include <myramath/dense/LowerMatrix.h>
 #include <myramath/dense/LowerMatrixRange.h>
 #include <myramath/dense/Vector.h>
 #include <myramath/dense/VectorRange.h>
 #include <myramath/dense/euclidean.h>
 
 #include <myramath/iterative/refine.h>
 #include <myramath/iterative/SymmAction.h>
 #include <myramath/iterative/SolveAction.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 ADD_TEST("iterative_refine","[iterative]") // int main()
   {
   // Form random symmetric A and column vector b.
   int N = 100;
   auto A = LowerMatrix<float>::random(N);
   auto b = Vector<float>::random(N);
   // Factor A using an RLDLTSolver, denoted M.
   auto M = RLDLTSolver<float>(A.add_const());
   // Find A*x1 = b, by calling M.solve()
   auto x1 = b; M.solve(x1.column());
   // Find A*x2 = b, by calling refine()
   auto x2 = b; refine(make_SymmAction(A), make_SolveAction(M), x2.column() );
   // Compare answers x1 and x2.
   std::cout << "|A*x-b| (solve) = " << euclidean( make_SymmAction(A)*x1 - b ) << std::endl;
   std::cout << "|A*x-b| (refine)= " << euclidean( make_SymmAction(A)*x2 - b ) << std::endl;
   }

|A*x-b| (solve) = 0.0124765
|A*x-b| (refine)= 1.37793e-05

Most of the sparse direct solvers in /multifrontal already encapsulate this sequence of steps into a solver.refine() method, see Iterative Refinement for details.

It's possible to mix precision types (float and double) when performing backwards refinement, typically by applying a low-precision solver (M) to a high-precision system (A*x=b). The advantage here is that a low-precision solver requires less memory than a high-precision one (roughly half), but there is some risk that a low precision factorization is more likely to breakdown due to ill-conditioning. See iterative/RaiseAction.h and iterative/LowerAction.h for helper classes that adjust the precision of an Action, or iterative/mixed_refine.h for canned algorithms.

Krylov linear solvers

The conditions under which iterative refinement converges are fairly restrictive. In particular, the original factorization must be fairly accurate to guarantee convergence/improvement. Krylov schemes are an alternative solution framework for solving A*x=b, wherein the k'th solution x_k is sought by taking a linear combination of {b, A*b, A²*b, A³*b,..A^k*b}, called the Krylov space of (A,b). Under some mild assumptions about A and b, this collection of vectors will completely span Rⁿ after n iterations, guaranteeing they capture to the exact solution to A*x=b (in exact arithmetic, at least).

In practice n iterations is still far too long to wait, but a more detailed analysis of the convergence properties of Krylov methods (too advanced to pursue here) reveals that they can converge quickly if A has favorable spectral properties (clustered eigenvalues, well conditioned). This observation leads to preconditioned Krylov schemes, in which another Action M is introduced and an alternative problem M^-1*A*x=M^-1*b is iterated instead. The preconditioner M^-1 serves to improve the spectral properties of A and should be in some sense an approximation of A's inverse (such that M^-1A is well conditioned and/or better clustered). Finding a good M^-1 is problem dependent, but preconditioning strategies often draw from (i) incomplete factorizations (ii) classical smoothers (iii) approximating the underlying "physics" of A cheaply.

There are many Krylov-type solvers, but the most important ones are probably:

iterative/bicgstab.h, for Biconjugate Gradient Stabilized, suitable for most A. Inexpensive, but somewhat erratic convergence.
iterative/pcg.h, for Preconditioned Conjugate Gradients. Superior performance, but only suitable for symmetric positive A.
iterative/minres.h, for Minimum Residual. Almost as good as pcg(), still requires symmetric A but it can be indefinite.
iterative/gmres.h, for Generalized Minimum Residual, suitable for arbitrary A. Optimally convergent, but costly in memory/time.

All of these routines take an Action for A and another Action to apply M^-1, and overwrite your initial guess for x with the final solution. They also return profiling data (convergence history, etc), check the detailed source documentation for details. The example code below (tutorial/iterative/pcg.cpp) demonstrates pcg() on a symmetric positive definite A (a 2D graph laplacian), using an incomplete Cholesky factorization as a preconditioner M^-1. For comparison, the problem is also solved with no preconditioner (by setting M^-1=I).

 #include <myramath/utility/stlprint.h>
 
 #include <myramath/dense/Vector.h>
 #include <myramath/dense/VectorRange.h>
 #include <myramath/dense/euclidean.h>
 
 #include <myramath/sparse/laplacian2.h>
 #include <myramath/sparse/SparseMatrix.h>
 #include <myramath/sparse/gemv.h>
 
 #include <myramath/iterative/pcg.h>
 #include <myramath/iterative/GemmAction.h>
 #include <myramath/iterative/IdentityAction.h>
 #include <myramath/iterative/SolveAction.h>
 #include <myramath/iterative/ICholeskySolver.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 ADD_TEST("iterative_pcg","[iterative]") // int main()
   {
   // Build graph laplacian over 2D grid of size IxJ
   int I = 400;
   int J = 200;
   int N = I*J;
   auto A = laplacian2<double>(I,J);
   auto b = Vector<double>::random(N);
   // Solve using pcg() with no preconditioner.
   std::cout << "-------- no preconditioner -------" << std::endl;
   auto x1 = Vector<double>::zeros(N);
   auto output1 = pcg( make_IdentityAction<double>(N), make_GemmAction(A), b, x1, 1.0e-8);
   std::cout << "  |A*x-b|/|b| = " << euclidean(A*x1-b) / euclidean(b) << std::endl;
   std::cout << "  iterations = " << output1.history.size() << std::endl;
   std::cout << std::endl;
   // Solve using pcg() with incomplete cholesky preconditioner.
   std::cout << "------- with preconditioner ------" << std::endl;
   auto x2 = Vector<double>::zeros(N);
   auto M = ICholeskySolver<double>(A);
   auto output2 = pcg( make_SolveAction(M), make_GemmAction(A), b, x2, 1.0e-8);
   std::cout << "  |A*x-b|/|b| = " << euclidean(A*x2-b) / euclidean(b) << std::endl;
   std::cout << "  iterations = " << output2.history.size() << std::endl;
   std::cout << std::endl;
   }

-------- no preconditioner -------
  |A*x-b|/|b| = 7.24724e-09
  iterations = 28
------- with preconditioner ------
  |A*x-b|/|b| = 3.55127e-09
  iterations = 10

For sake of completeness, tutorial/iterative/bicgstab_gmres.cpp (shown below) compares the performance of bicgstab() and gmres() on a small 50x50 unsymmetric system.

 #include <myramath/utility/stlprint.h>
 
 #include <myramath/dense/Matrix.h>
 #include <myramath/dense/Vector.h>
 #include <myramath/dense/DiagonalMatrix.h>
 #include <myramath/dense/diag.h>
 #include <myramath/dense/gemv.h>
 #include <myramath/dense/inverse.h>
 #include <myramath/dense/euclidean.h>
 
 #include <myramath/iterative/Action.h>
 #include <myramath/iterative/DimmAction.h>
 #include <myramath/iterative/GemmAction.h>
 #include <myramath/iterative/bicgstab.h>
 #include <myramath/iterative/gmres.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 #include <cmath>
 
 using namespace myra;
 
 ADD_TEST("iterative_bicgstab_gmres","[iterative]") // int main()
   {
   using myra_stlprint::operator<<;
   // Make A, a random matrix with exponentially decaying diagonal shifts.
   int N = 50;
   auto A = Matrix<double>::random(N,N);  
   double e0 = 2.0;
   double e1 = 0.5;
   for (int n = 0; n < N; ++n)
     {
     double dn = n;
     double dN = N;
     double e = e0 + dn/dN*(e1-e0);
     A(n,n) += std::pow(10.0,e);
     }
 
   // Make forcing data b.  
   auto b = Vector<double>::ones(N);
   // Extract diagonal of A, invert it to use as a preconditioner.
   auto D = diag(A);
   invert_inplace(D);
   // Solve A*x=b using bicgstab()
   std::cout << "-------- bicgstab() -------" << std::endl;
   auto x1 = Vector<double>::zeros(N);
   auto output1 = bicgstab(make_DimmAction(D), make_GemmAction(A), b, x1);
   std::cout << "  |A*x-b|/|b| = " << euclidean(A*x1-b)/euclidean(b) << std::endl;
   std::cout << "  history = " << output1.history << std::endl;
   std::cout << std::endl;
   // Solve A*x=b using gmres()
   std::cout << "--------- gmres() ---------" << std::endl;
   auto x2 = Vector<double>::zeros(N);
   auto output2 = gmres(make_DimmAction(D), make_GemmAction(A), b, x2);
   std::cout << "  |A*x-b|/|b| = " << euclidean(A*x2-b)/euclidean(b) << std::endl;
   std::cout << "  history = " << output2.history << std::endl;
   std::cout << std::endl;
   }

-------- bicgstab() -------
  |A*x-b|/|b| = 4.97367e-05
  history = [ 1 0.4513 1.60752 0.108491 0.0379676 0.011765 0.00958063 0.00847243 0.00931855 0.00127366 0.00110497 0.000371644 0.000362132 0.000269913 0.000223167 0.000212457 0.000279266 4.97367e-05 ] (18)
--------- gmres() ---------
  |A*x-b|/|b| = 0.000400246
  history = [ 0.893465 0.0607599 0.0262138 0.0099447 0.00348539 0.0011249 0.000349895 0.000106706 3.01123e-05 ] (9)

The performance of the two methods is difficult to compare directly: gmres() converges better (monotonically) but requires more internal workspace, while each bicgstab() iteration internally requires two applications of A and M. End user experimentation is encouraged. It is likely that more Krylov solvers will be added to MyraMath over time.

Regardless of the method, successful Krylov solution hinges on good preconditioning. The .schur() and .partialsolve() methods on MyraMath's /multifrontal Solver's are intended to be the building blocks of substructuring preconditioners. This would fall under strategy (iii), exploiting the physical intuition that "long range interactions" are less important, and that instead solving a collection of local subproblems can often strongly cluster a large sparse A that arises from discretizing a partial differential equation.

Krylov eigensolvers

Spans of Krylov vectors {b, A*b, A²*b, A³*b,..A^k*b} are also good spaces for approximating extremal eigenpairs (consider their similarity to the iterates generated by the classical power method). Eigensolvers are considerably harder to implement than linear solvers. However, unstructured nested dissection requires solving eigenproblems over sparse graph laplacians. So MyraMath does implement a few simple Krylov eigensolvers (as details of the internal reorder()'ing routine), and these algorithms are callable from user code.

Two algorithms are provided:

iterative/lanczos1.h, the Lanczos algorithm, suitable for finding a large eigenpair of a symmetric A
iterative/lopcg1.h, the locally optimal conjugate gradient algorithm, suitable for finding a small eigenpair of a symmetric A

The example below (tutorial/iterative/lanczos_lopcg.cpp) uses both these algorithms to find one large eigenpair and one small eigenpair of a sparse real symmetric A:

 #include <myramath/dense/Vector.h>
 #include <myramath/dense/Matrix.h>
 #include <myramath/dense/euclidean.h>
 #include <myramath/dense/DiagonalMatrix.h>
 #include <myramath/dense/inverse.h>
 
 #include <myramath/sparse/SparseMatrix.h>
 #include <myramath/sparse/SparseMatrixBuilder.h>
 #include <myramath/sparse/gemv.h>
 #include <myramath/sparse/diag.h>
 
 #include <myramath/iterative/ICholeskySolver.h>
 #include <myramath/iterative/SolveAction.h>
 #include <myramath/iterative/GemmAction.h>
 #include <myramath/iterative/DimmAction.h>
 #include <myramath/iterative/lanczos1.h>
 #include <myramath/iterative/lopcg1.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 ADD_TEST("iterative_lanczos_lopcg","[iterative]") // int main()
   {
   // Extract extremal eigenpairs of A =
   //
   //   [ +2 -1             -1 ]
   //   [ -1 +2 -1             ]
   //   [    -1 +2 -1          ]
   //   [          .           ]
   //   [            .         ]
   //   [          -1 +2 -1    ]
   //   [             -1 +2 -1 ]
   //   [ -1             -1 +2 ]
   // 
   // (1D laplacian with periodic boundary condition)
   int N = 16;
   SparseMatrixBuilder<double> B(N,N);
   for (int n = 0; n < N-1; ++n)
     {
     B(n,n) += 1.0;
     B(n+1,n+1) += 1.0;
     B(n,n+1) -= 1.0;
     B(n+1,n) -= 1.0;
     }
   B(0,0) += 1.0;
   B(N-1,N-1) += 1.0;
   B(0,N-1) -= 1.0;
   B(N-1,0) -= 1.0;
   auto A = B.make_SparseMatrix();
   // Form shifted ICholeskySolver to use as a preconditioner in lopcg1()
   ICholeskySolver<double> M(A,1.0e-6);
   // Find smallest eigenpair using lopcg1()
   std::cout << "-------- lopcg1() -------" << std::endl;
   auto result0 = lopcg1(make_SolveAction(M),make_GemmAction(A));
   // The eigenvector x0 should be [+1 +1 +1 +1 +1 +1 ...]/sqrt(N)
   const Vector<double>& x0 = result0.first;
   std::cout << "x0 = " << x0;
   // The eigenvalue lambda0 should be 0
   double lambda0 = 1.0/result0.second;
   std::cout << "lambda0 = " << lambda0 << std::endl;
   // Check residual of eigenstatement.
   std::cout << "|A*x0 - lambda0*x0| = " << euclidean(A*x0-x0*lambda0) << std::endl;
   std::cout << std::endl;
   // Find largest eigenpair using lanczos1()
   std::cout << "------- lanczos1() ------" << std::endl;
   auto result1 = lanczos1(make_GemmAction(A));
   // The eigenvector x1 should be [+1 -1 +1 -1 +1 -1 ...]/sqrt(N)
   const Vector<double>& x1 = result1.first;
   std::cout << "x1 = " << x1;
   // The eigenvalue lambda1 should be 4
   double lambda1 = result1.second;
   std::cout << "lambda1 = " << lambda1 << std::endl;
   // Check residual of eigenstatement.
   std::cout << "A*x1 - x1*lambda1 = " << euclidean(A*x1-x1*lambda1) << std::endl;
   }

-------- lopcg1() -------
x0 = size 16 Vector of double:
[ -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 -0.25 ]
lambda0 = -2.36655e-18
|A*x0 - lambda0*x0| = 2.1403e-10
------- lanczos1() ------
x1 = size 16 Vector of double:
[ 0.25 -0.25 0.25 -0.25 0.25 -0.25 0.25 -0.25 0.25 -0.25 0.25 -0.25 0.25 -0.25 0.25 -0.25 ]
lambda1 = 4
A*x1 - x1*lambda1 = 1.23629e-15

Note these methods are only suitable for real symmetric A (because graph laplacians are real symmetric). For general A, more sophisticated algorithms (like the Arnoldi method) are needed, which are not implemented here. Fortunately there are robust open source packages in this space that can help, like ARPACK and SLEPc.

Appendix 1: Writing your own Action

The algorithms in /iterative can be used with any Action. A variety of common Action's (sparse matrix-vector multiply, diagonal scaling, etc) and a language to compose them (sum, cascade, etc) are available. However, advanced users may discover a need to inject their own code as an Action (most likely as a preconditioner). Adaption routines for this use case can be found within iterative/UserAction.h.

User code should be encapsulated within a class with the following signatures:

A typedef for the Number type that the user code operates upon (float for example, or std::complex<double>)
A size() method that returns the size of the Action as a std::pair<int,int>
A multiply() method that maps b:=A*x, for (const) CMatrixRange x and (mutable) MatrixRange b

Then, call make_UserAction() to adapt an instance of your class into an Action. Any user code wrapped like this can be further composed (added, cascaded, scaled, etc) just like any other native Action. The following program (tutorial/iterative/action1.cpp) shows a working example of this process. The "user code" applies a 1D graph laplacian over N points and is encapsulated within the MyAction class. It is adapted into an Action using make_UserAction(), then composed with a rank-1 correction (a regularization), and finally fed into pcg() for linear solution.

 #include <myramath/dense/Matrix.h>
 #include <myramath/dense/MatrixRange.h>
 #include <myramath/dense/Vector.h>
 #include <myramath/dense/VectorRange.h>
 #include <myramath/dense/frobenius.h>
 
 #include <myramath/iterative/SumAction.h>
 #include <myramath/iterative/ProductAction.h>
 #include <myramath/iterative/UserAction.h>
 #include <myramath/iterative/GemmAction.h>
 #include <myramath/iterative/IdentityAction.h>
 #include <myramath/iterative/pcg.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 #include <typeinfo>
 
 using namespace myra;
 
 namespace {
 
 // An example user routine to be adapted into an Action, it implements a 1D graph laplacian over N points:
 //  
 //       [ +1 -1                ]
 //       [ -1 +2 -1             ]
 //       [    -1 +2 -1          ]
 //       [          .           ]
 //  b := [            .         ] * x
 //       [          -1 +2 -1    ]
 //       [             -1 +2 -1 ]
 //       [                -1 +1 ]
 //
 class MyAction
   {
   public:
 
     // User routines must provide a Number typedef. This one works over double's.
     typedef double Number;
 
     // User routines must report their size. This one maps from N-vectors into N-vectors.
     std::pair<int,int> size() const
       { return std::make_pair(N,N); }
 
     // User routines must provide a method to map b := A*x, don't forget that x and b may have multiple columns.
     void multiply(const CMatrixRange<double>& x, const MatrixRange<double>& b) const
       {
       b.zero();
       for (int n = 0; n < N-1; ++n)
         {
         b.row(n)   += x.row(n);
         b.row(n+1) += x.row(n+1);
         b.row(n)   -= x.row(n+1);        
         b.row(n+1) -= x.row(n);
         }
       }
 
     // User routines should capture any needed environmental references via constructor arguments.    
     MyAction(int in_N)
       : N(in_N) { }
       
   private:
 
     // User routines can carry member data, but they must be copy-constructible.
     int N;
     
   };
 
 } // namespace
 
 ADD_TEST("iterative_action1","[iterative]") // int main()
   {
   // Adapts user code (encapsulated in MyAction) into an Action U.
   int N = 7;
   auto U = make_UserAction( MyAction(N) );
   std::cout << "U.make_Matrix() = " << U.make_Matrix() << std::endl;
   // Compose a new Action A = U + VV' (a rank one correction to U, to move its 0 eigenvalue to 1)
   auto V = Matrix<double>::fill(N,1, std::sqrt(1.0/N) );
   auto A = U + make_GemmAction(V)*make_GemmAction(V,'T');
   std::cout << "A.make_Matrix() = " << A.make_Matrix() << std::endl;
   // Solve a linear system A*x=b, don't really need a preconditioner.
   auto b = Vector<double>::random(N);
   auto M = make_IdentityAction<double>(N);
   auto x = Vector<double>::zeros(N);
   auto result = pcg(M,A,b,x);
   // Examine final residual.
   double residual = frobenius(A*x-b);
   std::cout << "|(MyAction+VV')*x - b| = " << residual << std::endl;
   }

U.make_Matrix() = size 7 by 7 Matrix of double:
[ 1 -1 0 0 0 0 0 ]
[ -1 2 -1 0 0 0 0 ]
[ 0 -1 2 -1 0 0 0 ]
[ 0 0 -1 2 -1 0 0 ]
[ 0 0 0 -1 2 -1 0 ]
[ 0 0 0 0 -1 2 -1 ]
[ 0 0 0 0 0 -1 1 ]
A.make_Matrix() = size 7 by 7 Matrix of double:
[ 1.14286 -0.857143 0.142857 0.142857 0.142857 0.142857 0.142857 ]
[ -0.857143 2.14286 -0.857143 0.142857 0.142857 0.142857 0.142857 ]
[ 0.142857 -0.857143 2.14286 -0.857143 0.142857 0.142857 0.142857 ]
[ 0.142857 0.142857 -0.857143 2.14286 -0.857143 0.142857 0.142857 ]
[ 0.142857 0.142857 0.142857 -0.857143 2.14286 -0.857143 0.142857 ]
[ 0.142857 0.142857 0.142857 0.142857 -0.857143 2.14286 -0.857143 ]
[ 0.142857 0.142857 0.142857 0.142857 0.142857 -0.857143 1.14286 ]
|(MyAction+VV')*x - b| = 6.93334e-16

Appendix 2: Writing an Action that calls third party code.

Another notable use case for make_UserAction() is to help adapt third party code (like an external preconditioning algorithm) into an Action. Probably, this third party code isn't something you can easily modify or cut/paste into a MyAction.multiply() method. Possibly it's only available in compiled form anyway. You can still make this work, instead you just need to write a class (similar to MyAction from before) that delegates into the external library. The following program (tutorial/iterative/action2.cpp) shows a working example of this process.

 #include <myramath/dense/Matrix.h>
 #include <myramath/dense/MatrixRange.h>
 #include <myramath/dense/threshold.h>
 
 #include <myramath/iterative/UserAction.h>
 #include <myramath/iterative/ProductAction.h>
 
 #include <tests/myratest.h>
 
 #include <cmath>
 #include <iostream>
 #include <typeinfo>
 
 using namespace myra;
 
 namespace {
 
 // A mock for "3rd party code", this function applies a discrete hartley transform to a real N-vector.
 void hartley(const double* in, double* out, int N)
   {
   double pi = std::acos(-1.0);
   double q2 = std::sqrt(2.0);
   double qN = std::sqrt(N);
   for (int k = 0; k < N; ++k)
     {
     out[k] = 0.0;
     for (int n = 0; n < N; ++n)
       out[k] += in[n] * q2 * std::cos(2*pi/N*n*k - pi/4);
     out[k] /= qN;
     }
   }
 
 // Example user code, calls into the "3rd party code", will be adapted into an Action
 class MyAction
   {
   public:
 
     typedef double Number;
     
     std::pair<int,int> size() const
       { return std::make_pair(N,N); }
 
     void multiply(const CMatrixRange<double>& x, const MatrixRange<double>& b) const
       {
       // Our "3rd party code", hartley(), only works on 1 column at a time.
       for (int j = 0; j < x.J; ++j)
         hartley(x.column(j).begin, b.column(j).begin,N);
       }
 
     MyAction(int in_N)
       : N(in_N) { }
       
   private:
 
     int N;
   
   };
 
 } // namespace
 
 ADD_TEST("iterative_action2","[iterative]") // int main()
   {
   // Adapt 3rd party code (implemented by hartly(), encapsulated in MyAction) into an Action H.
   int N = 7;
   auto H = make_UserAction( MyAction(N) );
   std::cout << "H.make_Matrix() = " << H.make_Matrix() << std::endl;
   // The discrete hartly transform happens to involutory (is its own inverse). 
   // Cascade two together and examine the dense view of it. Should be identity.
   auto HH = H*H;
   std::cout << "HH.make_Matrix() = " << threshold(HH.make_Matrix(),1.0e-12) << std::endl;
   }

H.make_Matrix() = size 7 by 7 Matrix of double:
[ 0.377964 0.377964 0.377964 0.377964 0.377964 0.377964 0.377964 ]
[ 0.377964 0.531162 0.284383 -0.176542 -0.504527 -0.452593 -0.0598475 ]
[ 0.377964 0.284383 -0.504527 -0.0598475 0.531162 -0.176542 -0.452593 ]
[ 0.377964 -0.176542 -0.0598475 0.284383 -0.452593 0.531162 -0.504527 ]
[ 0.377964 -0.504527 0.531162 -0.452593 0.284383 -0.0598475 -0.176542 ]
[ 0.377964 -0.452593 -0.176542 0.531162 -0.0598475 -0.504527 0.284383 ]
[ 0.377964 -0.0598475 -0.452593 -0.504527 -0.176542 0.284383 0.531162 ]
HH.make_Matrix() = size 7 by 7 Matrix of double:
[ 1 0 0 0 0 0 0 ]
[ 0 1 0 0 0 0 0 ]
[ 0 0 1 0 0 0 0 ]
[ 0 0 0 1 0 0 0 ]
[ 0 0 0 0 1 0 0 ]
[ 0 0 0 0 0 1 0 ]
[ 0 0 0 0 0 0 1 ]

This is essentially the Adaptor design pattern, where MyraMath is the Client, the third party code is the Adaptee, and you need to write MyAction to be the Adaptor. The pre-canned Action's for calling BLAS routines (gemm(), etc) basically work like this. Injecting your own preconditioning routines is strongly encouraged. Or try building new ones from the other building blocks in MyraMath.

Go back to API Tutorials