Overview

This tutorial covers the basic steps for using the direct solvers in /multifrontal to solve A*X=B when A is sparse and structurally symmetric. Unlike the dense case, sparse matrix decompositions can't generally be performed in-place due to fill-in (the creation of additional nonzeroes in the L and U factors). Careful reordering of A can greatly reduce fill-in and is a key step in efficient sparse direct solution.

After reordering comes a symbolic factorization step wherein the patterns of the L and U factors are tabulated along with predictions of memory and flop cost. If the need arises to solve multiple matrices with identical nonzero patterns and similar numerical properties, it is possible to precompute the reordering and symbolic factorization once, and then reuse it.

Numeric factorization is the most time consuming part of the direct solution process, and is internally multithreaded for improved performance. When numerical factorization is complete, the decomposed A can be used to solve for arbitrary/multiple right hand sides B with relatively little effort. This can be the chief advantage of using a sparse direct solver like those in /multifrontal.

Below are links to the sections of this page:

Reordering and Permutations

Reordering unknowns prior to sparse factorization can have a dramatic impact on the total storage and flop requirements. Multifrontal solvers (like all the ones in MyraMath) tend to perform best with nested dissection (ND) orderings. ND is a divide and conquer strategy in which a top-level "separator" of unknowns is identified, whose removal will split the remainder into two unconnected subgroups, "left" and "right". When the unknowns are reordered into (left, right, separator), no structural fill-in can propagate between the left and right groups. The algorithm then recurses, dissecting the left group further, dissecting the right group further, and so forth.

For Pattern's arising from structured problems (like 2D/3D Cartesian grids), ND is extremely easy to perform because the separator is always a line/plane of unknowns at the "middle" of the grid. For the more important case of general unstructured Pattern's, ND is more expensive and considerably harder to implement. MyraMath provides orderings for both cases, the 2D/3D structured cases are implemented by bisect2() and bisect3() (in sparse/laplacian2.h and sparse/laplacian3.h), while the unstructured case is implemented by reorder() (in multifrontal/symbolic/reorder.h).

The excerpt below (tutorial/multifrontal/reorder.cpp) applies both algorithms to a 7x7 structured grid.

 #include <myramath/sparse/Pattern.h>
 #include <myramath/sparse/PatternRange.h>
 #include <myramath/sparse/PatternBuilder.h>
 #include <myramath/sparse/Permutation.h>
 #include <myramath/sparse/laplacian2.h>
 #include <myramath/sparse/permute.h>
 
 #include <myramath/multifrontal/symbolic/reorder.h>
 #include <myramath/multifrontal/symbolic/postorder.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 ADD_TEST("multifrontal_reorder","[multifrontal]") // int main()
   {
   // Construct stencil for a 2D structured grid, visualize using PatternBuilder.
   Pattern A = stencil2(7,7);
   std::cout << "A = " << PatternBuilder(A) << std::endl;
   // Reorder A using structured nested dissection, visualize.
   Permutation P = bisect2(7,7);
   std::cout << "A permuted by bisect2() = " << PatternBuilder(permute(P,A)) << std::endl;
   // Reorder A using unstructured nested dissection, visualize.
   Permutation Q = reorder(A);
   std::cout << "A permuted by reorder() = " << PatternBuilder(permute(Q,A)) << std::endl;
   }

A = size 49 by 49 pattern: 
[ x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - x x - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ x - - - - - - x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - x - - - - - x x - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - x - - - - - - x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - x - - - - - x x - - - - - - x - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - x - - - - - - x x - - - - - x - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - x - - - - - x x - - - - - - x - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - x - - - - - - x x - - - - - x - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x - - - - - - x - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - x x - - - - - x - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - - x - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x - - - - - - x ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - x x - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x x ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x x ]
A permuted by bisect2() = size 49 by 49 pattern: 
[ x - x - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x x - - - - - x - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ x x x - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - x - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - ]
[ - - - - x x - - x - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - x - - - - ]
[ - - - x x x - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - ]
[ x - - x - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - x - - x x x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x - - x - - x x - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - x - x - - - x - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - x x - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - x x x - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - x - x x - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - x - - ]
[ - - - - - - - - - - - - - x x - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x ]
[ - - - - - - - - - - - - x x x - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - ]
[ - - - - - - - - - x - - x - - x x - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - x - - x x x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - x - - x - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x - - - - - - - x - - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - x - - - - - - x - - x x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - x - - - - - - - x - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - x - - - ]
[ - - - - - - - - - - - - - - - - - - - - - x - x - - - x - - - - - - - - - - - - - - x - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - x x - - - - - x - - - - - - - - - x - - - - x - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - x x x - - - - x - - - - - - - - - - - - - - x - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - x - x x - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - x x - - x - - - - - - - - - - - x - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - x x x - x - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - x - - x - - x x - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - x - - x x x x - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - x - - x - - x x - - - - - - - - - - x - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x - - - x - - x - - - - - - x - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x - - - - - x - - - - - - - - - x ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x x - - - - x - - - - - - - - - x - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x x - - - - x - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x - - x - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x x - x - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - x - - x x - - x - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - x x x x - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - x - - x x - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - x - - - - - - - x - - - - - - - - x x - - - - x - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - x - - x x x - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - - x - - - - - - x x - - - - - - - ]
[ - - - x - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - x x - - - - - ]
[ - - - - - x - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - x x x - - - - ]
[ - - - - x - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - x x x - - - ]
[ - - - - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - x - - - - x x x - - ]
[ - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - x x x - ]
[ - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - x x x ]
[ - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - x x ]
A permuted by reorder() = size 49 by 49 pattern: 
[ x x - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ x x - - - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - x x - - - - - - x - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - x x - - - - - x - x - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - x x - - - x - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - x x - - x - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - ]
[ - - - - - - x x - x - - - - - - - - - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - x x x - - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - x - - - ]
[ - - - - - x - x x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - ]
[ - - - x x - x - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ x - x - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x - x x - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - x - - x - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - x - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - ]
[ - - - - - - - - - - - - - - x - x - - - - x - - - x x - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - x - x - - x - - - - - - x - - - - - - - - - - - - - - - - - - - x - ]
[ - - - - - - - - - - - - - - x - x x x - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - x x x - x - - - - - - x - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - x - x x - - - x - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - x x x x - - - - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - x - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - - - x ]
[ - - - - - - - - - - - - - - x - - - - - - x x - x - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - x - - - - x x x - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - x - - - x x - - - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - x - - - - - - - - - - - - - - - - - - x - - x x - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - x - - x - - - - - - - x - - - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - x - - - - - - - x - - x - - - - - - - - x x - - - - - - - - - - - - - - - - - - - - - ]
[ - - - - - - - x - - - - - - - x - - - - - - - - - - x x - - - - - - - - - - - - - - - - - - x - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x - x - - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x - - - - - - - - x x - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x x - x - - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x x - - x - - - - - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - x x - - - - - - - - x - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x x x - - - - - - - - x - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x x - - - - - x - - - x - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - x x x - - - - x - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - x x - - - - - - - - x ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x x - x - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x x - - - - - - - - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x x x - x - - - - - - - x - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - x x - - - - x - - - - x - - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - x - x - - - x - - - - - - - ]
[ - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - x - - - - - - - - - x x - - - - - ]
[ - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - - - x x x - - - - ]
[ - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - - - x x x - - - ]
[ - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - x x x - - ]
[ - - - - - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - - x - - - - - - - - - x x x - ]
[ - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - - - - - - - - - x - - - - - - x x x ]
[ - - - - - - - - - - - - - - - - - - - - x - - - - - - - - - - - - - - - x - - - - - - - - - - x x ]

(However, see Sidenote 1)

For sake of completeness, there are other reordering algorithms present in MyraMath that are based on other (non-ND) principles. There are two variants of the minimum-degree (MD) algorithm: amd() (approximate-MD) and mmd() (minimum-MD). When reordering/solving sparse systems that arose from a 2D/3D discretization of a partial differential equation, these algorithms are typically beaten by nested dissection. But, they are sometimes useful for other/non-geometrical sparse systems (analysis of circuit networks or unstructured graph laplacians, for instance). See multifrontal/symbolic/amd.h and multifrontal/symbolic/mmd.h for details.

The output of the bisect() and reorder() routines is a Permutation, which should be submitted to downstream symbolic/numeric factorization stages. It is anticipated that end users might wish to inject their own ordering, perhaps by calling into other third party reordering packages like Metis, AMD or Scotch. In that case, you can construct a Permutation from their output, a (permuted) index vector. See Permutation::from_perm() and Permutation::from_iperm() for details.

Symbolic Factorization, the AssemblyTree

In an effort to streamline the numerical factorization and backsolution phases, sparse direct solvers perform extensive symbolic analysis that (i) explicitly tabulates all the fill-in for the L and U factors (ii) aggregates unknowns with similar connectivity into "supernodes" for improved BLAS3 utilization and (iii) performs dependency analysis for the purpose of exposing parallelism. In MyraMath, the symbolic factorization is encapsulated into a data structure called the AssemblyTree. An AssemblyTree is constructed from a Pattern and a Permutation, typically the output of bisect2()/bisect3() or reorder(). The code excerpt below (tutorial/multifrontal/symbolic.cpp) shows the workflow to produce an AssemblyTree. Alternatively, you can access the AssemblyTree of an existing factorization (to reuse on a second A with the same Pattern and similar numerical properties).

 #include <myramath/sparse/Pattern.h>
 #include <myramath/sparse/PatternRange.h>
 #include <myramath/sparse/SparseMatrix.h>
 #include <myramath/sparse/SparseMatrixRange.h>
 #include <myramath/sparse/Permutation.h>
 #include <myramath/sparse/laplacian3.h>
 
 #include <myramath/multifrontal/symbolic/reorder.h>
 #include <myramath/multifrontal/symbolic/AssemblyTree.h>
 
 #include <tests/myratest.h>
 
 #include <cmath>
 #include <iostream>
 
 using namespace myra;
 
 ADD_TEST("multifrontal_symbolic","[multifrontal]") // int main()
   {
   // Analyze structured 3D grids of increasing size, d by d by d
   int N[7] = {1000,2000,4000,8000,16000,32000,64000};
   for (int n : N)
     {
     int d = static_cast<int>(std::pow(n,1.0/3.0));
     // Construct SparseMatrix A
     SparseMatrix<double> A = laplacian3<double>(d,d,d);
     // Compute fill reducing reordering P
     Permutation P = bisect3(d,d,d);
     // Perform symbolic factorization
     AssemblyTree atree(A.pattern(),P);
     // Report flop count.
     std::cout << P.size() << " unknowns -> " << atree.n_work_llt() << " flops" << std::endl;
     }
   }

unknowns -> 2799981 flops
unknowns -> 18817132 flops
unknowns -> 79766835 flops
unknowns -> 146058595 flops
unknowns -> 823869935 flops
unknowns -> 3142543645 flops
unknowns -> 10411697407 flops

AssemblyTree is primarily for internal use. The only member functions of any interest to an end-user are probably .n_words() and .n_work(), which return space and time cost estimates for A=L*L' or A=L*U. For instance, the output of this example program indicates that the cost for direct factorization of a sparse matrix arising from discretizing PDE in three dimensions scales with the unknown count n as O(n^2). (Since each time the problem size doubled, the flop count quadrupled).

Solver Classes and Options

There are currently six different solver classes inside /multifrontal, each is applicable to a different type of linear A*X=B system:

SparseRCholeskySolver, for real symmetric positive definite A, within multifrontal/rcholesky/solver.h
SparseZCholeskySolver, for complex hermitian positive definite A, within multifrontal/zcholesky/solver.h
SparseRLDLTSolver, for real symmetric indefinite A, within multifrontal/rldlt/solver.h
SparseZLDLHSolver, for complex hermitian indefinite A, within multifrontal/zldlh/solver.h
SparseZLDLTSolver, for complex symmetric A, within multifrontal/zldlt/solver.h
SparseLUSolver, for symmetric-pattern, nonsymmetric-valued A, within multifrontal/lu/solver.h
SparseNormalSolver, solves the normal equations for nonsymmetric A with full column rank, within multifrontal/normal/solver.h

Generally speaking, solvers higher on this list are simpler, more efficient or more robust than solvers lower on this list. Pick the simplest solver that is still suitable for your A. Note that MyraMath currently offers little functionality for systems with nonsymmetric nonzero patterns. For such systems, try using SuperLU or UMPACK instead.

Constructing any solver requires the input SparseMatrix A, and you can also optionally specify a Permutation for reordering (from a third party package, for example) or a precomputed AssemblyTree (when solving multiple A's with the same Pattern, for example). The code excerpt below illustrates these cases (tutorial/multifrontal/solver.cpp):

 #include <myramath/utility/ilinspace.h>
 
 #include <myramath/sparse/Permutation.h>
 #include <myramath/sparse/Pattern.h>
 #include <myramath/sparse/PatternRange.h>
 #include <myramath/sparse/SparseMatrix.h>
 #include <myramath/sparse/SparseMatrixRange.h>
 #include <myramath/sparse/laplacian3.h>
 
 #include <myramath/multifrontal/symbolic/AssemblyTree.h>
 #include <myramath/multifrontal/rcholesky/SparseRCholeskySolver.h>
 
 #include <myramath/jobgraph/TextProgressMeter.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 namespace {
 
 // Useful typedefs.
 typedef SparseRCholeskySolver<double> Solver;
 typedef typename Solver::Options Options;
 
 // Constructs a Solver from just a SparseMatrix A, uses internal reorder()'ing routine.
 void example3a()
   {
   SparseMatrix<double> A = laplacian3<double>(10,10,10);
   Solver solver(A);
   }
 
 // Constructs a Solver from a SparseMatrix A and the natural ordering P = {0,1,2..999}
 void example3b()
   {
   SparseMatrix<double> A = laplacian3<double>(10,10,10);
   Permutation P = Permutation::identity(1000);
   Solver solver(A,P);
   }
 
 // Constructs a Solver from a SparseMatrix A and a precomputed AssemblyTree based on bisect3() ordering.
 void example3c()
   {
   SparseMatrix<double> A = laplacian3<double>(10,10,10);
   Permutation P = bisect3(10,10,10);
   AssemblyTree tree(A.pattern(),P);
   Solver solver(A,tree);
   }
 
 // Constructs a Solver from just a SparseMatrix A, passes a ProgressMeter through options.
 void example3d()
   {
   SparseMatrix<double> A = laplacian3<double>(30,30,30);
   ProgressMeter progress = make_TextProgressMeter();
   Solver solver(A, Options::create().set_progress(progress));
   }
 
 } // namespace
 
 ADD_TEST("multifrontal_solver","[multifrontal]") // int main()
   {
   example3a();
   example3b();
   example3c();
   example3d();
   }

Sparse A = L*L' factor(27000 unknowns, work = 3550667924):
[----------------------------------------------------------------] (285 ms)

A variety of control and tuning parameters can be injected into a solver constructor call through its trailing (and defaulted) multifrontal::Options pack. In the code excerpt above, example3d() uses this to pass in a callback function for progress visualization. For further details, see Multifrontal Options.

Backsolution

Each of the solver classes listed above offers a general purpose .solve() method for solving op(A)*X=B or X*op(A)=B. Regardless of the op or side parameters, .solve() always has in-place semantics, which means you pass in B and the routine overwrites it with X. Below, tutorial/multifrontal/solve.cpp shows how to use .solve():

 #include <myramath/dense/Vector.h>
 #include <myramath/dense/Matrix.h>
 #include <myramath/dense/euclidean.h>
 
 #include <myramath/sparse/SparseMatrix.h>
 #include <myramath/sparse/SparseMatrixRange.h>
 #include <myramath/sparse/laplacian2.h>
 
 #include <myramath/multifrontal/rcholesky/SparseRCholeskySolver.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 // Solves A*x = b
 //
 //                 A                   *    x   =   b
 // 
 // [ 3 - 0 0 - 0 0 0 0 0 0 0 0 0 0 0 ]   [  1 ]   [ -4 ]
 // [ - 4 - 0 0 - 0 0 0 0 0 0 0 0 0 0 ]   [  2 ]   [ -2 ]
 // [ 0 - 4 - 0 0 - 0 0 0 0 0 0 0 0 0 ]   [  3 ]   [ -1 ]
 // [ 0 0 - 3 0 0 0 - 0 0 0 0 0 0 0 0 ]   [  4 ]   [  1 ]
 // [ - 0 0 0 4 - 0 0 - 0 0 0 0 0 0 0 ]   [  5 ]   [  4 ]
 // [ 0 - 0 0 - 5 - 0 0 - 0 0 0 0 0 0 ]   [  6 ]   [  6 ]
 // [ 0 0 - 0 0 - 5 - 0 0 - 0 0 0 0 0 ]   [  7 ]   [  7 ]
 // [ 0 0 0 - 0 0 - 4 0 0 0 - 0 0 0 0 ] * [  8 ] = [  9 ]
 // [ 0 0 0 0 - 0 0 0 4 - 0 0 - 0 0 0 ]   [  9 ]   [  8 ]
 // [ 0 0 0 0 0 - 0 0 - 5 - 0 0 - 0 0 ]   [ 10 ]   [ 10 ]
 // [ 0 0 0 0 0 0 - 0 0 - 5 - 0 0 - 0 ]   [ 11 ]   [ 11 ]
 // [ 0 0 0 0 0 0 0 - 0 0 - 4 0 0 0 - ]   [ 12 ]   [ 13 ]
 // [ 0 0 0 0 0 0 0 0 - 0 0 0 3 - 0 0 ]   [ 13 ]   [ 16 ]
 // [ 0 0 0 0 0 0 0 0 0 - 0 0 - 4 - 0 ]   [ 14 ]   [ 18 ]
 // [ 0 0 0 0 0 0 0 0 0 0 - 0 0 - 4 - ]   [ 15 ]   [ 19 ]
 // [ 0 0 0 0 0 0 0 0 0 0 0 - 0 0 - 3 ]   [ 16 ]   [ 21 ]
 
 ADD_TEST("multifrontal_solve","[multifrontal]") // int main()
   {
   // Construct solver for A.
   SparseMatrix<double> A = laplacian2<double>(4,4);
   SparseRCholeskySolver<double> solver(A);
   // Construct b.
   auto x = Vector<double>::fill({1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16});
   auto b = Vector<double>::fill({-4,-2,-1,1,4,6,7,9,8,10,11,13,16,18,19,21});
   std::cout << "A = " << A.make_Matrix() << std::endl;
   std::cout << "x = " << x << std::endl;
   std::cout << "b = A*x = " << b << std::endl;
   // Solve A*x = b, overwriting b with x.
   solver.solve(b.column());
   std::cout << "A\\b = " << b << std::endl;
   std::cout << "euclidean(A\\b-x) = " << euclidean(b-x) << std::endl;
   }

A = size 16 by 16 Matrix of double:
[ 3 -1 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 ]
[ -1 4 -1 0 0 -1 0 0 0 0 0 0 0 0 0 0 ]
[ 0 -1 4 -1 0 0 -1 0 0 0 0 0 0 0 0 0 ]
[ 0 0 -1 3 0 0 0 -1 0 0 0 0 0 0 0 0 ]
[ -1 0 0 0 4 -1 0 0 -1 0 0 0 0 0 0 0 ]
[ 0 -1 0 0 -1 5 -1 0 0 -1 0 0 0 0 0 0 ]
[ 0 0 -1 0 0 -1 5 -1 0 0 -1 0 0 0 0 0 ]
[ 0 0 0 -1 0 0 -1 4 0 0 0 -1 0 0 0 0 ]
[ 0 0 0 0 -1 0 0 0 4 -1 0 0 -1 0 0 0 ]
[ 0 0 0 0 0 -1 0 0 -1 5 -1 0 0 -1 0 0 ]
[ 0 0 0 0 0 0 -1 0 0 -1 5 -1 0 0 -1 0 ]
[ 0 0 0 0 0 0 0 -1 0 0 -1 4 0 0 0 -1 ]
[ 0 0 0 0 0 0 0 0 -1 0 0 0 3 -1 0 0 ]
[ 0 0 0 0 0 0 0 0 0 -1 0 0 -1 4 -1 0 ]
[ 0 0 0 0 0 0 0 0 0 0 -1 0 0 -1 4 -1 ]
[ 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 -1 3 ]
x = size 16 Vector of double:
[ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ]
b = A*x = size 16 Vector of double:
[ -4 -2 -1 1 4 6 7 9 8 10 11 13 16 18 19 21 ]
A\b = size 16 Vector of double:
[ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ]
euclidean(A\b-x) = 7.11236e-15

Generally speaking, backsolution exhibits poorer thread-parallel scalability than factorization, so .solve() runs with only a single thread by default. This can be overridden through Options, but appreciable speedup should only be expected when the number of right hand sides is comparable to the internal blocksize of the solver itself (a few dozen). At this point, the algorithm crosses over from BLAS2 dominated (memory bound) to BLAS3 dominated (compute bound).

Appendix: Working with external data.

External data can be injected into these A*X=B solvers by using a SparseMatrixRange to wrap around the external A stored in CSC format, and a MatrixRange to wrap around the external B stored in column major format. The desired solution X will overwrite B. The example code below (tutorial/multifrontal/external1.cpp) solves the same system as before, but uses raw C-arrays as mock "external" data:

 #include <myramath/dense/Vector.h>
 #include <myramath/dense/Matrix.h>
 #include <myramath/dense/euclidean.h>
 
 #include <myramath/sparse/SparseMatrix.h>
 #include <myramath/sparse/SparseMatrixRange.h>
 #include <myramath/sparse/laplacian2.h>
 
 #include <myramath/multifrontal/rcholesky/SparseRCholeskySolver.h>
 
 #include <tests/myratest.h>
 
 #include <iostream>
 
 using namespace myra;
 
 // Solves A*x = b
 //
 //                 A                   *    x   =   b
 // 
 // [ 3 - 0 0 - 0 0 0 0 0 0 0 0 0 0 0 ]   [  1 ]   [ -4 ]
 // [ - 4 - 0 0 - 0 0 0 0 0 0 0 0 0 0 ]   [  2 ]   [ -2 ]
 // [ 0 - 4 - 0 0 - 0 0 0 0 0 0 0 0 0 ]   [  3 ]   [ -1 ]
 // [ 0 0 - 3 0 0 0 - 0 0 0 0 0 0 0 0 ]   [  4 ]   [  1 ]
 // [ - 0 0 0 4 - 0 0 - 0 0 0 0 0 0 0 ]   [  5 ]   [  4 ]
 // [ 0 - 0 0 - 5 - 0 0 - 0 0 0 0 0 0 ]   [  6 ]   [  6 ]
 // [ 0 0 - 0 0 - 5 - 0 0 - 0 0 0 0 0 ]   [  7 ]   [  7 ]
 // [ 0 0 0 - 0 0 - 4 0 0 0 - 0 0 0 0 ] * [  8 ] = [  9 ]
 // [ 0 0 0 0 - 0 0 0 4 - 0 0 - 0 0 0 ]   [  9 ]   [  8 ]
 // [ 0 0 0 0 0 - 0 0 - 5 - 0 0 - 0 0 ]   [ 10 ]   [ 10 ]
 // [ 0 0 0 0 0 0 - 0 0 - 5 - 0 0 - 0 ]   [ 11 ]   [ 11 ]
 // [ 0 0 0 0 0 0 0 - 0 0 - 4 0 0 0 - ]   [ 12 ]   [ 13 ]
 // [ 0 0 0 0 0 0 0 0 - 0 0 0 3 - 0 0 ]   [ 13 ]   [ 16 ]
 // [ 0 0 0 0 0 0 0 0 0 - 0 0 - 4 - 0 ]   [ 14 ]   [ 18 ]
 // [ 0 0 0 0 0 0 0 0 0 0 - 0 0 - 4 - ]   [ 15 ]   [ 19 ]
 // [ 0 0 0 0 0 0 0 0 0 0 0 - 0 0 - 3 ]   [ 16 ]   [ 21 ]
 
 ADD_TEST("multifrontal_external1","[multifrontal]") // int main()
   {
   // Encoding A in compressed sparse column (CSC) format.
   int A_strides[17] = {0,3,7,11,14,18,23,28,32,36,41,46,50,53,57,61,64};
   int A_indices[64] = {0,1,4,0,1,2,5,1,2,3,6,2,3,7,0,4,5,8,1,4,5,6,9,2,5,
                        6,7,10,3,6,7,11,4,8,9,12,5,8,9,10,13,6,9,10,11,14,
                        7,10,11,15,8,12,13,9,12,13,14,10,13,14,15,11,14,15};
   double A_values[64] = {3,-1,-1,-1,4,-1,-1,-1,4,-1,-1,-1,3,-1,-1,4,-1,-1,
                          -1,-1,5,-1,-1,-1,-1,5,-1,-1,-1,-1,4,-1,-1,4,-1,-1,
                          -1,-1,5,-1,-1,-1,-1,5,-1,-1,-1,-1,4,-1,-1,3,-1,-1,
                          -1,4,-1,-1,-1,4,-1,-1,-1,3};
   SparseMatrixRange<double> A(A_strides,A_indices,A_values,16,16);
   // Construct solver for A.
   SparseRCholeskySolver<double> solver(A);
   // Encoding b/x as column vectors.
   double x_values[16] = {1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16};
   double b_values[16] = {-4,-2,-1,1,4,6,7,9,8,10,11,13,16,18,19,21};
   VectorRange<double> x(x_values,16);   
   VectorRange<double> b(b_values,16);
   std::cout << "A = " << A.make_Matrix() << std::endl;
   std::cout << "x = " << x << std::endl;
   std::cout << "b = A*x = " << b << std::endl;
   // Solve A*x = b, overwriting b with x.
   solver.solve(b.column());
   std::cout << "A\\b = " << b << std::endl;
   std::cout << "euclidean(A\\b-x) = " << euclidean(b-x) << std::endl;
   }

A = size 16 by 16 Matrix of double:
[ 3 -1 0 0 -1 0 0 0 0 0 0 0 0 0 0 0 ]
[ -1 4 -1 0 0 -1 0 0 0 0 0 0 0 0 0 0 ]
[ 0 -1 4 -1 0 0 -1 0 0 0 0 0 0 0 0 0 ]
[ 0 0 -1 3 0 0 0 -1 0 0 0 0 0 0 0 0 ]
[ -1 0 0 0 4 -1 0 0 -1 0 0 0 0 0 0 0 ]
[ 0 -1 0 0 -1 5 -1 0 0 -1 0 0 0 0 0 0 ]
[ 0 0 -1 0 0 -1 5 -1 0 0 -1 0 0 0 0 0 ]
[ 0 0 0 -1 0 0 -1 4 0 0 0 -1 0 0 0 0 ]
[ 0 0 0 0 -1 0 0 0 4 -1 0 0 -1 0 0 0 ]
[ 0 0 0 0 0 -1 0 0 -1 5 -1 0 0 -1 0 0 ]
[ 0 0 0 0 0 0 -1 0 0 -1 5 -1 0 0 -1 0 ]
[ 0 0 0 0 0 0 0 -1 0 0 -1 4 0 0 0 -1 ]
[ 0 0 0 0 0 0 0 0 -1 0 0 0 3 -1 0 0 ]
[ 0 0 0 0 0 0 0 0 0 -1 0 0 -1 4 -1 0 ]
[ 0 0 0 0 0 0 0 0 0 0 -1 0 0 -1 4 -1 ]
[ 0 0 0 0 0 0 0 0 0 0 0 -1 0 0 -1 3 ]
x = size 16 Vector of double:
[ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ]
b = A*x = size 16 Vector of double:
[ -4 -2 -1 1 4 6 7 9 8 10 11 13 16 18 19 21 ]
A\b = size 16 Vector of double:
[ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ]
euclidean(A\b-x) = 7.11236e-15

Continue to Tutorial for /multifrontal solvers (part 2), or go back to API Tutorials