doc/manual/snes.md

*7f296bb3SBarry Smith(ch_snes)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith# SNES: Nonlinear Solvers
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe solution of large-scale nonlinear problems pervades many facets of
*7f296bb3SBarry Smithcomputational science and demands robust and flexible solution
*7f296bb3SBarry Smithstrategies. The `SNES` library of PETSc provides a powerful suite of
*7f296bb3SBarry Smithdata-structure-neutral numerical routines for such problems. Built on
*7f296bb3SBarry Smithtop of the linear solvers and data structures discussed in preceding
*7f296bb3SBarry Smithchapters, `SNES` enables the user to easily customize the nonlinear
*7f296bb3SBarry Smithsolvers according to the application at hand. Also, the `SNES`
*7f296bb3SBarry Smithinterface is *identical* for the uniprocess and parallel cases; the only
*7f296bb3SBarry Smithdifference in the parallel version is that each process typically forms
*7f296bb3SBarry Smithonly its local contribution to various matrices and vectors.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe `SNES` class includes methods for solving systems of nonlinear
*7f296bb3SBarry Smithequations of the form
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{F}(\mathbf{x}) = 0,
*7f296bb3SBarry Smith$$ (fx0)
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $\mathbf{F}: \, \Re^n \to \Re^n$. Newton-like methods provide the
*7f296bb3SBarry Smithcore of the package, including both line search and trust region
*7f296bb3SBarry Smithtechniques. A suite of nonlinear Krylov methods and methods based upon
*7f296bb3SBarry Smithproblem decomposition are also included. The solvers are discussed
*7f296bb3SBarry Smithfurther in {any}`sec_nlsolvers`. Following the PETSc design
*7f296bb3SBarry Smithphilosophy, the interfaces to the various solvers are all virtually
*7f296bb3SBarry Smithidentical. In addition, the `SNES` software is completely flexible, so
*7f296bb3SBarry Smiththat the user can at runtime change any facet of the solution process.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithPETSc’s default method for solving the nonlinear equation is Newton’s
*7f296bb3SBarry Smithmethod with line search, `SNESNEWTONLS`. The general form of the $n$-dimensional Newton’s method
*7f296bb3SBarry Smithfor solving {math:numref}`fx0` is
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{x}_{k+1} = \mathbf{x}_k - \mathbf{J}(\mathbf{x}_k)^{-1} \mathbf{F}(\mathbf{x}_k), \;\; k=0,1, \ldots,
*7f296bb3SBarry Smith$$ (newton)
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $\mathbf{x}_0$ is an initial approximation to the solution and
*7f296bb3SBarry Smith$\mathbf{J}(\mathbf{x}_k) = \mathbf{F}'(\mathbf{x}_k)$, the Jacobian, is nonsingular at each
*7f296bb3SBarry Smithiteration. In practice, the Newton iteration {math:numref}`newton` is
*7f296bb3SBarry Smithimplemented by the following two steps:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\begin{aligned}
*7f296bb3SBarry Smith1. & \text{(Approximately) solve} & \mathbf{J}(\mathbf{x}_k) \Delta \mathbf{x}_k &= -\mathbf{F}(\mathbf{x}_k). \\
*7f296bb3SBarry Smith2. & \text{Update} & \mathbf{x}_{k+1} &\gets \mathbf{x}_k + \Delta \mathbf{x}_k.
*7f296bb3SBarry Smith\end{aligned}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOther defect-correction algorithms can be implemented by using different
*7f296bb3SBarry Smithchoices for $J(\mathbf{x}_k)$.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snesusage)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## Basic SNES Usage
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithIn the simplest usage of the nonlinear solvers, the user must merely
*7f296bb3SBarry Smithprovide a C, C++, Fortran, or Python routine to evaluate the nonlinear function
*7f296bb3SBarry Smith{math:numref}`fx0`. The corresponding Jacobian matrix
*7f296bb3SBarry Smithcan be approximated with finite differences. For codes that are
*7f296bb3SBarry Smithtypically more efficient and accurate, the user can provide a routine to
*7f296bb3SBarry Smithcompute the Jacobian; details regarding these application-provided
*7f296bb3SBarry Smithroutines are discussed below. To provide an overview of the use of the
*7f296bb3SBarry Smithnonlinear solvers, browse the concrete example in {ref}`ex1.c <snes-ex1>` or skip ahead to the discussion.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(snes_ex1)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith:::{admonition} Listing: `src/snes/tutorials/ex1.c`
*7f296bb3SBarry Smith```{literalinclude} /../src/snes/tutorials/ex1.c
*7f296bb3SBarry Smith:end-before: /*TEST
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith:::
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithTo create a `SNES` solver, one must first call `SNESCreate()` as
*7f296bb3SBarry Smithfollows:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESCreate(MPI_Comm comm,SNES *snes);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe user must then set routines for evaluating the residual function {math:numref}`fx0`
*7f296bb3SBarry Smithand, *possibly*, its associated Jacobian matrix, as
*7f296bb3SBarry Smithdiscussed in the following sections.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithTo choose a nonlinear solution method, the user can either call
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSetType(SNES snes,SNESType method);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithor use the option `-snes_type <method>`, where details regarding the
*7f296bb3SBarry Smithavailable methods are presented in {any}`sec_nlsolvers`. The
*7f296bb3SBarry Smithapplication code can take complete control of the linear and nonlinear
*7f296bb3SBarry Smithtechniques used in the Newton-like method by calling
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSetFromOptions(snes);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis routine provides an interface to the PETSc options database, so
*7f296bb3SBarry Smiththat at runtime the user can select a particular nonlinear solver, set
*7f296bb3SBarry Smithvarious parameters and customized routines (e.g., specialized line
*7f296bb3SBarry Smithsearch variants), prescribe the convergence tolerance, and set
*7f296bb3SBarry Smithmonitoring routines. With this routine the user can also control all
*7f296bb3SBarry Smithlinear solver options in the `KSP`, and `PC` modules, as discussed
*7f296bb3SBarry Smithin {any}`ch_ksp`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAfter having set these routines and options, the user solves the problem
*7f296bb3SBarry Smithby calling
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSolve(SNES snes,Vec b,Vec x);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere `x` should be initialized to the initial guess before calling and contains the solution on return.
*7f296bb3SBarry SmithIn particular, to employ an initial guess of
*7f296bb3SBarry Smithzero, the user should explicitly set this vector to zero by calling
*7f296bb3SBarry Smith`VecZeroEntries(x)`. Finally, after solving the nonlinear system (or several
*7f296bb3SBarry Smithsystems), the user should destroy the `SNES` context with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESDestroy(SNES *snes);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snesfunction)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Nonlinear Function Evaluation
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithWhen solving a system of nonlinear equations, the user must provide a
*7f296bb3SBarry Smitha residual function {math:numref}`fx0`, which is set using
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSetFunction(SNES snes,Vec f,PetscErrorCode (*FormFunction)(SNES snes,Vec x,Vec f,void *ctx),void *ctx);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe argument `f` is an optional vector for storing the solution; pass `NULL` to have the `SNES` allocate it for you.
*7f296bb3SBarry SmithThe argument `ctx` is an optional user-defined context, which can
*7f296bb3SBarry Smithstore any private, application-specific data required by the function
*7f296bb3SBarry Smithevaluation routine; `NULL` should be used if such information is not
*7f296bb3SBarry Smithneeded. In C and C++, a user-defined context is merely a structure in
*7f296bb3SBarry Smithwhich various objects can be stashed; in Fortran a user context can be
*7f296bb3SBarry Smithan integer array that contains both parameters and pointers to PETSc
*7f296bb3SBarry Smithobjects.
*7f296bb3SBarry Smith<a href="PETSC_DOC_OUT_ROOT_PLACEHOLDER/src/snes/tutorials/ex5.c.html">SNES Tutorial ex5</a>
*7f296bb3SBarry Smithand
*7f296bb3SBarry Smith<a href="PETSC_DOC_OUT_ROOT_PLACEHOLDER/src/snes/tutorials/ex5f90.F90.html">SNES Tutorial ex5f90</a>
*7f296bb3SBarry Smithgive examples of user-defined application contexts in C and Fortran,
*7f296bb3SBarry Smithrespectively.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snesjacobian)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Jacobian Evaluation
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe user may also specify a routine to form some approximation of the
*7f296bb3SBarry SmithJacobian matrix, `A`, at the current iterate, `x`, as is typically
*7f296bb3SBarry Smithdone with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSetJacobian(SNES snes,Mat Amat,Mat Pmat,PetscErrorCode (*FormJacobian)(SNES snes,Vec x,Mat A,Mat B,void *ctx),void *ctx);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe arguments of the routine `FormJacobian()` are the current iterate,
*7f296bb3SBarry Smith`x`; the (approximate) Jacobian matrix, `Amat`; the matrix from
*7f296bb3SBarry Smithwhich the preconditioner is constructed, `Pmat` (which is usually the
*7f296bb3SBarry Smithsame as `Amat`); and an optional user-defined Jacobian context,
*7f296bb3SBarry Smith`ctx`, for application-specific data. The `FormJacobian()`
*7f296bb3SBarry Smithcallback is only invoked if the solver requires it, always
*7f296bb3SBarry Smith*after* `FormFunction()` has been called at the current iterate.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithNote that the `SNES` solvers
*7f296bb3SBarry Smithare all data-structure neutral, so the full range of PETSc matrix
*7f296bb3SBarry Smithformats (including “matrix-free” methods) can be used.
*7f296bb3SBarry Smith{any}`ch_matrices` discusses information regarding
*7f296bb3SBarry Smithavailable matrix formats and options, while {any}`sec_nlmatrixfree` focuses on matrix-free methods in
*7f296bb3SBarry Smith`SNES`. We briefly touch on a few details of matrix usage that are
*7f296bb3SBarry Smithparticularly important for efficient use of the nonlinear solvers.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithA common usage paradigm is to assemble the problem Jacobian in the
*7f296bb3SBarry Smithpreconditioner storage `B`, rather than `A`. In the case where they
*7f296bb3SBarry Smithare identical, as in many simulations, this makes no difference.
*7f296bb3SBarry SmithHowever, it allows us to check the analytic Jacobian we construct in
*7f296bb3SBarry Smith`FormJacobian()` by passing the `-snes_mf_operator` flag. This
*7f296bb3SBarry Smithcauses PETSc to approximate the Jacobian using finite differencing of
*7f296bb3SBarry Smiththe function evaluation (discussed in {any}`sec_fdmatrix`),
*7f296bb3SBarry Smithand the analytic Jacobian becomes merely the preconditioner. Even if the
*7f296bb3SBarry Smithanalytic Jacobian is incorrect, it is likely that the finite difference
*7f296bb3SBarry Smithapproximation will converge, and thus this is an excellent method to
*7f296bb3SBarry Smithverify the analytic Jacobian. Moreover, if the analytic Jacobian is
*7f296bb3SBarry Smithincomplete (some terms are missing or approximate),
*7f296bb3SBarry Smith`-snes_mf_operator` may be used to obtain the exact solution, where
*7f296bb3SBarry Smiththe Jacobian approximation has been transferred to the preconditioner.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOne such approximate Jacobian comes from “Picard linearization”, use `SNESSetPicard()`, which
*7f296bb3SBarry Smithwrites the nonlinear system as
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{F}(\mathbf{x}) := \mathbf{A}(\mathbf{x}) \mathbf{x} - \mathbf{b} = 0
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $\mathbf{A}(\mathbf{x})$ usually contains the lower-derivative parts of the
*7f296bb3SBarry Smithequation. For example, the nonlinear diffusion problem
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith- \nabla\cdot(\kappa(u) \nabla u) = 0
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwould be linearized as
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry SmithA(u) v \simeq -\nabla\cdot(\kappa(u) \nabla v).
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithUsually this linearization is simpler to implement than Newton and the
*7f296bb3SBarry Smithlinear problems are somewhat easier to solve. In addition to using
*7f296bb3SBarry Smith`-snes_mf_operator` with this approximation to the Jacobian, the
*7f296bb3SBarry SmithPicard iterative procedure can be performed by defining $\mathbf{J}(\mathbf{x})$
*7f296bb3SBarry Smithto be $\mathbf{A}(\mathbf{x})$. Sometimes this iteration exhibits better global
*7f296bb3SBarry Smithconvergence than Newton linearization.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithDuring successive calls to `FormJacobian()`, the user can either
*7f296bb3SBarry Smithinsert new matrix contexts or reuse old ones, depending on the
*7f296bb3SBarry Smithapplication requirements. For many sparse matrix formats, reusing the
*7f296bb3SBarry Smithold space (and merely changing the matrix elements) is more efficient;
*7f296bb3SBarry Smithhowever, if the matrix nonzero structure completely changes, creating an
*7f296bb3SBarry Smithentirely new matrix context may be preferable. Upon subsequent calls to
*7f296bb3SBarry Smiththe `FormJacobian()` routine, the user may wish to reinitialize the
*7f296bb3SBarry Smithmatrix entries to zero by calling `MatZeroEntries()`. See
*7f296bb3SBarry Smith{any}`sec_othermat` for details on the reuse of the matrix
*7f296bb3SBarry Smithcontext.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe directory `$PETSC_DIR/src/snes/tutorials` provides a variety of
*7f296bb3SBarry Smithexamples.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithSometimes a nonlinear solver may produce a step that is not within the domain
*7f296bb3SBarry Smithof a given function, for example one with a negative pressure. When this occurs
*7f296bb3SBarry Smithone can call `SNESSetFunctionDomainError()` or `SNESSetJacobianDomainError()`
*7f296bb3SBarry Smithto indicate to `SNES` the step is not valid. One must also use `SNESGetConvergedReason()`
*7f296bb3SBarry Smithand check the reason to confirm if the solver succeeded. See {any}`sec_vi` for how to
*7f296bb3SBarry Smithprovide `SNES` with bounds on the variables to solve (differential) variational inequalities
*7f296bb3SBarry Smithand how to control properties of the line step computed.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_nlsolvers)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## The Nonlinear Solvers
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAs summarized in Table {any}`tab-snesdefaults`, `SNES` includes
*7f296bb3SBarry Smithseveral Newton-like nonlinear solvers based on line search techniques
*7f296bb3SBarry Smithand trust region methods. Also provided are several nonlinear Krylov
*7f296bb3SBarry Smithmethods, as well as nonlinear methods involving decompositions of the
*7f296bb3SBarry Smithproblem.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithEach solver may have associated with it a set of options, which can be
*7f296bb3SBarry Smithset with routines and options database commands provided for this
*7f296bb3SBarry Smithpurpose. A complete list can be found by consulting the manual pages or
*7f296bb3SBarry Smithby running a program with the `-help` option; we discuss just a few in
*7f296bb3SBarry Smiththe sections below.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```{eval-rst}
*7f296bb3SBarry Smith.. list-table:: PETSc Nonlinear Solvers
*7f296bb3SBarry Smith   :name: tab-snesdefaults
*7f296bb3SBarry Smith   :header-rows: 1
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   * - Method
*7f296bb3SBarry Smith     - SNESType
*7f296bb3SBarry Smith     - Options Name
*7f296bb3SBarry Smith     - Default Line Search
*7f296bb3SBarry Smith   * - Line Search Newton
*7f296bb3SBarry Smith     - ``SNESNEWTONLS``
*7f296bb3SBarry Smith     - ``newtonls``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
*7f296bb3SBarry Smith   * - Trust region Newton
*7f296bb3SBarry Smith     - ``SNESNEWTONTR``
*7f296bb3SBarry Smith     - ``newtontr``
*7f296bb3SBarry Smith     - —
*7f296bb3SBarry Smith   * - Newton with Arc Length Continuation
*7f296bb3SBarry Smith     - ``SNESNEWTONAL``
*7f296bb3SBarry Smith     - ``newtonal``
*7f296bb3SBarry Smith     - —
*7f296bb3SBarry Smith   * - Nonlinear Richardson
*7f296bb3SBarry Smith     - ``SNESNRICHARDSON``
*7f296bb3SBarry Smith     - ``nrichardson``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHL2``
*7f296bb3SBarry Smith   * - Nonlinear CG
*7f296bb3SBarry Smith     - ``SNESNCG``
*7f296bb3SBarry Smith     - ``ncg``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHCP``
*7f296bb3SBarry Smith   * - Nonlinear GMRES
*7f296bb3SBarry Smith     - ``SNESNGMRES``
*7f296bb3SBarry Smith     - ``ngmres``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHL2``
*7f296bb3SBarry Smith   * - Quasi-Newton
*7f296bb3SBarry Smith     - ``SNESQN``
*7f296bb3SBarry Smith     - ``qn``
*7f296bb3SBarry Smith     - see :any:`tab-qndefaults`
*7f296bb3SBarry Smith   * - Full Approximation Scheme
*7f296bb3SBarry Smith     - ``SNESFAS``
*7f296bb3SBarry Smith     - ``fas``
*7f296bb3SBarry Smith     - —
*7f296bb3SBarry Smith   * - Nonlinear ASM
*7f296bb3SBarry Smith     - ``SNESNASM``
*7f296bb3SBarry Smith     - ``nasm``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * - ASPIN
*7f296bb3SBarry Smith     - ``SNESASPIN``
*7f296bb3SBarry Smith     - ``aspin``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
*7f296bb3SBarry Smith   * - Nonlinear Gauss-Seidel
*7f296bb3SBarry Smith     - ``SNESNGS``
*7f296bb3SBarry Smith     - ``ngs``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * - Anderson Mixing
*7f296bb3SBarry Smith     - ``SNESANDERSON``
*7f296bb3SBarry Smith     - ``anderson``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * -  Newton with constraints (1)
*7f296bb3SBarry Smith     - ``SNESVINEWTONRSLS``
*7f296bb3SBarry Smith     - ``vinewtonrsls``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
*7f296bb3SBarry Smith   * -  Newton with constraints (2)
*7f296bb3SBarry Smith     - ``SNESVINEWTONSSLS``
*7f296bb3SBarry Smith     - ``vinewtonssls``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
*7f296bb3SBarry Smith   * - Multi-stage Smoothers
*7f296bb3SBarry Smith     - ``SNESMS``
*7f296bb3SBarry Smith     - ``ms``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * - Composite
*7f296bb3SBarry Smith     - ``SNESCOMPOSITE``
*7f296bb3SBarry Smith     - ``composite``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * - Linear solve only
*7f296bb3SBarry Smith     - ``SNESKSPONLY``
*7f296bb3SBarry Smith     - ``ksponly``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * - Python Shell
*7f296bb3SBarry Smith     - ``SNESPYTHON``
*7f296bb3SBarry Smith     - ``python``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith   * - Shell (user-defined)
*7f296bb3SBarry Smith     - ``SNESSHELL``
*7f296bb3SBarry Smith     - ``shell``
*7f296bb3SBarry Smith     - –
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Line Search Newton
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe method `SNESNEWTONLS` (`-snes_type newtonls`) provides a
*7f296bb3SBarry Smithline search Newton method for solving systems of nonlinear equations. By
*7f296bb3SBarry Smithdefault, this technique employs cubic backtracking
*7f296bb3SBarry Smith{cite}`dennis:83`. Alternative line search techniques are
*7f296bb3SBarry Smithlisted in Table {any}`tab-linesearches`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```{eval-rst}
*7f296bb3SBarry Smith.. table:: PETSc Line Search Methods
*7f296bb3SBarry Smith   :name: tab-linesearches
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   ==================== =========================== ================
*7f296bb3SBarry Smith   **Line Search**      **SNESLineSearchType**      **Options Name**
*7f296bb3SBarry Smith   ==================== =========================== ================
*7f296bb3SBarry Smith   Backtracking         ``SNESLINESEARCHBT``        ``bt``
*7f296bb3SBarry Smith   (damped) step        ``SNESLINESEARCHBASIC``     ``basic``
*7f296bb3SBarry Smith   identical to above   ``SNESLINESEARCHNONE``      ``none``
*7f296bb3SBarry Smith   L2-norm Minimization ``SNESLINESEARCHL2``        ``l2``
*7f296bb3SBarry Smith   Critical point       ``SNESLINESEARCHCP``        ``cp``
*7f296bb3SBarry Smith   Bisection            ``SNESLINESEARCHBISECTION`` ``bisection``
*7f296bb3SBarry Smith   Shell                ``SNESLINESEARCHSHELL``     ``shell``
*7f296bb3SBarry Smith   ==================== =========================== ================
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithEvery `SNES` has a line search context of type `SNESLineSearch` that
*7f296bb3SBarry Smithmay be retrieved using
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESGetLineSearch(SNES snes,SNESLineSearch *ls);.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThere are several default options for the line searches. The order of
*7f296bb3SBarry Smithpolynomial approximation may be set with `-snes_linesearch_order` or
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESLineSearchSetOrder(SNESLineSearch ls, PetscInt order);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithfor instance, 2 for quadratic or 3 for cubic. Sometimes, it may not be
*7f296bb3SBarry Smithnecessary to monitor the progress of the nonlinear iteration. In this
*7f296bb3SBarry Smithcase, `-snes_linesearch_norms` or
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESLineSearchSetComputeNorms(SNESLineSearch ls,PetscBool norms);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithmay be used to turn off function, step, and solution norm computation at
*7f296bb3SBarry Smiththe end of the linesearch.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe default line search for the line search Newton method,
*7f296bb3SBarry Smith`SNESLINESEARCHBT` involves several parameters, which are set to
*7f296bb3SBarry Smithdefaults that are reasonable for many applications. The user can
*7f296bb3SBarry Smithoverride the defaults by using the following options:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith- `-snes_linesearch_alpha <alpha>`
*7f296bb3SBarry Smith- `-snes_linesearch_maxstep <max>`
*7f296bb3SBarry Smith- `-snes_linesearch_minlambda <tol>`
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithBesides the backtracking linesearch, there are `SNESLINESEARCHL2`,
*7f296bb3SBarry Smithwhich uses a polynomial secant minimization of $||F(x)||_2$, and
*7f296bb3SBarry Smith`SNESLINESEARCHCP`, which minimizes $F(x) \cdot Y$ where
*7f296bb3SBarry Smith$Y$ is the search direction. These are both potentially iterative
*7f296bb3SBarry Smithline searches, which may be used to find a better-fitted steplength in
*7f296bb3SBarry Smiththe case where a single secant search is not sufficient. The number of
*7f296bb3SBarry Smithiterations may be set with `-snes_linesearch_max_it`. In addition, the
*7f296bb3SBarry Smithconvergence criteria of the iterative line searches may be set using
*7f296bb3SBarry Smithfunction tolerances `-snes_linesearch_rtol` and
*7f296bb3SBarry Smith`-snes_linesearch_atol`, and steplength tolerance
*7f296bb3SBarry Smith`snes_linesearch_ltol`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithFor highly non-linear problems, the bisection line search `SNESLINESEARCHBISECTION`
*7f296bb3SBarry Smithmay prove useful due to its robustness. Similar to the critical point line search
*7f296bb3SBarry Smith`SNESLINESEARCHCP`, it seeks to find the root of $F(x) \cdot Y$.
*7f296bb3SBarry SmithWhile the latter does so through a secant method, the bisection line search
*7f296bb3SBarry Smithdoes so by iteratively bisecting the step length interval.
*7f296bb3SBarry SmithIt works as follows (with $f(\lambda)=F(x-\lambda Y) \cdot Y / ||Y||$ for brevity):
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith1. initialize: $j=1$, $\lambda_0 = \lambda_{\text{left}} = 0.0$, $\lambda_j = \lambda_{\text{right}} = \alpha$, compute $f(\lambda_0)$ and $f(\lambda_j)$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith2. check whether there is a change of sign in the interval: $f(\lambda_{\text{left}}) f(\lambda_j) \leq 0$; if not accept the full step length $\lambda_1$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith3. if there is a change of sign, enter iterative bisection procedure
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   1. check convergence/ exit criteria:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith      - absolute tolerance $f(\lambda_j) < \mathtt{atol}$
*7f296bb3SBarry Smith      - relative tolerance $f(\lambda_j) < \mathtt{rtol} \cdot f(\lambda_0)$
*7f296bb3SBarry Smith      - change of step length $\lambda_j - \lambda_{j-1} < \mathtt{ltol}$
*7f296bb3SBarry Smith      - number of iterations $j < \mathtt{max\_it}$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   2. if $j > 1$, determine direction of bisection
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   $$
*7f296bb3SBarry Smith   \begin{aligned}\lambda_{\text{left}} &= \begin{cases}\lambda_{\text{left}} &f(\lambda_{\text{left}}) f(\lambda_j) \leq 0\\\lambda_{j} &\text{else}\\ \end{cases}\\ \lambda_{\text{right}} &= \begin{cases} \lambda_j &f(\lambda_{\text{left}}) f(\lambda_j) \leq 0\\\lambda_{\text{right}} &\text{else}\\ \end{cases}\\\end{aligned}
*7f296bb3SBarry Smith   $$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   3. bisect the interval: $\lambda_{j+1} = (\lambda_{\text{left}} + \lambda_{\text{right}})/2$, compute $f(\lambda_{j+1})$
*7f296bb3SBarry Smith   4. update variables for the next iteration: $\lambda_j \gets \lambda_{j+1}$, $f(\lambda_j) \gets f(\lambda_{j+1})$, $j \gets j+1$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithCustom line search types may either be defined using
*7f296bb3SBarry Smith`SNESLineSearchShell`, or by creating a custom user line search type
*7f296bb3SBarry Smithin the model of the preexisting ones and register it using
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESLineSearchRegister(const char sname[],PetscErrorCode (*function)(SNESLineSearch));.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Trust Region Methods
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe trust region method in `SNES` for solving systems of nonlinear
*7f296bb3SBarry Smithequations, `SNESNEWTONTR` (`-snes_type newtontr`), is similar to the one developed in the
*7f296bb3SBarry SmithMINPACK project {cite}`more84`. Several parameters can be
*7f296bb3SBarry Smithset to control the variation of the trust region size during the
*7f296bb3SBarry Smithsolution process. In particular, the user can control the initial trust
*7f296bb3SBarry Smithregion radius, computed by
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\Delta = \Delta_0 \| F_0 \|_2,
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithby setting $\Delta_0$ via the option `-snes_tr_delta0 <delta0>`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Newton with Arc Length Continuation
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe Newton method with arc length continuation reformulates the linearized system
*7f296bb3SBarry Smith$K\delta \mathbf x = -\mathbf F(\mathbf x)$ by introducing the load parameter
*7f296bb3SBarry Smith$\lambda$ and splitting the residual into two components, commonly
*7f296bb3SBarry Smithcorresponding to internal and external forces:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf F(x, \lambda) = \mathbf F^{\mathrm{int}}(\mathbf x) - \mathbf F^{\mathrm{ext}}(\mathbf x, \lambda)
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOften, $\mathbf F^{\mathrm{ext}}(\mathbf x, \lambda)$ is linear in $\lambda$,
*7f296bb3SBarry Smithwhich can be thought of as applying the external force in proportional load
*7f296bb3SBarry Smithincrements. By default, this is how the right-hand side vector is handled in the
*7f296bb3SBarry Smithimplemented method. Generally, however, $\mathbf F^{\mathrm{ext}}(\mathbf x, \lambda)$
*7f296bb3SBarry Smithmay depend non-linearly on $\lambda$ or $\mathbf x$, or both.
*7f296bb3SBarry SmithTo accommodate this possibility, we provide the `SNESNewtonALGetLoadParameter()`
*7f296bb3SBarry Smithfunction, which allows for the current value of $\lambda$ to be queried in the
*7f296bb3SBarry Smithfunctions provided to `SNESSetFunction()` and `SNESSetJacobian()`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAdditionally, we split the solution update into two components:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\delta \mathbf x = \delta s\delta\mathbf x^F + \delta\lambda\delta\mathbf x^Q,
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $\delta s = 1$ unless partial corrections are used (discussed more
*7f296bb3SBarry Smithbelow). Each of $\delta \mathbf x^F$ and $\delta \mathbf x^Q$ are found via
*7f296bb3SBarry Smithsolving a linear system with the Jacobian $K$:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith- $\delta \mathbf x^F$ is the full Newton step for a given value of $\lambda$: $K \delta \mathbf x^F = -\mathbf F(\mathbf x, \lambda)$
*7f296bb3SBarry Smith- $\delta \mathbf x^Q$ is the variation in $\mathbf x$ with respect to $\lambda$, computed by $K \delta\mathbf x^Q = \mathbf Q(\mathbf x, \lambda)$, where $\mathbf Q(\mathbf x, \lambda) = -\partial \mathbf F (\mathbf x, \lambda) / \partial \lambda$ is the tangent load vector.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOften, the tangent load vector $\mathbf Q$ is constant within a load increment,
*7f296bb3SBarry Smithwhich corresponds to the case of proportional loading discussed above. By default,
*7f296bb3SBarry Smith$\mathbf Q$ is the full right-hand-side vector, if one was provided.
*7f296bb3SBarry SmithThe user can also provide a function which computes $\mathbf Q$ to
*7f296bb3SBarry Smith`SNESNewtonALSetFunction()`. This function should have the same signature as for
*7f296bb3SBarry Smith`SNESSetFunction`, and the user should use `SNESNewtonALGetLoadParameter()` to get
*7f296bb3SBarry Smith$\lambda$ if it is needed.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**The Constraint Surface.** Considering the $n+1$ dimensional space of
*7f296bb3SBarry Smith$\mathbf x$ and $\lambda$, we define the linearized equilibrium line to be
*7f296bb3SBarry Smiththe set of points for which the linearized equilibrium equations are satisfied.
*7f296bb3SBarry SmithGiven the previous iterative solution
*7f296bb3SBarry Smith$\mathbf t^{(j-1)} = [\mathbf x^{(j-1)}, \lambda^{(j-1)}]$,
*7f296bb3SBarry Smiththis line is defined by the point $\mathbf t^{(j-1)} + [\delta\mathbf x^F, 0]$ and
*7f296bb3SBarry Smiththe vector $\mathbf t^Q [\delta\mathbf x^Q, 1]$.
*7f296bb3SBarry SmithThe arc length method seeks the intersection of this linearized equilibrium line
*7f296bb3SBarry Smithwith a quadratic constraint surface, defined by
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith% math::L^2 = \|\Delta x\|^2 + \psi^2 (\Delta\lambda)^2,
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $L$ is a user-provided step size corresponding to the radius of the
*7f296bb3SBarry Smithconstraint surface, $\Delta\mathbf x$ and $\Delta\lambda$ are the
*7f296bb3SBarry Smithaccumulated updates over the current load step, and $\psi^2$ is a
*7f296bb3SBarry Smithuser-provided consistency parameter determining the shape of the constraint surface.
*7f296bb3SBarry SmithGenerally, $\psi^2 > 0$ leads to a hyper-sphere constraint surface, while
*7f296bb3SBarry Smith$\psi^2 = 0$ leads to a hyper-cylinder constraint surface.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithSince the solution will always fall on the constraint surface, the method will often
*7f296bb3SBarry Smithrequire multiple incremental steps to fully solve the non-linear problem.
*7f296bb3SBarry SmithThis is necessary to accurately trace the equilibrium path.
*7f296bb3SBarry SmithImportantly, this is fundamentally different from time stepping.
*7f296bb3SBarry SmithWhile a similar process could be implemented as a `TS`, this method is
*7f296bb3SBarry Smithparticularly designed to be used as a SNES, either standalone or within a `TS`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithTo this end, by default, the load parameter is used such that the full external
*7f296bb3SBarry Smithforces are applied at $\lambda = 1$, although we allow for the user to specify
*7f296bb3SBarry Smitha different value via `-snes_newtonal_lambda_max`.
*7f296bb3SBarry SmithTo ensure that the solution corresponds exactly to the external force prescribed by
*7f296bb3SBarry Smiththe user, i.e. that the load parameter is exactly $\lambda_{max}$ at the end
*7f296bb3SBarry Smithof the SNES solve, we clamp the value before computing the solution update.
*7f296bb3SBarry SmithAs such, the final increment will likely be a hybrid of arc length continuation and
*7f296bb3SBarry Smithnormal Newton iterations.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**Choosing the Continuation Step.** For the first iteration from an equilibrium
*7f296bb3SBarry Smithpoint, there is a single correct way to choose $\delta\lambda$, which follows
*7f296bb3SBarry Smithfrom the constraint equations. Specifically the constraint equations yield the
*7f296bb3SBarry Smithquadratic equation $a\delta\lambda^2 + b\delta\lambda + c = 0$, where
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\begin{aligned}
*7f296bb3SBarry Smitha &= \|\delta\mathbf x^Q\|^2 + \psi^2,\\
*7f296bb3SBarry Smithb &= 2\delta\mathbf x^Q\cdot (\Delta\mathbf x + \delta s\delta\mathbf x^F) + 2\psi^2 \Delta\lambda,\\
*7f296bb3SBarry Smithc &= \|\Delta\mathbf x + \delta s\delta\mathbf x^F\|^2 + \psi^2 \Delta\lambda^2 - L^2.
*7f296bb3SBarry Smith\end{aligned}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithSince in the first iteration, $\Delta\mathbf x = \delta\mathbf x^F = \mathbf 0$ and
*7f296bb3SBarry Smith$\Delta\lambda = 0$, $b = 0$ and the equation simplifies to a pair of
*7f296bb3SBarry Smithreal roots:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\delta\lambda = \pm\frac{L}{\sqrt{\|\delta\mathbf x^Q\|^2 + \psi^2}},
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere the sign is positive for the first increment and is determined by the previous
*7f296bb3SBarry Smithincrement otherwise as
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\text{sign}(\delta\lambda) = \text{sign}\big(\delta\mathbf x^Q \cdot (\Delta\mathbf x)_{i-1} + \psi^2(\Delta\lambda)_{i-1}\big),
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $(\Delta\mathbf x)_{i-1}$ and $(\Delta\lambda)_{i-1}$ are the
*7f296bb3SBarry Smithaccumulated updates over the previous load step.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithIn subsequent iterations, there are different approaches to selecting
*7f296bb3SBarry Smith$\delta\lambda$, all of which have trade-offs.
*7f296bb3SBarry SmithThe main difference is whether the iterative solution falls on the constraint
*7f296bb3SBarry Smithsurface at every iteration, or only when fully converged.
*7f296bb3SBarry SmithThis MR implements one of each of these approaches, set via
*7f296bb3SBarry Smith`SNESNewtonALSetCorrectionType()` or
*7f296bb3SBarry Smith`-snes_newtonal_correction_type <normal|exact>` on the command line.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**Corrections in the Normal Hyperplane.** The `SNES_NEWTONAL_CORRECTION_NORMAL`
*7f296bb3SBarry Smithoption is simpler and computationally less expensive, but may fail to converge, as
*7f296bb3SBarry Smiththe constraint equation is not satisfied at every iteration.
*7f296bb3SBarry SmithThe update $\delta \lambda$ is chosen such that the update is within the
*7f296bb3SBarry Smithnormal hyper-surface to the quadratic constraint surface.
*7f296bb3SBarry SmithMathematically, that is
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\delta \lambda = -\frac{\Delta \mathbf x \cdot \delta \mathbf x^F}{\Delta\mathbf x \cdot \delta\mathbf x^Q + \psi^2 \Delta\lambda}.
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis implementation is based on {cite}`LeonPaulinoPereiraMenezesLages_2011`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**Exact Corrections.** The `SNES_NEWTONAL_CORRECTION_EXACT` option is far more
*7f296bb3SBarry Smithcomplex, but ensures that the constraint is exactly satisfied at every Newton
*7f296bb3SBarry Smithiteration. As such, it is generally more robust.
*7f296bb3SBarry SmithBy evaluating the intersection of constraint surface and equilibrium line at each
*7f296bb3SBarry Smithiteration, $\delta\lambda$ is chosen as one of the roots of the above
*7f296bb3SBarry Smithquadratic equation $a\delta\lambda^2 + b\delta\lambda + c = 0$.
*7f296bb3SBarry SmithThis method encounters issues, however, if the linearized equilibrium line and
*7f296bb3SBarry Smithconstraint surface do not intersect due to particularly large linearized error.
*7f296bb3SBarry SmithIn this case, the roots are complex.
*7f296bb3SBarry SmithTo continue progressing toward a solution, this method uses a partial correction by
*7f296bb3SBarry Smithchoosing $\delta s$ such that the quadratic equation has a single real root.
*7f296bb3SBarry SmithGeometrically, this is selecting the point on the constraint surface closest to the
*7f296bb3SBarry Smithlinearized equilibrium line. See the code or {cite}`Ritto-CorreaCamotim2008` for a
*7f296bb3SBarry Smithmathematical description of these partial corrections.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Nonlinear Krylov Methods
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithA number of nonlinear Krylov methods are provided, including Nonlinear
*7f296bb3SBarry SmithRichardson (`SNESNRICHARDSON`), nonlinear conjugate gradient (`SNESNCG`), nonlinear GMRES (`SNESNGMRES`), and Anderson Mixing (`SNESANDERSON`). These
*7f296bb3SBarry Smithmethods are described individually below. They are all instrumental to
*7f296bb3SBarry SmithPETSc’s nonlinear preconditioning.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**Nonlinear Richardson.** The nonlinear Richardson iteration, `SNESNRICHARDSON`, merely
*7f296bb3SBarry Smithtakes the form of a line search-damped fixed-point iteration of the form
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{x}_{k+1} = \mathbf{x}_k - \lambda \mathbf{F}(\mathbf{x}_k), \;\; k=0,1, \ldots,
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere the default linesearch is `SNESLINESEARCHL2`. This simple solver
*7f296bb3SBarry Smithis mostly useful as a nonlinear smoother, or to provide line search
*7f296bb3SBarry Smithstabilization to an inner method.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**Nonlinear Conjugate Gradients.** Nonlinear CG, `SNESNCG`, is equivalent to linear
*7f296bb3SBarry SmithCG, but with the steplength determined by line search
*7f296bb3SBarry Smith(`SNESLINESEARCHCP` by default). Five variants (Fletcher-Reed,
*7f296bb3SBarry SmithHestenes-Steifel, Polak-Ribiere-Polyak, Dai-Yuan, and Conjugate Descent)
*7f296bb3SBarry Smithare implemented in PETSc and may be chosen using
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESNCGSetType(SNES snes, SNESNCGType btype);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith**Anderson Mixing and Nonlinear GMRES Methods.** Nonlinear GMRES (`SNESNGMRES`), and
*7f296bb3SBarry SmithAnderson Mixing (`SNESANDERSON`) methods combine the last $m$ iterates, plus a new
*7f296bb3SBarry Smithfixed-point iteration iterate, into an approximate residual-minimizing new iterate.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAll of the above methods have support for using a nonlinear preconditioner to compute the preliminary update step, rather than the default
*7f296bb3SBarry Smithwhich is the nonlinear function's residual, \$ mathbf\{F}(mathbf\{x}\_k)\$. The different update is obtained by solving a nonlinear preconditioner nonlinear problem, which has its own
*7f296bb3SBarry Smith`SNES` object that may be obtained with `SNESGetNPC()`.
*7f296bb3SBarry SmithQuasi-Newton Methods
*7f296bb3SBarry Smith^^^^^^^^^^^^^^^^^^^^
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithQuasi-Newton methods store iterative rank-one updates to the Jacobian
*7f296bb3SBarry Smithinstead of computing the Jacobian directly. Three limited-memory quasi-Newton
*7f296bb3SBarry Smithmethods are provided, L-BFGS, which are described in
*7f296bb3SBarry SmithTable {any}`tab-qndefaults`. These all are encapsulated under
*7f296bb3SBarry Smith`-snes_type qn` and may be changed with `snes_qn_type`. The default
*7f296bb3SBarry Smithis L-BFGS, which provides symmetric updates to an approximate Jacobian.
*7f296bb3SBarry SmithThis iteration is similar to the line search Newton methods.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe quasi-Newton methods support the use of a nonlinear preconditioner that can be obtained with `SNESGetNPC()` and then configured; or that can be configured with
*7f296bb3SBarry Smith`SNES`, `KSP`, and `PC` options using the options database prefix `-npc_`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```{eval-rst}
*7f296bb3SBarry Smith.. list-table:: PETSc quasi-Newton solvers
*7f296bb3SBarry Smith   :name: tab-qndefaults
*7f296bb3SBarry Smith   :header-rows: 1
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   * - QN Method
*7f296bb3SBarry Smith     - ``SNESQNType``
*7f296bb3SBarry Smith     - Options Name
*7f296bb3SBarry Smith     - Default Line Search
*7f296bb3SBarry Smith   * - L-BFGS
*7f296bb3SBarry Smith     - ``SNES_QN_LBFGS``
*7f296bb3SBarry Smith     - ``lbfgs``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHCP``
*7f296bb3SBarry Smith   * - “Good” Broyden
*7f296bb3SBarry Smith     - ``SNES_QN_BROYDEN``
*7f296bb3SBarry Smith     - ``broyden``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHBASIC`` (or equivalently ``SNESLINESEARCHNONE``
*7f296bb3SBarry Smith   * - “Bad” Broyden
*7f296bb3SBarry Smith     - ``SNES_QN_BADBROYDEN``
*7f296bb3SBarry Smith     - ``badbroyden``
*7f296bb3SBarry Smith     - ``SNESLINESEARCHL2``
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOne may also control the form of the initial Jacobian approximation with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESQNSetScaleType(SNES snes, SNESQNScaleType stype);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithand the restart type with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESQNSetRestartType(SNES snes, SNESQNRestartType rtype);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### The Full Approximation Scheme
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe Nonlinear Full Approximation Scheme (FAS) `SNESFAS`, is a nonlinear multigrid method. At
*7f296bb3SBarry Smitheach level, there is a recursive cycle control `SNES` instance, and
*7f296bb3SBarry Smitheither one or two nonlinear solvers that act as smoothers (up and down). Problems
*7f296bb3SBarry Smithset up using the `SNES` `DMDA` interface are automatically
*7f296bb3SBarry Smithcoarsened. FAS, `SNESFAS`, differs slightly from linear multigrid `PCMG`, in that the hierarchy is
*7f296bb3SBarry Smithconstructed recursively. However, much of the interface is a one-to-one
*7f296bb3SBarry Smithmap. We describe the “get” operations here, and it can be assumed that
*7f296bb3SBarry Smitheach has a corresponding “set” operation. For instance, the number of
*7f296bb3SBarry Smithlevels in the hierarchy may be retrieved using
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASGetLevels(SNES snes, PetscInt *levels);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThere are four `SNESFAS` cycle types, `SNES_FAS_MULTIPLICATIVE`,
*7f296bb3SBarry Smith`SNES_FAS_ADDITIVE`, `SNES_FAS_FULL`, and `SNES_FAS_KASKADE`. The
*7f296bb3SBarry Smithtype may be set with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASSetType(SNES snes,SNESFASType fastype);.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithand the cycle type, 1 for V, 2 for W, may be set with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASSetCycles(SNES snes, PetscInt cycles);.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithMuch like the interface to `PCMG` described in {any}`sec_mg`, there are interfaces to recover the
*7f296bb3SBarry Smithvarious levels’ cycles and smoothers. The level smoothers may be
*7f296bb3SBarry Smithaccessed with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASGetSmoother(SNES snes, PetscInt level, SNES *smooth);
*7f296bb3SBarry SmithSNESFASGetSmootherUp(SNES snes, PetscInt level, SNES *smooth);
*7f296bb3SBarry SmithSNESFASGetSmootherDown(SNES snes, PetscInt level, SNES *smooth);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithand the level cycles with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASGetCycleSNES(SNES snes,PetscInt level,SNES *lsnes);.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAlso akin to `PCMG`, the restriction and prolongation at a level may
*7f296bb3SBarry Smithbe acquired with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASGetInterpolation(SNES snes, PetscInt level, Mat *mat);
*7f296bb3SBarry SmithSNESFASGetRestriction(SNES snes, PetscInt level, Mat *mat);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithIn addition, FAS requires special restriction for solution-like
*7f296bb3SBarry Smithvariables, called injection. This may be set with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASGetInjection(SNES snes, PetscInt level, Mat *mat);.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe coarse solve context may be acquired with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESFASGetCoarseSolve(SNES snes, SNES *smooth);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Nonlinear Additive Schwarz
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithNonlinear Additive Schwarz methods (NASM) take a number of local
*7f296bb3SBarry Smithnonlinear subproblems, solves them independently in parallel, and
*7f296bb3SBarry Smithcombines those solutions into a new approximate solution.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESNASMSetSubdomains(SNES snes,PetscInt n,SNES subsnes[],VecScatter iscatter[],VecScatter oscatter[],VecScatter gscatter[]);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithallows for the user to create these local subdomains. Problems set up
*7f296bb3SBarry Smithusing the `SNES` `DMDA` interface are automatically decomposed. To
*7f296bb3SBarry Smithbegin, the type of subdomain updates to the whole solution are limited
*7f296bb3SBarry Smithto two types borrowed from `PCASM`: `PC_ASM_BASIC`, in which the
*7f296bb3SBarry Smithoverlapping updates added. `PC_ASM_RESTRICT` updates in a
*7f296bb3SBarry Smithnonoverlapping fashion. This may be set with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESNASMSetType(SNES snes,PCASMType type);.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith`SNESASPIN` is a helper `SNES` type that sets up a nonlinearly
*7f296bb3SBarry Smithpreconditioned Newton’s method using NASM as the preconditioner.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## General Options
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis section discusses options and routines that apply to all `SNES`
*7f296bb3SBarry Smithsolvers and problem classes. In particular, we focus on convergence
*7f296bb3SBarry Smithtests, monitoring routines, and tools for checking derivative
*7f296bb3SBarry Smithcomputations.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snesconvergence)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Convergence Tests
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithConvergence of the nonlinear solvers can be detected in a variety of
*7f296bb3SBarry Smithways; the user can even specify a customized test, as discussed below.
*7f296bb3SBarry SmithMost of the nonlinear solvers use `SNESConvergenceTestDefault()`,
*7f296bb3SBarry Smithhowever, `SNESNEWTONTR` uses a method-specific additional convergence
*7f296bb3SBarry Smithtest as well. The convergence tests involves several parameters, which
*7f296bb3SBarry Smithare set by default to values that should be reasonable for a wide range
*7f296bb3SBarry Smithof problems. The user can customize the parameters to the problem at
*7f296bb3SBarry Smithhand by using some of the following routines and options.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOne method of convergence testing is to declare convergence when the
*7f296bb3SBarry Smithnorm of the change in the solution between successive iterations is less
*7f296bb3SBarry Smiththan some tolerance, `stol`. Convergence can also be determined based
*7f296bb3SBarry Smithon the norm of the function. Such a test can use either the absolute
*7f296bb3SBarry Smithsize of the norm, `atol`, or its relative decrease, `rtol`, from an
*7f296bb3SBarry Smithinitial guess. The following routine sets these parameters, which are
*7f296bb3SBarry Smithused in many of the default `SNES` convergence tests:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSetTolerances(SNES snes,PetscReal atol,PetscReal rtol,PetscReal stol, PetscInt its,PetscInt fcts);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis routine also sets the maximum numbers of allowable nonlinear
*7f296bb3SBarry Smithiterations, `its`, and function evaluations, `fcts`. The
*7f296bb3SBarry Smithcorresponding options database commands for setting these parameters are:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith- `-snes_atol <atol>`
*7f296bb3SBarry Smith- `-snes_rtol <rtol>`
*7f296bb3SBarry Smith- `-snes_stol <stol>`
*7f296bb3SBarry Smith- `-snes_max_it <its>`
*7f296bb3SBarry Smith- `-snes_max_funcs <fcts>` (use `unlimited` for no maximum)
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithA related routine is `SNESGetTolerances()`. `PETSC_CURRENT` may be used
*7f296bb3SBarry Smithfor any parameter to indicate the current value should be retained; use `PETSC_DETERMINE` to restore to the default value from when the object was created.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithUsers can set their own customized convergence tests in `SNES` by
*7f296bb3SBarry Smithusing the command
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESSetConvergenceTest(SNES snes,PetscErrorCode (*test)(SNES snes,PetscInt it,PetscReal xnorm, PetscReal gnorm,PetscReal f,SNESConvergedReason reason, void *cctx),void *cctx,PetscErrorCode (*destroy)(void *cctx));
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe final argument of the convergence test routine, `cctx`, denotes an
*7f296bb3SBarry Smithoptional user-defined context for private data. When solving systems of
*7f296bb3SBarry Smithnonlinear equations, the arguments `xnorm`, `gnorm`, and `f` are
*7f296bb3SBarry Smiththe current iterate norm, current step norm, and function norm,
*7f296bb3SBarry Smithrespectively. `SNESConvergedReason` should be set positive for
*7f296bb3SBarry Smithconvergence and negative for divergence. See `include/petscsnes.h` for
*7f296bb3SBarry Smitha list of values for `SNESConvergedReason`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snesmonitor)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Convergence Monitoring
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithBy default the `SNES` solvers run silently without displaying
*7f296bb3SBarry Smithinformation about the iterations. The user can initiate monitoring with
*7f296bb3SBarry Smiththe command
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESMonitorSet(SNES snes, PetscErrorCode (*mon)(SNES snes, PetscInt its, PetscReal norm, void* mctx), void *mctx, (PetscCtxDestroyFn *)*monitordestroy);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe routine, `mon`, indicates a user-defined monitoring routine, where
*7f296bb3SBarry Smith`its` and `mctx` respectively denote the iteration number and an
*7f296bb3SBarry Smithoptional user-defined context for private data for the monitor routine.
*7f296bb3SBarry SmithThe argument `norm` is the function norm.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe routine set by `SNESMonitorSet()` is called once after every
*7f296bb3SBarry Smithsuccessful step computation within the nonlinear solver. Hence, the user
*7f296bb3SBarry Smithcan employ this routine for any application-specific computations that
*7f296bb3SBarry Smithshould be done after the solution update. The option `-snes_monitor`
*7f296bb3SBarry Smithactivates the default `SNES` monitor routine,
*7f296bb3SBarry Smith`SNESMonitorDefault()`, while `-snes_monitor_lg_residualnorm` draws
*7f296bb3SBarry Smitha simple line graph of the residual norm’s convergence.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOne can cancel hardwired monitoring routines for `SNES` at runtime
*7f296bb3SBarry Smithwith `-snes_monitor_cancel`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAs the Newton method converges so that the residual norm is small, say
*7f296bb3SBarry Smith$10^{-10}$, many of the final digits printed with the
*7f296bb3SBarry Smith`-snes_monitor` option are meaningless. Worse, they are different on
*7f296bb3SBarry Smithdifferent machines; due to different round-off rules used by, say, the
*7f296bb3SBarry SmithIBM RS6000 and the Sun SPARC. This makes testing between different
*7f296bb3SBarry Smithmachines difficult. The option `-snes_monitor_short` causes PETSc to
*7f296bb3SBarry Smithprint fewer of the digits of the residual norm as it gets smaller; thus
*7f296bb3SBarry Smithon most of the machines it will always print the same numbers making
*7f296bb3SBarry Smithcross-process testing easier.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe routines
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESGetSolution(SNES snes,Vec *x);
*7f296bb3SBarry SmithSNESGetFunction(SNES snes,Vec *r,void *ctx,int(**func)(SNES,Vec,Vec,void*));
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithreturn the solution vector and function vector from a `SNES` context.
*7f296bb3SBarry SmithThese routines are useful, for instance, if the convergence test
*7f296bb3SBarry Smithrequires some property of the solution or function other than those
*7f296bb3SBarry Smithpassed with routine arguments.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snesderivs)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith### Checking Accuracy of Derivatives
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithSince hand-coding routines for Jacobian matrix evaluation can be error
*7f296bb3SBarry Smithprone, `SNES` provides easy-to-use support for checking these matrices
*7f296bb3SBarry Smithagainst finite difference versions. In the simplest form of comparison,
*7f296bb3SBarry Smithusers can employ the option `-snes_test_jacobian` to compare the
*7f296bb3SBarry Smithmatrices at several points. Although not exhaustive, this test will
*7f296bb3SBarry Smithgenerally catch obvious problems. One can compare the elements of the
*7f296bb3SBarry Smithtwo matrices by using the option `-snes_test_jacobian_view` , which
*7f296bb3SBarry Smithcauses the two matrices to be printed to the screen.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAnother means for verifying the correctness of a code for Jacobian
*7f296bb3SBarry Smithcomputation is running the problem with either the finite difference or
*7f296bb3SBarry Smithmatrix-free variant, `-snes_fd` or `-snes_mf`; see {any}`sec_fdmatrix` or {any}`sec_nlmatrixfree`.
*7f296bb3SBarry SmithIf a
*7f296bb3SBarry Smithproblem converges well with these matrix approximations but not with a
*7f296bb3SBarry Smithuser-provided routine, the problem probably lies with the hand-coded
*7f296bb3SBarry Smithmatrix. See the note in {any}`sec_snesjacobian` about
*7f296bb3SBarry Smithassembling your Jabobian in the "preconditioner" slot `Pmat`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe correctness of user provided `MATSHELL` Jacobians in general can be
*7f296bb3SBarry Smithchecked with `MatShellTestMultTranspose()` and `MatShellTestMult()`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe correctness of user provided `MATSHELL` Jacobians via `TSSetRHSJacobian()`
*7f296bb3SBarry Smithcan be checked with `TSRHSJacobianTestTranspose()` and `TSRHSJacobianTest()`
*7f296bb3SBarry Smiththat check the correction of the matrix-transpose vector product and the
*7f296bb3SBarry Smithmatrix-product. From the command line, these can be checked with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith- `-ts_rhs_jacobian_test_mult_transpose`
*7f296bb3SBarry Smith- `-mat_shell_test_mult_transpose_view`
*7f296bb3SBarry Smith- `-ts_rhs_jacobian_test_mult`
*7f296bb3SBarry Smith- `-mat_shell_test_mult_view`
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## Inexact Newton-like Methods
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithSince exact solution of the linear Newton systems within {math:numref}`newton`
*7f296bb3SBarry Smithat each iteration can be costly, modifications
*7f296bb3SBarry Smithare often introduced that significantly reduce these expenses and yet
*7f296bb3SBarry Smithretain the rapid convergence of Newton’s method. Inexact or truncated
*7f296bb3SBarry SmithNewton techniques approximately solve the linear systems using an
*7f296bb3SBarry Smithiterative scheme. In comparison with using direct methods for solving
*7f296bb3SBarry Smiththe Newton systems, iterative methods have the virtue of requiring
*7f296bb3SBarry Smithlittle space for matrix storage and potentially saving significant
*7f296bb3SBarry Smithcomputational work. Within the class of inexact Newton methods, of
*7f296bb3SBarry Smithparticular interest are Newton-Krylov methods, where the subsidiary
*7f296bb3SBarry Smithiterative technique for solving the Newton system is chosen from the
*7f296bb3SBarry Smithclass of Krylov subspace projection methods. Note that at runtime the
*7f296bb3SBarry Smithuser can set any of the linear solver options discussed in {any}`ch_ksp`,
*7f296bb3SBarry Smithsuch as `-ksp_type <ksp_method>` and
*7f296bb3SBarry Smith`-pc_type <pc_method>`, to set the Krylov subspace and preconditioner
*7f296bb3SBarry Smithmethods.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithTwo levels of iterations occur for the inexact techniques, where during
*7f296bb3SBarry Smitheach global or outer Newton iteration a sequence of subsidiary inner
*7f296bb3SBarry Smithiterations of a linear solver is performed. Appropriate control of the
*7f296bb3SBarry Smithaccuracy to which the subsidiary iterative method solves the Newton
*7f296bb3SBarry Smithsystem at each global iteration is critical, since these inner
*7f296bb3SBarry Smithiterations determine the asymptotic convergence rate for inexact Newton
*7f296bb3SBarry Smithtechniques. While the Newton systems must be solved well enough to
*7f296bb3SBarry Smithretain fast local convergence of the Newton’s iterates, use of excessive
*7f296bb3SBarry Smithinner iterations, particularly when $\| \mathbf{x}_k - \mathbf{x}_* \|$ is large,
*7f296bb3SBarry Smithis neither necessary nor economical. Thus, the number of required inner
*7f296bb3SBarry Smithiterations typically increases as the Newton process progresses, so that
*7f296bb3SBarry Smiththe truncated iterates approach the true Newton iterates.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithA sequence of nonnegative numbers $\{\eta_k\}$ can be used to
*7f296bb3SBarry Smithindicate the variable convergence criterion. In this case, when solving
*7f296bb3SBarry Smitha system of nonlinear equations, the update step of the Newton process
*7f296bb3SBarry Smithremains unchanged, and direct solution of the linear system is replaced
*7f296bb3SBarry Smithby iteration on the system until the residuals
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{r}_k^{(i)} =  \mathbf{F}'(\mathbf{x}_k) \Delta \mathbf{x}_k + \mathbf{F}(\mathbf{x}_k)
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithsatisfy
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\frac{ \| \mathbf{r}_k^{(i)} \| }{ \| \mathbf{F}(\mathbf{x}_k) \| } \leq \eta_k \leq \eta < 1.
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithHere $\mathbf{x}_0$ is an initial approximation of the solution, and
*7f296bb3SBarry Smith$\| \cdot \|$ denotes an arbitrary norm in $\Re^n$ .
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithBy default a constant relative convergence tolerance is used for solving
*7f296bb3SBarry Smiththe subsidiary linear systems within the Newton-like methods of
*7f296bb3SBarry Smith`SNES`. When solving a system of nonlinear equations, one can instead
*7f296bb3SBarry Smithemploy the techniques of Eisenstat and Walker {cite}`ew96`
*7f296bb3SBarry Smithto compute $\eta_k$ at each step of the nonlinear solver by using
*7f296bb3SBarry Smiththe option `-snes_ksp_ew` . In addition, by adding one’s own
*7f296bb3SBarry Smith`KSP` convergence test (see {any}`sec_convergencetests`), one can easily create one’s own,
*7f296bb3SBarry Smithproblem-dependent, inner convergence tests.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_nlmatrixfree)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## Matrix-Free Methods
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe `SNES` class fully supports matrix-free methods. The matrices
*7f296bb3SBarry Smithspecified in the Jacobian evaluation routine need not be conventional
*7f296bb3SBarry Smithmatrices; instead, they can point to the data required to implement a
*7f296bb3SBarry Smithparticular matrix-free method. The matrix-free variant is allowed *only*
*7f296bb3SBarry Smithwhen the linear systems are solved by an iterative method in combination
*7f296bb3SBarry Smithwith no preconditioning (`PCNONE` or `-pc_type` `none`), a
*7f296bb3SBarry Smithuser-provided preconditioner matrix, or a user-provided preconditioner
*7f296bb3SBarry Smithshell (`PCSHELL`, discussed in {any}`sec_pc`); that
*7f296bb3SBarry Smithis, obviously matrix-free methods cannot be used with a direct solver,
*7f296bb3SBarry Smithapproximate factorization, or other preconditioner which requires access
*7f296bb3SBarry Smithto explicit matrix entries.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe user can create a matrix-free context for use within `SNES` with
*7f296bb3SBarry Smiththe routine
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatCreateSNESMF(SNES snes,Mat *mat);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis routine creates the data structures needed for the matrix-vector
*7f296bb3SBarry Smithproducts that arise within Krylov space iterative
*7f296bb3SBarry Smithmethods {cite}`brownsaad:90`.
*7f296bb3SBarry SmithThe default `SNES`
*7f296bb3SBarry Smithmatrix-free approximations can also be invoked with the command
*7f296bb3SBarry Smith`-snes_mf`. Or, one can retain the user-provided Jacobian
*7f296bb3SBarry Smithpreconditioner, but replace the user-provided Jacobian matrix with the
*7f296bb3SBarry Smithdefault matrix-free variant with the option `-snes_mf_operator`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith`MatCreateSNESMF()` uses
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatCreateMFFD(Vec x, Mat *mat);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhich can also be used directly for users who need a matrix-free matrix but are not using `SNES`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe user can set one parameter to control the Jacobian-vector product
*7f296bb3SBarry Smithapproximation with the command
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatMFFDSetFunctionError(Mat mat,PetscReal rerror);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe parameter `rerror` should be set to the square root of the
*7f296bb3SBarry Smithrelative error in the function evaluations, $e_{rel}$; the default
*7f296bb3SBarry Smithis the square root of machine epsilon (about $10^{-8}$ in double
*7f296bb3SBarry Smithprecision), which assumes that the functions are evaluated to full
*7f296bb3SBarry Smithfloating-point precision accuracy. This parameter can also be set from
*7f296bb3SBarry Smiththe options database with `-mat_mffd_err <err>`
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithIn addition, PETSc provides ways to register new routines to compute
*7f296bb3SBarry Smiththe differencing parameter ($h$); see the manual page for
*7f296bb3SBarry Smith`MatMFFDSetType()` and `MatMFFDRegister()`. We currently provide two
*7f296bb3SBarry Smithdefault routines accessible via `-mat_mffd_type <ds or wp>`. For
*7f296bb3SBarry Smiththe default approach there is one “tuning” parameter, set with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatMFFDDSSetUmin(Mat mat,PetscReal umin);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis parameter, `umin` (or $u_{min}$), is a bit involved; its
*7f296bb3SBarry Smithdefault is $10^{-6}$ . Its command line form is `-mat_mffd_umin <umin>`.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe Jacobian-vector product is approximated
*7f296bb3SBarry Smithvia the formula
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry SmithF'(u) a \approx \frac{F(u + h*a) - F(u)}{h}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $h$ is computed via
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smithh = e_{\text{rel}} \cdot \begin{cases}
*7f296bb3SBarry Smithu^{T}a/\lVert a \rVert^2_2                                 & \text{if $|u^T a| > u_{\min} \lVert a \rVert_{1}$} \\
*7f296bb3SBarry Smithu_{\min} \operatorname{sign}(u^{T}a) \lVert a \rVert_{1}/\lVert a\rVert^2_2  & \text{otherwise}.
*7f296bb3SBarry Smith\end{cases}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis approach is taken from Brown and Saad
*7f296bb3SBarry Smith{cite}`brownsaad:90`. The second approach, taken from Walker and Pernice,
*7f296bb3SBarry Smith{cite}`pw98`, computes $h$ via
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\begin{aligned}
*7f296bb3SBarry Smith        h = \frac{\sqrt{1 + ||u||}e_{rel}}{||a||}\end{aligned}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThis has no tunable parameters, but note that inside the nonlinear solve
*7f296bb3SBarry Smithfor the entire *linear* iterative process $u$ does not change
*7f296bb3SBarry Smithhence $\sqrt{1 + ||u||}$ need be computed only once. This
*7f296bb3SBarry Smithinformation may be set with the options
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatMFFDWPSetComputeNormU(Mat mat,PetscBool );
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithor `-mat_mffd_compute_normu <true or false>`. This information is used
*7f296bb3SBarry Smithto eliminate the redundant computation of these parameters, therefore
*7f296bb3SBarry Smithreducing the number of collective operations and improving the
*7f296bb3SBarry Smithefficiency of the application code. This takes place automatically for the PETSc GMRES solver with left preconditioning.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithIt is also possible to monitor the differencing parameters h that are
*7f296bb3SBarry Smithcomputed via the routines
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatMFFDSetHHistory(Mat,PetscScalar *,int);
*7f296bb3SBarry SmithMatMFFDResetHHistory(Mat,PetscScalar *,int);
*7f296bb3SBarry SmithMatMFFDGetH(Mat,PetscScalar *);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithWe include an explicit example of using matrix-free methods in {any}`ex3.c <snes_ex3>`.
*7f296bb3SBarry SmithNote that by using the option `-snes_mf` one can
*7f296bb3SBarry Smitheasily convert any `SNES` code to use a matrix-free Newton-Krylov
*7f296bb3SBarry Smithmethod without a preconditioner. As shown in this example,
*7f296bb3SBarry Smith`SNESSetFromOptions()` must be called *after* `SNESSetJacobian()` to
*7f296bb3SBarry Smithenable runtime switching between the user-specified Jacobian and the
*7f296bb3SBarry Smithdefault `SNES` matrix-free form.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(snes_ex3)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith:::{admonition} Listing: `src/snes/tutorials/ex3.c`
*7f296bb3SBarry Smith```{literalinclude} /../src/snes/tutorials/ex3.c
*7f296bb3SBarry Smith:end-before: /*TEST
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith:::
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithTable {any}`tab-jacobians` summarizes the various matrix situations
*7f296bb3SBarry Smiththat `SNES` supports. In particular, different linear system matrices
*7f296bb3SBarry Smithand preconditioning matrices are allowed, as well as both matrix-free
*7f296bb3SBarry Smithand application-provided preconditioners. If {any}`ex3.c <snes_ex3>` is run with
*7f296bb3SBarry Smiththe options `-snes_mf` and `-user_precond` then it uses a
*7f296bb3SBarry Smithmatrix-free application of the matrix-vector multiple and a user
*7f296bb3SBarry Smithprovided matrix-free Jacobian.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```{eval-rst}
*7f296bb3SBarry Smith.. list-table:: Jacobian Options
*7f296bb3SBarry Smith   :name: tab-jacobians
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   * - Matrix Use
*7f296bb3SBarry Smith     - Conventional Matrix Formats
*7f296bb3SBarry Smith     - Matrix-free versions
*7f296bb3SBarry Smith   * - Jacobian Matrix
*7f296bb3SBarry Smith     - Create matrix with ``MatCreate()``:math:`^*`.  Assemble matrix with user-defined routine :math:`^\dagger`
*7f296bb3SBarry Smith     - Create matrix with ``MatCreateShell()``.  Use ``MatShellSetOperation()`` to set various matrix actions, or use ``MatCreateMFFD()`` or ``MatCreateSNESMF()``.
*7f296bb3SBarry Smith   * - Preconditioning Matrix
*7f296bb3SBarry Smith     - Create matrix with ``MatCreate()``:math:`^*`.  Assemble matrix with user-defined routine :math:`^\dagger`
*7f296bb3SBarry Smith     - Use ``SNESGetKSP()`` and ``KSPGetPC()`` to access the ``PC``, then use ``PCSetType(pc, PCSHELL)`` followed by ``PCShellSetApply()``.
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$^*$ Use either the generic `MatCreate()` or a format-specific variant such as `MatCreateAIJ()`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$^\dagger$ Set user-defined matrix formation routine with `SNESSetJacobian()` or with a `DM` variant such as `DMDASNESSetJacobianLocal()`
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithSNES also provides some less well-integrated code to apply matrix-free finite differencing using an automatically computed measurement of the
*7f296bb3SBarry Smithnoise of the functions. This can be selected with `-snes_mf_version 2`; it does not use `MatCreateMFFD()` but has similar options that start with
*7f296bb3SBarry Smith`-snes_mf_` instead of `-mat_mffd_`. Note that this alternative prefix **only** works for version 2 differencing.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_fdmatrix)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## Finite Difference Jacobian Approximations
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithPETSc provides some tools to help approximate the Jacobian matrices
*7f296bb3SBarry Smithefficiently via finite differences. These tools are intended for use in
*7f296bb3SBarry Smithcertain situations where one is unable to compute Jacobian matrices
*7f296bb3SBarry Smithanalytically, and matrix-free methods do not work well without a
*7f296bb3SBarry Smithpreconditioner, due to very poor conditioning. The approximation
*7f296bb3SBarry Smithrequires several steps:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith- First, one colors the columns of the (not yet built) Jacobian matrix,
*7f296bb3SBarry Smith  so that columns of the same color do not share any common rows.
*7f296bb3SBarry Smith- Next, one creates a `MatFDColoring` data structure that will be
*7f296bb3SBarry Smith  used later in actually computing the Jacobian.
*7f296bb3SBarry Smith- Finally, one tells the nonlinear solvers of `SNES` to use the
*7f296bb3SBarry Smith  `SNESComputeJacobianDefaultColor()` routine to compute the
*7f296bb3SBarry Smith  Jacobians.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithA code fragment that demonstrates this process is given below.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithISColoring    iscoloring;
*7f296bb3SBarry SmithMatFDColoring fdcoloring;
*7f296bb3SBarry SmithMatColoring   coloring;
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith/*
*7f296bb3SBarry Smith  This initializes the nonzero structure of the Jacobian. This is artificial
*7f296bb3SBarry Smith  because clearly if we had a routine to compute the Jacobian we wouldn't
*7f296bb3SBarry Smith  need to use finite differences.
*7f296bb3SBarry Smith*/
*7f296bb3SBarry SmithFormJacobian(snes,x, &J, &J, &user);
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith/*
*7f296bb3SBarry Smith   Color the matrix, i.e. determine groups of columns that share no common
*7f296bb3SBarry Smith  rows. These columns in the Jacobian can all be computed simultaneously.
*7f296bb3SBarry Smith*/
*7f296bb3SBarry SmithMatColoringCreate(J, &coloring);
*7f296bb3SBarry SmithMatColoringSetType(coloring,MATCOLORINGSL);
*7f296bb3SBarry SmithMatColoringSetFromOptions(coloring);
*7f296bb3SBarry SmithMatColoringApply(coloring, &iscoloring);
*7f296bb3SBarry SmithMatColoringDestroy(&coloring);
*7f296bb3SBarry Smith/*
*7f296bb3SBarry Smith   Create the data structure that SNESComputeJacobianDefaultColor() uses
*7f296bb3SBarry Smith   to compute the actual Jacobians via finite differences.
*7f296bb3SBarry Smith*/
*7f296bb3SBarry SmithMatFDColoringCreate(J,iscoloring, &fdcoloring);
*7f296bb3SBarry SmithISColoringDestroy(&iscoloring);
*7f296bb3SBarry SmithMatFDColoringSetFunction(fdcoloring,(PetscErrorCode (*)(void))FormFunction, &user);
*7f296bb3SBarry SmithMatFDColoringSetFromOptions(fdcoloring);
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith/*
*7f296bb3SBarry Smith  Tell SNES to use the routine SNESComputeJacobianDefaultColor()
*7f296bb3SBarry Smith  to compute Jacobians.
*7f296bb3SBarry Smith*/
*7f296bb3SBarry SmithSNESSetJacobian(snes,J,J,SNESComputeJacobianDefaultColor,fdcoloring);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOf course, we are cheating a bit. If we do not have an analytic formula
*7f296bb3SBarry Smithfor computing the Jacobian, then how do we know what its nonzero
*7f296bb3SBarry Smithstructure is so that it may be colored? Determining the structure is
*7f296bb3SBarry Smithproblem dependent, but fortunately, for most structured grid problems
*7f296bb3SBarry Smith(the class of problems for which PETSc was originally designed) if one
*7f296bb3SBarry Smithknows the stencil used for the nonlinear function one can usually fairly
*7f296bb3SBarry Smitheasily obtain an estimate of the location of nonzeros in the matrix.
*7f296bb3SBarry SmithThis is harder in the unstructured case, but one typically knows where the nonzero entries are from the mesh topology and distribution of degrees of freedom.
*7f296bb3SBarry SmithIf using `DMPlex` ({any}`ch_unstructured`) for unstructured meshes, the nonzero locations will be identified in `DMCreateMatrix()` and the procedure above can be used.
*7f296bb3SBarry SmithMost external packages for unstructured meshes have similar functionality.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithOne need not necessarily use a `MatColoring` object to determine a
*7f296bb3SBarry Smithcoloring. For example, if a grid can be colored directly (without using
*7f296bb3SBarry Smiththe associated matrix), then that coloring can be provided to
*7f296bb3SBarry Smith`MatFDColoringCreate()`. Note that the user must always preset the
*7f296bb3SBarry Smithnonzero structure in the matrix regardless of which coloring routine is
*7f296bb3SBarry Smithused.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithPETSc provides the following coloring algorithms, which can be selected using `MatColoringSetType()` or via the command line argument `-mat_coloring_type`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```{eval-rst}
*7f296bb3SBarry Smith.. list-table::
*7f296bb3SBarry Smith   :header-rows: 1
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith   * - Algorithm
*7f296bb3SBarry Smith     - ``MatColoringType``
*7f296bb3SBarry Smith     - ``-mat_coloring_type``
*7f296bb3SBarry Smith     - Parallel
*7f296bb3SBarry Smith   * - smallest-last :cite:`more84`
*7f296bb3SBarry Smith     - ``MATCOLORINGSL``
*7f296bb3SBarry Smith     - ``sl``
*7f296bb3SBarry Smith     - No
*7f296bb3SBarry Smith   * - largest-first :cite:`more84`
*7f296bb3SBarry Smith     - ``MATCOLORINGLF``
*7f296bb3SBarry Smith     - ``lf``
*7f296bb3SBarry Smith     - No
*7f296bb3SBarry Smith   * - incidence-degree :cite:`more84`
*7f296bb3SBarry Smith     - ``MATCOLORINGID``
*7f296bb3SBarry Smith     - ``id``
*7f296bb3SBarry Smith     - No
*7f296bb3SBarry Smith   * - Jones-Plassmann :cite:`jp:pcolor`
*7f296bb3SBarry Smith     - ``MATCOLORINGJP``
*7f296bb3SBarry Smith     - ``jp``
*7f296bb3SBarry Smith     - Yes
*7f296bb3SBarry Smith   * - Greedy
*7f296bb3SBarry Smith     - ``MATCOLORINGGREEDY``
*7f296bb3SBarry Smith     - ``greedy``
*7f296bb3SBarry Smith     - Yes
*7f296bb3SBarry Smith   * - Natural (1 color per column)
*7f296bb3SBarry Smith     - ``MATCOLORINGNATURAL``
*7f296bb3SBarry Smith     - ``natural``
*7f296bb3SBarry Smith     - Yes
*7f296bb3SBarry Smith   * - Power (:math:`A^k` followed by 1-coloring)
*7f296bb3SBarry Smith     - ``MATCOLORINGPOWER``
*7f296bb3SBarry Smith     - ``power``
*7f296bb3SBarry Smith     - Yes
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithAs for the matrix-free computation of Jacobians ({any}`sec_nlmatrixfree`), two parameters affect the accuracy of the
*7f296bb3SBarry Smithfinite difference Jacobian approximation. These are set with the command
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithMatFDColoringSetParameters(MatFDColoring fdcoloring,PetscReal rerror,PetscReal umin);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe parameter `rerror` is the square root of the relative error in the
*7f296bb3SBarry Smithfunction evaluations, $e_{rel}$; the default is the square root of
*7f296bb3SBarry Smithmachine epsilon (about $10^{-8}$ in double precision), which
*7f296bb3SBarry Smithassumes that the functions are evaluated approximately to floating-point
*7f296bb3SBarry Smithprecision accuracy. The second parameter, `umin`, is a bit more
*7f296bb3SBarry Smithinvolved; its default is $10e^{-6}$ . Column $i$ of the
*7f296bb3SBarry SmithJacobian matrix (denoted by $F_{:i}$) is approximated by the
*7f296bb3SBarry Smithformula
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry SmithF'_{:i} \approx \frac{F(u + h*dx_{i}) - F(u)}{h}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhere $h$ is computed via:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smithh = e_{\text{rel}} \cdot \begin{cases}
*7f296bb3SBarry Smithu_{i}             &    \text{if $|u_{i}| > u_{\min}$} \\
*7f296bb3SBarry Smithu_{\min} \cdot \operatorname{sign}(u_{i})  & \text{otherwise}.
*7f296bb3SBarry Smith\end{cases}
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithfor `MATMFFD_DS` or:
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smithh = e_{\text{rel}} \sqrt(\|u\|)
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithfor `MATMFFD_WP` (default). These parameters may be set from the options
*7f296bb3SBarry Smithdatabase with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith-mat_fd_coloring_err <err>
*7f296bb3SBarry Smith-mat_fd_coloring_umin <umin>
*7f296bb3SBarry Smith-mat_fd_type <htype>
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithNote that `MatColoring` type `MATCOLORINGSL`, `MATCOLORINGLF`, and
*7f296bb3SBarry Smith`MATCOLORINGID` are sequential algorithms. `MATCOLORINGJP` and
*7f296bb3SBarry Smith`MATCOLORINGGREEDY` are parallel algorithms, although in practice they
*7f296bb3SBarry Smithmay create more colors than the sequential algorithms. If one computes
*7f296bb3SBarry Smiththe coloring `iscoloring` reasonably with a parallel algorithm or by
*7f296bb3SBarry Smithknowledge of the discretization, the routine `MatFDColoringCreate()`
*7f296bb3SBarry Smithis scalable. An example of this for 2D distributed arrays is given below
*7f296bb3SBarry Smiththat uses the utility routine `DMCreateColoring()`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithDMCreateColoring(da,IS_COLORING_GHOSTED, &iscoloring);
*7f296bb3SBarry SmithMatFDColoringCreate(J,iscoloring, &fdcoloring);
*7f296bb3SBarry SmithMatFDColoringSetFromOptions(fdcoloring);
*7f296bb3SBarry SmithISColoringDestroy( &iscoloring);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithNote that the routine `MatFDColoringCreate()` currently is only
*7f296bb3SBarry Smithsupported for the AIJ and BAIJ matrix formats.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_vi)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## Variational Inequalities
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith`SNES` can also solve (differential) variational inequalities with box (bound) constraints.
*7f296bb3SBarry SmithThese are nonlinear algebraic systems with additional inequality
*7f296bb3SBarry Smithconstraints on some or all of the variables:
*7f296bb3SBarry Smith$L_i \le u_i \le H_i$. For example, the pressure variable cannot be negative.
*7f296bb3SBarry SmithSome, or all, of the lower bounds may be
*7f296bb3SBarry Smithnegative infinity (indicated to PETSc with `SNES_VI_NINF`) and some, or
*7f296bb3SBarry Smithall, of the upper bounds may be infinity (indicated by `SNES_VI_INF`).
*7f296bb3SBarry SmithThe commands
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESVISetVariableBounds(SNES,Vec L,Vec H);
*7f296bb3SBarry SmithSNESVISetComputeVariableBounds(SNES snes, PetscErrorCode (*compute)(SNES,Vec,Vec))
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithare used to indicate that one is solving a variational inequality. Problems with box constraints can be solved with
*7f296bb3SBarry Smiththe reduced space, `SNESVINEWTONRSLS`, and semi-smooth `SNESVINEWTONSSLS` solvers.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe
*7f296bb3SBarry Smithoption `-snes_vi_monitor` turns on extra monitoring of the active set
*7f296bb3SBarry Smithassociated with the bounds and `-snes_vi_type` allows selecting from
*7f296bb3SBarry Smithseveral VI solvers, the default is preferred.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith`SNESLineSearchSetPreCheck()` and `SNESLineSearchSetPostCheck()` can also be used to control properties
*7f296bb3SBarry Smithof the steps selected by `SNES`.
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith(sec_snespc)=
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith## Nonlinear Preconditioning
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithThe mathematical framework of nonlinear preconditioning is explained in detail in {cite}`bruneknepleysmithtu15`.
*7f296bb3SBarry SmithNonlinear preconditioning in PETSc involves the use of an inner `SNES`
*7f296bb3SBarry Smithinstance to define the step for an outer `SNES` instance. The inner
*7f296bb3SBarry Smithinstance may be extracted using
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESGetNPC(SNES snes,SNES *npc);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithand passed run-time options using the `-npc_` prefix. Nonlinear
*7f296bb3SBarry Smithpreconditioning comes in two flavors: left and right. The side may be
*7f296bb3SBarry Smithchanged using `-snes_npc_side` or `SNESSetNPCSide()`. Left nonlinear
*7f296bb3SBarry Smithpreconditioning redefines the nonlinear function as the action of the
*7f296bb3SBarry Smithnonlinear preconditioner $\mathbf{M}$;
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{F}_{M}(x) = \mathbf{M}(\mathbf{x},\mathbf{b}) - \mathbf{x}.
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithRight nonlinear preconditioning redefines the nonlinear function as the
*7f296bb3SBarry Smithfunction on the action of the nonlinear preconditioner;
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith\mathbf{F}(\mathbf{M}(\mathbf{x},\mathbf{b})) = \mathbf{b},
*7f296bb3SBarry Smith$$
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithwhich can be interpreted as putting the preconditioner into “striking
*7f296bb3SBarry Smithdistance” of the solution by outer acceleration.
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithIn addition, basic patterns of solver composition are available with the
*7f296bb3SBarry Smith`SNESType` `SNESCOMPOSITE`. This allows for two or more `SNES`
*7f296bb3SBarry Smithinstances to be combined additively or multiplicatively. By command
*7f296bb3SBarry Smithline, a set of `SNES` types may be given by comma separated list
*7f296bb3SBarry Smithargument to `-snes_composite_sneses`. There are additive
*7f296bb3SBarry Smith(`SNES_COMPOSITE_ADDITIVE`), additive with optimal damping
*7f296bb3SBarry Smith(`SNES_COMPOSITE_ADDITIVEOPTIMAL`), and multiplicative
*7f296bb3SBarry Smith(`SNES_COMPOSITE_MULTIPLICATIVE`) variants which may be set with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESCompositeSetType(SNES,SNESCompositeType);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry SmithNew subsolvers may be added to the composite solver with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESCompositeAddSNES(SNES,SNESType);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smithand accessed with
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```
*7f296bb3SBarry SmithSNESCompositeGetSNES(SNES,PetscInt,SNES *);
*7f296bb3SBarry Smith```
*7f296bb3SBarry Smith
*7f296bb3SBarry Smith```{eval-rst}
*7f296bb3SBarry Smith.. bibliography:: /petsc.bib
*7f296bb3SBarry Smith   :filter: docname in docnames
*7f296bb3SBarry Smith```