| 12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788899091929394959697989910010110210310410510610710810911011111211311411511611711811912012112212312412512612712812913013113213313413513613713813914014114214314414514614714814915015115215315415515615715815916016116216316416516616716816917017117217317417517617717817918018118218318418518618718818919019119219319419519619719819920020120220320420520620720820921021121221321421521621721821922022122222322422522622722822923023123223323423523623723823924024124224324424524624724824925025125225325425525625725825926026126226326426526626726826927027127227327427527627727827928028128228328428528628728828929029129229329429529629729829930030130230330430530630730830931031131231331431531631731831932032132232332432532632732832933033133233333433533633733833934034134234334434534634734834935035135235335435535635735835936036136236336436536636736836937037137237337437537637737837938038138238338438538638738838939039139239339439539639739839940040140240340440540640740840941041141241341441541641741841942042142242342442542642742842943043143243343443543643743843944044144244344444544644744844945045145245345445545645745845946046146246346446546646746846947047147247347447547647747847948048148248348448548648748848949049149249349449549649749849950050150250350450550650750850951051151251351451551651751851952052152252352452552652752852953053153253353453553653753853954054154254354454554654754854955055155255355455555655755855956056156256356456556656756856957057157257357457557657757857958058158258358458558658758858959059159259359459559659759859960060160260360460560660760860961061161261361461561661761861962062162262362462562662762862963063163263363463563663763863964064164264364464564664764864965065165265365465565665765865966066166266366466566666766866967067167267367467567667767867968068168268368468568668768868969069169269369469569669769869970070170270370470570670770870971071171271371471571671771871972072172272372472572672772872973073173273373473573673773873974074174274374474574674774874975075175275375475575675775875976076176276376476576676776876977077177277377477577677777877978078178278378478578678778878979079179279379479579679779879980080180280380480580680780880981081181281381481581681781881982082182282382482582682782882983083183283383483583683783883984084184284384484584684784884985085185285385485585685785885986086186286386486586686786886987087187287387487587687787887988088188288388488588688788888989089189289389489589689789889990090190290390490590690790890991091191291391491591691791891992092192292392492592692792892993093193293393493593693793893994094194294394494594694794894995095195295395495595695795895996096196296396496596696796896997097197297397497597697797897998098198298398498598698798898999099199299399499599699799899910001001100210031004100510061007100810091010101110121013101410151016101710181019102010211022102310241025102610271028102910301031103210331034103510361037103810391040104110421043104410451046104710481049105010511052105310541055105610571058105910601061106210631064106510661067106810691070107110721073107410751076107710781079108010811082108310841085108610871088108910901091109210931094109510961097109810991100110111021103110411051106110711081109111011111112111311141115111611171118111911201121112211231124112511261127112811291130113111321133113411351136113711381139114011411142114311441145114611471148114911501151115211531154115511561157115811591160116111621163116411651166116711681169117011711172117311741175117611771178117911801181118211831184118511861187118811891190119111921193119411951196119711981199120012011202120312041205120612071208120912101211121212131214121512161217121812191220122112221223122412251226122712281229123012311232123312341235123612371238123912401241124212431244124512461247124812491250125112521253125412551256125712581259126012611262126312641265126612671268126912701271 | 
.. default-domain:: cpp.. cpp:namespace:: ceres.. _chapter-solving:==========Solver API==========Introduction============Effective use of Ceres requires some familiarity with the basiccomponents of a nonlinear least squares solver, so before we describehow to configure the solver, we will begin by taking a brief look athow some of the core optimization algorithms in Ceres work and thevarious linear solvers and preconditioners that power it.Let :math:`x \in \mathbb{R}^n` be an :math:`n`-dimensional vector ofvariables, and:math:`F(x) = \left[f_1(x), ... ,  f_{m}(x) \right]^{\top}` be a:math:`m`-dimensional function of :math:`x`.  We are interested insolving the following optimization problem [#f1]_ ... math:: \arg \min_x \frac{1}{2}\|F(x)\|^2\ .  :label: nonlinsqHere, the Jacobian :math:`J(x)` of :math:`F(x)` is an :math:`m\timesn` matrix, where :math:`J_{ij}(x) = \partial_j f_i(x)` and thegradient vector :math:`g(x) = \nabla \frac{1}{2}\|F(x)\|^2 = J(x)^\topF(x)`. Since the efficient global minimization of :eq:`nonlinsq` forgeneral :math:`F(x)` is an intractable problem, we will have to settlefor finding a local minimum.The general strategy when solving non-linear optimization problems isto solve a sequence of approximations to the original problem[NocedalWright]_. At each iteration, the approximation is solved todetermine a correction :math:`\Delta x` to the vector :math:`x`. Fornon-linear least squares, an approximation can be constructed by usingthe linearization :math:`F(x+\Delta x) \approx F(x) + J(x)\Delta x`,which leads to the following linear least squares problem:.. math:: \min_{\Delta x} \frac{1}{2}\|J(x)\Delta x + F(x)\|^2   :label: linearapproxUnfortunately, naively solving a sequence of these problems andupdating :math:`x \leftarrow x+ \Delta x` leads to an algorithm thatmay not converge.  To get a convergent algorithm, we need to controlthe size of the step :math:`\Delta x`. Depending on how the size ofthe step :math:`\Delta x` is controlled, non-linear optimizationalgorithms can be divided into two major categories [NocedalWright]_.1. **Trust Region** The trust region approach approximates the   objective function using using a model function (often a quadratic)   over a subset of the search space known as the trust region. If the   model function succeeds in minimizing the true objective function   the trust region is expanded; conversely, otherwise it is   contracted and the model optimization problem is solved again.2. **Line Search** The line search approach first finds a descent   direction along which the objective function will be reduced and   then computes a step size that decides how far should move along   that direction. The descent direction can be computed by various   methods, such as gradient descent, Newton's method and Quasi-Newton   method. The step size can be determined either exactly or   inexactly.Trust region methods are in some sense dual to line search methods:trust region methods first choose a step size (the size of the trustregion) and then a step direction while line search methods firstchoose a step direction and then a step size.Ceres implements multiple algorithms in both categories... _section-trust-region-methods:Trust Region Methods====================The basic trust region algorithm looks something like this.   1. Given an initial point :math:`x` and a trust region radius :math:`\mu`.   2. :math:`\arg \min_{\Delta x} \frac{1}{2}\|J(x)\Delta      x + F(x)\|^2` s.t. :math:`\|D(x)\Delta x\|^2 \le \mu`   3. :math:`\rho = \frac{\displaystyle \|F(x + \Delta x)\|^2 -      \|F(x)\|^2}{\displaystyle \|J(x)\Delta x + F(x)\|^2 -      \|F(x)\|^2}`   4. if :math:`\rho > \epsilon` then  :math:`x = x + \Delta x`.   5. if :math:`\rho > \eta_1` then :math:`\rho = 2  \rho`   6. else if :math:`\rho < \eta_2` then :math:`\rho = 0.5 * \rho`   7. Goto 2.Here, :math:`\mu` is the trust region radius, :math:`D(x)` is somematrix used to define a metric on the domain of :math:`F(x)` and:math:`\rho` measures the quality of the step :math:`\Delta x`, i.e.,how well did the linear model predict the decrease in the value of thenon-linear objective. The idea is to increase or decrease the radiusof the trust region depending on how well the linearization predictsthe behavior of the non-linear objective, which in turn is reflectedin the value of :math:`\rho`.The key computational step in a trust-region algorithm is the solutionof the constrained optimization problem.. math:: \arg\min_{\Delta x} \frac{1}{2}\|J(x)\Delta x +  F(x)\|^2\quad \text{such that}\quad  \|D(x)\Delta x\|^2 \le \mu   :label: trpThere are a number of different ways of solving this problem, eachgiving rise to a different concrete trust-region algorithm. CurrentlyCeres, implements two trust-region algorithms - Levenberg-Marquardtand Dogleg. The user can choose between them by setting:member:`Solver::Options::trust_region_strategy_type`... rubric:: Footnotes.. [#f1] At the level of the non-linear solver, the block and         structure is not relevant, therefore our discussion here is         in terms of an optimization problem defined over a state         vector of size :math:`n`... _section-levenberg-marquardt:Levenberg-Marquardt-------------------The Levenberg-Marquardt algorithm [Levenberg]_ [Marquardt]_ is themost popular algorithm for solving non-linear least squares problems.It was also the first trust region algorithm to be developed[Levenberg]_ [Marquardt]_. Ceres implements an exact step [Madsen]_and an inexact step variant of the Levenberg-Marquardt algorithm[WrightHolt]_ [NashSofer]_.It can be shown, that the solution to :eq:`trp` can be obtained bysolving an unconstrained optimization of the form.. math:: \arg\min_{\Delta x}& \frac{1}{2}\|J(x)\Delta x + F(x)\|^2 +\lambda  \|D(x)\Delta x\|^2Where, :math:`\lambda` is a Lagrange multiplier that is inverserelated to :math:`\mu`. In Ceres, we solve for.. math:: \arg\min_{\Delta x}& \frac{1}{2}\|J(x)\Delta x + F(x)\|^2 + \frac{1}{\mu} \|D(x)\Delta x\|^2   :label: lsqrThe matrix :math:`D(x)` is a non-negative diagonal matrix, typicallythe square root of the diagonal of the matrix :math:`J(x)^\top J(x)`.Before going further, let us make some notational simplifications. Wewill assume that the matrix :math:`\sqrt{\mu} D` has been concatenatedat the bottom of the matrix :math:`J` and similarly a vector of zeroshas been added to the bottom of the vector :math:`f` and the rest ofour discussion will be in terms of :math:`J` and :math:`f`, i.e, thelinear least squares problem... math:: \min_{\Delta x} \frac{1}{2} \|J(x)\Delta x + f(x)\|^2 .   :label: simpleFor all but the smallest problems the solution of :eq:`simple` ineach iteration of the Levenberg-Marquardt algorithm is the dominantcomputational cost in Ceres. Ceres provides a number of differentoptions for solving :eq:`simple`. There are two major classes ofmethods - factorization and iterative.The factorization methods are based on computing an exact solution of:eq:`lsqr` using a Cholesky or a QR factorization and lead to an exactstep Levenberg-Marquardt algorithm. But it is not clear if an exactsolution of :eq:`lsqr` is necessary at each step of the LM algorithmto solve :eq:`nonlinsq`. In fact, we have already seen evidencethat this may not be the case, as :eq:`lsqr` is itself a regularizedversion of :eq:`linearapprox`. Indeed, it is possible toconstruct non-linear optimization algorithms in which the linearizedproblem is solved approximately. These algorithms are known as inexactNewton or truncated Newton methods [NocedalWright]_.An inexact Newton method requires two ingredients. First, a cheapmethod for approximately solving systems of linearequations. Typically an iterative linear solver like the ConjugateGradients method is used for thispurpose [NocedalWright]_. Second, a termination rule forthe iterative solver. A typical termination rule is of the form.. math:: \|H(x) \Delta x + g(x)\| \leq \eta_k \|g(x)\|.   :label: inexactHere, :math:`k` indicates the Levenberg-Marquardt iteration number and:math:`0 < \eta_k <1` is known as the forcing sequence.  [WrightHolt]_prove that a truncated Levenberg-Marquardt algorithm that uses aninexact Newton step based on :eq:`inexact` converges for anysequence :math:`\eta_k \leq \eta_0 < 1` and the rate of convergencedepends on the choice of the forcing sequence :math:`\eta_k`.Ceres supports both exact and inexact step solution strategies. Whenthe user chooses a factorization based linear solver, the exact stepLevenberg-Marquardt algorithm is used. When the user chooses aniterative linear solver, the inexact step Levenberg-Marquardtalgorithm is used... _section-dogleg:Dogleg------Another strategy for solving the trust region problem :eq:`trp` wasintroduced by M. J. D. Powell. The key idea there is to compute twovectors.. math::        \Delta x^{\text{Gauss-Newton}} &= \arg \min_{\Delta x}\frac{1}{2} \|J(x)\Delta x + f(x)\|^2.\\        \Delta x^{\text{Cauchy}} &= -\frac{\|g(x)\|^2}{\|J(x)g(x)\|^2}g(x).Note that the vector :math:`\Delta x^{\text{Gauss-Newton}}` is thesolution to :eq:`linearapprox` and :math:`\Deltax^{\text{Cauchy}}` is the vector that minimizes the linearapproximation if we restrict ourselves to moving along the directionof the gradient. Dogleg methods finds a vector :math:`\Delta x`defined by :math:`\Delta x^{\text{Gauss-Newton}}` and :math:`\Deltax^{\text{Cauchy}}` that solves the trust region problem. Ceressupports two variants that can be chose by setting:member:`Solver::Options::dogleg_type`.``TRADITIONAL_DOGLEG`` as described by Powell, constructs two linesegments using the Gauss-Newton and Cauchy vectors and finds the pointfarthest along this line shaped like a dogleg (hence the name) that iscontained in the trust-region. For more details on the exact reasoningand computations, please see Madsen et al [Madsen]_.``SUBSPACE_DOGLEG`` is a more sophisticated method that considers theentire two dimensional subspace spanned by these two vectors and findsthe point that minimizes the trust region problem in this subspace[ByrdSchnabel]_.The key advantage of the Dogleg over Levenberg Marquardt is that ifthe step computation for a particular choice of :math:`\mu` does notresult in sufficient decrease in the value of the objective function,Levenberg-Marquardt solves the linear approximation from scratch witha smaller value of :math:`\mu`. Dogleg on the other hand, only needsto compute the interpolation between the Gauss-Newton and the Cauchyvectors, as neither of them depend on the value of :math:`\mu`.The Dogleg method can only be used with the exact factorization basedlinear solvers... _section-inner-iterations:Inner Iterations----------------Some non-linear least squares problems have additional structure inthe way the parameter blocks interact that it is beneficial to modifythe way the trust region step is computed. e.g., consider thefollowing regression problem.. math::   y = a_1 e^{b_1 x} + a_2 e^{b_3 x^2 + c_1}Given a set of pairs :math:`\{(x_i, y_i)\}`, the user wishes to estimate:math:`a_1, a_2, b_1, b_2`, and :math:`c_1`.Notice that the expression on the left is linear in :math:`a_1` and:math:`a_2`, and given any value for :math:`b_1, b_2` and :math:`c_1`,it is possible to use linear regression to estimate the optimal valuesof :math:`a_1` and :math:`a_2`. It's possible to analyticallyeliminate the variables :math:`a_1` and :math:`a_2` from the problementirely. Problems like these are known as separable least squaresproblem and the most famous algorithm for solving them is the VariableProjection algorithm invented by Golub & Pereyra [GolubPereyra]_.Similar structure can be found in the matrix factorization withmissing data problem. There the corresponding algorithm is known asWiberg's algorithm [Wiberg]_.Ruhe & Wedin present an analysis of various algorithms for solvingseparable non-linear least squares problems and refer to *VariableProjection* as Algorithm I in their paper [RuheWedin]_.Implementing Variable Projection is tedious and expensive. Ruhe &Wedin present a simpler algorithm with comparable convergenceproperties, which they call Algorithm II.  Algorithm II performs anadditional optimization step to estimate :math:`a_1` and :math:`a_2`exactly after computing a successful Newton step.This idea can be generalized to cases where the residual is notlinear in :math:`a_1` and :math:`a_2`, i.e.,.. math:: y = f_1(a_1, e^{b_1 x}) + f_2(a_2, e^{b_3 x^2 + c_1})In this case, we solve for the trust region step for the full problem,and then use it as the starting point to further optimize just `a_1`and `a_2`. For the linear case, this amounts to doing a single linearleast squares solve. For non-linear problems, any method for solvingthe `a_1` and `a_2` optimization problems will do. The only constrainton `a_1` and `a_2` (if they are two different parameter block) is thatthey do not co-occur in a residual block.This idea can be further generalized, by not just optimizing:math:`(a_1, a_2)`, but decomposing the graph corresponding to theHessian matrix's sparsity structure into a collection ofnon-overlapping independent sets and optimizing each of them.Setting :member:`Solver::Options::use_inner_iterations` to ``true``enables the use of this non-linear generalization of Ruhe & Wedin'sAlgorithm II.  This version of Ceres has a higher iterationcomplexity, but also displays better convergence behavior periteration.Setting :member:`Solver::Options::num_threads` to the maximum numberpossible is highly recommended... _section-non-monotonic-steps:Non-monotonic Steps-------------------Note that the basic trust-region algorithm described inAlgorithm~\ref{alg:trust-region} is a descent algorithm in that theyonly accepts a point if it strictly reduces the value of the objectivefunction.Relaxing this requirement allows the algorithm to be more efficient inthe long term at the cost of some local increase in the value of theobjective function.This is because allowing for non-decreasing objective function valuesin a princpled manner allows the algorithm to *jump over boulders* asthe method is not restricted to move into narrow valleys whilepreserving its convergence properties.Setting :member:`Solver::Options::use_nonmonotonic_steps` to ``true``enables the non-monotonic trust region algorithm as described by Conn,Gould & Toint in [Conn]_.Even though the value of the objective function may be largerthan the minimum value encountered over the course of theoptimization, the final parameters returned to the user are theones corresponding to the minimum cost over all iterations.The option to take non-monotonic steps is available for all trustregion strategies... _section-line-search-methods:Line Search Methods===================**The implementation of line search algorithms in Ceres Solver isfairly new and not very well tested, so for now this part of thesolver should be considered beta quality. We welcome reports of yourexperiences both good and bad on the mailinglist.**Line search algorithms   1. Given an initial point :math:`x`   2. :math:`\Delta x = -H^{-1}(x) g(x)`   3. :math:`\arg \min_\mu \frac{1}{2} \| F(x + \mu \Delta x) \|^2`   4. :math:`x = x + \mu \Delta x`   5. Goto 2.Here :math:`H(x)` is some approximation to the Hessian of theobjective function, and :math:`g(x)` is the gradient at:math:`x`. Depending on the choice of :math:`H(x)` we get a variety ofdifferent search directions -`\Delta x`.Step 4, which is a one dimensional optimization or `Line Search` along:math:`\Delta x` is what gives this class of methods its name.Different line search algorithms differ in their choice of the searchdirection :math:`\Delta x` and the method used for one dimensionaloptimization along :math:`\Delta x`. The choice of :math:`H(x)` is theprimary source of computational complexity in thesemethods. Currently, Ceres Solver supports three choices of searchdirections, all aimed at large scale problems.1. ``STEEPEST_DESCENT`` This corresponds to choosing :math:`H(x)` to   be the identity matrix. This is not a good search direction for   anything but the simplest of the problems. It is only included here   for completeness.2. ``NONLINEAR_CONJUGATE_GRADIENT`` A generalization of the Conjugate   Gradient method to non-linear functions. The generalization can be   performed in a number of different ways, resulting in a variety of   search directions. Ceres Solver currently supports   ``FLETCHER_REEVES``, ``POLAK_RIBIRERE`` and ``HESTENES_STIEFEL``   directions.3. ``LBFGS`` In this method, a limited memory approximation to the   inverse Hessian is maintained and used to compute a quasi-Newton   step [Nocedal]_, [ByrdNocedal]_.Currently Ceres Solver uses a backtracking and interpolation basedArmijo line search algorithm... _section-linear-solver:LinearSolver============Recall that in both of the trust-region methods described above, thekey computational cost is the solution of a linear least squaresproblem of the form.. math:: \min_{\Delta x} \frac{1}{2} \|J(x)\Delta x + f(x)\|^2 .   :label: simple2Let :math:`H(x)= J(x)^\top J(x)` and :math:`g(x) = -J(x)^\topf(x)`. For notational convenience let us also drop the dependence on:math:`x`. Then it is easy to see that solving :eq:`simple2` isequivalent to solving the *normal equations*... math:: H \Delta x = g   :label: normalCeres provides a number of different options for solving :eq:`normal`... _section-qr:``DENSE_QR``------------For small problems (a couple of hundred parameters and a few thousandresiduals) with relatively dense Jacobians, ``DENSE_QR`` is the methodof choice [Bjorck]_. Let :math:`J = QR` be the QR-decomposition of:math:`J`, where :math:`Q` is an orthonormal matrix and :math:`R` isan upper triangular matrix [TrefethenBau]_. Then it can be shown thatthe solution to :eq:`normal` is given by.. math:: \Delta x^* = -R^{-1}Q^\top fCeres uses ``Eigen`` 's dense QR factorization routines... _section-cholesky:``DENSE_NORMAL_CHOLESKY`` & ``SPARSE_NORMAL_CHOLESKY``------------------------------------------------------Large non-linear least square problems are usually sparse. In suchcases, using a dense QR factorization is inefficient. Let :math:`H =R^\top R` be the Cholesky factorization of the normal equations, where:math:`R` is an upper triangular matrix, then the solution to:eq:`normal` is given by.. math::    \Delta x^* = R^{-1} R^{-\top} g.The observant reader will note that the :math:`R` in the Choleskyfactorization of :math:`H` is the same upper triangular matrix:math:`R` in the QR factorization of :math:`J`. Since :math:`Q` is anorthonormal matrix, :math:`J=QR` implies that :math:`J^\top J = R^\topQ^\top Q R = R^\top R`. There are two variants of Choleskyfactorization -- sparse and dense.``DENSE_NORMAL_CHOLESKY``  as the name implies performs a denseCholesky factorization of the normal equations. Ceres uses``Eigen`` 's dense LDLT factorization routines.``SPARSE_NORMAL_CHOLESKY``, as the name implies performs a sparseCholesky factorization of the normal equations. This leads tosubstantial savings in time and memory for large sparseproblems. Ceres uses the sparse Cholesky factorization routines inProfessor Tim Davis' ``SuiteSparse`` or ``CXSparse`` packages [Chen]_... _section-schur:``DENSE_SCHUR`` & ``SPARSE_SCHUR``----------------------------------While it is possible to use ``SPARSE_NORMAL_CHOLESKY`` to solve bundleadjustment problems, bundle adjustment problem have a specialstructure, and a more efficient scheme for solving :eq:`normal`can be constructed.Suppose that the SfM problem consists of :math:`p` cameras and:math:`q` points and the variable vector :math:`x` has the blockstructure :math:`x = [y_{1}, ... ,y_{p},z_{1}, ... ,z_{q}]`. Where,:math:`y` and :math:`z` correspond to camera and point parameters,respectively.  Further, let the camera blocks be of size :math:`c` andthe point blocks be of size :math:`s` (for most problems :math:`c` =:math:`6`--`9` and :math:`s = 3`). Ceres does not impose any constancyrequirement on these block sizes, but choosing them to be constantsimplifies the exposition.A key characteristic of the bundle adjustment problem is that there isno term :math:`f_{i}` that includes two or more point blocks.  This inturn implies that the matrix :math:`H` is of the form.. math:: H = \left[ \begin{matrix} B & E\\ E^\top & C \end{matrix} \right]\ ,   :label: hblockwhere, :math:`B \in \mathbb{R}^{pc\times pc}` is a block sparse matrixwith :math:`p` blocks of size :math:`c\times c` and :math:`C \in\mathbb{R}^{qs\times qs}` is a block diagonal matrix with :math:`q` blocksof size :math:`s\times s`. :math:`E \in \mathbb{R}^{pc\times qs}` is ageneral block sparse matrix, with a block of size :math:`c\times s`for each observation. Let us now block partition :math:`\Delta x =[\Delta y,\Delta z]` and :math:`g=[v,w]` to restate :eq:`normal`as the block structured linear system.. math:: \left[ \begin{matrix} B & E\\ E^\top & C \end{matrix}                \right]\left[ \begin{matrix} \Delta y \\ \Delta z            	    \end{matrix} \right] = \left[ \begin{matrix} v\\ w                    \end{matrix} \right]\ ,   :label: linear2and apply Gaussian elimination to it. As we noted above, :math:`C` isa block diagonal matrix, with small diagonal blocks of size:math:`s\times s`.  Thus, calculating the inverse of :math:`C` byinverting each of these blocks is cheap. This allows us to eliminate:math:`\Delta z` by observing that :math:`\Delta z = C^{-1}(w - E^\top\Delta y)`, giving us.. math:: \left[B - EC^{-1}E^\top\right] \Delta y = v - EC^{-1}w\ .   :label: schurThe matrix.. math:: S = B - EC^{-1}E^\topis the Schur complement of :math:`C` in :math:`H`. It is also known asthe *reduced camera matrix*, because the only variablesparticipating in :eq:`schur` are the ones corresponding to thecameras. :math:`S \in \mathbb{R}^{pc\times pc}` is a block structuredsymmetric positive definite matrix, with blocks of size :math:`c\timesc`. The block :math:`S_{ij}` corresponding to the pair of images:math:`i` and :math:`j` is non-zero if and only if the two imagesobserve at least one common point.Now, eq-linear2 can be solved by first forming :math:`S`, solving for:math:`\Delta y`, and then back-substituting :math:`\Delta y` toobtain the value of :math:`\Delta z`.  Thus, the solution of what wasan :math:`n\times n`, :math:`n=pc+qs` linear system is reduced to theinversion of the block diagonal matrix :math:`C`, a few matrix-matrixand matrix-vector multiplies, and the solution of block sparse:math:`pc\times pc` linear system :eq:`schur`.  For almost allproblems, the number of cameras is much smaller than the number ofpoints, :math:`p \ll q`, thus solving :eq:`schur` issignificantly cheaper than solving :eq:`linear2`. This is the*Schur complement trick* [Brown]_.This still leaves open the question of solving :eq:`schur`. Themethod of choice for solving symmetric positive definite systemsexactly is via the Cholesky factorization [TrefethenBau]_ anddepending upon the structure of the matrix, there are, in general, twooptions. The first is direct factorization, where we store and factor:math:`S` as a dense matrix [TrefethenBau]_. This method has:math:`O(p^2)` space complexity and :math:`O(p^3)` time complexity andis only practical for problems with up to a few hundred cameras. Ceresimplements this strategy as the ``DENSE_SCHUR`` solver.But, :math:`S` is typically a fairly sparse matrix, as most imagesonly see a small fraction of the scene. This leads us to the secondoption: Sparse Direct Methods. These methods store :math:`S` as asparse matrix, use row and column re-ordering algorithms to maximizethe sparsity of the Cholesky decomposition, and focus their computeeffort on the non-zero part of the factorization [Chen]_. Sparsedirect methods, depending on the exact sparsity structure of the Schurcomplement, allow bundle adjustment algorithms to significantly scaleup over those based on dense factorization. Ceres implements thisstrategy as the ``SPARSE_SCHUR`` solver... _section-cgnr:``CGNR``--------For general sparse problems, if the problem is too large for``CHOLMOD`` or a sparse linear algebra library is not linked intoCeres, another option is the ``CGNR`` solver. This solver uses theConjugate Gradients solver on the *normal equations*, but withoutforming the normal equations explicitly. It exploits the relation.. math::    H x = J^\top J x = J^\top(J x)When the user chooses ``ITERATIVE_SCHUR`` as the linear solver, Ceresautomatically switches from the exact step algorithm to an inexactstep algorithm... _section-iterative_schur:``ITERATIVE_SCHUR``-------------------Another option for bundle adjustment problems is to apply PCG to thereduced camera matrix :math:`S` instead of :math:`H`. One reason to dothis is that :math:`S` is a much smaller matrix than :math:`H`, butmore importantly, it can be shown that :math:`\kappa(S)\leq\kappa(H)`.  Cseres implements PCG on :math:`S` as the``ITERATIVE_SCHUR`` solver. When the user chooses ``ITERATIVE_SCHUR``as the linear solver, Ceres automatically switches from the exact stepalgorithm to an inexact step algorithm.The cost of forming and storing the Schur complement :math:`S` can beprohibitive for large problems. Indeed, for an inexact Newton solverthat computes :math:`S` and runs PCG on it, almost all of its time isspent in constructing :math:`S`; the time spent inside the PCGalgorithm is negligible in comparison. Because PCG only needs accessto :math:`S` via its product with a vector, one way to evaluate:math:`Sx` is to observe that.. math::  x_1 &= E^\top x.. math::  x_2 &= C^{-1} x_1.. math::  x_3 &= Ex_2\\.. math::  x_4 &= Bx\\.. math::   Sx &= x_4 - x_3   :label: schurtrick1Thus, we can run PCG on :math:`S` with the same computational effortper iteration as PCG on :math:`H`, while reaping the benefits of amore powerful preconditioner. In fact, we do not even need to compute:math:`H`, :eq:`schurtrick1` can be implemented using just the columnsof :math:`J`.Equation :eq:`schurtrick1` is closely related to *DomainDecomposition methods* for solving large linear systems that arise instructural engineering and partial differential equations. In thelanguage of Domain Decomposition, each point in a bundle adjustmentproblem is a domain, and the cameras form the interface between thesedomains. The iterative solution of the Schur complement then fallswithin the sub-category of techniques known as IterativeSub-structuring [Saad]_ [Mathew]_... _section-preconditioner:Preconditioner--------------The convergence rate of Conjugate Gradients forsolving :eq:`normal` depends on the distribution of eigenvaluesof :math:`H` [Saad]_. A useful upper bound is:math:`\sqrt{\kappa(H)}`, where, :math:`\kappa(H)` is the conditionnumber of the matrix :math:`H`. For most bundle adjustment problems,:math:`\kappa(H)` is high and a direct application of ConjugateGradients to :eq:`normal` results in extremely poor performance.The solution to this problem is to replace :eq:`normal` with a*preconditioned* system.  Given a linear system, :math:`Ax =b` and apreconditioner :math:`M` the preconditioned system is given by:math:`M^{-1}Ax = M^{-1}b`. The resulting algorithm is known asPreconditioned Conjugate Gradients algorithm (PCG) and its worst casecomplexity now depends on the condition number of the *preconditioned*matrix :math:`\kappa(M^{-1}A)`.The computational cost of using a preconditioner :math:`M` is the costof computing :math:`M` and evaluating the product :math:`M^{-1}y` forarbitrary vectors :math:`y`. Thus, there are two competing factors toconsider: How much of :math:`H`'s structure is captured by :math:`M`so that the condition number :math:`\kappa(HM^{-1})` is low, and thecomputational cost of constructing and using :math:`M`.  The idealpreconditioner would be one for which :math:`\kappa(M^{-1}A)=1`. :math:`M=A` achieves this, but it is not a practical choice, asapplying this preconditioner would require solving a linear systemequivalent to the unpreconditioned problem.  It is usually the casethat the more information :math:`M` has about :math:`H`, the moreexpensive it is use. For example, Incomplete Cholesky factorizationbased preconditioners have much better convergence behavior than theJacobi preconditioner, but are also much more expensive.The simplest of all preconditioners is the diagonal or Jacobipreconditioner, i.e., :math:`M=\operatorname{diag}(A)`, which forblock structured matrices like :math:`H` can be generalized to theblock Jacobi preconditioner.For ``ITERATIVE_SCHUR`` there are two obvious choices for blockdiagonal preconditioners for :math:`S`. The block diagonal of thematrix :math:`B` [Mandel]_ and the block diagonal :math:`S`, i.e, theblock Jacobi preconditioner for :math:`S`. Ceres's implements both ofthese preconditioners and refers to them as ``JACOBI`` and``SCHUR_JACOBI`` respectively.For bundle adjustment problems arising in reconstruction fromcommunity photo collections, more effective preconditioners can beconstructed by analyzing and exploiting the camera-point visibilitystructure of the scene [KushalAgarwal]. Ceres implements the twovisibility based preconditioners described by Kushal & Agarwal as``CLUSTER_JACOBI`` and ``CLUSTER_TRIDIAGONAL``. These are fairly newpreconditioners and Ceres' implementation of them is in its earlystages and is not as mature as the other preconditioners describedabove... _section-ordering:Ordering--------The order in which variables are eliminated in a linear solver canhave a significant of impact on the efficiency and accuracy of themethod. For example when doing sparse Cholesky factorization, thereare matrices for which a good ordering will give a Cholesky factorwith :math:`O(n)` storage, where as a bad ordering will result in ancompletely dense factor.Ceres allows the user to provide varying amounts of hints to thesolver about the variable elimination ordering to use. This can rangefrom no hints, where the solver is free to decide the best orderingbased on the user's choices like the linear solver being used, to anexact order in which the variables should be eliminated, and a varietyof possibilities in between.Instances of the :class:`ParameterBlockOrdering` class are used tocommunicate this information to Ceres.Formally an ordering is an ordered partitioning of the parameterblocks. Each parameter block belongs to exactly one group, and eachgroup has a unique integer associated with it, that determines itsorder in the set of groups. We call these groups *Elimination Groups*Given such an ordering, Ceres ensures that the parameter blocks in thelowest numbered elimination group are eliminated first, and then theparameter blocks in the next lowest numbered elimination group and soon. Within each elimination group, Ceres is free to order theparameter blocks as it chooses. e.g. Consider the linear system.. math::  x + y &= 3\\  2x + 3y &= 7There are two ways in which it can be solved. First eliminating:math:`x` from the two equations, solving for y and then backsubstituting for :math:`x`, or first eliminating :math:`y`, solvingfor :math:`x` and back substituting for :math:`y`. The user canconstruct three orderings here.1. :math:`\{0: x\}, \{1: y\}` : Eliminate :math:`x` first.2. :math:`\{0: y\}, \{1: x\}` : Eliminate :math:`y` first.3. :math:`\{0: x, y\}`        : Solver gets to decide the elimination order.Thus, to have Ceres determine the ordering automatically usingheuristics, put all the variables in the same elimination group. Theidentity of the group does not matter. This is the same as notspecifying an ordering at all. To control the ordering for everyvariable, create an elimination group per variable, ordering them inthe desired order.If the user is using one of the Schur solvers (``DENSE_SCHUR``,``SPARSE_SCHUR``, ``ITERATIVE_SCHUR``) and chooses to specify anordering, it must have one important property. The lowest numberedelimination group must form an independent set in the graphcorresponding to the Hessian, or in other words, no two parameterblocks in in the first elimination group should co-occur in the sameresidual block. For the best performance, this elimination groupshould be as large as possible. For standard bundle adjustmentproblems, this corresponds to the first elimination group containingall the 3d points, and the second containing the all the camerasparameter blocks.If the user leaves the choice to Ceres, then the solver uses anapproximate maximum independent set algorithm to identify the firstelimination group [LiSaad]_... _section-solver-options::class:`Solver::Options`------------------------.. class:: Solver::Options  :class:`Solver::Options` controls the overall behavior of the  solver. We list the various settings and their default values below... member:: MinimizerType Solver::Options::minimizer_type   Default: ``TRUST_REGION``   Choose between ``LINE_SEARCH`` and ``TRUST_REGION`` algorithms. See   :ref:`section-trust-region-methods` and   :ref:`section-line-search-methods` for more details... member:: LineSearchDirectionType Solver::Options::line_search_direction_type   Default: ``LBFGS``   Choices are ``STEEPEST_DESCENT``, ``NONLINEAR_CONJUGATE_GRADIENT``   and ``LBFGS``... member:: LineSearchType Solver::Options::line_search_type   Default: ``ARMIJO``   ``ARMIJO`` is the only choice right now... member:: NonlinearConjugateGradientType Solver::Options::nonlinear_conjugate_gradient_type   Default: ``FLETCHER_REEVES``   Choices are ``FLETCHER_REEVES``, ``POLAK_RIBIRERE`` and   ``HESTENES_STIEFEL``... member:: int Solver::Options::max_lbfs_rank   Default: 20   The LBFGS hessian approximation is a low rank approximation to the   inverse of the Hessian matrix. The rank of the approximation   determines (linearly) the space and time complexity of using the   approximation. Higher the rank, the better is the quality of the   approximation. The increase in quality is however is bounded for a   number of reasons.     1. The method only uses secant information and not actual        derivatives.     2. The Hessian approximation is constrained to be positive        definite.   So increasing this rank to a large number will cost time and space   complexity without the corresponding increase in solution   quality. There are no hard and fast rules for choosing the maximum   rank. The best choice usually requires some problem specific   experimentation... member:: TrustRegionStrategyType Solver::Options::trust_region_strategy_type   Default: ``LEVENBERG_MARQUARDT``   The trust region step computation algorithm used by   Ceres. Currently ``LEVENBERG_MARQUARDT`` and ``DOGLEG`` are the two   valid choices. See :ref:`section-levenberg-marquardt` and   :ref:`section-dogleg` for more details... member:: DoglegType Solver::Options::dogleg_type   Default: ``TRADITIONAL_DOGLEG``   Ceres supports two different dogleg strategies.   ``TRADITIONAL_DOGLEG`` method by Powell and the ``SUBSPACE_DOGLEG``   method described by [ByrdSchnabel]_ .  See :ref:`section-dogleg`   for more details... member:: bool Solver::Options::use_nonmonotonic_steps   Default: ``false``   Relax the requirement that the trust-region algorithm take strictly   decreasing steps. See :ref:`section-non-monotonic-steps` for more   details... member:: int Solver::Options::max_consecutive_nonmonotonic_steps   Default: ``5``   The window size used by the step selection algorithm to accept   non-monotonic steps... member:: int Solver::Options::max_num_iterations   Default: ``50``   Maximum number of iterations for which the solver should run... member:: double Solver::Options::max_solver_time_in_seconds   Default: ``1e6``   Maximum amount of time for which the solver should run... member:: int Solver::Options::num_threads   Default: ``1``   Number of threads used by Ceres to evaluate the Jacobian... member::  double Solver::Options::initial_trust_region_radius   Default: ``1e4``   The size of the initial trust region. When the   ``LEVENBERG_MARQUARDT`` strategy is used, the reciprocal of this   number is the initial regularization parameter... member:: double Solver::Options::max_trust_region_radius   Default: ``1e16``   The trust region radius is not allowed to grow beyond this value... member:: double Solver::Options::min_trust_region_radius   Default: ``1e-32``   The solver terminates, when the trust region becomes smaller than   this value... member:: double Solver::Options::min_relative_decrease   Default: ``1e-3``   Lower threshold for relative decrease before a trust-region step is   acceped... member:: double Solver::Options::lm_min_diagonal   Default: ``1e6``   The ``LEVENBERG_MARQUARDT`` strategy, uses a diagonal matrix to   regularize the the trust region step. This is the lower bound on   the values of this diagonal matrix... member:: double Solver::Options::lm_max_diagonal   Default:  ``1e32``   The ``LEVENBERG_MARQUARDT`` strategy, uses a diagonal matrix to   regularize the the trust region step. This is the upper bound on   the values of this diagonal matrix... member:: int Solver::Options::max_num_consecutive_invalid_steps   Default: ``5``   The step returned by a trust region strategy can sometimes be   numerically invalid, usually because of conditioning   issues. Instead of crashing or stopping the optimization, the   optimizer can go ahead and try solving with a smaller trust   region/better conditioned problem. This parameter sets the number   of consecutive retries before the minimizer gives up... member:: double Solver::Options::function_tolerance   Default: ``1e-6``   Solver terminates if   .. math:: \frac{|\Delta \text{cost}|}{\text{cost} < \text{function_tolerance}}   where, :math:`\Delta \text{cost}` is the change in objective function   value (up or down) in the current iteration of Levenberg-Marquardt... member:: double Solver::Options::gradient_tolerance   Default: ``1e-10``   Solver terminates if   .. math:: \frac{\|g(x)\|_\infty}{\|g(x_0)\|_\infty} < \text{gradient_tolerance}   where :math:`\|\cdot\|_\infty` refers to the max norm, and :math:`x_0` is   the vector of initial parameter values... member:: double Solver::Options::parameter_tolerance   Default: ``1e-8``   Solver terminates if   .. math:: \|\Delta x\| < (\|x\| + \text{parameter_tolerance}) * \text{parameter_tolerance}   where :math:`\Delta x` is the step computed by the linear solver in the   current iteration of Levenberg-Marquardt... member:: LinearSolverType Solver::Options::linear_solver_type   Default: ``SPARSE_NORMAL_CHOLESKY`` / ``DENSE_QR``   Type of linear solver used to compute the solution to the linear   least squares problem in each iteration of the Levenberg-Marquardt   algorithm. If Ceres is build with ``SuiteSparse`` linked in then   the default is ``SPARSE_NORMAL_CHOLESKY``, it is ``DENSE_QR``   otherwise... member:: PreconditionerType Solver::Options::preconditioner_type   Default: ``JACOBI``   The preconditioner used by the iterative linear solver. The default   is the block Jacobi preconditioner. Valid values are (in increasing   order of complexity) ``IDENTITY``, ``JACOBI``, ``SCHUR_JACOBI``,   ``CLUSTER_JACOBI`` and ``CLUSTER_TRIDIAGONAL``. See   :ref:`section-preconditioner` for more details... member:: SparseLinearAlgebraLibrary Solver::Options::sparse_linear_algebra_library   Default:``SUITE_SPARSE``   Ceres supports the use of two sparse linear algebra libraries,   ``SuiteSparse``, which is enabled by setting this parameter to   ``SUITE_SPARSE`` and ``CXSparse``, which can be selected by setting   this parameter to ```CX_SPARSE``. ``SuiteSparse`` is a   sophisticated and complex sparse linear algebra library and should   be used in general. If your needs/platforms prevent you from using   ``SuiteSparse``, consider using ``CXSparse``, which is a much   smaller, easier to build library. As can be expected, its   performance on large problems is not comparable to that of   ``SuiteSparse``... member:: int Solver::Options::num_linear_solver_threads   Default: ``1``   Number of threads used by the linear solver... member:: bool Solver::Options::use_inner_iterations   Default: ``false``   Use a non-linear version of a simplified variable projection   algorithm. Essentially this amounts to doing a further optimization   on each Newton/Trust region step using a coordinate descent   algorithm.  For more details, see :ref:`section-inner-iterations`... member:: ParameterBlockOrdering*  Solver::Options::inner_iteration_ordering   Default: ``NULL``   If :member:`Solver::Options::use_inner_iterations` true, then the user has   two choices.   1. Let the solver heuristically decide which parameter blocks to      optimize in each inner iteration. To do this, set      :member:`Solver::Options::inner_iteration_ordering` to ``NULL``.   2. Specify a collection of of ordered independent sets. The lower      numbered groups are optimized before the higher number groups      during the inner optimization phase. Each group must be an      independent set.   See :ref:`section-ordering` for more details... member:: ParameterBlockOrdering* Solver::Options::linear_solver_ordering   Default: ``NULL``   An instance of the ordering object informs the solver about the   desired order in which parameter blocks should be eliminated by the   linear solvers. See section~\ref{sec:ordering`` for more details.   If ``NULL``, the solver is free to choose an ordering that it   thinks is best. Note: currently, this option only has an effect on   the Schur type solvers, support for the ``SPARSE_NORMAL_CHOLESKY``   solver is forth coming.   See :ref:`section-ordering` for more details... member:: bool Solver::Options::use_block_amd   Default: ``true``   By virtue of the modeling layer in Ceres being block oriented, all   the matrices used by Ceres are also block oriented.  When doing   sparse direct factorization of these matrices, the fill-reducing   ordering algorithms can either be run on the block or the scalar   form of these matrices. Running it on the block form exposes more   of the super-nodal structure of the matrix to the Cholesky   factorization routines. This leads to substantial gains in   factorization performance. Setting this parameter to true, enables   the use of a block oriented Approximate Minimum Degree ordering   algorithm. Settings it to ``false``, uses a scalar AMD   algorithm. This option only makes sense when using   :member:`Solver::Options::sparse_linear_algebra_library` = ``SUITE_SPARSE``   as it uses the ``AMD`` package that is part of ``SuiteSparse``... member:: int Solver::Options::linear_solver_min_num_iterations   Default: ``1``   Minimum number of iterations used by the linear solver. This only   makes sense when the linear solver is an iterative solver, e.g.,   ``ITERATIVE_SCHUR`` or ``CGNR``... member:: int Solver::Options::linear_solver_max_num_iterations   Default: ``500``   Minimum number of iterations used by the linear solver. This only   makes sense when the linear solver is an iterative solver, e.g.,   ``ITERATIVE_SCHUR`` or ``CGNR``... member:: double Solver::Options::eta   Default: ``1e-1``   Forcing sequence parameter. The truncated Newton solver uses this   number to control the relative accuracy with which the Newton step   is computed. This constant is passed to   ``ConjugateGradientsSolver`` which uses it to terminate the   iterations when   .. math:: \frac{Q_i - Q_{i-1}}{Q_i} < \frac{\eta}{i}.. member:: bool Solver::Options::jacobi_scaling   Default: ``true``   ``true`` means that the Jacobian is scaled by the norm of its   columns before being passed to the linear solver. This improves the   numerical conditioning of the normal equations... member:: LoggingType Solver::Options::logging_type   Default: ``PER_MINIMIZER_ITERATION``.. member:: bool Solver::Options::minimizer_progress_to_stdout   Default: ``false``   By default the :class:`Minimizer` progress is logged to ``STDERR``   depending on the ``vlog`` level. If this flag is set to true, and   :member:`Solver::Options::logging_type` is not ``SILENT``, the logging   output is sent to ``STDOUT``... member:: vector<int> Solver::Options::lsqp_iterations_to_dump   Default: ``empty``   List of iterations at which the optimizer should dump the linear   least squares problem to disk. Useful for testing and   benchmarking. If ``empty``, no problems are dumped... member:: string Solver::Options::lsqp_dump_directory   Default: ``/tmp``   If :member:`Solver::Options::lsqp_iterations_to_dump` is non-empty, then   this setting determines the directory to which the files containing   the linear least squares problems are written to... member:: DumpFormatType Solver::Options::lsqp_dump_format   Default: ``TEXTFILE``   The format in which linear least squares problems should be logged   when :member:`Solver::Options::lsqp_iterations_to_dump` is non-empty.   There are three options:   * ``CONSOLE`` prints the linear least squares problem in a human      readable format to ``stderr``. The Jacobian is printed as a      dense matrix. The vectors :math:`D`, :math:`x` and :math:`f` are      printed as dense vectors. This should only be used for small      problems.   * ``PROTOBUF`` Write out the linear least squares problem to the     directory pointed to by :member:`Solver::Options::lsqp_dump_directory` as     a protocol buffer. ``linear_least_squares_problems.h/cc``     contains routines for loading these problems. For details on the     on disk format used, see ``matrix.proto``. The files are named     ``lm_iteration_???.lsqp``. This requires that ``protobuf`` be     linked into Ceres Solver.   * ``TEXTFILE`` Write out the linear least squares problem to the     directory pointed to by member::`Solver::Options::lsqp_dump_directory` as     text files which can be read into ``MATLAB/Octave``. The Jacobian     is dumped as a text file containing :math:`(i,j,s)` triplets, the     vectors :math:`D`, `x` and `f` are dumped as text files     containing a list of their values.   A ``MATLAB/Octave`` script called ``lm_iteration_???.m`` is also   output, which can be used to parse and load the problem into memory... member:: bool Solver::Options::check_gradients   Default: ``false``   Check all Jacobians computed by each residual block with finite   differences. This is expensive since it involves computing the   derivative by normal means (e.g. user specified, autodiff, etc),   then also computing it using finite differences. The results are   compared, and if they differ substantially, details are printed to   the log... member:: double Solver::Options::gradient_check_relative_precision   Default: ``1e08``   Precision to check for in the gradient checker. If the relative   difference between an element in a Jacobian exceeds this number,   then the Jacobian for that cost term is dumped... member:: double Solver::Options::numeric_derivative_relative_step_size   Default: ``1e-6``   Relative shift used for taking numeric derivatives. For finite   differencing, each dimension is evaluated at slightly shifted   values, e.g., for forward differences, the numerical derivative is   .. math::     \delta &= numeric\_derivative\_relative\_step\_size\\     \Delta f &= \frac{f((1 + \delta)  x) - f(x)}{\delta x}   The finite differencing is done along each dimension. The reason to   use a relative (rather than absolute) step size is that this way,   numeric differentiation works for functions where the arguments are   typically large (e.g. :math:`10^9`) and when the values are small   (e.g. :math:`10^{-5}`). It is possible to construct *torture cases*   which break this finite difference heuristic, but they do not come   up often in practice... member:: vector<IterationCallback> Solver::Options::callbacks   Callbacks that are executed at the end of each iteration of the   :class:`Minimizer`. They are executed in the order that they are   specified in this vector. By default, parameter blocks are updated   only at the end of the optimization, i.e when the   :class:`Minimizer` terminates. This behavior is controlled by   :member:`Solver::Options::update_state_every_variable`. If the user wishes   to have access to the update parameter blocks when his/her   callbacks are executed, then set   :member:`Solver::Options::update_state_every_iteration` to true.   The solver does NOT take ownership of these pointers... member:: bool Solver::Options::update_state_every_iteration   Default: ``false``   Normally the parameter blocks are only updated when the solver   terminates. Setting this to true update them in every   iteration. This setting is useful when building an interactive   application using Ceres and using an :class:`IterationCallback`... member:: string Solver::Options::solver_log   Default: ``empty``   If non-empty, a summary of the execution of the solver is recorded   to this file.  This file is used for recording and Ceres'   performance. Currently, only the iteration number, total time and   the objective function value are logged. The format of this file is   expected to change over time as the performance evaluation   framework is fleshed out.:class:`ParameterBlockOrdering`-------------------------------.. class:: ParameterBlockOrdering   TBD:class:`IterationCallback`--------------------------.. class:: IterationCallback   TBD:class:`CRSMatrix`------------------.. class:: CRSMatrix   TBD:class:`Solver::Summary`------------------------.. class:: Solver::Summary   TBD:class:`GradientChecker`------------------------.. class:: GradientChecker
 |