Derivatives are required at the core of many numerical algorithms. Unfortunately, they are usually computed inefficiently and approximately by some variant of the finite difference approach
This method is inefficient because it requires evaluations of to compute the gradient , for example. It is approximate because we have to choose some finite, small value of the step length , balancing floating-point precision with mathematical approximation error.
One option is to explicitly write down a function which computes the exact derivatives by using the rules that we know from calculus. However, this quickly becomes an error-prone and tedious exercise. There is another way! The field of automatic differentiation provides methods for automatically computing exact derivatives (up to floating-point error) given only the function itself. Some methods use many fewer evaluations of than would be required when using finite differences. In the best case, the exact gradient of can be evaluated for the cost of evaluations of itself. The caveat is that cannot be considered a black box; instead, we require either access to the source code of or a way to plug in a special type of number using operator overloading.
JuliaDiff is an informal organization which aims to unify and document packages written in Julia for evaluating derivatives. The technical features of Julia, namely, multiple dispatch, source code via reflection, JIT compilation, and first-class access to expression parsing make implementing and using techniques from automatic differentiation easier than ever before (in our biased opinion).
This is a big list of Julia Automatic Differentiaion (AD) packages and related tooling. As you can see there is a lot going on here. As with any such big lists it rapidly becomes out-dated. When you notice something that is out of date, or just plain wrong, please submit a PR.
This list aims to be comprehensive in coverage. By necessity, this means it is not comprehensive in detail. It is worth investigating each package yourself to really understand its ins and outs, and pros and cons of its competitors.
ReverseDiff.jl: Operator overloading reverse-mode AD. Very well-established.
Nabla.jl: Operator overloading reverse-mode AD. Used in (its maintainer) Invenia's systems.
Tracker.jl: Operator overloading reverse-mode AD. Most well-known for having been the AD used in earlier versions of the machine learning package Flux.jl. No longer used by Flux.jl, but still used in several places in the Julia ecosystem.
Yota.jl: IR-level source to source reverse-mode AD.
XGrad.jl: AST-level source to source reverse-mode AD. Not currently in active development.
ReversePropagation.jl: Scalar, tracing-based source to source reverse-mode AD.
Enzyme.jl: Scalar, LLVM source to source reverse-mode AD. Experimental.
ForwardDiff.jl: Scalar, operator overloading forward-mode AD. Very stable. Very well-established.
ForwardDiff2: Experimental, non-scalar hybrid operator-overloading/source-to-source forward-mode AD. Not currently in development.
TaylorSeries.jl: Computes polynomial expansions; which is the generalization of forward-mode AD to nth-order derivatives.
Yes, we said at the start to stop approximating deriviatives, but these packages are faster and more accurate than you would expect finite differencing to every achieve. If you really need finite differencing, use these packages rather than implementing your own.
FiniteDifferences.jl: High-accuracy finite differencing with support for almost any type (not just arrays and numbers).
FiniteDiff.jl: High-accuracy finite differencing with support for efficient calculation of spares Jacobians via coloring vectors.
Calculus.jl: Largely deprecated, legacy package. New users should look to FiniteDifferences.jl and FiniteDiff.jl instead.
Packages providing collections of derivatives of functions which can be used in AD packages.
ChainRules: Extensible, AD-independent rules.
DiffRules.jl: An earlier set of AD-independent rules. Largely deprecated in favor of the more extensible ChainRules.jl.
ZygoteRules.jl: For defining Zygote.jl specific rules. Largely deprecated in favour of the AD-independent ChainRules.jl.
SparsityDetection.jl: Automatic Jacobian and Hessian sparsity pattern detection.
SparseDiffTools.jl: Exploiting sparsity to speed up FiniteDiff.jl and ForwardDiff.jl, as well as other algorithms.
Discussions on JuliaDiff and its uses may be directed to the Julia Discourse forum The autodiff.org site serves as a portal for the academic community, though it is often out-of-date. The ChainRules project maintains a list of recommend reading/watching for those after more information. Finally, automatic differentiation techniques have been implemented in a variety of languages. If you would prefer not to use Julia, see the wikipedia page for a comprehensive list of available packages.