User Contributed Dictionary
Adjective
- Of, pertaining to, or caused by variation
Translations
- Italian: variazionale
Extensive Definition
Calculus of variations is a field of mathematics that deals with
functionals,
as opposed to ordinary calculus which deals with
functions.
Such functionals
can for example be formed as integrals involving an unknown
function and its derivatives. The interest is in extremal
functions: those making the functional attain a maximum or minimum
value.
Perhaps the simplest example of such a problem is
to find the curve of shortest length connecting two points. If
there are no constraints, the solution is obviously a straight line
between the points. However, if the curve is constrained to lie on
a surface in space, then the solution is less obvious, and possibly
many solutions may exist. Such solutions are known as geodesics. A related problem is
posed by Fermat's
principle: light follows the path of shortest optical length
connecting two points, where the optical length depends upon the
material of the medium. One corresponding concept in mechanics is the
principle of least action. The theory of optimal
control concerns a specific kind of problem in the calculus of
variations.
Many important problems involve functions of
several variables. Solutions of boundary value problems for the
Laplace
equation satisfy the Dirichlet
principle. Plateau's
problem requires finding a surface of minimal area that spans a
given contour in space: the solution or solutions may be found by
dipping a wire frame in a solution of soap suds. Although such
experiments are relatively easy to perform, their mathematical
interpretation is far from simple: there may be more than one
locally minimizing surface, and they may have non-trivial
topology.
The Euler–Lagrange equation
Under ideal conditions, the maxima and minima of a given function may be located by finding the points where its derivative vanishes. By analogy, solutions of smooth variational problems may be obtained by solving the associated Euler–Lagrange equation. In order to illustrate this process, consider the problem of finding the shortest curve in the plane that connects two points (x_1, y_1) and (x_2, y_2). The arc length is given by- A[f] = \int_^ \sqrt \, dx,
with
- f'(x) = \frac, \,
and where y=f(x), f(x_1)=y_1 and f(x_2)=y_2. The
function f should have at least one derivative in order to satisfy
the requirements for valid application of the function, further, if
f_0 is a local
minimum and f_1 is an arbitrary function that vanishes at the
endpoints x_1 and x_2 and with at least one derivative, then we
must have
- A[f_0] \le A[f_0 + \epsilon f_1]
for any number ε close to 0. Therefore, the
derivative of A[f_0 + \epsilon f_1] with respect to ε (the first
variation of A) must vanish at ε=0. Thus
- \int_^ \frac \,dx =0, \,
for any choice of the function f_1. We may
interpret this condition as the vanishing of all directional
derivatives of A[f_0] in the space of differentiable functions, and
this is formalized by requiring the Fréchet
derivative of A to vanish at f_0. If we assume that f_0 has two
continuous derivatives (or if we consider weak
derivatives), then we may use integration
by parts:
- \int_a^b u(x) v'(x)\,dx = \left[ u(x) v(x) \right]_^ - \int_a^b u'(x) v(x)\,dx
with the substitution
- u(x)=\frac , \quad v'(x)=f_1'(x),
then we have
- \left[ u(x) v(x) \right]_^ - \int_^ f_1(x) \frac\left[ \frac \right] \, dx =0,
but the first term is zero since v(x)=f_1(x) was
chosen to vanish at x_1 and x_2 where the evaluation is taken.
Therefore,
- \int_^ f_1(x) \frac\left[ \frac \right] \, dx =0
for any twice differentiable function f_1 that
vanishes at the endpoints of the interval. This is a special case
of the
fundamental lemma of calculus of variations:
- I =\int_^ f_1(x) H(x)\, dx =0, \,
for any differentiable function f_1(x) that
vanishes at the endpoints of the interval. Since f_1(x) is an
arbitrary function within the integration range, we conclude that
H(x) = 0. Therefore,
- \frac\left[ \frac \right] =0.\,
It follows from this equation that
- \frac=0,
and hence the extremals are straight lines.
A similar calculation holds in the general case
where
- A[f] = \int_^ L(x,f,f')\, dx . \,
and f is required to have two continuous
derivatives. Again, we find an extremal f_0 by setting f = f_0 +
\epsilon f_1, taking the derivative with respect to ε,
and setting \epsilon = 0 at the end:
\begin \left.\frac\right|_ & = \int_^
\left.\frac\right|_ dx \\ & = \int_^ \left(\frac f_1 + \frac
f'_1\right)\, dx \\ & = \int_^ \left(\frac f_1 - f_1 \frac\frac
\right)\, dx + \left.\frac f_1 \right|_^\\ & = \int_^ f_1
\left(\frac - \frac\frac \right)\, dx \\ & = 0, \end
where we have used the chain rule in
the second line and integration
by parts in the third. As before, the last term in the third
line vanishes due to our choice of f_1. Finally, according to the
fundamental lemma of calculus of variations, we find that L
will satisfy the Euler–Lagrange equation
- -\frac \frac + \frac=0,
In general this gives a second-order
ordinary differential equation which can be solved to obtain
the extremal f. The Euler–Lagrange equation is a necessary,
but not sufficient,
condition for an extremal. Sufficient conditions for an extremal
are discussed in the references.
The Beltrami Identity
Frequently in physical problems, it turns out
that \part L/\part x=0. In that case, the Euler-Lagrange equation
can be simplified using the Beltrami Identity
- L-f'\frac=C,
where C is a constant. http://planetmath.org/encyclopedia/BeltramiIdentity.html
du Bois Reymond's theorem
The discussion thus far has assumed that extremal
functions possess two continuous derivatives, although the
existence of the integral A requires only first derivatives of
trial functions. The condition that the first variation vanish at
an extremal may be regarded as a weak form of the Euler-Lagrange
equation. The theorem of du Bois Reymond asserts that this weak
form implies the strong form. If L has continuous first and second
derivatives with respect to all of its arguments, and if
- \frac \ne 0,
then f_0 has two continuous derivatives, and it
satisfies the Euler-Lagrange equation.
Fermat's principle
Fermat's
principle states that light takes a path that (locally)
minimizes the optical length between its endpoints. If the
x-coordinate is chosen as the parameter along the path, and y=f(x)
along the path, then the optical length is given by
- A[f] = \int_^ n(x,f(x)) \sqrt dx, \,
where the refractive index n(x,y) depends upon
the material. If we try f(x) = f_0 (x) + \epsilon f_1 (x) then the
first variation of A (the derivative of A with respect to ε)
is
- \delta A[f_0,f_1] = \int_^ \left[ \frac + n_y (x,f_0) f_1 \right] dx.
After integration by parts of the first term
within brackets, we obtain the Euler-Lagrange equation
- -\frac \left[\frac \right] + n_y (x,f_0) =0. \,
The light rays may be determined by integrating
this equation.
Snell's law
There is a discontinuity of the refractive index
when light enters or leaves a lens. Let
- n(x,y) = n_- \quad \hbox \quad x
- n(x,y) = n_+ \quad \hbox \quad x>0,\,
where n_- and n_+ are constants. Then the
Euler-Lagrange equation holds as before in the region where x0, and
in fact the path is a straight line there, since the refractive
index is constant. At the x=0, f must be continuous, but f' may be
discontinuous. After integration by parts in the separate regions
and using the Euler-Lagrange equations, the first variation takes
the form
- \delta A[f_0,f_1] = f_1(0)\left[ n_-\frac -n_+\frac \right].\,
The factor multiplying n_- is the sine of angle
of the incident ray with the x axis, and the factor multiplying n_+
is the sine of angle of the refracted ray with the x axis. Snell's law
for refraction requires that these terms be equal. As this
calculation demonstrates, Snell's law is equivalent to vanishing of
the first variation of the optical path length.
Fermat's principle in three dimensions
It is expedient to use vector notation: let
X=(x_1,x_2,x_3), let t be a parameter, let X(t) be the parametric
representation of a curve C, and let \dot X(t) be its tangent
vector. The optical length of the curve is given by
- A[C] = \int_^ n(X) \sqrt dt. \,
Note that this integral is invariant with respect
to changes in the parametric representation of C. The
Euler-Lagrange equations for a minimizing curve have the symmetric
form
- \frac P = \sqrt \nabla n, \,
where
- P = \frac.\,
It follows from the definition that P
satisfies
- P \cdot P = n(X)^2. \,
Therefore the integral may also be written
as
- A[C] = \int_^ P \cdot \dot X \, dt.\,
This form suggests that if we can find a function
ψ whose gradient is given by P, then the integral A is given by the
difference of ψ at the endpoints of the interval of integration.
Thus the problem of studying the curves that make the integral
stationary can be related to the study of the level surfaces of ψ.
In order to find such a function, we turn to the wave equation,
which governs the propagation of light.
Connection with the wave equation
The wave
equation for an inhomogeneous medium is
- u_ = c^2 \nabla \cdot \nabla u, \,
where c is the velocity, which generally depends
upon X. Wave fronts for light are characteristic surfaces for this
partial differential equation: they satisfy
- \varphi_t^2 = c(X)^2 \nabla \varphi \cdot \nabla \varphi. \,
We may look for solutions in the form
- \varphi(t,X) = t - \psi(X). \,
In that case, ψ satisfies
- \nabla \psi \cdot \nabla \psi = n^2, \,
where n=1/c. According to the theory of
first order partial differential equations, if P = \nabla \psi,
then P satisfies
- \frac = 2 n \nabla n, \,
along a system of curves (the light rays) that
are given by
- \frac = P. \,
These equations for solution of a first-order
partial differential equation are identical to the Euler-Lagrange
equations if we make the identification
- \frac = \frac. \,
We conclude that the function ψ is the value of
the minimizing integral A as a function of the upper end point.
That is, when a family of minimizing curves is constructed, the
values of the optical length satisfy the characteristic equation
corresponding the wave equation. Hence, solving the associated
partial differential equation of first order is equivalent to
finding families of solutions of the variational problem. This is
the essential content of the Hamilton-Jacobi
theory, which applies to more general variational
problems.
The action principle
The action
was defined by Hamilton to be the time integral of the Lagrangian,
L, which is defined as a difference of energies:
- L = T - U, \,
- A[C] = \int_^ L(X, \dot X) dt \,
- \frac \frac = \frac, \,
The conjugate momenta P are defined by
- P = \frac. \,
- T = \frac m \dot x^2, \,
- P = m \dot x. \,
- H(X,P) = -L(X,\dot X) + P \cdot \dot X.\,
- \frac + H(X,\nabla \psi) =0.\,
Functions of several variables
Variational problems that involve multiple integrals arise in numerous applications. For example, if φ(x,y) denotes the displacement of a membrane above the domain D in the x,y plane, then its potential energy is proportional to its surface area:- U[\varphi] = \iint_D \sqrt dx\,dy.\,
- \varphi_(1 + \varphi_y^2) + \varphi_(1 + \varphi_x^2) - 2\varphi_x \varphi_y \varphi_ = 0.\,
Dirichlet's principle
It is often sufficient to consider only small displacements of the membrane, whose energy difference from no displacement is approximated by- V[\varphi] = \frac\iint_D \nabla \varphi \cdot \nabla \varphi \, dx\, dy.\,
- \frac V[u + \epsilon v]|_ = \iint_D \nabla u \cdot \nabla v \, dx\,dy = 0.\,
- \iint_D \nabla \cdot (v \nabla u) \,dx\,dy =
- \iint_D v\nabla \cdot \nabla u \,dx\,dy =0 \,
- \nabla \cdot \nabla u= 0 \, in D.
The difficulty with this reasoning is the
assumption that the minimizing function u must have two
derivatives. Riemann argued that the existence of a smooth
minimizing function was assured by the connection with the physical
problem: membranes do indeed assume configurations with minimal
potential energy. Riemann named this
idea Dirichlet's principle in honor of his teacher Dirichlet.
However Weierstrass gave an example of a variational problem with
no solution: minimize
- W[\varphi] = \int_^ (x\varphi')^2 \, dx\,
Generalization to other boundary value problems
A more general expression for the potential
energy of a membrane is
- v[\varphi] = \iint_D \left[ \frac \nabla \varphi \cdot \nabla \varphi + f(x,y) \varphi \right] \, dx\,dy \, + \int_C \left[ \frac \sigma(s) \varphi^2 + g(s) \varphi \right] \, ds.
- \iint_D \left[ \nabla u \cdot \nabla v + f v \right] \, dx\, dy + \int_C \left[ \sigma u v + g v \right] \, ds =0. \,
- \iint_D \left[ -v \nabla \cdot \nabla u + v f \right] \, dx \, dy + \int_C v \left[ \frac + \sigma u + g \right] \, ds =0. \,
- - \nabla \cdot \nabla u + f =0 \,
- \frac + \sigma u + g =0, \,
The preceding reasoning is not valid if \sigma
vanishes identically on C. In such a case, we could allow a trial
function \varphi \equiv c, where c is a constant. For such a trial
function,
- V[c] = c\left[ \iint_D f \, dx\,dy + \int_C g ds \right].
- \iint_D f \, dx\,dy + \int_C g \, ds =0.\,
Eigenvalue problems
Both one-dimensional and multi-dimensional
eigenvalue problems can be formulated as variational
problems.
Sturm-Liouville problems
The Sturm-Liouville eigenvalue problem involves a
general quadratic form
- Q[\varphi] = \int_^ \left[ p(x) \varphi'(x)^2 + q(x) \varphi(x)^2 \right] \, dx, \,
- \varphi(x_1)=0, \quad \varphi(x_2)=0. \,
- R[\varphi] =\int_^ r(x)\varphi(x)^2 \, dx.\,
- -(pu')' +q u -\lambda r u =0, \,
- \lambda = \frac. \,
The next smallest eigenvalue and eigenfunction
can be obtained by minimizing Q under the additional constraint
- \int_^ r(x) u_1(x) \varphi(x) \, dx=0. \,
The variational problem also applies to more
general boundary conditions. Instead of requiring that φ vanish at
the endpoints, we may not impose any condition at the endpoints,
and set
- Q[\varphi] = \int_^ \left[ p(x) \varphi'(x)^2 + q(x)\varphi(x)^2 \right] \, dx + a_1 \varphi(x_1)^2 + a_2 \varphi(x_2)^2, \,
- V_1 = \frac \left( \int_^ \left[ p(x) u'(x)v'(x) + q(x)u(x)v(x) -\lambda u(x) v(x) \right] \, dx + a_1 u(x_1)v(x_1) + a_2 u(x_2)v(x_2) \right) , \,
- \frac V_1 = \int_^ v(x) \left[ -(p u')' + q u -\lambda r u \right] \, dx + v(x_1)[ -p(x_1)u'(x_1) + a_1 u(x_1)] + v(x_2) [p(x_2 u'(x_2) + a_2 u(x_2). \,
- -(p u')' + q u -\lambda r u =0 \quad \hbox \quad x_1
- -p(x_1)u'(x_1) + a_1 u(x_1)=0, \quad \hbox \quad p(x_2 u'(x_2) + a_2 u(x_2)=0.\,
Eigenvalue problems in several dimensions
Eigenvalue problems in higher dimensions are
defined in analogy with the one-dimensional case. For example,
given a domain D with boundary B in three dimensions we may
define
- Q[\varphi] = \iiint_D p(X) \nabla \varphi \cdot \nabla \varphi + q(X) \varphi^2 \, dx \, dy \, dz + \iint_B \sigma(S) \varphi^2 \, dS, \,
- R[\varphi] = \iiint_D r(X) \varphi(X)^2 \, dx \, dy \, dz.\,
- -\nabla \cdot (p(X) \nabla u) + q(x) u - \lambda r(x) u=0,\,
- \lambda = \frac.\,
- p(S) \frac + \sigma(S) u =0,
See also
Reference books
- Gelfand, I.M. and Fomin, S.V.: Calculus of Variations, Dover Publ., 2000
- Lebedev, L.P. and Cloud, M.J.: The Calculus of Variations and Functional Analysis with Optimal Control and Applications in Mechanics, World Scientific, 2003, pages 1-98
- Charles Fox: An Introduction to the Calculus of Variations, Dover Publ., 1987
- Forsyth, A.R.: Calculus of Variations, Dover, 1960
- Sagan, Hans: Introduction to the Calculus of Variations, Dover, 1992
- Weinstock, Robert: Calculus of Variations with Applications to Physics and Engineering, Dover, 1974
- Clegg, J.C.: Calculus of Variations, Interscience Publishers Inc., 1968
- Courant, R.: Dirichlet's principle, conformal mapping and minimal surfaces. Interscience, 1950.
- Courant, R. and D. Hilbert: Methods of Mathematical Physics, Vol I. Interscience Press, 1953.
- Elsgolc, L.E.: Calculus of Variations, Pergamon Press Ltd., 1962
- Jost, J. and X. Li-Jost: Calculus of Variations. Cambridge University Press, 1998.
References
- Johan Byström, Lars-Erik Persson, and Fredrik Strömberg, Chapter III: Introduction to the calculus of variations (undated).
- Calculus of variations example problems.
- Chapter 8: Calculus of Variations, from Optimization for Engineering Systems, by Ralph W. Pike, Louisiana State University
variational in Czech: Variační počet
variational in German: Variationsrechnung
variational in Spanish: Cálculo de
variaciones
variational in Persian: حسابان وردشها
variational in French: Calcul des
variations
variational in Italian: Calcolo delle
variazioni
variational in Hebrew: חשבון וריאציות
variational in Maltese: Kalkulu
tal-Varjazzjonijiet
variational in Japanese: 変分法
variational in Piemontese: Càlcol dle
variassion
variational in Polish: Rachunek wariacyjny
variational in Russian: Вариационное
исчисление
variational in Slovenian: Variacijski
račun
variational in Swedish: Variationskalkyl
variational in Ukrainian: Варіаційне
числення
variational in Chinese: 变分法