Directional Differentiability
Definition: Directional Derivative of a Real Scalar Field
Let be a real scalar field on an open subset , let and let be some unit vector.
The directional derivative of at along is the limit
if it exists.
NOTATION
Partial Differentiability
Definition: Partial Derivatives of a Real Scalar Field
Let be a real scalar field on an open subset , let , let be a coordinate system on and let be the coordinates of in .
The partial derivative of at with respect to the -th coordinate is the limit
NOTATION
Most commonly, the partial derivative of at with respect to the -th coordinate is denoted as:
If the coordinate system is evident from context, we can also write .
Note: Partial Derivative as a Function
When there is no specific mentioned, the term “partial derivative” refers to the the function which to each assigns the partial derivative of at with respect to the coordinate .
Note: Orders of Partial Derivatives
An -th order partial derivative of is a partial derivative of ‘s -th partial derivative function.
Definition: Partial Differentiability of Real Scalar Fields
We say that is -times (continuously) partially differentiable at if all of its -th order partial derivatives with respect to all coordinates of exist at (and are continuous there).
If the above is true for every in some , then we say that is -times (continuously) partially differentiable on . We can also omit “on ” when .
NOTE
If there is no specific coordinate system mentioned, then we mean that is -times (continuously) partially differentiable with respect to Cartesian coordinates.
Warning: Partial Differentiability's Dependence on Coordinate Systems
If is partially differentiable in one coordinate system, then that does not necessarily mean that it is partially differentiable in every coordinate system.
Example
Consider the function defined in Cartesian coordinates as
Let’s see what ‘s partial derivatives with respect to and are.
Partial derivative with respect to :
For ,
For , we have and thus
So, the partial derivative of with respect to exists at every point.
Partial derivative with respect to :
For ,
For , again we have and so
However, this limit does not exists, since it depends on whether approaches from the left or the right.
Therefore, is not partially differentiable with respect to on the points .
Now, let’s take a look at another coordinate system . The transformations between and are given below.
When , i.e. when , the function is also .
When , the function expressed in the coordinate system is
Therefore,
Let’s examine the partial derivatives of with respect to and .
Partial derivative with respect to :
For ,
For , we have
Therefore, the partial derivative of with respect to exists at every point.
Partial derivative with respect to :
For ,
For , we have
Therefore, the partial derivative of with respect to exists at every point.
This means that is partially differentiable at every point with respect to the coordinate system , but it is not partially differentiable with respect to .
Theorem: Partial Derivatives in Cartesian Coordinates
Let be a real scalar field on an open subset .
The partial derivative of a real scalar field at with respect to the -th Cartesian coordinate coincides with the directional derivative of along the -th standard basis vector :
PROOF
TODO
NOTATION
Cartesian coordinates in partial derivatives can be denoted using either subscripts and superscripts, i.e.
are equivalent. Writing the coordinates using subscripts is cleaner when partial derivatives of higher orders are involved.
Schwarz's Theorem: Symmetry of Second-Order Partial Derivatives
Let be a real scalar field on an open subset , let .
If is twice continuously partially differentiable at , then for all
PROOF
TODO
Gradient
Definition: Gradient
Let be a real scalar field on an open subset and let .
If is partially differentiable at with respect to Cartesian coordinates, then the gradient of at is the following vector:
NOTATION
The gradient at shows the directions in which small deviations from result in the largest increase and the largest decrease in the value of :
- An infinitesimally small deviation from in the direction of will result in the greatest possible increase in the value of . If the deviation from is in any other direction, then the increase in the value of will necessarily be smaller (closer to 0).
- An infinitesimally small deviation from in the direction of will result in the greatest possible decrease in the value of . If the deviation from is in any other direction, then the decrease in the value of will necessarily be smaller (closer to 0).
Hessian Matrix
Definition: Hessian Matrix
Let be a real scalar field which is twice partially differentiable with respect to Cartesian coordinates.
The Hessian matrix of is the -matrix whose columns are the gradients of ‘s partial derivatives:
NOTATION
The Hessian matrix is different for different , since the partial derivatives of depend on . The Hessian matrix at a particular is thus denoted as to make this dependency apparent.
Theorem: Symmetry of the Hessian Matrix
Let be a differentiable real scalar field with differentiable partial derivatives.
If all of second partial derivatives of ‘s second-order partial derivatives are also continuous, then the Hessian matrix of is symmetric for every .
PROOF
TODO
Differentiability
We can use partial derivatives to provide an alternative definition for the differentiability of real scalar fields.
Definition: Differentiability of Real Scalar Fields
Let be a real scalar field on an open subset and let .
We say that is differentiable at if and only if is partially differentiable at with respect to Cartesian coordinates and the following limit is zero:
where is the dot product between ‘s gradient at and .
Tip: Gradient and Jacobian
The gradient is then just the transpose of ‘s Jacobian matrix.
Definition: Critical Point
Let be a real scalar field.
We say that has a critical point at if is not differentiable at or it is differentiable at but its gradient there is zero.
Theorem: Differentiability implies Directional Differentiability
Let be a real scalar field on an open subset and let .
If is differentiable at , then its directional derivatives at exist along every direction and are equal to the dot product of ‘s gradient with :
PROOF
TODO
Theorem: Chain Rule for Scalar Fields
Let be a real scalar field on an open subset and let . Let be a vector-valued function on an open subset and let .
If is differentiable at and is differentiable at , then the derivative of the composition is the dot product of ‘s gradient and ‘s derivative .
PROOF
TODO
Theorem: Product Rule
Let and be real scalar fields and let .
If and are differentiable at , then the product is also differentiable at and its gradient is the following:
PROOF
TODO
Theorem: Quotient Rule
Let and be real scalar fields and let .
If and are differentiable at and , then the quotient is also differentiable at its gradient is the following:
PROOF
TODO