The Wolfram Mathematica notebook which contains the code that produces all the Mathematica output in this web page may be downloaded at this link.

We denote by 𝔽 one of the following four fields: ℤ, a set of integers, ℚ, a set of rational numbers, ℝ, a set of real numbers, and ℂ, a set of complex numbers. However, norm definition involves only two of them, either ℝ or ℂ.

This section is devoted to one of the most important operations in all of linear algebra---dot product. Many operations and algorithms involve dot product, including convolution, correlation, matrix multiplication, duality, the Fourier transform, signal ﬁltering, and many others.

Dot Product

We met many times in previous sections a special linear combination of numerical vectors. For instance, a linear equation in n unknowns

\begin{equation} \label{EqDot.1} a_1 x_1 + a_2 x_2 + \cdots + a_n x_n = b , \end{equation}

which we prefer to write in succinct form a • x = b, where a = (𝑎₁, 𝑎₂, … , 𝑎_n), x = (x₁, x₂, … , x_n), and b = (b₁, b₂, … , b_n) are numerical vectors from 𝔽ⁿ. Another widely used application of this peculiar linear combination is observed in multiplications of matrices.

The dot product or scalar product of two lists (or arrays) of the same size \( {\bf x} = \left[ x_1 , x_2 , \ldots , x_n \right] \) and \( {\bf y} = \left[ y_1 , y_2 , \ldots , y_n \right] \) is the following expression, denoted by x • y, \begin{equation} \label{EqDot.2} {\bf x} \bullet {\bf y} = x_1 y_1 + x_2 y_2 + \cdots + x_n y_n , \end{equation} subject that all multiplications in (2) make sense and their sum is justified.

Remark 1: Although textbooks on linear algebra define dot product for vectors of the same vector space (mostly because it leads to fruitfully theory and geometric applications), our definition extends dot product for vectors from different vector spaces, but of the same dimension and over the same field 𝔽 of scalars. Importance of this definition stems from practical applications; for instance, in calculus you learn that the line integral involves definition of the dot product for vector field F with infinitesimal dr:

\[ \int_C \mathbf{F} \bullet {\text d}\mathbf{r} . \]

Another famous example constitutes definition of the Laplacian operator via dot product of two gradient operators:

\[ \Delta = \nabla \bullet \nabla = \left( \frac{\partial}{\partial x_1} , \frac{\partial}{\partial x_2} , \ldots , \frac{\partial}{\partial x_n} \right) \bullet \left( \frac{\partial}{\partial x_1} , \frac{\partial}{\partial x_2} , \ldots , \frac{\partial}{\partial x_n} \right) \]

Example 1: Suppose we have a vector a = (3,2,1) and a list of three matrix entries b = (A₁, A₂, A₃), where A₁, A₂, and A₃ are some matrices of the same dimensions. Then their dot product is the matrix \[ \mathbf{a} \bullet \mathbf{b} = 3\,\mathbf{A}_1 + 2\,\mathbf{A}_2 + \mathbf{A}_3 . \] As it is seen from the relation above, this leads to the definition of dot product with values in the set of matrices. ■

End of Example 1

Remark 2: In applications, numerical vectors are usually associated with measurements and so inherit units. For instance, integer 5 is associated with 5 millions of dollars in Wall street offices, the same number is considered as a 5 dollars bill by a bank's clerk, but mechanical engineer may look at it as 5 centimeters, and computer science folks consider this information as 5 GB. Only mathematicians see in 5 an integer or number without any unit. Therefore, vectors and scalars in Linear Algebra are not related to any specific unit measurements. Now we all appreciate the buity of mathematical language because we enter our particular information into a computer---this device recognizes only electric pulses as on or off---there is no room for any unit. When Joseph Fourier (1768--1830) introduced the Fourier transform in 1822

\[ \hat{f}(\xi ) = \int_{\mathbb{R}^n} f(x)\, e^{-{\bf j} 2\pi\,x\bullet \xi} {\text d}^n x , \qquad {\bf j}^2 = -1, \]

he used dot product, x • ξ, with two vectors of different units (say if x measures time, then ξ corresponds to frequency).

Example 2: Numerical vectors (i.e., vectors from 𝔽ⁿ) that are used in the definition of dot product, Eq.(2), may have distinct units depending on applications. We give two examples from mechanics.

If v represents a displacement (e.g., in meters) and f represents a force (e.g., in Newtons), then f • v represents work (in Newton-meters or Joules). Therefore, the force has units of "Joules per meter".

If v represents a velocity (e.g., in meters per second) and φ represents momentum (e.g., in kgm/s), then φ(v) = m • v represents kinetic energy (in kg²/s² or Joules). Therefore, the dual vector (momentum) has units of "kg*m/s". ■

End of Example 2

Remark 3: Recall that two vector spaces V and U are isomorphic (denoted V ≌ U) if there is a bijective linear map between them. This bijection (which is one-to-one and onto mapping) can be achieved by considering ordered bases α = [ a₁, a₂, … , a_n ] and β = [ b₁, b₂, … , b_n ] in these vector spaces V and U, respectively. Then components of every vector with respect to a chosen ordered basis can be identified uniquely with an n-tuple. Therefore, the algebraic formula \eqref{EqDot.2} is essentially applied to two isomorphic copies of the Cartesian product 𝔽ⁿ. Geometric interpretation of the dot product, which is coordinate independent and therefore conveys invariant properties of these products, is given in the Euclidean space section. Then scalar product can be defined as a bilinear form:

\[ \left. \begin{array} {ccc} U \times V & \rightarrow & \mathbb{F} \\ (\mathbf{u}, \mathbf{v}) & \mapsto & \mathbf{u} \bullet \mathbf{v} \end{array} \right\} \qquad \mathbf{u} \in U, \ \mathbf{v} \in V . \]

Note: The definition of the dot product does not restrict of applying it to two distinct isomorphic versions of the Cartesian product 𝔽ⁿ. So you can find the dot product of a row vector with a column vector. However, we try to avoid writing it as matrix multiplication,

\[ \left[ x_1 , x_2 , \ldots , x_n \right] \begin{pmatrix} y_1 \\ y_2 \\ \vdots \\ y_n \end{pmatrix} = \left[ {\bf x} \bullet {\bf y} \right] \in \mathbb{F}^{1 \times 1} \cong \mathbb{F} , \]

because the right-hand side is a 1×1 matrix that a computer solver always treats differently from a scalar. However, the dot product can be applied to row vectors and column vectors:

\[ \left[ x_1 , x_2 , \ldots , x_n \right] \bullet \begin{pmatrix} y_1 \\ y_2 \\ \vdots \\ y_n \end{pmatrix} = \begin{pmatrix} y_1 \\ y_2 \\ \vdots \\ y_n \end{pmatrix} \bullet \left[ x_1 , x_2 , \ldots , x_n \right] \in \mathbf{F} . \quad ▣ \]

Mathematica does not distinguish rows from columns. Dot product can be accomplished with two Mathematica commands:

a = {1, 2, 3}; b = {3, 2, 1};
Dot[a, b]
a . b

The dot product was first introduced by the American physicist and mathematician Josiah Willard Gibbs (1839--1903) in the 1880s. Initially, the scalar product appeared in a pamphlet distributed to his students at Yale University. Gibbs's pamphlet was eventually incorporated into a book entitled Vector Analysis that was published in 1901 and coauthored with one of his students.

One of the main and fruitfull applications of the dot product is observed when scalar product involves numerical vectors from 𝔽ⁿ or their isomorphic copies. Upon introducing an ordered basis α = [e₁, e₂, … , e_n] in a finite dimensional vector space V, every its vector v = c₁e₁ + c₂e₂ + ⋯ + c_ne_n is uniqwuely identified with the corresponding coordinate vector ⟦v⟧_α = (c₁, c₂, … , c_n) ∈ 𝔽ⁿ.

Example 3: Calculate the dot product of two three dimensional vectors a = (3, 2, 1) and b = (4, −5, 2).

Solution: Using the component formula (1) for the dot product of three-dimensional vectors \[ \mathbf{a} \bullet \mathbf{b} = a_1 b_1 + a_2 b_2 + a_3 b_3 , \] we calculate the dot product to be \[ \mathbf{a} \bullet \mathbf{b} = 3 \cdot 4 - 2 \cdot 5 + 1 \cdot 2 = \]

a = {3, 2, 1}; b = {4, -5, 2}; a . b
Dot[a, b]

■

End of Example 3

Not every curvilinear system of coordinates supports dot product, as the following example shows.

Example 4: Let us consider a plane ℝ² equipped with polar coordinates. Then every point P is uniquely identified with a polar pair (r, θ), where r is the distance P from a reference point O (known as pole, which is usually the origin) and angle θ formed by the line OP and polar axis, which is usually abscissa. For two points P₁(r₁, θ₁) and P₂(r₂, θ₂), you cannot form the dot product \[ P_1 \bullet P_2 = r_1 r_2 + \theta_1 \theta_2 \qquad \mbox{is wrong} \] because its components have different units: distances are measured in meters (in SI system) and angles have no units (in SI). ■

End of Example 4

Properties of dot product

The dot product is not defined for vectors of different dimensions. It does not matter whether vectors are columns or rows or n-tuples. so you can evaluate dot product of row vectors with column vectors---they must be from the vector spaces over the same field. Therefore, this definition is valid not only for n-tuples (elements from 𝔽ⁿ), but also for column vectors and row vectors.

The following basic properties of the dot product are valid for vectors from the same vector space. They are all easily proven from the above definition. In the following properties, u, v, u, and w are n-dimensional vectors, and λ is a number (scalar):

Theorem 1: Let u, v, w be vectors of the same finite size and λ be a scalar. Then the following properties hold:

u • v = v • u (commutative law);
(u + v) • w = u • w + v • w (distributive law);
(λ u) • v = λ (u • v) ;
for any two column vectors u, v, and a square matrix A, the following equation holds:
u • Av = A^Tu • v, where A^T is the transpose of a square matrix A.

Applying the definition of dot product to u · v and v · u, we obtain \begin{align*} \mathbf{u} \bullet \mathbf{v} &= u_1 v_1 + u_2 v_2 + \cdots + u_n v_n \\ \mathbf{v} \bullet \mathbf{u} &= v_1 u_1 + v_2 u_2 + \cdots + v_n u_n \end{align*} Since product of two numbers from field 𝔽 is commutative, we conclude thatu · v = v · u.
Since every finite dimensional vector space is isomorphic to 𝔽ⁿ, we can assume that these vectors u, v, and w belong to the Cartesian product 𝔽ⁿ. Then \[ \mathbf{u} + \mathbf{v} = \left( u_1 , \ldoys , u_n \right) + \left( v_1 , \ldots , v_n \right) = \left( u_1 + v_1 , \ldots , u_n + v_n \right) . \] Taking the dot product with w, we get \[ \left( \mathbf{u} + \mathbf{v} \right) u_1 w_1 + v_1 w_1 + \cdots u_n w_n + v_n w_n = \mathbf{u} \bulle \mathbf{w} + \mathbf{v} \bulle \mathbf{w} . \]

Example 5: ■

End of Example 5

Theorem 2 (Cauchy inequality): For any two numerical vectors v and u of the same finite dimension, the following inequality holds:

(u • v)² ≤ (u • u) · (v • v)

It is convenient to introduce the following notation: ∥v∥² = v • v. Positive square root of this quantity is call the norm in mathematics. Then Cauchy inequality can be rewritten as \[ \left\vert {\bf u} \bullet {\bf v} \right\vert \le \| {\bf u} \| \cdot \| {\bf v} \| , \] Suppose first that either u or v is zero. Then their dot product is zero and the Cauchy inequality holds.

Now suppose that neither u nor v is zero. It follows that ∥u∥ > 0 and ∥v∥ > 0 because the dot product x • x > 0 for any nonzero vector x. We have \begin{align*} 0 &\le \left( \frac{{\bf u}}{\| {\bf u} \|} + \frac{{\bf v}}{\| {\bf v} \|} \right) \bullet \left( \frac{{\bf u}}{\| {\bf u} \|} + \frac{{\bf v}}{\| {\bf v} \|} \right) \\ &= \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf u}}{\| {\bf u} \|} \right) + 2 \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) + \left( \frac{{\bf v}}{\| {\bf v} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) \\ &= \frac{1}{\| {\bf u} \|^2} \left( {\bf u} \bullet {\bf u} \right) + \frac{2}{\| {\bf u} \| \cdot \| {\bf v} \|} \left( {\bf u} \bullet {\bf v} \right) + \frac{1}{\| {\bf v} \|^2} \left( {\bf v} \bullet {\bf v} \right) \\ &= \frac{1}{\| {\bf u} \|^2} \, \| {\bf u} \|^2 + 2 \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) + \frac{1}{\| {\bf v} \|^2} \, \| {\bf v} \|^2 \\ &= 1 + 2 \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) + 1 \end{align*} Hence, −∥u∥ · ∥v∥ ≤ u • v. Similarly, \begin{align*} 0 &\le \left( \frac{{\bf u}}{\| {\bf u} \|} - \frac{{\bf v}}{\| {\bf v} \|} \right) \bullet \left( \frac{{\bf u}}{\| {\bf u} \|} - \frac{{\bf v}}{\| {\bf v} \|} \right) \\ &= \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf u}}{\| {\bf u} \|} \right) - 2 \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) + \left( \frac{{\bf v}}{\| {\bf v} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) \\ &= \frac{1}{\| {\bf u} \|^2} \left( {\bf u} \bullet {\bf u} \right) - \frac{2}{\| {\bf u} \| \cdot \| {\bf v} \|} \left( {\bf u} \bullet {\bf v} \right) + \frac{1}{\| {\bf v} \|^2} \left( {\bf v} \bullet {\bf v} \right) \\ &= \frac{1}{\| {\bf u} \|^2} \, \| {\bf u} \|^2 - 2 \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) + \frac{1}{\| {\bf v} \|^2} \, \| {\bf v} \|^2 \\ &= 1 - 2 \left( \frac{{\bf u}}{\| {\bf u} \|} \bullet \frac{{\bf v}}{\| {\bf v} \|} \right) + 1 \end{align*} Therefore, u • v ≤ ∥u∥ · ∥v∥. By combining the two inequalities, we obtain the Cauchy inequality.

Example 6: ■

End of Example 6

Note: The inequality in part (4) was first proved by the French mathematician, engineer, and physicist baron Augustin-Louis Cauchy in 1821. In 1859, Viktor Bunyakovsky extended this inequality to its integral version (that is, for the case where we have continuous summation). 30 years later, in 1888, Hermann Schwarz established the general form of this inequality, valid in vector spaces endowed with dot product. Therefore, this inequality is usually referred to as Cauchy–Schwarz inequality, also Cauchy–Bunyakovsky–Schwarz inequality, or simply CBS-inequality.


Augustin-Louis Cauchy		Viktor Yakovlevich Bunyakovsky		Hermann Amandus Schwarz

Duality

The dot product of two numerical vectors (i.e., vectors from 𝔽ⁿ, where 𝔽 is a field of either real numbers ℝ or complex numbers ℂ) is a single number that provides information about the relationship between two vectors. The linearity of dot product tells us that the scalar product of two numerical vectors can be used for definition of linear forms/functionals. The reverse statement (see section on dual spaces in Part 3) is always true: every linear functional on a finite dimensional vector space is generated by a corresponding dot product.

Example 7: In calculus, the gradient of a scalar function is a dual vector. It takes a vector (representing a direction) and returns the rate of change of the scalar function in that direction.

In physics, dual vectors appear in various contexts, such as in describing fields (like electromagnetic fields) and in quantum mechanics (using bra-ket notation). ■

End of Example 7

At the beginning of twentieth century, it was discovered that the dot product is needed for definition of dual spaces (see section in Part3). Although definition of dot product is symmetrical, it is convenient to separate vectors identifying one of them as a vector, called contravariant vector or ket-vector. When ordered basis in a finite dimensional vector space is chosen, [e₁, e₂, … , e_n], every vector can be uniqely expressed as linear conbination of the basis vectors:

\[ \mathbf{x} = x^1 \mathbf{e}_1 + x^2 \mathbf{e}_2 + \cdots + x^n \mathbf{e}_n = \sum_{i=1}^n x^i \mathbf{e}_i = x^i \mathbf{e}_i , \]

whose components are identiﬁed with superscripts. The upper indices are not exponents but are indices of coordinates, coefficients or basis vectors. It is customary to omit the sum symbol in this expression adopting the Einstein summation convention. This convention is routinely used in Einstein’s general theory of relativity which suffers from a proliferation of indices but it also facilitates a simplified notation and more effective computations in many other contexts.

Using the dual basis [e¹, e², … , eⁿ], a vector from the dual space can be written as

\[ \mathbf{y} = y_1 \mathbf{e}^1 + y_2 \mathbf{e}^2 + \cdots + y_n \mathbf{e}^n = \sum_{j=1}^n y_j \mathbf{e}^j = y_j \mathbf{e}^j . \]

This vector y from the dual space is called the covariant vector or covector or dual vector or bra-vector. Then using the duality property \( \displaystyle \quad \mathbf{e}^j \mathbf{e}_i = \delta^j_i , \quad \) where δ^j_i is the Kroneker's delta, we can write the dot product of bra-vector y with ket-vector x in concise form:

\[ \mathbf{y} \bullet \mathbf{x} = \sum_{i=1}^n y_i x^i = y_i x^i . \]

Then left-hand part of Eq.\eqref{EqDot.1} defines a linear functional on any n dimensional vector space independently on what field is used (ℂ or ℝ). Then one of the multipliers, say y in Eq.\eqref{EqDot.2}, is called a vector, but another counterpart, x is known as a covector. Such treatment of vectors in the dot product breaks their "equal rights;" in many practical problems, these vectors, x and y, are indeed taken from different spaces, but sometimes look the same.

In geometry, to distinguish these two partners in Eq.\eqref{EqDot.2}, the vector y is called contravariant vector, and the covector x is referred to as covariant vector. In order to decide between these partners who is who, it is common to use superscript for coordinates of contravariant vector, y = [ y¹, y², y³], and subscript for covariant vectors, x = [x₁, x₂, x₃]. In physics, covariant vectors are also called bra-vectors, while contravariant vectors are known as ket-vectors.

It is customary to omit the sum symbol in this definition and simply write

\[ \mathbf{x} \bullet \mathbf{y} = x_i y_i \qquad\mbox{or}\qquad x^i y_i , \]

adopting the Einstein summation convention.

Dot Product and Linear Transformations

The fundamental significance of a dot product is that it is a linear transformation of vectors. This means that the function f(v) = u • v is a linear functional for any fixed vector u.

Metric via Dot Product

A vector space, by definition, has no metric inside it, which is very desirable property. It turns out that the scalar product can be used to define length or distance between vectors transferring ℝⁿ into a metric space, known as the Euclidean space. Upon introducing the norm (meaning length or magnitude) of a vector, \( \displaystyle \quad \| {\bf v} \| = +\sqrt{{\bf v} \bullet {\bf v}} , \quad \) the Cauchy inequality can be written as

\begin{equation} \label{EqDot.3} \left| \mathbf{u} \bullet \mathbf{v} \right\vert \leqslant \| \mathbf{u} \| \cdot \| \mathbf{v} \| . \end{equation}

Example 2: ■

End of Example 2

Geometric Properties of the Dot Product

Many years before Gibbs definition, ancient Greeks discovered that geometrically the product of the corresponding entries of the two sequences of numbers is equivalent to the product of their magnitudes and the cosine of the angle between them. This leads to introducing a metric (or length or distance) in the Cartesian product ℝ³ transferring it into the Euclidean space. Originally, it was the three-dimensional physical space, but in modern mathematics there are Euclidean spaces of any positive integer dimension n, which are called Euclidean n-spaces.

Geometrical analysis yields further interesting properties of the dot product operation that can then be used in nongeometric applications. If we rewrite the Cauchy inequality as an equality with parameter k:

\[ \mathbf{u} \bullet \mathbf{v} = \| \mathbf{u} \| \cdot \| \mathbf{v} \| \, k \qquad (0 \le k \le 1) , \]

it was discovered ny ancient Greeks that the parameter k has a geometric meaning in physical space (ℝ³ or ℝ²). This leads to the equation

\begin{equation} \label{EqDot.4} \mathbf{u} \bullet \mathbf{v} = \| \mathbf{u} \| \cdot \| \mathbf{v} \| \, \cos\theta , \end{equation}

where θ is the angle between vectors u and v.

Dot product in coordinate systems

The concepts of angle and radius were already used by ancient Greek astronomer and astrologer Hipparchus (190–120 BC). Although, Grégoire de Saint-Vincent and Bonaventura Cavalieri independently introduced the system's concepts in the mid-17th century, though the actual term polar coordinates has been attributed to Gregorio Fontana in the 18th century.

The polar coordinate system specifies a given point P(x, y) in a plane by using a distance r and an angle θ as its two coordinates (r, θ), where r is the point's distance from a reference point called the pole, and θ is the point's direction from the pole relative to the direction of the abscissa. The distance r from the pole is called the radial coordinate, radial distance or simply radius, and the angle θ is called the angular coordinate, polar angle, or azimuth.

The polar coordinates r and θ can be converted to the Cartesian coordinates x and y by using the trigonometric functions sine and cosine or complex numbers:

\[ \begin{split} x & = r\,\cos\theta , \\ y &= r\,\sin\theta , \end{split} \qquad\mbox{or}\qquad z = x + {\bf j}\,y = r\,e^{{\bf j}\,\theta} . \]

It makes no sense to define dot product in polar coordinates similar to the Cartesian coordinates:

\[ \left( r_1 , \theta_1 \right) \bullet \left( r_2 , \theta_2 \right) \ne r_1 r_2 + \theta_1 \theta_2 \]

because products of components isn't even dimensionally correct – the radial coordinates are dimensional while the angles are dimensionless. Upon introducing the radius vector \( \displaystyle \quad \mathbf{r} = r\,\hat{\bf r}(\theta ), \quad \) where \( \displaystyle \quad \hat{\bf r}(\theta ) = \cos\theta\, \hat{\bf x} + \sin\theta\, \hat{\bf y} = \mathbf{i}\,\cos\theta + \mathbf{j}\,\sin\theta . \quad \) Then scalar product in polar coordinates becomes

\[ r_1 e^{{\bf j}\,\theta_1} \bullet r_2 e^{{\bf j}\,\theta_2} = r_1 r_2 \left( \cos\theta_1 \cos\theta_2 + \sin\theta_1 \sin\theta_2 \right) = r_1 r_2 \cos\left( \theta_1 - \theta_2 \right) . \]

({1,0}*Cos[theta1] + {0,1}*Sin[theta1]).({1,0}*Cos[theta2] + {0, 1}*Sin[theta2])

The polar coordinate system is extended to three dimensions in two ways: the cylindrical coordinate system adds a second distance coordinate, and the spherical coordinate system adds a second angular coordinate.

\[ x = \]

The radius vector of a point in space with spherical coordinates ρ,𝜃,𝜙 can be written as

\[ \mathbf{r} = \rho\,\hat{\bf r} (\theta , \phi ) , \]

where

\[ \hat{\bf r} (\theta , \phi ) = \sin\phi\,\cos\theta\,\hat{\bf x} + \sin\phi\,\sin\theta\,\hat{\bf y} + \cos\phi\,\hat{\bf z} . \]

Thus, the components of the radius vector with respect to the "spherical basis" form a vector field because they vary from point to point. Moreover, the radius vector has coordinates (ρ, 0, 0) because θ and ϕ have no physical dimension, and cannot be the components of a vector.

When ρ₁, θ₁, ϕ₁ and ρ₂, θ₂, ϕ₂ are known for two vectors u and v, we have

\[ \mathbf{u} = \rho_1 \hat{\bf r} (\theta_1 , \phi_1 ) \qquad \mbox{and} \qquad \mathbf{v} = \rho_2 \hat{\bf r} (\theta_2 , \phi_2 ) . \]

Their dot product is

\begin{align*} \mathbf{u} \bullet \mathbf{v} &= \left[ \rho_1 \hat{\bf r} (\theta_1 , \phi_1 ) \right] \bullet \left[ \rho_2 \hat{\bf r} (\theta_2 , \phi_2 ) \right] \\ &= \rho_1 \rho_2 \hat{\bf r} (\theta_1 , \phi_1 ) \bullet \hat{\bf r} (\theta_2 , \phi_2 ) \\ &= \rho_1 \rho_2 \left( \sin\phi_1 \cos\theta_1 \hat{\bf x} + \sin\phi_1 \sin\theta_1 \hat{\bf y} + \cos\phi_1 \hat{\bf z} \right) \bullet \left( \sin\phi_2 \cos\theta_2 \hat{\bf x} + \sin\phi_2 \sin\theta_2 \hat{\bf y} + \cos\phi_2 \hat{\bf z} \right) \\ &= \rho_1 \rho_2 \left( \sin\phi_1 \sin\phi_2 \cos\theta_1 \cos\theta_2 + \sin\phi_1 \sin\phi_2 \sin\theta_1 \sin \theta_2 + \cos\phi_1 \cos\phi_2 \right) \\ &= \rho_1 \rho_2 \left[ \sin\phi_1 \sin\phi_2 \cos \left( \theta_1 - \theta_2 \right) + \cos\phi_1 \cos\phi_2 \right] . \end{align*}

If we introduce the angle ω by

\[ \cos\omega = \sin\phi_1 \sin\phi_2 \cos \left( \theta_1 - \theta_2 \right) + \cos\phi_1 \cos\phi_2 , \]

then equation \eqref{EqDot.4} becomes

\[ \mathbf{u} \bullet \mathbf{v} = \| \mathbf{u} \| \cdot \| \mathbf{v} \| \,\cos\omega . \]

Applications

Scalar products are intimately associated with a variety of physical concepts. For example, the work done by a force applied at a point is defined as the product of the displacement and the component of the force in the direction of displacement (i.e., the projection of the force onto the direction of the displacement). Thus the component of the force perpendicular to the displacement "does no work." If F is the force and s is the displacement, then the work W is by definition equal to

\[ W = F_{\parallel} s = F\,s\,\cos\left( {\bf F}, {\bf s} \right) = {\bf F} \bullet {\bf s} . \]

Suppose the force makes an obtuse angle with the displacement, so that the force is "resisting." Then the work is regarded as negative, in keeping with formula above.

The dot product is very important in physics. Let us consider an example. In classical mechanics, it is true that the ‘work’ that is done when an object is moved equals the dot product of the force acting on the object and the displacement vector:

\[ F = \mathbf{F} \bullet \mathbf{x} . \]

Example 2: There are many physical examples of line integrals, but perhaps the most common is the expression for the total work done by a force F when it moves its point of application from a point A to a point B along a given curve C. We allow the magnitude and direction of F to vary along the curve. Let the force act at a point r and consider a small displacement dr along the curve; then the small amount of work done is dW = F • dr (note that dW can be either positive or negative). Therefore, the total work done in traversing the path C is \[ W_C = \int_C {\bf F} \bullet {\text d}{\bf r} . \]

Naturally, other physical quantities can be expressed in such a way. For example, the electrostatic potential energy gained by moving a charge q along a path C in an electric field E is −q∫_C E • dr. We may also note that Ampere's law concerning the magnetic field B associated with a current-carrying wire can be written as \[ \oint_C {\bf B} \bull {\text d}{bf r} = \mu_0 I , \] where I is the current enclosed by a closed path C traversed in a right-handed sense with respect to the current direction. ■

End of Example 2

The work W must of course be independent of the coordinate system in which the vectors F and x are expressed. The dot product as we know it from Eq.\eqref{EqDot.3} does not have this property. In general, using matrix transformation, we have

\[ s = {\bf A}\,\mathbf{x} \bullet {\bf A}\,\mathbf{y} = {\bf A}^{\mathrm T} {\bf A}\,\mathbf{x} \bullet \mathbf{y} . \]

Only if A⁻¹ equals A^T (i.e. if we are dealing with orthonormal transformations) s will not change. It appears as if the dot product only describes the physics correctly in a special kind of coordinate system: a system which according to our human perception is ‘rectangular’, and has physical units, i.e. a distance of 1 in coordinate x means indeed 1 meter in x-direction. An orthonormal transformation produces again such a rectangular ‘physical’ coordinate system. If one has so far always employed such special coordinates anyway, this dot product has always worked properly.

It is not always guaranteed that one can use such special coordinate systems (polar coordinates are an example in which the local orthonormal basis of vectors is not the coordinate basis). However, the dot product between a vector x and a covector y is invariant under all transformations because this product defines a functional generated by covector y. Then the given dot product is just one representation of this linear functional in particular coordinates. Making linear transformation with matrix A, we get

\begin{align*} \mathbf{x} \bullet \mathbf{y} &= \sum_i x^i y_i = \sum_i \sum_j A^i_j \xi^j \bullet \left( \mathbf{A}^{-1} \right)^j_i \eta_j \\ &= \sum_j \sum_i \left( \mathbf{A}^{-1} \right)^j_i A^i_j \xi^j \bullet \eta_j = \sum_j \xi^j \bullet \eta_j . \end{align*}

We can use the dot product to find the angle between two vectors. From the definition of the dot product, we get

\[ {\bf a} \cdot {\bf b} = \langle {\bf a} , {\bf b} \rangle = \| {\bf a} \| \cdot \| {\bf b} \| \,\cos \theta , \]

where θ is the angle between ywo vectors a and b. If the vectors are nonzero, then

\[ \theta = \arccos \left( \frac{{\bf a} \cdot {\bf b}}{\| {\bf a} \| \cdot \| {\bf b} \| } \right) . \]

The prime example of dot operation is work that is defined as the scalar product of force and displacement. The presence of cos(θ) ensures the requirement that the work done by a force perpendicular to the displacement is zero.

The dot product is clearly commutative, 𝑎 · b = b · 𝑎. Moreover, it distributes over vector addition

\[ ({\bf a} + {\bf b}) · {\bf c} = {\bf a} · {\bf c} + {\bf b} · {\bf c}. \]

One can use the distributive property of the dot product to show that if (a_x, a_y, a_z) and (b_x, b_y, b_z) represent the components of a and b along the axes x, y, and z, then

\[ {\bf a} \cdot {\bf b} = a_x b_x + a_y b_y + a_z b_z . \]

From the definition of the dot product, we can draw an important conclusion. If we divide both sides of a · b = |a| |b| cos θ by |a|, we get

\[ \frac{{\bf a} \cdot {\bf b}}{|{\bf a}|} = |{\bf b}|\,\cos\theta \qquad \iff \qquad \left( \frac{{\bf a}}{|{\bf a}|} \right) \cdot {\bf b} = \hat{\bf e}_a \cdot {\bf b} = |{\bf b}|\,\cos\theta \]

Noting that |b| cos θ is simply the projection of b along a, we conclude that in order to find the perpendicular projection of a vector b along another vector a, take dot product of b with \( \hat{\bf e}_a , \) the unit vector along a.

The dot product of any two vectors of the same dimension can be done with the dot operation given as Dot[vector 1, vector 2] or with use of a period “. “ .

{1,2,3}.{2,4,6}

Dot[{1,2,3},{3,2,1} ]

Example 7: What is the angle between i and i + j + 2k?

\begin{align*} \theta &= \arccos \left( \frac{{\bf i} \cdot ({\bf i} + {\bf j} + 2 {\bf k})}{\| {\bf i} \| \cdot \| {\bf i} + {\bf j} + 2 {\bf k} \| } \right) \\ &= \arccos \left( \frac{1}{\sqrt{6}} \right) \approx 1.15026. \end{align*}

■

End of Example 7

========================== to be checked ===============

An outer product is the tensor product of two coordinate vectors \( {\bf u} = \left[ u_1 , u_2 , \ldots , u_m \right] \) and \( {\bf v} = \left[ v_1 , v_2 , \ldots , v_n \right] , \) denoted \( {\bf u} \otimes {\bf v} , \) is an m-by-n matrix W such that its coordinates satisfy \( w_{i,j} = u_i v_j . \) The outer product \( {\bf u} \otimes {\bf v} , \) is equivalent to a matrix multiplication \( {\bf u} \, {\bf v}^{\ast} , \) (or \( {\bf u} \, {\bf v}^{\mathrm T} , \) if vectors are real) provided that u is represented as a column \( m \times 1 \) vector, and v as a column \( n \times 1 \) vector. Here \( {\bf v}^{\ast} = \overline{{\bf v}^{\mathrm T}} . \)

For three dimensional vectors \( {\bf a} = a_1 \,{\bf i} + a_2 \,{\bf j} + a_3 \,{\bf k} = \left[ a_1 , a_2 , a_3 \right] \) and \( {\bf b} = b_1 \,{\bf i} + b_2 \,{\bf j} + b_3 \,{\bf k} = \left[ b_1 , b_2 , b_3 \right] \) , it is possible to define special multiplication, called cross-product:

\[ {\bf a} \times {\bf b} = \det \left[ \begin{array}{ccc} {\bf i} & {\bf j} & {\bf k} \\ a_1 & a_2 & a_3 \\ b_1 & b_2 & b_3 \end{array} \right] = {\bf i} \left( a_2 b_3 - b_2 a_3 \right) - {\bf j} \left( a_1 b_3 - b_1 a_3 \right) + {\bf k} \left( a_1 b_2 - a_2 b_1 \right) . \]

Example: For instance, if m = 4 and n = 3, then

\[ {\bf u} \otimes {\bf v} = {\bf u} \, {\bf v}^{\mathrm T} = \begin{bmatrix} u_1 \\ u_2 \\ u_3 \\ u_4 \end{bmatrix} \begin{bmatrix} v_1 & v_2 & v_3 \end{bmatrix} = \begin{bmatrix} u_1 v_1 & u_1 v_2 & u_1 v_3 \\ u_2 v_1 & u_2 v_2 & u_2 v_3 \\ u_3 v_1 & u_3 v_2 & u_3 v_3 \\ u_4 v_1 & u_4 v_2 & u_4 v_3 \end{bmatrix} . \]

In Mathematica, the outer product has a special command:

Outer[Times, {1, 2, 3, 4}, {a, b, c}]

Out[1]= {{a, b, c}, {2 a, 2 b, 2 c}, {3 a, 3 b, 3 c}, {4 a, 4 b, 4 c}}

Applications in Physics

What is the angle between the vectors i + j and i + 3j?
What is the area of the quadrilateral with vertices at (1, 1), (4, 2), (3, 7) and (2, 3)?

Aldaz, J. M.; Barza, S.; Fujii, M.; Moslehian, M. S. (2015), "Advances in Operator Cauchy—Schwarz inequalities and their reverses", Annals of Functional Analysis, 6 (3): 275–295, doi:10.15352/afa/06-3-20
Bunyakovsky, Viktor (1859), "Sur quelques inequalities concernant les intégrales aux différences finies" (PDF), Mem. Acad. Sci. St. Petersbourg, 7 (1): 6
Cauchy, A.-L. (1821), "Sur les formules qui résultent de l'emploie du signe et sur > ou <, et sur les moyennes entre plusieurs quantités", Cours d'Analyse, 1er Partie: Analyse Algébrique 1821; OEuvres Ser.2 III 373-377
Deay, T. and Manogue, C.A., he Geometry of the Dot and Cross Products, Journal of Online Mathematics and Its Applications 6.
Gibbs, J.W. and Wilson, E.B., Vector Analysis: A Text-Book for the Use of Students of Mathematics & Physics: Founded Upon the Lectures of J. W. Gibbs, Nabu Press, 2010.
Schwarz, H. A. (1888), "Über ein Flächen kleinsten Flächeninhalts betreffendes Problem der Variationsrechnung" (PDF), Acta Societatis Scientiarum Fennicae, XV: 318, archived (PDF) from the original on 2022-10-09
Solomentsev, E. D. (2001) [1994], "Cauchy inequality", Encyclopedia of Mathematics, EMS Press
Vector addition

Introduction to Linear Algebra

Systems of Linear Equations

Matrix Algebra

Vector Spaces

Eigenvalues, Eigenvectors

Euclidean Spaces

Matrix Decompositions

Tensors

Applications

Functions of Matrices

Miscellany

Preliminaries

Glossary

Reference

Dot Product

Applications

Applications in Physics