4.4 The Mean Value Theorem

The Mean Value Theorem is one of the most important theorems in calculus. We look at some of its implications at the end of this section. First, let’s start with a special case of the Mean Value Theorem, called Rolle’s theorem.

Rolle’s Theorem

Informally, Rolle’s theorem states that if the outputs of a differentiable function f f are equal at the endpoints of an interval, then there must be an interior point c c where f ′ ( c ) = 0 . f ′ ( c ) = 0 . Figure 4.21 illustrates this theorem.

The figure is divided into three parts labeled a, b, and c. Figure a shows the first quadrant with values a, c, and b marked on the x-axis. A downward-<a href=facing parabola is drawn such that its values at a and b are the same. The point c is the global maximum, and it is noted that f’(c) = 0. Figure b shows the first quadrant with values a, c, and b marked on the x-axis. An upward-facing parabola is drawn such that its values at a and b are the same. The point c is the global minimum, and it is noted that f’(c) = 0. Figure c shows the first quadrant with points a, c1, c2, and b marked on the x-axis. One period of a sine wave is drawn such that its values at a and b are equal. The point c1 is the global maximum, and it is noted that f’(c1) = 0. The point c2 is the global minimum, and it is noted that f’(c2) = 0." width="887" height="311" />

Figure 4.21 If a differentiable function f satisfies f ( a ) = f ( b ) , f ( a ) = f ( b ) , then its derivative must be zero at some point(s) between a a and b . b .

Theorem 4.4

Rolle’s Theorem

Let f f be a continuous function over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) ( a , b ) such that f ( a ) = f ( b ) . f ( a ) = f ( b ) . There then exists at least one c ∈ ( a , b ) c ∈ ( a , b ) such that f ′ ( c ) = 0 . f ′ ( c ) = 0 .

Proof

Let k = f ( a ) = f ( b ) . k = f ( a ) = f ( b ) . We consider three cases:

  1. f ( x ) = k f ( x ) = k for all x ∈ ( a , b ) . x ∈ ( a , b ) .
  2. There exists x ∈ ( a , b ) x ∈ ( a , b ) such that f ( x ) > k . f ( x ) > k .
  3. There exists x ∈ ( a , b ) x ∈ ( a , b ) such that f ( x ) < k . f ( x ) < k .

Case 1: If f ( x ) = k f ( x ) = k for all x ∈ ( a , b ) , x ∈ ( a , b ) , then f ′ ( x ) = 0 f ′ ( x ) = 0 for all x ∈ ( a , b ) . x ∈ ( a , b ) .

Case 2: Since f f is a continuous function over the closed, bounded interval [ a , b ] , [ a , b ] , by the extreme value theorem, it has an absolute maximum. Also, since there is a point x ∈ ( a , b ) x ∈ ( a , b ) such that f ( x ) > k , f ( x ) > k , the absolute maximum is greater than k . k . Therefore, the absolute maximum does not occur at either endpoint. As a result, the absolute maximum must occur at an interior point c ∈ ( a , b ) . c ∈ ( a , b ) . Because f f has a maximum at an interior point c , c , and f f is differentiable at c , c , by Fermat’s theorem, f ′ ( c ) = 0 . f ′ ( c ) = 0 .

Case 3: The case when there exists a point x ∈ ( a , b ) x ∈ ( a , b ) such that f ( x ) < k f ( x ) < k is analogous to case 2, with maximum replaced by minimum.

An important point about Rolle’s theorem is that the differentiability of the function f f is critical. If f f is not differentiable, even at a single point, the result may not hold. For example, the function f ( x ) = | x | − 1 f ( x ) = | x | − 1 is continuous over [ −1 , 1 ] [ −1 , 1 ] and f ( −1 ) = 0 = f ( 1 ) , f ( −1 ) = 0 = f ( 1 ) , but f ′ ( c ) ≠ 0 f ′ ( c ) ≠ 0 for any c ∈ ( −1 , 1 ) c ∈ ( −1 , 1 ) as shown in the following figure.

The function f(x) = |x| − 1 is graphed. It is shown that f(1) = f(−1), but it is noted that there is no c such that f’(c) = 0.

Figure 4.22 Since f ( x ) = | x | − 1 f ( x ) = | x | − 1 is not differentiable at x = 0 , x = 0 , the conditions of Rolle’s theorem are not satisfied. In fact, the conclusion does not hold here; there is no c ∈ ( −1 , 1 ) c ∈ ( −1 , 1 ) such that f ′ ( c ) = 0 . f ′ ( c ) = 0 .

Let’s now consider functions that satisfy the conditions of Rolle’s theorem and calculate explicitly the points c c where f ′ ( c ) = 0 . f ′ ( c ) = 0 .

Example 4.14

Using Rolle’s Theorem

For each of the following functions, verify that the function satisfies the criteria stated in Rolle’s theorem and find all values c c in the given interval where f ′ ( c ) = 0 . f ′ ( c ) = 0 .

  1. f ( x ) = x 2 + 2 x f ( x ) = x 2 + 2 x over [ −2 , 0 ] [ −2 , 0 ]
  2. f ( x ) = x 3 − 4 x f ( x ) = x 3 − 4 x over [ −2 , 2 ] [ −2 , 2 ]

Solution

  1. Since f f is a polynomial, it is continuous and differentiable everywhere. In addition, f ( −2 ) = 0 = f ( 0 ) . f ( −2 ) = 0 = f ( 0 ) . Therefore, f f satisfies the criteria of Rolle’s theorem. We conclude that there exists at least one value c ∈ ( −2 , 0 ) c ∈ ( −2 , 0 ) such that f ′ ( c ) = 0 . f ′ ( c ) = 0 . Since f ′ ( x ) = 2 x + 2 = 2 ( x + 1 ) , f ′ ( x ) = 2 x + 2 = 2 ( x + 1 ) , we see that f ′ ( c ) = 2 ( c + 1 ) = 0 f ′ ( c ) = 2 ( c + 1 ) = 0 implies c = −1 c = −1 as shown in the following graph.

The function f(x) = x2 +2x is graphed. It is shown that f(0) = f(−2), and a dashed horizontal line is drawn at the absolute minimum at (−1, −1).

Figure 4.23 This function is continuous and differentiable over [ −2 , 0 ] , [ −2 , 0 ] , f ′ ( c ) = 0 f ′ ( c ) = 0 when c = −1 . c = −1 .

The function f(x) = x3 – 4x is graphed. It is obvious that f(2) = f(−2) = f(0). Dashed horizontal lines are drawn at x = ±2/square root of 3, which are the local maximum and minimum.

Figure 4.24 For this polynomial over [ −2 , 2 ] , [ −2 , 2 ] , f ′ ( c ) = 0 f ′ ( c ) = 0 at x = ± 2 / 3 . x = ± 2 / 3 .

Checkpoint 4.14

Verify that the function f ( x ) = 2 x 2 − 8 x + 6 f ( x ) = 2 x 2 − 8 x + 6 defined over the interval [ 1 , 3 ] [ 1 , 3 ] satisfies the conditions of Rolle’s theorem. Find all points c c guaranteed by Rolle’s theorem.

The Mean Value Theorem and Its Meaning

Rolle’s theorem is a special case of the Mean Value Theorem. In Rolle’s theorem, we consider differentiable functions f f defined on a closed interval [ a , b ] [ a , b ] with f ( a ) = f ( b ) f ( a ) = f ( b ) . The Mean Value Theorem generalizes Rolle’s theorem by considering functions that do not necessarily have equal value at the endpoints. Consequently, we can view the Mean Value Theorem as a slanted version of Rolle’s theorem (Figure 4.25). The Mean Value Theorem states that if f f is continuous over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) , ( a , b ) , then there exists a point c ∈ ( a , b ) c ∈ ( a , b ) such that the tangent line to the graph of f f at c c is parallel to the secant line connecting ( a , f ( a ) ) ( a , f ( a ) ) and ( b , f ( b ) ) . ( b , f ( b ) ) .

A vaguely sinusoidal function y = f(x) is drawn. On the x-axis, a, c1, c2, and b are marked. On the y-axis, f(a) and f(b) are marked. The function f(x) starts at (a, f(a)), decreases to c1, increases to c2, and then decreases to (b, f(b)). A secant line is drawn between (a, f(a)) and (b, f(b)), and it is noted that this line has slope (f(b) – f(a))/(b − a). The tangent lines at c1 and c2 are drawn, and these lines are parallel to the secant line. It is noted that the slopes of these tangent lines are f’(c1) and f’(c2), respectively.

Figure 4.25 The Mean Value Theorem says that for a function that meets its conditions, at some point the tangent line has the same slope as the secant line between the ends. For this function, there are two values c 1 c 1 and c 2 c 2 such that the tangent line to f f at c 1 c 1 and c 2 c 2 has the same slope as the secant line.

Theorem 4.5

Mean Value Theorem

Let f f be continuous over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) . ( a , b ) . Then, there exists at least one point c ∈ ( a , b ) c ∈ ( a , b ) such that

f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a .

Proof

The proof follows from Rolle’s theorem by introducing an appropriate function that satisfies the criteria of Rolle’s theorem. Consider the line connecting ( a , f ( a ) ) ( a , f ( a ) ) and ( b , f ( b ) ) . ( b , f ( b ) ) . Since the slope of that line is

f ( b ) − f ( a ) b − a f ( b ) − f ( a ) b − a

and the line passes through the point ( a , f ( a ) ) , ( a , f ( a ) ) , the equation of that line can be written as

y = f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) . y = f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) .

Let g ( x ) g ( x ) denote the vertical difference between the point ( x , f ( x ) ) ( x , f ( x ) ) and the point ( x , y ) ( x , y ) on that line. Therefore,

g ( x ) = f ( x ) − [ f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) ] . g ( x ) = f ( x ) − [ f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) ] .

A vaguely sinusoidal function y = f(x) is drawn. On the x-axis, a and b are marked. On the y-axis, f(a) and f(b) are marked. The function f(x) starts at (a, f(a)), decreases, then increases, and then decreases to (b, f(b)). A secant line is drawn between (a, f(a)) and (b, f(b)), and it is noted that this line has equation y = ((f(b) – f(a))/(b − a)) (x − a) + f(a). A line is drawn between the maximum of f(x) and the secant line and it is marked g(x).

Figure 4.26 The value g ( x ) g ( x ) is the vertical difference between the point ( x , f ( x ) ) ( x , f ( x ) ) and the point ( x , y ) ( x , y ) on the secant line connecting ( a , f ( a ) ) ( a , f ( a ) ) and ( b , f ( b ) ) . ( b , f ( b ) ) .

Since the graph of f f intersects the secant line when x = a x = a and x = b , x = b , we see that g ( a ) = 0 = g ( b ) . g ( a ) = 0 = g ( b ) . Since f f is a differentiable function over ( a , b ) , ( a , b ) , g g is also a differentiable function over ( a , b ) . ( a , b ) . Furthermore, since f f is continuous over [ a , b ] , [ a , b ] , g g is also continuous over [ a , b ] . [ a , b ] . Therefore, g g satisfies the criteria of Rolle’s theorem. Consequently, there exists a point c ∈ ( a , b ) c ∈ ( a , b ) such that g ′ ( c ) = 0 . g ′ ( c ) = 0 . Since

g ′ ( x ) = f ′ ( x ) − f ( b ) − f ( a ) b − a , g ′ ( x ) = f ′ ( x ) − f ( b ) − f ( a ) b − a , g ′ ( c ) = f ′ ( c ) − f ( b ) − f ( a ) b − a . g ′ ( c ) = f ′ ( c ) − f ( b ) − f ( a ) b − a .

Since g ′ ( c ) = 0 , g ′ ( c ) = 0 , we conclude that

f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a .

In the next example, we show how the Mean Value Theorem can be applied to the function f ( x ) = x f ( x ) = x over the interval [ 0 , 9 ] . [ 0 , 9 ] . The method is the same for other functions, although sometimes with more interesting consequences.

Example 4.15

Verifying that the Mean Value Theorem Applies

For f ( x ) = x f ( x ) = x over the interval [ 0 , 9 ] , [ 0 , 9 ] , show that f f satisfies the hypothesis of the Mean Value Theorem, and therefore there exists at least one value c ∈ ( 0 , 9 ) c ∈ ( 0 , 9 ) such that f ′ ( c ) f ′ ( c ) is equal to the slope of the line connecting ( 0 , f ( 0 ) ) ( 0 , f ( 0 ) ) and ( 9 , f ( 9 ) ) . ( 9 , f ( 9 ) ) . Find these values c c guaranteed by the Mean Value Theorem.

Solution

We know that f ( x ) = x f ( x ) = x is continuous over [ 0 , 9 ] [ 0 , 9 ] and differentiable over ( 0 , 9 ) . ( 0 , 9 ) . Therefore, f f satisfies the hypotheses of the Mean Value Theorem, and there must exist at least one value c ∈ ( 0 , 9 ) c ∈ ( 0 , 9 ) such that f ′ ( c ) f ′ ( c ) is equal to the slope of the line connecting ( 0 , f ( 0 ) ) ( 0 , f ( 0 ) ) and ( 9 , f ( 9 ) ) ( 9 , f ( 9 ) ) (Figure 4.27). To determine which value(s) of c c are guaranteed, first calculate the derivative of f . f . The derivative f ′ ( x ) = 1 ( 2 x ) . f ′ ( x ) = 1 ( 2 x ) . The slope of the line connecting ( 0 , f ( 0 ) ) ( 0 , f ( 0 ) ) and ( 9 , f ( 9 ) ) ( 9 , f ( 9 ) ) is given by

f ( 9 ) − f ( 0 ) 9 − 0 = 9 − 0 9 − 0 = 3 9 = 1 3 . f ( 9 ) − f ( 0 ) 9 − 0 = 9 − 0 9 − 0 = 3 9 = 1 3 .

We want to find c c such that f ′ ( c ) = 1 3 . f ′ ( c ) = 1 3 . That is, we want to find c c such that

1 2 c = 1 3 . 1 2 c = 1 3 .

Solving this equation for c , c , we obtain c = 9 4 . c = 9 4 . At this point, the slope of the tangent line equals the slope of the line joining the endpoints.

The function f(x) = the square root of x is graphed from (0, 0) to (9, 3). There is a secant line drawn from (0, 0) to (9, 3). At point (9/4, 3/2), there is a tangent line that is drawn, and this line is parallel to the secant line.

Figure 4.27 The slope of the tangent line at c = 9 / 4 c = 9 / 4 is the same as the slope of the line segment connecting ( 0 , 0 ) ( 0 , 0 ) and ( 9 , 3 ) . ( 9 , 3 ) .

One application that helps illustrate the Mean Value Theorem involves velocity. For example, suppose we drive a car for 1 h down a straight road with an average velocity of 45 mph. Let s ( t ) s ( t ) and v ( t ) v ( t ) denote the position and velocity of the car, respectively, for 0 ≤ t ≤ 1 0 ≤ t ≤ 1 h. Assuming that the position function s ( t ) s ( t ) is differentiable, we can apply the Mean Value Theorem to conclude that, at some time c ∈ ( 0 , 1 ) , c ∈ ( 0 , 1 ) , the speed of the car was exactly

v ( c ) = s ′ ( c ) = s ( 1 ) − s ( 0 ) 1 − 0 = 45 mph . v ( c ) = s ′ ( c ) = s ( 1 ) − s ( 0 ) 1 − 0 = 45 mph .

Example 4.16

Mean Value Theorem and Velocity

If a rock is dropped from a height of 100 ft, its position t t seconds after it is dropped until it hits the ground is given by the function s ( t ) = −16 t 2 + 100 . s ( t ) = −16 t 2 + 100 .

  1. Determine how long it takes before the rock hits the ground.
  2. Find the average velocity v avg v avg of the rock for when the rock is released and the rock hits the ground.
  3. Find the time t t guaranteed by the Mean Value Theorem when the instantaneous velocity of the rock is v avg . v avg .

Solution

  1. When the rock hits the ground, its position is s ( t ) = 0 . s ( t ) = 0 . Solving the equation −16 t 2 + 100 = 0 −16 t 2 + 100 = 0 for t , t , we find that t = ± 5 2 sec . t = ± 5 2 sec . Since we are only considering t ≥ 0 , t ≥ 0 , the ball will hit the ground 5 2 5 2 sec after it is dropped.
  2. The average velocity is given by

v avg = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = 0 − 100 5 / 2 = −40 ft/sec . v avg = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = 0 − 100 5 / 2 = −40 ft/sec .

s ′ ( c ) = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = −40 . s ′ ( c ) = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = −40 .


Taking the derivative of the position function s ( t ) , s ( t ) , we find that s ′ ( t ) = −32 t . s ′ ( t ) = −32 t . Therefore, the equation reduces to s ′ ( c ) = −32 c = −40 . s ′ ( c ) = −32 c = −40 . Solving this equation for c , c , we have c = 5 4 . c = 5 4 . Therefore, 5 4 5 4 sec after the rock is dropped, the instantaneous velocity equals the average velocity of the rock during its free fall: −40 −40 ft/sec.

The function s(t) = −16t2 + 100 is graphed from (0, 100) to (5/2, 0). There is a secant line drawn from (0, 100) to (5/2, 0). At the point corresponding to t = 5/4, there is a tangent line that is drawn, and this line is parallel to the secant line.

Figure 4.28 At time t = 5 / 4 t = 5 / 4 sec, the velocity of the rock is equal to its average velocity from the time it is dropped until it hits the ground.

Checkpoint 4.15

Suppose a ball is dropped from a height of 200 ft. Its position at time t t is s ( t ) = −16 t 2 + 200 . s ( t ) = −16 t 2 + 200 . Find the time t t when the instantaneous velocity of the ball equals its average velocity.

Corollaries of the Mean Value Theorem

Let’s now look at three corollaries of the Mean Value Theorem. These results have important consequences, which we use in upcoming sections.

At this point, we know the derivative of any constant function is zero. The Mean Value Theorem allows us to conclude that the converse is also true. In particular, if f ′ ( x ) = 0 f ′ ( x ) = 0 for all x x in some interval I , I , then f ( x ) f ( x ) is constant over that interval. This result may seem intuitively obvious, but it has important implications that are not obvious, and we discuss them shortly.

Theorem 4.6

Corollary 1: Functions with a Derivative of Zero

Let f f be differentiable over an interval I . I . If f ′ ( x ) = 0 f ′ ( x ) = 0 for all x ∈ I , x ∈ I , then f ( x ) = f ( x ) = constant for all x ∈ I . x ∈ I .

Proof

Since f f is differentiable over I , I , f f must be continuous over I . I . Suppose f ( x ) f ( x ) is not constant for all x x in I . I . Then there exist a , b ∈ I , a , b ∈ I , where a ≠ b a ≠ b and f ( a ) ≠ f ( b ) . f ( a ) ≠ f ( b ) . Choose the notation so that a < b . a < b . Therefore,

f ( b ) − f ( a ) b − a ≠ 0 . f ( b ) − f ( a ) b − a ≠ 0 .

Since f f is a differentiable function, by the Mean Value Theorem, there exists c ∈ ( a , b ) c ∈ ( a , b ) such that

f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a .

Therefore, there exists c ∈ I c ∈ I such that f ′ ( c ) ≠ 0 , f ′ ( c ) ≠ 0 , which contradicts the assumption that f ′ ( x ) = 0 f ′ ( x ) = 0 for all x ∈ I . x ∈ I .

From Corollary 1: Functions with a Derivative of Zero, it follows that if two functions have the same derivative, they differ by, at most, a constant.

Theorem 4.7

Corollary 2: Constant Difference Theorem

If f f and g g are differentiable over an interval I I and f ′ ( x ) = g ′ ( x ) f ′ ( x ) = g ′ ( x ) for all x ∈ I , x ∈ I , then f ( x ) = g ( x ) + C f ( x ) = g ( x ) + C for some constant C . C .

Proof

Let h ( x ) = f ( x ) − g ( x ) . h ( x ) = f ( x ) − g ( x ) . Then, h ′ ( x ) = f ′ ( x ) − g ′ ( x ) = 0 h ′ ( x ) = f ′ ( x ) − g ′ ( x ) = 0 for all x ∈ I . x ∈ I . By Corollary 1, there is a constant C C such that h ( x ) = C h ( x ) = C for all x ∈ I . x ∈ I . Therefore, f ( x ) = g ( x ) + C f ( x ) = g ( x ) + C for all x ∈ I . x ∈ I .

This fact is important because it means that for a given function f , f , if there exists a function F F such that F ′ ( x ) = f ( x ) ; F ′ ( x ) = f ( x ) ; then, the only other functions that have a derivative equal to f f are F ( x ) + C F ( x ) + C for some constant C . C . We discuss this result in more detail later in the chapter.

Figure 4.29 If a function has a positive derivative over some interval I , I , then the function increases over that interval I ; I ; if the derivative is negative over some interval I , I , then the function decreases over that interval I . I .

Theorem 4.8

Corollary 3: Increasing and Decreasing Functions

Let f f be continuous over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) . ( a , b ) .

  1. If f ′ ( x ) > 0 f ′ ( x ) > 0 for all x ∈ ( a , b ) , x ∈ ( a , b ) , then f f is an increasing function over [ a , b ] . [ a , b ] .
  2. If f ′ ( x ) < 0 f ′ ( x ) < 0 for all x ∈ ( a , b ) , x ∈ ( a , b ) , then f f is a decreasing function over [ a , b ] . [ a , b ] .

Proof

We will prove i.; the proof of ii. is similar. Suppose f f is continuous and differentiable over an interval I I and f ' x > 0 f ' x > 0 for all x ∈ I x ∈ I . By way of contradiction, suppose that f f is not an increasing function on I I . Then there exist a a and b b in I I such that a < b a < b , but f a >f b f a > f b . Since f f is a differentiable function over I I , the Mean Value Theorem guarantees that there is some value c ∈ a , b c ∈ a , b such that

f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a < 0 . f ′ ( c ) = f ( b ) − f ( a ) b − a < 0 .

However, f ′ ( x ) > 0 f ′ ( x ) > 0 for all x ∈ I x ∈ I including c c . This is a contradiction. The assumption that f f is not an increasing function on I I is false. Therefore, f f must be increasing throughout I I .

Section 4.4 Exercises

Why do you need continuity to apply the Mean Value Theorem? Construct a counterexample.

Why do you need differentiability to apply the Mean Value Theorem? Find a counterexample.

When are Rolle’s theorem and the Mean Value Theorem equivalent?

If you have a function with a discontinuity, is it still possible to have f ′ ( c ) ( b − a ) = f ( b ) − f ( a ) ? f ′ ( c ) ( b − a ) = f ( b ) − f ( a ) ? Draw such an example or prove why not.

For the following exercises, determine over what intervals (if any) the Mean Value Theorem applies. Justify your answer.

y = sin ( π x ) y = sin ( π x )