The Mean Value Theorem is one of the most important theorems in calculus. We look at some of its implications at the end of this section. First, let’s start with a special case of the Mean Value Theorem, called Rolle’s theorem.
Informally, Rolle’s theorem states that if the outputs of a differentiable function f f are equal at the endpoints of an interval, then there must be an interior point c c where f ′ ( c ) = 0 . f ′ ( c ) = 0 . Figure 4.21 illustrates this theorem.
facing parabola is drawn such that its values at a and b are the same. The point c is the global maximum, and it is noted that f’(c) = 0. Figure b shows the first quadrant with values a, c, and b marked on the x-axis. An upward-facing parabola is drawn such that its values at a and b are the same. The point c is the global minimum, and it is noted that f’(c) = 0. Figure c shows the first quadrant with points a, c1, c2, and b marked on the x-axis. One period of a sine wave is drawn such that its values at a and b are equal. The point c1 is the global maximum, and it is noted that f’(c1) = 0. The point c2 is the global minimum, and it is noted that f’(c2) = 0." width="887" height="311" />
Figure 4.21 If a differentiable function f satisfies f ( a ) = f ( b ) , f ( a ) = f ( b ) , then its derivative must be zero at some point(s) between a a and b . b .
Let f f be a continuous function over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) ( a , b ) such that f ( a ) = f ( b ) . f ( a ) = f ( b ) . There then exists at least one c ∈ ( a , b ) c ∈ ( a , b ) such that f ′ ( c ) = 0 . f ′ ( c ) = 0 .
Let k = f ( a ) = f ( b ) . k = f ( a ) = f ( b ) . We consider three cases:
Case 1: If f ( x ) = k f ( x ) = k for all x ∈ ( a , b ) , x ∈ ( a , b ) , then f ′ ( x ) = 0 f ′ ( x ) = 0 for all x ∈ ( a , b ) . x ∈ ( a , b ) .
Case 2: Since f f is a continuous function over the closed, bounded interval [ a , b ] , [ a , b ] , by the extreme value theorem, it has an absolute maximum. Also, since there is a point x ∈ ( a , b ) x ∈ ( a , b ) such that f ( x ) > k , f ( x ) > k , the absolute maximum is greater than k . k . Therefore, the absolute maximum does not occur at either endpoint. As a result, the absolute maximum must occur at an interior point c ∈ ( a , b ) . c ∈ ( a , b ) . Because f f has a maximum at an interior point c , c , and f f is differentiable at c , c , by Fermat’s theorem, f ′ ( c ) = 0 . f ′ ( c ) = 0 .
Case 3: The case when there exists a point x ∈ ( a , b ) x ∈ ( a , b ) such that f ( x ) < k f ( x ) < k is analogous to case 2, with maximum replaced by minimum.
An important point about Rolle’s theorem is that the differentiability of the function f f is critical. If f f is not differentiable, even at a single point, the result may not hold. For example, the function f ( x ) = | x | − 1 f ( x ) = | x | − 1 is continuous over [ −1 , 1 ] [ −1 , 1 ] and f ( −1 ) = 0 = f ( 1 ) , f ( −1 ) = 0 = f ( 1 ) , but f ′ ( c ) ≠ 0 f ′ ( c ) ≠ 0 for any c ∈ ( −1 , 1 ) c ∈ ( −1 , 1 ) as shown in the following figure.
Figure 4.22 Since f ( x ) = | x | − 1 f ( x ) = | x | − 1 is not differentiable at x = 0 , x = 0 , the conditions of Rolle’s theorem are not satisfied. In fact, the conclusion does not hold here; there is no c ∈ ( −1 , 1 ) c ∈ ( −1 , 1 ) such that f ′ ( c ) = 0 . f ′ ( c ) = 0 .
Let’s now consider functions that satisfy the conditions of Rolle’s theorem and calculate explicitly the points c c where f ′ ( c ) = 0 . f ′ ( c ) = 0 .
For each of the following functions, verify that the function satisfies the criteria stated in Rolle’s theorem and find all values c c in the given interval where f ′ ( c ) = 0 . f ′ ( c ) = 0 .
Figure 4.23 This function is continuous and differentiable over [ −2 , 0 ] , [ −2 , 0 ] , f ′ ( c ) = 0 f ′ ( c ) = 0 when c = −1 . c = −1 .
Figure 4.24 For this polynomial over [ −2 , 2 ] , [ −2 , 2 ] , f ′ ( c ) = 0 f ′ ( c ) = 0 at x = ± 2 / 3 . x = ± 2 / 3 .
Verify that the function f ( x ) = 2 x 2 − 8 x + 6 f ( x ) = 2 x 2 − 8 x + 6 defined over the interval [ 1 , 3 ] [ 1 , 3 ] satisfies the conditions of Rolle’s theorem. Find all points c c guaranteed by Rolle’s theorem.
Rolle’s theorem is a special case of the Mean Value Theorem. In Rolle’s theorem, we consider differentiable functions f f defined on a closed interval [ a , b ] [ a , b ] with f ( a ) = f ( b ) f ( a ) = f ( b ) . The Mean Value Theorem generalizes Rolle’s theorem by considering functions that do not necessarily have equal value at the endpoints. Consequently, we can view the Mean Value Theorem as a slanted version of Rolle’s theorem (Figure 4.25). The Mean Value Theorem states that if f f is continuous over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) , ( a , b ) , then there exists a point c ∈ ( a , b ) c ∈ ( a , b ) such that the tangent line to the graph of f f at c c is parallel to the secant line connecting ( a , f ( a ) ) ( a , f ( a ) ) and ( b , f ( b ) ) . ( b , f ( b ) ) .
Figure 4.25 The Mean Value Theorem says that for a function that meets its conditions, at some point the tangent line has the same slope as the secant line between the ends. For this function, there are two values c 1 c 1 and c 2 c 2 such that the tangent line to f f at c 1 c 1 and c 2 c 2 has the same slope as the secant line.
Let f f be continuous over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) . ( a , b ) . Then, there exists at least one point c ∈ ( a , b ) c ∈ ( a , b ) such that
f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a .The proof follows from Rolle’s theorem by introducing an appropriate function that satisfies the criteria of Rolle’s theorem. Consider the line connecting ( a , f ( a ) ) ( a , f ( a ) ) and ( b , f ( b ) ) . ( b , f ( b ) ) . Since the slope of that line is
f ( b ) − f ( a ) b − a f ( b ) − f ( a ) b − aand the line passes through the point ( a , f ( a ) ) , ( a , f ( a ) ) , the equation of that line can be written as
y = f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) . y = f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) .Let g ( x ) g ( x ) denote the vertical difference between the point ( x , f ( x ) ) ( x , f ( x ) ) and the point ( x , y ) ( x , y ) on that line. Therefore,
g ( x ) = f ( x ) − [ f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) ] . g ( x ) = f ( x ) − [ f ( b ) − f ( a ) b − a ( x − a ) + f ( a ) ] .
Figure 4.26 The value g ( x ) g ( x ) is the vertical difference between the point ( x , f ( x ) ) ( x , f ( x ) ) and the point ( x , y ) ( x , y ) on the secant line connecting ( a , f ( a ) ) ( a , f ( a ) ) and ( b , f ( b ) ) . ( b , f ( b ) ) .
Since the graph of f f intersects the secant line when x = a x = a and x = b , x = b , we see that g ( a ) = 0 = g ( b ) . g ( a ) = 0 = g ( b ) . Since f f is a differentiable function over ( a , b ) , ( a , b ) , g g is also a differentiable function over ( a , b ) . ( a , b ) . Furthermore, since f f is continuous over [ a , b ] , [ a , b ] , g g is also continuous over [ a , b ] . [ a , b ] . Therefore, g g satisfies the criteria of Rolle’s theorem. Consequently, there exists a point c ∈ ( a , b ) c ∈ ( a , b ) such that g ′ ( c ) = 0 . g ′ ( c ) = 0 . Since
g ′ ( x ) = f ′ ( x ) − f ( b ) − f ( a ) b − a , g ′ ( x ) = f ′ ( x ) − f ( b ) − f ( a ) b − a , g ′ ( c ) = f ′ ( c ) − f ( b ) − f ( a ) b − a . g ′ ( c ) = f ′ ( c ) − f ( b ) − f ( a ) b − a .Since g ′ ( c ) = 0 , g ′ ( c ) = 0 , we conclude that
f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a .In the next example, we show how the Mean Value Theorem can be applied to the function f ( x ) = x f ( x ) = x over the interval [ 0 , 9 ] . [ 0 , 9 ] . The method is the same for other functions, although sometimes with more interesting consequences.
For f ( x ) = x f ( x ) = x over the interval [ 0 , 9 ] , [ 0 , 9 ] , show that f f satisfies the hypothesis of the Mean Value Theorem, and therefore there exists at least one value c ∈ ( 0 , 9 ) c ∈ ( 0 , 9 ) such that f ′ ( c ) f ′ ( c ) is equal to the slope of the line connecting ( 0 , f ( 0 ) ) ( 0 , f ( 0 ) ) and ( 9 , f ( 9 ) ) . ( 9 , f ( 9 ) ) . Find these values c c guaranteed by the Mean Value Theorem.
We know that f ( x ) = x f ( x ) = x is continuous over [ 0 , 9 ] [ 0 , 9 ] and differentiable over ( 0 , 9 ) . ( 0 , 9 ) . Therefore, f f satisfies the hypotheses of the Mean Value Theorem, and there must exist at least one value c ∈ ( 0 , 9 ) c ∈ ( 0 , 9 ) such that f ′ ( c ) f ′ ( c ) is equal to the slope of the line connecting ( 0 , f ( 0 ) ) ( 0 , f ( 0 ) ) and ( 9 , f ( 9 ) ) ( 9 , f ( 9 ) ) (Figure 4.27). To determine which value(s) of c c are guaranteed, first calculate the derivative of f . f . The derivative f ′ ( x ) = 1 ( 2 x ) . f ′ ( x ) = 1 ( 2 x ) . The slope of the line connecting ( 0 , f ( 0 ) ) ( 0 , f ( 0 ) ) and ( 9 , f ( 9 ) ) ( 9 , f ( 9 ) ) is given by
f ( 9 ) − f ( 0 ) 9 − 0 = 9 − 0 9 − 0 = 3 9 = 1 3 . f ( 9 ) − f ( 0 ) 9 − 0 = 9 − 0 9 − 0 = 3 9 = 1 3 .
We want to find c c such that f ′ ( c ) = 1 3 . f ′ ( c ) = 1 3 . That is, we want to find c c such that
1 2 c = 1 3 . 1 2 c = 1 3 .Solving this equation for c , c , we obtain c = 9 4 . c = 9 4 . At this point, the slope of the tangent line equals the slope of the line joining the endpoints.
Figure 4.27 The slope of the tangent line at c = 9 / 4 c = 9 / 4 is the same as the slope of the line segment connecting ( 0 , 0 ) ( 0 , 0 ) and ( 9 , 3 ) . ( 9 , 3 ) .
One application that helps illustrate the Mean Value Theorem involves velocity. For example, suppose we drive a car for 1 h down a straight road with an average velocity of 45 mph. Let s ( t ) s ( t ) and v ( t ) v ( t ) denote the position and velocity of the car, respectively, for 0 ≤ t ≤ 1 0 ≤ t ≤ 1 h. Assuming that the position function s ( t ) s ( t ) is differentiable, we can apply the Mean Value Theorem to conclude that, at some time c ∈ ( 0 , 1 ) , c ∈ ( 0 , 1 ) , the speed of the car was exactly
v ( c ) = s ′ ( c ) = s ( 1 ) − s ( 0 ) 1 − 0 = 45 mph . v ( c ) = s ′ ( c ) = s ( 1 ) − s ( 0 ) 1 − 0 = 45 mph .
If a rock is dropped from a height of 100 ft, its position t t seconds after it is dropped until it hits the ground is given by the function s ( t ) = −16 t 2 + 100 . s ( t ) = −16 t 2 + 100 .
v avg = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = 0 − 100 5 / 2 = −40 ft/sec . v avg = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = 0 − 100 5 / 2 = −40 ft/sec .
s ′ ( c ) = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = −40 . s ′ ( c ) = s ( 5 / 2 ) − s ( 0 ) 5 / 2 − 0 = −40 .
Taking the derivative of the position function s ( t ) , s ( t ) , we find that s ′ ( t ) = −32 t . s ′ ( t ) = −32 t . Therefore, the equation reduces to s ′ ( c ) = −32 c = −40 . s ′ ( c ) = −32 c = −40 . Solving this equation for c , c , we have c = 5 4 . c = 5 4 . Therefore, 5 4 5 4 sec after the rock is dropped, the instantaneous velocity equals the average velocity of the rock during its free fall: −40 −40 ft/sec.
Figure 4.28 At time t = 5 / 4 t = 5 / 4 sec, the velocity of the rock is equal to its average velocity from the time it is dropped until it hits the ground.
Suppose a ball is dropped from a height of 200 ft. Its position at time t t is s ( t ) = −16 t 2 + 200 . s ( t ) = −16 t 2 + 200 . Find the time t t when the instantaneous velocity of the ball equals its average velocity.
Let’s now look at three corollaries of the Mean Value Theorem. These results have important consequences, which we use in upcoming sections.
At this point, we know the derivative of any constant function is zero. The Mean Value Theorem allows us to conclude that the converse is also true. In particular, if f ′ ( x ) = 0 f ′ ( x ) = 0 for all x x in some interval I , I , then f ( x ) f ( x ) is constant over that interval. This result may seem intuitively obvious, but it has important implications that are not obvious, and we discuss them shortly.
Let f f be differentiable over an interval I . I . If f ′ ( x ) = 0 f ′ ( x ) = 0 for all x ∈ I , x ∈ I , then f ( x ) = f ( x ) = constant for all x ∈ I . x ∈ I .
Since f f is differentiable over I , I , f f must be continuous over I . I . Suppose f ( x ) f ( x ) is not constant for all x x in I . I . Then there exist a , b ∈ I , a , b ∈ I , where a ≠ b a ≠ b and f ( a ) ≠ f ( b ) . f ( a ) ≠ f ( b ) . Choose the notation so that a < b . a < b . Therefore,
f ( b ) − f ( a ) b − a ≠ 0 . f ( b ) − f ( a ) b − a ≠ 0 .Since f f is a differentiable function, by the Mean Value Theorem, there exists c ∈ ( a , b ) c ∈ ( a , b ) such that
f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a .Therefore, there exists c ∈ I c ∈ I such that f ′ ( c ) ≠ 0 , f ′ ( c ) ≠ 0 , which contradicts the assumption that f ′ ( x ) = 0 f ′ ( x ) = 0 for all x ∈ I . x ∈ I .
From Corollary 1: Functions with a Derivative of Zero, it follows that if two functions have the same derivative, they differ by, at most, a constant.
If f f and g g are differentiable over an interval I I and f ′ ( x ) = g ′ ( x ) f ′ ( x ) = g ′ ( x ) for all x ∈ I , x ∈ I , then f ( x ) = g ( x ) + C f ( x ) = g ( x ) + C for some constant C . C .
Let h ( x ) = f ( x ) − g ( x ) . h ( x ) = f ( x ) − g ( x ) . Then, h ′ ( x ) = f ′ ( x ) − g ′ ( x ) = 0 h ′ ( x ) = f ′ ( x ) − g ′ ( x ) = 0 for all x ∈ I . x ∈ I . By Corollary 1, there is a constant C C such that h ( x ) = C h ( x ) = C for all x ∈ I . x ∈ I . Therefore, f ( x ) = g ( x ) + C f ( x ) = g ( x ) + C for all x ∈ I . x ∈ I .
This fact is important because it means that for a given function f , f , if there exists a function F F such that F ′ ( x ) = f ( x ) ; F ′ ( x ) = f ( x ) ; then, the only other functions that have a derivative equal to f f are F ( x ) + C F ( x ) + C for some constant C . C . We discuss this result in more detail later in the chapter.
Figure 4.29 If a function has a positive derivative over some interval I , I , then the function increases over that interval I ; I ; if the derivative is negative over some interval I , I , then the function decreases over that interval I . I .
Let f f be continuous over the closed interval [ a , b ] [ a , b ] and differentiable over the open interval ( a , b ) . ( a , b ) .
We will prove i.; the proof of ii. is similar. Suppose f f is continuous and differentiable over an interval I I and f ' x > 0 f ' x > 0 for all x ∈ I x ∈ I . By way of contradiction, suppose that f f is not an increasing function on I I . Then there exist a a and b b in I I such that a < b a < b , but f a >f b f a > f b . Since f f is a differentiable function over I I , the Mean Value Theorem guarantees that there is some value c ∈ a , b c ∈ a , b such that
f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a . f ′ ( c ) = f ( b ) − f ( a ) b − a < 0 . f ′ ( c ) = f ( b ) − f ( a ) b − a < 0 .However, f ′ ( x ) > 0 f ′ ( x ) > 0 for all x ∈ I x ∈ I including c c . This is a contradiction. The assumption that f f is not an increasing function on I I is false. Therefore, f f must be increasing throughout I I .
Why do you need continuity to apply the Mean Value Theorem? Construct a counterexample.
Why do you need differentiability to apply the Mean Value Theorem? Find a counterexample.
When are Rolle’s theorem and the Mean Value Theorem equivalent?
If you have a function with a discontinuity, is it still possible to have f ′ ( c ) ( b − a ) = f ( b ) − f ( a ) ? f ′ ( c ) ( b − a ) = f ( b ) − f ( a ) ? Draw such an example or prove why not.
For the following exercises, determine over what intervals (if any) the Mean Value Theorem applies. Justify your answer.
y = sin ( π x ) y = sin ( π x )