| You are viewing the html version of Optics, by Benjamin Crowell. This version is only designed for casual browsing, and may have some formatting problems. For serious reading, you want the printer-friendly Adobe Acrobat version. (c) 1998-2006 Benjamin Crowell, licensed under the Creative Commons Attribution-ShareAlike license, or, at your option, the GFDL license. Photo credits are given at the end of the Adobe Acrobat version. |

Breakfast Table, by Willem Clasz. de Heda, 17th century. The painting shows a variety of images, some of them distorted, resulting both from reflection and from refraction (ch. 4).
It sounds a bit odd when a scientist refers to a theory as “beautiful,” but to those in the know it makes perfect sense. One mark of a beautiful theory is that it surprises us by being simple. The mathematical theory of lenses and curved mirrors gives us just such a surprise. We expect the subject to be complex because there are so many cases: a converging mirror forming a real image, a diverging lens that makes a virtual image, and so on for a total of six possibilities. If we want to predict the location of the images in all these situations, we might expect to need six different equations, and six more for predicting magnifications. Instead, it turns out that we can use just one equation for the location of the image and one equation for its magnification, and these two equations work in all the different cases with no changes except for plus and minus signs. This is the kind of thing the physicist Eugene Wigner referred to as “the unreasonable effectiveness of mathematics.” Sometimes we can find a deeper reason for this kind of unexpected simplicity, but sometimes it almost seems as if God went out of Her way to make the secrets of universe susceptible to attack by the human thought-tool called math.

a / The relationship between the object's position and the image's can be expressed in terms of the angles θo and θi.

b / The geometrical interpretation of the focal angle.

c / Example 1, an alternative test for finding the focal angle. The mirror is the same as in figure b.

d / The object and image distances

e / Mirror 1 is weaker than mirror 2. It has a shallower curvature, a longer focal length, and a smaller focal angle. It reflects rays at angles not much different than those that would be produced with a flat mirror.
We will now derive the equation for the location of a real image formed by a converging mirror. We assume for simplicity that the mirror is spherical, but actually this isn't a restrictive assumption, because any shallow, symmetric curve can be approximated by a sphere. The shape of the mirror can be specified by giving the location of its center, C. A deeply curved mirror is a sphere with a small radius, so C is close to it, while a weakly curved mirror has C farther away. Given the point O where the object is, we wish to find the point I where the image will be formed.
To locate an image, we need to track a minimum of two rays coming from the same point. Since we have proved in the previous chapter that this type of image is not distorted, we can use an on-axis point, O, on the object, as in figure a/1. The results we derive will also hold for off-axis points, since otherwise the image would have to be distorted, which we know is not true. We let one of the rays be the one that is emitted along the axis; this ray is especially easy to trace, because it bounces straight back along the axis again. As our second ray, we choose one that strikes the mirror at a distance of 1 from the axis. “One what?” asks the astute reader. The answer is that it doesn't really matter. When a mirror has shallow curvature, all the reflected rays hit the same point, so 1 could be expressed in any units you like. It could, for instance, be 1 cm, unless your mirror is smaller than 1 cm!
The only way to find out anything mathematical about the rays is to use the sole mathematical fact we possess concerning specular reflection: the incident and reflected rays form equal angles with respect to the normal, which is shown as a dashed line. Therefore the two angles shown in figure a/2 are the same, and skipping some straightforward geometry, this leads to the visually reasonable result that the two angles in figure a/3 are related as follows:
(Note that θi and θo, which are measured
from the image and the object, not from the eye like the
angles we referred to in discussing angular magnification
on page 36.)
For example, move O farther from the
mirror. The top angle in figure a/2 is increased, so the
bottom angle must increase by the same amount, causing the
image point, I, to move closer to the mirror. In terms of
the angles shown in figure a/3, the more distant object has
resulted in a smaller angle θo, while the closer
image corresponds to a larger
One angle
increases by the same amount that the other decreases, so
their sum remains constant. These changes are summarized in figure a/4.
The sum θi+θo is a constant. What does this constant represent? Geometrically, we interpret it as double the angle made by the dashed radius line. Optically, it is a measure of the strength of the mirror, i.e., how strongly the mirror focuses light, and so we call it the focal angle, θf,
Suppose, for example, that we wish to use a quick and dirty optical test to determine how strong a particular mirror is. We can lay it on the floor as shown in figure c, and use it to make an image of a lamp mounted on the ceiling overhead, which we assume is very far away compared to the radius of curvature of the mirror, so that the mirror intercepts only a very narrow cone of rays from the lamp. This cone is so narrow that its rays are nearly parallel, and θo is nearly zero. The real image can be observed on a piece of paper. By moving the paper nearer and farther, we can bring the image into focus, at which point we know the paper is located at the image point. Since θo≈ 0, we have θi≈ θf, and we can then determine this mirror's focal angle either by measuring θi directly with a protractor, or indirectly via trigonometry. A strong mirror will bring the rays together to form an image close to the mirror, and these rays will form a blunt-angled cone with a large θi and θf.
◊ The object and image angles are the same; the angle labeled θ in the figure equals both of them. We therefore have θi+θo=θ =θf. Comparing figures b and c, it is indeed plausible that the angles are related by a factor of two.
At this point, we could consider our work to be done. Typically, we know the strength of the mirror, and we want to find the image location for a given object location. Given the mirror's focal angle and the object location, we can determine θo by trigonometry, subtract to find θi=θf-θo, and then do more trig to find the image location.
There is, however, a shortcut that can save us from doing so much work. Figure a/3 shows two right triangles whose legs of length 1 coincide and whose acute angles are θo and θi. These can be related by trigonometry to the object and image distances shown in figure d:
Ever since chapter 2, we've been assuming small angles. For small angles, we can use the small-angle approximation tan x≈ x (for x in radians), giving simply
We likewise define a distance called the focal length, f according to θf=1/f. In figure b, f is the distance from the mirror to the place where the rays cross. We can now reexpress the equation relating the object and image positions as

Figure e summarizes the interpretation of the focal length and focal angle.1
Which form is better, θf=θi+θo or
The angular form has in its favor its
simplicity and its straightforward visual interpretation,
but there are two reasons why we might prefer the second
version. First, the numerical values of the angles depend on
what we mean by “one unit” for the distance shown as 1 in
figure a/1. Second, it is usually easier to measure
distances rather than angles, so the distance form is more
convenient for number crunching. Neither form is superior
overall, and we will often need to use both to solve any
given problem.2
Suppose we need to create a parallel beam of light, as in a searchlight. Where should we place the lightbulb? A parallel beam has zero angle between its rays, so θi=0. To place the lightbulb correctly, however, we need to know a distance, not an angle: the distance do between the bulb and the mirror. The problem involves a mixture of distances and angles, so we need to get everything in terms of one or the other in order to solve it. Since the goal is to find a distance, let's figure out the image distance corresponding to the given angle θi=0. These are related by di=1/θi, so we have di=∞. (Yes, dividing by zero gives infinity. Don't be afraid of infinity. Infinity is a useful problem-solving device.) Solving the distance equation for do, we have



The bulb has to be placed at a distance from the mirror equal to its focal point.
An equation like di=1/θi really doesn't make sense in terms of units. Angles are unitless, since radians aren't really units, so the right-hand side is unitless. We can't have a left-hand side with units of distance if the right-hand side of the same equation is unitless. This is an artifact of my cavalier statement that the conical bundles of rays spread out to a distance of 1 from the axis where they strike the mirror, without specifying the units used to measure this 1. In real life, optometrists define the thing we're calling θi=1/di as the “dioptric strength” of a lens or mirror, and measure it in units of inverse meters (m-1), also known as diopters (1 D=1 m-1).
We have already discussed in the previous chapter how to find the magnification of a virtual image made by a curved mirror. The result is the same for a real image, and we omit the proof, which is very similar. In our new notation, the result is M=di/do. A numerical example is given in section 3.2.
The equation di= can easily produce a negative result, but we have been thinking of di as a distance, and distances can't be negative. A similar problem occurs with θi=θf-θo for θo>θf. What's going on here?
The interpretation of the angular equation is straightforward. As we bring the object closer and closer to the image, θo gets bigger and bigger, and eventually we reach a point where θo=θf and θi=0. This large object angle represents a bundle of rays forming a cone that is very broad, so broad that the mirror can no longer bend them back so that they reconverge on the axis. The image angle θi=0 represents an outgoing bundle of rays that are parallel. The outgoing rays never cross, so this is not a real image, unless we want to be charitable and say that the rays cross at infinity. If we go on bringing the object even closer, we get a virtual image.

f / A graph of the image distance di as a function of the object distance do.
To analyze the distance equation, let's look at a graph of di as a function of do. The branch on the upper right corresponds to the case of a real image. Strictly speaking, this is the only part of the graph that we've proven corresponds to reality, since we never did any geometry for other cases, such as virtual images. As discussed in the previous section, making do bigger causes di to become smaller, and vice-versa.Letting do be less than f is equivalent to
a virtual image is produced on the far side of the mirror.
This is the first example of Wigner's “unreasonable
effectiveness of mathematics” that we have encountered in
optics. Even though our proof depended on the assumption
that the image was real, the equation we derived turns out
to be applicable to virtual images, provided that we either
interpret the positive and negative signs in a certain way,
or else modify the equation to have different positive and negative signs.
self-check: Interpret the three places where, in physically realistic parts of the graph, the graph approaches one of the dashed lines. [This will come more naturally if you have learned the concept of limits in a math class.] (answer in the back of the PDF version of the book)
We can even apply the equation to a flat mirror. As a sphere gets bigger and bigger, its surface is more and more gently curved. The planet Earth is so large, for example, that we cannot even perceive the curvature of its surface. To represent a flat mirror, we let the mirror's radius of curvature, and its focal length, become infinite. Dividing by infinity gives zero, so we have


If we interpret the minus sign as indicating a virtual image on the far side of the mirror from the object, this makes sense.
It turns out that for any of the six possible combinations of real or virtual images formed by converging or diverging lenses or mirrors, we can apply equations of the form


with only a modification of plus or minus signs. There are two possible approaches here. The approach we have been using so far is the more popular approach in American textbooks: leave the equation the same, but attach interpretations to the resulting negative or positive values of the variables. The trouble with this approach is that one is then forced to memorize tables of sign conventions, e.g. that the value of di should be negative when the image is a virtual image formed by a converging mirror. Positive and negative signs also have to be memorized for focal lengths. Ugh! It's highly unlikely that any student has ever retained these lengthy tables in his or her mind for more than five minutes after handing in the final exam in a physics course. Of course one can always look such things up when they are needed, but the effect is to turn the whole thing into an exercise in blindly plugging numbers into formulas.
As you have gathered by now, there is another method which I think is better, and which I'll use throughout the rest of this book. In this method, all distances and angles are positive by definition, and we put in positive and negative signs in the equations depending on the situation. (I thought I was the first to invent this method, but I've been told that this is known as the European sign convention, and that it's fairly common in Europe.) Rather than memorizing these signs, we start with the generic equations


and then determine the signs by a two-step method that depends on ray diagrams. There are really only two signs to determine, not four; the signs in the two equations match up in the way you'd expect. The method is as follows:
1. Use ray diagrams to decide whether θo and θi vary in the same way or in opposite ways. (In other words, decide whether making θo greater results in a greater value of θi or a smaller one.) Based on this, decide whether the two signs in the angle equation are the same or opposite. If the signs are opposite, go on to step 2 to determine which is positive and which is negative.
2. If the signs are opposite, we need to decide which is the positive one and which is the negative. Since the focal angle is never negative, the smaller angle must be the one with a minus sign.
In step 1, many students have trouble drawing the ray diagram correctly. For simplicity, you should always do your diagram for a point on the object that is on the axis of the mirror, and let one of your rays be the one that is emitted along the axis and reflect straight back on itself, as in the figures in section 3.1. As shown in figure a/4 in section 3.1, there are four angles involved: two at the mirror, one at the object (θo), and one at the image (θi). Make sure to draw in the normal to the mirror so that you can see the two angles at the mirror. These two angles are equal, so as you change the object position, they fan out or fan in, like opening or closing a book. Once you've drawn this effect, you should easily be able to tell whether θo and θi change in the same way or in opposite ways.
Although focal lengths are always positive in the method used in this book, you should be aware that diverging mirrors and lenses are assigned negative focal lengths in the other method, so if you see a lens labeled f=-30 cm, you'll know what it means.
◊ As shown in ray diagram g/1, di is less than do. The magnification, M= di/do, will be less than one, i.e., the image is actually reduced rather than magnified.
Apply the method outlined above for determining the plus and minus signs. Step 1: The object is the point on the opposite wall. As an experiment, g/2, move the object closer. I did these drawings using illustration software, but if you were doing them by hand, you'd want to make the scale much larger for greater accuracy. Also, although I split figure g into two separate drawings in order to make them easier to understand, you're less likely to make a mistake if you do them on top of each other.
The two angles at the mirror fan out from the normal. Increasing θo has clearly made θi larger as well. (All four angles got bigger.) There must be a cancellation of the effects of changing the two terms on the right in the same way, and the only way to get such a cancellation is if the two terms in the angle equation have opposite signs:
or
Step 2: Now which is the positive term and which is negative? Since the image angle is bigger than the object angle, the angle equation must be
in order to give a positive result for the focal angle. The signs of the distance equation behave the same way:

Solving for di, we find


The image of the store is reduced by a factor of 2.1/7.0=0.3, i.e., it is smaller by 70%.

h / A diverging mirror in the shape of a sphere. The image is reduced (M<1). This is similar to example 5, but here the image is distorted because the mirror's curve is not shallow.
In the case of a real image, there is a shortcut for step 1, the determination of the signs. In a real image, the rays cross at both the object and the image. We can therefore time-reverse the ray diagram, so that all the rays are coming from the image and reconverging at the object. Object and image swap roles. Due to this time-reversal symmetry, the object and image cannot be treated differently in any of the equations, and they must therefore have the same signs. They are both positive, since they must add up to a positive result.
An imperfection or distortion in an image is called an aberration. An aberration can be produced by a flaw in a lens or mirror, but even with a perfect optical surface some degree of aberration is unavoidable. To see why, consider the mathematical approximation we've been making, which is that the depth of the mirror's curve is small compared to do and di. Since only a flat mirror can satisfy this shallow-mirror condition perfectly, any curved mirror will deviate somewhat from the mathematical behavior we derived by assuming that condition. There are two main types of aberration in curved mirrors, and these also occur with lenses.
(1) An object on the axis of the lens or mirror may be imaged correctly, but off-axis objects may be out of focus or distorted. In a camera, this type of aberration would show up as a fuzziness or warping near the sides of the picture when the center was perfectly focused. An example of this is shown in figure i, and in that particular example, the aberration is not a sign that the equipment was of low quality or wasn't right for the job but rather an inevitable result of trying to flatten a panoramic view; in the limit of a 360-degree panorama, the problem would be similar to the problem of representing the Earth's surface on a flat map, which can't be accomplished without distortion.

i / This photo was taken using a “fish-eye lens,” which gives an extremely large field of view.
(2) The image may be sharp when the object is at certain distances and blurry when it is at other distances. The blurriness occurs because the rays do not all cross at exactly the same point. If we know in advance the distance of the objects with which the mirror or lens will be used, then we can optimize the shape of the optical surface to make in-focus images in that situation. For instance, a spherical mirror will produce a perfect image of an object that is at the center of the sphere, because each ray is reflected directly onto the radius along which it was emitted. For objects at greater distances, however, the focus will be somewhat blurry. In astronomy the objects being used are always at infinity, so a spherical mirror is a poor choice for a telescope. A different shape (a parabola) is better specialized for astronomy.
j / Spherical mirrors are the cheapest to make, but parabolic mirrors are better for making images of objects at infinity. A sphere has equal curvature everywhere, but a parabola has tighter curvature at its center and gentler curvature at the sides.
One way of decreasing aberration is to use a small-diameter mirror or lens, or block most of the light with an opaque screen with a hole in it, so that only light that comes in close to the axis can get through. Either way, we are using a smaller portion of the lens or mirror whose curvature will be more shallow, thereby making the shallow-mirror (or thin-lens) approximation more accurate. Your eye does this by narrowing down the pupil to a smaller hole. In a camera, there is either an automatic or manual adjustment, and narrowing the opening is called “stopping down.” The disadvantage of stopping down is that light is wasted, so the image will be dimmer or a longer exposure must be used.
k / Even though the spherical mirror (solid line) is not well adapted for viewing an object at infinity, we can improve its performance greatly by stopping it down. Now the only part of the mirror being used is the central portion, where its shape is virtually indistinguishable from a parabola (dashed line).
What I would suggest you take away from this discussion for the sake of your general scientific education is simply an understanding of what an aberration is, why it occurs, and how it can be reduced, not detailed facts about specific types of aberrations.
l / The Hubble Space Telescope was placed into orbit with faulty optics in 1990. Its main mirror was supposed to have been nearly parabolic, since it is an astronomical telescope, meant for producing images of objects at infinity. However, contractor Perkin Elmer had delivered a faulty mirror, which produced aberrations. The large photo shows astronauts putting correcting mirrors in place in 1993. The two small photos show images produced by the telescope before and after the fix.
focal length — a property of a lens or mirror, equal to the distance from the lens or mirror to the image it forms of an object that is infinitely far away
f — the focal length
do — the distance of the object from the mirror
di — the distance of the image from the mirror
θf — the focal angle, defined as 1/f
θo — the object angle, defined as 1/do
θi — the image angle, defined as 1/di
f>0 — describes a converging lens or mirror; in this book, all focal lengths are positive, so there is no such implication
f<0 — describes a diverging lens or mirror; in this book, all focal lengths are positive
M<0 — indicates an inverted image; in this book, all magnifications are positive
Every lens or mirror has a property called the focal length, which is defined as the distance from the lens or mirror to the image it forms of an object that is infinitely far away. A stronger lens or mirror has a shorter focal length.
The relationship between the locations of an object and its image formed by a lens or mirror can always be expressed by equations of the form


The choice of plus and minus signs depends on whether we are dealing with a lens or a mirror, whether the lens or mirror is converging or diverging, and whether the image is real or virtual. A method for determining the plus and minus signs is as follows:
Once the correct form of the equation has been determined, the magnification can be found via the equation

This equation expresses the idea that the entire image-world is shrunk consistently in all three dimensions.
2. (solution in the pdf version of the book) Use the method described in the text to derive the equation relating object distance to image distance for the case of a virtual image produced by a converging mirror.
3. (a) Make up a numerical example of a virtual image formed by a converging mirror with a certain focal length, and determine the magnification. (You will need the result of problem 2.) Make sure to choose values of do and f that would actually produce a virtual image, not a real one. Now change the location of the object a little bit and redetermine the magnification, showing that it changes. At my local department store, the cosmetics department sells mirrors advertised as giving a magnification of 5 times. How would you interpret this?
(b) Suppose a Newtonian telescope is being used for astronomical observing. Assume for simplicity that no eyepiece is used, and assume a value for the focal length of the mirror that would be reasonable for an amateur instrument that is to fit in a closet. Is the angular magnification different for objects at different distances? For example, you could consider two planets, one of which is twice as far as the other.
4. (a) Find a case where the magnification of a curved mirror is infinite. Is the angular magnification infinite from any realistic viewing position? (b) Explain why an arbitrarily large magnification can't be achieved by having a sufficiently small value of do.
5. The figure shows a device for constructing a realistic optical illusion. Two mirrors of equal focal length are put against each other with their silvered surfaces facing inward. A small object placed in the bottom of the cavity will have its image projected in the air above. The way it works is that the top mirror produces a virtual image, and the bottom mirror then creates a real image of the virtual image. (a) Show that if the image is to be positioned as shown, at the mouth of the cavity, then the focal length of the mirrors is related to the dimension h via the equation

(b) Restate the equation in terms of a single variable x=h/f, and show that there are two solutions for x. Which solution is physically consistent with the assumptions of the calculation?
6. A concave surface that reflects sound waves can act just like a converging mirror. Suppose that, standing near such a surface, you are able to find a point where you can place your head so that your own whispers are focused back on your head, so that they sound loud to you. Given your distance to the surface, what is the surface's focal length?
9. (a) A converging mirror is being used to create a virtual image. What is the range of possible magnifications? (b) Do the same for the other types of images that can be formed by curved mirrors (both converging and diverging).
10. (solution in the pdf version of the book) (a) A converging mirror with a focal length of 20 cm is used to create an image, using an object at a distance of 10 cm. Is the image real, or is it virtual? (b) How about f=20 cm and do=30 cm? (c) What if it was a diverging mirror with f=20 cm and do=10 cm? (d) A diverging mirror with f=20 cm and do=30 cm?
11. A diverging mirror of focal length f is fixed, and faces down.
An object is dropped from the surface of the mirror, and falls away
from it with acceleration g. The goal of the problem is to find
the maximum velocity of the image.
(a) Describe the motion of the image verbally, and explain why we should
expect there to be a maximum velocity.
(b) Use arguments based on units to determine the form of the solution,
up to an unknown unitless multiplicative constant.
(c) Complete the solution by determining the unitless constant.
∫