Quantum Physics
From Lm
Simple Nature by Benjamin Crowell
This is a version of the book that has been converted to wiki format from the original at lightandmatter.com, where it is available in PDF, HTML, and LaTeX formats. The version at lightandmatter.com is the one that is actively maintained by the author. Please see this page for information about the purpose of this wiki version.
Chapter 13 - Quantum Physics
Quantum Physics

a / In 1980, the continental U.S. got its first taste of active volcanism in recent memory with the eruption of Mount St. Helens.
Contents
|
Rules of Randomness
Given for one instant an intelligence which could comprehend all the forces by which nature is animated and the respective positions of the things which compose it...nothing would be uncertain, and the future as the past would be laid out before its eyes. -- Pierre Simon de Laplace, 1776 The energy produced by the atom is a very poor kind of thing. Anyone who expects a source of power from the transformation of these atoms is talking moonshine. -- Ernest Rutherford, 1933 The Quantum Mechanics is very imposing. But an inner voice tells me that it is still not the final truth. The theory yields much, but it hardly brings us nearer to the secret of the Old One. In any case, I am convinced that He does not play dice. -- Albert Einstein However radical Newton's clockwork universe seemed to his contemporaries, by the early twentieth century it had become a sort of smugly accepted dogma. Luckily for us, this deterministic picture of the universe breaks down at the atomic level. The clearest demonstration that the laws of physics contain elements of randomness is in the behavior of radioactive atoms. Pick two identical atoms of a radioactive isotope, say the naturally occurring uranium 238, and watch them carefully. They will decay at different times, even though there was no difference in their initial behavior. We would be in big trouble if these atoms' behavior was as predictable as expected in the Newtonian world-view, because radioactivity is an important source of heat for our planet. In reality, each atom chooses a random moment at which to release its energy, resulting in a nice steady heating effect. The earth would be a much colder planet if only sunlight heated it and not radioactivity. Probably there would be no volcanoes, and the oceans would never have been liquid. The deep-sea geothermal vents in which life first evolved would never have existed. But there would be an even worse consequence if radioactivity was deterministic: after a few billion years of peace, all the uranium 238 atoms in our planet would presumably pick the same moment to decay. The huge amount of stored nuclear energy, instead of being spread out over eons, would all be released at one instant, blowing our whole planet to Kingdom Come.1 The new version of physics, incorporating certain kinds of randomness, is called quantum physics (for reasons that will become clear later). It represented such a dramatic break with the previous, deterministic tradition that everything that came before is considered “classical,” even the theory of relativity. This chapter is a basic introduction to quantum physics. ”Discussion Question” ◊ I said “Pick two identical atoms of a radioactive isotope.” Are two atoms really identical? If their electrons are orbiting the nucleus, can we distinguish each atom by the particular arrangement of its electrons at some instant in time?
Randomness isn't random.
Einstein's distaste for randomness, and his association of determinism with divinity, goes back to the Enlightenment conception of the universe as a gigantic piece of clockwork that only had to be set in motion initially by the Builder. Many of the founders of quantum mechanics were interested in possible links between physics and Eastern and Western religious and philosophical thought, but every educated person has a different concept of religion and philosophy. Bertrand Russell remarked, “Sir Arthur Eddington deduces religion from the fact that atoms do not obey the laws of mathematics. Sir James Jeans deduces it from the fact that they do.” Russell's witticism, which implies incorrectly that mathematics cannot describe randomness, remind us how important it is not to oversimplify this question of randomness. You should not simply surmise, “Well, it's all random, anything can happen.” For one thing, certain things simply cannot happen, either in classical physics or quantum physics. The conservation laws of mass, energy, momentum, and angular momentum are still valid, so for instance processes that create energy out of nothing are not just unlikely according to quantum physics, they are impossible. A useful analogy can be made with the role of randomness in evolution. Darwin was not the first biologist to suggest that species changed over long periods of time. His two new fundamental ideas were that (1) the changes arose through random genetic variation, and (2) changes that enhanced the organism's ability to survive and reproduce would be preserved, while maladaptive changes would be eliminated by natural selection. Doubters of evolution often consider only the first point, about the randomness of natural variation, but not the second point, about the systematic action of natural selection. They make statements such as, “the development of a complex organism like Homo sapiens via random chance would be like a whirlwind blowing through a junkyard and spontaneously assembling a jumbo jet out of the scrap metal.” The flaw in this type of reasoning is that it ignores the deterministic constraints on the results of random processes. For an atom to violate conservation of energy is no more likely than the conquest of the world by chimpanzees next year. ”Discussion Question” ◊ Economists often behave like wannabe physicists, probably because it seems prestigious to make numerical calculations instead of talking about human relationships and organizations like other social scientists. Their striving to make economics work like Newtonian physics extends to a parallel use of mechanical metaphors, as in the concept of a market's supply and demand acting like a self-adjusting machine, and the idealization of people as economic automatons who consistently strive to maximize their own wealth. What evidence is there for randomness rather than mechanical determinism in economics?

b / Normalization: the probability of picking land plus the probability of picking water adds up to 1.
Calculating randomness
You should also realize that even if something is random, we can still understand it, and we can still calculate probabilities numerically. In other words, physicists are good bookmakers. A good bookmaker can calculate the odds that a horse will win a race much more accurately that an inexperienced one, but nevertheless cannot predict what will happen in any particular race.
Statistical independence
As an illustration of a general technique for calculating odds, suppose you are playing a 25-cent slot machine. Each of the three wheels has one chance in ten of coming up with a cherry. If all three wheels come up cherries, you win $100. Even though the results of any particular trial are random, you can make certain quantitative predictions. First, you can calculate that your odds of winning on any given trial are 1/10×1/10×1/10=1/1000=0.001. Here, I am representing the probabilities as numbers from 0 to 1, which is clearer than statements like “The odds are 999 to 1,” and makes the calculations easier. A probability of 0 represents something impossible, and a probability of 1 represents something that will definitely happen. Also, you can say that any given trial is equally likely to result in a win, and it doesn't matter whether you have won or lost in prior games. Mathematically, we say that each trial is statistically independent, or that separate games are uncorrelated. Most gamblers are mistakenly convinced that, to the contrary, games of chance are correlated. If they have been playing a slot machine all day, they are convinced that it is “getting ready to pay,” and they do not want anyone else playing the machine and “using up” the jackpot that they “have coming.” In other words, they are claiming that a series of trials at the slot machine is negatively correlated, that losing now makes you more likely to win later. Craps players claim that you should go to a table where the person rolling the dice is “hot,” because she is likely to keep on rolling good numbers. Craps players, then, believe that rolls of the dice are positively correlated, that winning now makes you more likely to win later. My method of calculating the probability of winning on the slot machine was an example of the following important rule for calculations based on independent probabilities:
The Law of Independent Probabilities
If the probability of one event happening is PA, and the probability of a second statistically independent event happening is PB, then the probability that they will both occur is the product of the probabilities, PAPB. If there are more than two events involved, you simply keep on multiplying. Note that this only applies to independent probabilities. For instance, if you have a nickel and a dime in your pocket, and you randomly pull one out, there is a probability of 0.5 that it will be the nickel. If you then replace the coin and again pull one out randomly, there is again a probability of 0.5 of coming up with the nickel, because the probabilities are independent. Thus, there is a probability of 0.25 that you will get the nickel both times. Suppose instead that you do not replace the first coin before pulling out the second one. Then you are bound to pull out the other coin the second time, and there is no way you could pull the nickel out twice. In this situation, the two trials are not independent, because the result of the first trial has an effect on the second trial. The law of independent probabilities does not apply, and the probability of getting the nickel twice is zero, not 0.25. Experiments have shown that in the case of radioactive decay, the probability that any nucleus will decay during a given time interval is unaffected by what is happening to the other nuclei, and is also unrelated to how long it has gone without decaying. The first observation makes sense, because nuclei are isolated from each other at the centers of their respective atoms, and therefore have no physical way of influencing each other. The second fact is also reasonable, since all atoms are identical. Suppose we wanted to believe that certain atoms were “extra tough,” as demonstrated by their history of going an unusually long time without decaying. Those atoms would have to be different in some physical way, but nobody has ever succeeded in detecting differences among atoms. There is no way for an atom to be changed by the experiences it has in its lifetime.
Addition of index{probabilities!addition of}probabilities
The law of independent probabilities tells us to use multiplication to calculate the probability that both A and B will happen, assuming the probabilities are independent. What about the probability of an “or” rather than an “and”? If two events A and B are mutually exclusive, then the probability of one or the other occurring is the sum PA+PB. For instance, a bowler might have a 30% chance of getting a strike (knocking down all ten pins) and a 20% chance of knocking down nine of them. The bowler's chance of knocking down either nine pins or ten pins is therefore 50%. It does not make sense to add probabilities of things that are not mutually exclusive, i.e., that could both happen. Say I have a 90% chance of eating lunch on any given day, and a 90% chance of eating dinner. The probability that I will eat either lunch or dinner is not 180%.
Normalization
If I spin a globe and randomly pick a point on it, I have
about a 70% chance of picking a point that's in an ocean
and a 30% chance of picking a point on land. The probability
of picking either water or land is
. Water
and land are mutually exclusive, and there are no other
possibilities, so the probabilities had to add up to 100%.
It works the same if there are more than two possibilities
--- if you can classify all possible outcomes into a list
of mutually exclusive results, then all the probabilities
have to add up to 1, or 100%. This property of probabilities
is known as normalization.
Averages
Another way of dealing with randomness is to take averages. The casino knows that in the long run, the number of times you win will approximately equal the number of times you play multiplied by the probability of winning. In the slot-machine game described on page 773, where the probability of winning is 0.001, if you spend a week playing, and pay $2500 to play 10,000 times, you are likely to win about 10 times (10,000×0.001=10), and collect $1000. On the average, the casino will make a profit of $1500 from you. This is an example of the following rule.
Rule for Calculating Averages
If you conduct N identical, statistically independent trials, and the probability of success in each trial is P, then on the average, the total number of successful trials will be NP. If N is large enough, the relative error in this estimate will become small. The statement that the rule for calculating averages gets more and more accurate for larger and larger N(known popularly as the “law of averages”) often provides a correspondence principle that connects classical and quantum physics. For instance, the amount of power produced by a nuclear power plant is not random at any detectable level, because the number of atoms in the reactor is so large. In general, random behavior at the atomic level tends to average out when we consider large numbers of atoms, which is why physics seemed deterministic before physicists learned techniques for studying atoms individually. We can achieve great precision with averages in quantum physics because we can use identical atoms to reproduce exactly the same situation many times. If we were betting on horses or dice, we would be much more limited in our precision. After a thousand races, the horse would be ready to retire. After a million rolls, the dice would be worn out. self-check: Which of the following things must be independent, which could be independent, and which definitely are not independent? (1) the probability of successfully making two free-throws in a row in basketball; (2) the probability that it will rain in London tomorrow and the probability that it will rain on the same day in a certain city in a distant galaxy; (3) your probability of dying today and of dying tomorrow. (answer in the back of the PDF version of the book) ”Discussion Questions” ◊ Newtonian physics is an essentially perfect approximation for describing the motion of a pair of dice. If Newtonian physics is deterministic, why do we consider the result of rolling dice to be random? ◊ Why isn't it valid to define randomness by saying that randomness is when all the outcomes are equally likely? ◊ The sequence of digits 121212121212121212 seems clearly nonrandom, and 41592653589793 seems random. The latter sequence, however, is the decimal form of pi, starting with the third digit. There is a story about the Indian mathematician Ramanujan, a self-taught prodigy, that a friend came to visit him in a cab, and remarked that the number of the cab, 1729, seemed relatively uninteresting. Ramanujan replied that on the contrary, it was very interesting because it was the smallest number that could be represented in two different ways as the sum of two cubes. The Argentine author Jorge Luis Borges wrote a short story called “The Library of Babel,” in which he imagined a library containing every book that could possibly be written using the letters of the alphabet. It would include a book containing only the repeated letter “a;” all the ancient Greek tragedies known today, all the lost Greek tragedies, and millions of Greek tragedies that were never actually written; your own life story, and various incorrect versions of your own life story; and countless anthologies containing a short story called “The Library of Babel.” Of course, if you picked a book from the shelves of the library, it would almost certainly look like a nonsensical sequence of letters and punctuation, but it's always possible that the seemingly meaningless book would be a science-fiction screenplay written in the language of a Neanderthal tribe, or the lyrics to a set of incomparably beautiful love songs written in a language that never existed. In view of these examples, what does it really mean to say that something is random?

d / Probability distribution for the result of rolling a single die.

e / Rolling two dice and adding them up.

f / A probability distribution for height of human adults (not real data).

h / The average of a probability distribution.

i / The full width at half maximum (FWHM) of a probability distribution.
Probability distributions
So far we've discussed random processes having only two possible outcomes: yes or no, win or lose, on or off. More generally, a random process could have a result that is a number. Some processes yield integers, as when you roll a die and get a result from one to six, but some are not restricted to whole numbers, for example the number of seconds that a uranium-238 atom will exist before undergoing radioactive decay. Consider a throw of a die. If the die is “honest,” then we expect all six values to be equally likely. Since all six probabilities must add up to 1, then probability of any particular value coming up must be 1/6. We can summarize this in a graph, d. Areas under the curve can be interpreted as total probabilities. For instance, the area under the curve from 1 to 3 is 1/6+1/6+1/6=1/2, so the probability of getting a result from 1 to 3 is 1/2. The function shown on the graph is called the probability distribution. Figure e shows the probabilities of various results obtained by rolling two dice and adding them together, as in the game of craps. The probabilities are not all the same. There is a small probability of getting a two, for example, because there is only one way to do it, by rolling a one and then another one. The probability of rolling a seven is high because there are six different ways to do it: 1+6, 2+5, etc. If the number of possible outcomes is large but finite, for example the number of hairs on a dog, the graph would start to look like a smooth curve rather than a ziggurat. What about probability distributions for random numbers that are not integers? We can no longer make a graph with probability on the y axis, because the probability of getting a given exact number is typically zero. For instance, there is zero probability that a radioactive atom will last for exactly 3 seconds, since there is are infinitely many possible results that are close to 3 but not exactly three: 2.999999999999999996876876587658465436, for example. It doesn't usually make sense, therefore, to talk about the probability of a single numerical result, but it does make sense to talk about the probability of a certain range of results. For instance, the probability that an atom will last more than 3 and less than 4 seconds is a perfectly reasonable thing to discuss. We can still summarize the probability information on a graph, and we can still interpret areas under the curve as probabilities. But the y axis can no longer be a unitless probability scale. In radioactive decay, for example, we want the x axis to have units of time, and we want areas under the curve to be unitless probabilities. The area of a single square on the graph paper is then Failed to parse (unknown function\begin): \begin{gather} \text{(unitless area of a square)} \\ = \text{(width of square with time units)}\\ \times \text{(height of square)} \qquad . \end{gather}
If the units are to cancel out, then the height of the square must evidently be a quantity with units of inverse time. In other words, the y axis of the graph is to be interpreted as probability per unit time, not probability. Figure f shows another example, a probability distribution for people's height. This kind of bell-shaped curve is quite common. self-check: Compare the number of people with heights in the range of 130-135 cm to the number in the range 135-140. (answer in the back of the PDF version of the book)
Example 1: Looking for tall basketball players
◊ A certain country with a large population wants to find very tall people to be on its Olympic basketball team and strike a blow against western imperialism. Out of a pool of 108 people who are the right age and gender, how many are they likely to find who are over 225 cm (7 feet 4 inches) in height? Figure g gives a close-up of the “tail” of the distribution shown previously in figure f. ◊ The shaded area under the curve represents the probability that a given person is tall enough. Each rectangle represents a probability of 0.2×10-7 cm-1 × 1 cm=2×10-8. There are about 35 rectangles covered by the shaded area, so the probability of having a height greater than 225 cm is 7×10-7 , or just under one in a million. Using the rule for calculating averages, the average, or expected number of people this tall is (108)×(7×10-7)=70.
Average and width of a probability distribution
If the next Martian you meet asks you, “How tall is an adult human?,” you will probably reply with a statement about the average human height, such as “Oh, about 5 feet 6 inches.” If you wanted to explain a little more, you could say, “But that's only an average. Most people are somewhere between 5 feet and 6 feet tall.” Without bothering to draw the relevant bell curve for your new extraterrestrial acquaintance, you've summarized the relevant information by giving an average and a typical range of variation. The average of a probability distribution can be defined geometrically as the horizontal position at which it could be balanced if it was constructed out of cardboard. A convenient numerical measure of the amount of variation about the average, or amount of uncertainty, is the full width at half maximum, or FWHM, shown in figure i. A great deal more could be said about this topic, and indeed an introductory statistics course could spend months on ways of defining the center and width of a distribution. Rather than force-feeding you on mathematical detail or techniques for calculating these things, it is perhaps more relevant to point out simply that there are various ways of defining them, and to inoculate you against the misuse of certain definitions. The average is not the only possible way to say what is a typical value for a quantity that can vary randomly; another possible definition is the median, defined as the value that is exceeded with 50% probability. When discussing incomes of people living in a certain town, the average could be very misleading, since it can be affected massively if a single resident of the town is Bill Gates. Nor is the FWHM the only possible way of stating the amount of random variation; another possible way of measuring it is the standard deviation (defined as the square root of the average squared deviation from the average value).
Exponential decay and half-life
Half-life
Most people know that radioactivity “lasts a certain amount
of time,” but that simple statement leaves out a lot. As an
example, consider the following medical procedure used to
diagnose thyroid function. A very small quantity of the
isotope 131I, produced in a nuclear reactor, is fed to
or injected into the patient. The body's biochemical systems
treat this artificial, radioactive isotope exactly the same
as 127I, which is the only naturally occurring type.
(Nutritionally, iodine is a necessary trace element. Iodine
taken into the body is partly excreted, but the rest becomes
concentrated in the thyroid gland. Iodized salt has had
iodine added to it to prevent the nutritional deficiency
known as goiters, in which the iodine-starved
thyroid becomes swollen.) As the 131I undergoes beta
decay, it emits electrons, neutrinos, and gamma rays. The
gamma rays can be measured by a detector passed over the
patient's body. As the radioactive iodine becomes concentrated
in the thyroid, the amount of gamma radiation coming from
the thyroid becomes greater, and that emitted by the rest of
the body is reduced. The rate at which the iodine concentrates
in the thyroid tells the doctor about the health of the thyroid.
If you ever undergo this procedure, someone will presumably
explain a little about radioactivity to you, to allay your
fears that you will turn into the Incredible Hulk, or that
your next child will have an unusual number of limbs. Since
iodine stays in your thyroid for a long time once it gets
there, one thing you'll want to know is whether your thyroid
is going to become radioactive forever. They may just tell
you that the radioactivity “only lasts a certain amount of
time,” but we can now carry out a quantitative derivation
of how the radioactivity really will die out.
Let Psurv(t) be the probability that an iodine atom
will survive without decaying for a period of at least t.
It has been experimentally measured that half all 131I
atoms decay in 8 hours, so we have
Psurv(8 hr) = 0.5 .
Now using the law of independent probabilities, the
probability of surviving for 16 hours equals the probability
of surviving for the first 8 hours multiplied by the
probability of surviving for the second 8 hours,
Similarly we have
Generalizing from this pattern, the probability of surviving
for any time t that is a multiple of 8 hours is
Psurv(t) = 0.5t/8 hr .
We now know how to find the probability of survival at
intervals of 8 hours, but what about the points in time in
between? What would be the probability of surviving for 4
hours? Well, using the law of independent probabilities again, we have
Psurv(8 hr) = Psurv(4 hr) × Psurv(4 hr) ,
which can be rearranged to give
This is exactly what we would have found simply by plugging
in Psurv(t)=0.5t/8 hr and ignoring the restriction to
multiples of 8 hours. Since 8 hours is the amount of time
required for half of the atoms to decay, it is known as the
half-life, written t1/2. The general rule is as follows:
Exponential Decay Equation
Psurv(t) = 0.5t/t1/2 Using the rule for calculating averages, we can also find the number of atoms, N(t), remaining in a sample at time Failed to parse (lexing error): t\/
N(t) = N(0) × 0.5t/t1/2 Both of these equations have graphs that look like dying-out exponentials, as in the example below.
Example 2: Radioactive contamination at Chernobyl
◊ One of the most dangerous radioactive isotopes
released by the Chernobyl disaster in 1986 was 90Sr,
whose half-life is 28 years. (a) How long will it be before
the contamination is reduced to one tenth of its original
level? (b) If a total of 1027 atoms was released,
about how long would it be before not a single atom was left?
◊ (a) We want to know the amount of time that a
90Sr nucleus has a probability of 0.1 of surviving.
Starting with the exponential decay formula,
Psurv = 0.5t/t1/2 ,
we want to solve for t. Taking natural logarithms of both sides,
so
Plugging in P=0.1 and t1/2=28 years, we get t=93 years.
(b) This is just like the first part, but P=10-27 . The
result is about 2500 years.

j / Calibration of the 14C dating method using tree rings and artifacts whose ages were known from other methods. Redrawn from Emilio Segrè, Nuclei and Particles, 1965.
Example 3: 14C Dating
text{C}Dating}^{12}text{C}, but not quite.^{14}text{C}, with a half-life of 5600 years, is^{14}text{C}in the atmosphere remains constant, at1.3\times10^{-12}. Living plants and animals take in both^{12}text{C}and^{14}text{C}from the atmosphere and incorporate^{14}text{C}gradually falls off as it undergoes^{14}text{C}$ in various objects. Similar methods, using longer-lived isotopes, provided the first firm proof that the earth was billions of years old, not a few thousand as some had claimed on religious grounds.
Rate of decay
If you want to find how many radioactive decays occur within
a time interval lasting from time t to time t+Δ t,
the most straightforward approach is to calculate it like this:
A problem arises when Δ t is small compared to t1/2.
For instance, suppose you have a hunk of 1022 atoms of
235U, with a half-life of 700 million years, which is
2.2×1016 s. You want to know how many decays will
occur in Δ t=1 s. Since we're specifying the
current number of atoms, t=0. As you plug in to the
formula above on your calculator, the quantity 0.5Δ t/t1/2 comes out on
your calculator to equal one, so the final result is zero.
That's incorrect, though. In reality, 0.5Δ t/t1/2 should equal
0.999999999999999968, but your calculator only gives eight
digits of precision, so it rounded it off to one. In other
words, the probability that a 235U atom will survive
for 1 s is very close to one, but not equal to one. The
number of decays in one second is therefore 3.2×105, not zero.
Well, my calculator only does eight digits of precision,
just like yours, so how did I know the right answer? The way
to do it is to use the following approximation:
(The symbol << means “is much less than.”) Using it, we
can find the following approximation:
Failed to parse (lexing error): \begin{align} \text{(number of }&\text{decays between $t$ and $t+\Delta t$) } \\ &= N(0)\left[1-0.5^{\Delta t/t_{1/2}}\right]0.5^{t/t_{1/2}} \\ &\approx N(0)\left[1-\left(1+\frac{\Delta t}{t_{1/2}}\ln 0.5\right)\right]0.5^{t/t_{1/2}} \\ &\approx (\ln 2)N(0)0.5^{t/t_{1/2}}\frac{\Delta t}{t_{1/2}} \end{align}
This also gives us a way to calculate the rate of decay, i.e., the number of decays per unit time. Dividing by Δ t on both sides, we have Failed to parse (lexing error): \text{(decays per unit time)} \approx \frac{(\ln 2)N(0)}{t_{1/2}}0.5^{t/t_{1/2}} \quad , \quad \text{if $\Delta t \ll t_{1/2}$}
Example 4: The hot potato
◊ A nuclear physicist with a demented sense of humor tosses you a cigar box, yelling “hot potato.” The label on the box says “contains 1020 atoms of 17F, half-life of 66 s, produced today in our reactor at 1 p.m.” It takes you two seconds to read the label, after which you toss it behind some lead bricks and run away. The time is 1:40 p.m. Will you die? ◊ The time elapsed since the radioactive fluorine was produced in the reactor was 40 minutes, or 2400 s. The number of elapsed half-lives is therefore t/t1/2= 36. The initial number of atoms was N(0)=1020 . The number of decays per second is now about 107 s-1, so it produced about 2×107 high-energy electrons while you held it in your hands. Although twenty million electrons sounds like a lot, it is not really enough to be dangerous. By the way, none of the equations we've derived so far was the actual probability distribution for the time at which a particular radioactive atom will decay. That probability distribution would be found by substituting N(0)=1 into the equation for the rate of decay. If the sheer number of equations is starting to seem formidable, let's pause and think for a second. The simple equation for Psurv is something you can derive easily from the law of independent probabilities any time you need it. From that, you can quickly find the exact equation for the rate of decay. The derivation of the approximate equations for Δ t<< t is a little hairier, but note that except for the factors of ln 2, everything in these equations can be found simply from considerations of logic and units. For instance, a longer half-life will obviously lead to a slower rate of decays, so it makes sense that we divide by it. As for the ln 2 factors, they are exactly the kind of thing that one looks up in a book when one needs to know them. ”Discussion Questions” ◊ In the medical procedure involving 131I, why is it the gamma rays that are detected, not the electrons or neutrinos that are also emitted? ◊ For 1 s, Fred holds in his hands 1 kg of radioactive stuff with a half-life of 1000 years. Ginger holds 1 kg of a different substance, with a half-life of 1 min, for the same amount of time. Did they place themselves in equal danger, or not? ◊ How would you interpret it if you calculated N(t), and found it was less than one? ◊ Does the half-life depend on how much of the substance you have? Does the expected time until the sample decays completely depend on how much of the substance you have?
Applications of calculus
The area under the probability distribution is of course an integral. If we call the random number x and the probability distribution D(x), then the probability that x lies in a certain range is given by Failed to parse (lexing error): \text{(probability of $a\le x \le b$)}=\int_a^b D(x) d x \qquad .
What about averages? If x had a finite number of equally probable values, we would simply add them up and divide by how many we had. If they weren't equally likely, we'd make the weighted average x1P1+x2P2+... But we need to generalize this to a variable x that can take on any of a continuum of values. The continuous version of a sum is an integral, so the average is Failed to parse (lexing error): \text{(average value of $x$)} = \int x D(x) d x \qquad ,
where the integral is over all possible values of x.
Example 5: Probability distribution for radioactive decay
Here is a rigorous justification for the statement in subsection 13.1.4 that the probability distribution for radioactive decay is found by substituting N(0)=1 into the equation for the rate of decay. We know that the probability distribution must be of the form D(t) = k 0.5t/t1/2 , where k is a constant that we need to determine. The atom is guaranteed to decay eventually, so normalization gives us Failed to parse (lexing error): \begin{align} \text{(probability of $0\le t < \infty$)} &= 1 \\ &= \int_0^\infty D(t) d t \qquad . \end{align}
The integral is most easily evaluated by converting the
function into an exponential with e as the base
which gives an integral of the familiar form
. We thus have
which gives the desired result:
Example 6: Average lifetime
You might think that the half-life would also be the average lifetime of an atom, since half the atoms' lives are shorter and half longer. But the half whose lives are longer include some that survive for many half-lives, and these rare long-lived atoms skew the average. We can calculate the average lifetime as follows: Failed to parse (lexing error): (\text{average lifetime}) = \int_0^\infty t\: D(t)\: d t
Using the convenient base-e form again, we have
This integral is of a form that can either be attacked with
integration by parts or by looking it up in a table. The
result is
,
and the first term can be ignored for our
purposes because it equals zero at both limits of integration. We end up with
Failed to parse (lexing error): \begin{align} \text{(average lifetime)} &= \frac{\ln 2}{t_{1/2}}\left(\frac{t_{1/2}}{\ln 2}\right)^2 \\ &= \frac{t_{1/2}}{\ln 2} \\ &= 1.443 \: t_{1/2} \qquad , \end{align}
which is, as expected, longer than one half-life.

k / In recent decades, a huge hole in the ozone layer has spread out from Antarctica. Left: November 1978. Right: November 1992
Light As a Particle
The only thing that interferes with my learning is my education. -- Albert Einstein Radioactivity is random, but do the laws of physics exhibit randomness in other contexts besides radioactivity? Yes. Radioactive decay was just a good playpen to get us started with concepts of randomness, because all atoms of a given isotope are identical. By stocking the playpen with an unlimited supply of identical atom-toys, nature helped us to realize that their future behavior could be different regardless of their original identicality. We are now ready to leave the playpen, and see how randomness fits into the structure of physics at the most fundamental level. The laws of physics describe light and matter, and the quantum revolution rewrote both descriptions. Radioactivity was a good example of matter's behaving in a way that was inconsistent with classical physics, but if we want to get under the hood and understand how nonclassical things happen, it will be easier to focus on light rather than matter. A radioactive atom such as uranium-235 is after all an extremely complex system, consisting of 92 protons, 143 neutrons, and 92 electrons. Light, however, can be a simple sine wave. However successful the classical wave theory of light had been --- allowing the creation of radio and radar, for example --- it still failed to describe many important phenomena. An example that is currently of great interest is the way the ozone layer protects us from the dangerous short-wavelength ultraviolet part of the sun's spectrum. In the classical description, light is a wave. When a wave passes into and back out of a medium, its frequency is unchanged, and although its wavelength is altered while it is in the medium, it returns to its original value when the wave reemerges. Luckily for us, this is not at all what ultraviolet light does when it passes through the ozone layer, or the layer would offer no protection at all!

b / A wave is partially absorbed.

c / A stream of particles is partially absorbed.

d / Einstein and Seurat: twins separated at birth? Seine Grande Jatte by Georges Seurat (19th century).
Evidence for light as a particle
For a long time, physicists tried to explain away the
problems with the classical theory of light as arising from
an imperfect understanding of atoms and the interaction of
light with individual atoms and molecules. The ozone
paradox, for example, could have been attributed to the
incorrect assumption that one could think of the ozone layer
as a smooth, continuous substance, when in reality it was
made of individual ozone molecules. It wasn't until 1905
that Albert Einstein threw down the gauntlet, proposing that
the problem had nothing to do with the details of light's
interaction with atoms and everything to do with the
fundamental nature of light itself.

a / Digital camera images of dimmer and dimmer sources of light. The dots are records of individual photons.
In those days the data were sketchy, the ideas vague, and the experiments difficult to interpret; it took a genius like Einstein to cut through the thicket of confusion and find a simple solution. Today, however, we can get right to the heart of the matter with a piece of ordinary consumer electronics, the digital camera. Instead of film, a digital camera has a computer chip with its surface divided up into a grid of light-sensitive squares, called “pixels.” Compared to a grain of the silver compound used to make regular photographic film, a digital camera pixel is activated by an amount of light energy orders of magnitude smaller. We can learn something new about light by using a digital camera to detect smaller and smaller amounts of light, as shown in figure a. Figure a/1 is fake, but a/2 and a/3 are real digital-camera images made by Prof. Lyman Page of Princeton University as a classroom demonstration. Figure a/1 is what we would see if we used the digital camera to take a picture of a fairly dim source of light. In figures a/2 and a/3, the intensity of the light was drastically reduced by inserting semitransparent absorbers like the tinted plastic used in sunglasses. Going from a/1 to a/2 to a/3, more and more light energy is being thrown away by the absorbers. The results are drastically different from what we would expect based on the wave theory of light. If light was a wave and nothing but a wave, b, then the absorbers would simply cut down the wave's amplitude across the whole wavefront. The digital camera's entire chip would be illuminated uniformly, and weakening the wave with an absorber would just mean that every pixel would take a long time to soak up enough energy to register a signal. But figures a/2 and a/3 show that some pixels take strong hits while others pick up no energy at all. Instead of the wave picture, the image that is naturally evoked by the data is something more like a hail of bullets from a machine gun, c. Each “bullet” of light apparently carries only a tiny amount of energy, which is why detecting them individually requires a sensitive digital camera rather than an eye or a piece of film. Although Einstein was interpreting different observations, this is the conclusion he reached in his 1905 paper: that the pure wave theory of light is an oversimplification, and that the energy of a beam of light comes in finite chunks rather than being spread smoothly throughout a region of space. We now think of these chunks as particles of light, and call them “photons,” although Einstein avoided the word “particle,” and the word “photon” was invented later. Regardless of words, the trouble was that waves and particles seemed like inconsistent categories. The reaction to Einstein's paper could be kindly described as vigorously skeptical. Even twenty years later, Einstein wrote, “There are therefore now two theories of light, both indispensable, and --- as one must admit today despite twenty years of tremendous effort on the part of theoretical physicists --- without any logical connection.” In the remainder of this chapter we will learn how the seeming paradox was eventually resolved. ”Discussion Questions” ◊ Suppose someone rebuts the digital camera data in figure a, claiming that the random pattern of dots occurs not because of anything fundamental about the nature of light but simply because the camera's pixels are not all exactly the same --- some are just more sensitive than others. How could we test this interpretation? ◊ Discuss how the correspondence principle applies to the observations and concepts discussed in this section.

e / Apparatus for observing the photoelectric effect. A beam of light strikes a capacitor plate inside a vacuum tube, and electrons are ejected (black arrows).

f / The hamster in her hamster ball is like an electron emerging from the metal (tiled kitchen floor) into the surrounding vacuum (wood floor). The wood floor is higher than the tiled floor, so as she rolls up the step, the hamster will lose a certain amount of kinetic energy, analogous to Es. If her kinetic energy is too small, she won't even make it up the step.

g / A different way of studying the photoelectric effect.

h / The quantity Es+eΔ V indicates the energy of one photon. It is found to be proportional to the frequency of the light.
How much light is one photon?
The photoelectric effect
We have seen evidence that light energy comes in little chunks, so the next question to be asked is naturally how much energy is in one chunk. The most straightforward experimental avenue for addressing this question is a phenomenon known as the photoelectric effect. The photoelectric effect occurs when a photon strikes the surface of a solid object and knocks out an electron. It occurs continually all around you. It is happening right now at the surface of your skin and on the paper or computer screen from which you are reading these words. It does not ordinarily lead to any observable electrical effect, however, because on the average free electrons are wandering back in just as frequently as they are being ejected. (If an object did somehow lose a significant number of electrons, its growing net positive charge would begin attracting the electrons back more and more strongly.) Figure e shows a practical method for detecting the photoelectric effect. Two very clean parallel metal plates (the electrodes of a capacitor) are sealed inside a vacuum tube, and only one plate is exposed to light. Because there is a good vacuum between the plates, any ejected electron that happens to be headed in the right direction will almost certainly reach the other capacitor plate without colliding with any air molecules. The illuminated (bottom) plate is left with a net positive charge, and the unilluminated (top) plate acquires a negative charge from the electrons deposited on it. There is thus an electric field between the plates, and it is because of this field that the electrons' paths are curved, as shown in the diagram. However, since vacuum is a good insulator, any electrons that reach the top plate are prevented from responding to the electrical attraction by jumping back across the gap. Instead they are forced to make their way around the circuit, passing through an ammeter. The ammeter allows a measurement of the strength of the photoelectric effect.
An unexpected dependence on frequency
The photoelectric effect was discovered serendipitously by Heinrich Hertz in 1887, as he was experimenting with radio waves. He was not particularly interested in the phenomenon, but he did notice that the effect was produced strongly by ultraviolet light and more weakly by lower frequencies. Light whose frequency was lower than a certain critical value did not eject any electrons at all. (In fact this was all prior to Thomson's discovery of the electron, so Hertz would not have described the effect in terms of electrons --- we are discussing everything with the benefit of hindsight.) This dependence on frequency didn't make any sense in terms of the classical wave theory of light. A light wave consists of electric and magnetic fields. The stronger the fields, i.e., the greater the wave's amplitude, the greater the forces that would be exerted on electrons that found themselves bathed in the light. It should have been amplitude (brightness) that was relevant, not frequency. The dependence on frequency not only proves that the wave model of light needs modifying, but with the proper interpretation it allows us to determine how much energy is in one photon, and it also leads to a connection between the wave and particle models that we need in order to reconcile them.
To make any progress, we need to consider the physical process by which a photon would eject an electron from the metal electrode. A metal contains electrons that are free to move around. Ordinarily, in the interior of the metal, such an electron feels attractive forces from atoms in every direction around it. The forces cancel out. But if the electron happens to find itself at the surface of the metal, the attraction from the interior side is not balanced out by any attraction from outside. In popping out through the surface the electron therefore loses some amount of energy Es, which depends on the type of metal used. Suppose a photon strikes an electron, annihilating itself and giving up all its energy to the electron. (We now know that this is what always happens in the photoelectric effect, although it had not yet been established in 1905 whether or not the photon was completely annihilated.) The electron will (1) lose kinetic energy through collisions with other electrons as it plows through the metal on its way to the surface; (2) lose an amount of kinetic energy equal to Es as it emerges through the surface; and (3) lose more energy on its way across the gap between the plates, due to the electric field between the plates. Even if the electron happens to be right at the surface of the metal when it absorbs the photon, and even if the electric field between the plates has not yet built up very much, Es is the bare minimum amount of energy that it must receive from the photon if it is to contribute to a measurable current. The reason for using very clean electrodes is to minimize Es and make it have a definite value characteristic of the metal surface, not a mixture of values due to the various types of dirt and crud that are present in tiny amounts on all surfaces in everyday life. We can now interpret the frequency dependence of the photoelectric effect in a simple way: apparently the amount of energy possessed by a photon is related to its frequency. A low-frequency red or infrared photon has an energy less than Es, so a beam of them will not produce any current. A high-frequency blue or violet photon, on the other hand, packs enough of a punch to allow an electron to make it to the other plate. At frequencies higher than the minimum, the photoelectric current continues to increase with the frequency of the light because of effects (1) and (3).
Numerical relationship between energy and frequency
Prompted by Einstein's photon paper, Robert Millikan (whom we first encountered in chapter 8) figured out how to use the photoelectric effect to probe precisely the link between frequency and photon energy. Rather than going into the historical details of Millikan's actual experiments (a lengthy experimental program that occupied a large part of his professional career) we will describe a simple version, shown in figure g, that is used sometimes in college laboratory courses.2 The idea is simply to illuminate one plate of the vacuum tube with light of a single wavelength and monitor the voltage difference between the two plates as they charge up. Since the resistance of a voltmeter is very high (much higher than the resistance of an ammeter), we can assume to a good approximation that electrons reaching the top plate are stuck there permanently, so the voltage will keep on increasing for as long as electrons are making it across the vacuum tube. At a moment when the voltage difference has a reached a value ΔV, the minimum energy required by an electron to make it out of the bottom plate and across the gap to the other plate is Es+eΔV. As Δ V increases, we eventually reach a point at which Es+eΔ V equals the energy of one photon. No more electrons can cross the gap, and the reading on the voltmeter stops rising. The quantity Es+eΔ V now tells us the energy of one photon. If we determine this energy for a variety of wavelengths, h, we find the following simple relationship between the energy of a photon and the frequency of the light: E = hf , where h is a constant with the value 6.63×10-34 J⋅s. Note how the equation brings the wave and particle models of light under the same roof: the left side is the energy of one particle of light, while the right side is the frequency of the same light, interpreted as a wave. The constant h is known as Planck's constant, for historical reasons explained in the footnote beginning on the preceding page. self-check: How would you extract h from the graph in figure h? What if you didn't even know Es in advance, and could only graph eΔ V versus f? (answer in the back of the PDF version of the book) Since the energy of a photon is hf, a beam of light can only have energies of hf, 2hf, 3hf, etc. Its energy is quantized --- there is no such thing as a fraction of a photon. Quantum physics gets its name from the fact that it quantizes quantities like energy, momentum, and angular momentum that had previously been thought to be smooth, continuous and infinitely divisible.
Example 7: Number of photons emitted by a lightbulb per second
◊ Roughly how many photons are emitted by a 100-W
lightbulb in 1 second?
◊ People tend to remember wavelengths rather than
frequencies for visible light. The bulb emits photons with a
range of frequencies and wavelengths, but let's take 600 nm
as a typical wavelength for purposes of estimation. The
energy of a single photon is
A power of 100 W means 100 joules per second, so the
number of photons is
Example 8: Momentum of a photon
◊ According to the theory of relativity, the
momentum of a beam of light is given by p=E/c (see homework
problem 16 on page 409).
Apply this to find the momentum of a
single photon in terms of its frequency, and in terms of its wavelength.
◊ Combining the equations p=E/c and E=hf, we find
To reexpress this in terms of wavelength, we use c=fλ:
The second form turns out to be simpler.
”Discussion Questions”
◊
The photoelectric effect only ever ejects a
very tiny percentage of the electrons available
near the surface of an object.
How well does this agree with the wave
model of light, and how well with the particle model?
Consider the two different distance scales involved:
the wavelength of the light, and the size of an atom,
which is on the order of 10-10 or 10-9 m.
◊
What is the significance of the fact that Planck's
constant is numerically very small? How would our everyday
experience of light be different if it was not so small?
◊
How would the experiments described above be affected if
a single electron was likely to get hit by more than one photon?
◊
Draw some representative trajectories of electrons for
Δ V=0, Δ V less than the maximum value, and
Δ V greater than the maximum value.
◊
Explain based on the photon theory of light why
ultraviolet light would be more likely than visible or
infrared light to cause cancer by damaging DNA molecules.
How does this relate to discussion question C?
◊
Does E=hf imply that a photon changes its energy
when it passes from one transparent material into another
substance with a different index of refraction?

j / Bullets pass through a double slit.

k / A water wave passes through a double slit.

l / A single photon can go through both slits.
Wave-particle duality
How can light be both a particle and a wave? We are now ready to resolve this seeming contradiction. Often in science when something seems paradoxical, it's because we (1) don't define our terms carefully, or (2) don't test our ideas against any specific real-world situation. Let's define particles and waves as follows:
- Waves exhibit superposition, and specifically interference phenomena.
- Particles can only exist in
whole numbers, not fractions
As a real-world check on our philosophizing, there is one
particular experiment that works perfectly. We set up a
double-slit interference experiment that we know will
produce a diffraction pattern if light is an honest-to-goodness
wave, but we detect the light with a detector that is
capable of sensing individual photons, e.g., a digital
camera. To make it possible to pick out individual dots due to
individual photons, we must use filters to cut down the
intensity of the light to a very low level, just as in the
photos by Prof. Page on page 787. The whole thing is
sealed inside a light-tight box. The results are shown in
figure i. (In fact, the similar
figures in on page 787
are simply cutouts from these figures.)

i / Wave interference patterns photographed by Prof. Lyman Page with a digital camera. Laser light with a single well-defined wavelength passed through a series of absorbers to cut down its intensity, then through a set of slits to produce interference, and finally into a digital camera chip. (A triple slit was actually used, but for conceptual simplicity we discuss the results in the main text as if it was a double slit.) In panel 2 the intensity has been reduced relative to 1, and even more so for panel 3.
Neither the pure wave theory nor the pure particle theory can explain the results. If light was only a particle and not a wave, there would be no interference effect. The result of the experiment would be like firing a hail of bullets through a double slit, j. Only two spots directly behind the slits would be hit. If, on the other hand, light was only a wave and not a particle, we would get the same kind of diffraction pattern that would happen with a water wave, k. There would be no discrete dots in the photo, only a diffraction pattern that shaded smoothly between light and dark. Applying the definitions to this experiment, light must be both a particle and a wave. It is a wave because it exhibits interference effects. At the same time, the fact that the photographs contain discrete dots is a direct demonstration that light refuses to be split into units of less than a single photon. There can only be whole numbers of photons: four photons in figure i/3, for example.
A wrong interpretation: photons interfering with each other
One possible interpretation of wave-particle duality that occurred to physicists early in the game was that perhaps the interference effects came from photons interacting with each other. By analogy, a water wave consists of moving water molecules, and interference of water waves results ultimately from all the mutual pushes and pulls of the molecules. This interpretation was conclusively disproved by G.I. Taylor, a student at Cambridge. The demonstration by Prof. Page that we've just been discussing is essentially a modernized version of Taylor's work. Taylor reasoned that if interference effects came from photons interacting with each other, a bare minimum of two photons would have to be present at the same time to produce interference. By making the light source extremely dim, we can be virtually certain that there are never two photons in the box at the same time. In figure i/3, however, the intensity of the light has been cut down so much by the absorbers that if it was in the open, the average separation between photons would be on the order of a kilometer! At any given moment, the number of photons in the box is most likely to be zero. It is virtually certain that there were never two photons in the box at once. ===The concept of a photon's path is undefined.=== If a single photon can demonstrate double-slit interference, then which slit did it pass through? The unavoidable answer must be that it passes through both! This might not seem so strange if we think of the photon as a wave, but it is highly counterintuitive if we try to visualize it as a particle. The moral is that we should not think in terms of the path of a photon. Like the fully human and fully divine Jesus of Christian theology, a photon is supposed to be 100% wave and 100% particle. If a photon had a well defined path, then it would not demonstrate wave superposition and interference effects, contradicting its wave nature. (In subsection 13.3.4 we will discuss the Heisenberg uncertainty principle, which gives a numerical way of approaching this issue.)
Another wrong interpretation: the pilot wave hypothesis
A second possible explanation of wave-particle duality was taken seriously in the early history of quantum mechanics. What if the photon particle is like a surfer riding on top of its accompanying wave? As the wave travels along, the particle is pushed, or “piloted” by it. Imagining the particle and the wave as two separate entities allows us to avoid the seemingly paradoxical idea that a photon is both at once. The wave happily does its wave tricks, like superposition and interference, and the particle acts like a respectable particle, resolutely refusing to be in two different places at once. If the wave, for instance, undergoes destructive interference, becoming nearly zero in a particular region of space, then the particle simply is not guided into that region. The problem with the pilot wave interpretation is that the only way it can be experimentally tested or verified is if someone manages to detach the particle from the wave, and show that there really are two entities involved, not just one. Part of the scientific method is that hypotheses are supposed to be experimentally testable. Since nobody has ever managed to separate the wavelike part of a photon from the particle part, the interpretation is not useful or meaningful in a scientific sense.
The probability interpretation
The correct interpretation of wave-particle duality is suggested by the random nature of the experiment we've been discussing: even though every photon wave/particle is prepared and released in the same way, the location at which it is eventually detected by the digital camera is different every time. The idea of the probability interpretation of wave-particle duality is that the location of the photon-particle is random, but the probability that it is in a certain location is higher where the photon-wave's amplitude is greater. More specifically, the probability distribution of the particle must be proportional to the square of the wave's amplitude, ∝ (amplitude)2 . This follows from the correspondence principle and from the fact that a wave's energy density is proportional to the square of its amplitude. If we run the double-slit experiment for a long enough time, the pattern of dots fills in and becomes very smooth as would have been expected in classical physics. To preserve the correspondence between classical and quantum physics, the amount of energy deposited in a given region of the picture over the long run must be proportional to the square of the wave's amplitude. The amount of energy deposited in a certain area depends on the number of photons picked up, which is proportional to the probability of finding any given photon there.
Example 9: A microwave oven
◊ The figure shows two-dimensional (top) and one-dimensional (bottom) representations of the standing wave inside a microwave oven. Gray represents zero field, and white and black signify the strongest fields, with white being a field that is in the opposite direction compared to black. Compare the probabilities of detecting a microwave photon at points A, B, and C. ◊ A and C are both extremes of the wave, so the probabilities of detecting a photon at A and C are equal. It doesn't matter that we have represented C as negative and A as positive, because it is the square of the amplitude that is relevant. The amplitude at B is about 1/2 as much as the others, so the probability of detecting a photon there is about 1/4 as much. The probability interpretation was disturbing to physicists who had spent their previous careers working in the deterministic world of classical physics, and ironically the most strenuous objections against it were raised by Einstein, who had invented the photon concept in the first place. The probability interpretation has nevertheless passed every experimental test, and is now as well established as any part of physics. An aspect of the probability interpretation that has made many people uneasy is that the process of detecting and recording the photon's position seems to have a magical ability to get rid of the wavelike side of the photon's personality and force it to decide for once and for all where it really wants to be. But detection or measurement is after all only a physical process like any other, governed by the same laws of physics. We will postpone a detailed discussion of this issue until the following chapter, since a measuring device like a digital camera is made of matter, but we have so far only discussed how quantum mechanics relates to light.
Example 10: What is the proportionality constant?
◊ What is the proportionality constant that would
make an actual equation out of
?
◊ The probability that the photon is in a certain
small region of volume v should equal the fraction of the
wave's energy that is within that volume:
Failed to parse (lexing error): \begin{align} P &= \frac{\text{energy in volume $v$}}{\text{energy of photon}} \\ &= \frac{\text{energy in volume $v$}}{hf} \end{align}
We assume v is small enough so that the electric and magnetic fields are nearly constant throughout it. We then have Failed to parse (unknown function\vc): P = \frac{\left(\frac{1}{8\pi k}|\vc{E}|^2 +\frac{1}{2\mu_text{o}}|\vc{B}|^2\right)v}{hf} \qquad .
We can simplify this formidable looking expression by recognizing that in an electromagnetic wave, |E| and |B| are related by |E|=c|B|. With some algebra, it turns out that the electric and magnetic fields each contribute half the total energy, so we can simplify this to Failed to parse (unknown function\vc): \begin{align} P &= 2\frac{\left(\frac{1}{8\pi k}|\vc{E}|^2\right)v}{hf} \\ &= \frac{v}{4\pi khf}|\vc{E}|^2 \qquad . \end{align}
As advertised, the probability is proportional to the square of the wave's amplitude. ”Discussion Questions” ◊ Referring back to the example of the carrot in the microwave oven, show that it would be nonsensical to have probability be proportional to the field itself, rather than the square of the field. ◊ Einstein did not try to reconcile the wave and particle theories of light, and did not say much about their apparent inconsistency. Einstein basically visualized a beam of light as a stream of bullets coming from a machine gun. In the photoelectric effect, a photon “bullet” would only hit one atom, just as a real bullet would only hit one person. Suppose someone reading his 1905 paper wanted to interpret it by saying that Einstein's so-called particles of light are simply short wave-trains that only occupy a small region of space. Comparing the wavelength of visible light (a few hundred nm) to the size of an atom (on the order of 0.1 nm), explain why this poses a difficulty for reconciling the particle and wave theories. ◊ Can a white photon exist? ◊ In double-slit diffraction of photons, would you get the same pattern of dots on the digital camera image if you covered one slit? Why should it matter whether you give the photon two choices or only one?

n / Probability is the volume under a surface defined by D(x,y).
Photons in three dimensions
Up until now I've been sneaky and avoided a full discussion of the three-dimensional aspects of the probability interpretation. The example of the carrot in the microwave oven, for example, reduced to a one-dimensional situation because we were considering three points along the same line and because we were only comparing ratios of probabilities. The purpose of bringing it up now is to head off any feeling that you've been cheated conceptually rather than to prepare you for mathematical problem solving in three dimensions, which would not be appropriate for the level of this course. A typical example of a probability distribution in section 13.1 was the distribution of heights of human beings. The thing that varied randomly, height, h, had units of meters, and the probability distribution was a graph of a function D(h). The units of the probability distribution had to be m-1 (inverse meters) so that areas under the curve, interpreted as probabilities, would be unitless: (area)=(height)(width)=m-1⋅m. Now suppose we have a two-dimensional problem, e.g., the probability distribution for the place on the surface of a digital camera chip where a photon will be detected. The point where it is detected would be described with two variables, x and y, each having units of meters. The probability distribution will be a function of both variables, D(x,y). A probability is now visualized as the volume under the surface described by the function D(x,y), as shown in figure n. The units of D must be m-2 so that probabilities will be unitless: (probability)=(depth)(length)(width). In terms of calculus, we have Failed to parse (lexing error): P\:=\:\int D\: d x \:d y . Generalizing finally to three dimensions, we find by analogy that the probability distribution will be a function of all three coordinates, D(x,y,z), and will have units of m-3. It is unfortunately impossible to visualize the graph unless you are a mutant with a natural feel for life in four dimensions. If the probability distribution is nearly constant within a certain volume of space v, the probability that the photon is in that volume is simply vD. If not, then we can use an integral, Failed to parse (lexing error): P\:=\:\int D\: d x \:d y\: d z . \inlinefignocaption{melting-witch}
Matter As a Wave
[In] a few minutes I shall be all melted... I have been wicked in my day, but I never thought a little girl like you would ever be able to melt me and end my wicked deeds. Look out --- here I go! -- The Wicked Witch of the West As the Wicked Witch learned the hard way, losing molecular cohesion can be unpleasant. That's why we should be very grateful that the concepts of quantum physics apply to matter as well as light. If matter obeyed the laws of classical physics, molecules wouldn't exist. Consider, for example, the simplest atom, hydrogen. Why does one hydrogen atom form a chemical bond with another hydrogen atom? Roughly speaking, we'd expect a neighboring pair of hydrogen atoms, A and B, to exert no force on each other at all, attractive or repulsive: there are two repulsive interactions (proton A with proton B and electron A with electron B) and two attractive interactions (proton A with electron B and electron A with proton B). Thinking a little more precisely, we should even expect that once the two atoms got close enough, the interaction would be repulsive. For instance, if you squeezed them so close together that the two protons were almost on top of each other, there would be a tremendously strong repulsion between them due to the 1/r2 nature of the electrical force. The repulsion between the electrons would not be as strong, because each electron ranges over a large area, and is not likely to be found right on top of the other electron. Thus hydrogen molecules should not exist according to classical physics. Quantum physics to the rescue! As we'll see shortly, the whole problem is solved by applying the same quantum concepts to electrons that we have already used for photons.

b / These two electron waves are not distinguishable by any measuring device.
Electrons as waves
We started our journey into quantum physics by studying the
random behavior of matter in radioactive decay, and
then asked how randomness could be linked to the basic laws
of nature governing light. The probability interpretation
of wave-particle duality was strange and hard to accept, but
it provided such a link. It is now natural to ask whether
the same explanation could be applied to matter. If the
fundamental building block of light, the photon, is a
particle as well as a wave, is it possible that the basic
units of matter, such as electrons, are waves as well as particles?
A young French aristocrat studying physics, Louis de Broglie (pronounced “broylee”), made
exactly this suggestion in his 1923 Ph.D. thesis. His idea
had seemed so farfetched that there was serious doubt about
whether to grant him the degree. Einstein was asked for his
opinion, and with his strong support, de Broglie got his degree.
Only two years later, American physicists C.J. Davisson
and L. Germer confirmed de Broglie's idea
by accident. They had been studying the scattering of
electrons from the surface of a sample of nickel, made of
many small crystals. (One can often see such a crystalline
pattern on a brass doorknob that has been polished by
repeated handling.) An accidental explosion occurred, and
when they put their apparatus back together they observed
something entirely different: the scattered electrons were
now creating an interference pattern! This dramatic proof of
the wave nature of matter came about because the nickel
sample had been melted by the explosion and then resolidified
as a single crystal. The nickel atoms, now nicely arranged
in the regular rows and columns of a crystalline lattice,
were acting as the lines of a diffraction grating. The new
crystal was analogous to the type of ordinary diffraction
grating in which the lines are etched on the surface of a
mirror (a reflection grating) rather than the kind in which
the light passes through the transparent gaps between the
lines (a transmission grating).

a / A double-slit interference pattern made with neutrons. (A. Zeilinger, R. G\"{a}hler, C.G. Shull, W. Treimer, and W. Mampe, Reviews of Modern Physics, Vol. 60, 1988.)
Although we will concentrate on the wave-particle duality of electrons because it is important in chemistry and the physics of atoms, all the other “particles” of matter you've learned about show wave properties as well. Figure a, for instance, shows a wave interference pattern of neutrons. It might seem as though all our work was already done for us, and there would be nothing new to understand about electrons: they have the same kind of funny wave-particle duality as photons. That's almost true, but not quite. There are some important ways in which electrons differ significantly from photons:
- Electrons have mass, and photons don't.
- Photons always move at the speed of light, but electrons
can move at any speed less than c.
- Photons don't have electric charge, but electrons do, so
electric forces can act on them. The most important example is the atom, in which the electrons are held by the electric force of the nucleus.
- Electrons cannot be absorbed or emitted as photons are.
Destroying an electron or creating one out of nothing would violate conservation of charge.
(In section 13.4 we will learn of one more fundamental way in
which electrons differ from photons, for a total of five.)
Because electrons are different from photons, it is not
immediately obvious which of the photon equations from
chapter 11 can be applied to electrons as well. A
particle property, the energy of one photon, is related to
its wave properties via E=hf or, equivalently,
E=hc/λ. The momentum of a photon was given
by p=hf/c or p=h/λ. Ultimately it was a
matter of experiment to determine which of these equations,
if any, would work for electrons, but we can make a quick
and dirty guess simply by noting that some of the equations
involve c, the speed of light, and some do not. Since c
is irrelevant in the case of an electron, we might guess
that the equations of general validity are those that do
not have c in them:
This is essentially the reasoning that de Broglie went
through, and experiments have confirmed these two equations
for all the fundamental building blocks of light and matter,
not just for photons and electrons.
The second equation, which I soft-pedaled in the previous
chapter, takes on a greater important for electrons. This is
first of all because the momentum of matter is more likely
to be significant than the momentum of light under ordinary
conditions, and also because force is the transfer of
momentum, and electrons are affected by electrical forces.
Example 11: The wavelength of an elephant
◊ What is the wavelength of a trotting elephant? ◊ One may doubt whether the equation should be applied to an elephant, which is not just a single particle but a rather large collection of them. Throwing caution to the wind, however, we estimate the elephant's mass at 103 kg and its trotting speed at 10 m/s. Its wavelength is therefore roughly Failed to parse (unknown function\unitdot): \begin{align} \lambda &= \frac{h}{p} \\ &= \frac{h}{mv} \\ &= \frac{6.63\times10^{-34}\ text{J}\unitdot\text{s}}{(10^3\ \text{kg})(10\ \text{m}/\text{s})} \\ &\sim 10^{-37}\ \frac{\left(\text{kg}\unitdot\text{m}^2/\text{s}^2\right)\unitdot\text{s}}{\text{kg}\unitdot\text{m}/\text{s}} \\ &= 10^{-37}\ \text{m} \end{align}
The wavelength found in this example is so fantastically small that we can be sure we will never observe any measurable wave phenomena with elephants or any other human-scale objects. The result is numerically small because Planck's constant is so small, and as in some examples encountered previously, this smallness is in accord with the correspondence principle. Although a smaller mass in the equation λ =h/mv does result in a longer wavelength, the wavelength is still quite short even for individual electrons under typical conditions, as shown in the following example.
Example 12: The typical wavelength of an electron
◊ Electrons in circuits and in atoms are typically
moving through voltage differences on the order of 1 V,
so that a typical energy is (e)(1 V), which is on the
order of 10-19 J. What is the wavelength of an electron
with this amount of kinetic energy?
◊ This energy is nonrelativistic, since it is much
less than mc2. Momentum and energy are therefore related
by the nonrelativistic equation K=p2/2m. Solving
for p and substituting in to the equation for the wavelength, we find
This is on the same order of magnitude as the size of an
atom, which is no accident: as we will discuss in the next
chapter in more detail, an electron in an atom can be
interpreted as a standing wave. The smallness of the
wavelength of a typical electron also helps to explain why
the wave nature of electrons wasn't discovered until a
hundred years after the wave nature of light. To scale the
usual wave-optics devices such as diffraction gratings down
to the size needed to work with electrons at ordinary
energies, we need to make them so small that their parts are
comparable in size to individual atoms. This is essentially
what Davisson and Germer did with their nickel crystal.
self-check:
These remarks about the inconvenient smallness of electron
wavelengths apply only under the assumption that the
electrons have typical energies. What kind of energy would
an electron have to have in order to have a longer
wavelength that might be more convenient to work with?
(answer in the back of the PDF version of the book)
What kind of wave is it?
If a sound wave is a vibration of matter, and a photon is a
vibration of electric and magnetic fields, what kind of a
wave is an electron made of? The disconcerting answer is
that there is no experimental “observable,” i.e., directly
measurable quantity, to correspond to the electron wave
itself. In other words, there are devices like microphones
that detect the oscillations of air pressure in a sound
wave, and devices such as radio receivers that measure the
oscillation of the electric and magnetic fields in a light
wave, but nobody has ever found any way to measure the
electron wave directly.
We can of course detect the energy (or momentum) possessed
by an electron just as we could detect the energy of a
photon using a digital camera. (In fact I'd imagine that an
unmodified digital camera chip placed in a vacuum chamber
would detect electrons just as handily as photons.) But this
only allows us to determine where the wave carries high
probability and where it carries low probability. Probability
is proportional to the square of the wave's amplitude, but
measuring its square is not the same as measuring the wave
itself. In particular, we get the same result by squaring
either a positive number or its negative, so there is no way
to determine the positive or negative sign of an electron wave.
Most physicists tend toward the school of philosophy known
as operationalism, which says that a concept is only
meaningful if we can define some set of operations for
observing, measuring, or testing it. According to a strict
operationalist, then, the electron wave itself is a
meaningless concept. Nevertheless, it turns out to be one of
those concepts like love or humor that is impossible to
measure and yet very useful to have around. We therefore
give it a symbol, Ψ (the capital Greek letter psi),
and a special name, the electron wavefunction
(because it is a function of the coordinates x, y, and z
that specify where you are in space). It would be impossible,
for example, to calculate the shape of the electron wave in
a hydrogen atom without having some symbol for the wave. But
when the calculation produces a result that can be compared
directly to experiment, the final algebraic result will turn
out to involve only Ψ2, which is what is observable, not Ψ itself.
Since Ψ, unlike E and B, is not directly
measurable, we are free to make the probability equations
have a simple form: instead of having the probability
density equal to some funny constant multiplied by Ψ2,
we simply define Ψ so that the constant of
proportionality is one:
Since the probability distribution has units of m-3, the units
of Ψ must be m-3/2.
”Discussion Question”
◊
Frequency is oscillations per second, whereas wavelength is
meters per oscillation. How could the equations E=hf
and p=h/λ be made to look more alike by
using quantities that were more closely analogous?
(This more symmetric treatment makes it easier to
incorporate relativity into quantum mechanics, since
relativity says that space and time are not entirely
separate.)

c / Part of an infinite sine wave.

d / A finite-length sine wave.

e / A beat pattern created by superimposing two sine waves with slightly different wavelengths.
Dispersive waves
A colleague of mine who teaches chemistry loves to tell the
story about an exceptionally bright student who, when told
of the equation p=h/λ, protested, “But when I
derived it, it had a factor of 2!” The issue that's
involved is a real one, albeit one that could be glossed
over (and is, in most textbooks) without raising any alarms
in the mind of the average student. The present optional
section addresses this point; it is intended for the student
who wishes to delve a little deeper.
Here's how the now-legendary student was presumably
reasoning. We start with the equation v=fλ, which
is valid for any sine wave, whether it's quantum or
classical. Let's assume we already know E=hf, and
are trying to derive the relationship between wavelength and momentum:
The reasoning seems valid, but the result does contradict
the accepted one, which is after all solidly based on experiment.
The mistaken assumption is that we can figure everything out
in terms of pure sine waves. Mathematically, the only wave
that has a perfectly well defined wavelength and frequency
is a sine wave, and not just any sine wave but an infinitely
long sine wave, c.
The unphysical thing about such a wave
is that it has no leading or trailing edge, so it can never
be said to enter or leave any particular region of space.
Our derivation made use of the velocity, v, and if
velocity is to be a meaningful concept, it must tell us how
quickly stuff (mass, energy, momentum, ...) is transported
from one region of space to another. Since an infinitely
long sine wave doesn't remove any stuff from one region and
take it to another, the “velocity of its stuff” is not a
well defined concept.
Of course the individual wave peaks do travel through space,
and one might think that it would make sense to associate
their speed with the “speed of stuff,” but as we will see,
the two velocities are in general unequal when a wave's
velocity depends on wavelength. Such a wave is called a
dispersive wave, because a wave pulse consisting of a
superposition of waves of different wavelengths will
separate (disperse) into its separate wavelengths as the
waves move through space at different speeds. Nearly all
the waves we have encountered have been nondispersive. For
instance, sound waves and light waves (in a vacuum) have
speeds independent of wavelength. A water wave is one good
example of a dispersive wave. Long-wavelength water waves
travel faster, so a ship at sea that encounters a storm
typically sees the long-wavelength parts of the wave first.
When dealing with dispersive waves, we need symbols and
words to distinguish the two speeds. The speed at which
wave peaks move is called the phase velocity, vp, and the
speed at which “stuff” moves is called the group velocity, vg.
An infinite sine wave can only tell us about the phase
velocity, not the group velocity, which is really what we
would be talking about when we refer to the speed of an
electron. If an infinite sine wave is the simplest possible
wave, what's the next best thing? We might think the runner
up in simplicity would be a wave train consisting of a
chopped-off segment of a sine wave, d. However, this kind
of wave has kinks in it at the end. A simple wave should be
one that we can build by superposing a small number of
infinite sine waves, but a kink can never be produced by
superposing any number of infinitely long sine waves.
Actually the simplest wave that transports stuff from place
to place is the pattern shown in figure
e. Called a beat
pattern, it is formed by superposing two sine waves whose
wavelengths are similar but not quite the same. If you have
ever heard the pulsating howling sound of musicians in the
process of tuning their instruments to each other, you have
heard a beat pattern. The beat pattern gets stronger and
weaker as the two sine waves go in and out of phase with
each other. The beat pattern has more “stuff” (energy, for
example) in the areas where constructive interference
occurs, and less in the regions of cancellation. As the
whole pattern moves through space, stuff is transported from
some regions and into other ones.
If the frequency of the two sine waves differs by 10%, for
instance, then ten periods will be occur between times when
they are in phase. Another way of saying it is that the
sinusoidal “envelope” (the dashed lines in figure e) has
a frequency equal to the difference in frequency between the
two waves. For instance, if the waves had frequencies of 100
Hz and 110 Hz, the frequency of the envelope would be 10 Hz.
To apply similar reasoning to the wavelength, we must define
a quantity z=1/λ that relates to wavelength in the
same way that frequency relates to period. In terms of this
new variable, the z of the envelope equals the difference
between the z's of the two sine waves.
The group velocity is the speed at which the envelope moves
through space. Let Δ f and Δ z be the
differences between the frequencies and z's of the two
sine waves, which means that they equal the frequency and
z of the envelope. The group velocity is vg=fenvelopeλenvelope=Δz. If Δ f and Δ z are sufficiently
small, we can approximate this expression as a derivative,
This expression is usually taken as the definition of the
group velocity for wave patterns that consist of a
superposition of sine waves having a narrow range of
frequencies and wavelengths. In quantum mechanics, with
f=E/h and z=p/h, we have vg=d E/d p. In the case of a
nonrelativistic electron the relationship between energy and
momentum is E=p2/2m, so the group velocity is d E/d p=p/m=v,
exactly what it should be. It is only the phase velocity
that differs by a factor of two from what we would have expected,
but the phase velocity is not the physically important thing.

f / Three possible standing-wave patterns for a particle in a box.
![]()
g / The spectrum of the light from the star Sirius.

h / Two hydrogen atoms bond to form an H2 molecule. In the molecule, the two electrons' wave patterns overlap , and are about twice as wide.
Bound states
Electrons are at their most interesting when they're in atoms, that is, when they are bound within a small region of space. We can understand a great deal about atoms and molecules based on simple arguments about such bound states, without going into any of the realistic details of atom. The simplest model of a bound state is known as the particle in a box: like a ball on a pool table, the electron feels zero force while in the interior, but when it reaches an edge it encounters a wall that pushes back inward on it with a large force. In particle language, we would describe the electron as bouncing off of the wall, but this incorrectly assumes that the electron has a certain path through space. It is more correct to describe the electron as a wave that undergoes 100% reflection at the boundaries of the box. Like a generation of physics students before me, I rolled my eyes when initially introduced to the unrealistic idea of putting a particle in a box. It seemed completely impractical, an artificial textbook invention. Today, however, it has become routine to study electrons in rectangular boxes in actual laboratory experiments. The “box” is actually just an empty cavity within a solid piece of silicon, amounting in volume to a few hundred atoms. The methods for creating these electron-in-a-box setups (known as “quantum dots”) were a by-product of the development of technologies for fabricating computer chips. For simplicity let's imagine a one-dimensional electron in a box, i.e., we assume that the electron is only free to move along a line. The resulting standing wave patterns, of which the first three are shown in the figure, are just like some of the patterns we encountered with sound waves in musical instruments. The wave patterns must be zero at the ends of the box, because we are assuming the walls are impenetrable, and there should therefore be zero probability of finding the electron outside the box. Each wave pattern is labeled according to n, the number of peaks and valleys it has. In quantum physics, these wave patterns are referred to as “states” of the particle-in-the-box system. The following seemingly innocuous observations about the particle in the box lead us directly to the solutions to some of the most vexing failures of classical physics: The particle's energy is quantized (can only have certain values). Each wavelength corresponds to a certain momentum, and a given momentum implies a definite kinetic energy, E=p2/2m. (This is the second type of energy quantization we have encountered. The type we studied previously had to do with restricting the number of particles to a whole number, while assuming some specific wavelength and energy for each particle. This type of quantization refers to the energies that a single particle can have. Both photons and matter particles demonstrate both types of quantization under the appropriate circumstances.) The particle has a minimum kinetic energy. Long wavelengths correspond to low momenta and low energies. There can be no state with an energy lower than that of the n=1 state, called the ground state. The smaller the space in which the particle is confined, the higher its kinetic energy must be. Again, this is because long wavelengths give lower energies.
Example 13: Spectra of thin gases
A fact that was inexplicable by classical physics was that thin gases absorb and emit light only at certain wavelengths. This was observed both in earthbound laboratories and in the spectra of stars. The figure on the left shows the example of the spectrum of the star Sirius, in which there are “gap teeth” at certain wavelengths. Taking this spectrum as an example, we can give a straightforward explanation using quantum physics.
Energy is released in the dense interior of the star, but the outer layers of the star are thin, so the atoms are far apart and electrons are confined within individual atoms. Although their standing-wave patterns are not as simple as those of the particle in the box, their energies are quantized.
When a photon is on its way out through the outer layers, it can be absorbed by an electron in an atom, but only if the amount of energy it carries happens to be the right amount to kick the electron from one of the allowed energy levels to one of the higher levels. The photon energies that are missing from the spectrum are the ones that equal the difference in energy between two electron energy levels. (The most prominent of the absorption lines in Sirius's spectrum are absorption lines of the hydrogen atom.)
Example 14: The stability of atoms
In many Star Trek episodes the Enterprise, in orbit around a planet, suddenly lost engine power and began spiraling down toward the planet's surface. This was utter nonsense, of course, due to conservation of energy: the ship had no way of getting rid of energy, so it did not need the engines to replenish it.
Consider, however, the electron in an atom as it orbits the nucleus. The electron does have a way to release energy: it has an acceleration due to its continuously changing direction of motion, and according to classical physics, any accelerating charged particle emits electromagnetic waves. According to classical physics, atoms should collapse!
The solution lies in the observation that a bound state has a minimum energy. An electron in one of the higher-energy atomic states can and does emit photons and hop down step by step in energy. But once it is in the ground state, it cannot emit a photon because there is no lower-energy state for it to go to.
Example 15: Chemical bonds
I began this section with a classical argument that chemical bonds, as in an H2 molecule, should not exist. Quantum physics explains why this type of bonding does in fact occur. When the atoms are next to each other, the electrons are shared between them. The “box” is about twice as wide, and a larger box allows a smaller energy. Energy is required in order to separate the atoms. (A qualitatively different type of bonding is discussed on page 837. Example 20 on page 832 revisits the H2 bond in more detail.) ”Discussion Questions” ◊ Neutrons attract each other via the strong nuclear force, so according to classical physics it should be possible to form nuclei out of clusters of two or more neutrons, with no protons at all. Experimental searches, however, have failed to turn up evidence of a stable two-neutron system (dineutron) or larger stable clusters. These systems are apparently not just unstable in the sense of being able to beta decay but unstable in the sense that they don't hold together at all. Explain based on quantum physics why a dineutron might spontaneously fly apart. ◊ The following table shows the energy gap between the ground state and the first excited state for four nuclei, in units of picojoules. (The nuclei were chosen to be ones that have similar structures, e.g., they are all spherical in shape.)
| nucleus | energy gap (picojoules) |
| 4zuHe | 3.234 |
| 16zuO | 0.968 |
| 40zuCa | 0.536 |
| 208zuPb | 0.418 |
Explain the trend in the data.
The uncertainty principle and measurement
Eliminating randomness through measurement?
A common reaction to quantum physics, among both early-twentieth-century physicists and modern students, is that we should be able to get rid of randomness through accurate measurement. If I say, for example, that it is meaningless to discuss the path of a photon or an electron, one might suggest that we simply measure the particle's position and velocity many times in a row. This series of snapshots would amount to a description of its path. A practical objection to this plan is that the process of measurement will have an effect on the thing we are trying to measure. This may not be of much concern, for example, when a traffic cop measure's your car's motion with a radar gun, because the energy and momentum of the radar pulses are insufficient to change the car's motion significantly. But on the subatomic scale it is a very real problem. Making a videotape through a microscope of an electron orbiting a nucleus is not just difficult, it is theoretically impossible. The video camera makes pictures of things using light that has bounced off them and come into the camera. If even a single photon of visible light was to bounce off of the electron we were trying to study, the electron's recoil would be enough to change its behavior significantly.
The heisenberg uncertainty principle
This insight, that measurement changes the thing being measured, is the kind of idea that clove-cigarette-smoking intellectuals outside of the physical sciences like to claim they knew all along. If only, they say, the physicists had made more of a habit of reading literary journals, they could have saved a lot of work. The anthropologist Margaret Mead has recently been accused of inadvertently encouraging her teenaged Samoan informants to exaggerate the freedom of youthful sexual experimentation in their society. If this is considered a damning critique of her work, it is because she could have done better: other anthropologists claim to have been able to eliminate the observer-as-participant problem and collect untainted data. The German physicist Werner Heisenberg, however, showed that in quantum physics, any measuring technique runs into a brick wall when we try to improve its accuracy beyond a certain point. Heisenberg showed that the limitation is a question of what there is to be known, even in principle, about the system itself, not of the ability or inability of a specific measuring device to ferret out information that is knowable but not previously hidden. Suppose, for example, that we have constructed an electron in a box (quantum dot) setup in our laboratory, and we are able to adjust the length L of the box as desired. All the standing wave patterns pretty much fill the box, so our knowledge of the electron's position is of limited accuracy. If we write Δ x for the range of uncertainty in our knowledge of its position, then Δ x is roughly the same as the length of the box: Δ x ≈ L If we wish to know its position more accurately, we can certainly squeeze it into a smaller space by reducing L, but this has an unintended side-effect. A standing wave is really a superposition of two traveling waves going in opposite directions. The equation p=h/λ really only gives the magnitude of the momentum vector, not its direction, so we should really interpret the wave as a 50/50 mixture of a right-going wave with momentum p=h/λ and a left-going one with momentum p=-h/λ. The uncertainty in our knowledge of the electron's momentum is Δ p=2h/λ, covering the range between these two values. Even if we make sure the electron is in the ground state, whose wavelength λ =2L is the longest possible, we have an uncertainty in momentum of Δ p=h/L. In general, we find Δ p >rsim h/L , with equality for the ground state and inequality for the higher-energy states. Thus if we reduce L to improve our knowledge of the electron's position, we do so at the cost of knowing less about its momentum. This trade-off is neatly summarized by multiplying the two equations to give Δ pΔ x >rsim h . Although we have derived this in the special case of a particle in a box, it is an example of a principle of more general validity:
The Heisenberg uncertainty principle
It is not possible, even in principle, to know the momentum and the position of a particle simultaneously and with perfect accuracy. The uncertainties in these two quantities are always such that Δ pΔ x >rsim h.
(This approximation can be made into a strict inequality, Δ pΔ x>h/4π, but only with more careful definitions, which we will not bother with.) Note that although I encouraged you to think of this derivation in terms of a specific real-world system, the quantum dot, no reference was ever made to any specific laboratory equipment or procedures. The argument is simply that we cannot know the particle's position very accurately unless it has a very well defined position, it cannot have a very well defined position unless its wave-pattern covers only a very small amount of space, and its wave-pattern cannot be thus compressed without giving it a short wavelength and a correspondingly uncertain momentum. The uncertainty principle is therefore a restriction on how much there is to know about a particle, not just on what we can know about it with a certain technique.
Example 16: An estimate for electrons in atoms
◊ A typical energy for an electron in an atom is on
the order of
, which corresponds to a speed of
about 1% of the speed of light. If a typical atom has a
size on the order of 0.1 nm, how close are the electrons to
the limit imposed by the uncertainty principle?
◊ If we assume the electron moves in all directions
with equal probability, the uncertainty in its momentum is
roughly twice its typical momentum. This only an order-of-magnitude
estimate, so we take Δ p to be the same as a typical momentum:
Failed to parse (unknown function\unitdottext): \begin{align} \Delta p \Delta x &= p_{typical} \Delta x \\ &= (m_{electron}) (0.01c) (0.1\times10^{-9}\ \text{m}) \\ &= 3\times 10^{-34}\ text{J}\unitdottext{s} \end{align}
This is on the same order of magnitude as Planck's constant, so evidently the electron is “right up against the wall.” (The fact that it is somewhat less than h is of no concern since this was only an estimate, and we have not stated the uncertainty principle in its most exact form.) self-check: If we were to apply the uncertainty principle to human-scale objects, what would be the significance of the small numerical value of Planck's constant? (answer in the back of the PDF version of the book)
Optional topic: Historical Note
The true nature of Heisenberg's role in the Nazi atomic bomb effort is a fascinating question, and dramatic enough to have inspired a well-received 1998 theatrical play, “Copenhagen.” The real story, however, may never be completely unraveled. Heisenberg was the scientific leader of the German bomb program up until its cancellation in 1942, when the German military decided that it was too ambitious a project to undertake in wartime, and too unlikely to produce results. Some historians believe that Heisenberg intentionally delayed and obstructed the project because he secretly did not want the Nazis to get the bomb. Heisenberg's apologists point out that he never joined the Nazi party, and was not anti-Semitic. He actively resisted the government's Deutsche-Physik policy of eliminating supposed Jewish influences from physics, and as a result was denounced by the S.S. as a traitor, escaping punishment only because Himmler personally declared him innocent. One strong piece of evidence for this view is a secret message carried to the U.S. in 1941, by one of the last Jews to escape from Berlin, and eventually delivered to the chairman of the Uranium Committee, which was then studying the feasibility of a bomb. The message states “...that a large number of German physicists are working intensively on the problem of the uranium bomb under the direction of Heisenberg, [and] that Heisenberg himself tries to delay the work as much as possible, fearing the catastrophic results of success. But he cannot help fulfilling the orders given to him, and if the problem can be solved, it will be solved probably in the near future. So he gave the advice to us to hurry up if U.S.A. will not come too late.” The message supports the view that Heisenberg intentionally misled his government about the bomb's technical feasibility; Nazi Minister of Armaments Albert Speer wrote that he was convinced to drop the project after a 1942 meeting with Heisenber because “the physicists themselves didn't want to put too much into it.” Heisenberg also may have warned Danish physicist Niels Bohr personally in September 1941 about the existence of the Nazi bomb effort. On the other side of the debate, critics of Heisenberg say that he clearly wanted Germany to win the war, that he visited German-occupied territories in a semi-official role, and that he may simply have not been very good at his job directing the bomb project. On a visit to the occupied Netherlands in 1943, he told a colleague, “Democracy cannot develop sufficient energy to rule Europe. There are, therefore, only two alternatives: Germany and Russia. And then a Europe under German leadership would be the lesser evil.” Some historians3 argue that the real point of Heisenberg's meeting with Bohr was to try to convince the U.S. not to try to build a bomb, so that Germany, possessing a nuclear monopoly, would defeat the Soviets --- this was after the June 1941 entry of the U.S.S.R. into the war, but before the December 1941 Pearl Harbor attack brought the U.S. in. Bohr apparently considered Heisenberg's account of the meeting, published after the war was over, to be inaccurate.4 The secret 1941 message also has a curious moral passivity to it, as if Heisenberg was saying “I hope you stop me before I do something bad,” but we should also consider the great risk Heisenberg would have been running if he actually originated the message.
Measurement and Schr\"odinger's cat
In the previous chapter I briefly mentioned an issue concerning measurement that we are now ready to address carefully. If you hang around a laboratory where quantum-physics experiments are being done and secretly record the physicists' conversations, you'll hear them say many things that assume the probability interpretation of quantum mechanics. Usually they will speak as though the randomness of quantum mechanics enters the picture when something is measured. In the digital camera experiments of the previous chapter, for example, they would casually describe the detection of a photon at one of the pixels as if the moment of detection was when the photon was forced to “make up its mind.” Although this mental cartoon usually works fairly well as a description of things they experience in the lab, it cannot ultimately be correct, because it attributes a special role to measurement, which is really just a physical process like all other physical processes. If we are to find an interpretation that avoids giving any special role to measurement processes, then we must think of the entire laboratory, including the measuring devices and the physicists themselves, as one big quantum-mechanical system made out of protons, neutrons, electrons, and photons. In other words, we should take quantum physics seriously as a description not just of microscopic objects like atoms but of human-scale (“macroscopic”) things like the apparatus, the furniture, and the people. The most celebrated example is called the Schr\"odinger's cat experiment. Luckily for the cat, there probably was no actual experiment --- it was simply a “thought experiment” that the physicist the German theorist Schr\"odinger discussed with his colleagues. Schr\"odinger wrote:
One can even construct quite burlesque cases. A cat is shut up in a steel container, together with the following diabolical apparatus (which one must keep out of the direct clutches of the cat): In a Geiger tube [radiation detector] there is a tiny mass of radioactive substance, so little that in the course of an hour perhaps one atom of it disintegrates, but also with equal probability not even one; if it does happen, the counter [detector] responds and ... activates a hammer that shatters a little flask of prussic acid [filling the chamber with poison gas]. If one has left this entire system to itself for an hour, then one will say to himself that the cat is still living, if in that time no atom has disintegrated. The first atomic disintegration would have poisoned it.
Now comes the strange part. Quantum mechanics describes the particles the cat is made of as having wave properties, including the property of superposition. Schr\"odinger describes the wavefunction of the box's contents at the end of the hour: The wavefunction of the entire system would express this situation by having the living and the dead cat mixed ... in equal parts [50/50 proportions]. The uncertainty originally restricted to the atomic domain has been transformed into a macroscopic uncertainty...
At first Schr\"odinger's description seems like nonsense. When you opened the box, would you see two ghostlike cats, as in a doubly exposed photograph, one dead and one alive? Obviously not. You would have a single, fully material cat, which would either be dead or very, very upset. But Schr\"odinger has an equally strange and logical answer for that objection. In the same way that the quantum randomness of the radioactive atom spread to the cat and made its wavefunction a random mixture of life and death, the randomness spreads wider once you open the box, and your own wavefunction becomes a mixture of a person who has just killed a cat and a person who hasn't. ”Discussion Questions” ◊ Compare Δ p and Δ x for the two lowest energy levels of the one-dimensional particle in a box, and discuss how this relates to the uncertainty principle. ◊ On a graph of Δ p versus Δx, sketch the regions that are allowed and forbidden by the Heisenberg uncertainty principle. Interpret the graph: Where does an atom lie on it? An elephant? Can either p or x be measured with perfect accuracy if we don't care about the other?

i / An electron in a gentle electric field gradually shortens its wavelength as it gains energy.

j / The wavefunction's tails go where classical physics says they shouldn't.
Electrons in electric fields
So far the only electron wave patterns we've considered have been simple sine waves, but whenever an electron finds itself in an electric field, it must have a more complicated wave pattern. Let's consider the example of an electron being accelerated by the electron gun at the back of a TV tube. Newton's laws are not useful, because they implicitly assume that the path taken by the particle is a meaningful concept. Conservation of energy is still valid in quantum physics, however. In terms of energy, the electron is moving from a region of low voltage into a region of higher voltage. Since its charge is negative, it loses electrical energy by moving to a higher voltage, so its kinetic energy increases. As its electrical energy goes down, its kinetic energy goes up by an equal amount, keeping the total energy constant. Increasing kinetic energy implies a growing momentum, and therefore a shortening wavelength, i The wavefunction as a whole does not have a single well-defined wavelength, but the wave changes so gradually that if you only look at a small part of it you can still pick out a wavelength and relate it to the momentum and energy. (The picture actually exaggerates by many orders of magnitude the rate at which the wavelength changes.)
But what if the electric field was stronger? The electric
field in a TV is only ∼105 N/C, but the electric field
within an atom is more like 1012 N/C. In figure k,
the wavelength changes so rapidly that there is nothing that
looks like a sine wave at all. We could get a rough idea of
the wavelength in a given region by measuring the distance
between two peaks, but that would only be a rough approximation.
Suppose we want to know the wavelength at point P. The
trick is to construct a sine wave, like the one shown with
the dashed line, which matches the curvature of the actual
wavefunction as closely as possible near P. The sine wave
that matches as well as possible is called the “osculating”
curve, from a Latin word meaning “to kiss.” The wavelength
of the osculating curve is the wavelength that will relate
correctly to conservation of energy.

k / A typical wavefunction of an electron in an atom (heavy curve) and the osculating sine wave (dashed curve) that matches its curvature at point P.
Tunneling
We implicitly assumed that the particle-in-a-box wavefunction would cut off abruptly at the sides of the box, j/1, but that would be unphysical. A kink has infinite curvature, and curvature is related to energy, so it can't be infinite. A physically realistic wavefunction must always “tail off” gradually, j/2. In classical physics, a particle can never enter a region in which its interaction energy U would be greater than the amount of energy it has available. But in quantum physics the wavefunction will always have a tail that reaches into the classically forbidden region. If it was not for this effect, called tunneling, the fusion reactions that power the sun would not occur due to the high electrical energy nuclei need in order to get close together! Tunneling is discussed in more detail in the following subsection.

l / Tunneling through a barrier.

m / The electrical, nuclear, and total interaction energies for an alpha particle escaping from a nucleus.
The Schr\"odinger equation
In subsection 13.3.5 we were able to apply conservation
of energy to an electron's wavefunction, but only by using
the clumsy graphical technique of osculating sine waves as a
measure of the wave's curvature. You have learned a more
convenient measure of curvature in calculus: the second
derivative. To relate the two approaches, we take the second
derivative of a sine wave:
Taking the second derivative gives us back the same
function, but with a minus sign and a constant out in front
that is related to the wavelength. We can thus relate the
second derivative to the osculating wavelength:
Failed to parse (unknown function\label): \label{eq:schreqna} \frac{d^2\Psi}{d x^2} = -\left(\frac{2\pi}{\lambda}\right)^2\Psi
This could be solved for λ in terms of Ψ, but it will turn out below to be more convenient to leave it in this form. Applying this to conservation of energy, we have Failed to parse (unknown function\label): \begin{align}\label{eq:schreqnb} E &= K + U \\ &= \frac{p^2}{2m} + U \\ &= \frac{(h/\lambda)^2}{2m} + U \end{align}
Note that both equation \eqref{eq:schreqna} and equation \eqref{eq:schreqnb} have λ2
in the denominator. We can simplify our algebra by
multiplying both sides of equation \eqref{eq:schreqnb} by Ψ to make it
look more like equation \eqref{eq:schreqna}:
Further simplification is achieved by using the symbol
(h
with a slash through it, read “h-bar”) as an abbreviation
for h/2π. We then have the important result known as
the \labelimportantintext{Schr\"odinger equation}:
(Actually this is a simplified version of the Schr\"odinger equation, applying only to standing waves in one dimension.) Physically it is a statement of conservation of energy. The total energy E must be constant, so the equation tells us that a change in interaction energy U must be accompanied by a change in the curvature of the wavefunction. This change in curvature relates to a change in wavelength, which corresponds to a change in momentum and kinetic energy. self-check: Considering the assumptions that were made in deriving the Schr\"odinger equation, would it be correct to apply it to a photon? To an electron moving at relativistic speeds? (answer in the back of the PDF version of the book) Usually we know right off the bat how U depends on x, so the basic mathematical problem of quantum physics is to find a function Ψ (x) that satisfies the Schr\"odinger equation for a given interaction-energy function U(x). An equation, such as the Schr\"odinger equation, that specifies a relationship between a function and its derivatives is known as a differential equation. The detailed study of the solution of the Schr\"odinger equation is beyond the scope of this book, but we can gain some important insights by considering the easiest version of the Schr\"odinger equation, in which the interaction energy U is constant. We can then rearrange the Schr\"odinger equation as follows: Failed to parse (unknown function\intertext): \begin{align} \frac{d^2\Psi}{d x^2} &= \frac{2m(U-E)}{\hbar^2} \Psi\qquad , \intertext{which boils down to} \frac{d^2\Psi}{d x^2} &= a\Psi \qquad , \end{align}
where, according to our assumptions, a is independent of x. We need to find a function whose second derivative is the same as the original function except for a multiplicative constant. The only functions with this property are sine waves and exponentials:
Failed to parse (lexing error): \begin{align} \frac{d^2}{d x^2}\left[\:q\sin(rx+s)\:\right] &= -qr^2\sin(rx+s) \\ \frac{d^2}{d x^2}\left[qe^{rx+s}\right] &= qr^2e^{rx+s} \end{align}
The sine wave gives negative values of a, a=-r2, and the exponential gives positive ones, a=r2. The former applies to the classically allowed region with U<E.
This leads us to a quantitative calculation of the tunneling
effect discussed briefly in the preceding subsection. The
wavefunction evidently tails off exponentially in the
classically forbidden region. Suppose, as shown in
figure l,
a wave-particle traveling to the right encounters a
barrier that it is classically forbidden to enter. Although
the form of the Schr\"odinger equation we're using technically
does not apply to traveling waves (because it makes no
reference to time), it turns out that we can still use it to
make a reasonable calculation of the probability that the
particle will make it through the barrier. If we let the
barrier's width be w, then the ratio of the wavefunction
on the left side of the barrier to the wavefunction on the right is
Probabilities are proportional to the squares of wavefunctions,
so the probability of making it through the barrier is
self-check:
If we were to apply this equation to find the probability
that a person can walk through a wall, what would the small
value of Planck's constant imply?
(answer in the back of the PDF version of the book)
Example 17: Tunneling in alpha decay
Naively, we would expect alpha decay to be a very fast process. The typical speeds of neutrons and protons inside a nucleus are extremely high (see problem 20). If we imagine an alpha particle coalescing out of neutrons and protons inside the nucleus, then at the typical speeds we're talking about, it takes a ridiculously small amount of time for them to reach the surface and try to escape. Clattering back and forth inside the nucleus, we could imagine them making a vast number of these “escape attempts” every second. Consider figure m, however, which shows the interaction energy for an alpha particle escaping from a nucleus. The electrical energy is kq1q2/r when the alpha is outside the nucleus, while its variation inside the nucleus is proportional to r, as a consequence of the shell theorem. The nuclear energy is constant when the alpha is inside the nucleus, because the forces from all the neighboring neutrons and protons cancel out; it rises sharply near the surface, and flattens out to zero over a distance of ∼ 1 fm, which is the maximum distance scale at which the strong force can operate. There is a classically forbidden region immediately outside the nucleus, so the alpha particle can only escape by quantum mechanical tunneling. (It's true, but somewhat counterintuitive, that a repulsive electrical force can make it more difficult for the alpha to get out.) In reality, alpha-decay half-lives are often extremely long --- sometimes billions of years --- because the tunneling probability is so small. Although the shape of the barrier is not a rectangle, the equation for the tunneling probability on page 818 can still be used as a rough guide to our thinking. Essentially the tunneling probability is so small because U-E is fairly big, typically about 30 MeV at the peak of the barrier.
Three dimensions
For simplicity, we've been considering the Schr\"odinger equation in one dimension, so that Ψ is only a function of x, and has units of m-1/2 rather than m-3/2. Since the Schr\"odinger equation is a statement of conservation of energy, and energy is a scalar, the generalization to three dimensions isn't particularly complicated. The total energy term E⋅Ψ and the interaction energy term U⋅Ψ involve nothing but scalars, and don't need to be changed at all. In the kinetic energy term, however, we're essentially basing our computation of the kinetic energy on the squared magnitude of the momentum, px2, and in three dimensions this would clearly have to be generalized to px2+py2+pz2. The obvious way to achieve this is to replace the second derivative d2Ψ/d x2 with the sum ∂2Ψ/∂ x2+ ∂2Ψ/∂ y2+ ∂2Ψ/∂ z2. Here the partial derivative symbol ∂, introduced on page 211, indicates that when differentiating with respect to a particular variable, the other variables are to be considered as constants. This operation on the function Ψ is notated ∇2Ψ, and the derivative-like operator ∇2=∂2/∂ x2+ ∂2/∂ y2+ ∂2/∂ z2 is called the Laplacian. It occurs elswehere in physics. For example, in classical electrostatics, the voltage in a region of vacuum must be a solution of the equation ∇2V=0. Like the second derivative, the Laplacian is essentially a measure of curvature.
Example 18: Examples of the Laplacian in two dimensions
◊ Compute the Laplacians of the following functions in two dimensions, and interpret them: A=x2+y2, B=-x2-y2, C=x2-y2. ◊ The first derivative of function A with respect to x is ∂ A/∂ x=2x. Since y is treated as a constant in the computation of the partial derivative ∂/∂ x, the second term goes away. The second derivative of A with respect to x is ∂2 A/∂ x2=2. Similarly we have ∂2 A/∂ y2=2, so ∇2 A=4. All derivative operators, including ∇2, have the linear property that multiplying the input function by a constant just multiplies the output function by the same constant. Since B=-A, and we have ∇2 B=-4. For function C, the x term contributes a second derivative of 2, but the y term contributes -2, so ∇2 C=0. The interpretation of the positive sign in ∇2 A=4 is that A's graph is shaped like a trophy cup, and the cup is concave up. The negative sign in the result for ∇2 B is because B is concave down. Function C is shaped like a saddle. Since its curvature along one axis is concave up, but the curvature along the other is down and equal in magnitude, the function is considered to have zero concavity over all.
Example 19: A classically allowed region with constant U
In a classically allowed region with constant U, we expect the solutions
to the Schr\"odinger equation to be sine waves. A sine wave in three dimensions
has the form
When we compute ∂2Ψ/∂ x2, double differentiation of
sin gives -sin, and the chain rule brings out a factor of kx2.
Applying all three second derivative operators, we get
The Schr\"odinger equation gives
which can be satisfied since we're in a classically allowed region with E-U>0, and the right-hand
side is manifestly positive.
Use of complex numbers
In a classically forbidden region, a particle's total
energy, U+K, is less than its U, so
its K must be negative. If we want to keep believing
in the equation K=p2/2m, then apparently the
momentum of the particle is the square root of a negative
number. This is a symptom of the fact that the Schr\"odinger
equation fails to describe all of nature unless the
wavefunction and various other quantities are allowed to be
complex numbers. In particular it is not possible to
describe traveling waves correctly without using complex wavefunctions.
This may seem like nonsense, since real numbers are the only
ones that are, well, real! Quantum mechanics can always be
related to the real world, however, because its structure is
such that the results of measurements always come out to be
real numbers. For example, we may describe an electron as
having non-real momentum in classically forbidden regions,
but its average momentum will always come out to be real
(the imaginary parts average out to zero), and it can never
transfer a non-real quantity of momentum to another particle.
A complete investigation of these issues is beyond the scope
of this book, and this is why we have normally limited
ourselves to standing waves, which can be described with
real-valued wavefunctions.
-
The Atom
You can learn a lot by taking a car engine apart, but you will have learned a lot more if you can put it all back together again and make it run. Half the job of reductionism is to break nature down into its smallest parts and understand the rules those parts obey. The second half is to show how those parts go together, and that is our goal in this chapter. We have seen how certain features of all atoms can be explained on a generic basis in terms of the properties of bound states, but this kind of argument clearly cannot tell us any details of the behavior of an atom or explain why one atom acts differently from another. The biggest embarrassment for reductionists is that the job of putting things back together job is usually much harder than the taking them apart. Seventy years after the fundamentals of atomic physics were solved, it is only beginning to be possible to calculate accurately the properties of atoms that have many electrons. Systems consisting of many atoms are even harder. Supercomputer manufacturers point to the folding of large protein molecules as a process whose calculation is just barely feasible with their fastest machines. The goal of this chapter is to give a gentle and visually oriented guide to some of the simpler results about atoms.

a / Eight wavelengths fit around this circle (
).
Classifying states
We'll focus our attention first on the simplest atom,
hydrogen, with
one proton and one electron. We know in advance a little of
what we should expect for the structure of this atom. Since
the electron is bound to the proton by electrical forces, it
should display a set of discrete energy states, each
corresponding to a certain standing wave pattern. We need to
understand what states there are and what their properties are.
What properties should we use to classify the states? The
most sensible approach is to used conserved quantities.
Energy is one conserved
quantity, and we already know to expect each state to have a
specific energy. It turns out, however, that energy alone is
not sufficient. Different standing wave patterns of the atom
can have the same energy.
Momentum is also a
conserved quantity, but it is not particularly appropriate
for classifying the states of the electron in a hydrogen
atom. The reason is that the force between the electron and
the proton results in the continual exchange of momentum
between them. (Why wasn't this a problem for energy as well?
Kinetic energy and momentum are related by K=p2/2m,
so the much more massive proton never has very much kinetic
energy. We are making an approximation by assuming all the
kinetic energy is in the electron, but it is quite a
good approximation.)
Angular momentum
does help with classification. There is no transfer of
angular momentum between the proton and the electron, since
the force between them is a center-to-center force,
producing no torque.
Like energy, angular momentum is quantized in quantum
physics. As an example, consider a quantum wave-particle
confined to a circle, like a wave in a circular moat
surrounding a castle. A sine wave in such a “quantum moat”
cannot have any old wavelength, because an integer number of
wavelengths must fit around the circumference, C, of the
moat. The larger this integer is, the shorter the wavelength,
and a shorter wavelength relates to greater momentum and
angular momentum. Since this integer is related to angular
momentum, we use the symbol for it:
The angular momentum is
L = rp .
Here, r=C/2π, and
, so
In the example of the quantum moat, angular momentum is
quantized in units of h/2π. This makes h/2π a
pretty important number, so we define the abbreviation
. This symbol is read “h-bar.”
In fact, this is a completely general fact in quantum
physics, not just a fact about the quantum moat:
Quantization of angular momentum
The angular momentum of a particle due to its motion through
space is quantized in units of
.
self-check:
What is the angular momentum of the wavefunction shown at
the beginning of the section?
(answer in the back of the PDF version of the book)

b / The angular momentum vector of a spinning top.

c / Reconciling the uncertainty principle with the definition of angular momentum.
Angular momentum in three dimensions
Up until now we've only worked with angular momentum in the context of rotation in a plane, for which we could simply use positive and negative signs to indicate clockwise and counterclockwise directions of rotation. A hydrogen atom, however, is unavoidably three-dimensional. Before we worry about three-dimensional angular momentum in quantum mechanics, we need to understand how it works in classical mechanics; the following discussion is a summary of the most important facts from optional section 4.3.
Three-dimensional angular momentum in classical physics
If we are to completely specify the angular momentum of a classical object like a top, b, in three dimensions, it's not enough to say whether the rotation is clockwise or counterclockwise. We must also give the orientation of the plane of rotation or, equivalently, the direction of the top's axis. The convention is to specify the direction of the axis. There are two possible directions along the axis, and as a matter of convention we use the direction such that if we sight along it, the rotation appears clockwise. Angular momentum can, in fact, be defined as a vector pointing along this direction. This might seem like a strange definition, since nothing actually moves in that direction, but it wouldn't make sense to define the angular momentum vector as being in the direction of motion, because every part of the top has a different direction of motion. Ultimately it's not just a matter of picking a definition that is convenient and unambiguous: the definition we're using is the only one that makes the total angular momentum of a system a conserved quantity if we let “total” mean the vector sum. As with rotation in one dimension, we cannot define what we mean by angular momentum in a particular situation unless we pick a point as an axis. This is really a different use of the word “axis” than the one in the previous paragraphs. Here we simply mean a point from which we measure the distance r. In the hydrogen atom, the nearly immobile proton provides a natural choice of axis.
Three-dimensional angular momentum in quantum physics
Once we start to think more carefully about the role of
angular momentum in quantum physics, it may seem that there
is a basic problem: the angular momentum of the electron in
a hydrogen atom depends on both its distance from the proton
and its momentum, so in order to know its angular momentum
precisely it would seem we would need to know both its
position and its momentum simultaneously with good accuracy.
This, however, might seem to be forbidden by the Heisenberg
uncertainty principle.
Actually the uncertainty principle does place limits on what
can be known about a particle's angular momentum vector, but
it does not prevent us from knowing its magnitude as an
exact integer multiple of
. The reason is that in three
dimensions, there are really three separate uncertainty principles:
Now consider a particle, c/1, that is moving along the x
axis at position x and with momentum px. We may not be
able to know both x and px with unlimited accurately,
but we can still know the particle's angular momentum about
the origin exactly: it is zero, because the particle is
moving directly away from the origin.
Suppose, on the other hand, a particle finds itself, c/2, at
a position x along the x axis, and it is moving parallel
to the y axis with momentum py. It has angular momentum
xpy about the z axis, and again we can know its angular
momentum with unlimited accuracy, because the uncertainty
principle on relates x to px and y to py. It does
not relate x to py.
As shown by these examples, the uncertainty principle does
not restrict the accuracy of our knowledge of angular
momenta as severely as might be imagined. However, it does
prevent us from knowing all three components of an angular
momentum vector simultaneously. The most general statement
about this is the following theorem, which we present without proof:
The angular momentum vector in quantum physics
The most the can be known about an angular momentum vector
is its magnitude and one of its three vector components.
Both are quantized in units of
.

d / A cross-section of a hydrogen wavefunction.

e / The energy of a state in the hydrogen atom depends only on its n quantum number.
The hydrogen atom
Deriving the wavefunctions of the states of the hydrogen
atom from first principles would be mathematically too
complex for this book, but it's not hard to understand the
logic behind such a wavefunction in visual terms. Consider
the wavefunction from the beginning of the section, which is
reproduced in figure d.
Although the graph looks three-dimensional,
it is really only a representation of the part of the
wavefunction lying within a two-dimensional plane. The third
(up-down) dimension of the plot represents the value of the
wavefunction at a given point, not the third dimension of
space. The plane chosen for the graph is the one perpendicular
to the angular momentum vector.
Each ring of peaks and valleys has eight wavelengths going
around in a circle, so this state has
, i.e., we label
it
. The wavelength is shorter near the center, and this
makes sense because when the electron is close to the
nucleus it has a lower electrical energy, a higher kinetic energy,
and a higher momentum.
Between each ring of peaks in this wavefunction is a nodal
circle, i.e., a circle on which the wavefunction is zero. The
full three-dimensional wavefunction has nodal spheres: a
series of nested spherical surfaces on which it is zero. The
number of radii at which nodes occur, including r=∞,
is called n, and n turns out to be closely related to
energy. The ground state has n=1 (a single node only at
r=∞), and higher-energy states have higher n
values. There is a simple equation relating n to energy,
which we will discuss in subsection 13.4.4.
The numbers n and
, which identify the state, are called
its quantum numbers.
A state of a
given n and
can be oriented in a variety of directions in
space. We might try to indicate the orientation using the
three quantum numbers
,
, and
.
But we have already seen
that it is impossible to know all three of these simultaneously.
To give the most complete possible description of a state,
we choose an arbitrary axis, say the z axis, and label the
state according to n,
, and
.5
Angular momentum requires motion, and motion implies kinetic
energy. Thus it is not possible to have a given amount of
angular momentum without having a certain amount of kinetic
energy as well. Since energy relates to the n quantum
number, this means that for a given n value there will be
a maximum possible . It turns out that this maximum
value of equals n-1.
In general, we can list the possible combinations of
quantum numbers as follows:
| n can equal 1, 2, 3, … |
| ell can range from 0 ton − 1, in steps of 1 |
| ellz can range fromell toell, in steps of 1 |
Applying these rules, we have the following list of states:
| n = 1, | ell0, | ellz0 | one state |
| n = 2, | ell0, | ellz0 | one state |
| n = 2, | ell1, | ellz1, 0, or 1 | three states |
| … | |||
self-check:
Continue the list for n=3.
(answer in the back of the PDF version of the book)
Figure f on page 828
shows the lowest-energy states
of the hydrogen atom. The left-hand column of graphs
displays the wavefunctions in the x-y plane, and the
right-hand column shows the probability distribution in a
three-dimensional representation.

f / The three states of the hydrogen atom having the lowest energies.
”Discussion Questions”
◊
The quantum number n is defined as the number of radii
at which the wavefunction is zero, including r=∞.
Relate this to the features of the figures on the facing page.
◊
Based on the definition of n, why can't there be any
such thing as an n=0 state?
◊
Relate the features of the wavefunction plots in figure f
to the corresponding features of the probability distribution pictures.
◊
How can you tell from the wavefunction plots in figure f
which ones have which angular momenta?
◊
Criticize the following incorrect statement: “The
wavefunction in figure d has a shorter wavelength
in the center because in the center the electron is in a
higher energy level.”
◊
Discuss the implications of the fact that the probability
cloud in of the n=2,
state is split into two parts.

g / The energy levels of a particle in a box, contrasted with those of the hydrogen atom.
Energies of states in hydrogen
History
The experimental technique for measuring the energy levels
of an atom accurately is spectroscopy: the study of the
spectrum of light emitted (or absorbed) by the atom. Only
photons with certain energies can be emitted or absorbed by
a hydrogen atom, for example, since the amount of energy
gained or lost by the atom must equal the difference in
energy between the atom's initial and final states.
Spectroscopy had actually become a highly developed art
several decades before Einstein even proposed the photon,
and the Swiss spectroscopist Johann Balmer
determined in 1885 that there was a simple equation that
gave all the wavelengths emitted by hydrogen. In modern
terms, we think of the photon wavelengths merely as indirect
evidence about the underlying energy levels of the atom, and
we rework Balmer's result into an equation for these
atomic energy levels:
This energy includes both
the kinetic energy of the electron and the electrical
energy. The zero-level of the electrical energy
scale is chosen to be the energy of an electron and a proton
that are infinitely far apart. With this choice, negative
energies correspond to bound states and positive energies to unbound ones.
Where does the mysterious numerical factor of 2.2×10-18 J
come from? In 1913 the Danish theorist Niels Bohr
realized that it was exactly numerically equal to a certain
combination of fundamental physical constants:
where m is the mass of the electron, and k is the Coulomb
force constant for electric forces.
Bohr was able to cook up a derivation of this equation based
on the incomplete version of quantum physics that had been
developed by that time, but his derivation is today mainly
of historical interest. It assumes that the electron follows
a circular path, whereas the whole concept of a path for a
particle is considered meaningless in our more complete
modern version of quantum physics. Although Bohr was able to
produce the right equation for the energy levels, his model
also gave various wrong results, such as predicting that the
atom would be flat, and that the ground state would have
rather than the correct
.
Approximate treatment
Rather than leaping straight into a full mathematical treatment, we'll start by looking for some physical insight,
which will lead to an approximate argument that correctly reproduces the form of the Bohr equation.
A typical standing-wave pattern for the electron
consists of a central oscillating area surrounded by a
region in which the wavefunction tails off. As discussed in
subsection 13.3.6, the oscillating type of pattern is typically
encountered in the classically allowed region, while the
tailing off occurs in the classically forbidden region where
the electron has insufficient kinetic energy to penetrate
according to classical physics. We use the symbol r for
the radius of the spherical boundary between the classically
allowed and classically forbidden regions.
Classically, r would be the distance from the proton at which
the electron would have to stop,
turn around, and head back in.
If r had the same value for every standing-wave pattern, then we'd
essentially be solving the particle-in-a-box problem in three dimensions,
with the box being a spherical cavity. Consider the energy levels of the particle in
a box compared to those of the hydrogen atom, g.
They're qualitatively different. The energy levels of the particle in a box
get farther and farther apart as we go higher in energy, and this feature
doesn't even depend on the details of whether the box is two-dimensional or three-dimensional,
or its exact shape. The reason for the spreading is that the box is taken to be
completely impenetrable, so its size, r, is fixed. A wave pattern with n humps
has a wavelength proportional to r/n, and therefore a momentum proportional to n,
and an energy proportional to n2. In the hydrogen atom, however, the force keeping
the electron bound isn't an infinite force encountered when it bounces off of a wall, it's
the attractive electrical force from the nucleus. If we put more energy into the electron,
it's like throwing a ball upward with a higher energy --- it will get farther out before
coming back down. This means that in the hydrogen atom, we expect r to increase as we
go to states of higher energy. This tends to keep the wavelengths of the high energy
states from getting too short, reducing their kinetic energy. The closer and closer
crowding of the energy levels in hydrogen also makes sense because we know that there
is a certain energy that would be enough to make the electron escape completely, and
therefore the sequence of bound states cannot extend above that energy.
When the electron is at the maximum classically allowed distance r from the proton, it
has zero kinetic energy.
Thus when the
electron is at distance r, its energy is purely electrical:
Now comes the approximation. In reality, the electron's
wavelength cannot be constant in the classically allowed
region, but we pretend that it is. Since n is the number
of nodes in the wavefunction, we can interpret it approximately
as the number of wavelengths that fit across the diameter
2r. We are not even attempting a derivation that would
produce all the correct numerical factors like 2 and π
and so on, so we simply make the approximation
Finally we assume that the typical kinetic energy of the
electron is on the same order of magnitude as the absolute
value of its total energy. (This is true to within a factor
of two for a typical classical system like a planet in a
circular orbit around the sun.) We then have
\begin{subequations}
\renewcommand{\theequation}{\theparentequation}
Failed to parse (unknown function\notag): \begin{align} \text{absolute}&\text{ value of total energy} \\ &= \frac{ke^2}{r} \notag \\ &\sim K \notag \\ &= p^2/2m \notag \\ &= (h/\lambda)^2/2m \notag \\ &~ h^2n^2/2mr^2 \notag \end{align}
\end{subequations}
We now solve the equation ke2/r ∼ h2n2 / 2mr2 for r
and throw away numerical factors we can't hope to have
gotten right, yielding
Plugging n=1 into this equation gives r=2 nm, which is
indeed on the right order of magnitude. Finally we combine
equations [4] and [1] to find
which is correct except for the numerical factors we
never aimed to find.
Exact treatment of the ground state
The general proof of the Bohr equation for all values of n is beyond the mathematical
scope of this book, but it's fairly straightforward to verify it for a particular n,
especially given a lucky guess as to what functional form to try for the wavefunction.
The form that works for the ground state is
Ψ = he-r/a ,
where
is the electron's distance from the proton, and h provides for normalization.
In the following, the result ∂ r/∂ x=x/r comes in handy.
Computing the partial derivatives that occur in the Laplacian, we obtain for the
x term
Failed to parse (unknown function\intertext): \begin{align} \frac{\partial\Psi}{\partial x} &= \frac{\partial \Psi}{\partial r} \frac{\partial r}{\partial x} \\ &= -\frac{x}{ar} \Psi \\ \frac{\partial^2\Psi}{\partial x^2} &= -\frac{1}{ar} \Psi -\frac{x}{a}\left(\frac{\partial}{d x}\frac{1}{r}\right)\Psi+ \left( \frac{x}{ar}\right)^2 \Psi\\ &= -\frac{1}{ar} \Psi +\frac{x^2}{ar^3}\Psi+ \left( \frac{x}{ar}\right)^2 \Psi \qquad , \intertext{so} \nabla^2\Psi &= \left( -\frac{2}{ar} + \frac{1}{a^2} \right) \Psi \qquad . \end{align}
The Schr\"odinger equation gives
If we require this equation to hold for all r, then we must have equality for both the terms of the form
(constant)×Ψ and for those of the form (constant/r)×Ψ. That means
Failed to parse (unknown function\intertext): \begin{align} E &= -\frac{\hbar^2}{2ma^2} \\ \intertext{and} 0 &= \frac{\hbar^2}{mar} -\frac{ke^2}{r} \qquad . \end{align}
These two equations can be solved for the unknowns a and E, giving Failed to parse (unknown function\intertext): \begin{align} a &= \frac{\hbar^2}{mke^2} \\ \intertext{and} E &= -\frac{mk^2e^4}{2\hbar^2} \qquad , \end{align}
where the result for the energy agrees with the Bohr equation for n=1. The calculation of the normalization constant h is relegated to homework problem 36. self-check: We've verified that the function Ψ = he-r/a is a solution to the Schr\"odinger equation, and yet it has a kink in it. What's going on here? Didn't I argue before that kinks are unphysical? (answer in the back of the PDF version of the book)
Example 20: Wave phases in the hydrogen molecule
In example 15 on page 809, I argued that the existence of the H2 molecule
could essentially be explained by a particle-in-a-box argument: the molecule is a bigger box than an
individual atom, so each electron's wavelength can be longer, its kinetic energy lower.
Now that we're in possession of a mathematical expression for the wavefunction of the hydrogen
atom in its ground state, we can make this argument a little more rigorous and detailed.
Suppose that two hydrogen atoms are in a relatively cool sample of monoatomic hydrogen gas.
Because the gas is cool, we can assume that the atoms are in their ground states. Now suppose
that the two atoms approach one another. Making use again of the assumption that the gas is cool, it
is reasonable to imagine that the atoms approach one another slowly. Now the atoms come a little
closer, but still far enough apart that the region between them is classically forbidden. Each
electron can tunnel through this classically forbidden region, but the tunneling probability is
small. Each one is now found with, say, 99% probability in its original home, but with 1%
probability in the other nucleus. Each electron is now in a state consisting of a superposition
of the ground state of its own atom with the ground state of the other atom. There are two peaks
in the superposed wavefunction, but one is a much bigger peak than the other.
An interesting question now arises. What are the relative phases of the two electrons?
As discussed on page 803, the absolute phase of an electron's
wavefunction is not really a meaningful concept. Suppose atom A contains electron Alice, and
B electron Bob. Just before the collision, Alice may have wondered, “Is my phase
positive right now, or is it negative? But of course I shouldn't ask myself such silly questions,”
she adds sheepishly.

h / Example 20.
But relative phases are well defined. As the two atoms draw closer and closer together, the tunneling probability rises, and eventually gets so high that each electron is spending essentially 50% of its time in each atom. It's now reasonable to imagine that either one of two possibilities could obtain. Alice's wavefunction could either look like h/1, with the two peaks in phase with one another, or it could look like h/2, with opposite phases. Because relative phases of wavefunctions are well defined, states 1 and 2 are physically distinguishable. In particular, the kinetic energy of state 2 is much higher; roughly speaking, it is like the two-hump wave pattern of the particle in a box, as opposed to 1, which looks roughly like the one-hump pattern with a much longer wavelength. Not only that, but an electron in state 1 has a large probability of being found in the central region, where it has a large negative electrical energy due to its interaction with both protons. State 2, on the other hand, has a low probability of existing in that region. Thus state 1 represents the true ground-state wavefunction of the H2 molecule, and putting both Alice and Bob in that state results in a lower energy than their total energy when separated, so the molecule is bound, and will not fly apart spontaneously. State h/3, on the other hand, is not physically distinguishable from h/2, nor is h/4 from h/1. Alice may say to Bob, “Isn't it wonderful that we're in state 1 or 4? I love being stable like this.” But she knows it's not meaningful to ask herself at a given moment which state she's in, 1 or 4. ”Discussion Questions” ◊ States of hydrogen with n greater than about 10 are never observed in the sun. Why might this be? ◊ Sketch graphs of r and E versus n for the hydrogen, and compare with analogous graphs for the one-dimensional particle in a box.

i / The top has angular momentum both because of the motion of its center of mass through space and due to its internal rotation. Electron spin is roughly analogous to the intrinsic spin of the top.
Electron spin
It's disconcerting to the novice ping-pong player to
encounter for the first time a more skilled player who can
put spin on the ball. Even though you can't see that the
ball is spinning, you can tell something is going on by the
way it interacts with other objects in its environment. In
the same way, we can tell from the way electrons interact
with other things that they have an intrinsic spin of their
own. Experiments show that even when an electron is not moving through
space, it still has angular momentum amounting to
.
This may seem paradoxical because the quantum moat, for
instance, gave only angular momenta that were integer
multiples of
, not half-units, and I claimed that angular
momentum was always quantized in units of
, not just in the
case of the quantum moat. That whole discussion, however,
assumed that the angular momentum would come from the motion
of a particle through space. The
angular momentum of the
electron is simply a property of the particle, like its
charge or its mass. It has nothing to do with whether the
electron is moving or not, and it does not come from any
internal motion within the electron. Nobody has ever
succeeded in finding any internal structure inside the
electron, and even if there was internal structure, it would
be mathematically impossible for it to result in a half-unit
of angular momentum.
We simply have to accept this
angular momentum, called the
“spin” of the electron --- Mother Nature rubs our noses in it as
an observed fact.
Protons and neutrons have
the same
spin, while photons have an intrinsic spin of
.
In general, half-integer spins are typical of material particles.
Integral values are found for the particles that carry forces: photons,
which embody the electric and magnetic fields of force, as well as the
more exotic messengers of the nuclear and gravitational forces.
As was the case with ordinary angular momentum, we can
describe spin angular momentum in terms of its magnitude,
and its component along a given axis.
We write s and sz for these quantities, expressed in
units of
, so an
electron has s=1/2 and sz=+1/2 or -1/2.
Taking electron spin into account, we need a total of four
quantum numbers to label a state of an electron in the
hydrogen atom: n,
,
, and sz. (We omit s because it
always has the same value.) The symbols and include only
the angular momentum the electron has because it is moving
through space, not its spin angular momentum. The availability
of two possible spin states of the electron leads to a
doubling of the numbers of states:
| n = 1, | ell0, | ellz0, | sz = + 1 / 2 or − 1 / 2 | two states |
| n = 2, | ell0, | ellz0, | sz = + 1 / 2 or − 1 / 2 | two states |
| n = 2, | ell1, | ellz1, 0, or 1, | sz = + 1 / 2 or − 1 / 2 | six states |
| … | ||||
A note about notation
There are unfortunately two inconsistent systems of notation for the quantum numbers we've been discussing. The notation I've been using is the one that is used in nuclear physics, but there is a different one that is used in atomic physics.
| nuclear physics | atomic physics |
| n | same |
| ell | same |
| ellx | no notation |
| elly | no notation |
| ellz | m |
| s = 1 / 2 | no notation (sometimesσ) |
| sx | no notation |
| sy | no notation |
| sz | s |
{}The nuclear physics notation is more logical (not giving
special status to the z axis) and more memorable (
rather than the obscure m), which is why I use it consistently
in this book, even though nearly all the applications we'll
consider are atomic ones.
We are further encumbered with
the following historically derived
letter labels, which deserve to be
eliminated in favor of the simpler numerical ones:
| ell0 | ell1 | ell2 | ell3 |
| s | p | d | f |
| n = 1 | n = 2 | n = 3 | n = 4 | n = 5 | n = 6 | n = 7 |
| K | L | M | N | O | P | Q |
{}The spdf labels are used in both nuclear6
and atomic physics, while
the KLMNOPQ letters are used only to refer to states of electrons.
And finally, there is a piece of notation that is good and useful,
but which I simply haven't mentioned yet. The vector
Failed to parse (unknown function\vc): \vc{j}=\vc{\ell}+\vc{s}
stands for the total angular momentum of a particle in units of
, including
both orbital and spin parts. This quantum number turns out to be
very useful in nuclear physics, because nuclear forces tend to
exchange orbital and spin angular momentum, so a given energy level
often contains a mixture of
and s values, while remaining
fairly pure in terms of j.

j / The beginning of the periodic table.

k / Hydrogen is highly reactive.
Atoms with more than one electron
What about other atoms besides hydrogen? It would seem that things would get much more complex with the addition of a second electron. A hydrogen atom only has one particle that moves around much, since the nucleus is so heavy and nearly immobile. Helium, with two, would be a mess. Instead of a wavefunction whose square tells us the probability of finding a single electron at any given location in space, a helium atom would need to have a wavefunction whose square would tell us the probability of finding two electrons at any given combination of points. Ouch! In addition, we would have the extra complication of the electrical interaction between the two electrons, rather than being able to imagine everything in terms of an electron moving in a static field of force created by the nucleus alone. Despite all this, it turns out that we can get a surprisingly good description of many-electron atoms simply by assuming the electrons can occupy the same standing-wave patterns that exist in a hydrogen atom. The ground state of helium, for example, would have both electrons in states that are very similar to the n=1 states of hydrogen. The second-lowest-energy state of helium would have one electron in an n=1 state, and the other in an n=2 states. The relatively complex spectra of elements heavier than hydrogen can be understood as arising from the great number of possible combinations of states for the electrons. A surprising thing happens, however, with lithium, the three-electron atom. We would expect the ground state of this atom to be one in which all three electrons settle down into n=1 states. What really happens is that two electrons go into n=1 states, but the third stays up in an n=2 state. This is a consequence of a new principle of physics:
The Pauli Exclusion Principle
Only one electron can ever occupy a given state. There are two n=1 states, one with sz=+1/2 and one with sz=-1/2, but there is no third n=1 state for lithium's third electron to occupy, so it is forced to go into an n=2 state. It can be proved mathematically that the Pauli exclusion principle applies to any type of particle that has half-integer spin. Thus two neutrons can never occupy the same state, and likewise for two protons. Photons, however, are immune to the exclusion principle because their spin is an integer.
Deriving the periodic table
We can now account for the structure of the periodic table, which seemed so mysterious even to its inventor Mendeleev. The first row consists of atoms with electrons only in the n=1 states:
| H | 1 electron in ann = 1 state |
| He | 2 electrons in the twon = 1 states |
The next row is built by filling the n=2 energy levels:
| Li | 2 electrons
inn = 1 states, 1 electron in ann = 2 state |
| Be | 2 electrons
inn = 1 states, 2 electrons inn = 2 states |
| … | |
| O | 2 electrons
inn = 1 states, 6 electrons inn = 2 states |
| F | 2 electrons
inn = 1 states, 7 electrons inn = 2 states |
| Ne | 2 electrons
inn = 1 states, 8 electrons inn = 2 states |
In the third row we start in on the n=3 levels:
| Na | 2 electrons
inn = 1 states, 8 electrons inn = 2 states, 1 electron in ann = 3 state |
| ... |
We can now see a logical link between the filling of the
energy levels and the structure of the periodic table.
Column 0, for example, consists of atoms with the right
number of electrons to fill all the available states up to a
certain value of n. Column I contains atoms like lithium
that have just one electron more than that.
This shows that the columns relate to the filling of energy levels, but why does that have anything to do with chemistry? Why, for example, are the elements in columns I and VII dangerously reactive? Consider, for example, the element sodium (Na), which is so reactive that it may burst into flames when exposed to air. The electron in the n=3 state has an unusually high energy. If we let a sodium atom come in contact with an oxygen atom, energy can be released by transferring the n=3 electron from the sodium to one of the vacant lower-energy n=2 states in the oxygen. This energy is transformed into heat. Any atom in column I is highly reactive for the same reason: it can release energy by giving away the electron that has an unusually high energy. Column VII is spectacularly reactive for the opposite reason: these atoms have a single vacancy in a low-energy state, so energy is released when these atoms steal an electron from another atom. It might seem as though these arguments would only explain reactions of atoms that are in different rows of the periodic table, because only in these reactions can a transferred electron move from a higher-n state to a lower-n state. This is incorrect. An n=2 electron in fluorine (F), for example, would have a different energy than an n=2 electron in lithium (Li), due to the different number of protons and electrons with which it is interacting. Roughly speaking, the n=2 electron in fluorine is more tightly bound (lower in energy) because of the larger number of protons attracting it. The effect of the increased number of attracting protons is only partly counteracted by the increase in the number of repelling electrons, because the forces exerted on an electron by the other electrons are in many different directions and cancel out partially.
Exercises
Exercise A: Quantum Versus Classical Randomness
1. Imagine the classical version of the particle in a one-dimensional box. Suppose you insert the particle in the box and give it a known, predetermined energy, but a random initial position and a random direction of motion. You then pick a random later moment in time to see where it is. Sketch the resulting probability distribution by shading on top of a line segment. Does the probability distribution depend on energy?
2. Do similar sketches for the first few energy levels of the quantum mechanical particle in a box, and compare with 1.
3. Do the same thing as in 1, but for a classical hydrogen atom in two dimensions, which acts just like a miniature solar system. Assume you're always starting out with the same fixed values of energy and angular momentum, but a position and direction of motion that are otherwise random. Do this for L=0, and compare with a real L=0 probability distribution for the hydrogen atom.
4. Repeat 3 for a nonzero value of L, say L=
.
5. Summarize: Are the classical probability distributions accurate? What qualitative features are possessed by the classical diagrams but not by the quantum mechanical ones, or vice-versa?
Footnotes
atoms were created at the same time. In reality, we have only a general idea of the processes that might have created the heavy elements in the nebula from which our solar system condensed. Some portion of them may have come from nuclear reactions in supernova explosions in that particular nebula, but some may have come from previous supernova explosions throughout our galaxy, or from exotic events like collisions of white dwarf
stars.What I'm presenting in this chapter is a simplified explanation of how the photon could have been discovered. The actual history is more complex. Max Planck (1858-1947) began the photon saga with a theoretical investigation of the spectrum of light emitted by a hot, glowing object. He introduced quantization of the energy of light waves, in multiples of hf, purely as a mathematical trick that happened to produce the right results. Planck did not believe that his procedure could have any physical significance. In his 1905 paper Einstein took Planck's quantization as a description of reality, and applied it to various theoretical and experimental puzzles, including the photoelectric effect. Millikan then subjected Einstein's ideas to a series of rigorous experimental tests. Although his results matched Einstein's predictions perfectly, Millikan was skeptical about photons, and his papers conspicuously omit any reference to them. Only in his autobiography did Millikan rewrite history and claim that he had given experimental proof for photons.
never published them for fear of hurting Heisenberg and his family. Bohr's papers were to be sealed for 50 years after his death, but his family released them early, in February 2002. The texts,
including English translations, are available at http://www.nbi.dk/NBA/papers/docs/cover.html.835 for a note about the two different
systems of notations that are used for quantum numbers.the series continues in alphabetical order. In nuclei that are spinning rapidly enough that they are almost breaking apart, individual protons and neutrons can be stirred up to
PROTECT_TEX_MATH_FOR_MEDIAWIKI3c665edbZZZ values as high as 7, which is j.







