Psychology 458/558
Judgment and Decision Making
Prof. Bertram Malle
Fall 1995
You would normally expect a probability of 0.167 for each side. Would you
rather bet on odd or even numbers, or would you not care?
G: She schedules a garden party for Saturday
To test her superstition we have to find the conditional probability that it
will rain on Saturdays given that she schedules a party for Saturday,
p(rain|Sat) and see whether it is greater than the probability that it will
rain Saturday, p(rain).
The event we are considering is quite specific: She schedules the party for
Saturday and it will rain that day, p(Sat &rain). By itself, this
probability does not tell us much. Conditionalizing on the class of events in
which she does schedule the party on Saturday allows us to see in how
many cases of scheduling a party on Saturday it actually rains. We can then
compare this proportion to the proportion of rainy days overall and see whether
it rains more often if she she schedules the party than if she doesn't.
That is, we take p(rain & Sat) and divide it by p(Sat) to get the
conditional probability p(rain|Sat). If there is no relation between her party
scheduling and rain, then the following must hold: At any probability for her
scheduling the party on Saturday and any probability that it will rain
Saturday, the conditional probability p(rain|Sat) should be exactly the same
as p(rain). Why? Remember chi-square tests. Under the condition of
independence between rain and her party scheduling, p(rain & Sat) is the
product of p(rain) and p(Sat). This product is If we now divide this product
p(rain & Sat) by p(Sat) to get p(rain|Sat), we get p(rain) again. In
numbers, this is even more obvious. Below you find an example in which there
is no correlation between scheduling and rain, so the chi-square is 0, and
the conditional probability, p(rain|Sat) must be identical to the base rate,
p(rain). Test: .27/.90 = .30, which is the base rate.
The general relation between conditional probability and the joint probability
of the two events is as follows:
Plous points out that people have great difficulites distinguishing between the
two conditional probabilities ("conditionals"): p(A|B) and p(B|A). In the
equation above you see that the two have one element in common: p(A & B).
But they differ in the other element: p(B) for p(A|B) and p(A) for p(B|A). The
element in which they differ is the base rate of the event on which you
conditionalize. The base rates are therefore crucial for distinguishing
between the two conditionalss.
Through simple transformation of the formula in the top part of the
previous slide, you can get the ratio in the bottom part of the slide
that tells you when the two conditionals must be
different--namely, when the two base rates are different.
In medical diagnosis, personality, psychopathology, and technological risk
prevention, people rely on signs (symptoms, indicators) to make inferences
about an underlying state, such as a disease, a trait, or a technical failure.
We often use tests to diagnose or predict the probability that such an
underlying state exists. How should we do that, following the rules of
probability and statistics?
Let's look at a first example from the domain of personality.
Given that Harold talks to strangers, is he an extravert?
Assumptions:
We can now calculate two conditionals. One is p(talks|ext), which is p(talks
& ext)/p(ext), and that is .60/.60 = 1. So if you know that someone is
extraverted, you know that he or she talks to stragners. But what about the
reverse? You know someone who talks to strangers, what's the probability that
he or she is an extravert?
This is the inverse conditional to the one before, and it is far from 1.0:
In medical diagnosis, we find very similar problems with conditionals. We
often know the reliability of a test to indicate a "positive" result if the
illness exists. Validity studies would point, for example, to a hit rate of
90%. This is the probability that the test shows positive given
that the person has the illness, p(t|i). But all tests sometimes
indicate a positive result even if the person doesn't have the illness. This is
called the test's false alarm rate, and it might be, say, 11%. It is the
probability that the test shows positive given that the person
does not have the illness, p(t|~i).
Now let's ask the reverse question: If one of your patients shows a positive
test result, what is the probability that the patient has the illness? The
answer to this question depends crucially on the prevalence of the illness. So
let's look at two different illnesses, one with a prevalence of 0.10, one with
0.70.
Note first that the test's hit rate is 9/10 = 90%, and the test's false alarm
rate is 10/90 = 11%. Now, given that the patient had a positive test result,
what is the probability that he has the illness? The answer is p(i|t) = p(i
& t)/p(t) = 9/19 = 0.53. Why so low? Because the illness itself is rare,
so the test result must improve our initially low confidence of 0.10. Or think
about it another way. The test rarely has a false alarm relatively speaking:
only 11% of all healthy people show a positive result. But because most people
are healthy, you will still sample a lot of people who show a positive result
even though they are healthy. The large number of false alarms "contaminates"
your sample of people with positive test results.
Now let's look at a common illness.
We have a test with the same hit rate as before, namely, 63/70 = 90%, and a
similar false alarm rate, namely, 3/30 = 10%. Now, given that the patient had a
positive test result, what is the probability that he has this common illness?
In this case, given that the patient showed a positive test result (she is one
out of the 66), the probability that she has the illness is 63/66 = 0.95. This
time the estimate is very high because the prevalence of the illness is already
high, namely 0.70. Or put another way: The same false alarm as before now
samples from a much smaller pool of healthy people, so the absolute number of
false alarms is low, thus not contaminating the group of people with positive
test results.
1. Probabilities and frequencies:
2. Conditional probabilities
You have a superstitious friend who claims, for example, that the weather
always turns bad after she makes plans. To test her belief, you consider the
following two events:
R: It will rain on Saturday
3. Judgments of diagnosticity
These more technical explanations are important for understanding the
subtleties of diagnostic judgments, as discussed next.
p(talks) = .85; that is, 85% of all people talk to strangers
p(ext) = .60; that is, 60% of all people are extraverts
p(talks & ext) = .60; that is, 60% of all people are
extraverts who talk to strangers
p(ext | talks) = p(ext & talks)/p(talks) = .60/.85 = .71 If you don't know
whether someon talks to strangers or not, your diagnostic judgement is .60; if
you know the person talks to strangers, your diagnostic judgment rises to .71.
But never do you reach 1.0. Why? Because some of the people who talk to
strangers are not extraverts.
Implications
Applications
Bayes theorem: A more formal approach
If we know a test's reliability (or hit rate), i.e., p(t|e), how can we infer
the inverse probability, namely, the test's diagnosticity, i.e., p(e|t)?
Bayes' theorem, about 200 years old, allows us to do this algebraically if we
know the base rate of the event in question, p(i), and the hit rate and false
alarm rate of the test, p(t|i) and p(t|~i), respectively. We can derive the
theorem ourselves through simple transformations of conditional and joint
probabilities: