Noise: A Flaw in Human Judgment
Rate it:
Open Preview
Read between July 27 - October 21, 2021
11%
Flag icon
In any system in which judges are assumed to be interchangeable and assigned quasi-randomly, large disagreements about the same case violate expectations of fairness and consistency.
12%
Flag icon
System noise is inconsistency, and inconsistency damages the credibility of the system.
12%
Flag icon
All we need to measure noise is multiple judgments of the same problem.
12%
Flag icon
An important question, therefore, is how, and how much, bias and noise contribute to error.
12%
Flag icon
in professional judgments of all kinds, whenever accuracy is the goal, bias and noise play the same role in the calculation of overall error.
12%
Flag icon
Bias is simply the average of errors,
13%
Flag icon
measure of overall error—called mean squared error (MSE)—is the average of the squares of the individual errors of measurement.
13%
Flag icon
This is a key feature of MSE: squaring gives large errors a far greater weight than it gives small ones.
13%
Flag icon
They are independent of each other and equally weighted in the determination of overall error.
14%
Flag icon
In terms of overall error, noise and bias are independent: the benefit of reducing noise is the same, regardless of the amount of bias.
14%
Flag icon
all of which aim to approach a true value with maximum accuracy (the least bias) and precision (the least noise).
14%
Flag icon
The error equation does not apply to evaluative judgments, however, because the concept of error, which depends on the existence of a true value, is far more difficult to apply.
15%
Flag icon
Predictive judgments will be improved by procedures that reduce noise, as long as they do not increase bias to a larger extent.
15%
Flag icon
variability in judgments of the same case is still undesirable—it is system noise.
15%
Flag icon
For the average case, the mean sentence was 7.0 years, and the standard deviation around that mean was 3.4 years.
16%
Flag icon
New information, unless it is decisive, provides more opportunities for judges to differ from one another.
16%
Flag icon
variation among judges
16%
Flag icon
these deviations as level errors.
16%
Flag icon
The general conclusion is that the average level of sentencing functions like a personality trait.
16%
Flag icon
We use the term level noise for the variability of the judges’ average judgments, which is identical to the variability of level errors.
16%
Flag icon
differences in average severity across individual judges.
16%
Flag icon
they are harsher than their personal average in some and more lenient in others. We call these residual deviations pattern errors.
16%
Flag icon
proper statistical term for pattern noise is judge × case interaction—pronounced “judge-by-case.”
16%
Flag icon
random error.
17%
Flag icon
Our name for the variability that is due to transient effects is occasion noise.
17%
Flag icon
The variability in a shooter’s performance is a form of noise.
17%
Flag icon
It is less easy to observe the variability of our minds.
17%
Flag icon
we do not always produce identical judgments when faced with the same facts on two occasions.
17%
Flag icon
direct measurements of occasion noise are hard to obtain whenever cases are easily memorable.
17%
Flag icon
test-retest reliability, or reliability
19%
Flag icon
mood has a measurable influence on what you think: what you notice in your environment, what you retrieve from your memory, how you make sense of these signals.
19%
Flag icon
mood has another, more surprising effect: it also changes how you think.
19%
Flag icon
People who are in a good mood are more likely to let their biases affect their thinking.
19%
Flag icon
Inducing good moods makes people more receptive to bullshit and more gullible in general; they are less apt to detect deception or identify misleading information.
19%
Flag icon
an important truth: you are not the same person at all times.
19%
Flag icon
stress and fatigue.
19%
Flag icon
cognitive bias known as the gambler’s fallacy: we tend to underestimate the likelihood that streaks will occur by chance.
20%
Flag icon
But it pales in comparison with the variability between judges:
20%
Flag icon
Even in this tightly controlled setting, exactly what factors drive occasion noise was a mystery.
21%
Flag icon
They were testing for a particular driver of noise: social influence.
21%
Flag icon
The key finding was that group rankings were wildly disparate: across different groups, there was a great deal of noise.
21%
Flag icon
In politics, as in music, a great deal depends on social influences and, in particular, on whether people see that other people are attracted or repelled.
21%
Flag icon
political positions can be just like songs, in the sense that their ultimate fate can depend on initial popularity.
21%
Flag icon
independence is a prerequisite for the wisdom of crowds. If people are not making their own judgments and are relying instead on what other people think, crowds might not be so wise after all.
21%
Flag icon
social influences are a problem because they reduce “group diversity without diminishing the collective error.”
22%
Flag icon
two problems. First, people tend to neglect the possibility that most of the people in the crowd are in a cascade,
22%
Flag icon
Second, informational cascades can lead groups of people in truly terrible directions.
22%
Flag icon
Internal discussions often create greater confidence, greater unity, and greater extremism, frequently in the form of increased enthusiasm.
23%
Flag icon
Deliberation had the effect of increasing noise.
23%
Flag icon
noise is a major factor in the inferiority of human judgment.