Quantifying 'Evil Vibes': An Analysis of the Race to the Bottom Experiment

The quest to quantify human values is often fraught with subjectivity, but a recent project titled "Race to the Bottom" attempts to do exactly that. By presenting users with a series of pairwise comparisons, the project asks participants to vote for which of two options—ranging from industrial sectors to societal habits—is "worse" or more harmful to society overall.

While the premise is a simple "Hot or Not" style interface for societal harm, the resulting data and the community reaction on Hacker News reveal a deeper conversation about cognitive bias, the nuance of industrial impact, and the difficulty of measuring public sentiment.

The Mechanics of Pairwise Comparison

At its core, "Race to the Bottom" utilizes a ranking system based on relative preference (or in this case, relative dislike). Instead of asking users to rate an industry on a scale of 1 to 10, it forces a choice between two specific options. This method is designed to eliminate the "everything is a 5" problem common in traditional surveys, forcing the user to make a definitive judgment call.

However, as users noted, this approach can lead to interesting mathematical and psychological phenomena. One commenter, @jjmarr, suggested that the experiment could track "intransitive preferences," where a user might find A worse than B, B worse than C, but then find C worse than A. Such a loop would highlight the inherent subjectivity of "evil vibes" and the lack of a consistent moral hierarchy in the human mind.

The Framing Problem: Bias and Language

One of the most significant critiques of the experiment is the role of framing. In data collection, the way a question is phrased can radically alter the outcome. Several participants pointed out that the descriptions provided for the industries were not neutral.

For instance, @hjkl0 noted a stark contrast in how different options were described:

Cannabis was described as "Cannabis cultivation, dispensaries, and marijuana-related businesses."
Sugary Food & Beverage was described as "Products associated with obesity, diabetes, and health concerns."

By linking sugary drinks directly to obesity and diabetes in the description, the system primes the user to vote for it as the "worse" option, whereas the cannabis description remains largely descriptive. This "injection of bias" transforms the experiment from a neutral snapshot of opinion into what some called "bias confirmation theater."

The Nuance of "Harm"

Another recurring theme in the discussion was the inability of a blanket term like "worst" to capture the complexity of modern industry. Many of the options presented—such as Pharmaceuticals or Chemical Manufacturing—contain both life-saving innovations and destructive practices.

As @danpalmer observed:

"Animal testing of cosmetics: bad, animal testing of the safety of a new drug that millions of humans will take: probably good. Chemical manufacturing that produces plastic packaging for things that could use paper packaging: not great, chemical manufacturing for chemicals used in healthcare, probably good."

This highlights a fundamental tension in the project: the desire for a clean, ranked list versus the messy reality of industrial utility. When users are forced to choose between "Dating Apps" and "Weapons Manufacturing," the comparison can feel absurd or like "performance art" because the scales of harm are so vastly different.

Societal Reflections in the Leaderboard

Despite the critiques, the resulting leaderboard serves as a fascinating, if flawed, snapshot of the current cultural zeitgeist. The rankings reveal what the public perceives as the most immediate or visceral threats.

Some users expressed shock at the results, such as the fact that social media was ranked as more harmful than oil and gas or weapons manufacturing in some snapshots. Others were surprised by the relatively low ranking of factory farming, which @lobofta argued is "arguably the worst thing happening on this planet right now" from the perspective of sentient beings.

Conclusion: Lessons for Opinion Mining

"Race to the Bottom" demonstrates the power of simple interfaces to engage users in complex moral questioning. However, it also serves as a cautionary tale for anyone attempting to quantify qualitative values. To move from a "vibes-based" ranking to a meaningful data set, the project would likely need to incorporate:

Neutral Framing: Removing loaded language from descriptions to avoid priming.
Nuanced Input: Allowing for "neutral" or "both are equally bad" options to avoid forced choice bias.
Contextual Data: Integrating regional or demographic data to see how "harm" is defined differently across cultures.

Ultimately, the project succeeds not as a scientific study, but as a mirror reflecting how we perceive the world—and how easily those perceptions can be manipulated by the way a question is asked.

Quantifying 'Evil Vibes': An Analysis of the Race to the Bottom Experiment

Quantifying 'Evil Vibes': An Analysis of the Race to the Bottom Experiment

The Mechanics of Pairwise Comparison

The Framing Problem: Bias and Language

The Nuance of "Harm"

Societal Reflections in the Leaderboard

Conclusion: Lessons for Opinion Mining

References

HN Stories