Why Personality Tests Lie
(And What Actually Works)
The $2 Billion Industry Built On Pseudoscience
Every year, 2 million people take the Myers-Briggs Type Indicator (MBTI). Fortune 500 companies pay consultants thousands of dollars to administer it. Dating apps use it for matching. People put their four-letter type in social media bios.
There’s just one problem: The MBTI doesn’t work.
Not “works imperfectly” or “has limitations.” It fundamentally fails to do what personality tests should do: measure stable traits that predict real-world outcomes.
And most people have no idea.
What Makes A Personality Test Valid?
Before we can evaluate personality tests, we need criteria. What should a good test do?
1. Test-Retest Reliability
If measuring stable personality traits, retaking the test should yield similar results. Your personality shouldn’t change drastically week to week.
Standard: 80%+ of people get same result when retested after several weeks.
2. Internal Consistency
Questions measuring the same trait should correlate with each other. If Question 3 and Question 47 both supposedly measure “extraversion,” people who answer one way on Q3 should answer consistently on Q47.
Standard: Cronbach’s alpha of 0.70 or higher (statistical measure of internal consistency).
3. Predictive Validity
Test results should predict real outcomes. A test measuring “conscientiousness” should predict job performance, academic achievement, or other behaviors requiring discipline and organization.
Standard: Significant correlation (r > 0.30) with relevant life outcomes.
4. Construct Validity
The trait being measured should be distinguishable from other traits. “Extraversion” should be a separate dimension from “conscientiousness,” not just measuring the same thing with different words.
Standard: Factor analysis showing distinct dimensions.
Why The MBTI Fails Every Test
Test-Retest Reliability: FAILED
Research by Pittenger (2005) found that 50% of people get a different type when retaking the MBTI after just 5 weeks. That’s coin-flip accuracy.
If your “personality type” changes like your mood, it’s not measuring personality—it’s measuring how you felt that day.
The Dichotomy Problem: FAILED
The MBTI forces binary choices:
- Introvert OR Extravert
- Thinking OR Feeling
- Judging OR Perceiving
- Sensing OR Intuition
Reality: Most people fall in the middle. Studies show personality traits distribute on a bell curve (normal distribution), not bimodal distribution (two peaks at extremes with nothing in middle).
Forcing continuous traits into categories loses information. It’s like categorizing everyone as either “tall” or “short” with no middle ground.
Predictive Validity: FAILED
Multiple studies show the MBTI does NOT predict:
- Job performance (Grant, 2013)
- Career satisfaction (Pittenger, 2005)
- Relationship compatibility (no peer-reviewed evidence)
- Academic achievement (no correlation found)
If it doesn’t predict anything measurable, what’s it measuring?
The Barnum Effect: CORE PROBLEM
“You have a need for other people to like and admire you, and yet you tend to be critical of yourself. While you have some personality weaknesses you are generally able to compensate for them. You have considerable unused capacity that you have not turned to your advantage.”
Sound familiar? It should—it describes everyone. This is the Barnum Effect (named after P.T. Barnum): vague statements that feel personally true but apply universally.
MBTI descriptions are carefully worded to feel accurate while being non-specific enough that anyone can see themselves.
Why The Enneagram Is Better (But Still Limited)
The Enneagram Type Indicator has some advantages over MBTI:
What It Gets Right
- Better reliability: ~75% consistency on retest (Hook et al., 2020)
- Motivational focus: Examines WHY people behave, not just surface behavior
- Growth framework: Provides paths for development based on type
- Some research backing: Correlates with established personality dimensions
What’s Still Problematic
- Limited research: Far fewer peer-reviewed studies than needed for confidence
- Self-typing issues: Requires high self-awareness; people frequently mistype
- Arbitrary categories: Nine types may not reflect natural personality clusters
- Commercial origins: Developed by spiritual teachers, not empirical psychologists
Verdict: More useful than MBTI for personal growth, but not scientifically validated enough for high-stakes decisions (hiring, clinical diagnosis, etc.).
What Actually Works: The Big Five
While MBTI and Enneagram were created by individuals theorizing about personality, the Big Five was discovered through data.
How it was developed: Researchers collected thousands of personality-descriptive adjectives, had people rate themselves and others, then used statistical analysis (factor analysis) to identify underlying dimensions. The same five factors emerged across cultures, languages, and time periods.
The Five Dimensions (OCEAN)
1. Openness to Experience
High: Curious, creative, intellectual, imaginative, appreciates art and novelty
Low: Practical, conventional, prefers routine, focuses on concrete facts
Predicts: Creative careers, political liberalism, entrepreneurship, artistic interests
2. Conscientiousness
High: Organized, disciplined, goal-directed, reliable, plans ahead
Low: Spontaneous, flexible, may procrastinate, less concerned with order
Predicts: Job performance, academic achievement, longevity, financial success
3. Extraversion
High: Outgoing, energetic, seeks social stimulation, talkative, assertive
Low (Introversion): Reserved, prefers solitude or small groups, reflective, independent
Predicts: Leadership roles, sales performance, social network size, happiness (weakly)
4. Agreeableness
High: Compassionate, cooperative, trusting, values harmony, empathetic
Low: Competitive, skeptical, direct, values honesty over tact, independent-minded
Predicts: Relationship satisfaction, helping behavior, conflict resolution style
5. Neuroticism (Emotional Stability)
High: Emotionally reactive, experiences anxiety/depression more readily, sensitive to stress
Low: Emotionally stable, calm under pressure, resilient, even-tempered
Predicts: Mental health, stress response, relationship satisfaction, job satisfaction
Why Big Five Works
Test-Retest Reliability: 80-90% consistency over decades
Cross-Cultural Validity: Same five factors emerge in 50+ countries
Predictive Power: Correlates with job performance, relationship outcomes, health behaviors, political orientation, criminal behavior, and life satisfaction
Genetic Component: ~40-60% heritable (twin studies)
Big Five is the only personality system with strong scientific consensus.
Where To Take It (Free)
1. IPIP-NEO (International Personality Item Pool)
- 120-300 questions depending on version
- Free, public domain
- Correlates .95+ with commercial NEO-PI-R
- Available at: openpsychometrics.org
2. Big Five Inventory (BFI-2)
- 60 questions
- Shorter, still accurate
- Free for research/personal use
3. HEXACO (Big Five + Honesty-Humility)
- Adds sixth dimension: Honesty-Humility
- Better predicts counterproductive workplace behavior
- Free assessment online at hexaco.org
Using Big Five To Understand Your Child
Why Personality Matters In Parenting
Traditional parenting advice assumes one approach works for all children. It doesn’t. A discipline strategy that works for a conscientious, agreeable child might backfire with a child high in openness and low in agreeableness.
Understanding your child’s personality helps you:
- Match teaching methods to their learning style
- Set appropriate expectations (not fighting their nature)
- Identify genuine problems vs. personality differences
- Adapt communication and discipline strategies
When Can You Assess Children?
Age 3-5: Temperament becomes stable enough to measure
Age 6-12: Personality traits solidify but still developing
Age 13+: Adult-like personality structure emerges
Important: Children’s personalities are less fixed than adults. Traits shift more during development. Use as guide, not deterministic label.
Matching Parenting Style to Child’s Personality
Authoritative parenting (warmth + boundaries) works for most children, but implementation varies:
For High Openness
- Explain reasoning behind rules
- Allow negotiation within limits
- Provide choices
For High Conscientiousness
- Clear expectations and consequences
- Consistent follow-through
- Acknowledge their responsibility
For High Extraversion
- Discipline through discussion
- Allow verbal processing
- Social consequences (time with friends) as rewards
For Low Extraversion
- Discipline privately (not in front of others)
- Written communication can work better than verbal
- Alone time as reward, forced socializing as consequence
For High Agreeableness
- Gentle corrections (they’re already hard on themselves)
- Teach it’s okay to disappoint others sometimes
- Model healthy disagreement
For Low Agreeableness
- Logical consequences, not emotional appeals
- Respect their directness
- Teach empathy explicitly
For High Neuroticism
- Calm, predictable responses
- Emotional support before problem-solving
- Help regulate before disciplining
For Low Neuroticism
- Natural consequences (they won’t be devastated)
- Let them experience manageable failures
- Don’t rescue constantly
The Most Important Thing
Your child’s personality is NOT a problem to fix.
High Neuroticism isn’t “wrong”—it’s a different emotional experience that requires different support.
Low Agreeableness isn’t “mean”—it’s a strength in careers requiring directness and competition.
High Openness isn’t “scattered”—it’s intellectual curiosity that needs channeling, not suppressing.
Your job as parent:
- Understand their natural wiring
- Provide environment where they can thrive
- Teach skills that don’t come naturally to their type
- Accept them as they are while helping them grow
The goal isn’t to change their personality. It’s to help them become the best version of who they already are.