When the Same Problem Follows You Everywhere

What if it isn’t you? What if it’s the measurement?

Jun 07, 2026

I. Before You Say a Word

There is a moment most people have experienced but few have named. You walk into a room — a classroom, a meeting, a disciplinary hearing, a doctor’s office, a job interview — and you can feel, before a single word is spoken, whether you will be believed.

It is not always visible as a single event. Sometimes it accumulates slowly, across many rooms, many years, many institutions. You notice that some people seem to arrive with a kind of authority already attached to them. They speak and the room listens. Their accounts of events are received as plausible. Their distress is treated as proportionate. When conflict arises, the default assumption tilts in their favor.

Other people walk in carrying the opposite. They have to establish credibility from scratch, in real time, under conditions that are already working against them. Their accounts are questioned before they finish speaking. Their distress reads as instability. When conflict arises, the default assumption tilts against them — not because of anything they’ve done, but because of how they are read.

This is not paranoia. It is not a distortion of experience. It is a feature of how institutions allocate legitimacy — and it operates, largely, before any assessment of what a person has actually said or done.

Some people arrive with credibility already assigned. Others spend their entire lives trying to earn what was never going to be given.

For many neurodivergent people, this experience is not occasional. It is the background condition of institutional life. The same dynamic — speaking clearly, being heard as confused; reporting accurately, being treated as unreliable; asking reasonable questions, being read as difficult — repeats across different settings with different people, generating the same outcome so consistently that it begins to feel like a personal failing.

It is not a personal failing. It is a measurement problem. And understanding it as a measurement problem changes everything about what the solution looks like.

II. What Credibility Is Actually Measuring

Credibility is the currency of institutional life. It determines who is listened to, who is trusted, who is believed when accounts conflict. And institutions talk about credibility as though it were a neutral, accurate measure of truthfulness and reliability.

It is not.

Credibility, as it functions in most institutional settings, is a social judgment — assembled from cues that correlate with social fluency far more reliably than they correlate with accuracy. Calm delivery. Confident eye contact. Linear, coherent narrative. Emotional responses that match what the evaluating institution expects. The ability to mirror the communication style of the person assessing you. These are the signals institutions read as evidence that someone is credible.

None of them measure honesty. They measure performance. And the performance that reads as credible was calibrated around a specific kind of communicator — one whose neurological and social profile happens to match the assumptions built into the evaluation.

Credibility, as it functions in most institutions, does not measure truthfulness. It measures how closely your presentation matches the institution’s template for a trustworthy person.

The consequences of this miscalibration are not evenly distributed. Neurodivergent people — whose genuine responses often include flat affect, atypical eye contact, intense or fragmented emotional expression, direct communication that reads as aggression, non-linear narrative — are read as less credible not because they are less honest, but because their honest responses don’t match the template.

Meanwhile, people who understand credibility signals and can perform them strategically — regardless of whether their account is accurate — receive the benefit of the doubt the system was always going to give to whoever performed most fluently.

The institution believes it is assessing truthfulness. It is assessing social performance. That gap between what is claimed and what is measured is where a great deal of injustice quietly lives.

III. The Proxies We Mistake for the Thing Itself

Credibility is not the only place this happens. Across nearly every domain where institutions claim to evaluate human capability, the same pattern appears: an institution chooses a proxy for something it wants to measure, the proxy is easier to observe than the thing itself, and over time the institution begins treating the proxy as though it were the real thing.

The gap between the proxy and the thing it was meant to represent does not disappear. It widens — quietly, systematically, in ways the institution rarely examines.

Education

Schools claim to measure learning, intelligence, and understanding. What they more often measure is compliance, tolerance for particular task formats, speed of information recall under time pressure, and the ability to sit still and attend to externally imposed structures for extended periods.

A student who deeply understands a concept but cannot demonstrate it under timed exam conditions is measured as less capable than one who has memorized information and can reproduce it quickly. The system calls this achievement. It may actually be measuring compatibility with the assessment format.

Hiring

Hiring claims to measure job ability. Most interviews are, functionally, measures of social performance under pressure — the candidate’s ability to mirror the interviewer’s communication style, produce confident and coherent self-narrative on demand, fill silence with the right kind of words, and display the non-verbal cues the interviewer reads as trustworthy.

A brilliant engineer and a mediocre engineer can receive opposite outcomes from the same interview process, because the interview is not measuring engineering. It is measuring interviewing. Those are different skills, and conflating them produces predictable errors at scale.

Performance Reviews

Performance reviews claim to measure contribution and output. Research on workplace evaluation consistently finds they more often measure visibility, self-promotion skill, political acumen, and manager affinity. The employee producing the most value is not always the employee receiving the best review, because the review process is not calibrated to capture value. It is calibrated to capture impression.

Professionalism

Professionalism is presented as a measure of competence and reliability. In practice, it is largely a set of conformity signals: eye contact norms, tone of voice, body language, small talk fluency, office etiquette. None of these necessarily correlate with ability to do the job. All of them influence evaluations of whether someone is capable of doing the job.

Framed differently: professionalism often measures how closely a person resembles the dominant social group’s definition of a capable person. It is not a neutral standard. It is an inherited one — and its costs fall unevenly on everyone who did not grow up inside the norms it encodes.

Leadership

Leadership selection claims to identify the most capable people to guide organizations. Research on leadership perception and advancement consistently finds that charisma and confidence are the traits most reliably rewarded — and that they are systematically mistaken for competence and effectiveness.

Some of the worst leaders look like leaders. Some of the best leaders don’t. The institution continues selecting for the appearance because the appearance is legible and the actual quality is difficult to see until the consequences arrive, by which point the selection has already been made.

Credibility

And here the circle closes. Credibility claims to measure truthfulness and reliability. It measures social legibility — the ease with which a person is read and processed by the institution evaluating them. The person who fits the template is believed. The person who does not has to argue for standing they were never going to be given on the same terms.

The pattern is consistent across every institution that claims to evaluate human capability: the proxy is not the thing. And the proxy was not chosen neutrally.

IV. The Consequences Are Not Limited to Neurodivergent People

This is where the argument opens into something much larger.

If a measurement system is measuring the wrong thing, the consequences are not limited to the people most obviously excluded by it. They radiate outward — to the institutions doing the measuring, to the teams and communities inheriting the outcomes, and eventually to the broader society those institutions are meant to serve.

For Neurodivergent People

The most visible consequence is exclusion. Talented people are filtered out of hiring processes that measure interview performance instead of job capability. Capable students are labeled deficient by schools measuring compliance instead of understanding. People spend years masking — performing the social signals that grant institutional access — rather than contributing from a position of genuine fit.

The individual cost is unemployment, underemployment, burnout, misdiagnosis, and a persistent erosion of confidence in one’s own perception and capability. These are not small costs. They accumulate across a lifetime.

For Institutions

Institutions believe they are selecting the best people. If the measurement is flawed, they are selecting the people who perform best on the measurement — which is a different thing entirely.

The consequence is systematic talent misidentification: missed innovators, specialists whose expertise doesn’t survive the interview, deep thinkers who don’t self-promote effectively, pattern recognizers who cannot perform the social rituals that signal belonging. Over time, institutions optimized for proxy performance become populated by people who are good at looking competent. That is not the same as being competent, and the difference eventually shows up in the outcomes.

For Teams

Teams inherit the consequences of poor measurement at every level. An organization that consistently promotes confidence over competence, visibility over contribution, and political skill over expertise does not simply promote the wrong individuals. It creates an environment where the most capable people — the ones least likely to self-promote, least likely to perform belonging, most likely to find the political dynamics alienating — eventually leave.

The remaining team absorbs the cost: lower capability, higher workload, greater dependence on people selected for the wrong reasons. The organization rarely traces this back to the measurement system that created it.

For Society

Every institution functions as a filter. Schools filter students. Hiring processes filter workers. Licensing systems filter professionals. Promotion structures filter leaders. If those filters are systematically measuring the wrong things, the aggregate effect is a society that misallocates human potential at scale.

The result is talent shortages in fields that claim to want talent but filter for performance. Leadership failures in organizations that selected for charisma rather than capability. Innovation loss from the systematic exclusion of thinkers whose cognitive profiles don’t translate into institutional legibility. These are not hypothetical costs. They are measurable, and they compound.

For Truth

This consequence is the most consequential of all.

If institutions systematically mistake confidence for competence and social fluency for truthfulness, then over time they begin rewarding people who are skilled at appearing right rather than being right. That changes who gets listened to, who gets promoted, who gets funded, who gets believed when it matters.

A system optimized for appearances does not simply make mistakes. It becomes unable to recognize its mistakes — because the measurement keeps producing the results the system expects.

The most capable people look like evidence that the selection was wrong. The selected people look like proof that the system chose correctly. The institution never examines what it was actually measuring, because the outcomes appear to confirm the method.

This is the deepest design failure: not that flawed measurement produces bad outcomes, but that it produces bad outcomes while convincing everyone the outcomes are correct. The excluded people become evidence that the system is working. The system becomes unfalsifiable from inside itself.

V. This Is a Design Problem

The gap between what institutions claim to measure and what they actually measure was not designed with malicious intent. It emerged from the natural human tendency to rely on observable proxies for things that are harder to see — and from the failure to examine those proxies against the outcomes they were meant to predict.

But the fact that the design failure was not intentional does not make it neutral. Unexamined systems do not sit still. They calcify. The proxies become the standard. The standard becomes the definition. And eventually, the institution loses the ability to distinguish between the signal it was trying to read and the noise it has been reading all along.

Understanding this as a design problem rather than a bias problem changes what the solution looks like. Bias interventions ask people to see past their assumptions. Design interventions change the measurement instrument. The first requires individual effort, conversation by conversation, assessment by assessment, against a current that is always running the other way. The second changes the current.

That means making the assumptions explicit. Writing down what a process claims to measure. Auditing whether the observable signals being used as proxies actually correlate with the claimed measure. Asking who is systematically excluded by the current instrument and whether that exclusion is evidence of a population problem or a measurement problem.

In most cases, when the audit is done honestly, the answer is the same: the measurement instrument was not designed for the full range of human capability. It was designed, largely without awareness, around a narrower model — one that treated a particular kind of social performance as universal, and read everything that diverged from it as deficiency.

The people excluded by a measurement system are not evidence that they failed. They are evidence that the measurement was not designed for them.

This reframe matters enormously for the individuals who have spent years inside the standard explanation — the one that locates the problem in them. The legitimacy gap is not something that can be closed by becoming more fluent in the signals the institution rewards. It can be closed by redesigning the instruments that mistake those signals for the thing itself.

That is a harder task. It requires institutions to examine something they have largely taken for granted: the assumption that their methods of evaluation are measuring what they say they are measuring.

Some institutions are beginning to ask that question. More need to.

Conclusion: What the System Was Built to See

The legitimacy gap is not a new problem. It is an old design running without examination. Institutions inherited assumptions about what trustworthy, capable, intelligent, and reliable people look like — and built their measurement systems around those assumptions so thoroughly that the assumptions became invisible.

The result is a set of filters that claim to be selecting for capability while selecting, in practice, for a narrow set of social performances that correlate with capability only weakly, under conditions that were never examined, in populations that were never representative.

For neurodivergent people, the gap is most visible and most costly. But the gap does not affect only neurodivergent people. It affects everyone whose genuine capability does not translate cleanly into the proxy the institution has decided to measure. It affects the institutions doing the measuring, which believe they are selecting the best and are often doing something else. It affects the societies those institutions serve, which are absorbing the compounding cost of misallocated human potential without being able to trace it to its source.

The question worth asking is not why certain people keep failing institutional measurements. That question accepts the measurement as accurate and looks for the problem in the person.

The question worth asking is what the measurement was actually built to see — and whether what it was built to see has anything to do with what the institution claims to value.

Most institutions are not failing at inclusion. They are succeeding at something else — a narrower, older task they inherited without examining. The work is not to make the excluded more legible to the system. The work is to build systems that can see.

Nothing is neutral. Normal was designed.

The people excluded by a system are not evidence that they failed. They are evidence that the design did.

Toby

I didn't think the systems actually want to change this, even when they become aware of it. For my whole career I misunderstood the workplace. They asked for innovation, I gave them brilliant innovation, not in my opinion, but by objective measures, and they rejected it because it meant a loss of status or authority. They used various versions of "If we did that, we would be admitting we were wrong about this all along". This was eventually followed by, "your services are no longer required...". I realise now that when they asked for innovation, they were actually asking me to make them look good.

1 reply by Eva Redford

Erin Pyper, MSW

Jun 8

“Credibility is the currency of institutional life. It determines who is listened to, who is trusted, who is believed when accounts conflict. And institutions talk about credibility as though it were a neutral, accurate measure of truthfulness and reliability.” You hit the nail on the head in this article. It should not have to be that way.

4 more comments...

Institutional Legitimacy™

Discussion about this post

Ready for more?