As featured in Voyage Ohio magazine

How to Measure Employee Performance in Remote and Hybrid Work

Outcomes aren't enough and what to measure instead: Remote and hybrid work have forced organizations to rethink how they measure performance, especially for knowledge workers and creative roles where contribution isn't visible, linear, or easily reduced to metrics.

REMOTE WORKI/O PSYCHOLOGYPERFORMANCE

Jess Sumerak

12/17/20257 min read

Why Outcome-Based Performance Reviews Are Crippling Innovation

To survive, human brains rely on cognitive shortcuts, heuristics that help us make fast sense of a world that is far more layered than we can consciously process. But when we apply them to modern organizations, complex, probabilistic, multi-agent systems, they distort how we judge people, performance, and worth.

One of the most common distortions is our tendency to mistake probabilistic participation for personal causation. Someone succeeds, so we assume they caused the success. Someone struggles, so we assume they caused the failure. We then moralize those conclusions, assigning positive or negative character traits to outcomes as if results were owned by individuals rather than produced by systems.

This instinct feels intuitive. It is also deeply flawed.

Outcomes Are Emergent, Not Owned

In systems thinking, outcomes are not the product of a single actor. They emerge from interaction. Any result, good or bad, arises from a convergence of factors: individual effort, starting conditions, access to resources, timing, health, randomness, decisions made by others, systemic constraints, macroeconomic forces, and variables no one can see in real time.

No human controls that entire field.

Yet organizational cultures routinely behave as if people do.

When we say “they succeeded because they’re better” or “they failed because they didn’t try hard enough,” we are committing a category error. We are treating a system output as if it were a personal possession. From an epistemic standpoint, this is invalid. It requires knowledge of total causal conditions that are not available in probabilistic, multi-agent systems. Even with good data and good intentions, leaders simply cannot know enough to make those claims honestly.

This matters, because once outcomes are moralized, compassion erodes and inequity hardens.

What People Actually Control (and Why That Distinction Matters)

Industrial-organizational psychology has long distinguished between behaviors, processes, and results. This distinction becomes even more important in knowledge work and remote environments.

People can influence effort, attention, preparation, collaboration, communication, learning, and how they respond to constraints. They cannot command outcomes. They cannot control timing, how others respond, market shifts, organizational bottlenecks, health disruptions, or the compounding interactions of complex systems.

The most honest statement is this: people influence probabilities; they do not command results. Anything beyond that is storytelling.

When organizations ignore this distinction, they unintentionally punish employees for variables outside their control and reward others for alignment with favorable conditions. Over time, this degrades trust, increases burnout, and incentivizes risk-avoidant behavior rather than learning.

It also pushes those who are succeeding under the current conditions to work hard to sustain the status quo. This ends up reducing innovation and blocking change management efforts.

Why Outcome-Based Performance Reviews Fail in Modern Work

Traditional performance reviews assume a deterministic world. They presume that if expectations are clear and effort is sufficient, outcomes will reliably follow. This assumption may have held more weight in repetitive, tightly controlled environments. It does not hold in adaptive, knowledge-based, remote, or cross-functional work.

Research in I/O psychology and organizational behavior consistently shows that outcome-only evaluation increases defensiveness, impression management, and blame-shifting while decreasing psychological safety and experimentation. Employees learn quickly that visibility matters more than substance and that failure, even intelligent failure, is dangerous.

This is especially misaligned in remote or hybrid environments, where “butts in seats” are no longer visible and should no longer be the proxy for contribution. Measuring presence rather than impact is not just outdated; it actively undermines autonomy, trust, and work-life balance.

A More Human, More Accurate Way to Measure Performance

A systems-aligned approach to performance evaluation shifts the focus from outcomes to quality of participation within constraint.

Instead of asking only “What happened?” leaders ask:

How did this person approach the work given the conditions they were operating under?
What decisions did they make with the information they had at the time?
How did they collaborate, adapt, communicate, and learn?
Did they surface risks early?
Did they improve the system, even if the outcome wasn’t ideal?

This aligns with decades of research on procedural justice, psychological safety, and self-determination theory. People perform better when they are evaluated on effort, growth, contribution, and integrity of process rather than on outcomes distorted by factors they cannot control.

Importantly, this does not mean abandoning accountability. It means relocating accountability to where it is epistemically valid.

Systems Inevitably Drift Into Chaos

Systems naturally oscillate between order and disorder. As complexity increases, variables are introduced faster than systems can adapt.

Cultures built on outcome moralization will always fracture under complexity. Cultures built on systems awareness can adapt.

When leaders stop pretending they can fully control results, they stop needing to assign blame to preserve the illusion of competence. Trust increases. Learning accelerates. People take responsible risks. Performance becomes more sustainable, not because pressure increases, but because accuracy does.

Human injustice in organizations rarely originates in malice. More often, it originates in cognitive simplification applied to irreducibly complex systems.

The work of modern leadership is not to eliminate uncertainty, but to design cultures that can function ethically and effectively within it.

A Systems-Aligned Performance Review Framework

If outcomes are emergent, then performance reviews have to stop treating results as moral proof. A better review system evaluates what is actually knowable: how someone participated in the system, what they contributed, how they handled constraints, and how reliably they created value over time.

In I/O psychology terms, this moves reviews away from a single, outcome-heavy snapshot and toward a more valid view of job performance as a combination of task performance, contextual performance, and adaptive performance. Classic performance models (Campbell’s model of performance; Borman & Motowidlo’s distinction between task and contextual performance; Pulakos’s work on adaptive performance) all support the idea that what makes someone valuable at work is bigger than “did the metric go up.”

What to measure instead of “did you hit the number”

Start with results, but treat them as signals, not verdicts. Results still matter, but they don’t stand alone.

Then add three measures that capture what outcomes can’t: process quality, relational contribution, and adaptability.

Process quality is the reliability of someone’s judgment and execution within the conditions they were operating under. Leaders can evaluate whether someone planned appropriately, escalated risk early, chose good tradeoffs, used evidence, and followed through consistently. This aligns with decades of research showing that structured, behavior-focused evaluation reduces bias and increases appraisal accuracy compared to vague trait ratings.

Relational contribution measures how someone affects system functioning. This includes collaboration, knowledge sharing, psychological safety behaviors, and whether they reduce friction for others or create it. I/O research on contextual performance and organizational citizenship behaviors supports the idea that these contributions strongly affect team effectiveness even when they don’t show up on a spreadsheet.

Adaptability measures how someone responds when reality changes. Learning speed, flexibility, recovery after setbacks, and ability to function amid ambiguity are not “nice-to-haves” anymore; they’re core performance in modern work. Adaptive performance research exists precisely because stable job environments are no longer the norm.

A practical way to frame this for leaders is: results show what happened; process shows how it happened; relational contribution shows how the person impacted others; adaptability shows how they handle change. Together, these are more predictive and more just than outcomes alone.

How to measure it without turning it into bureaucracy

The most defensible approach is structured ratings anchored in observable behaviors, paired with short evidence summaries. Behaviorally anchored rating approaches (like BARS) have a long history in I/O psychology because they reduce “vibe ratings” and make feedback more consistent across managers.

In practice, that looks like a small set of role-specific behaviors defined in plain language, reviewed quarterly with concrete examples. Leaders ask for a brief “evidence packet” that includes a few completed work samples, a short reflection on constraints faced, and a summary of what was learned or improved. This is especially useful in remote work, where contribution isn’t always visible.

To reduce bias and rater idiosyncrasies (a major known problem in performance appraisal research), add calibration. Calibration is where leaders compare ratings across teams to make sure “excellent” means roughly the same thing everywhere. This supports fairness and improves decision quality.

For roles that are highly cross-functional, selective 360 input can help, but it should be used carefully. Research and practice both suggest multi-rater feedback is most useful for development, and less reliable when used as the sole basis for compensation decisions. The key is to treat peer input as additional evidence, not a popularity contest.

What changes in the actual review conversation

This framework shifts the conversation from “defend your outcome” to “understand your contribution.”

Instead of asking employees to justify why the number didn’t happen, leaders ask what constraints were present, what decisions were made, what risks were surfaced, what tradeoffs were chosen, and what will be done differently next cycle. This aligns with feedback and motivation research showing that people improve more when feedback is specific, behavior-linked, and autonomy-supportive rather than globally evaluative or identity-threatening.

There’s also a direct link here to organizational justice research: when employees perceive performance processes as fair, consistent, and transparent (procedural justice), trust and commitment rise, and cynicism and withdrawal behaviors decrease. That’s culture, but it’s also business.

How this ties to the bottom line

This approach improves performance because it improves the system conditions that create performance.

When employees are evaluated on what they can actually control, you reduce learned helplessness and fear-based behavior. That increases discretionary effort, knowledge sharing, and intelligent risk-taking. Psychological safety research supports this: teams perform better when people can speak up early, admit uncertainty, and surface problems before they become expensive.

This approach also reduces turnover. Performance review injustice is one of the fastest ways to lose high performers and retain the wrong people, because opaque, outcome-only judgment rewards visibility and luck rather than contribution. Turnover is an enormous cost center, and even conservative estimates put replacement costs far above what most leaders intuit, especially in skilled roles.

Finally, you get better forecasting. Outcome-only systems tend to create performance theater: people manage perceptions, sandbag targets, and hide risk. Systems-aligned reviews make it safer to share real constraints, which gives leaders better information and prevents avoidable failures. That shows up directly in quality, customer experience, project delivery, and operational efficiency.

Pulling It All Together

Many organizations are currently shifting away from physical presence as a measure of contribution and toward outcome-based performance. That shift is overdue, but it is incomplete.

Outcome-only evaluation carries the same structural flaw as presence-based evaluation: it ignores the probabilistic, multi-variable nature of real work. Organizations that stop here will soon be correcting course again, once the limits of outcome moralization become impossible to ignore.

The research already points further. Performance is best understood as contribution to organizational goals through sound judgment, collaboration, adaptability, and learning within constraint. Leaders who understand this now can design performance systems that are not just modern, but durable.

Don’t get stuck in the lane everyone else is about to outgrow. Build a culture aligned with how work actually functions, before the next correction cycle begins.