Rethinking Performance: Part 4 Of 5 Why Judgement Matters In Performance Evaluation
Rethinking Performance - Part 4
In Part 4 of the five-part Rethinking Performance series, written by Camille Rabier, Consultant at 21st Century, the focus shifts from design to real-world application. While the model provides a way of making behaviour evaluable, its effectiveness ultimately depends on how it is applied in practice. This places managerial judgement at the centre of behavioural evaluation, which is the focus of this article.
Making behavioural evaluation work in practice
The translation model provides a structured way of making behaviour evaluable within performance systems. By linking values to observable behaviours and supporting their assessment through indicators, it creates a clearer and more consistent basis for evaluating how work is carried out.
The question, then, is how this model translates into practice. While the model provides structure, its effectiveness depends on how it is applied. In practice, behavioural evaluation breaks down not at the level of definition, but at the point of interpretation. The same indicators may be applied differently across managers, contexts, and situations, leading to variation in how behaviour is understood and assessed. Understanding this variation is essential to applying behavioural evaluation consistently and meaningfully.
Integrating behaviour and outcomes
The challenge in practice is not only how behaviour is evaluated, but how it is integrated with outcomes to form a coherent view of performance. Outcomes and behaviour are not interchangeable, nor can one compensate for the absence of the other. Strong outcomes achieved through unsustainable or detrimental behaviours do not represent the same level of performance as those achieved through effective and enabling behaviours. Equally, behaviour that is well aligned but does not result in any delivery cannot be considered sufficient on its own, regardless of how well aligned it may be.
Performance must therefore be understood through the interdependent relationship between what is delivered and how it is achieved. This requires evaluating outcomes in light of the behaviours that produced them, rather than treating the two as separate components. While this provides a more complete way of understanding performance, it also introduces a new layer of complexity. If performance depends on how outcomes and behaviour are interpreted together, evaluation cannot be reduced to predefined rules or fixed weightings. It requires judgement to determine how different elements should be weighed in context.
In practice, even when behaviour is clearly defined and supported by indicators, consistency in evaluation is not guaranteed. The same outcomes and observable behaviours may be interpreted differently across managers, depending on how context, intent and impact are assessed.
Where application breaks down
Although the model provides a common language for describing behaviour, it does not, on its own, ensure alignment in how that behaviour is interpreted and evaluated. In practice, this is where discrepancies arise. Consider two managers assessing similar performance. Both observe comparable outcomes and similar patterns of behaviour, supported by the same set of indicators. One manager places greater emphasis on the strength of the outcome, viewing the behaviour as sufficient given the result achieved, while the other focuses more closely on how the outcome was delivered, identifying gaps in collaboration or sustainability that affect their overall assessment. In both cases, the same evidence is available, yet the conclusions differ. The variation does not arise from the absence of structure, but from differences in how that structure is interpreted and applied.
This divergence is further reinforced when indicators are applied mechanically and treated as checklists. In such cases, the presence of certain behaviours is taken as sufficient evidence, without fully considering their relevance, quality, impact, or the context in which they occur. The same behaviour may be appropriate in one situation and less effective in another. Without a shared understanding of how these factors should influence evaluation, assessments can diverge significantly, undermining comparability across individuals and introducing inconsistency in how performance is judged. As a result, even well-defined behavioural frameworks can produce inconsistent outcomes in practice because the model provides structure, but not alignment in how it is used.
The role of managerial judgement
The variation observed in practice is not a flaw of the model itself. It reflects the role of judgement in interpreting behaviour within context. Behavioural evaluation cannot be applied mechanically. While indicators provide a structured way of identifying how behaviour is expressed, they do not determine how it should be assessed. Managers must interpret what they observe, considering the circumstances in which behaviour occurred, the intent behind it, and its impact on others.
This highlights the distinction between applying a model and interpreting it. When applied mechanically, indicators can be reduced to checklists, creating the impression that behaviour has been fully assessed once certain elements are observed. However, effective evaluation requires more than identification. It requires judgement to determine the relevance, quality, and significance of what is observed. The presence of judgement is therefore not a limitation of behavioural evaluation, but a requirement. What differentiates effective from ineffective application is not whether judgement is used, but how it is applied. When anchored in shared indicators, judgement becomes more consistent, transparent and defensible.
This is why it is critical that judgement is not only applied but aligned across those responsible for evaluating performance.
Why calibration matters
Consistency in behavioural evaluation does not come from definitions or from the framework alone, but from alignment in how behaviour is interpreted and weighted in practice. While indicators provide a shared reference point, they do not guarantee that managers will assess behaviour in the same way. Differences in experience, perspective, and context can lead to variation in how behaviour is understood and evaluated.
Calibration provides a mechanism to address this by aligning judgement. It creates a space for managers to compare evaluations, discuss reasoning, and align on what constitutes effective behaviour in practice. Through this process, differences in interpretation become visible and can be reconciled over time. This alignment is not about enforcing uniformity or eliminating judgement. Rather, it is about developing a shared understanding of how behaviour should be assessed, including how it should be weighed relative to outcomes and how context should influence evaluation.
In this way, consistency is achieved not by standardising judgement, but by making it collective, transparent, and continuously aligned.
Designing systems that support judgement
If behavioural evaluation depends on judgement, performance systems must be designed to support it, rather than replace it. Indicators should function as reference points that guide evaluation, not as scoring mechanisms that determine it. When treated as checklists, they risk reducing behaviour to observable fragments, rather than enabling a more complete assessment of how work is carried out.
Performance systems should therefore create space for interpretation, discussion, and alignment. This includes encouraging managers to articulate the reasoning behind their assessments, compare perspectives, and refine their understanding of what effective behaviour looks like in practice.
The role of the system is therefore to support more consistent judgement through shared understanding, rather than enforce it through rigid criteria.
Conclusion
Performance systems have traditionally focused on what can be measured most easily, often at the expense of what matters most.
While behaviour can be made more visible through structured models and indicators, it cannot be reduced to objective measurement. The effectiveness of behavioural evaluation depends not only on how behaviour is defined, but on how it is interpreted, aligned, and applied in practice.
Performance systems do not eliminate judgement but shape how it is exercised, and it is through structuring and aligning this judgement that performance can be understood more fully and evaluated more meaningfully.
This becomes most visible when differences in evaluation emerge in practice. Situations where managers assess similar performance differently are not uncommon, even when outcomes and behaviours appear consistent. How these differences are addressed is critical and is explored further in Part 5.
Total Words: 1280
Submitted on behalf of
- Company: 21st Century
- Contact #: 0760781723
- Website
Media Contact
- Agency/PR Company: The Lime Envelope
- Contact person: Bronwyn Levy
- Contact #: 0760781723
- Website
Social Media Post
Rethinking Performance: Part 4 Of 5 Why Judgement Matters In Performance Evaluation
Part 4 explores how behavioural evaluation works in practice, highlighting the role of managerial judgement, challenge of consistency, and why performance depends on how behaviour and outcomes are interpreted together...