Performance management is not capability building

A strong appraisal cycle can run flawlessly beside leaders who do not develop. Performance management sorts people. It rarely builds the capability that produces the performance. They are different jobs.

Dr Eric Albertini · Co-Founder, CapabilityFX8 June 2026 · 8 min read

a woman standing at a desk in front of a laptop computer

Photo by TheStandingDesk on Unsplash

A well-run appraisal cycle that builds nothing

Most organisations have a performance management system that works. Objectives are set at the start of the year and cascaded down. Mid-year check-ins happen. Ratings are calibrated across a panel so that no manager can inflate their own team. Bonuses and increases flow from the outcome. The whole machine runs on schedule, and at the end of it everyone knows where they stand.

What the machine does not do, despite the widespread assumption that it does, is build the capability that produced the performance in the first place. It measures. It sorts. It rewards and corrects. These are real and necessary functions. They are not development, and treating them as if they were is one of the most common and expensive category errors in the way organisations think about their leaders.

Two different jobs wearing the same clothes

Performance management and capability building get conflated because they overlap in calendar, vocabulary, and ownership. The same HR function often runs both. The annual review talks about "development areas" alongside ratings. A development plan is stapled to the appraisal form. Because the activities sit next to each other, it is easy to assume they are the same activity. They are not.

The job of performance management is measurement and allocation. It exists to answer a set of organisational questions: who delivered against what they agreed, how do people compare, who should be promoted, paid more, managed out. These are sorting questions. A performance system is, at root, an instrument for distinguishing between people and distributing consequences accordingly. Done well, it is fair, consistent, and defensible.

The job of capability building is change in the person. It exists to answer a different question: what would make this leader more capable next year than they are now, in a way that holds when conditions are hard. That is not a sorting question. It is a developmental one, and it operates on a different timescale, with different evidence, and through different means.

The trouble is that the sorting machine produces a number, and the number feels like it ought to drive the change. A leader rated "meets expectations" on stakeholder management is told to improve stakeholder management. The rating has named a gap. But naming a gap is not the same as closing it, and the appraisal process contains nothing that closes it. The number describes the destination. It does not lay the road.

This is a different point from the one we made in why measuring development well requires a deliberate design. That piece is about how to measure development properly when you are actually developing. This one is about a prior error: mistaking the measurement system itself for the development. You can have excellent metrics and still be measuring a performance process that builds nothing.

Why a strong process coexists with flat capability

The evidence that the two jobs are distinct is visible in any organisation that has run a mature performance system for years and still complains that its leadership bench is thin. The process is not broken. It is doing exactly what it is designed to do. It was simply never designed to develop anyone.

There are three reasons a high-functioning performance system can sit on top of stagnant capability.

Ratings reward the performance, not the growth. A leader who is already strong gets a high rating and a bonus. A leader who is genuinely stretching, taking on work at the edge of their capability and occasionally getting it wrong, may rate lower than the safe performer who never left their comfort zone. The system pays for delivery, which is reasonable, but delivery and development are not the same thing, and a system that pays only for delivery quietly teaches leaders to stop stretching. The most developmentally important behaviour, attempting something you cannot yet reliably do, is the behaviour the rating penalises.

The conversation is evaluative, which closes the door development needs open. A genuine developmental conversation requires a leader to look honestly at their own patterns, including the uncomfortable ones. An appraisal conversation requires them to defend their rating, because the rating attaches to their pay. These two stances are incompatible. The moment a conversation determines money, the person in it manages the impression rather than examines the truth. We have written about why the inside-out work is the half that determines whether anything holds in leadership development keeps training the wrong half; the appraisal room is structurally the worst possible place for that work, because evaluation and honest self-examination cannot occupy the same conversation.

Development plans are an afterthought with no architecture. The development section of most appraisal forms is filled in last, in the final five minutes, with a generic line about "improving executive presence" or "delegating more." There is no diagnosis of why the leader operates the way they do, no sequence of practice, no one accountable for the change between now and the next review. A development plan that is a sentence on a form is not a development plan. It is a note that the box was ticked.

None of this means performance management is wrong. It means performance management is not development, and an organisation that relies on its appraisal cycle to build leaders is relying on a tool for a job it was never built to do.

What it looks like in practice

The category error becomes concrete in specific cases. Two are worth setting out.

The HR director with an immaculate system and a thin bench. An HR director at a South African financial services group had spent four years professionalising the performance cycle. Calibration was rigorous. Ratings were distributed sensibly, free of grade inflation. Pay was demonstrably linked to delivery. By any audit, the system was sound. Yet when the executive team mapped its succession risk, the same problem surfaced that had surfaced three years earlier: a layer of competent operators who could run what existed but could not be trusted to lead through genuine ambiguity. The performance system had measured this gap accurately every year. The "leads through uncertainty" line item had sat at "developing" for the same group of leaders cycle after cycle. The rating was correct and it changed nothing, because nothing in the cycle acted on it. The number was a thermometer, read annually, attached to no treatment. When she stopped expecting the appraisal process to develop people and stood up a separate, deliberately non-evaluative stream of capability work for that group, the line item finally began to move, not because the measurement improved, but because something other than measurement was now happening.

The high performer who was rated, never developed, and left. A divisional finance lead in a manufacturing business was the organisation's reliable top performer. She rated "exceeds expectations" every year, was paid accordingly, and was told repeatedly that she was on the succession shortlist for a group role. What no one noticed, because the system was not built to notice it, was that her strong ratings rested entirely on outside-in capability that suited the role she already had. She delivered because the work matched her strengths. The harder, inside-out development that the group role would demand, holding her nerve in genuinely contested executive debate, leading peers rather than direct reports, sitting with the discomfort of decisions she could not control, was never touched, because the performance system had no mechanism to touch it. It only ever confirmed she was excellent at her current job. She was promoted on the strength of her ratings, struggled in the larger role precisely on the dimensions the ratings had never tested, and left within eighteen months. The appraisal data had been accurate throughout. It had also been silent on the one thing that mattered for the next step, because measuring current performance tells you almost nothing about capability for a role with different demands. We make the wider version of this argument in assessing potential is not the same as assessing performance.

What both cases share is a leader who trusted the performance system to do a developmental job, and a system that, however well run, was constitutionally unable to do it. The fix in neither case was a better appraisal process. It was a separate process aimed at the thing appraisal does not reach.

What genuine capability building actually requires

If performance management sorts and rewards, what does the other job, the one that actually builds capability, require? Three things, none of which a rating cycle supplies.

A real diagnosis, not a rating

A rating tells you a leader is weak at something. It does not tell you why, and without the why there is nothing to work on. A leader who avoids difficult conversations may be conflict-averse, may lack a structure for the conversation, or may have an unexamined need to be liked. These have different remedies. Genuine development begins with a diagnosis of pattern and cause, not a score on a scale. This is where validated instruments, used honestly, earn their place: not to grade the leader, but to surface the patterns worth working on. CapabilityFX uses Ennea International's Five Lens Development Platform and Tomorrows Compass's future-readiness assessment for exactly this, as inputs to a diagnosis rather than verdicts on a person. Our assessments page sets out what each one does and where it fits.

A developmental relationship that is not evaluative

The conversation that builds capability cannot be the conversation that sets pay. It has to be safe enough for a leader to admit what they cannot yet do, which means it has to be uncoupled from consequence. This is precisely why the work belongs outside the line, with someone whose only agenda is the leader's development, not their rating. The DUAL model (Discover, Understand, Accept, Lead) describes the movements such a relationship moves a leader through, and the first two, genuinely discovering and understanding one's own patterns, are simply not available inside an evaluative process.

Time and an architecture across it

Capability does not change in a conversation. It changes through a sequence of practice, reflection, and challenge held across months, with someone accountable for the arc rather than the annual snapshot. This is the difference between a development plan that is a line on a form and one that has structure. Our 4D method is built around exactly this kind of arc, deliberately on a different rhythm from the annual cycle, because the annual cycle measures at a frequency that is useful for sorting and useless for development.

Put plainly: performance management asks "how did this leader do," once a year, to allocate consequence. Capability building asks "who is this leader becoming," continuously, to change the answer. Both matter. Neither substitutes for the other.

The reader's next step

If you are responsible for leadership capability, three questions will tell you whether your organisation has made the category error.

If you removed the development section from your appraisal form, would any actual development stop? If the honest answer is no, the development section was never doing developmental work. It was documenting an intention. That is worth knowing, because it means your real development capacity is whatever exists outside the form, and for many organisations that is close to nothing.

Can a leader in your organisation admit a genuine weakness without it costing them? If every conversation that touches a leader's gaps also touches their pay or their standing, you have no safe developmental space, and leaders are managing their image rather than building capability. The honesty development requires is structurally unavailable.

When a leader is rated weak on something for three years running, what happens? If the answer is "they get rated weak on it again," the system is measuring a gap it has no mechanism to close. The thermometer works. There is no treatment attached.

These questions do not call for a better performance system. They call for the recognition that development is a separate job needing a separate process. Our services and use cases describe how that work takes shape alongside, not inside, an organisation's existing performance machinery.

Keep the system. Build the leaders separately.

The argument here is not against performance management. A fair, consistent appraisal cycle is a genuinely valuable thing, and most organisations should keep theirs and run it well. The argument is against asking it to do a job it cannot do. Rating a leader is not developing them. Ranking a leader is not developing them. Setting a leader objectives and reviewing them is not developing them. These are acts of measurement and allocation, and they leave the underlying capability exactly where they found it.

The organisations that build leaders who hold under pressure are the ones that stopped expecting the appraisal cycle to do it, and stood up something else. If you want to think through what that something else looks like for your organisation, start a conversation with us. It is a different conversation from the one your performance system is having, and it is the one that actually changes who your leaders are.

The leaders described here are representative composites drawn from patterns we observe in practice, not identifiable individuals.

Dr Eric Albertini · Co-Founder, CapabilityFX

Originator of the DUAL model, developed through his doctoral research at the University of Johannesburg. Eric has spent his career building leadership capability inside executive teams.