Security Awareness Metrics: What to Track, What to Ignore

Key Insights

Completion rates prove training was delivered but cannot show whether employees retained material, changed habits, or would recognize a real threat.

Verizon's 2025 DBIR found employees with recent training reported phishing at roughly four times the rate of those without recent training.

Low reporting rates may reflect tool friction or punitive simulation cultures rather than low awareness, making context essential for interpretation.

Only inbox-delivered emails should form the reporting-rate denominator, since gateway-blocked messages are never actionable by employees.

Metrics without predefined out-of-range responses are decorative, not actionable; every dashboard measure needs a documented action plan.

Security awareness metrics in most organizations begin and end with who finished the training. That number tells you almost nothing about whether anyone will actually report a suspicious email, question an unexpected wire transfer request, or follow through on a security policy when it matters.

The metrics that belong on a dashboard should answer a harder question: is employee behavior changing in ways that reduce real risk? This guide explains which measures belong on that dashboard, which ones distort the picture, and how to build an approach grounded in evidence rather than habit.

Key Takeaways

Training completion rates and quiz scores are useful for compliance and diagnostics, but they should not stand alone as measures of program effectiveness.
Reporting behavior, incident trends, and other outcome signals provide a clearer view of whether security habits are changing.
Separating process, behavior, and outcome metrics makes dashboards easier to interpret and act on.
Culture and behavior change take time, so useful measurement depends on trends tracked over multiple periods rather than one-time snapshots.

Security Awareness Metrics: What They Measure and Why Most Dashboards Miss the Point

Most programs rely on the narrowest slice of available signals and miss the categories that connect to real risk.

Security Awareness Metrics as Process, Behavior, and Outcome Signals

Security awareness metrics are often grouped into three categories, and confusing them can lead to poor decisions:

Process Metrics: Track whether activities happened, such as course completions and attendance logs.
Behavioral Metrics: Track what people do when faced with a real or simulated decision, such as reporting a suspicious message or clicking a link.
Outcome Metrics: Track organizational results, such as employee-generated incidents or changes in mean time to detect a threat.

NIST formalizes a similar breakdown in SP 800-55 Revision 1, separating implementation measures from effectiveness, efficiency, and impact measures. When a dashboard mixes these categories without labeling them, a high completion rate can sit next to a rising incident count and nobody questions the contradiction.

Why Completion Rates and Quiz Scores Cannot Stand Alone

Completion rates prove that training was delivered. They do not prove anyone retained the material, changed a habit, or would recognize a real threat. Quiz scores suffer from the same problem. They measure knowledge at one moment in time, under test conditions, without the cognitive load or time pressure that accompany real security decisions.

Compliance metrics still matter for audits and regulatory filings, but they belong in an appendix, not as the headline story of program performance.

Distinguish Vanity Metrics from Decision-Making Metrics

A vanity metric looks good in a report but does not change what the security team does next. A decision-making metric triggers a specific action when it moves. The test for any metric on a dashboard is straightforward: if this number changes significantly, does the team know what to do differently? If the answer is no, the metric belongs in an appendix, not on the front page.

Which Security Awareness Metrics Actually Matter

The metrics worth tracking reflect real security behavior, predict future incident trends, and respond meaningfully to program changes.

Reporting Rate: A Leading Behavioral Metric

Reporting rate measures the percentage of employees who flag a suspicious message through official channels, whether during a simulation or in response to a real threat. According to the Verizon 2025 DBIR, users with recent training reported phishing at roughly four times the rate of those without recent training. Reporting rate is more operationally useful than click rate because a reported phishing email gives the security operations team something to act on immediately.

Reporting rate has confounders. If employees lack a one-click reporting tool, low reporting may reflect tool friction rather than low awareness. Punitive simulation cultures also suppress reporting.

Click Rate: A Contested and Difficulty-Sensitive Signal

Phishing simulation click rate is widely used and often misunderstood. Click rates can vary based on factors such as simulation difficulty, training, and security controls. The NIST Phish Scale was developed precisely because the same workforce will show very different click rates depending on lure sophistication.

Without recording the difficulty rating for every campaign, a team cannot reliably interpret movement in click rate over time. Click rate still has a role in identifying departments or roles that need additional support, but it should not be the headline success metric.

Add Longitudinal Behavior Change and Employee-Generated Incidents

Single-campaign snapshots miss the trajectory. Tracking behavior across multiple periods reveals whether a program is producing durable change or just short-term compliance spikes that fade between training cycles.

Employee-generated incident rates sit at the other end of the spectrum as lagging indicators. They count actual security events caused by human actions and connect directly to organizational risk. A program that reduces employee-generated incidents over time has stronger evidence of genuine impact.

How to Build a Security Awareness Metrics Framework

A useful security awareness metrics framework keeps metric selection tied to decisions instead of habit and makes gaps in measurement easier to spot.

Map Metrics to NIST SP 800-50r1 and SP 800-55v1 Categories

NIST provides complementary guidance for security awareness and measurement, including:

SP 800-50 Rev. 1 for building a cybersecurity and privacy learning program and SP 800-55 for information security measurement.
SP 800-50r1 separates awareness from training and emphasizes measuring program effectiveness through metrics such as incident reporting and changes in incident trends after targeted awareness efforts.
SP 800-55v1 adds the implementation-effectiveness-efficiency-impact framework.

Mapping existing metrics against these categories often reveals clustering. Most programs stack heavily on implementation measures and have little in the impact column. Walking each metric through this four-category test shows which categories have adequate coverage and which remain empty.

Separate Leading Indicators from Lagging Indicators

Leading indicators help teams adjust early, while lagging indicators show whether those adjustments reduced risk over time.

Leading: Reporting rates, self-efficacy survey results, and behavioral trend data. These signals can change before incident volume does.
Lagging: Employee-generated incidents, successful phishing breaches, and data loss events. These are better suited to periodic review.

A balanced framework includes both, with clear ownership for who monitors each type and how often.

Tie Each Metric to a Clear Decision or Action Playbook

Every metric on the dashboard should have a documented response for when it moves beyond an expected range. Without this linkage, metrics become decoration. Teams collect data and build charts, but the numbers do not change how the program operates. If a metric does not have a realistic action associated with it, it may not belong on the active dashboard.

What Can Distort Security Awareness Metrics

Even well-chosen security awareness metrics can mislead if the data behind them contains uncontrolled variables or measurement artifacts.

Difficulty and Timing Effects

A campaign using a well-crafted, context-specific lure will produce higher click rates than one using a generic template, regardless of workforce readiness. Teams should standardize difficulty using the NIST Phish Scale where appropriate, or report the difficulty level of phishing simulations alongside results. Campaign timing and message context can also influence how people respond. Without controlling for lure difficulty, teams cannot infer training effectiveness from click-rate changes.

Tool Availability and Psychological Safety

Reporting rate can only measure behavior if employees have a frictionless way to report. If the reporting button is buried in a submenu, or if it works differently across email clients, low reporting may reflect a user-experience problem rather than a behavioral one.

Organizations that use simulation results punitively, including naming individuals in team meetings or tying results to performance reviews, suppress the very reporting behavior they claim to measure. A written non-punitive policy helps keep reporting rates interpretable as a genuine behavioral signal.

Solve Denominator and Delivery Problems Before Reporting Trends

Reporting metrics depend on clearly defining what counts as a real malicious message and which population of messages is included in the calculation. Messages blocked by the email gateway (SEG) never reach employees, so they do not belong in the denominator.

If malicious messages target the organization but the gateway blocks many of them, only the messages that actually reach inboxes belong in the denominator. Using all targeted messages instead of delivered ones can make reporting performance look much worse than it is. Before reporting trends, teams should document their calculation methodology, including what counts as a confirmed threat and how delivery data is sourced.

How to Measure Security Culture Without Guesswork

Culture measurement works best when it is separated from training activity and paired with behavioral evidence over longer periods.

Treat Culture, Behavior, and Training Activity as Separate Layers

Training activity tracks program delivery. Behavior tracks individual actions like reporting rates and secure file-sharing choices. Culture captures shared beliefs, norms, and attitudes toward security across the organization.

Collapsing them into a single score hides the dynamics between layers. Measuring each layer independently may reveal that training activity scores are high and behavioral metrics are strong, but culture survey data shows employees still view security as someone else's job. That finding points to a cultural intervention rather than more training modules.

Use Qualitative Feedback and Self-Efficacy Signals Carefully

Surveys, focus groups, and structured interviews are the primary tools for measuring culture and self-efficacy, the belief that one can perform secure behaviors when it matters. Self-efficacy matters because it captures confidence and perceived capability rather than just information recall.

Self-report data is subject to social desirability bias. Triangulating survey results with behavioral data strengthens both signals. When stated confidence diverges sharply from observed behavior, that gap identifies a priority for investigation.

Set Realistic Time Horizons for Behavior and Culture Change

Setting longer-term measurement goals with interim behavioral milestones gives leadership more realistic expectations. Communicating these interim indicators demonstrates progress within the longer timeline and protects the program from being judged too quickly.

How to Report Security Awareness Metrics to Different Stakeholders

Security awareness metrics are more useful when the same data is translated into the decisions each audience actually needs to make.

Translate Operational Metrics for Security Teams

Security operations teams need granular, filterable data updated continuously. They use simulation results segmented by department, reporting timestamps, and repeat-clicker lists to guide targeted interventions. Executives need risk exposure trends, program trajectory over time, and connections between awareness investments and incident reduction. Board members need business-level framing that shows how awareness metrics relate to risk posture and oversight.

Presenting click rates to a board without translating them into business risk language undermines the security team's credibility.

Reframe Security Awareness Metrics for Executives and Boards

The National Association of Corporate Directors (NACD) 2026 Director's Handbook recommends framing cyber risk in business terms, including how it affects strategy, core value drivers, financial exposure, and mission-critical parts of the enterprise. Framing reporting rate as a leading indicator of how quickly the organization detects threats that bypass technical controls translates an operational number into a board-level risk signal.

Reports should be visual, concise, and comparable across reporting periods, with the goal of supporting strategic oversight rather than presenting raw data.

Keep Compliance Evidence, but Demote It in the Narrative

Completion rates, policy acknowledgments, and quiz evidence still matter for audits and regulatory filings, but they should not lead the narrative when the goal is to explain program effectiveness. They support the record of program activity; they do not replace behavior and outcome measures.

A Practical Starting Point for Better Security Awareness Metrics

Programs need a defensible starting point and a plan to improve it, not a perfect measurement system on day one.

Establish a Baseline Without Overclaiming Early Results

Before introducing any program changes, teams should capture current reporting rates, click rates with lure difficulty noted, and whatever incident data exists. This baseline becomes the reference point for all future comparisons. If reporting tool availability varies across the organization or if incident tracking systems changed recently, note those conditions alongside the baseline numbers. A flawed but documented baseline is far more useful than no baseline at all.

Start With a Small Metric Set and Expand Gradually

A reasonable starting set contains a small group of metrics with clear decision links: simulation reporting rate, click rate with difficulty annotation, repeat-clicker percentage, and one lagging indicator like employee-generated incident count. That small set is more useful than a crowded dashboard that nobody acts on. Each metric should have a documented owner and a defined response for when it moves beyond normal range.

Revisit the Framework as Risks and Programs Mature

Measurement needs change as programs mature. Early-stage programs focus on establishing behavioral baselines and confirming that reporting tools work. As programs mature, they can begin tracking longitudinal trends, comparing cohorts, and eventually layering in culture survey instruments and correlations between awareness metrics and incident outcomes. The NIST SP 800-55 Rev. 2 initial public draft introduces a four-category framework that can be used as a check over time: if every metric still sits in the implementation column after the program has matured, the measurement approach has not kept pace with the program.

Measurement That Earns Its Place

The most useful dashboards connect security awareness metrics to decisions, behavior, and risk instead of activity counts alone. A small set of well-defined measures, tracked consistently and interpreted with context, gives leadership a clearer view of whether the program is changing anything that matters. Over time, that discipline turns reporting from a compliance exercise into a more credible measure of impact.