
Incident Investigation Done Right: Getting to the Root Cause
Incident Investigation Done Right: Getting to the Root Cause
When a workplace incident occurs, the immediate priority is to manage the situation and provide care. But what happens next is critical. Too often, the subsequent investigation stops at the immediate, observable cause—a worker slipping, a machine malfunctioning, a procedural step missed. This superficial analysis, while seemingly efficient, is a missed opportunity. It’s like treating the symptom without diagnosing the disease. True, sustainable safety improvement requires a deeper, more systemic approach to incident investigation, one that uncovers not just what happened, but why it was possible for it to happen in the first place.
Beyond the Blame Game: A Systems-Thinking Approach
The conventional approach to incident investigation often ends with identifying a single point of failure, frequently culminating in “human error.” This is a dangerously incomplete conclusion. While an individual’s action may have been the final step in the chain of events, it is rarely the sole cause. To prevent recurrence, we must shift from a person-centric blame model to a systems-thinking approach. This means looking at the entire operational environment—the processes, the equipment, the training, the culture—to understand how it contributed to the incident.
One of the most powerful mental models for this type of analysis is James Reason’s “Swiss Cheese Model” of accident causation [1]. Reason, a pioneer in the field of human error, proposed that organizations have multiple layers of defense to prevent catastrophic failures. These defenses can be technological (e.g., safety guards, alarms), procedural (e.g., checklists, work permits), or human (e.g., training, supervision). In an ideal world, these layers are solid. In reality, they all have inherent weaknesses, or “holes,” like the holes in a slice of Swiss cheese.
The Swiss Cheese Model in Action
Imagine each layer of defense as a slice of cheese. The holes in each slice are constantly opening, closing, and shifting. These holes represent latent failures—dormant weaknesses in the system that may lie hidden for days, months, or even years. Examples of latent failures include:
- Inadequate training: Employees are not fully equipped to handle non-routine situations.
- Poorly designed equipment: A machine has a known design flaw that creates a workaround.
- Time pressure: Production goals consistently override safety procedures.
- Ineffective supervision: Supervisors lack the training or authority to enforce safety rules.
An incident occurs when the holes in multiple slices of cheese momentarily align, allowing a hazard to pass through all the layers of defense and cause a loss. The individual at the sharp end of the incident—the one who made the “error”—is simply the last line of defense. Blaming them ignores the alignment of holes that set them up for failure.
From “Who?” to “Why?”: The 5-Why Technique
To uncover these latent failures, a simple but effective tool is the 5-Why analysis. Developed by Sakichi Toyoda and used within the Toyota Production System, this technique involves repeatedly asking the question “Why?” to peel back the layers of an issue and get to its root cause. For example:
- Problem: A worker slipped and fell.
- Why? There was oil on the floor.
- Why? A machine was leaking oil.
- Why? The gasket on the machine was worn out.
- Why? The machine was not on a regular preventive maintenance schedule.
- Why? The maintenance department is understaffed and can only respond to emergency breakdowns.
In this example, a simple slip-and-fall investigation moves from blaming the worker for not being careful to identifying a systemic issue with maintenance resource allocation. The corrective action is not to “tell workers to be more careful,” but to implement a preventive maintenance program and address staffing levels. This is how real prevention happens.
Actionable Takeaways for Leaders
Improving your incident investigation process is one of the highest-leverage activities you can undertake to improve workplace safety. Here’s how to get started:
- Train your investigators: Ensure that anyone leading an investigation is trained in root cause analysis techniques like the Swiss Cheese Model and the 5-Whys.
- Go beyond the immediate cause: Mandate that every investigation identifies contributing factors and systemic weaknesses, not just the active failure.
- Focus on corrective and preventive actions (CAPAs): The output of an investigation should be a set of actions that address the identified root causes. These actions should be specific, measurable, achievable, relevant, and time-bound (SMART).
- Verify effectiveness: Don’t just implement corrective actions; circle back to verify that they have been effective in preventing recurrence.
By adopting a systems-thinking approach to incident investigation, you can move your organization from a reactive cycle of blame and repeat incidents to a proactive culture of learning and continuous improvement.
Further Reading and Resources
For a deeper dive into building robust safety systems, explore the comprehensive resources available at Trident Business Group. From our detailed guide on implementing Behavior-Based Safety programs to our supervisor safety leadership checklists, we provide the practical tools you need to move from a reactive to a proactive safety culture. Visit https://tridentbusinessgroup.biz/resources to learn more.
References
[1] Reason, J. (2000). Human error: models and management. BMJ, 320(7237), 768–770.
[2] Wiegmann, D. A., & Shappell, S. A. (2017). A human error approach to aviation accident analysis: The human factors analysis and classification system. Ashgate Publishing, Ltd.
[3] Larouzee, J., & Le Coze, J. C. (2020). Good and bad reasons: The Swiss cheese model and its critics. Safety science, 126, 104660.