Managing the problems associated with the global supply chain of major products these days requires a flexible, adaptable, and consistent approach.
One need look no further than the latest commercial aircraft designed by Boeing to see just how far a company is willing to go to realize the numerous and enormous benefits of a decentralized, extended global supply chain. Expertise and specialization is focused directly on individual components. Profit and loss responsibility is concentrated into smaller, more manageable (and thereby accountable) business units. Risk is diversified across multiple "baskets" of suppliers. The benefits of local markets (such as cheaper labor and the proximity to raw materials) can be exploited. The list goes on.
But Boeing's strategy also shows some of the inherent risks and unfortunate consequences associated with managing such a supply chain. Unfortunately, it is one thing to map out all the potential benefits of a diversified global supply chain, yet quite another for the company and its managers to actually make it out alive given the risks involved.
While the new 787 by Boeing may be one of the most ambitious attempts at wringing benefits from an extended global supply chain, we can find other examples of all sizes, shapes, and flavors. Globalization is just another way to say "global supply chain." Like it or not, it is here to stay.
Companies that learn to manage the risks of a global supply chain can expect to reap, at the very least, the reward of survival. But those that learn to proactively manage the problems encountered in such a diverse system can expect to rule their sectors. The key component to proactive problem solving is a robust Solution Management System (SMS) built on a solid, adaptable root cause analysis program.
To proactively minimize the risk of future failures by learning from failures in the supply chain, read the rest of our article published in the February issue of Industrial Engineer magazine.
]]>
When something goes wrong in your organization, is human error often the identified cause? Is "more (task-related) training" for the people involved often the designated solution? As a result, have employees in your organization reluctantly endured many kinds of training - often referred to as "flavor of the month"? And do the problems you are trying to prevent continue to occur? Given limited resources, are you getting the most out of those resources?
Organizations have an opportunity to look more closely at the results they are trying achieve, and evaluate what actions will help achieve those results.
Consider all the time and energy it takes to provide training - from the people who initiate and drive it, to the people who plan it, to the people who deliver it, to the people who partake in it, not to mention the work that is not getting done while those people are away from their jobs, or the temps that must be hired to fill in - it is a huge commitment and investment. Especially in the current economic climate, when everyone needs to be very careful with the way they invest their resources - time, people and money - it is all the more important to carefully focus training and make sure there's payback on the investment.
Training in an effective root cause analysis methodology and careful program implementation can resolve these issues and help accomplish training goals. For best practices, go to our article published as the cover story in the February 2010 issue of Plant Engineering.
Link to cover and contents:
http://www.plantengineering.com/archive/2010/20100201.php
Link directly to article text:
http://www.plantengineering.com/article/447578-Is_more_training_the_solution_to_human_error_.php
]]>
One thing is for certain, you must compete for executives' attention. A compelling case that stands out from the many other initiatives is necessary.
Executives often don't pay attention to root cause analysis programs. Why? It's not because they don't care. Rather, they don't understand the lost value that an effective RCA program could easily scoop up. Leaders these days are simply swamped with significant problems and little time is available for them to dig in and understand what is going on, or at least that is the perception.
The reality is that if they can invest one or two hours of time and help you solve just one system-wide problem, they can not only endorse your efforts, they can help save significant money and demonstrate a proactive change that will improve the work processes or system. You need to set the stage before this can happen. So, where do you begin?
First, let's understand the systemic causes that enable system-wide problems. Systemic problems are pervasive throughout all work processes in the entire organization, including reliability, manufacturing, EH&S and supply chain. Systemic problems have considerable negative impact on overall performance and competitiveness - they generate many of the causes (problems) you have been working hard to eliminate. Left unchecked, systemic problems will always generate new problems.
Question: How have you successfully eliminated systemic causes?
Question: What kind of ROI have you helped your organization to achieve?
Question: How have you involved executives? Suppliers? Customers? And how has that impacted leadership support of your program?
Share your experiences and best practices. We're building an online community to support professional development and continuous improvement.
Posted 11 December 2009 by Chris Eckert, Apollo President
]]>Many emerging ITIL programs either don't have a formal root cause tool in place or take an ad hoc approach to identifying problems. Either way, most don't know how to separate incidents from problems in order to right-size the investigation response, so that time and money is not spent needlessly.
It is most important to ensure that your root cause analyses are focused on the events that have -- or could have -- the greatest impact on the most important functions of your organization. That way, your RCA effort is aligned with the organization's business goals. To differentiate between incidents and problems -- or incidents and major incidents -- draw a line in the sand where the cost of the unwanted event outweighs the cost of a formal investigation. Set threshold criteria that specifically outline the kind of response that is appropriate for different scenarios, make it easy for anyone including help desk staff to make the decision, and remember to clearly outline the rationale to justify your criteria.
One way to create threshold criteria that define what Problem Managers should formally investigate would be to set up a table.
Engage people in the organization to help you define what level of service interruption or deviation could compromise the various business goals. Be aware that each person's definition of what constitutes a problem or major incident will be different.
Another way to approach problem threshold criteria, especially in larger organizations, would be to:
In order to do this successfully, you may need to more specifically define the general terms that are contained in many service agreements. For example:
How does your organization decide when to perform a root cause analysis?
Question: What threshold criteria do you have in place?
Question: What best practices did you follow to you create them?
Question: Who leads the effort? What role to systems and business analysts play?
Share your experiences and best practices. We're building an online community to support professional development and continuous improvement.
Posted by Mark Hall | Canada Account Manager, Instructor, & Investigator | 12/2/2009
[1] Within the incident management process, an incident is defined as an event that interrupts or reduces the Quality of Service. An incident may affect a single employee (i.e., user) but it does not affect the overall business in a significant way. Within the problem management process, a problem is defined as a series of incidents that adversely affects the business because it affects a group of employees.
]]>Question: When you present the results of a root cause analysis, do decision makers question the conclusions, thinking they know better?
Question: Are you denied budget for solution implementation?
These symptoms may indicate lack of management support.
Before you can change corporate culture, you need to know:
Eliminate the bad, reinforce the good. Sounds simple enough. But we all know it can be political, complicated and tricky.
Put yourself in the executive's shoes. Play to their perspective and needs.
Question: How is the level of management support in your organization affecting your job?
Question: What attitudes and behaviors do you need to eliminate or reinforce?
Question: How have you improved the corporate quality culture?
Share your experiences and best practices. We're building an online community to support professional development and continuous improvement.
I look forward to hearing from you,
John Stiller | November 17 2009
]]>
It was a dark and stormy night, and all the vampires were just getting to work. As they gathered around the blood cooler drinking their cups of red 'go-juice', the conversation was dominated by the subject of a chilling event that happened a few nights prior. Apparently, Vlad (not his real name) sprained his ankle while jumping down off a flat bed truck trailer. He had climbed up onto the trailer to replace a tarp that had been ripped sometime during previous usage. He threw the ripped tarp onto the ground below. Then, instead of climbing carefully down the permanent access ladder at the back of the truck, he sat down on the edge and gently pushed himself off. He landed on the discarded tarp, which slipped under his foot. He then twisted his ankle. Ouch! The witch doctor concluded he had a badly sprained ankle, which forced the safety manager to reset the lost-time accident counter to zero.
During the interview, Vlad offered the following:
"Well, that turned out to be a stupid mistake. I did this work as a human for five years as a contractor in Moldova before my abrupt introduction to the realm of the undead. You think vampires are thick in Transylvania? Well, after Gorbachev took the lid off the Eastern Block, they fanned out all over the place. Just like Bald Eagles - you used to go your whole life without seeing them... now they're competing for road kill with the crows. Hmmm, speaking of road kill, is it lunch time yet? Anyway, now I've been doing the same kind of work - albeit on the night shift - for the past fifteen years. But my undead bones must be getting weak. I've never been hurt before. I thought that I was being careful by sitting down first before jumping off the trailer. If only I hadn't landed on that tarp, I wouldn't have slipped and we wouldn't be here."
Management wasn't buying it. They knew that Vlad had been trained recently in best safety practices because they pulled the training roster. Right there, between Keith Richards and Joan Rivers (two longtime members of the undead) was Vlad's scrawled signature. He should have known that that his actions of jumping off the trailer weren't acceptable.
The rules were very clear. If an employee violates a safety procedure, they are to be bolted into their coffin for two days with a sunlamp and a bag of beef jerky. That's what the conversation at the blood cooler was all about. Sure, Vlad may have violated a rule. But did he really deserve to be punished in such a way? The punishment wasn't perceived as just by the other vampires onsite because Vlad had a reputation for being safe. He had never been hurt before. And, everyone took shortcuts from time to time. The supervisors seemed to be more interested in meeting their demonic production targets than making sure that every little safety rule was followed. If Vlad hadn't been hurt, no one would have cared.
Now I know what you're thinking. If you've got enough vampires to work the night shift, and you're worried about a sprained ankle, your priorities could use some fine-tuning. Agreed, but since I'm already invested in the metaphor now we've got to stick it out...
Recently, Cory Boisoneau and I attended the National Safety Congress conference in Orlando, Florida. I had the opportunity (thanks to Hilda Koskiewicz of NSC) to host a professional development session titled "Does Disciplinary Action Increase the Risk of Human Error?". This was a panel discussion where I gave a presentation and then we opened up a dialogue with the panel members and the audience about the topic. I was very fortunate to have Beth Lay, Manager of Risk Management and Loss Control and Mark Thomasson, Six Sigma Master Black Belt, both of Siemens Energy, as panel members. Also, thanks to the attendees who contributed great stuff to the conversation.
My take on this, as you can see from my previous blog on this subject, as well as my recent article in the October 2009 issue of Occupational Health and Safety, is that systemic risk can be found in conditional causes of events and non-systemic risk can be found in the action causes of events. Most investigations focus on action causes, and therefore only on the non-systemic risk elements in an event. However, as we stress in our root cause analysis courses, the best solutions are often found in controlling conditional causes. This is because controlling conditions reduces the systemic risk of recurrence to which all employees are subject... not just the single actions of any one employee.
Disciplinary action is intended to control action causes of an event. It attempts to control worker's choices by providing a disincentive to acting outside the established rules. However, my experience has shown me that disciplinary action is often ineffective in reducing risk. This experience is apparently mirrored by many others. In James Reason's latest book
"The Human Contribution: Unsafe Acts, Accidents, and Heroic Recoveries" page 74 discusses the concept of blame (the act of attributing to which occurs immediately before handing down disciplinary action) in the context of what he calls "the fundamental attribution error", which occurs when we assume that an error is caused by some character flaw, rather than the fact that they made choices based upon past experiences along with information available at the time of the incident. In other words, the conditions at the time of the event were eminently important to the decisions made by the blamed (and subsequently disciplined) individual.
Disciplinary action often not only fails to reduce risk of future recurrence, it also can actually increase the risk of future errors. When discipline is applied programmatically (as in Vlad's case above - the supervisors were 'forced' into discipline by their own rules) regardless of conditions at the time of the incident, the action will be perceived as unjust by both the person experiencing it and that person's peers. This is a nightmare (remember, it's Halloween) scenario for an investigator of future incidents because when the time comes to understand the causes, no one's talking. No one saw anything. No one knows what happened. That's spooky stuff.
This was the point I attempted to make in my presentation. But I learned that I needed to account better for times when punitive disciplinary action actually does help to reduce risk... primarily when dealing with cardinal rule violations, such as horseplay, drinking, drugs, etc. Beth Lay of Siemens shared a great decision tree based on the work of James Reason that they created in order to help recognize the difference between an error that resides primarily with the employee or with management systems. I am convinced that this is the right way to go. An organization needs guidance to know how to act, and in the future we at Apollo will work to develop our own guidelines to help decide when disciplinary action will be effective without actually adding to systemic risk.
A great article on this subject by Kevin Sharpe of Science and Spirit Magazine can be found here for those interested in reading more.
Have a happy Halloween! Be safe! Or be boring like me...I think I'll spend this year on the couch watching baseball.
Brian Hughes
]]>These are typical thoughts when developing a team to investigate a problem. Who should be on the team? In today's environment, most employees perform many tasks and are responsible for various activities, or in other words wear multiple "hats" - so how will they react to being assigned yet another task?
RCA investigations are often eye opening, and when performed correctly very effective at preventing problem recurrence. Most RCA team members take great pride in participating in an analysis, and value the company's decision to include them on such a pressing issue. Putting together an effective team is crucial to the success or failure of an RCA analysis.
There are usually 2 basic roles in an investigation; a facilitator and a participant. Here's a look at what these are and the differences between them:
Facilitator - This is the person leading the group. Ideally this individual is knowledgeable of the incident, but would not be considered the "expert" on the issue. However, the facilitator should be proficient in the RCA method being applied to the analysis. Having an education on the corporate process involved in the incident is extremely helpful, but often not a requirement. Managing a team is the responsibility of the facilitator, and the rest of the team looks to the facilitator for direction and guidance on what to do in the investigation.
Participant - A participant is an individual selected to be actively involved in the investigation. An effective team is generally made up 4-6 participants with different roles. Industry or corporate experts are usually involved, and offer information on the process and equipment involved in incidents. Other participants include individuals with direct knowledge of what happened and outside personnel with no direct knowledge of the incident or process.
You might ask "why have someone in the room with no expertise or knowledge of the problem?". It is important to have someone that has no vested interest in the causes or solutions, other than self-preservation (i.e. employment). You need someone to ask what experts might say are "dumb" or "ignorant" questions, because those questions are valuable when conducting an analysis. Thoroughly identifying the causes of an incident include those simple causes that experts might not even consider as they might have already dismissed those as contributing causes. As the old saying goes "there are no dumb questions" is true when applying those questions to an RCA analysis.
Another attribute of an effective RCA team is the diversity of the group. We've briefly discussed the roles of the investigation team and what value they bring to the investigation, but having a diverse group both in education and background also helps contribute to an effective team. You never want a team made up of similar educations and backgrounds because these similarities could contribute to the old "group think" mentality.
I suppose what I'm trying to say is: mix it up. Expertise is great, but creativity in a group can be just as valuable.
What are your thoughts? Have you had any successful RCA teams? What were some of the attributes of the team you found intriguing and worth mentioning?
Marcus McCoy - Apollo Account Manager/Investigator
]]>Fundamentally, the Apollo RCA method is a collection of critical thinking principles designed to help you create a strong argument that explains why a problem occurred and shows the logical connection between solutions and the actual causes of a problem. In an Apollo RCA the logic of your analysis is visually depicted via the Realitychart. The Realitychart also becomes the platform from which effective solutions are identified and the place where the logical connections between causes and effective solutions are evident. Without a solid Realitychart, a problem solvers' ability to identify solution options and to select the most effective solutions is diminished.
The principles of logical thinking teach us that an argument is valid when the conclusion logically flows from the premises AND there are no other circumstances that would invalidate the argument. A sound argument is a valid argument when the premises are true. Try thinking of your next RCA as an exercise in constructing a logical argument and bringing clarity to an otherwise complex scenario as your primary job. With that in mind, here are some tips to help you build an inductively strong argument into your next Apollo RCA:
1. Think of your entire analysis as an argument with the primary effect being your main conclusion and the causes as the premises which support that conclusion.
2. Be sure that the primary effect logically flows from the causes you have identified. Use Baby Steps (i.e., inserting causes between causes) to improve the logical flow of your analysis.
3. Be sure your analysis will resonate with your colleagues. If you have to, run it by them - they will point out areas that do not make sense.
4. Be sure that all the causes are true. Use robust forms of evidence to validate each cause. Use more than one piece of evidence to increase the certainty for any causes you feel require a greater level of validation.
5. Avoid vague or ambiguous wording when describing causes such as: inadequate, poor, lack of, or failed. Be specific what those terms mean but don't get too wordy (Extra Hint: use the new "Notes" or "Reference" feature in RealityCharting® version 4.0 to clarify your terms).
6. Add any additional causes to your analysis that strengthen your argument. Try to anticipate any counter arguments or "Yes, but what about..." scenarios and address them by adding additional causes.
7. Try focusing on the logic of what your Realitychart is saying, not on labeling the actions and conditions. Label your actions and conditions after you are confident the analysis is sound.
8. Spend enough time on the analysis so that it describes your problem adequately enough for you to see a number of different solution opportunities.
9. Ensure your final solutions are specific and detailed enough that they can be easily implemented by a third person by avoiding vague or ambiguous wording.
10. Follow the four steps of the Apollo method but remember that the "ends" - creating a logical analysis, is more important than the "means" - i.e., how you get there.
What specific tips or hints do you have that have helped you to improve the strength of your analysis or success of your solutions?
Regards,
Apollo RCA Instructor
]]>
Brian Hughes | Vice President
]]>Posted by: Cory Boisoneau | May 28 2009
]]>o If so, are your training and mentoring systems robust enough to handle the transition?
o What steps are you putting in place to address the shortage?
I would be interested in hearing your thoughts and experiences.
By Chris Eckert, 5/1/2009
]]>| Original Phrase | New Phrase | |
| “Management Culture” |
becomes.. | “Management Behavior” |
| “Problem-Solving Culture” |
becomes.. | “Problem-Solving Behavior” |
| “Culture that encourages risk-taking” | becomes.. | “Behavior that encourages risk-taking” |
| “Culture that places the highest value on production” | becomes.. | “Behavior that places the highest value on production” |
You get the picture right away. It even works outside of the workplace. I guess I always knew this intuitively, but it’s never been directly on the surface the way it is now. A culture is really defined by the behaviors of the members of that culture.
So why did this make an impact on me? I’m a sucker for specifics. I don’t have a lot of value for categorical phrases, such as ‘inadequate maintenance’ or ‘wrong procedure’ because phrases like this don’t provide me with the specifics I need to put an effective solution in place. I always have to ask a follow up question, such as: “Why was the maintenance inadequate?” The same is true with a word like ‘Culture’. What can I do with a word like ‘Culture’? What can my clients do with a when I tell them they would be better off by developing a ‘Problem-Solving Culture’?
The value for me is the fact that the word ‘Behavior’ quickly gets me to specifics. What behaviors do I want to change? What conditional causes allow and encourage these behaviors? What corrective actions can we recommend to change the conditional environment in a way that encourages the behaviors we want to see?
Maybe I’m making too much of this – but I’ve incorporated this new way of thinking about culture into my investigations and classes and the feedback has been very positive. Give it some thought and let me know what you think…
Brian Hughes, 4/17/09
]]>
According to the Privacy Rights Clearinghouse there has been 255 million security record breaches involving sensitive personal information in the U.S. since 2005. 26 million personal medical records were breached from the U.S. Department of Veterans Affairs in 2006. In 2005, CardSystems suffered the breach of 40 million credit card holder's personal account information and in January 2007 TJ Stores (TJX) suffered the breach of over 46 million credit and debit card holders account information. The TJX incident alone impacted 100 million accounts to the tune of $94 million dollars. It has been estimated that the exposure of 46,000 records of sensitive personal information costs an organization on average - $76 million.
So what is being done to protect the public? There is a dizzying array of reports on the topic - each offering various solutions to safeguard sensitive data. There seems to be a reliance on the statistical analyses of industry trends to drive solutions much like a Pareto Analysis. Information from actual data security breaches have been categorized by business sector, type of data breached, proportion attributed to malicious acts, theft, hacking, careless/untrained employee so on and so on. Solutions are then recommended based on the trend data exhibiting the highest percentages or greatest threats.
Is implementing solutions based on industry trends a proactive way to solve the problem of data security breaches or for that fact any problem? Well....maybe. It depends on how you use the knowledge. If you simply use recommended solutions without understanding what the actual causes of your own internal problems are you are certainly taking a chance that the borrowed solutions may not be addressing the causes of your own problems. In other words if the solutions are ineffective you remain vulnerable to the consequences of another breach. When you blindly accept solutions from external sources, the risk to you comes from assuming that 1) the borrowed solution ideas are controlling known causes of other problems, and 2) your problems are caused by the same things experience by others. Solutions generated from data categories used in trending are less likely to be effective than solutions that that are controlling specifically defined individual causes.
However, industry trend data can be useful if you use it in conjunction with your own RCAs to direct the search for causes within your own system boundaries. Industry trend data can also help an RCA team decide if it has been diligent enough in the depth of its analysis. When conducting a multiple event analysis (i.e., an analysis of multiple RCAs used to identify common causes among multiple mutually exclusive problems) industry trends can help bring focus to the areas in which common causes might be found.
What are your thoughts - is implementing solutions based on industry trends a proactive way to solve existing problems?
Sincerely,
Mark Hall, Account Manager
Apollo Associated Services
Note: The organizations referred to in this article are not clients of Apollo Associated Services, LLC.
Brian Hughes, Vice President - Apollo Associated Services - Posted 18 February 2009
*Note: Peanut Corporation of America is not a client of Apollo Associated Services, LLC or Artemis Investigations LLC.

As the cartoon created by Sid Harris depicts, risk assessment (RA) is about quantifying and describing a hazard which is perceived to have the potential ability to cause harm to those exposed to it should the hazard be released. Risk management is the actions taken to eliminate or mitigate harmful consequences associated with that risk.
In the paper we describe how root cause analysis (RCA) can benefit the overall risk management process by helping define and quantify risk, understand the causes of risk, and identify effective risk management actions.
One of our premises for this paper is that risk assessment focuses on anticipating events and RCA focuses on reacting to them. Because of this attitude RCA and risk assessment are separate programs led by people from different disciplines with separate training and different tool kits.
It is our belief that RCA is fundamentally designed to minimize or eliminate risk by reactively and proactively solving problems and removing causes that contribute to risk.
What are your thoughts?
1) Are Risk Assessment and RCA separate entities in your organization?
2) Do you think RCA can be leveraged to help improve the overall Risk Management process in your organization?
3) What barriers would have to be removed to more effectively integrate RCA and Risk Management functions?
Or -
Do you think RCA, used proactively on identified hazards, could simply replace the effort duplicated by the risk assessment / risk management process and one could simply rely on hazard analysis and RCA to management risk?
Sincerely,
Mark Hall
Author: Brian Hughes, Apollo Vice President
]]>