Most teams don’t struggle to find problems, they struggle to figure out which problems actually matter. You run a root cause analysis, generate a long list of potential contributors, and then waste resources chasing issues that barely move the needle. That’s exactly where Pareto chart root cause analysis changes the game, giving you a visual, data-backed way to separate the critical few from the trivial many.
The concept is grounded in the 80/20 principle: roughly 80% of consequences stem from 20% of causes. A Pareto chart makes this ratio visible so you can direct your improvement efforts where they’ll generate the greatest return. At Lean Six Sigma Experts, we’ve used this tool across manufacturing floors, corporate operations, and multi-site programs since 2011, and it remains one of the most reliable prioritization methods in our engineering-based consulting and training toolkit.
This guide walks you through the step-by-step process of building a Pareto chart, applying it to root cause analysis, and using the results to prioritize action. Whether you’re an operations manager tackling defect reduction or a Green Belt working through a DMAIC project, you’ll leave with a practical framework you can put to work immediately.
What a Pareto chart does in root cause analysis
A Pareto chart is a bar chart ranked in descending order, combined with a cumulative percentage line that runs across the top. Each bar represents a category of defects, failures, complaints, or whatever problem type you’re investigating. The chart’s power in pareto chart root cause analysis comes from making the relative weight of each cause immediately visible, so you stop guessing and start directing resources where they’ll produce the greatest impact.
A Pareto chart doesn’t find root causes for you. It tells you which causes deserve your investigation time first.
How the chart structure directs your focus
The tallest bar on the left always represents your most frequent or most costly problem category. As you move right, bar heights drop and the cumulative line climbs toward 100%. Your job is to find the point where the cumulative line crosses approximately 80%, because the bars sitting to the left of that threshold are your critical few. Everything to the right still deserves a note, but those categories rarely justify the same level of resource commitment during your initial investigation.

A practical example: if you’re analyzing customer complaint data across five categories and two categories account for 78% of all complaints, that’s where your root cause investigation should concentrate first. Chasing the remaining three categories would be like fixing the guest bathroom while the kitchen is flooding.
Where Pareto analysis fits in DMAIC
In a DMAIC project, you typically build a Pareto chart during the Measure or Analyze phase. You use it to narrow a broad problem statement down to the specific sub-categories driving the most variation or loss. This prevents your team from spreading investigation effort thin across every possible contributor when only a fraction of them are responsible for the bulk of the damage.
Teams that skip this step often spend weeks analyzing causes that together represent less than 15% of the total problem. The chart forces discipline: it converts a long, flat list of issues into a ranked hierarchy with a clear visual cutoff. That cutoff becomes your starting point for deeper root cause tools like fishbone diagrams or 5 Whys analysis.
Step 1. Define the problem, scope, and categories
Before you build anything, you need a clear problem statement and a defined scope. Vague problems produce vague charts. If you’re investigating defects, specify the product line, the time window, and the location. If you’re analyzing complaints, define the customer segment and the channel. Without these boundaries, your categories will overlap and your Pareto chart root cause analysis will scatter your team’s focus across issues that may not even share the same root cause.
How to write a problem statement that holds up
Your problem statement should answer three things: what is failing, where it’s failing, and how you’re measuring it. Avoid statements like "quality is bad" or "customers are unhappy." Instead, write: "Defects on Line 3 increased by 22% in Q1 2026, measured by units rejected at final inspection." That level of specificity tells you exactly what data to collect in the next step.
A good problem statement eliminates ambiguity before data collection starts, not after.
Choosing your categories before you collect data
Pick your defect or failure categories before you start counting, not after. Defining categories after the fact introduces bias because you may unconsciously group data to match a preferred narrative. A solid starting point is to list every known failure type from your quality records, maintenance logs, or complaint database. Then consolidate any category representing less than 2% of total occurrences into an "Other" bucket to keep the chart clean. Aim for five to eight categories total.
| Category Element | Example |
|---|---|
| Defect Type | Surface scratch, incorrect dimension, missing component |
| Process Source | Assembly, incoming inspection, packaging |
| Measurement Basis | Count, cost, or hours of downtime |
Step 2. Collect clean data and tally it correctly
Clean data is non-negotiable in pareto chart root cause analysis. If you feed noisy, inconsistent, or incomplete records into your chart, the bars will mislead you and you’ll prioritize the wrong problems. Before you start tallying, confirm that your data source is consistent, meaning every record uses the same category definitions you established in Step 1.
Garbage data produces a confident-looking chart that points you in the wrong direction.
Choose the right data source and timeframe
Your data source should be operationally linked to the problem you defined. Quality inspection logs, warranty claims, help desk tickets, and maintenance work orders are all valid sources depending on your process. Pull data from a single, defined time window, typically 30 to 90 days, to avoid seasonal variation distorting your category totals. If you pull six months of data but your process changed mid-period, split the data sets and analyze each one separately.
| Data Source | Best For |
|---|---|
| Inspection logs | Manufacturing defects |
| Customer complaint records | Service quality issues |
| Maintenance work orders | Equipment failure analysis |
| Help desk tickets | IT or process errors |
Tally counts by category accurately
Once you have your raw data, count every occurrence and assign it to exactly one category. If a record could fit two categories, define a tiebreaker rule before you start, not midway through. Total all counts, then calculate the percentage each category represents out of the full count. This percentage column is what drives the chart and the cumulative line in Step 3.
Step 3. Build the Pareto chart and cumulative line
With your tallied data in hand, you’re ready to assemble the chart. The build process follows a fixed sequence: sort, calculate cumulative percentages, then plot. Skipping or reordering any step will break the visual logic that makes pareto chart root cause analysis effective.
Sort and calculate before you plot
Sort your categories from highest count to lowest before opening any charting tool. Then add two new columns: a running total and a cumulative percentage. The cumulative percentage for each row equals the running total divided by the grand total, multiplied by 100. Here’s what that looks like with a simple defect dataset:
| Category | Count | Running Total | Cumulative % |
|---|---|---|---|
| Surface scratch | 142 | 142 | 47.3% |
| Wrong dimension | 88 | 230 | 76.7% |
| Missing component | 41 | 271 | 90.3% |
| Discoloration | 19 | 290 | 96.7% |
| Other | 10 | 300 | 100% |
Plot the bars and overlay the cumulative line
Use a dual-axis chart: the left y-axis tracks count, the right y-axis tracks cumulative percentage from 0% to 100%. Plot each category as a vertical bar in sorted order, then overlay the cumulative percentage as a line starting at the top of the first bar and ending at 100% above the last.

Draw a horizontal reference line at 80% across the cumulative axis to make your cutoff point immediately visible to everyone reading the chart.
Add a clear chart title that includes the problem category, process location, and data timeframe. Anyone who picks up the chart without context should understand exactly what they’re looking at within ten seconds.
Step 4. Read the chart, find root causes, and act
Your chart is built, but the work isn’t finished yet. Reading it correctly is what transforms a visual summary into a focused action plan. Locate the point where your cumulative line crosses 80% and draw a vertical line down to the x-axis. Every category to the left of that line belongs on your priority list for root cause investigation.
Identify your critical few categories
The categories inside your 80% threshold are your primary targets. In the example from Step 3, surface scratches and wrong dimensions together account for 76.7% of total defects. You don’t need to solve all five categories at once; solve those two first. Use the table below to match each critical category to the right investigation tool:
| Category Type | Recommended Root Cause Tool |
|---|---|
| Recurring defect type | Fishbone (Ishikawa) diagram |
| Process step failure | 5 Whys analysis |
| Equipment-related failure | Failure Mode and Effects Analysis (FMEA) |
| Human error pattern | Process observation and error-proofing review |
Connect findings to corrective actions
Once your root cause tools confirm the underlying driver, document a corrective action with an owner, a deadline, and a measurable success criterion. This is precisely where pareto chart root cause analysis pays off, because you’re directing your team’s time and budget toward verified, high-impact problems rather than assumptions.
Verify your corrective actions by re-running the Pareto chart after 30 to 60 days to confirm the targeted categories have dropped in frequency.

Wrap it up and keep the gains
A Pareto chart root cause analysis gives you a repeatable, data-backed system for cutting through noise and directing your team toward the problems that actually drive results. You define the scope, collect clean data, build the chart, and use the 80/20 cutoff to decide where root cause investigation belongs before a single hour of analysis time gets spent.
The gains only stick if you re-run the analysis after your corrective actions go live. Pull fresh data 30 to 60 days after implementation and rebuild the chart from scratch. If the critical categories from your first pass have dropped significantly, your fixes are working. If they haven’t moved, the updated chart tells you exactly where to look next.
Process improvement compounds when you make this a habit, not a one-time exercise. If you want structured support applying these methods inside your organization, connect with our Lean Six Sigma consulting team to get started.
