Process OptimizationIntermediateJanuary 18, 202620 min read

Solder Defect Root Cause Analysis Using SPI and AOI Data

A practical methodology for correlating solder paste inspection and automated optical inspection data to identify root causes of solder defects, implement corrective actions, and prevent recurrence.

Key Takeaways

  • 60–70% of post-reflow solder defects can be traced back to paste printing issues detectable by SPI
  • Correlating SPI data (pre-reflow) with AOI data (post-reflow) creates a powerful diagnostic tool for identifying systematic process issues
  • A structured defect categorization framework prevents misdiagnosis and ensures corrective actions target the true root cause
  • The Pareto principle applies strongly: typically 3–5 root causes account for 80% of all solder defects

1. Introduction — The Value of Data-Driven Root Cause Analysis

When solder defects appear at post-reflow AOI or functional test, the natural reaction is to ask “what went wrong?” Too often, the answer is based on intuition or incomplete information, leading to corrective actions that don't address the real problem.

Modern SPI and AOI systems generate rich datasets that, when properly correlated, can pinpoint defect root causes with far greater accuracy than visual observation alone. The key insight is that SPI data captures the state of the process before reflow, while AOI captures the result after reflow. Comparing these two datasets reveals whether a defect originated at printing, placement, or reflow.

What Is Root Cause Analysis?

Root cause analysis (RCA) is a systematic method of identifying the fundamental reason a defect occurred, rather than just treating the symptom. For solder defects in electronics manufacturing, RCA involves:

  • Accurately categorizing the defect type
  • Collecting relevant data from all process stages
  • Analyzing potential contributing factors
  • Identifying the most likely root cause through data correlation
  • Implementing and validating corrective actions

2. Defect Categorization Framework

Accurate defect categorization is the essential first step. Misclassifying a defect leads to investigating the wrong root cause. Use the following framework to systematically categorize solder defects:

Category 1: Paste Volume Defects

These defects relate to the amount of solder paste deposited and are primarily detectable by SPI.

DefectSPI IndicatorAOI IndicatorPrimary Root Cause
Insufficient solderLow volume (<60%)Thin or incomplete filletClogged aperture, worn stencil, low pressure
Excess solderHigh volume (>180%)Solder balls, tombstoningStencil gasketing failure, excess paste on stencil
No solderZero or near-zero volumeMissing or open jointCompletely clogged aperture, stencil damage

Category 2: Paste Position Defects

These defects relate to misalignment between paste deposits and pads.

DefectSPI IndicatorAOI IndicatorPrimary Root Cause
Offset (global)Uniform X/Y shift all padsComponent misalignmentStencil-to-board alignment error
Offset (local)Offset in specific area onlyDefects in one board zoneBoard warpage, local stencil distortion
Rotation errorOffset increases with distance from centerCorner components affected mostFiducial recognition error, stencil theta misalignment

Category 3: Bridging Defects

These defects involve unintended solder connections between adjacent pads.

DefectSPI IndicatorAOI IndicatorPrimary Root Cause
Paste bridgePaste between pads detectedSolder bridgePaste slump, stencil aperture issues, excess volume
Solder bridge (no paste bridge)Normal paste at SPISolder bridge detectedComponent placement force, reflow profile, pad design

Category 4: Reflow-Related Defects

These defects appear at AOI but have no corresponding SPI anomaly, indicating the defect was introduced after printing.

DefectSPI IndicatorAOI IndicatorPrimary Root Cause
TombstoningImbalanced volume between pads of same componentComponent standing on endUnequal paste volume, unequal pad sizes, reflow profile
Head-in-pillowNormal paste depositsIncomplete wetting on BGABoard warpage during reflow, oxidation
VoidingOften normal at SPIX-ray required for detectionPaste chemistry, reflow profile, pad finish

3. Correlating SPI and AOI Data

The power of root cause analysis comes from comparing data at the same pad location across both inspection stages. This requires board-level traceability.

Prerequisites for Data Correlation

  • Unique board identification: Each board must have a unique barcode or serial number read by both SPI and AOI
  • Common coordinate system: Both systems must use the same pad reference designators (from the same CAD data)
  • Data export format: Both systems should export to a common format (CSV, XML, or through an MES database)
  • Time synchronization: Timestamps on both systems should be synchronized for accurate process-time correlation

Correlation Analysis Method

For each AOI defect, look up the corresponding SPI data for the same pad on the same board. Classify the correlation into one of four outcomes:

OutcomeSPI ResultAOI ResultInterpretation
True positiveDefect flaggedDefect confirmedSPI correctly predicted the defect — printing root cause
SPI escapeNo defect flaggedDefect foundEither SPI thresholds need tightening OR defect introduced after SPI
False callDefect flaggedNo defectSPI thresholds too tight or measuring artifact — the reflow process “healed” the anomaly
True negativeNo defectNo defectNormal — no action needed

Key Insight

The most valuable root cause information comes from “SPI escape” cases. When AOI finds a defect that SPI missed, determine whether the SPI data was actually within limits (true escape, requiring threshold adjustment) or whether the defect was introduced after printing (placement or reflow issue, requiring process investigation).

4. Fishbone Diagram Analysis for Common Defects

The fishbone (Ishikawa) diagram is a standard tool for organizing potential root causes into categories. For solder defects in SMT, the six standard categories map to:

Six M Categories for Solder Defects

CategoryFactors to Investigate
MachinePrinter alignment accuracy, squeegee condition, SPI calibration, AOI lighting, reflow oven profile uniformity
MaterialPaste age/viscosity, paste lot variation, PCB pad finish (HASL/ENIG/OSP), solder mask quality, stencil condition
MethodPrint parameters (speed, pressure, separation), reflow profile, cleaning frequency, stencil wiping method
MeasurementSPI thresholds, AOI sensitivity settings, Gage R&R, reference standard validity
Man (People)Operator training level, shift-to-shift variation, manual handling practices, verification procedures
Mother Nature (Environment)Temperature, humidity, vibration, air flow near printer, ESD protection

Example: Fishbone for Insufficient Solder on Fine-Pitch QFN

When insufficient solder is detected on QFN peripheral pads, a typical fishbone analysis reveals these high-probability root causes (listed by category):

  • Machine: Squeegee blade worn or nicked at the QFN location; printer vision not compensating for local board warp
  • Material: Stencil apertures partially clogged (check print count since last cleaning); paste viscosity too high (age or temperature)
  • Method: Separation speed too fast for small aperture aspect ratio; insufficient stencil cleaning frequency
  • Measurement: SPI volume threshold too loose, not catching gradual volume decline

5. Data Analysis Techniques

Pareto Analysis

Start every root cause investigation with a Pareto chart of defects by type, location, and frequency. The 80/20 rule almost always applies:

  1. Export all AOI defects for the investigation period (minimum 1 week of production)
  2. Sort by defect type and count occurrences
  3. Create a Pareto chart — the top 3–5 defect types typically account for 80%+ of all defects
  4. Focus root cause investigation on the top defect types first

Spatial Analysis (Heat Maps)

Plot defect locations on a board map to identify spatial patterns:

  • Uniform distribution: Suggests a global process issue (paste viscosity, printer settings)
  • Edge concentration: Indicates stencil gasketing issues or board support problems
  • Corner concentration: Points to board warpage or stencil alignment rotation error
  • Single-component clustering: Suggests component-specific issue (pad design, aperture design, thermal mass)

Trend Analysis

Plot SPI volume or offset data over time (by board sequence number) to identify trends:

  • Gradual decline: Aperture clogging, squeegee wear, paste aging
  • Step change: Process parameter change, new paste lot, stencil change
  • Cyclical pattern: Environmental variation (temperature/humidity cycles), cleaning cycle effects
  • Random variation: Normal process noise — may not require action if within limits

Process Capability Analysis

Calculate Cpk for critical SPI measurements to quantify process capability:

  • Cpk ≥ 1.67: Excellent — process well-centered and capable
  • 1.33 ≤ Cpk < 1.67: Acceptable — monitor for drift
  • 1.0 ≤ Cpk < 1.33: Marginal — improvement needed, defects likely
  • Cpk < 1.0: Incapable — process is producing defects, immediate action required

6. Corrective Action Procedures

Once the root cause is identified, implement corrective actions using the following structured approach:

Immediate Containment (within hours)

  1. Tighten SPI thresholds on the affected pad group to prevent further defective boards from proceeding
  2. If the defect rate is high (>1%), consider stopping production until the root cause is confirmed
  3. Inspect and quarantine any boards produced between the last known good board and defect detection

Root Cause Correction (within days)

  1. Implement the fix targeting the identified root cause (e.g., replace damaged stencil, adjust print parameters, increase cleaning frequency)
  2. Run a validation lot (minimum 25 boards) and verify defect rate drops to acceptable levels
  3. Monitor SPI and AOI data for the next 100 boards to confirm sustained improvement

Systemic Prevention (within weeks)

  1. Update preventive maintenance procedures to address the root cause (e.g., more frequent stencil inspection, paste viscosity checks)
  2. Adjust SPI program thresholds based on the new process understanding
  3. Update operator training materials if the issue was related to handling or setup
  4. Consider implementing closed-loop feedback if the root cause was printer drift (see our companion application note on Closed-Loop SPI-Printer Feedback)

Tip

Document every root cause investigation and its outcome, even for minor issues. This builds an institutional knowledge base that accelerates future investigations. Over time, your team will recognize patterns and resolve new issues faster.

7. Prevention and Continuous Improvement

The goal of root cause analysis is not just to fix today's problem but to prevent tomorrow's. Build these practices into your standard operating procedures:

Daily Monitoring

  • Review SPI volume SPC charts at shift start and shift end
  • Check AOI defect Pareto from the previous shift
  • Verify SPI-AOI correlation rate (what percentage of AOI defects were predicted by SPI)

Weekly Review

  • Run Pareto analysis on all defects from the past week
  • Compare against previous weeks to identify emerging trends
  • Review any boards that had SPI calls but passed AOI (false call analysis)
  • Update SPI thresholds if process capability has shifted

Monthly Deep Dive

  • Full SPI-AOI correlation study on a representative product
  • Cpk analysis on critical components
  • Stencil inspection and aperture measurement on high-volume programs
  • Review and update the root cause investigation log

8. Worked Examples

Example 1: Intermittent Insufficient Solder on 0402 Passives

Observation: AOI reports insufficient solder on 0402 resistors, affecting approximately 0.3% of pads. Defects are scattered across the board with no clear spatial pattern.

SPI correlation: 80% of affected pads showed SPI volume in the 55–65% range (borderline low). 20% showed normal volume at SPI but defective joint at AOI.

Investigation:

  • Trend analysis showed a gradual volume decline over 50 boards, correlating with print count since last stencil cleaning
  • Paste age was 6 hours (within spec but at the upper end of the recommended window)
  • Stencil inspection revealed partial clogging on approximately 5% of 0402 apertures

Root cause: Aperture clogging rate was exceeding the cleaning frequency. As paste aged, viscosity increased, accelerating clogging.

Corrective actions:

  1. Increased stencil under-wipe frequency from every 10 prints to every 5 prints
  2. Reduced maximum paste age on the stencil from 8 hours to 5 hours
  3. Tightened SPI volume threshold for 0402s from <50% to <60%
  4. Result: Insufficient solder rate dropped from 0.3% to 0.02%

Example 2: Solder Bridging on 0.5mm Pitch QFP

Observation: AOI reports solder bridges on QFP pins, always between pins 25–30 on one specific QFP. Defect occurs on approximately 1 in 20 boards.

SPI correlation: SPI showed elevated volume (140–160%) on the affected pins on all boards, but only flagged boards exceeding the 180% threshold. No paste bridges detected at SPI.

Investigation:

  • Stencil inspection revealed slight aperture deformation on pins 25–30 (apertures were 15% wider than design)
  • The excess paste volume was within SPI limits but, combined with component placement force, was sufficient to cause bridging during reflow on some boards

Root cause: Stencil damage from handling, creating oversized apertures on specific pins.

Corrective actions:

  1. Replaced the stencil
  2. Added a component-specific volume upper limit of 150% for the QFP pins
  3. Implemented stencil aperture measurement during incoming inspection
  4. Result: Bridging eliminated

Related ASC Products

ASC International offers both 3D SPI and AOI systems with integrated data export capabilities, enabling the SPI-AOI correlation analysis described in this application note.

Published by

ASC International Applications Engineering

Last updated: January 18, 2026

Need Help With Defect Analysis?

Our applications engineers can help you set up SPI-AOI data correlation, conduct root cause investigations, and implement corrective actions in your production environment.

Contact Our Engineers →