Which errors dominate
Over 40%
Over 40%
of failures fall into two categories: SQL Syntax (~44%) and Name Resolution (~36%)
What’s actually expensive
<0.2%
<0.2%
of errors (Resources + Cancellation) drive 83%+ of wasted slot-hours
How to reduce failures fast
Practical controls to stop retry storms, catch issues earlier, and protect warehouse capacity in production
Who it’s for
Data Engineers
Fewer broken pipelines + faster root cause
Platform / Heads of Data
Prioritize reliability work by impact, not noise
Data FinOps
Identify which failures waste slots vs. scans—and what to fix first


