Accuracy Benchmarks
Published precision and recall data for VertaaUX audit findings, measured against expert-labeled ground truth.
Looking for our methodology? See our Accuracy page.
92.4%
Precision
87.5%
Recall
89.8%
F1 Score
84.3%
Precision
79.4%
Recall
81.6%
F1 Score
92.2%
Precision
89.3%
Recall
90.5%
F1 Score
Accessibility (WCAG)
Precision and recall for WCAG-related audit findings, measured against expert-labeled ground truth datasets. These benchmarks cover core accessibility checks including contrast, semantics, and interaction patterns.
Each benchmark entry is evaluated using independent expert review. See individual entry tooltips for methodology details.
UX Heuristics
Precision and recall for UX heuristic violations detected by the audit engine. These checks cover layout, typography, visual hierarchy, and interaction state quality assessed against expert UX review.
Each benchmark entry is evaluated using independent expert review. See individual entry tooltips for methodology details.
Performance UX
Precision and recall for performance-related UX findings including layout shift, paint timing, and interaction responsiveness. Benchmarked against Chrome DevTools and WebPageTest ground truth data.
Each benchmark entry is evaluated using independent expert review. See individual entry tooltips for methodology details.
Transparent accuracy, verifiable results
We publish our accuracy data because trust is earned through transparency. See how we measure, or try an audit yourself.