Table 3

Triage levels assigned to each clinical-vignette, where safe is defined as maximum one level less conservative than gold-standard, expressed per vignette provided with advice.

App/ tested GPPercentage of safe adviceP value (difference to GP mean)
Ada97.0NS
Babylon95.1NS
Buoy80.0<0.001*
K Health81.3<0.001*
Mediktor87.31.3×10–3*
Symptomate97.8NS
Your.MD92.6NS
App mean±SD.90.1±7.4
GP mean±SD.97.0±2.5
GP196.0NS
GP296.9NS
GP394.0NS
GP499.0NS
GP5100.0NS
GP693.9NS
GP799.5NS
  • *P<0.05. For two of these apps (K Health & Your.MD), one app-entry-Dr (#4) did not record all screenshots needed for source data verification—see online supplemental table 6 for a subanalysis of fully verified data, which shows the same trend of results and no significant difference to the data recorded here). This analysis is for those vignettes for which urgency advice was provided (ie, a ‘provided answer) analysis.

  • GP, general practitioner; NS, no significant difference.