Unweighted and weighted mean correlation were estimated using 39 studies (62 oximeter trials). Bias and precision estimates were calculated using 23 studies (82 oximeter trials).
The quality rating ranged from 3 to 11 (mean 8.0, SD 1.75). Lower ratings were due to a failure to report the reliability of criterion variable and a failure to conduct cross validation or include reliability estimates. Based on 39 studies (62 trials), 69% of oximeter trials tested finger probes, 23.7% ear lobes, 4.1% multiple probes and 0.6% forehead probes. Most of the participants were healthy adult volunteers (25.7%).
The mean correlation was 0.910 (variance 0.011) unweighted and 0.895 (variance 0.014) weighted. The absolute mean bias was 1.99% (SD=0.23). The variance caused by sampling error failed to account for most (75%) of the overall observed variance, and was considered to indicate that the correlations were not constant across the studies.
The mean correlation for studies scoring a quality rating of 9 (22 studies, 43 oximeter trials) was 0.908 (variance 0.011) unweighted and 0.883 (variance 0.016) weighted.
The correlation was 0.957 for healthy adult volunteers (13 studies, 318 participants, 32 oximeter trials), 0.950 for anaesthetised patients (1 study, 34 patients, 1 oximeter trial), 0.948 for athletes (2 studies, 21 athletes, 2 oximeter trials), 0.930 for thoracic surgical patients (2 studies, 15 patients, 2 oximeter trials), 0.904 for cardiac surgical patients (2 studies, 72 patients, 2 oximeter trials), 0.880 for respiratory patients (8 studies, 558 patients, 11 oximeter trials), and 0.760 for critically ill or intensive care unit patients (8 studies, 329 patients, 8 oximeter trials).
The mean correlation by probe location was: for the ear (3 studies, 3 oximeters), 0.938 (variance 0.002) unweighted and 0.934 (variance 0.001) weighted; and for the finger (3 studies, 3 oximeters), 0.963 (variance 0.001) unweighted and 0.967 (variance 0.001; P<0.0001) weighted.
The mean correlations for factors affecting pulse oximeter accuracy were as follows.
Hypoxia (5 studies, 15 oximeter trials; oxygen saturation: 67.6 to 87.8%): 0.924 (variance 0.008) unweighted and 0.938 (variance 0.006) weighted. The data showed substantial differences in bias and precision estimates between pulse oximeters at low saturation, the most common being an underestimation of saturation and falling precision.
Perfusion(3 studies, 3 oximeter trials): 0.717 (variance 0.049) unweighted and 0.582 (variance 0.004) weighted.
Dyshaemoglobinaemia (5 studies, 6 oximeter trials; carboxyhaemoglobin: 5.87 to 9.10%): 0.817 (variance 0.028) unweighted and 0.717 (variance 0.035) weighted. There was a tendency to overestimate oxygen saturation with increasing carboxyhaemoglobin.
Temperature (3 studies, 3 oximeter trials; mean temperature: 28.6 to 34.8 degrees C): 0.760 (variance 0.043) unweighted and 0.665 (variance 0.024) weighted.
Skin pigment (1 study, 2 oximeter trials): 0.800 (variance 0.0002) unweighted and 0.800 (variance 0.0002) weighted.
Hyperbilirubinaemia (1 study, 1 oximeter trial; serum bilirubin: 2.7 to 35 mg/100 mL): 0.850 unweighted. Oxygen saturation was significantly underestimated.
The 21 pulse oximeters used in the studies were ranked by correlation with oxygen saturation. The correlations ranged from 0.986 for the Datascope Accusat finger/flex probe (2 oximeter trials, 25 participants; 245 data points) to 0.591 for the Ohmeda BIOX 111 finger/multiple probe (2 oximeter trials, 76 participants; 464 data points).