Twenty-four diagnostic accuracy studies (2,278 procedures) were included in the review. Twenty were prospective and the others were retrospective.
The mean quality score of all the included studies was 7.4 (range: 3 to 11). The number of failed procedures ranged from 0 to 23%.
The 24 included studies were not pooled due to statistical heterogeneity. However, the data were pooled for the diagnosis of normal versus abnormal uterine cavity for the following subgroups of studies: where the 'gold' standard was hysterectomy (2 studies, 96 patients); where verification bias had been avoided (16 studies, 877 patients); where there had been independent interpretation of the reference test (8 studies, 562 patients); where a retrospective design had been used (4 studies, 151 patients); and where women had been included due to failed medical treatment (2 studies, 130 patients).
For studies where the 'gold' standard was hysterectomy, the positive LR was 16.8 (95% confidence interval, CI: 5.6, 50.7), the negative LR 0.05 (95% CI: 0.01, 0.19), the positive post-test probability 0.93 (95% CI: 0.86, 1.00), and the negative post-test probability 0.04 (95% CI: -0.01, 0.09).
For studies where verification bias had been avoided, the positive LR was 8.23 (95% CI: 6.22, 10.9), the negative LR 0.06 (95% CI: 0.04, 0.09), the positive post-test probability 0.91 (95% CI: 0.89, 0.94), and the negative post-test probability 0.07 (95% CI: 0.04, 0.10).
For the diagnosis of endometrial polyps (15 studies), the positive LR was 5.23 (95% CI: 3.98, 6.90) and the negative LR 0.12 (95% CI: -0.08, 0.17); the post-test probability was not reported.
For the diagnosis of intra-uterine fibroids (15 studies): the positive LR was 11.0 (95% CI: 6.86, 17.6) and the negative LR was 0.07 (95% CI: 0.03, 0.11); the post-test probability was not reported.