Relevant data was available for forty studies (2,104 participants, range 10 to 145). The mean quality score was 57.4 (standard deviation 12.3, range 35 to 87). Internal validity problems included: small sample size (60% studies); lack of independent randomisation (70% studies); lack of treatment fidelity assessment (80% studies) and group allocation masking (73% studies). More recent studies tended to be larger and of higher quality. Mean attrition rate was 11.0% (range 0% to 47.5%). Drop-out was higher than 15% in 12 studies. There was no evidence for publication bias. If a meta-analysis showed significant heterogeneity, then p for heterogeneity was reported. Results were mostly reported post-treatment with few after follow-up.
Cognitive remediation therapy had a significant beneficial effect on global cognition (d 0.448, 95% CI 0.306 to 0.590; 38 studies; p= 0.00 for heterogeneity). The effect was still significant at follow-up (d 0.428, 95% CI 0.184 to 0.671; 11 studies).
Cognitive domains: There were significant benefits for: attention/vigilance (d 0.250, 95% CI 0.080 to 0.419; 16 studies); speed of processing (d 0.258, 95% CI 0.072 to 0.445; 24 studies; p=0.000 for heterogeneity); verbal working memory (d 0.346, 95% CI 0.186 to 0.506; 20 studies); verbal learning and memory (d 0.410, 95% CI 0.273 to 0.548; 23 studies); reasoning/problem solving (d 0.572, 95% CI 0.222 to 0.922; 25 studies; p=0.00 for heterogeneity); and social cognition (d 0.651, 95% CI 0.331 to 0.972; seven studies; p=0.03 for heterogeneity); but with no significant effect on visual learning and memory (10 studies; p=0.05 for heterogeneity).
Specific cognitive tests: There were significant benefits for Digit Span (d 0.422, 95% CI 0.226 to 0.617; 13 studies), the Trail Making Test (d 0.319, 95% CI 0.077 to 0.560; 13 studies; p=0.03 for heterogeneity) and the Wisconsin Card Sorting Test (d 0.335, 95% CI 0.188 to 0.481; 13 studies). There was no significant effect for the Continuous Performance Test (10 studies).
Other outcomes: There were significant benefits for: symptoms (d 0.177, 95% CI 0.034 to 0.321; 20 studies); and functioning (d 0.418, 95% CI 0.216 to 0.620; 19 studies; p=0.003 for heterogeneity). At follow-up the effect was still significant for function (12 studies) but not for symptoms (eight studies).
Mediator and moderator effects were reported. Excluding individual studies did not affect the overall effects. There was significant evidence for an increased effect on psychosocial functioning for interventions with adjunctive psychiatric remediation (d 0.59, 95% CI 0.30 to 0.88) and for those with a strategic approach.