The Consensus on Exercise Reporting Template (CERT) in a systematic review of exercise-based rehabilitation effectiveness: completeness of reporting, rater agreement, and utility

E. Jean Hay-Smith, Kadri Englas, Chantale Dumoulin, Cristine H. Ferreira, Helena Frawley, Mark Weatherall

Research output: Contribution to journalArticleResearchpeer-review

Abstract

BACKGROUND: Rehabilitation interventions are diverse - making decisions about pooling data in meta-analyses challenging. Intervention reporting templates such as the Consensus on Exercise Reporting Template (CERT) may help reviewers document intervention variability. AIM: To assess inter-rater agreement and utility of CERT used to assess completeness of reporting of one rehabilitation exercise intervention: pelvic floor muscle training (PFMT). DESIGN: A non-experimental agreement study. SETTING: Update of the Cochrane systematic review comparing different approaches to PFMT for urinary incontinence in women. POPULATION: Two PFMT arms from 21 newly identified trials. METHODS: Five raters independently used CERT to assess sufficiency of reporting of each arm (experimental PFMT and control PFMT) of each trial. One rater, PFMT non-expert, rated all trials. Four raters, all PFMT experts, assessed a mutually exclusive subgroup of the trials. In addition to rating sufficiency - "Yes" compared to No" or "Uncertain" - raters also reported on CERT utility. Expert ratings were used to determine the proportion of CERT items rated as sufficiently reported. Rater agreement was estimated using coefficient kappa and McNemar's test. RESULTS: The range of CERT items rated as sufficiently reported was 0 to 15 of 19 items, and the mean for both trial arms was 5.5. For agreement, 11 of 19 items had sufficient data to estimate coefficient kappa and only 3 of 11 had a kappa >0.4 (moderate agreement). From the 12 of 19 items for which McNemar's test could be performed, five had evidence that PFMT experts more often rated the reporting as sufficient than the non-expert. Raters reported the CERT template was comprehensive but not complete and needed contextualizing for PFMT. CONCLUSIONS: Completeness of reporting was poor for this example of a rehabilitation exercise intervention, and equally poor in both trial arms. Inter-rater agreement of completeness of reporting was also poor. Using a data extraction tool with poor rater-agreement may add unnecessary burden in a review. However, using a data extraction tool that enables assessment of intervention homogeneity has benefits in making decisions about which data to pool or not. CLINICAL REHABILITATION IMPACT: Researchers reporting clinical trials must pay more attention to completeness of rehabilitation exercise reporting.

Original languageEnglish
Pages (from-to)342-352
Number of pages11
JournalEuropean Journal of Physical and Rehabilitation Medicine
Volume55
Issue number3
DOIs
Publication statusPublished - 1 Jun 2019

Cite this

@article{6ede7ca3e88a447d89190987ea06c521,
title = "The Consensus on Exercise Reporting Template (CERT) in a systematic review of exercise-based rehabilitation effectiveness: completeness of reporting, rater agreement, and utility",
abstract = "BACKGROUND: Rehabilitation interventions are diverse - making decisions about pooling data in meta-analyses challenging. Intervention reporting templates such as the Consensus on Exercise Reporting Template (CERT) may help reviewers document intervention variability. AIM: To assess inter-rater agreement and utility of CERT used to assess completeness of reporting of one rehabilitation exercise intervention: pelvic floor muscle training (PFMT). DESIGN: A non-experimental agreement study. SETTING: Update of the Cochrane systematic review comparing different approaches to PFMT for urinary incontinence in women. POPULATION: Two PFMT arms from 21 newly identified trials. METHODS: Five raters independently used CERT to assess sufficiency of reporting of each arm (experimental PFMT and control PFMT) of each trial. One rater, PFMT non-expert, rated all trials. Four raters, all PFMT experts, assessed a mutually exclusive subgroup of the trials. In addition to rating sufficiency - {"}Yes{"} compared to No{"} or {"}Uncertain{"} - raters also reported on CERT utility. Expert ratings were used to determine the proportion of CERT items rated as sufficiently reported. Rater agreement was estimated using coefficient kappa and McNemar's test. RESULTS: The range of CERT items rated as sufficiently reported was 0 to 15 of 19 items, and the mean for both trial arms was 5.5. For agreement, 11 of 19 items had sufficient data to estimate coefficient kappa and only 3 of 11 had a kappa >0.4 (moderate agreement). From the 12 of 19 items for which McNemar's test could be performed, five had evidence that PFMT experts more often rated the reporting as sufficient than the non-expert. Raters reported the CERT template was comprehensive but not complete and needed contextualizing for PFMT. CONCLUSIONS: Completeness of reporting was poor for this example of a rehabilitation exercise intervention, and equally poor in both trial arms. Inter-rater agreement of completeness of reporting was also poor. Using a data extraction tool with poor rater-agreement may add unnecessary burden in a review. However, using a data extraction tool that enables assessment of intervention homogeneity has benefits in making decisions about which data to pool or not. CLINICAL REHABILITATION IMPACT: Researchers reporting clinical trials must pay more attention to completeness of rehabilitation exercise reporting.",
author = "Hay-Smith, {E. Jean} and Kadri Englas and Chantale Dumoulin and Ferreira, {Cristine H.} and Helena Frawley and Mark Weatherall",
year = "2019",
month = "6",
day = "1",
doi = "10.23736/S1973-9087.19.05791-5",
language = "English",
volume = "55",
pages = "342--352",
journal = "European Journal of Physical and Rehabilitation Medicine",
issn = "1973-9087",
publisher = "Edizioni Minerva Medica",
number = "3",

}

The Consensus on Exercise Reporting Template (CERT) in a systematic review of exercise-based rehabilitation effectiveness : completeness of reporting, rater agreement, and utility. / Hay-Smith, E. Jean; Englas, Kadri; Dumoulin, Chantale; Ferreira, Cristine H.; Frawley, Helena; Weatherall, Mark.

In: European Journal of Physical and Rehabilitation Medicine, Vol. 55, No. 3, 01.06.2019, p. 342-352.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - The Consensus on Exercise Reporting Template (CERT) in a systematic review of exercise-based rehabilitation effectiveness

T2 - completeness of reporting, rater agreement, and utility

AU - Hay-Smith, E. Jean

AU - Englas, Kadri

AU - Dumoulin, Chantale

AU - Ferreira, Cristine H.

AU - Frawley, Helena

AU - Weatherall, Mark

PY - 2019/6/1

Y1 - 2019/6/1

N2 - BACKGROUND: Rehabilitation interventions are diverse - making decisions about pooling data in meta-analyses challenging. Intervention reporting templates such as the Consensus on Exercise Reporting Template (CERT) may help reviewers document intervention variability. AIM: To assess inter-rater agreement and utility of CERT used to assess completeness of reporting of one rehabilitation exercise intervention: pelvic floor muscle training (PFMT). DESIGN: A non-experimental agreement study. SETTING: Update of the Cochrane systematic review comparing different approaches to PFMT for urinary incontinence in women. POPULATION: Two PFMT arms from 21 newly identified trials. METHODS: Five raters independently used CERT to assess sufficiency of reporting of each arm (experimental PFMT and control PFMT) of each trial. One rater, PFMT non-expert, rated all trials. Four raters, all PFMT experts, assessed a mutually exclusive subgroup of the trials. In addition to rating sufficiency - "Yes" compared to No" or "Uncertain" - raters also reported on CERT utility. Expert ratings were used to determine the proportion of CERT items rated as sufficiently reported. Rater agreement was estimated using coefficient kappa and McNemar's test. RESULTS: The range of CERT items rated as sufficiently reported was 0 to 15 of 19 items, and the mean for both trial arms was 5.5. For agreement, 11 of 19 items had sufficient data to estimate coefficient kappa and only 3 of 11 had a kappa >0.4 (moderate agreement). From the 12 of 19 items for which McNemar's test could be performed, five had evidence that PFMT experts more often rated the reporting as sufficient than the non-expert. Raters reported the CERT template was comprehensive but not complete and needed contextualizing for PFMT. CONCLUSIONS: Completeness of reporting was poor for this example of a rehabilitation exercise intervention, and equally poor in both trial arms. Inter-rater agreement of completeness of reporting was also poor. Using a data extraction tool with poor rater-agreement may add unnecessary burden in a review. However, using a data extraction tool that enables assessment of intervention homogeneity has benefits in making decisions about which data to pool or not. CLINICAL REHABILITATION IMPACT: Researchers reporting clinical trials must pay more attention to completeness of rehabilitation exercise reporting.

AB - BACKGROUND: Rehabilitation interventions are diverse - making decisions about pooling data in meta-analyses challenging. Intervention reporting templates such as the Consensus on Exercise Reporting Template (CERT) may help reviewers document intervention variability. AIM: To assess inter-rater agreement and utility of CERT used to assess completeness of reporting of one rehabilitation exercise intervention: pelvic floor muscle training (PFMT). DESIGN: A non-experimental agreement study. SETTING: Update of the Cochrane systematic review comparing different approaches to PFMT for urinary incontinence in women. POPULATION: Two PFMT arms from 21 newly identified trials. METHODS: Five raters independently used CERT to assess sufficiency of reporting of each arm (experimental PFMT and control PFMT) of each trial. One rater, PFMT non-expert, rated all trials. Four raters, all PFMT experts, assessed a mutually exclusive subgroup of the trials. In addition to rating sufficiency - "Yes" compared to No" or "Uncertain" - raters also reported on CERT utility. Expert ratings were used to determine the proportion of CERT items rated as sufficiently reported. Rater agreement was estimated using coefficient kappa and McNemar's test. RESULTS: The range of CERT items rated as sufficiently reported was 0 to 15 of 19 items, and the mean for both trial arms was 5.5. For agreement, 11 of 19 items had sufficient data to estimate coefficient kappa and only 3 of 11 had a kappa >0.4 (moderate agreement). From the 12 of 19 items for which McNemar's test could be performed, five had evidence that PFMT experts more often rated the reporting as sufficient than the non-expert. Raters reported the CERT template was comprehensive but not complete and needed contextualizing for PFMT. CONCLUSIONS: Completeness of reporting was poor for this example of a rehabilitation exercise intervention, and equally poor in both trial arms. Inter-rater agreement of completeness of reporting was also poor. Using a data extraction tool with poor rater-agreement may add unnecessary burden in a review. However, using a data extraction tool that enables assessment of intervention homogeneity has benefits in making decisions about which data to pool or not. CLINICAL REHABILITATION IMPACT: Researchers reporting clinical trials must pay more attention to completeness of rehabilitation exercise reporting.

UR - http://www.scopus.com/inward/record.url?scp=85067268842&partnerID=8YFLogxK

U2 - 10.23736/S1973-9087.19.05791-5

DO - 10.23736/S1973-9087.19.05791-5

M3 - Article

VL - 55

SP - 342

EP - 352

JO - European Journal of Physical and Rehabilitation Medicine

JF - European Journal of Physical and Rehabilitation Medicine

SN - 1973-9087

IS - 3

ER -