Baseline dependent minimally important differences for clinical outcomes of pulmonary rehabilitation in people with COPD

Jenkins, A.R.; Groenen, M.T.J.; Vaes, A.W.; Janssen, D.J.A.; Wouters, E.F.M.; Franssen, F.M.E.; Spruit, M.A.

doi:10.1016/j.pulmoe.2023.06.004

Article information

Abstract

Full Text

Bibliography

Download PDF

Statistics

Tables (6)

Table 1. Baseline population characteristics.

Table 2. MIDs for outcomes following pulmonary rehabilitation.

Table 3. MIDs following stratification for baseline outcome values.

Table 4. MIDs following stratification for GOLD severity.

Table 5. MIDs following stratification for mMRC.

Table 6. MIDs following stratification for FFMI.

Show moreShow less

Abstract

Introduction

Minimally important differences (MIDs) for common outcomes of pulmonary rehabilitation are well documented for people with chronic obstructive pulmonary disease (COPD). It is not known whether MIDs differ based on COPD disease characteristics. This study aimed to estimate MIDs for clinical outcomes of pulmonary rehabilitation dependent upon baseline characteristics.

Methods

A database containing 2791 people with COPD was split into derivation (n=2245; age 66±9 years; 50% males; FEV1 47±20% predicted) and comparator (n=546; age 66±9 years; 47% males; FEV1 46±21% predicted) cohorts. MIDs were estimated using 0.5 x SD (symmetrically distributed) or 0.5 x IQR (non-symmetrically distributed) for: 6-minute walk test (6MWT), constant work rate test (CWRT), COPD assessment test (CAT), St. George's respiratory questionnaire (SGRQ), hospital anxiety and depression scale (HADS), and fat-free mass index (FFMI). MIDs were estimated based on baseline outcome scores, lung function, modified medical research council (mMRC) grade and FFMI.

Results

MID estimates were comparable to previously reported values. MIDs for SGRQ domains (Symptom=8.7 points, Activity=7.1 points, Impact=8.1 points) and FFMI were produced (0.36kg/m2). There was greater variation of change in 6MWT, SGRQ-activity, SGRQ-impact, HADS and FFMI on which the MIDs were determined when categorising for baseline values (all, p<0.05). Greater variation of change in 6MWT on which the MIDs were determined was evident with COPD disease severity grouping (p<0.05). The magnitude of change in 6MWT, CAT, CWRT, SGRQ-activity, and FFMI with baseline mMRC score categorisation resulted in greater variation on which the MIDs were determined (all, p<0.05). Baseline stratification for FFMI resulted in greater variation of change in CWRT (p<0.001) and HADS-depression (p = 0.043) on which MIDs were determined.

Discussion

Findings suggest that baseline presentation should be considered for people with COPD when assessing the efficacy of pulmonary rehabilitation. However, clinical significance of the variation underpinning MIDs is yet to be determined.

Keywords:

Minimally important differences

Pulmonary rehabilitation

Clinical outcomes

Full Text

Introduction

Pulmonary rehabilitation is considered a cornerstone treatment in the management of Chronic Obstructive Pulmonary Disease (COPD) for inducing improvements in exercise capacity and health-related quality of life.1 People with COPD undergoing pulmonary rehabilitation present with a diverse range of health care needs as defined by variability in disease severity, functional exercise capacity and psychological well-being.2 The effectiveness of pulmonary rehabilitation programmes is commonly determined by changes in exercise capacity (e.g., 6-minute walk test (6MWT), constant work rate test (CWRT)), health-related quality of life measures (e.g., COPD Assessment Tool (CAT), St. Georges Respiratory Questionnaire (SGRQ)), mental health (e.g., Hospital Anxiety & Depression Scale (HADS)), and body composition (fat free mass index (FFMI)).3 However, statistical analysis of changes in clinical outcome measures can often be misleading and may not be clinically relevant to patients and/or healthcare professionals.4

The minimally important difference (MID) was defined to address misleading statistics by calculating the smallest difference in a measured clinical parameter that is assumed to reflect a clinically meaningful impact in a patient's condition, for better or worse, as perceived by the patient, clinician, or investigator.5 MIDs can be determined using different approaches; anchor-based, distribution-based, and Delphi methods.4 The MIDs developed in respiratory research thus far have been highlighted with differing thresholds and/or ranges defined using either anchor-based or distribution-based methods.6 Distribution-based methods have been suggested to have an advantage of being simpler to use as they do not require an external criterion as observed with the anchor-based method.7

There is established evidence, especially in the field of medical statistics, stating that baseline differences could be key to the variety in the magnitude of effect seen with intervention.8 This has recently been demonstrated to be the case with pulmonary rehabilitation,9-11 whereby people with COPD presenting with poorer exercise capacity and higher symptom burden pre-rehabilitation stand to achieve greater benefits with pulmonary rehabilitation. Several MIDs used in practice are based on distribution-based approaches covering the COPD population as a whole, whereby the variation in response to pulmonary rehabilitation is used to determine MIDs. It is important to reassess the current state of MIDs to evaluate programme efficacy accounting for baseline differences. Currently available MIDs are disease specific and not patient specific. These MIDs may lack specificity leading to the potential over- or under-estimation of effects seen with pulmonary rehabilitation depending on patient presentation. Therefore, it is important to begin to build on the development of MIDs in the context of pulmonary rehabilitation to offer more personalised and contextualised MIDs for common clinical outcomes.

This study aimed to develop new MIDs for clinical outcomes (6MWT, CWRT, HADS, CAT & SGRQ, FFMI) of pulmonary rehabilitation in people with COPD based on stratification for baseline values and disease characteristics (disease severity according to GOLD grade, modified Medical Research Council dyspnea scale (mMRC) grade, body composition according to FFMI) using the distribution-based approach.

Methods

This study was conducted using the Ciro clinical pulmonary rehabilitation database for which patients had not objected to the storing of data for research purposes. Ethical approval for this study was waived by the medical ethics committee of the University Hospital Maastricht and Maastricht University (METC azM/UM) because the Medical Research Involving Human Subjects Act (WMO) does not apply to this study (METC 2020-1542).

Population

Data were extracted from the electronic patient files consisting of 2791 people with a clinical diagnosis of COPD (International Classification of Diseases and Related Health Problems (ICD-10) J43 or J44, mMRC >0, post-bronchodilator FEV1/FVC <0.70 according to GOLD guidelines12). Patients were evaluated at initial assessment of a comprehensive pulmonary rehabilitation programme at Ciro, a tertiary care centre for people with complex chronic respiratory diseases in Horn (The Netherlands) which they completed between July 2013 and August 2020.

Intervention

The pulmonary rehabilitation programme delivered by Ciro for people with COPD is comprehensive and multidisciplinary in nature. Performed in line with the American Thoracic Society & European Respiratory Society guidance,13 the programme consists of supervised exercise training, education, psychosocial counselling, nutritional counselling, COPD exacerbation management, and occupational therapy.10 Ciro offers pulmonary rehabilitation in both the inpatient (8 weeks, 5 sessions per week; 40 sessions in total) and outpatient (8 weeks, 3 sessions per week, followed by 8 weeks, 2 sessions per week; total of 40 sessions) setting.10 Exercise training was performed at a moderate-high intensity to achieve an overload stimulus. Intensity was increased during rehabilitation based on dyspnoea and fatigue symptom scores. The exercise programme comprised of flexibility exercises, general physical exercise for lower and upper extremities, and daily supervised 30-min outdoor walks. The most dyspneic and frail inpatients were offered neuromuscular electrical stimulation of lower-limb muscles instead of the exercise training, as described before.14 A detailed psychosocial and physical assessment of each patient was undertaken during the initial and final assessments for pulmonary rehabilitation.

Measurements

Demographics, body mass index (BMI), body composition (FFMI: determined by dual-energy x-ray absorptiometry), and degree of breathlessness (mMRC grade) were assessed pre- and post-pulmonary rehabilitation. Post-bronchodilator spirometry was performed to confirm COPD diagnosis (FEV1/FVC <0.70) and divide patients into GOLD stages of disease severity; mild, moderate, severe, and very severe.12 Health status was assessed using the CAT15 (score range: 0-40 points) and Dutch version of the COPD-specific SGRQ16 (score range: 0-100 points). Symptoms of anxiety and depression were assessed using the HADS scale17 (score range: 0-21 points). Functional exercise capacity was assessed using the 6MWT, performed in accordance with ERS/ATS standards,18 and the CWRT, set at 75% of the determined peak work rate derived from a maximal incremental cycle test.19

MID calculation

Distribution-based MIDs were calculated for each clinical outcome using 0.5 x standard deviation (SD) of change from pre- to post-rehabilitation for symmetrically distributed outcomes. Where outcomes were non-symmetrically distributed, 0.5 x interquartile range (IQR) was used. To calculate personalised MIDs for CWRT, SGRQ-S, SGRQ-A, and SGRQ-I, data were split into tertiles for each outcome using baseline outcome scores (T1 = low, T2 = moderate, T3 = high). For 6MWT (<350m vs ≥350m),20 CAT (<18 vs ≥18 points),21 SGRQ-T (<46 vs ≥46 points),22 HADS-A (<8 vs ≥8 points),22 HADS-D (<8 vs ≥8 points),22 and FFMI (‘abnormal’ <15kg/m2 for females and <17kg/m2 for males vs ‘normal’ ≥15kg/m2 for females and ≥17kg/m2 for males)23 patients were split into ‘abnormal’ or ‘normal’ baseline outcome values using clinically relevant cut-offs. Further subset MIDs for these outcomes were calculated based on disease characteristics at baseline (mild, moderate, severe, very severe COPD; mMRC 1, 2, 3, 4; ‘abnormal’ FFMI vs ‘normal’ FFMI (in line with the criteria above)).

Statistical analysis

Statistical analyses were performed using SPSS (v25.00; SPSS Inc., Chicago, IL, USA). Data was randomly partitioned with an 80/20 ratio, as is commonly used and recommended in large datasets,24 into two groups to provide a derivation (n = 2245) and comparator (n = 546) cohort for the calculating of MIDs. All statistical analyses were undertaken primarily using the derivation cohort, with estimated MIDs compared with the comparator cohort. Baseline demographics and outcomes of the derivation cohort are presented as mean and SD for symmetrically distributed outcomes, and as median and IQR for non-symmetrically distributed outcomes. All data were tested for symmetry by assessing skewness scores with values lower than -0.5 or above 0.5 considered as non-symmetrically distributed. If one MID was not normally distributed, all MIDs for the outcome were assessed using 0.5 x IQR. To assess differences in the MIDs between groups in both cohorts (low (T1) vs moderate (T2) vs high (T3); abnormal vs normal; mild vs moderate vs severe vs very severe; mMRC 1 vs 2 vs 3 vs 4; abnormal FFMI vs normal FFMI), the homogeneity of variances was tested with Levene's test to assess whether the variances were equal between groups. Statistical significance was accepted at p < 0.05. If the assumption of Levene's was violated when including >2 groups, post-hoc Levene's independent t-test was used to further explore differences between subgroups. An adjustment for multiple testing was made with a Bonferroni correction. Statistical significance following correction was accepted at p < 0.0167.

Results

The demographics of the derivation and comparator cohorts who completed a pre-rehabilitation assessment are presented in Table 1.

Table 1.

Baseline population characteristics.

Characteristic	Derivation cohort	Comparator cohort
Age (years)	65.7 ± 8.7	65.6 ± 8.5
Male, n (%)	1132 (50.4%)	259 (47.4%)
FEV1 % predicted	43.6 [31.4,60.2]	42.7 [29.8,59.3]
Mild COPD, n (%)	167 (7.5%)	36 (6.6%)
Moderate COPD, n (%)	688 (31.1%)	171 (31.4%)
Severe COPD, n (%)	862 (38.9%)	200 (36.8%)
Very Severe COPD, n (%)	498 (22.5%)	137 (25.2%)
GOLDa
A, n (%)	114 (5.2%)	28 (5.2%)
B, n (%)	506 (23.2%)	139 (25.8%)
C, n (%)	142 (6.5%)	29 (5.4%)
D, n (%)	1423 (65.1%)	342 (63.6%)
mMRC
mMRC 1, n (%)	259 (11.7%)	58 (10.7%)
mMRC 2, n (%)	871 (39.3%)	206 (37.9%)
mMRC 3, n (%)	574 (25.9%)	154 (28.4%)
mMRC 4, n (%)	513 (23.1%)	125 (23.0%)
6MWT (m)	380 ± 120	381 ± 117
CWRT (secs)	213 [160,302]	215 [163,308]
CAT (points)	21.7 ± 6.5	21.6 ± 6.6
SGRQ-Total (points)	59.4 ± 14.4	59.9 ± 16.5
SGRQ-Symptom (points)	62.9 ± 17.7	63.0 ± 20.2
SGRQ-Activity (points)	80.3 [67.7,86.9]	79.9 [66.7,94]
SGRQ-Impact (points)	47.3 ± 17.7	48.0 ± 19.8
HADS-A (points)	7.7 ± 4.2	7.6 ± 4.5
HADS-D (points)	7.5 ± 4.0	7.6 ± 4.1
FFMI (kg/m2)	16.6 ± 2.5	16.6 ± 2.5

Data presented as mean ± SD, % of population or median [IQR].

a

GOLD grade based on mMRC grade. 6MWT, six-minute walk test; CAT, COPD assessment tool; COPD, chronic obstructive pulmonary disease; CWRT, constant work rate test; FEV1, forced expiratory volume in one second; FFMI, fat-free mass index; GOLD, global initiative for chronic obstructive lung disease; HADS, hospital anxiety and depression scale; mMRC, modified medical research council dyspnoea scale; SGRQ- St. George's Respiratory Questionnaire.

MIDs for whole population

There were no significant differences between the derivation and comparator cohorts in terms of MIDs for each outcome (all, p > 0.05) (Table 2).

Table 2.

MIDs for outcomes following pulmonary rehabilitation.

Variable	Derivation cohort	Comparator cohort	p-values
6MWT (m)	32	27	0.178
CWRT (secs)	170*	146*	0.091
CAT (points)	3.1	3.1	0.862
SGRQ-T (points)	6.4	6.0	0.523
SGRQ-S (points)	8.7	8.6	0.907
SGRQ-A (points)	7.1*	7.2*	0.590
SGRQ-I (points)	8.1	7.4	0.266
HADS-A (points)	1.5*	2.0*	0.811
HADS-D (points)	2.0*	2.0*	0.964
FFMI (kg/m2)	0.36*	0.37*	0.610

⁎

Due to non-symmetrical distribution, MIDs presented as 0.5 x IQR. 6MWT, six-minute walk test; CAT, COPD assessment tool; CWRT, constant work rate test; FFMI, fat-free mass index; HADS, hospital anxiety (A) and depression (D) scale; SGRQ, St. George's Respiratory Questionnaire. CAT (Total, n =1879; Derivation, n =1507; Comparator, n =372), SGRQ (Total, Symptom, Activity, Impact domains: Total, n =716; Derivation, n =566; Comparator, n =150), HADS (Total, n =1833; Derivation, n =1469; Comparator, n =364), 6MWT (Total, n =2018; Derivation, n =1613; Comparator, n =405), CWRT (Total, n =1797; Derivation, n =1431; Comparator, n =366), and FFMI (Total, n =2079; Derivation, n =1654; Comparator, n =425).

MIDs stratified for baseline values

Variation of change on which MIDs were based for CWRT were not significantly different across tertiles for the derivation cohort (p = 0.281) but were for the comparator cohort (p < 0.001). Post-hoc analyses in the comparator cohort showed greater variation of change on which the MID was determined for CWRT in the high (T3) group compared to the moderate (T2) (p = 0.001) and low (T1) (p < 0.001) groups. No significant differences were observed between low (T1) and moderate (T2) groups (p = 0.225).

Variation of change across tertiles on which MIDs were determined for SGRQ-A was evident in both cohorts (p < 0.05). Post-hoc analyses showed greater variation of change on which the MID was determined for SGRQ-A in the low (T1) compared to the high group (T3) (derivation, p = 0.014; comparator, p = 0.016). In the comparator cohort only, the variation of change was greater for determining the MID for SGRQ-A in the moderate (T2) compared to the high (T3) group (p = 0.008). All other tertile comparisons were found to not be statistically significant in the post-hoc analysis (p > 0.0167). Variation of change across tertiles on which MIDs were determined for SGRQ-I was evident in both cohorts (p < 0.05). Post-hoc analyses showed greater variation of change for determining the MID for SGRQ-I in the high (T3) compared to low (T1) (derivation, p < 0.001; comparator, p = 0.003) group. All other tertile comparisons were found to not be statistically significant in the post-hoc analysis (p > 0.0167). No significant differences between tertiles were seen for SGRQ-S in both cohorts (p > 0.05).

Variation of change on which MIDs were determined was evident in people categorised as abnormal according to clinical cut-offs for the outcomes of 6MWT (derivation, p < 0.001; comparator, p = 0.001), HADS-A (both, p < 0.001) and HADS-D (both, p < 0.001) when compared to people categorised as normal in both cohorts. Greater variation of change on which MIDs were determined was seen in people categorised as normal for the outcome of FFMI when compared to people categorised as abnormal in both cohorts (derivation, p = 0.006; comparator, p = 0.014). No significant differences between abnormal and normal baseline scores were seen for CAT and SGRQ-T in either cohort (all, p > 0.05) (Table 3).

Table 3.

MIDs following stratification for baseline outcome values.

Variable		Low(T1)	Moderate(T2)		High(T3)	p-values
CWRT (secs)	Derivation	110*	142*		275*	0.281
	Comparator	85*	103*		270*	<0.001
	Tertile cut-off	≤178.00	178.01-270.67		≥270.68	-
SGRQ-S (points)	Derivation	9.8*	8.6*		11.7*	0.173
	Comparator	12.5*	8.3*		11.8*	0.180
	Tertile cut-off	≤55.50	55.51-70.90		≥70.91	-
SGRQ-A (points)	Derivation	7.3*	7.2*		7.2*	0.040
	Comparator	7.4*	7.5*		7.2*	0.018
	Tertile cut-off	≤73.00	73.01-86.80		≥86.81	-
SGRQ-I (points)	Derivation	7.8*	10.0*		11.9*	<0.001
SGRQ-I (points)	Comparator	8.2*	9.3*		12.6*	0.010
	Tertile cut-off	≤37.60	37.61-54.03		≥54.04	-
		Abnormal		Normal		p-values
6MWT (m)	Derivation	39*		27*		<0.001
	Comparator	41*		28*		0.001
	Clinical cut-off	<350		≥350		-
SGRQ-T (points)	Derivation	8.5*		7.0*		0.188
	Comparator	7.8*		10.3*		0.993
	Clinical cut-off	<46		≥46		-
CAT (points)	Derivation	4.0*		3.5*		0.196
	Comparator	3.5*		3.5*		0.750
	Clinical cut-off	<18		≥18		-
HADS-A (points)	Derivation	2.0*		1.5*		<0.001
	Comparator	2.5*		1.5*		<0.001
	Clinical cut-off	<8		≥8		-
HADS-D (points)	Derivation	1.8		1.3		<0.001
	Comparator	1.9		1.2		<0.001
	Clinical cut-off	<8		≥8		-
FFMI (kg/m2)	Derivation	0.34*		0.38*		0.006
	Comparator	0.32*		0.40*		0.014
	Clinical cut-off	<15 (female)<17 (male)		≥15 (female)≥17 (male)		-

⁎

Due to non-symmetrical distribution, MIDs presented as 0.5 x IQR. 6MWT, six-minute walk test; CAT, COPD assessment tool; CWRT, constant work rate test; FFMI, fat-free mass index; HADS, hospital anxiety (A) and depression (D) scale; SGRQ, St. George's Respiratory Questionnaire.

MIDs stratified for lung function

Variation of change on which MIDs were determined across disease severities for the outcome of 6MWT was evident in the derivation cohort only (p = 0.024). Post-hoc analyses showed greater variation of change on which to determine the MID for 6MWT in very-severe COPD when compared to moderate COPD (p = 0.008). No other significant differences were observed between disease severities (p > 0.0167). No significant differences between disease severities were observed in the comparator cohort (p = 0.807).

No significant differences between disease severity groups were seen for CWRT, CAT, SGRQ-T, SGRQ-S, SGRQ-A, SGRQ-I, HADS-A, HADS-D, and FFMI (Table 4).

Table 4.

MIDs following stratification for GOLD severity.

Variable		Mild	Moderate	Severe	Very Severe	p-values
6MWT (m)	Derivation	26*	28*	30*	36*	0.024
6MWT (m)	Comparator	27*	29*	30*	35*	0.807
CWRT (secs)	Derivation	183*	177*	177*	123*	0.188
CWRT (secs)	Comparator	238*	184*	104*	155*	0.199
CAT (points)	Derivation	4.5*	4.0*	4.0*	3.5*	0.242
CAT (points)	Comparator	5.0*	5.0*	3.0*	3.0*	0.320
SGRQ-T (points)	Derivation	8.4*	9.0*	7.6*	8.3*	0.951
SGRQ-T (points)	Comparator	7.2*	9.5*	6.8*	8.3*	0.764
SGRQ-S (points)	Derivation	9.9	9.0	8.1	8.3	0.314
SGRQ-S (points)	Comparator	9.2	8.2	8.6	9.4	0.956
SGRQ-A (points)	Derivation	8.3*	8.7*	6.9*	6.7*	0.173
SGRQ-A (points)	Comparator	7.3⁎⁎	10.3*	7.2*	7.2*	0.067
SGRQ-I (points)	Derivation	12.2*	10.4*	10.3*	10.3*	0.986
SGRQ-I (points)	Comparator	8.4*	10.6*	8.8*	12.0*	0.849
HADS-A (points)	Derivation	2.0*	2.0*	1.5*	2.0*	0.250
HADS-A (points)	Comparator	1.1*	2.0*	1.5*	2.5*	0.301
HADS-D (points)	Derivation	1.5*	2.0*	2.0*	2.0*	0.931
HADS-D (points)	Comparator	3.0*	2.0*	2.0*	2.0*	0.899
FFMI (kg/m2)	Derivation	0.34*	0.36*	0.37*	0.38*	0.053
FFMI (kg/m2)	Comparator	0.31*	0.38*	0.36*	0.42*	0.808

⁎

Due to non-symmetrical distribution, MIDs presented as 0.5 x IQR. 6MWT, six-minute walk test; CAT, COPD assessment tool; CWRT, constant work rate test; FFMI, fat-free mass index; HADS, hospital anxiety (A) and depression (D) scale; SGRQ, St. George's Respiratory Questionnaire.

MIDs stratified for mMRC grade

Variation of change on which the MIDs were determined was evident across mMRC scores for 6MWT in the derivation cohort (p < 0.001). Post-hoc analyses showed greater variation of change on which the MID was determined in people with an mMRC of 4 for 6MWT when compared to people with an mMRC of 1 (p < 0.001), mMRC of 2 (p < 0.001), and mMRC of 3 (p = 0.013). Greater variation of change on which the MID was based was also evident in people with an mMRC score of 3 for 6MWT when compared to people with an mMRC of 1 (p = 0.002). No other significant differences were observed between mMRC scores (p > 0.0167). No significant differences between mMRC scores were observed in the comparator cohort.

Variation of change on which the MIDs were determined was evident across mMRC scores for CWRT in the derivation cohort (p = 0.030). Post-hoc analyses showed greater variation of change on which the MID was determined in people with an mMRC of 1 for CWRT when compared to people with an mMRC of 3 (p = 0.013). No other significant differences were observed between mMRC scores (p > 0.0167). No significant differences between mMRC scores were observed in the comparator cohort.

Variation of change on which the MIDs were determined was evident across mMRC scores for CAT in the derivation cohort (p = 0.021). Post-hoc analyses showed greater variation of change on which the MID was determined in people with an mMRC of 1 for CAT when compared to people with an mMRC of 3 (p = 0.007) and mMRC of 2 (p = 0.015). No other significant differences were observed between mMRC scores (p > 0.0167). No significant differences between mMRC scores were observed in the comparator cohort.

Variation of change on which the MIDs were determined was evident across mMRC scores for CAT in both cohorts (derivation, p = 0.001; p = 0.005). Post-hoc analyses showed greater variation of change on which the MID was determined in people with an mMRC of 1 for SGRQ-A when compared to people with an mMRC of 2 (p = 0.011), mMRC of 3 (p < 0.001), and mMRC of 4 (p = 0.002) in the derivation cohort. In the comparator cohort, less variation of change on which the MID was determined was evident in people with an mMRC of 3 when compared to people with an mMRC of 2 (p = 0.010) and mMRC of 1 (p = 0.001). No other significant differences were observed between mMRC scores across derivation and comparator cohorts (p > 0.0167).

Variation of change on which the MIDs were determined was evident across mMRC scores for SGRQ-I in the comparator cohort (p = 0.039). Post-hoc analyses showed greater variation of change on which the MID was determined in people with an mMRC of 3 for SGRQ-I when compared to people with an mMRC of 2 (p = 0.004). No other significant differences were observed between mMRC scores (p > 0.0167). No significant differences between mMRC scores were observed in the derivation cohort.

Variation of change on which the MIDs were determined was evident across mMRC scores for FFMI in the derivation cohort (p < 0.001). Post-hoc analyses showed greater variation of change on which the MID was determined in people with an mMRC of 4 for FFMI when compared to people with an mMRC of 1 (p = 0.002) and mMRC of 2 (p < 0.001). No other significant differences were observed between mMRC scores (p > 0.0167). No significant differences between mMRC scores were observed in the comparator cohort.

No significant differences in MIDs between mMRC groups were seen for SGRQ-T, SGRQ-S, HADS-A, and HADS-D (Table 5).

Table 5.

MIDs following stratification for mMRC.

Variable		mMRC 1	mMRC 2	mMRC 3	mMRC 4	p-values
6MWT (m)	Derivation	26*	27*	31*	40*	<0.001
6MWT (m)	Comparator	24*	31*	31*	39*	0.361
CWRT (secs)	Derivation	229*	172*	147*	178*	0.030
CWRT (secs)	Comparator	245*	165*	144*	89*	0.458
CAT (points)	Derivation	4.0*	4.0*	3.0*	4.0*	0.021
CAT (points)	Comparator	5.0*	4.1*	3.0*	3.6*	0.088
SGRQ-T (points)	Derivation	9.8*	7.4*	7.3*	8.6*	0.542
SGRQ-T (points)	Comparator	11.2*	6.8*	9.0*	9.4*	0.075
SGRQ-S (points)	Derivation	12.7*	10.3*	10.9*	11.6*	0.213
SGRQ-S (points)	Comparator	12.2*	11.2*	10.3*	11.9*	0.537
SGRQ-A (points)	Derivation	12.3*	9.4*	6.8*	6.9*	0.001
SGRQ-A (points)	Comparator	11.0*	7.5*	3.7*	7.2*	0.005
SGRQ-I (points)	Derivation	10.7*	9.2*	10.9*	12.5*	0.318
SGRQ-I (points)	Comparator	10.7*	8.8*	11.6*	9.6*	0.039
HADS-A (points)	Derivation	2.0*	2.0*	1.5*	2.0*	0.057
HADS-A (points)	Comparator	2.0*	2.0*	2.3*	2.5*	0.334
HADS-D (points)	Derivation	1.5*	2.0*	2.0*	2.0*	0.985
HADS-D (points)	Comparator	2.5*	1.5*	2.0*	2.5*	0.152
FFMI (kg/m2)	Derivation	0.31*	0.36*	0.40*	0.40*	<0.001
FFMI (kg/m2)	Comparator	0.32*	0.34*	0.38*	0.41*	0.167

⁎

Due to non-symmetrical distribution, MIDs presented as 0.5 x IQR as opposed to 0.5 x SD. 6MWT, six-minute walk test; CAT, COPD assessment tool; CWRT, constant work rate test; FFMI, fat-free mass index; HADS, hospital anxiety (A) and depression (D) scale; mMRC, modified medical research council dyspnoea scale; SGRQ, St. George's Respiratory Questionnaire.

MIDs stratified for baseline FFMI

There was greater variation of change on which the MID was determined in people categorised as having normal FFMI for the outcome of CWRT when compared to people categorised as having abnormal FFMI at baseline (derivation, p < 0.001; comparator, p = 0.001) in both cohorts. Greater variation of change on which the MID was determined was seen in people categorised as having abnormal FFMI for HADS-D when compared to people categorised as having normal FFMI at baseline in the derivation cohort (p = 0.043), but not the comparator cohort (p = 0.297). No significant differences between abnormal and normal baseline FFMI groups were seen for 6MWT, CAT, SGRQ-T, SGRQ-S, SGRQ-A, SGRQ-I, and HADS-A in either cohort (all, p > 0.05) (Table 6).

Table 6.

MIDs following stratification for FFMI.

		Abnormal	Normal	p-values
6MWT (m)	Derivation	32	31	0.464
6MWT (m)	Comparator	26	27	0.439
CWRT (secs)	Derivation	126*	205*	<0.001
CWRT (secs)	Comparator	74*	199*	<0.001
CAT (points)	Derivation	3.0	3.1	0.517
CAT (points)	Comparator	3.0	3.1	0.136
SGRQ-T (points)	Derivation	7.4*	8.3*	0.412
SGRQ-T (points)	Comparator	8.4*	8.0*	0.696
SGRQ-S (points)	Derivation	8.2	8.9	0.255
SGRQ-S (points)	Comparator	9.1	8.3	0.285
SGRQ-A (points)	Derivation	7.0*	7.2*	0.322
SGRQ-A (points)	Comparator	7.0*	7.3*	0.540
SGRQ-I (points)	Derivation	7.7	8.3	0.573
SGRQ-I (points)	Comparator	6.7	7.7	0.429
HADS-A (points)	Derivation	1.5*	1.5*	0.214
HADS-A (points)	Comparator	2.5*	2.0*	0.620
HADS-D (points)	Derivation	2.0*	2.0*	0.043
HADS-D (points)	Comparator	2.0*	2.0*	0.297

⁎

Due to non-symmetrical distribution, MIDs presented as 0.5 x IQR. 6MWT, six-minute walk test; CAT, COPD assessment tool; CWRT, constant work rate test; HADS, hospital anxiety (A) and depression (D) scale; SGRQ, St. George's Respiratory Questionnaire.

Discussion

To the best of our knowledge, this is the first study in the context of pulmonary rehabilitation in COPD to calculate MIDs for commonly used clinical outcomes based on baseline disease characteristics. Firstly, the present study corroborates previous MID estimates for 6MWT (32m vs 30m18,25), CAT (-3.1 points vs -3.0 to -2.0 points26,27), SGRQ-total (-6.4 points vs -7.43 to -4.0 points6,26), HADS-A (-1.5 points vs -1.8 to -1.3 points27), HADS-D (-2.0 points vs -1.7 to -1.5 points27), and CWRT (170s vs 100-200s28). New MIDs for pulmonary rehabilitation outcomes proposed because of this study include SGRQ-S (-8.7 points), SGRQ-A (-7.1 points), SGRQ-I (-8.1 points), and FFMI (0.36 kg/m2). This is the first study to show that MID estimates differentiate statistically based upon baseline outcome values (6MWT, SGRQ-A, SGRQ-I, HADS-A, HADS-D, FFMI), GOLD disease severity (6MWT), mMRC dyspnoea score (6MWT, CAT, CWRT, SGRQ-A, FFMI), and baseline FFMI (CWRT, HADS-D). SGRQ-T and SGRQ-S MID estimates were not found to be statistically different in terms of variation dependent upon baseline characteristics.

Recently, there has been evidence emerging suggesting that baseline characteristics, in terms of disease, psychological and physical health, have an impact on the magnitude of response to pulmonary rehabilitation in terms of exercise-based and health-related quality of life outcomes.9 It is not implausible to suggest that those with higher baseline exercise capacity and/or better self-reported health-related quality of life might experience a ‘ceiling effect’ in response to any intervention, let alone pulmonary rehabilitation, when such used constructs are restricted to a scoring limit which could lead to less variation in responses seen. Whilst the results reported in this study corroborate previously reported MIDs, this study is the first to demonstrate that MIDs can differ based on baseline characteristics of people with COPD presenting to pulmonary rehabilitation.

In terms of exercise capacity, 6MWT and CWRT did not follow a similar path. The MID for 6MWT was amenable to change dependent upon baseline values and lung function whereas the MID for CWRT was not. However, the MID for CWRT was amenable to change dependent upon baseline FFMI values whereas the MID for 6MWT was not. This study demonstrates for the first time that the MID for 6MWT is statistically higher for people with COPD with heightened disease burden (i.e., low exercise capacity, poorer lung function, heightened dyspnea). Given the MIDs used within this study have been determined by distribution-based methods (i.e., 0.5 x SD or 0.5 x IQR), the higher MID in those with heightened disease burden at baseline for pulmonary rehabilitation reflects greater variation in responses to pulmonary rehabilitation consequently resulting in a higher MID. It was interesting to report the relative stability of the MID for CWRT when accounting for differing disease characteristics, with FFMI and mMRC being the only factors to impact the MID for CWRT. The lack of concordance between 6MWT and CWRT may be attributed to the differing nature of the tests with the 6MWT being a self-paced test where stopping for rest is an option, whilst the CWRT is conducted at a set workload with the test stopped if effort at the set intensity cannot be maintained.

For quality-of-life measures, the MIDs for HADS, CAT and SGRQ domains of activity and impact were amenable to change based upon baseline values. However, this was not the case for SGRQ domains of total score and symptoms. This lack of concordance within the SGRQ suggests a need to examine the tool on both a domain basis, as well as total score. Interestingly, lung function nor mMRC appeared to impact the MIDs for SGRQ or HADS, apart from in the SGRQ-A domain which was impacted by mMRC as was CAT. All MIDs for health-related quality of outcomes remained stable when accounting for baseline FFMI. The lack of differences observed in the SGRQ domains of total and symptoms when accounting for all baseline factors suggests the MIDs for these outcomes are stable across the COPD disease spectrum. However, this was not the case for HADS, CAT and SGRQ domains of activity and impact which need to be assessed on a baseline characteristic dependent level. All in all, as seen with exercise capacity, the observed differences suggested that people with COPD with heightened disease burden (i.e., poorer self-reported quality of life and mood status) had statistically higher MIDs.

In terms of body composition, the MID for FFMI appeared to be amenable to change based on baseline values and dyspnea. The MID for FFMI was higher in those with an mMRC score of 4. In keeping with the other outcomes, people with COPD presenting with poorer health (i.e., lower FFMI) had heightened MIDs. However, people with COPD with higher baseline FFMI had a higher MID. This study for the first time has produced an MID for use with pulmonary rehabilitation for a body composition outcome based on a large dataset.

The most influential factor on MIDs in the context of people with COPD in pulmonary rehabilitation is the baseline values of clinical outcomes. Lung function and mMRC appeared to have a modest impact on the MIDs of certain clinical outcomes but were far less prominent than baseline values themselves. Body composition had little impact on the MIDs of clinical outcomes. Some outcomes, mainly SGRQ-total and symptom domain appeared to have robust MIDs which were not amenable to certain disease characteristics in this large cohort. The 6MWT seemed to be the most consistently amenable outcome to a change in MID based on disease characteristics.

When interpreting the findings of this study, it is important to consider the limitations. Firstly, as far as we are aware there is no comparative literature across diseases which has statistically analysed estimated MIDs dependent upon baseline disease characteristics using a distribution-based approach. We consulted and opted for the Levene's test to measure the variance between clinically relevant cut-offs/tertiles/disease characteristic groups as our MID was based on distribution. It is also worth noting that whilst statistical differences between certain MIDs were seen, it is not possible to determine the clinical relevance of such differences between groups. This combined with the use of a large dataset also increase the possibility of relatively small changes leading to statistically significant differences. However, it is important to note that this study has implications for clinical practice in that MIDs have been produced based on a wide range of disease characteristics allowing service providers to contextualize responses to pulmonary rehabilitation in people with COPD in a variety of different ways which are more relevant to individuals. In turn, these MIDs may also be used in the design of future trials involving pulmonary rehabilitation to assess interventional efficacy through more specified MIDs. This study was not able to include an anchor to weight the changes in outcomes against self-reported improvements. It is important to highlight that there is still ongoing debate as to the accuracy of using differing approaches for estimating the MID.28-32 Due to the large nature of the dataset, there were outcomes which were found to be non-symmetrically distributed which posed challenges for determining the MID based on the standard deviation of data. Therefore, we opted to present a non-parametric equivalent in the form of 0.5 x IQR alongside 0.5 x SD for comparison purposes, and visually, MIDs appeared to be largely similar between approaches. It is important to note that some observations were not replicated in the comparator cohort, and vice versa meaning some results should be interpreted with caution.

In conclusion, this study further confirms the currently available MIDs for the COPD population, whilst also demonstrating that disease characteristics such as baseline outcome values, GOLD disease severity, mMRC score, and baseline FFMI can result in differing MIDs, but not necessarily for all outcomes. The findings suggest a potential need to shift from umbrella MIDs for measuring intervention efficacy with pulmonary rehabilitation and move towards individually tailored MIDs.

Funding information

Not applicable.

Acknowledgements

We would like to acknowledge the contribution of Dr. Iain Stewart and Dr. Qichen Deng for providing input into the statistical analysis plan.

References

[1]

B McCarthy, D Casey, D Devane, K Murphy, E Murphy, Y. Lacasse.

Pulmonary rehabilitation for chronic obstructive pulmonary disease.

Cochrane Database Syst Rev, (2015),

http://dx.doi.org/10.1002/14651858.CD003793.pub3

[2]

CE Bolton, EF Bevan-Smith, JD Blakey, P Crowe, SL Elkin, R Garrod, et al.

British Thoracic Society guideline on pulmonary rehabilitation in adults: accredited by NICE.

Thorax, 68 (2013), pp. ii1-ii30

http://dx.doi.org/10.1136/thoraxjnl-2013-203808 | Medline

[3]

S Souto-Miranda, G Rodrigues, MA Spruit, A. Marques.

Pulmonary rehabilitation outcomes in individuals with chronic obstructive pulmonary disease: a systematic review.

Ann Phys Rehabil Med, 65 (2022),

http://dx.doi.org/10.1016/j.rehab.2021.101564

[4]

SK Rai, J Yazdany, PR Fortin, JA. Aviña-Zubieta.

Approaches for estimating minimal clinically important differences in systemic lupus erythematosus.

Arthritis Res Ther, 17 (2015), pp. 143

http://dx.doi.org/10.1186/2Fs13075-015-0658-6 | Medline

[5]

JP Kiley, J Sri Ram, TL Croxton, GG Weinmann.

Challenges associated with estimating minimal clinically important differences in COPD-the NHLBI perspective.

COPD, 2 (2005), pp. 43-46

http://dx.doi.org/10.1081/copd-200050649 | Medline

[6]

PW Jones, KM Beeh, KR Chapman, M Decramer, DA Mahler, JA. Wedzicha.

Minimal clinically important differences in pharmacological trials.

Am J Respir Crit Care Med, 189 (2014), pp. 250-255

http://dx.doi.org/10.1164/rccm.201310-1863pp | Medline

[7]

A Ousmen, C Touraine, N Deliu, F Cottone, F Bonnetain, F Efficace, et al.

Distribution- and anchor-based methods to determine the minimally important difference on patient-reported outcome questionnaires in oncology: a structured review.

Health Qual Life Outcomes, 16 (2018), pp. 228

http://dx.doi.org/10.1186/s12955-018-1055-z | Medline

[8]

M Eccles, J Grimshaw, M Campbell, C. Ramsay.

Research designs for studies evaluating the effectiveness of change and improvement strategies.

Qual Saf Health Care, 12 (2003), pp. 47-52

http://dx.doi.org/10.1136/qhc.12.1.47 | Medline

[9]

IML Augustin, FME Franssen, S Houben-Wilke, DJA Janssen, S Gaffron, HJ Pennings, et al.

Multidimensional outcome assessment of pulmonary rehabilitation in traits-based clusters of COPD patients.

PLoS One, 17 (2022),

http://dx.doi.org/10.1371/journal.pone.0263657

[10]

MA Spruit, IM Augustin, LE Vanfleteren, DJ Janssen, S Gaffron, HJ Pennings, et al.

Differential response to pulmonary rehabilitation in COPD: multidimensional profiling.

Eur Respir J, 46 (2015), pp. 1625-1635

http://dx.doi.org/10.1183/13993003.00350-2015 | Medline

[11]

S Souto-Miranda, V Rocha, MA Mendes, P Simão, V Martins, MA Spruit, et al.

The presence of extra-pulmonary treatable traits increases the likelihood of responding to pulmonary rehabilitation.

Respir Med, 206 (2023),

http://dx.doi.org/10.1016/j.rmed.2022.107086

[12]

CF Vogelmeier, GJ Criner, FJ Martinez, A Anzueto, PJ Barnes, J Bourbeau, et al.

Global Strategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Lung Disease 2017 Report. GOLD Executive Summary.

Am J Respir Crit Care Med, 195 (2017), pp. 557-582

http://dx.doi.org/10.1164/rccm.201701-0218pp | Medline

[13]

MA Spruit, SJ Singh, C Garvey, R ZuWallack, L Nici, C Rochester, et al.

An official American Thoracic Society/European Respiratory Society statement: key concepts and advances in pulmonary rehabilitation.

Am J Respir Crit Care Med, 188 (2013), pp. e13-e64

http://dx.doi.org/10.1164/rccm.201309-1634st | Medline

[14]

MJH Sillen, FME Franssen, JML Delbressine, AW Vaes, EFM Wouters, MA. Spruit.

Efficacy of lower-limb muscle training modalities in severely dyspnoeic individuals with COPD and quadriceps muscle weakness: results from the DICES trial.

Thorax, 69 (2014), pp. 525-531

http://dx.doi.org/10.1136/thoraxjnl-2013-204388 | Medline

[15]

PW Jones, G Harding, P Berry, I Wiklund, WH Chen, N. Kline Leidy.

Development and first validation of the COPD Assessment Test.

Eur Respir J, 34 (2009), pp. 648-654

http://dx.doi.org/10.1183/09031936.00102509 | Medline

[16]

PW Jones, FH Quirk, CM. Baveystock.

The St George's respiratory questionnaire.

Respir Med, 85 (1991), pp. 25-31

http://dx.doi.org/10.1016/s0954-6111(06)80166-6 | Medline

[17]

AS Zigmond, RP. Snaith.

The hospital anxiety and depression scale.

Acta Psychiatr Scand, 67 (1983), pp. 361-370

http://dx.doi.org/10.1111/j.1600-0447.1983.tb09716.x | Medline

[18]

AE Holland, MA Spruit, T Troosters, MA Puhan, V Pepin, D Saey, et al.

An official European respiratory society/American thoracic society technical standard: field walking tests in chronic respiratory disease.

Eur Respir J, 44 (2014), pp. 1428-1446

http://dx.doi.org/10.1183/09031936.00150314 | Medline

[19]

A van ‘t Hul, R Gosselink, G. Kwakkel.

Constant-load cycle endurance performance: test-restest reliability and validity in patients with COPD.

J Cardiopulm Rehabil, 23 (2003), pp. 143-150

http://dx.doi.org/10.1097/00008483-200303000-00012 | Medline

[20]

MA Spruit, MI Polkey, B Celli, LD Edwards, ML Watkins, V Pinto-Plata, et al.

Predicting outcomes from 6-minute walk distance in chronic obstructive pulmonary disease.

J Am Med Dir Assoc, 13 (2012), pp. 291-297

http://dx.doi.org/10.1016/j.jamda.2011.06.009 | Medline

[21]

DE Smid, FME Franssen, M Gonik, M Miravitlles, C Casanova, BG Cosio, et al.

Redefining cut-points for high symptom burden of the global initiative for chronic obstructive lung disease classification in 18,577 patients with chronic obstructive pulmonary disease.

J Am Med Dir Assoc, 18 (2017),

https://doi.org/10.1016/j.jamda.2017.09.003

[22]

I Bjelland, AA Dahl, TT Haug, D. Neckelmann.

The validity of the hospital anxiety and depression scale: an updated literature review.

J Psychosom Res, 52 (2002), pp. 69-77

http://dx.doi.org/10.1016/s0022-3999(01)00296-3 | Medline

[23]

AM Schols, IM Ferreira, FM Franssen, HR Gosker, W Janssens, M Muscaritoli, et al.

Nutritional assessment and therapy in COPD: a European Respiratory Society statement.

Eur Respir J, 44 (2014), pp. 1504-1520

http://dx.doi.org/10.1183/09031936.00070914 | Medline

[24]

A Gholamy, V Kreinovich, O. Kosheleva.

Why 70/30 or 80/20 relation between training and testing sets: a pedagogical explanation.

Departmental Technical Reports (CS), (2018), pp. 1209

[25]

SJ Singh, MA Puhan, V Andrianopoulos, NA Hernandes, KE Mitchell, CJ Hill, et al.

An official systematic review of the European Respiratory Society/American Thoracic Society: measurement properties of field walking tests in chronic respiratory disease.

Eur Respir J, 44 (2014), pp. 1447-1478

http://dx.doi.org/10.1183/09031936.00150414 | Medline

[26]

H Alma, C de Jong, I Tsiligianni, R Sanderman, J Kocks, T. van der Molen.

Clinically relevant differences in COPD health status: systematic review and triangulation.

Eur Respir J, 52 (2018),

http://dx.doi.org/10.1183/13993003.00412-2018

[27]

DE Smid, FME Franssen, S Houben-Wilke, LEGW Vanfleteren, DJA Janssen, EFM Wouters, et al.

Responsiveness and MCID estimates for CAT, CCQ, and HADS in patients with COPD undergoing pulmonary rehabilitation: a prospective analysis.

J Am Med Dir Assoc, 18 (2017), pp. 53-58

http://dx.doi.org/10.1016/j.jamda.2016.08.002 | Medline

[28]

L Laviolette, J Bourbeau, S Bernard, Y Lacasse, V Pepin, MJ Breton, et al.

Assessing the impact of pulmonary rehabilitation on functional status in COPD.

Thorax, 63 (2008), pp. 115-121

http://dx.doi.org/10.1136/thx.2006.076844 | Medline

[29]

GR Norman, JA Sloan, KW. Wyrwich.

Interpretation of changes in health-related quality of life: the remarkable universality of half a standard deviation.

Med Care, 41 (2003), pp. 582-592

http://dx.doi.org/10.1097/01.mlr.0000062554.74615.4c | Medline

[30]

JA Sloan, CL Loprinzi, SA Kuross, AW Miser, JR O'Fallon, MR Mahoney, et al.

Randomized comparison of four tools measuring overall quality of life in patients with advanced cancer.

J Clin Oncol, 16 (1998), pp. 3662-3673

http://dx.doi.org/10.1200/jco.1998.16.11.3662 | Medline

[31]

MA. Testa.

Interpreting quality-of-life clinical trial data for use in the clinical practice of antihypertensive therapy.

J Hypertens Suppl, 5 (1987), pp. S9-13

Medline

[32]

AR. Feinstein.

Indexes of contrast and quantitative significance for comparisons of two groups.

Stat Med, 18 (1999), pp. 2557-2581

http://dx.doi.org/10.1002/(sici)1097-0258(19991015)18:19%3C2557::aid-sim361%3E3.0.co;2-r | Medline

Indexed in:

Follow us:

Indexed in:

Follow us:

Subscribe to our newsletter