Journal Information
Download PDF
More article options
Original article
Open Access
Available online 4 December 2022
Reliability and validity of the Chester step test in patients with interstitial lung disease
A. Alvesa,e, A. Oliveiraa,b,c,d, P.G. Ferreiraf,g, V. Martinsh, A. Marquesa,b,
Corresponding author

Corresponding author at: Lab3R - Respiratory Research and Rehabilitation Laboratory, School of Health Sciences (ESSUA) and Institute of Biomedicine (iBiMED), University of Aveiro, Agras do Crasto - Campus Universitário de Santiago, Edifício 30, 3810-193 Aveiro, Portugal.
a Lab3R – Respiratory Research and Rehabilitation Laboratory, School of Health Sciences, University of Aveiro (ESSUA), Aveiro, Portugal
b iBiMED – Institute of Biomedicine, Department of Medical Sciences, University of Aveiro, Aveiro, Portugal
c Respiratory Medicine, West Park Healthcare Centre, Toronto, ON, Canada
d School of Rehabilitation Science, McMaster University, Hamilton, ON, Canada
e REMEO Home Department, Linde Healthcare, Porto, Portugal
f ILD Outpatient Clinic, Pulmonology Department – Centro Hospitalar Universitário de Coimbra (CHUC), Coimbra, Portugal
g Faculty of Medicine of the University of Coimbra (FMUC), Coimbra, Portugal
h Pulmonology Department – Hospital Distrital da Figueira da Foz, Figueira da Foz, Portugal
Received 05 May 2022. Accepted 22 October 2022
Article information
Full Text
Download PDF
Figures (3)
Show moreShow less

The Chester Step Test (CST) is a simple and inexpensive field test, which requires minimal physical space to assess exercise capacity. Such characteristics make the CST suitable to be used in different settings, however, its measurement properties in patients with interstitial lung diseases (ILD) are unknown.


A cross-sectional study was conducted in patients with ILD. First, a CST-1 and a 6-minute walk test (6MWT) were performed. After 48-72 hours, a CST-2 was repeated. A 2nd rater was present in one of the sessions. Relative reliability was measured using intraclass correlation coefficient (ICC1,1 and ICC2,1). Absolute reliability was determined using standard error of measurement (SEM), minimal detectable change at 95% confidence interval (MDC95) and the Bland-Altman method. The values of SEM and MDC95 were also expressed as a percentage of the mean. Construct validity was explored using Spearman correlation coefficient (rs) between the number of steps taken in the best CST and the distance performed in the 6MWT.


Sixty-six patients with ILD (65.5±12.9 years; 48.5%men; FVC 79.4±18.8pp; DLCO 49.0±18.3pp) participated in the study. Relative (ICC 0.95-1.0) and absolute reliability were excellent without evidence of systematic bias. The SEM and MDC95 were 11.8 (14.7%) and 32.6 steps (40.7%), respectively. The correlation between CST and 6MWT was significant, positive, and high (rs=0.85, p=0.001).


The CST is a reliable and valid test and might be especially useful to assess exercise capacity in patients with ILD in limited space environments.

Interstitial lung disease
Measurement properties
Functional capacity
Full Text

Interstitial lung diseases (ILD) are a heterogeneous group of diffuse parenchymal lung disorders which, despite presenting diverse etiologies, share several clinical features.1,2 Although comprising various degrees of inflammation and/or fibrosis, a percentage of ILD patients can develop a progressive self-sustaining fibrosis, namely those with idiopathic nonspecific interstitial pneumonia, unclassifiable idiopathic interstitial pneumonia, connective tissue disease-associated ILDs, hypersensitivity pneumonitis, sarcoidosis and ILDs related to other occupational exposures (e.g. silicosis, asbestosis).3 This progressive-fibrosing phenotype may lead to worsening of symptoms (mainly exertional dyspnea, dry cough and fatigue), progressive impairment in gas exchange and lung function decline, reduced exercise capacity, muscle dysfunction and reduced quality of life.4,5 Some of these patients can also present a median survival rate below 5 years after diagnosis and generally exhibit high associated socioeconomic costs, high dependency on others to perform daily living activities and a massive burden on healthcare systems.6-9

In patients with ILD, reduced exercise capacity is associated with poor health-related quality of life and increased hospital admissions.10,11 Finding valid, feasible and standardized tests which enable comparison with standard values and provide agreement between healthcare professionals is, thus, a priority in order to provide the best quality of care to this population.11 The gold standard to evaluate exercise capacity is cardiopulmonary exercise testing (CPET).12 However, CPET is not easily available in clinical practice as it requires expensive equipment, the presence of specialized human resources and is time-consuming.13 To overcome these limitations, field tests such as the 6-minute walk test (6MWT), the incremental shuttle walk test (ISWT) and, more recently, the Chester Step Test (CST) have been used to assess exercise capacity in patients with chronic respiratory diseases.13-15 Field tests are more affordable and simpler to apply than CPET and are better related to patients’ demands during activities of daily living.13,16 Particularly, the CST requires less space than the other field tests, which allows it to be easily applied in different settings, including inpatient, outpatient and home-based settings.16

The CST is an externally paced, incremental and multistage test, designed to assess exercise capacity in healthy individuals.15,16 Recently, it has been validated to assess exercise capacity in patients with chronic obstructive pulmonary disease (COPD).15 However, the physiological mechanisms of exercise limitation in patients with COPD differ significantly from those in patients with ILD, in whom exercise intolerance is mostly due to impaired gas exchange and circulation limitation.10,17 Thus, it is imperative to test the measurement properties of the CST in ILD to assure that the selection of this instrument for research and clinical practice is evidence-based and its results can be reliably interpreted.18

The authors hypothesize that the number of steps in the CST will present: (1) excellent intra and inter-rater reliability; (2) significant, positive, and high correlation with the distance covered in the 6MWT (6MWD). The main purpose of this study was to assess the reliability (relative and absolute) and construct validity of the CST in patients with ILD.

Materials and methodsStudy design and population

This cross-sectional study was integrated in a larger trial (POCI-01-0145-FEDER-007628, POCI-01-0145-FEDER-028806 and PTDC/SAU-SER/28806/2017), with ethical approval from the Unidade de Investigação em Ciências da Saúde: Enfermagem (UICISA: E) of the Escola Superior de Enfermagem de Coimbra, Portugal (N°P517-08/2018), from the Ethics Committee for Health of the Centro Hospitalar do Baixo Vouga, EPE, Aveiro, Portugal (N/Ref 0863926) and from the Hospital Distrital da Figueira da Foz, EPE, Leiria, Portugal (March 15th 2019). All participants signed an informed consent.

The study initiated in January 2019, and it was completed in November 2020. Patients were considered eligible if they were diagnosed by their pulmonologist with any ILD, according with the internationally accepted guidelines,11920 and were clinically stable over the past month (i.e., no hospital admissions, exacerbations – i.e., acute, clinically significant respiratory deterioration, typically less than 1 month in duration, with new bilateral glass opacity and/or consolidation superimposed on a background pattern consistent with fibrosing ILD21 - or changes in their pharmacological treatment strategy). They were excluded if they had other lung diseases, signs of cognitive impairment or substance abuse, or presence of a significant cardiovascular, neurological, or musculoskeletal disease that precluded their participation in data collection.

This study was conducted and reported according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) initiative and a minimum of fifty participants were aimed to be included in order to achieve a good sample size for measurement properties assessment.18,22

Data collection

Participants were asked to attend to two assessment sessions, with at least 48-72 hours apart. Sociodemographic, anthropometric, and clinical data were first obtained to characterize the sample. Lung function tests (spirometry and diffusion capacity for carbon monoxide - DLCO) were conducted during patients´ routine medical appointments and collected from their most recent clinical records. The Self-Administered Comorbidities Questionnaire (SCQ) was used to score the severity of comorbidities.23,24 This questionnaire is composed of 12 medical and 3 optional conditions and attributes a maximum of 3 points to each condition (1 point for the presence of the problem, 1 point if receiving treatment for it and 1 point if the medical condition limits the person activities). Scores range from 0 to 45, with higher scores indicating more severe comorbidities.23,24 Then participants performed a CST-1 and a 6MWT, in this specific order. A resting period of at least 30 minutes between tests was given to allow for recovery of vital signs, fatigue, and dyspnea to their baseline values (a longer period was given, after the 30 min, if participants had not returned to their baselines levels). In the second session, a CST-2 was performed. A 2nd rater was present in one of the sessions to assess the CST.

The CST was performed using a digital recording with timed metronome rhythms and a 20 cm tall single-step device.16 The digital recording also gives the standardized instructions of the test and the chance to practice the test briefly.16 The CST has 5 stages, lasting 2 minutes each. The timed metronome set the step cadence, which starts at 15 steps/minute and increases 5 steps/minute every 2 minutes: stage 1 (15 steps/minute); stage 2 (20 steps/minute); stage 3 (25 steps/minute); stage 4 (30 steps/minute); stage 5 (35 steps/minute). The maximum test duration is 10 minutes, corresponding to the final stage 5. Heart rate (HR) and peripheral arterial oxygen saturation (SpO2) were monitored continuously with a pulse oximeter (Konica Minolta, Pulsox-300i, Japan) and recorded on paper by the rater every minute or at the end of each 2-minute stage, respectively.25 The perceived dyspnea and fatigue were recorded using the modified Borg scale every minute.25 The CST ended when: the participant reached 80% of the reserve HR25; if the SpO2 dropped below 85%; or if the participant was unable to maintain the step cadence for 15 seconds. Moreover, if the participant showed signs of intolerable dyspnea, being over-tired or dizzy, the CST was immediately terminated. The main outcome measure of the CST was the total number of steps taken. The best of the two CST, where the participant performed the highest number of steps, was selected for validity analysis.

The 6MWT was performed on a flat, straight, 30 meters length corridor with a hard surface, according to the European Respiratory Society/American Thoracic Society guidelines.26 Before each 6MWT, participants rested on a chair, located near the starting position, for at least 10 minutes. Then, participants were instructed to walk as fast as possible, without running or jogging, for 6 minutes.26 If participants requested to pause during the test or their SpO2 dropped below 85%, they could sit on the chairs placed along the corridor.26 Participants were encouraged to resume walking as soon as they could or when their SpO2 reached at least 88%. Criteria for immediately terminating the test included chest pain, intolerable dyspnea, leg cramps, staggering, diaphoresis and pale or ashen appearance.26 Standard encouragement was given each minute.26 The 6MWD was the main outcome measure.

Statistical analysis

Data analysis was performed using IBM SPSS Statistics (version 25.0, IBM Corporation, Armonk, NY, USA).

Descriptive statistics, i.e., relative frequencies (percentage), mean ± standard deviation (SD) or median [interquartile range] were used to describe the sample. The Kolmogorov-Smirnov test (KS) was used to determine the normality of data distribution.27 Outliers were identified, through the inspection of extreme points on the plotted graphs of the variables in study and analysis were performed with and without their presence. We decided not to remove outliers since their presence did not affect results significantly. Tests with a p<0.05 were considered statistically significant.

Relative reliability was measured using intraclass correlation coefficient (ICC)(29). ICC1,1 and ICC2,1 models were used to determine intra-rater and inter-rater reliability,29 respectively, according to the following equations:

where BMS is between-subjects mean squares, WMS is within-subjects mean squares, EMS is the error (residual) of mean squares, RMS is between raters mean squares, k is the number of measurements/raters (k=1), and n is the number of participants. An ICC lower than 0.50 was considered of poor reliability, 0.5-0.75 moderate, 0.75-0.90 good and greater than 0.9 excellent reliability.18,28

Absolute reliability was determined by calculating the standard error of measurement (SEM) and the minimal detectable change at 95% confidence interval (MDC95)(29). The SEM was measured according to the following equation: SEM=SDdifference/2), where SDdifference is the SD of the differences between the CST-1 and CST-2.18,30 The MDC95 was calculated as follows: MDC95=1.96×2×SEM.18 The values of SEM and MDC95 were also expressed as a percentage of the mean and calculated as follow: SEM%=(SEM/mean)×100andMDC95%=(MDC95/mean)×100, where mean is the mean of the number of steps taken in CST-1 and CST-2. A MDC95% of less than 30% was considered acceptable.31

The Bland-Altman method was applied to calculate absolute reliability.18,22 First, we plotted the difference between the number of steps taken in CST-1 and CST-2 against the mean of the number of steps taken in CST-1 and CST-2.32 Then, we calculated the mean and SD of the differences between CST-1 and CST-2, the closer the mean difference is to zero and the smaller the SD of the differences, the more reliable is the measure.32 Finally, we calculated the 95% limits of agreement (LoA95) as follows: LoA95=meandifference±1.96×SDdifferences.32

Construct validity was assessed by analysing the relationship between the number of steps taken in the best CST and the 6MWD using Spearman correlation coefficient (rs)(18). A correlation of 0-0.3 was considered poor, 0.3-0.5 weak, 0.5-0.7 moderate, 0.7-0.9 strong, and 0.9-1.0 excellent.18,27


Seventy-eight patients with ILD were screened to be included in the study. Sixty-six were eligible to participate and twelve were excluded for the following reasons: decline to participate (n=4), drop out for no reason given (n=1), presence of a significant cardiovascular disease (ischemic cardiomyopathy associated with myocardial infarction, n=4), presence of a significant musculoskeletal disease (severe gonarthrosis, n=2), presence of cognitive impairment (Alzheimer's disease, n=1). Sixty-six participants were included for the construct validity study and fifty-three for the reliability study, since thirteen individuals did not attend the second assessment session due to participants unavailability. A flow diagram of recruitment is provided in Fig. 1.

Fig. 1.

Flow diagram of participants with interstitial lung disease included in the study.


Eligible participants were on average 65.5±12.9 years old, slightly overweight (body mass index= 28.9±5.2kg/m2) and 48.5% were male (n=32). Most prevalent types of ILD were chronic hypersensitivity pneumonitis (n=29, 43.9%), followed by idiopathic pulmonary fibrosis (n=16, 24.2%) and sarcoidosis (n=6, 9.1%). Participants presented a mean forced expiratory volume in one second (FEV1) of 81.9±19.9% of predicted and mean forced vital capacity (FVC) of 79.4±18.8% of predicted. Nineteen participants (28.8%) presented mild ILD (DLCO >60% predicted), 18 participants (27.3%) moderate ILD (40%≤DLCO≤60% predicted) and 19 participants (28.8%) severe ILD (DLCO<40% predicted). We were unable to access DLCO values for 10 patients (15.1%). Thirty-one participants used long-term oxygen therapy (47%) and five participants used non-invasive ventilation during sleep (7.6%). The mean number of steps taken in CST-1 and CST-2 were 77.7±50.2, and 82.4±55.7, respectively. The main reason for CST termination was the inability to maintain the required step cadence. The mean 6MWD was 399.4±128.2 meters (83.1±26.4% of predicted). A detailed sample characterization is summarized in Table 1.

Table 1.

Sample characterization (n=66).

Characteristics  Eligible participants (n=66) 
Age, years  65.5±12.9 
Gender, male n (%)  32 (48.5) 
BMI, kg/m2  28.9±5.2 
Smoking status, n (%)   
Current  2 (3) 
Former  24 (36.4) 
Never  42 (63.6) 
Packs/year  35.0 [7.0-55.8] 
Exacerbations/year, n (%)   
0  48 (72.7) 
1  14 (21.2) 
≥2  4 (6) 
Lung function   
FEV1, L  2.1±0.7 
FEV1, %predicted*1  83.7±20.8 
FVC, L  2.5±0.8 
FVC, %predicted*1  79.4±18.8 
FEV1/FVC, %  82.7±9.2 
DLCO, %predicted*1  49.0±18.3 
 DLCO >60%predicted, n (%)  19 (28.8) 
 40%≤DLCO ≤60%predicted, n (%)  18 (27.3) 
 DLCO <40%predicted, n (%)  19 (28.8) 
Long-term oxygen therapy, n (%)  31 (47) 
Non-invasive ventilation, n (%)  5 (7.6) 
Pharmacological treatment for ILD, n (%)   
Glucocorticoids  43 (65.2) 
Immunosuppressant  30 (45.5) 
Antifibrotics  5 (7.6) 
ILD types, n (%)   
IPF  16 (24.2) 
Sarcoidosis  6 (9.1) 
Chronic hypersensitivity pneumonitis  29 (43.9) 
NSIP secondary to systemic sclerosis  2 (3) 
UIP secondary to systemic sclerosis  4 (6.1) 
UIP secondary to rheumatoid arthritis  2 (3) 
Anti-synthetase syndrome  2 (3) 
Desquamative interstitial pneumonia  1 (1.5) 
LIP related to Sjogren's syndrome  1 (1.5) 
Silicosis  1 (1.5) 
Respiratory bronchiolitis ILD  1 (1.5) 
Follicular bronchiolitis related to Sjogren's syndrome  1 (1.5) 
SCQ  9±3.9 
CST-1  77.7±50.2 
CST-2  82.4±55.7 
6MWT  399.4±128.2 
6MWT, % of predicted⁎2  83.1±26.4 

Notes: Values are presented as mean±standard deviation or median [interquartile range].

Legend: BMI, body mass index; CST, Chester Step Test; FEV1, forced expiratory volume in one second; FVC, forced vital capacity; DLCO, diffusion capacity for carbon monoxide; ILD, interstitial lung disease; IPF, idiopathic pulmonary fibrosis; LIP, lymphocytic interstitial pneumonia; NSIP, non‐specific interstitial pneumonia; UIP, usual interstitial pneumonia; SCQ, self-administered comorbidities questionnaire; 6MWT, 6-minute walk test.


computed according to Standardization of Spirometry 2019 Update41;


computed according to Marques et al 202042


The CST demonstrated excellent relative reliability, for both intra-rater reliability (ICC1,1=0.95; 95%CI 0.91-0.97) and inter-rater reliability (ICC2,1=1.0; 95%CI 0.99-1.0). Regarding absolute reliability, SEM and MDC95 values were 11.8 steps (SEM%=14.7%) and 32.6 steps (MDC95%=40.7%), respectively. The Bland-Altman plot was created and a mean difference of -4.72 steps was observed with the LoA95 ranging from -37.28 and 27.84 steps (Fig. 2).

Fig. 2.

Bland and Altman plot of the difference between number of steps in the Chester Step Test-1 (CST-1) and CST-2 against the mean of the number of steps in test -1 and test -2 in patients with interstitial lung disease (n=66). The dashed horizontal line represents the mean difference, and the solid horizontal lines represent the 95% upper and lower limits of agreement.


The correlation between the number of steps of the best CST and the 6MWD was significant, positive, and strong (rs=0.85, p=0.001) (Fig. 3).

Fig. 3.

Correlation between the number of steps of the best Chester Step Test (CST) and the 6-minute walk distance (6MWD) in patients with interstitial lung disease (n=66).


Excellent intra-rater and inter-rater reliability were found for the CST. A similar study, which focused on patients with COPD and assessed the reliability of the CST also showed excellent relative reliability (ICC of 0.99; 95%CI 0.97-0.99)(15). This finding indicates that CST provides consistent results and excellent agreement between healthcare professionals, which allows for comparisons of patients’ results even when CST is applied by different raters on different occasions, i.e., pre and post PR program.11,33 Moreover, it also suggests that only minimal training is required for healthcare professionals to apply the CST.

Our findings suggest that it is necessary to improve above 32.6 steps to assume that the change observed was above the measurement error.18 Although this cut-off is informative, whether that change is clinically meaningful remains unknown. Moreover, our MDC95% was above the 30% acceptable limit.31 It is likely that this finding was influenced by the high heterogeneity of symptoms and exercise capacity observed in our sample of patients within disease subgroups (mild DLCO 28.8%; moderate DLCO 27.3% and severe DLCO 28.8%), thus increasing the MDC95 value.1,34 Therefore, future studies determining the minimal clinically important differences of the CST in patients with ILD, ideally using homogeneous samples, and using both anchor (i.e., mean change, the receiver operating characteristic curves and linear regression analysis) and distribution-based methods (i.e., 0.5 times the SD, SEM, 1.96 times the SEM; MDC95 and ES)14 are needed.

In the present study, no evidence of systematic bias was observed. In patients with COPD, the CST showed smaller and narrower values of mean difference and LoA95 (mean difference of -1.1 steps with LoA95 ranging from -20.2 to 17.9 steps).15 It is likely that the high heterogeneity observed in our sample of patients within disease subgroups may have contributed to the wider range of the LoA95, especially those patients that achieved higher stages on the CST.

The CST has adequate construct validity as shown by the strong correlation between the number of steps and the 6MWD. Similar results were found for the ISWT in the same population (r=0.76, p<0.0001), while a lower correlation was observed in patients with COPD (r=0.60, p=0.001).13,15 In addition to the adequate validity, the CST presents some advantages over the ISWT and the 6MWT, such as the apparent absence of a learning effect and requiring minimal space for its application.35 As reported previously, the 6MWT requires a 30 meters length corridor, and the ISWT a 10 meters corridor, which is often difficult to find in home and clinical settings.35,36

The CST may also be useful in other contexts and populations. During the COVID-19 pandemic, the need to transfer patients to rehabilitation facilities or discharge them to home increased exponentially and so did the demand for home-based and/or remote rehabilitation.37,38 Worldwide, healthcare professionals and researchers were looking for validated, easy and practical to implement exercise tests that allowed assessing and prescribing exercise in non-clinical settings.39 Step tests emerged as validated measures, however up until now only the 3-minute step test had been used in such settings.39 To the best of the authors´ knowledge, the current study is the first to show that the CTS is safe, reliable, and valid to be conducted in a community setting. Futures studies should be conducted to assess its applicability in other populations, such as patients post-COVID-19.

Assessment of exercise capacity is of paramount importance as exercise capacity in patients with ILD has been associated with a more reserved prognosis and a poorer health-related quality of life.11,17,1 Clinicians and researchers now have at their disposal a field test that is valid and reliable to assess exercise capacity in patients with ILD even in limited space settings.


This study has some limitations that need to be acknowledged. First, we only assessed construct and not criterion validity (i.e., correlation with the gold standard – CPET). Second, we did not evaluate the effects of the disease severity and variability on the MDC95, since our sample size was too small to perform subgroup analysis. Third, our study only included patients with clinical stable ILD and participants with any other significant impairment or disease were excluded, thus our results cannot be generalized to all patients. Modified versions of the CST may be needed either to assess patients only with mild ILD or to safely assess patients with more severe impairments.


The CST is a reliable and valid test to evaluate exercise capacity in patients with clinical stable ILD. Due to its characteristics, the CST may constitute an appropriate alternative to the 6MWT and the ISWT in limited space environments.


Authors’ contributions: AM obtained the funding, had full access to all data in the study and takes responsibility for the data and the accuracy of data analysis, including and especially any adverse effects. AM and AO conceived the idea. All authors contributed to the design and interpretation of data. AA, AO and PGF contributed to data acquisition. AA performed the analysis and drafted the paper. All authors critically revised the manuscript and approved the final version.

Financial/ nonfinancial disclosures: none declared

Role of sponsors: The sponsors had no role in the design of the study, the collection and analysis of the data, or the preparation of the manuscript.

Other contributions: The authors would like to thank Cátia Paixão, Filipa Machado, Patrícia Rebelo, Liliana Santos and Sara Souto-Miranda for their contribution in data collection.

KM Antoniou, GA Margaritopoulos, S Tomassetti, F Bonella, U Costabel, V. Poletti.
Interstitial lung disease.
Eur Respirat Rev, 23 (2014), pp. 40-54
AL Olson, AH Gifford, N Inase, ER Fernández Pérez, T Suda.
The epidemiology of idiopathic pulmonary fibrosis and interstitial lung diseases at risk of a progressive-fibrosing phenotype.
Eur Respirat Rev, 27 (2018),
V Cottin, NA Hirani, DL Hotchkin, AM Nambiar, T Ogura, M Otaola, et al.
Presentation, diagnosis and clinical course of the spectrum of progressive-fibrosing interstitial lung diseases.
Eur Respirat Rev, 27 (2018),
AU Wells, KK Brown, KR Flaherty, M Kolb, J. Thannickal Victor.
What's in a name? That which we call IPF, by any other name would act the same.
Eur Respir J, 51 (2018),
V Cottin, L Wollin, A Fischer, M Quaresma, S Stowasser, S. Harari.
Fibrosing interstitial lung diseases: knowns and unknowns.
Eur Respirat Rev, 28 (2019),
M Algamdi, M Sadatsafavi, JH Fisher, J Morisset, KA Johannson, CD Fell, et al.
Costs of workplace productivity loss in patients with fibrotic interstitial lung disease.
A Belkin, K Albright, JJ. Swigris.
A qualitative study of informal caregivers’ perspectives on the effects of idiopathic pulmonary fibrosis.
BMJ Open Respira Res, 1 (2014), pp. e000007
C Holtze, K Flaherty, M Kreuter, F Luppi, T Moua, C Vancheri, et al.
Healthcare utilisation and costs in the diagnosis and treatment of progressive-fibrosing interstitial lung diseases.
Eur Respirat Rev, 27 (2018),
G Raghu, HR Collard, JJ Egan, FJ Martinez, J Behr, KK Brown, et al.
An official ATS/ERS/JRS/ALAT statement: idiopathic pulmonary fibrosis: evidence-based guidelines for diagnosis and management.
Am J Respir Crit Care Med, 183 (2011), pp. 788-824
AE. Holland.
Exercise limitation in interstitial lung disease-mechanisms, significance and therapeutic options.
Chronic Respirat Dis, 7 (2010), pp. 101-111
L. Puente-Maestu.
Physiological rationale of commonly used clinical exercise tests.
Pulmonol, 26 (2020), pp. 159-165
American Thoracic Society.
ATS/ACCP statement on cardiopulmonary exercise testing.
Am J Respir Crit Care Med, 167 (2003), pp. 211-277
S Singh, JA Moiz, MS Ali, D. Talwar.
Reliability, validity, and responsiveness of the incremental shuttle walk test in patients with interstitial lung disease.
J Cardiopulmonary Rehabil Prevent, 38 (2018), pp. 425-429
Bois RMd, Weycker D, C Albera, WZ Bradford, U Costabel, A Kartashov, et al.
Six-minute-walk test in idiopathic pulmonary fibrosis: test validation and minimal clinically important difference.
Am J Respir Crit Care Med, 183 (2011), pp. 1231-1237
AA de Camargo, T Justino, CHS de Andrade, C Malaguti, S. Dal Corso.
Chester step test in patients with COPD: reliability and correlation with pulmonary function test results.
Respir Care, 56 (2011), pp. 995-1001
K Sykes, A. Roberts.
The Chester step test—a simple yet effective tool for the prediction of aerobic capacity.
Physiotherapy, 90 (2004), pp. 183-188
MA Spruit, SJ Singh, C Garvey, R ZuWallack, L Nici, C Rochester, et al.
An official American Thoracic Society/European Respiratory Society statement: key concepts and advances in pulmonary rehabilitation.
Am J Respir Crit Care Med, 188 (2013), pp. e13-e64
CB Terwee, SD Bot, MR de Boer, DA van der Windt, DL Knol, J Dekker, et al.
Quality criteria were proposed for measurement properties of health status questionnaires.
J Clin Epidemiol, 60 (2007), pp. 34-42
J. Hutchinson.
Statement on sarcoidosis. Joint Statement of the American Thoracic Society (ATS), the European Respiratory Society (ERS) and the World Association of Sarcoidosis and Other Granulomatous Disorders (WASOG) adopted by the ATS Board of Directors and by the ER.
Am J Respir Crit Care Med, 160 (1999), pp. 736-755
WD Travis, U Costabel, DM Hansell, TE King Jr, DA Lynch, AG Nicholson, et al.
An official American Thoracic Society/European Respiratory Society statement: update of the international multidisciplinary classification of the idiopathic interstitial pneumonias.
Am J Respir Crit Care Med, 188 (2013), pp. 733-748
M Kolb, B Bondue, A Pesci, Y Miyazaki, JW Song, NY Bhatt, et al.
Acute exacerbations of progressive-fibrosing interstitial lung diseases.
Eur Respirat Rev, 27 (2018), pp. 180071
Mokkink LB, Prinsen CA, Patrick DL, Alonso J, Bouter LM, de Vet HC, et al. COSMIN Study Design checklist for Patient-reported outcome measurement instruments. 2019.
A Molto, M. Dougados.
Comorbidity indices.
Clin Exp Rheumatol, 32 (2014), pp. S131-S1S4
O Sangha, G Stucki, MH Liang, AH Fossel, JN. Katz.
The Self-Administered Comorbidity Questionnaire: a new method to assess comorbidity for clinical and health services research.
Arthritis Care Res, 49 (2003), pp. 156-163
K Sykes.
Chester aerobic test software. Cardiac rehabilitation ed.
Cartwright Fitness Ltd 2013 - 2018, (2016),
AE Holland, MA Spruit, T Troosters, MA Puhan, V Pepin, D Saey, et al.
An official European Respiratory Society/American Thoracic Society technical standard: field walking tests in chronic respiratory disease.
6th, Eur Respir J, (2014), pp. 1428-1446
J. Cohen.
Statistical power analysis for the behavioral sciences.
Routledge, (2013),
A Bruton, JH Conway, ST. Holgate.
Reliability: what is it, and how is it measured?.
Physiotherapy, 86 (2000), pp. 94-99
PE Shrout, JL. Fleiss.
Intraclass correlations: uses in assessing rater reliability.
Psychol Bull, 86 (1979), pp. 420
HC De Vet, CB Terwee, LB Mokkink, DL. Knol.
Measurement in medicine: a practical guide.
Cambridge University Press, (2011),
C Jácome, J Cruz, A Oliveira, A. Marques.
Validity, reliability, and ability to identify fall status of the Berg Balance Scale, BESTest, Mini-BESTest, and Brief-BESTest in patients with COPD.
Phys Ther, 96 (2016), pp. 1807-1815
JM Bland, DG. Altman.
Measuring agreement in method comparison studies.
Stat Methods Med Res, 8 (1999), pp. 135-160
H. Coolican.
Research methods and statistics in psychology.
Routledge, (2018),
L Dowman, CF McDonald, C Hill, A Lee, K Barker, C Boote, et al.
The benefits of exercise training in interstitial lung disease: protocol for a multicentre randomised controlled trial.
BMC Pulmonary Med, 13 (2013), pp. 1-8
F Coll, K Hill, S Burrows, C Watson, D. Edgar.
Modified Chester step test in a healthy adult population: measurement properties and development of a regression equation to estimate test duration.
Phys Ther, 100 (2020), pp. 1411-1418
M Maddocks, CM Nolan, WD. Man.
Simple functional tests in COPD: stand up and be counted!.
Eur Respirat J, 49 (2017), pp. 1700104
Polastri M, Lazzeri M, Jácome C, Vitacca M, Costi S, Clini E, et al. Rehabilitative practice in Europe: roles and competencies of physiotherapists. Are we learning something new from COVID-19 pandemic? 2021.
M Polastri, A Ciasca, S Nava, E. Andreoli.
Two years of COVID-19: trends in rehabilitation.
Elsevier, (2022),
AE Holland, C Malaguti, M Hoffman, A Lahham, AT Burge, L Dowman, et al.
Home-based or remote exercise testing in chronic respiratory disease, during the COVID-19 pandemic and beyond: a rapid review.
Chronic Respirat Dis, 17 (2020),
BL Graham, I Steenbruggen, MR Miller, IZ Barjaktarevic, BG Cooper, GL Hall, et al.
Standardization of spirometry 2019 update. An official American thoracic society and European respiratory society technical statement.
Am J Respir Crit Care Med, 200 (2019), pp. e70-e88
A Marques, P Rebelo, C Paixão, S Almeida, C Jácome, J Cruz, et al.
Enhancing the assessment of cardiorespiratory fitness using field tests.
Physiotherapy, 109 (2020), pp. 54-64

Funding information: This work, was funded by Programa Operacional de Competitividade e Internacionalização – POCI, through Fundo Europeu de Desenvolvimento Regional - FEDER (POCI-01-0145-FEDER-007628 and POCI-01-0145-FEDER-028806), Fundação para a Ciência e Tecnologia (PTDC/SAU-SER/28806/2017) and under the project UIDB/04501/2020.

Copyright © 2022. Sociedade Portuguesa de Pneumologia

Subscribe to our newsletter

Article options

Are you a health professional able to prescribe or dispense drugs?