Manual Therapy Journal - Volume 14, Issue 2, Pages 117-240 (April 2009)

VOLUME 14 NUMBER 2 PAGES 117–240 April 2009 Editors International Advisory Board Ann Moore PhD, GradDipPhys, FCSP, Ce...

Author: Editors: Ann Moore and Gwen Jull

84 downloads 977 Views 4MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

VOLUME 14 NUMBER 2 PAGES 117–240 April 2009

Editors

International Advisory Board

Ann Moore PhD, GradDipPhys, FCSP, CertEd, FMACP Clinical Research Centre for Health Professions University of Brighton Aldro Building, 49 Darley Road Eastbourne BN20 7UR, UK Gwendolen Jull PhD, MPhty, Grad Dip ManTher, FACP Department of Physiotherapy University of Queensland Brisbane QLD 4072, Australia

K. Bennell (Victoria, Australia) K. Burton (Huddersfield, UK) B. Carstensen (Frederiksberg, Denmark) M. Coppieters (Queensland, Australia) E. Cruz (Setubal, Portugal) L. Danneels (Maríakerke, Belgium) S. Durrell (London, UK) S. Edmondston (Perth, Australia) J. Endresen (Flaktvei, Norway) L. Exelby (Biggleswade, UK) D. Falla (Aalborg, Denmark) J. Greening (London, UK) C. J. Groen (Utrecht, The Netherlands) A. Gross (Hamilton, Canada) T. Hall (West Leederville, Australia) W. Hing (Auckland, New Zealand) M. Jones (Adelaide, Australia) S. King (Glamorgan, UK) B.W. Koes (Amsterdam, The Netherlands) J. Langendoen (Kempten, Germany) D. Lawrence (Davenport, IA, USA) D. Lee (Delta, Canada) R. Lee (London, UK) C. Liebenson (Los Angeles, CA, USA) L. Maffey-Ward (Calgary, Canada) E. Maheu (Quebec, Canada) C. McCarthy (Coventry, UK) J. McConnell (Northbridge, Australia) S. Mercer (Queensland, Australia) D. Newham (London, UK) J. Ng (Hung Hom, Hong Kong) S. O’Leary (Queensland, Australia) L. Ombregt (Kanegem-Tielt, Belgium) N. Osbourne (Bournemouth, UK) M. Paatelma (Jyvaskyla, Finland) N. Petty (Eastbourne, UK) A. Pool-Goudzwaard (The Netherlands) M. Pope (Aberdeen, UK) G. Rankin (London, UK) E. Rasmussen Barr (Stockholm, Sweden) D. Reid (Auckland, New Zealand) A. Rushton (Birmingham, UK) C. Shacklady (Manchester, UK) M. Shacklock (Adelaide, Australia) D. Shirley (Lidcombe, Australia) W. Smeets (Tongeren, Belgium) C. Snijders (Rotterdam, The Netherlands) R. Soames (Dundee, UK) P. Spencer (Barnstaple, UK) M. Sterling (St Lucia, Australia) P. Tehan (Victoria, Australia) M. Testa (Alassio, Italy) M. Uys (Tygerberg, South Africa) P. van der Wurff (Doorn, The Netherlands) P. van Roy (Brussels, Belgium) B.Vicenzino (St Lucia, Australia) H.J.M. Von Piekartz (Wierden, The Netherlands) M. Wessely (Paris, France) A. Wright (Perth, Australia) M. Zusman (Mount Lawley, Australia)

Associate Editor’s Darren A. Rivett PhD, MAppSc, (ManipPhty) GradDipManTher, BAppSc (Phty) Discipline of Physiotherapy Faculty of Health The University of Newcastle Callaghan, NSW 2308, Australia E-mail: [email protected] Deborah Falla PhD, BPhty(Hons) Department of Health Science and Technology Aalborg University Fredrik BajersVej 7, D-3 DK-9220 Aalborg Denmark Email:deborahfvhst.aau.dk Tim McClune D.O. Spinal Research Unit. University of Huddersfield 30 Queen Street Huddersfield HD12SP, UK E-mail: [email protected]

Editorial Committee Timothy W Flynn PhD, PT, OCS, FAAOMPT RHSHP-Department of Physical Therapy Regis University Denver, CO 80221-1099 USA Email: [email protected] Masterclass Editor Karen Beeton PhD, MPhty, BSc(Hons), MCSP MACP ex officio member Associate Head of School (Professional Development) School of Health and Emergency Professions University of Hertfordshire College Lane Hatfield AL10 9AB, UK E-mail: [email protected] Case reports & Professional Issues Editor Jeffrey D. Boyling MSc, BPhty, GradDipAdvManTher, MCSP, MErgS Jeffrey Boyling Associates Broadway Chambers Hammersmith Broadway London W6 7AF, UK E-mail:[email protected] Book Review Editor Raymond Swinkels MSc, PT, MT Ulenpas 80 5655 JD Eindoven The Netherlands E-mail: [email protected]

Visit the journal website at http://www.elsevier.com/math doi:10.1016/S1356-689X(09)00009-5

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 117e118 www.elsevier.com/math

Editorial

Bring back the biopsychosocial model for neck pain disorders The traditional pathoanatomical (biomedical) approach to the diagnosis of neck pain disorders is widely acknowledged as inadequate. It is well recognised that in the vast majority of individuals no pathology can be imaged which can reliably account for symptoms. Equally, pain is commonly the patient’s presenting complaint and certainly its dimensions cannot be imaged with conventional radiological techniques. The biopsychosocial model was introduced as a diagnostic and management paradigm to recognise correctly, the multidimensional nature of pain. The model retained the biomedical aspect and added the role that psychological and social factors could contribute to pain perception and activity limitation. There is no argument about the multidimensional nature of neck pain. However in the absence of ‘red ﬂags’ or demonstrable pathoanatomy, it now appears to be becoming commonly accepted, often without supporting data, that psychosocial features are the strongest drivers of neck pain especially in compensable cases of neck pain. No imageable pathoanatomy seems to be commonly and incorrectly interpreted as the lack of any injury or biological event. The consequence is that the management advocated for these individuals then focuses on psychosocial aspects with little or no regard to biological features. While the pendulum was held at the far left in the pathoanatomical (biomedical) model, it now seems to have swung to the far right in a psychosocial model for both acute as well as persistent neck pain. Our question is where has the middle ground of the biopsychosocial model gone? There is undoubtedly an association between psychological features and neck pain, but this association is often not as strong as is commonly believed. For example, Kyhlback et al. (2002) found that baseline psychosocial factors of gender, age and self eﬃcacy accounted for only 24% of the variance in pain and 36% of the variance in disability 12 months following whiplash injury. Similarly a model proposed by Young Casey et al. (2008), which included standardised psychosocial measures of cumulative exposure to trauma, baseline depression, pain and pain beliefs in people with acute neck and back pain, only 1356-689X/$ - see front matter Ó 2009 Published by Elsevier Ltd. doi:10.1016/j.math.2009.01.004

accounted for 28% of the variation in pain intensity and 58% of the variance in disability 3 months later. Despite these relatively weak relationships, it is almost considered a fact that psychosocial factors play a stronger role than physical ones in presentation and development of neck pain, although few studies have actually included physical or biological factors in their analysis. The magnitude of the contribution of psychosocial features can only be evaluated and perhaps interpreted appropriately with an understanding of concomitant biological features. This is clearly in evidence in studies which have measured both psychosocial and biological features simultaneously. Data from such studies demonstrate that some biological (physical) features are stronger predictors of pain and disability than psychosocial factors or at least signiﬁcantly contribute to predictive models that include measures of both biological and psychosocial substrates. In a cross-sectional study of oﬃce workers with and without neck pain, Johnston et al. (in press) demonstrated that when considering psychosocial domains, individual factors, task demands, quantitative sensory measures and measures of motor function concomitantly, sensory and motor impairments had stronger inﬂuences on pain and disability than workplace and psychosocial features. In the case of whiplash, sensory disturbances, in particular increased cold sensitivity, are predictive of poor functional recovery (Kasch et al., 2005; Sterling et al., 2005). The inclusion of the physical variables of movement loss and sensory disturbances to a predictive model comprising psychosocial factors almost doubled the percentage of successful classiﬁcation of individuals with persistent symptoms 12 months post whiplash injury (Sterling et al., 2005). We support the biopsychosocial model. We advocate that future research addresses biological (physical), psychological and social features concurrently to more fully understand the interactions between these features in a pain state and in recovery. Studies must prestate hypotheses to test whether nominated biological or psychological features are mediators or moderators of the presenting pain and functional states to better understand their respective roles. Importantly the

118

Editorial / Manual Therapy 14 (2009) 117e118

clinician’s assessments of individuals with neck pain must follow a similar approach and individual patients should not be ﬁtted to a predetermined management approach. Current evidence for the management of neck pain disorders does not support any singular line of management whether biologically or psychologically based. Rather, the evidence supports multimodal approaches and a clearer understanding of the interactions between biological, psychological and social features of various neck pain disorders will inform better management the aim of the biopsychosocial model.

References Johnston V, Jimmieson NL, Jull G, Souvlis T. Contribution of individual, workplace, psychosocial and physiological factors to neck pain in female oﬃce workers. European Journal of Pain, in press. Kasch H, Qerama E, Bach F, Jensen T. Reduced cold pressor pain tolerance in non-recovered whiplash patients: a 1 year prospective study. European Journal of Pain 2005;9:561e9. Kyhlback M, Thierfelder T, Soderlund A. Prognostic factors in whiplash associated disorders. International Journal of Rehabilitation 2002;25:181e7.

Sterling M, Jull G, Vicenzino B, Kenardy J, Darnell R. Physical and psychological factors predict outcome following whiplash injury. Pain 2005;114:141e8. Young Casey C, Greenberg M, Nicassio P, Harpin R, Hubbard D. Transition from acute to chronic pain and disability: a model including cognitive, aﬀective and trauma factors. Pain 2008;134:69e79.

Gwendolen Jull NHMRC Centre of Spinal Pain Injury and Health School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Qld 4072, Australia Michele Sterling NHMRC Centre of Spinal Pain Injury and Health School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Qld 4072, Australia Centre of National Research on Disability and Rehabilitation, The University of Queensland, St Lucia, Qld 4072, Australia

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 119e130 www.elsevier.com/math

Systematic Review

The validity and accuracy of clinical tests used to detect labral pathology of the shoulder e A systematic review Wendy Munro*, Raymond Healy University of Salford, Salford, Greater Manchester M6 6PU, UK Received 16 July 2007; received in revised form 8 August 2008; accepted 27 August 2008

Abstract Labral tears frequently require repair [Kim S, Ha K, Han K. Biceps Load test: a clinical test for superior labrum anterior and posterior lesions in shoulders with recurrent anterior dislocations. The American Journal of Sports Medicine 1999;27(3):300e3]. Physiotherapists need confidence in clinical tests used to detect labral pathology to accurately identify this condition. This review systematically evaluates the evidence for the accuracy of these tests with reference to study quality and key biases. Cochrane, Medline, Cinahl, AMED, DARE and HTA databases were searched to identify 15 studies evaluating 15 clinical tests for labral pathology against Magnetic Resonance Imaging MRI or surgery. Two independent reviewers assessed methodological quality using Quality Assessment of Diagnostic Accuracy Studies (QUADAS). Meta Disc calculated likelihood ratios (positive LR > 10, providing convincing diagnostic evidence of ruling a condition in; negative LR < 0.2 providing large to moderate evidence of ruling the condition out) and true positive rates (TPRs) against false positive rates (FPRs) in receiver operator characteristic (ROC) plots and summary receiver operator curves (SROCs). Probable overestimation of accuracy was caused by use of case control design, verification bias and use of a lesser reference standard. Six accurate tests; Biceps Load I (þLR: 29.09; LR: 0.09) Biceps Load II (þLR: 26.32; LR: 0.11), Internal Rotation Resistance (IRRT) (þLR: 24.77; LR: 0.12), Crank (þLR: 13.59 and 6.46; LR: 0.1 and 0.22), Kim (þLR: 12.62; LR:0.21) and Jerk (þLR: 34.71; LR: 0.27) tests were identified from high quality single studies in selected populations. Subgroup analysis identified varying results of accuracy in the Crank test and the Active Compression (AC) test when evaluated in more than one study. Further evaluation is needed before these tests can be used with confidence. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Labral pathology; Screening; Sensitivity and specificity; Likelihood ratios

1. Introduction Assessment and diagnosis has become an increasingly important aspect of the physiotherapist’s role in clinical specialist and extended scope roles. Differential diagnosis of the shoulder is a problematic area, with no standardised definitions and diagnostic criteria for defining disorders being inconsistent and unreliable (Green et al., 2003). Hanchard * Corresponding author. Directorate of Physiotherapy, Mary Seacole Building, Frederick Road Campus, University of Salford, Salford, Greater Manchester M6 6PU, UK. Tel.: þ44 0161 295 2502; fax: þ44 0161 295 2432. E-mail address: [email protected] (W. Munro). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.08.008

et al. (2004) advocate an evidence based conservative management approach which does not differentiate between subacromial impingement syndrome (SIS), posterior superior glenoid impingement (PSGI) and superior labral anterior posterior (SLAP) lesions suggesting that such clear cut diagnosis is unnecessary. However, the presence of signs, possibly indicating glenoid labral damage e.g. pain on overhead activities, deep shoulder pain, painful catching and popping or clicking (Musgrave and Rodosky, 2001), should lead the clinician to consider further management outside the scope of physiotherapy such as arthroscopy or surgery. Symptoms of labral pathology can make it difficult to differentiate from other shoulder pathologies such as impingement and

120

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

acromio-clavicular joint arthritis (Musgrave and Rodosky, 2001). Knowledge of the tests available to assist in the differentiation of this diagnosis, the validity of these tests and the skills to perform them are therefore required. Physical examination has been described as more of an art than a science although carefully planned diagnostic test accuracy studies will provide more of a science to this art (Reider, 2004). Although SLAP lesions commonly occur in the young active overhead athlete (Andrews et al., 1985) and following a compressive or distraction force on the shoulder (Andrews et al., 1985; Snyder et al., 1990; Maffet et al., 1995), labral pathology may result from a sudden fall onto the outstretched hand or elbow with the shoulder in a somewhat adducted and extended position. This can lead to secondary symptoms of impingement caused by superior translation of the humeral head (Kumar et al., 1989; Altchek et al., 1992; Schmitz, 1999). Hasan (2006) has suggested the superior labrum to have a more meniscoid attachment to the glenoid than the rest of the labrum, making it susceptible to degenerative as well as traumatic lesions. Tests for labral pathology therefore need to be accurate in both general and athletic population settings in a wide age group of patients. Liume et al. (2004) and Jones and Galluch (2007) have systematically reviewed studies relating to clinical tests for instability and labral lesions and superior glenoid labral lesions respectively. Liume et al. (2004) reviewed 17 studies evaluating clinical tests for shoulder instability or labral lesion suggesting the Relocation test and the Anterior Release test to be most clinically relevant in diagnosing instability, and the Biceps Load tests I and II, the Pain provocation test and the Internal Rotation Resistance test (IRRT) to be most promising for labral tears. Jones and Galluch (2007) reviewed 12 studies and concluded that SLAP specific physical examination results cannot be used alone to diagnose SLAP lesions. This review, including additional studies, focuses on studies evaluating tests for labral pathology and adds to the previous literature with a thorough quality assessment of the included studies using Quality Assessment of Diagnostic Accuracy Studies (QUADAS), receiver operating characteristic and forest plots. Previous studies have either used QUADAS only, or levels of evidence to control for study quality. Subgroup analysis is carried out on single tests evaluated in different studies. 2. Methods 2.1. Search strategy Publications were identified by searching the following databases: Cochrane (1995e2007), Medline (1996eJune 2007), Cinahl (1982eJune 2007), AMED (1985eJune 2007), Health Technology Assessment (1995eJune 2007) and the Database of Abstracts of Reviews of Effectiveness (1995eJune 2007). A combination of MeSh terms (exp ‘sensitivity and specificity’/, exp shoulder joint/, exp joint instability/, exp shoulder injuries/, exp shoulder pain/) and text words (specificity, false negative, accuracy, screening, labral pathology, SLAP lesions, SLAP,

glenoid labrum, instability and individual test names) based on Deville´ et al.’s (2000) optimal search strategy were used. The search was limited to articles of English language. 2.2. Inclusion and exclusion criteria The titles of the articles were screened and filtered and the abstracts of the filtered articles were screened by one reviewer (WM) for fulfilment of the inclusion and exclusion criteria. Inclusion criteria were: cohort and case control design, shoulder pain, clinical examination tests used to evaluate labral pathology, comparison against a reference standard, and inclusion of sensitivity and specificity values. Exclusions were: other pathologies leading to shoulder pain (e.g. referred from spine or internal organs, Cerebrovascular accident CVA) and studies omitting values of either sensitivity or specificity. Where the first reviewer was uncertain whether a study should be included, a second reviewer (RH) was consulted and a decision made by consensus. To ensure completeness of the literature search, the references of the included studies were hand searched for further references and a citation search was carried out. No further studies were identified. 2.3. Data extraction and quality assessment A standardised extraction form was piloted and then used independently by two reviewers (WM and RH) to maintain quality and objectivity (Deeks and Morris, 1996). Any disagreements were decided by consensus. Quality assessment was carried out on all studies which met the inclusion and exclusion criteria using the QUADAS tool (Whiting et al., 2003). This ensured that all studies were evaluated for individual quality items rather than being given a quality score as advocated by Whiting et al. (2005). The QUADAS tool has been developed based on expert consensus and empirical evidence. It has been shown to have varied reliability, with agreement on individual checklist items of 90% (Whiting et al., 2006) 76% (Davis et al., 2007) and 78% (Hollingworth et al., 2006) and kappa scores of 0.65 (Whiting et al., 2006), 0.39 (Davis et al., 2007) and 0.22 (Hollingworth et al., 2006) demonstrating good, fair and fair inter-rater reliability respectively (Altman, 1999). Differences appear to be down to numbers of reviewers, working proximity of the reviewers and experience in diagnostic accuracy systematic reviews (Hollingworth et al., 2006). The QUADAS tool has gained positive feedback in a pilot study by twenty reviewers, with eighteen considering the tool to cover all important items (Whiting et al., 2006). The tool includes questions relevant to spectrum bias, selection bias, disease progression bias, verification bias, incorporation bias, execution of the index and reference test, index test and reference standard test review bias, uninterpretable tests and withdrawals from the studies. These terms are explained in Table 1. Each item was scored yes, no or unclear according to the scoring guidelines of the tool (Whiting et al., 2003). The meaning of the questions relevant to the clinical applicability was discussed and agreed by the reviewers prior to use of the tool. For the purpose of the review, it was assumed that

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130 Table 1 Glossary (QUADAS tool). Spectrum bias

Selection bias

Disease progression bias

Verification bias

Incorporation bias

Execution of the tests Index test and reference standard review bias

Uninterpretable test results Withdrawals

An appropriate sample of patients is considered to have a range of mild to severe, treated and untreated disease and different but commonly confused disorders. This would rule out the chance of spectrum bias occurring by minimising the prevalence of the condition for which the index test is testing. The ideal study sample is a consecutive series of randomly selected patients. Selection bias may occur if patients are selected in a non-random manner e.g. only those who are having surgery. The index and reference test should be carried out at the same time to avoid bias due to the progression of the disease. This occurs when not all those who have the index test have the reference standard (partial verification bias) or when the index tests are verified by different reference standards (differential verification bias). The index and reference tests are required to have an independent result. Bias occurs when the index test is used as part of the reference standard. The test details are required to be sufficient to be able to perform the tests again. This occurs when the examiner is not blinded to the result of either the index or the reference standard when interpreting the results of the other test. Lack of inclusion of uncertain test results can bias the assessment of the test characteristics. Lack of reporting of withdrawals from the study can introduce bias.

in prospective studies the index test would be interpreted without knowledge of the reference standard. Retrospective studies were marked as unclear unless it was stated that the diagnosis was given prior to the interpretation of the reference standard. An appropriate spectrum of patients was considered to be a sample consisting of a wide age range of male and female patients with a range of conditions. 2.4. Data analysis The objectives of the data analysis were guided by the recommendations of the Cochrane Methods Group on Systematic Review of Screening and Diagnostic Tests (1996): To identify the number, quality and scope of primary studies; To provide an overall summary of the diagnostic accuracy of tests studied; To compare different tests in terms of their accuracy; To determine if accuracy estimates depended on the study quality; To determine whether accuracy varied in subgroups according to patient and test characteristics; To determine further areas for research. All studies with raw data available which were evaluated using QUADAS were included in the data analysis in order to

121

assess the effect of study quality on the accuracy of the tests. The raw data included the numbers of true positives (disease present and positive index test), false negatives (disease present and negative index test), false positives (disease absent with positive index test) and true negatives (disease absent with negative index test). These values were extracted from the individual studies and presented in 2 2 contingency tables and inputted into Meta Disc (Zamora et al., 2006) to provide results of sensitivity, specificity, and positive and negative Likelihood ratios (LR) with their confidence intervals. 2.5. Sensitivity, specificity and LR Sensitivity is the proportion of patients with the pathology correctly identified by a positive index test (Peat et al., 2002) calculated using the formula TP/TP þ FN. Specificity is the proportion of patients without the pathology correctly diagnosed by a negative index test (Peat et al., 2002). This is calculated by TN/FP þ TN. LR are particularly relevant to clinicians with the positive LR corresponding to the clinical concept of ruling in a condition and the negative LR in ruling out the condition. The advantages of the LR are that they do not vary as the underlying probability of the disease varies and they provide richer information to the clinician (Ebell, 1998). Positive LR are calculated using the formula sensitivity/1 specificity and negative LR are calculated using 1 sensitivity/specificity. Where the raw data was unavailable to calculate the above results, the studies were excluded from data analysis (Guanche and Jones, 2003; Myers et al., 2005; Nakagawa et al., 2005; Parentis et al., 2006). Values of sensitivity, specificity and positive and negative LR from these studies are reported in Table 4 where available. The sensitivity and specificity values for each test in each study were evaluated in relation to each other in a receiver operating characteristic (ROC) plot. Analysis was carried out according to the quality of the studies such that it was possible to determine from the graph (Fig. 1) which were the most valid and accurate tests. The ROC plot (Fig. 1) indicates the relationship between the true positive rate (TPR) and the false positive rate (FPR) of each test. Using Sackett’s (1992) rule of SpPin and SnNout, tests which are in the top left corner show high accuracy, with high sensitivity and high specificity. This means that the tests in this area are useful at ruling a disorder in when positive and in ruling a disorder out when negative. Tests close to the diagonal line indicate that discrimination between a positive and negative tests is no better than chance. LR were presented in forest plots (Figs. 2 and 3). An overall summary of the diagnostic accuracy of individual tests in varying population groups was provided using summary receiver operator curves (SROCs) (Figs. 4 and 5). 3. Results 3.1. Study selection Searches retrieved 1924 references from which, following assessment against the review inclusion/exclusion criteria, 19

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

122

articles were obtained for closer examination. Four studies were excluded (Bennett, 1998; Berg and Ciullo, 1998; Holtby and Razmjou, 2004; Liu et al., 1996b) (see Fig. 6). 3.2. Quality assessment Fifteen studies were included in the review, evaluating various tests for labral pathology (Table 2). Results of quality assessment are presented in Table 3.The major limitations in quality of the studies related to spectrum and selection bias, verification bias, clinical test review bias, diagnostic test review bias, reporting of reference test details, availability of clinical information and information regarding disease progression between tests. Strengths in the studies related to avoidance of incorporation bias, index test details and avoidance of withdrawal bias. Uninterpretable results were explained in the majority of studies. Fig. 1. Plot in receiver operating characteristic space of estimates of the FPR and TPR of 11 individual tests used to detect labral pathology on clinical examination. These tests are represented in relation to the quality of the studies according to key sources of bias in diagnostic accuracy studies. 2 or less key sources of bias, 3 key sources of bias, - 5 or more key sources of bias present or unclear. AC1, Active Compression (O’Brien 1998); AC2, Active Compression (Stetson 2002); AC3, Active Compression (McFarland 2002); C1, Crank (Mimori 1999); C2, Crank (Liu 1996); C3, Crank (Stetson 2002); AS1, Anterior Slide (Kibler 1995); AS2, Anterior Slide (McFarland 2002); BLI, Biceps Load I (Kim 1999); BLII, Biceps Load II (Kim 2001); NPPT, New Pain Provocation (Mimori 1999); IRRT, Internal Rotation Resistance (Zaslav 2001); K, Kim (Kim 2005); J, Jerk (Kim 2005); PIS, Posterior Impingement (Meister 2005); CR, Compression Rotation (McFarland 2002).

3.3. Relationship of test results to study quality Lack of raw data from the primary studies meant that tests from only 11 of the fifteen studies were included in the data analysis (Fig. 1). Results demonstrating the plotting of the tests in ROC space in relation to the quality of the studies using Lijmer et al.’s (1999) and Whiting et al.’s (2004) evidence of key sources of bias which can overestimate the accuracy of tests are presented in Fig. 1. These key sources of bias are case control design, partial and differential verification bias, absent/inappropriate reference standard, clinical test and diagnostic test review bias, and availability of clinical information. Tests with high TPRs (sensitivity) against minimal FPRs (1 specificity) with minimal bias (red) in the studies demonstrate an optimum profile (Fig. 1). These were the Biceps Load tests I and II (Kim et al., 1999, 2001), the IRRT (Zaslav, 2001), the Crank test (Liu et al., 1996a) as well

Kibler 1995 (AS) Kim 1999 (BLI ) Kim 2001 (BLII) Kim 2005 (K) Kim 2005 (J) Liu 1996 (C) Meister 2004 (PIS) McFarland 2002 (CR) McFarland 2002 (AS) McFarland 2002 (AC) Mimori 1999 (NPPT) Mimori 1999 (C) O’Brien 1998 (AC) Steson 2002 (C) Stetson 2002 (AC) Zaslav 2001 (IRRT)

0.01

1

4.26 (2.16-8.39) 29.09 (7.34- 115.27)* 26.32 (8.61-80.45)* 12.62 (6.54-24.35)* 34.71 (11.10-108.55)* 13.59 (3.55-52.10)* 5.03 (1.75-14.46) 0.99 (0.50-1.94) 0.49 (0.16-1.47) 1.05 (0.73 -1.49) 7.17 (1.62-31.78) 1.06 (0.61-1.83) 43.59 (15.47-122.84) 1.06 (0.51-1.18)* 0.78 (0.51-1.18)* 24.77 (8.08-75.90)*

100.0

Positive LR Fig. 2. Positive Likelihood Ratios with 95% CI. Positive likelihood ratios demonstrating large and conclusive changes from pre-test to post test probability are marked in bold (Jaeschke 1994). * Studies identified with 2 or less items of key bias using the QUADAS tool.

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

123

Kibler 1995 (AS) 0.26 (0.17-0.41) Kim 1999 (BLI ) 0.09 (0.01-0.61)* Kim 2001 (BLII) 0.11 (0.04 -0.27) Kim 2005 (K) 0.21 (0.10-0.44) Kim 2005 (J) 0.27 (0.15-0.49) Liu 1996 (C) 0.10(0.03-0.30)* Meister 2004 (PIS) 0.29 (0.17-0.49) McFarland 2002 (CR) 1.00 (0.81-1.25) McFarland 2002 (AS) 1.10 (0.99-1.22) McFarland 2002 (AC) 0.96 (0.70-1.32) Mimori 1999 (NPPT) 0.03 (0.00-0.39)* Mimori 1999 (C) 0.22 (0.07-0.71) O’Brien 1998 (AC) 0.01 (0.00-0.15) Steson 2002 (C) 0.95 (0.61-1.50)* Stetson 2002 (AC) 1.50 (0.80-2.81)* Zaslav 2001 (IRRT) 0.12 (0.04-0.35)*

0.01

1

100.0

Negative LR Fig. 3. Negative Likelihood Ratios with 95% CI. Negative likelihood ratios demonstrating large and conclusive changes from pre-test to post test probability are marked in bold (Jaeschke 1994). * Studies identified with 2 or less items of key bias using the QUADAS tool.

as the Kim and Jerk tests (Kim et al., 2005). These results are reinforced by the positive and negative LR (Figs. 2 and 3). The Biceps Load test I (Kim et al., 1999), Biceps Load test II (Kim et al., 2001), Crank test (Liu et al., 1996a), IRRT (Zaslav, 2001), the Kim and Jerk tests (Kim et al., 2005) and the AC test (O’Brien et al., 1998) had high positive LR (>10) showing large and conclusive changes from pre-test to post-test probability of the target disorder. The Biceps Load test I (Kim et al., 1999), New Pain Provocation test (NPPT) (Mimori et al., 1999), AC test (O’Brien et al., 1998), all had negative LR less than 0.1 again providing large and conclusive changes from pre-test to post-test probability of the target disorder. However O’Brien et al.’s (1998) study evaluating the AC test and Mimori et al.’s (1999) study evaluating the NPPT and the

Crank test were subject to many of the key biases suggested by Lijmer et al. (1999) and Whiting et al. (2004) to overestimate the accuracy of test results. Studies without raw data and therefore not inputted into Meta Disc demonstrated three (Myers et al., 2005; Parentis et al., 2006) two (Guanche and Jones, 2003) and one (Nakagawa et al., 2005) key sources of bias. Of these, only the Resisted Supination External Rotation (RSER) test (Myers et al., 2005) demonstrated sensitivity and specificity values over 80%. 3.4. Subgroup analysis Tests evaluated in more than one study were the AC (O’Brien et al., 1998; McFarland et al., 2002; Guanche and

Fig. 4. SROC of the Crank test carried out in different studies (Liu et al., 1996a; Mimori et al., 1999; Stetson and Templin, 2002).

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

124

Fig. 5. SROC of the AC test carried out in different studies (O’Brien et al., 1998; McFarland et al., 2002; Stetson and Templin, 2002).

Jones, 2003; Myers et al., 2005; Nakagawa et al., 2005; Parentis et al., 2006), Anterior Slide (AS) (Kibler, 1995; McFarland et al., 2002; Nakagawa et al., 2005), Compression Rotation (CR) (McFarland et al., 2002; Nakagawa et al., 2005), Crank (Liu et al., 1996a; Mimori et al., 1999; Stetson and Templin, 2002; Myers et al., 2005; Nakagawa et al., 2005; Parentis et al., 2006), and the NPPT (Mimori et al., 1999; Parentis et al., 2006). Of these, only the Crank and AC tests had sufficient raw data to input into Meta Disc (Zamora et al., 2006) to provide SROCs. Within Meta Disc, 0.5 was added to all cells in the table to allow for calculation of statistics where there were 0 values in any cells as suggested by Cox (1970) cited in Zamora et al. (2006). These SROCs are demonstrated in Figs. 4 and 5. These graphs demonstrate the Crank test to be the better test, with the area under the curve being closer to 1, although there is one outlier. The area under the curve for the AC test is closer to 0.5 demonstrating this test to be no more discriminating between a positive and negative result than chance (Hopley and van Scalkwyk, 2007). 4. Discussion The findings of the review are that 6 tests for labral pathology, which demonstrated both high sensitivity and specificity values and LR, were identified (Table 5). These were found to come from studies of moderately sound methodological quality and results provided convincing or moderately strong diagnostic accuracy (ranging between 91 and 96%). The tests were:

Biceps Load test I (Kim et al., 1999, n ¼ 75); Biceps Load test II (Kim et al., 2001, n ¼ 127); IRRT (Zaslav, 2001, n ¼ 110); Crank test (Liu et al., 1996a, n ¼ 62); Kim test (Kim et al., 2005, n ¼ 172); Jerk test (Kim et al., 2005, n ¼ 172);

Confidence intervals for sensitivity and specificity were clinically acceptable (0.70e1.0), except for the Biceps Load test I and the Kim and Jerk tests which ranged between 0.54 and 1.00. Although encouraging, the results should be treated with caution as test accuracy has been based on single studies, with the tests performed by the people who developed them (and therefore expected to be unusually skilled) in specialist settings on people referred for surgery. It cannot be assumed that the tests will produce the same results when carried out by less skilled examiners in unselected populations. Where tests were evaluated by more than one study, the results were less consistent (Figs. 4 and 5, Table 4). Overestimation of results appears to occur in studies demonstrating key bias (Kibler, 1995; O’Brien et al., 1998; Mimori et al., 1999) and where there are skilled practitioners who have developed the tests (Kibler, 1995; Liu et al., 1996a; O’Brien et al., 1998). Variations in thresholds used, mean age of the population and quality of the studies may account for the differences in results between O’Brien et al. (1998), McFarland et al. (2002) and Stetson and Templin (2002) on evaluation of the AC test (Fig. 5). Similarly, on analysis of the Crank test, although the same index test description and threshold were used by all authors (Liu et al., 1996a; Mimori et al., 1999; Stetson and Templin, 2002), the main difference between the studies was the quality of the methodology and the mean age of the patients. Lower accuracy levels are apparent when a number of tests are evaluated at the same time (McFarland et al., 2002; Stetson and Templin, 2002; Guanche and Jones, 2003; Nakagawa et al., 2005; Parentis et al., 2006; Myers et al., 2005) and where there is a higher mean age (McFarland et al., 2002; Stetson and Templin, 2002; Guanche and Jones, 2003; Myers et al., 2005; Parentis et al., 2006). The IRRT (Zaslav, 2001), Kim and Jerk tests (Kim et al., 2005) in studies by their developers have however performed well on patients with an older mean age. Unsurprisingly, other tests when re-evaluated

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130 Included trials

125

Excluded trials

Articles retrieved (n=1924)

Articles obtained for closer examination (n=19)

Not specific to area of review (n=1902)

Systematic Reviews (n=3)

Studies of tests no traditionally used to detect labral pathology (n=3)

Studies omitting values of either sensitivity or specificity (n=1)

Studies identified for quality appraisal (n=15)

Studies with 2 or less key sources of bias (n=9)

Studies with 2 key sources of bias (n=4)

Abduction Inferior stability (n=1) Active Compression (n=3) Anterior slide (n=1) Biceps load I (n=1) Biceps Load II (n=1) Crank (n= 4) Clunk (n=1) Compression rotation (n=1) Forced Abduction (n=1) Internal rotation resistance test (n=1) Jerk (n=1) Kim (n=1) Posterior jerk (n=1)

Studies with 5 or more keysources of bias (n=3)

Active Compression (n=1) Anterior slide (n=1) Crank (n=1) New pain provocation (n=1)

Active compression (n=3) Anterior slide (n=2) Compression Rotation (n=1) Crank (n=2) New pain provocation (n=1) Posterior impingement (1) Resisted Supination external rotation (n=1)

Fig. 6. Flow chart of literature search.

in this age group have not been so accurate, as patients are more likely to have co-existing pathologies such as rotator cuff pathology and gleno-humeral arthritis. All the studies came from specialist settings (people referred to hospital consultants for surgery), making the studies subject to selection bias. It cannot be assumed that similar results would be seen in a population with a wider spectrum of shoulder problems of lesser severity and co-existence of pathologies, such as that found in primary care. In all of the studies, doctors performed the testing but none reported the experience of the examiners. Therefore the results cannot be assumed to be applicable to other health care professionals, such as physiotherapists, or doctors regardless of their level of experience.

4.1. Considerations for future research In considering the results of this review, the analytical methods must be considered. Deville´ et al. (2002) advocates Table 2 Tests reviewed. Abduction Inferior Stability test (ABIS) Active Compression (AC) Anterior Slide (AS) Biceps Load I (BLI) Biceps Load II (BLII) Compression Rotation (CR) Clunk (Cl) Crank (C)

Forced Abduction (FA) Internal Rotation Resistance (IRRT) Jerk (J) Kim (K) New Pain Provocation (NPPT) Posterior Impingement Sign (PIS) Resisted Supination External Rotation (RSER)

Table 3 Quality assessment. Study (first author)

Design Sample Mean % Test details size age Males

Guanche 2003 Kibler 1995 Kim 1999

P C U U P C Kim 2001 P C Kim 2005 U C Liu 1996a P U McFarland R C 2002

Reference Appropriate Inclusion, Approtest spectrum exclusion priate reference criteria test

Disease progression bias avoided

Partial verification bias avoided

Differential verification bias avoided

Incorpo- Index Reference Test review test test ration bias details details bias avoided avoided

Diagnostic Clinical test review info available bias avoided

Uninterpretable results explained

Withdrawal bias avoided

A

Y

Y

Y

Y

Y

Y

Y

Y

Y

Y

U

U

U

Y

A

N

U

Y

U

N

N

Y

Y

Y

U

U

U

N

Y

61

38

77

226

24.6

75

Crank, Active Compression Anterior Slide

75

24.8

85

Biceps Load I

A

N

Y

Y

Y

Y

Y

Y

Y

N

Y

U

U

Y

Y

127

30.6

70

Biceps Load II

A

N

Y

Y

U

Y

Y

Y

Y

N

Y

U

N

Y

Y

172

43

61

Kim, Jerk

A

Y

Y

Y

U

Y

Y

Y

Y

N

U

Y

U

Y

Y

62

28

65

Crank

A

N

Y

Y

U

Y

Y

Y

Y

Y

Y

U

U

Y

Y

426

44

42

Compression Rotation, Anterior Slide, Active Compression Posterior Impingement New Pain Provocation, Crank Resisted Supination External Rotation, Crank, Active Compression Forced Abduction, Compression Rotation, Abduction Inferior Stability, Clunk, Anterior Slide, Crank, Active Compression, Posterior Jerk Active Compression

A

Y

Y

Y

Y

Y

Y

Y

Y

Y

U

U

U

Y

Y

A

N

N

Y

U

Y

Y

Y

Y

N

U

U

U

Y

Y

A M

N

Y

Y N

U

N

N

Y

Y

Y

Y

U

U

Y

Y

A

N

N

Y

Y

Y

U

Y

Y

N

U

Y

U

U

U

A

N

Y

Y

Y

Y

Y

Y

Y

N

Y

Y

U

U

U

R M CD A

U

N

N

U

N

N

U

Y

N

U

U

N

Y

Y

Y

U

Y

U

Y

Y

Y

Y

N

U

U

U

Y

Y

A

Y

N

Y

N

Y

Y

Y

Y

Y

Y

U

U

Y

Y

S

Y

Y

Y

U

Y

Y

Y

Y

N

Y

U

U

Y

Y

Meister 2004 Mimori 1999

P C P U

69

23

U

32

20.9

94

Myers 2005

P

40

23.9

97.5

Nakagawa 2005

P

54

23

96

O’Brien 1998

P

268

U

U

Parentis 2006

C P C

132

45

74

Stetson 2002 Zaslav 2001

P NR P C

65

45.9

73

110

44

58

Active Compression, Anterior Slide, New Pain Provocation, Crank Active Compression, Crank Internal Rotation Resistance

Design: P, prospective; U, unreported; C, consecutive; NR, non-randomised. Reference test: A, arthroscopy; R, radiography; M, MRI; S, surgical findings, CD, clinical data. Response to quality checks: Y, yes; N, no; U, unclear/unreported. For glossary of terms, refer to Table 1 Key sources of bias as identified by Lijmer et al. (1999) and Whiting et al. (2004) are: case control design, partial and differential verification bias, absent/inappropriate reference standard, clinical test and diagnostic test review bias and availability of clinical information.

Table 4 Results. Sensitivity (95% CI)a

Specificity (95% CI)a

O & SM

0.290

0.900

SLAP lesions

O & SM

Guanche 2003

Glenoid labral lesions

O

0.474 (0.310e0.642) 0.630

0.547 (0.495e0.599) 0.730

Active compression

Myers 2005

SLAP lesions

SM

0.778

0.111

Active Compression Active Compression

Nakagawa 2005 O’Brien 1998

SLAP lesions Labral tears

O & SM O & SM

Active Compression

Parentis 2006

SLAP lesions

O & SM

0.540 1.000 (0.933e1.000) 0.652

0.600 0.985 (0.957e0.997) 0.486

Active Compression

Stetson 2002

Labral tears

O & SM

Anterior Slide

Kibler 1995

SM

Anterior Slide

McFarland 2002

Superior glenoid labral tears SLAP lesions

O & SM

Anterior Slide

Nakagawa 2005

SLAP lesions

O & SM

0.538 (0.334e0.734) 0.784 (0.684e0.865) 0.079 (0.017e0.214) 0.050

Biceps Load I

Kim 1999

SLAP lesions

O

Biceps Load II

Kim 2001

SLAP lesions

O & SM

Clunk

Nakagawa 2005

SLAP lesions

O & SM

Compression Rotation Compression Rotation

McFarland 2002

SLAP lesions

O & SM

Nakagawa 2005

SLAP lesions

O & SM

Crank

Guanche 2003

Glenoid labral lesions

O

Crank

Liu 1996a

O & SM

Crank

Mimori 1999

Glenoid labral tears SLAP lesions

O

Crank

Myers 2005

SLAP lesions

SM

Crank

Nakagawa 2005

SLAP lesions

Crank

Parentis 2006

Crank

Stetson and Templin 2002 Nakagawa 2005

Test

First author

To evaluate

Setting

Abduction Inferior Stability

Nakagawa 2005

SLAP lesions

Active Compression

McFarland 2002

Active Compression

Forced Abduction

þLR (95% CI)a

LR (95% CI)a

Accuracy (%)a 62.0

1.046 (0.735e1.489) 2.340

0.962 (0.702e1.319) 0.506

54.0

59.5 57.0 98.8

57.746 (20.432e163.20)

0.009 (0.001e1.149)

0.308 (0.170e0.476) 0.816 (0.657e0.923) 0.837 (0.796e0.873) 0.930

0.778 (0.550e1.175) 4.256 (2.161e8.385) 0.485 (0.160e1.472)

1.500 (0.801e2.810) 0.240 (0.16e0.36) 1.100 (0.992e1.220)

40.0

0.909 (0.587e0.998) 0.897 (0.758e0.971) 0.440

0.969 (0.892e0.996) 0.966 (0.904e0.993) 0.680

29.091 (7.342e115.27) 26.325 (8.613e80.455)

0.094 (0.014e0.608) 0.106 (0.042e0.269)

96.0

0.241 (0.103e0.435) 0.250

0.755 (0.700e0.805) 1.000

0.987 (0.501e1.945)

1.004 (0.809e1.246)

0.400

0.730

0.906 (0.750e0.980) 0.833 (0.516e0.979) 0.346

0.933 (0.779e0.992) 1.000 (0.292e1.000) 0.700

O & SM

0.580

0.720

SLAP lesions

O & SM

0.087

0.826

Labral tears

O & SM

0.955 (0.608e1.497)

O & SM

0.564 (0.396e0.722) 0.400

1.059 (0.612e1.831)

SLAP lesions

0.462 (0.266e0.666) 0.670

O

0.885 (0.698e0.976)

0.964 (0.899e0.993)

24.769 (8.083e75.902)

0.120 (0041e0.347)

94.5

O

0.733 (0.541e0.877)

0.979 (0.940e0.996)

34.711 (11.099e108.55)

0.272 (0.150e0.493)

93.6

O

0.800 (0.614e0.923)

0.937 (0.883e0.971)

12.622 (6.543e24.351)

0.214 (0.104e0.437)

91.3

O

1.000 (0.846e1.000) 0.174

0.900 (0.555e0.997) 0.899

7.174 (1.619e31.782)

0.025 (0.002e0.394)

96.8

5.034 (1.752e14.463)

0.288 (0.170e0.487)

95.8

85.8 76.8 54.0

94.5 57.0 70.6 63.0

1.481

0.821

13.594 (3.547e52.099) 6.462 (0.477e87.549)

0.100 (0.034e0.296) 0.220 (0.068e0.711)

91.9 86.6 44.4 66.0 33.8 67.0

Internal Rotation Resistance

Zaslav 2001

Jerk

Kim 2005

Kim

Kim 2005

New Pain Provocation New Pain Provocation

Mimori 1999

Differentiate intra-articular pathology from impingement Posteriore inferior labral lesion Posteriore inferior labral lesion SLAP lesions

Parentis 2006

SLAP lesions

O & SM

Posterior Impingement Sign Posterior Jerk

Meister 2004

O & SM

0.755 (0.611e0.867)

0.850 (0.621e0.968)

Nakagawa 2005

Posterior labral tears and rotator cuff tears SLAP lesions

O & SM

0.250

0.800

56.0

Myers 2005

SLAP lesions

SM

0.828

0.818

82.5

Resisted Supination External Rotation

O, orthopaedic; SM, sports medicine. Shading indicates tests not included in data analysis due to lack of availability of raw data from the primary studies. a Gaps indicate that confidence intervals and values are not calculable due to lack of raw data.

128

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

Table 5 Test descriptions. Test

Description

Threshold

Biceps Load test I (BLI) to identify SLAP lesions

Patient is supine. Examiner sits adjacent to patient on the same side as affected arm, grasps wrist and elbow. Arm is abducted to 90 with the forearm supinated. External rotation is applied to the point of apprehension. Patient is asked to flex the elbow while the examiner resists with one hand and observes for any change in symptoms. The arm is elevated to 120 and externally rotated to its maximum point. The elbow is positioned in 90 flexion with the forearm supinated. The patient is asked to flex the elbow against resistance.

Positive if apprehension increases or pain reproduced. Negative if apprehension decreases or discomfort decreases

Biceps Load test II (BLII) to identify SLAP lesions

Internal Rotation Resistance test (IRRT) to differentiate between intra-articular pathology and impingement Kim test (K) to identify postero-inferior labral lesions

Jerk test (J) to identify postero-inferior labral lesions

Crank test (C) to identify glenoid labral tears.

Standing, the examiner is behind the patient. Arm in 90 abduction in coronal plane and 80 ER. Manual isometric muscle test for ER compared with that for internal rotation Patient is sitting. With the arm in 90 of abduction, the examiner holds the elbow and lateral aspect of the proximal arm and a strong axial load is applied. The arm is moved into 45 of elevation while the axial force is maintained and a posterior and inferior force is applied to the proximal arm. Patient is sitting. The scapula is stabilised by the examiner with one hand and the patient’s arm is abducted to 90 and internally rotated 90 . An axial force is applied with the examiners other hand holding the elbow and a simultaneous horizontal adduction movement is applied. Patient is upright with the arm elevated to 160 in the scapular plane. Joint load is applied along the axis of the humerus with one hand whilst the other hand performs humeral rotation. The test can be repeated in supine.

the construction of 2 2 tables by extracting the raw data for data analysis and the reporting of pairs of complimentary outcome measures. Although this was performed in the majority of studies, several studies (Guanche and Jones, 2003; Myers et al., 2005; Nakagawa et al., 2005; Parentis et al., 2006) omitted raw data. More sophisticated and accurate techniques such as reporting the relationship between TPR and FPR in ROC space and reporting LR are less frequently used. This is unfortunate, as LR have been identified as the most powerful results to judge the clinical utility of a test (Hayden and Brown, 1999), and it has been suggested that authors may overstate the value of test results in the absence of LR (Honest and Khan, 2002). Several of the studies included in this review demonstrate flaws consistent with sources identified by Lijmer et al. (1999) and Whiting et al. (2004) to overestimate diagnostic accuracy, which may have inflated their estimations of accuracy. Future research needs to address these methodological issues in order to provide confidence in the results. The quality of appraisal of study methodologies is often hindered by insufficient detail in the reporting (Deville´ et al., 2002; Mallet et al., 2003), and so it was found in this review in relation to availability of clinical information and demographic details of the study populations.

Test is positive if there is pain during resisted elbow flexion or if there is an increase in the pain already present. Test is negative if no pain is elicited or if the pre-existing pain is unchanged or diminished by resisted elbow flexion Positive if patient with positive impingement has good strength ER with weakness IR, this is predictive of non-outlet impingement A positive test is indicated by a sudden onset of posterior shoulder pain, regardless of a clunk.

A positive test is indicated by a sharp pain with or without a clunk or click.

A positive test is indicated during the manoeuvre (usually external rotation) with or without a click or if there is reproduction of symptoms as felt by the patient during overhead or work activities.

5. Limitations to the review Like all systematic reviews, the results are dependent on the articles identified in the searches. The authors had undertaken exhaustive searches of the published English-language literature, however searches for unpublished studies and foreign language studies were not carried out which means a few relevant papers may have been missed. The power of the analysis is also dependent on the number of studies identified for each individual test. This was low; for many tests there was only one paper on which to base the analysis and hence the SROCs of the Crank and the AC tests may be of limited value. The authors’ decision to include case control studies could be seen as a weakness as this is not the optimal design. However it was felt necessary to include for quality appraisal, studies which have regularly been reported in narrative reviews to have high accuracy (Kibler, 1995; O’Brien et al., 1998). 6. Conclusion There is limited evidence from single well carried out studies to suggest that the Biceps Load tests I and II, the IRRT, the Kim test and the Jerk test are accurate in differentiating

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

labral pathology from other pathologies in selected populations. However other tests for labral pathology (AC, AS and Crank) when re-evaluated in studies not carried out by the developers of the tests have not produced such accurate results. There is a need therefore for further evaluation of labral pathology tests to see whether these tests are as accurate when carried out in different populations by less skilled examiners. Physiotherapists working in extended roles are in an ideal position to do this. Further to this, future research needs to address the key sources of bias relative to diagnostic test screening and provide more detailed demographic information and adequate raw data in order to produce clinically relevant LR and allow for results to be analysed fully. Acknowledgements The authors would like to thank Dr Sarah Tyson from the University of Salford for her comments assisting with the redrafting of this review. References Altchek DW, Warren RF, Wickiewicz TL. Arthroscopic labral debridement: a three year follow up study. The American Journal of Sports Medicine 1992;20(6):702e6. Altman DG. Inter-rater agreement, Practical statistics for medical research. 1st ed. London: Chapman & Hall; 1999. 14.3, p. 403e8. Andrews JR, Carson WG, McLeod WD. Glenoid labrum tears related to the long head of biceps. The American Journal of Sports Medicine 1985;13: 337e41. Bennett WF. Specificity of the Speed’s test: arthroscopic technique for evaluating the biceps tendon at the level of the bicipital groove. Arthroscopy 1998;14(8):789e96. Berg EE, Ciullo JV. A clinical test for superior glenoid labral or ‘SLAP’ lesions. Clinical Journal of Sport Medicine 1998;8(2):121e3. Davis P, Fitzgerald A, Alderson P. Feasibility of the QUADAS tool for quality assessment of diagnostic studies in guideline development. In: 4th Annual guidelines international network conference. http://www.g-i-n.net/ download/files/b33_davies.pdf; 2007 [accessed 25.01.08]. Deeks JJ, Morris JM. Evaluating diagnostic tests. Bailliere’s Clinical Obstetrics and Gynaecology 1996;10(4):613e30. Deville´ WLJM, Bezemer PD, Bouter LM. Publications on diagnostic test evaluation in family medicine journals: an optimal search strategy. Journal of Clinical Epidemiology 2000;53:65e9. Deville´ WLJM, Buntinx F, Bouter LM, Montori VM, De Vet HC, Van Der Windt DA, et al. Conducting systematic reviews of diagnostic studies: didactic guidelines. BMC Medical Research Methodology:1, http://www. biomedcentral.com/content/pdf/1471-2288-2-9.pdf, 2002;2 [accessed 26.06.07]. Ebell MH. An introduction to information mastery: reading an article about diagnosis. Department of Family Medicine, Michigan State University, http://www.poems.msu.edu/InfoMastery/Diagnosis/Diagnosis.htm; 1998 [accessed 12.03.07]. Green S, Buchbinder R, Glazier R, Forbes A. Interventions for shoulder pain (Cochrane review), The Cochrane Library. Chichester, UK: John Wiley & Sons, Ltd; 2003. Issue 4. Guanche C, Jones DC. Clinical testing of the glenoid labrum, arthroscopy. The Journal of Arthroscopic and Related Surgery 2003;19(5):517e23. Hanchard N, Cummins J, Jeffries C. Evidence based clinical guidelines for the diagnosis, assessment and physiotherapy management of shoulder impingement syndrome. London, UK: Chartered Society of Physiotherapy; 2004. Section 7, p. 41e56.

129

Hasan SA. Superior labral lesions. Emedicine, http://www.emedicine.com/ orthoped/topic317.htm 2006 [accessed 26.06.07]. Hayden SR, Brown MD. Likelihood ratio: a powerful tool for incorporating the results of a diagnostic test into clinical decision making. Annals of Emergency Medicine 1999;33(5):575e80. Holtby R, Razmjou H. Accuracy of the Speed’s and Yergason’s tests in detecting biceps pathology and slap lesions: comparison with arthroscopic findings. Arthroscopy: The Journal of Arthroscopic and Related Surgery 2004;20(3):231e6. Hollingworth W, Lenkinski R, Shibata DK, Bernal B, Zurakowski D, Comstock B, et al. Interrater reliability in assessing quality of diagnostic accuracy studies using the QUADAS tool: a preliminary assessment. Academic Radiology 2006;13(7):803e10. Honest H, Khan KS. Reporting of measures of accuracy in systematic reviews of diagnostic literature. BMC Health Services Research(4), http://www. biomedcentral.com/1472-6963/2/4, 2002;2 [accessed 26.06.07]. Hopley L, van Scalkwyk J. The magnificent ROC (receiver operating characteristic curve), http://www.anaesthetist.com/mnm/stats/roc/Findex.htm; 2007 [accessed 12.03.08]. Jaeschke R. Users guide to the medical literature III. How to use an article about a diagnostic test A. Are the results of the study valid? JAMA 1994; 271(5):389e91. Jones GL, Galluch DB. Clinical assessment of superior glenoid labral lesions: a systematic review. Clinical Orthopaedics and Related Research 2007; 455:45e51. Kibler WB. Specificity and sensitivity of the anterior slide test in throwing athletes with superior glenoid labral tears. Arthroscopy 1995;11(3): 296e300. Kim S, Ha K, Han K. Biceps load test: a clinical test for superior labrum anterior and posterior lesions in shoulders with recurrent anterior dislocations. The American Journal of Sports Medicine 1999;27(3):300e3. Kim SH, Ha KI, Ahn JH, Choi HJ. Biceps load test II: a clinical test for SLAP lesions of the shoulder. Arthroscopy 2001;17(2):160e4. Kim SH, Park JS, Jeong WK, Shin SK. The Kim test: a novel test for posteroinferior labral lesion of the shoulder e a comparison to the Jerk test. The American Journal of Sports Medicine 2005;33:1188e92. Kumar VP, Satku K, Balasubramaniam P. The role of the long head of biceps brachii in the stabilization of the head of the humerus. Clinical Orthopaedics 1989;244:172e5. Lijmer JG, Moll WM, Heisterkamp S, Bonsel GJ, Prins MH, Van der Meulen J, et al. Empirical evidence of design-related bias in studies of diagnostic tests. JAMA 1999;282(11):1061e6. Liu SH, Henry MH, Nuccion S, Shapiro MS, Dorey F. Diagnosis of glenoid labral tears. A comparison between magnetic resonance imaging and clinical examinations. The American Journal of Sports Medicine 1996; 24(2):149e54. Liu SH, Henry MH, Nuccion SL. A prospective evaluation of a new physical examination in predicting glenoid labral tears. The American Journal of Sports Medicine 1996;24(6):721e5. Liume JJ, Verhagen AP, Miedema HS, Kuiper JI, Burdorf A, Verhaar JAN, et al. Does this patient have an instability of the shoulder or a labrum lesion? JAMA 2004;292(16):1989e99. Maffet MW, Gartsman GM, Moseley B. Superior labrumebiceps tendon complex lesions of the shoulder. The American Journal of Sports Medicine 1995;23(1):93e8. Mallet S, Summerton N, Deeks J, Halligan S, Altman D. Systematic reviews of diagnostic tests in cancer: assessment of methodology and reporting quality [abstract]. In: XI Cochrane colloquium: evidence, health care and culture; 2003, Oct 26e31. Barcelona, Spain. McFarland EG, Kim TK, Savino RM. Clinical assessment of three common tests for superior labral anterioreposterior lesions. The American Journal of Sports Medicine 2002;30(6):810e5. Meister K, Buckley B, Batts J. The posterior impingement sign: diagnosis of rotator cuff and posterior labral tears secondary to internal impingement in overhand athletes. The American Journal of Orthopaedics 2004;33(8):412e5. Mimori K, Muneta T, Nakagawa T, Shinomiya K. A new pain provocation test for superior labral tears of the shoulder. The American Journal of Sports Medicine 1999;27(2):137e42.

130

W. Munro, R. Healy / Manual Therapy 14 (2009) 119e130

Musgrave DS, Rodosky MW. SLAP lesions: current concepts. The American Journal of Orthopaedics 2001;1:29e38. Myers TH, Zemanovic JR, Andrews JR. The resisted supination external rotation test. The American Journal of Sports Medicine 2005;33(9): 1315e20. Nakagawa S, Yoneda M, Hayashida K, Obata M, Fukushima S, Miyazaki Y. Forced abduction and elbow flexion test: a new simple clinical test to detect superior labral injury in the throwing shoulder. The Journal of Arthroscopic and Related Surgery 2005;21(11):1290e5. O’Brien SJ, Pagnani MJ, Fealy S, McGlynn SR, Wilson JB. The active compression test: a new and effective test for diagnosing labral tears and acromioclavicular joint abnormality. The American Journal of Sports Medicine 1998;26(5):610e3. Parentis MA, Glousman RE, Mohr KS, Yocum LA. An evaluation of the provocative tests for superior labral anterior posterior lesions. The American Journal of Sports Medicine 2006;34:265e8. Peat J, Mellis C, Williams K, Xuan W. Health science research, a handbook of quantitative methods. London: Sage Publications Ltd; 2002. Section 2, p. 237. Reider B. Physical examination. The American Journal of Sports Medicine 2004;32(2):299e300. Sackett DL. A primer on the precision and accuracy of the clinical examination. JAMA 1992;267(19):2638e44. Schmitz MA. The recognition and treatment of superior labral anterioreposterior (SLAP) lesions in the shoulder. Medscape General Medicine(1), www. medscape.com/viewarticle/408488, 1999;1 [accessed 30.10.02]. Snyder SJ, Karzel RP, Del Pizzo W, Ferkel RD, Friedman MJ. SLAP lesions of the shoulder. Arthroscopy 1990;6:274e9.

Stetson WB, Templin K. The Crank test, the O’Brien test, and routine magnetic resonance imaging scans in the diagnosis of labral tears. American Journal of Sports Medicine 2002;30(6):806e9. The Cochrane methods working group on systematic review of screening and diagnostic tests: recommended methods, http://www.nihs.go.jp/dig/ cochrane/cochrane/sadtdoc1.htm; 1996 [accessed 26.06.07]. Whiting P, Rutjes AWS, Reitsma JB, Bossuyt PM, Kleijnen J. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Medical Research Methodology:25, http://www.biomedcentral.com/1471-2288/3/25, 2003;3 [accessed 26.06.07]. Whiting P, Rutjes AW, Reitsma JB, Glas AS, Bossuyt PM, Kleijnen J. Sources of variation and bias in studies of diagnostic accuracy: a systematic review. Annals of Internal Medicine 2004;140(3):189e202. Whiting P, Harbord R, Kleijnen J. No role for quality scores in systematic reviews of diagnostic accuracy studies. BMC Medical Research Methodology:19, http://www.biomedcentral.com/1471-2288/5/19, 2005;5 [accessed 26.06.07]. Whiting PF, Weswood ME, Rutjes AWS, Reitsma JB, Bossuyt PNM, Kleijnen J. Evaluation of QUADAS, a tool for the quality assessment of diagnostic accuracy studies. BMC Medical Research Methodology:9, http://www/ biomedcentral.com/1471-2288/6/9, 2006;6 [accessed 23.01.08]. Zamora J, Abraira V, Muriel A, Khan KS, Coomarasamy A. Meta-DiSc: a software for meta-analysis of test accuracy data. BMC Medical Research Methodology 2006;6:31. Zaslav KR. Internal rotation resistance strength test: a new diagnostic test to differentiate intra-articular pathology from outlet (Neer) impingement syndrome in the shoulder. Journal of Shoulder & Elbow Surgery 2001; 10(1):23e7.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 131e137 www.elsevier.com/math

Original Article

Physiotherapists’ treatment approach towards neck pain and the inﬂuence of a behavioural graded activity training: An exploratory study Frieke Vonk a,*, Jan J.M. Pool b, Raymond W.J.G. Ostelo b, Arianne P. Verhagen a b

a Department of General Practice, Erasmus MC, PO Box 2040, 3000 CA Rotterdam, The Netherlands Institute for Research in Extramural Medicine (EMGO), VU University Medical Centre, Amsterdam, The Netherlands

Received 15 April 2007; received in revised form 15 November 2007; accepted 21 December 2007

Abstract Physiotherapists’ treatment approach might inﬂuence their behaviour during practice and, consequently, patients’ treatment outcome; however, an explicit description of the treatment approach is often missing in trials. The purpose of this prospective exploratory study was to evaluate whether the treatment approach diﬀers between therapists who favour a behavioural graded activity (BGA) program, conservative exercise (CE) or manual therapy, and whether BGA training has inﬂuence on the treatment approach. Forty-two therapists participated. BGA therapists received a 2-day training. Treatment approach was measured at baseline and at 3-month follow-up, using the Pain Attitude and Beliefs Scale for Physiotherapists (PABS-PTs). By this method data on the adoption of biomedical or biopsychosocial approaches were generated. Diﬀerences were examined with analysis of variance (ANOVA) and independent Student’s t-test. Inﬂuence of the BGA training was examined with linear regression. At baseline, there were no signiﬁcant diﬀerences between BGA, CE or manual therapists use of biomedical or biopsychosocial approaches, but there was a trend for BGA therapists to score higher on the biopsychosocial approach. At follow-up, their biopsychosocial score remained higher and their biomedical score was lower compared to CE therapists. Corrected regression analysis showed a 4.4 points (95%CI 7.9; 0.8) higher decrease for therapists who followed the BGA training compared to therapists who did not. Our results indicate no signiﬁcant diﬀerences in treatment approach at baseline, and that BGA training might inﬂuence therapists’ treatment approach since the scores on the biomedical approach decreased. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Attitude; Treatment approach; Neck pain; Physiotherapists

1. Introduction In the Netherlands, neck pain is one of the three most reported musculoskeletal pains and entails considerable costs for health care (Picavet and Schouten, 2003). Because generally no speciﬁc underlying pathology can

* Corresponding author. Tel.: þ31 10 4087550; fax: þ31 10 4089491. E-mail address: [email protected] (F. Vonk). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2007.12.005

be found, neck pain is often designated as non-speciﬁc (Bogduk, 1984). When musculoskeletal pain cannot be explained by an obvious physical cause and when only few guidelines are available, treatment regimens may reﬂect the clinicians’ beliefs (Foster et al., 2003a). Therapists’ attitudes inﬂuence their actual behaviour, which could have implications for the eﬀectiveness of the treatment (Rainville et al., 2000; Linton et al., 2002; Houben et al., 2004). An observational study showed that the

132

F. Vonk et al. / Manual Therapy 14 (2009) 131e137

treatment style of clinicians (concerning prescription of pain medication or bed rest) was related to treatment outcome in low back pain (Von Korﬀ et al., 1994). Health care providers who were fear avoidant also were more likely to advise a patient to avoid painful movements (Linton et al., 2002). Further, it is argued that therapists allegiance and adherence to treatment protocols is a plausible contributor to diﬀerences in treatment outcome (Morley and Williams, 2006). Therefore, understanding therapists’ beliefs or treatment approach seems fundamental in developing better ways of managing pain complaints (Foster et al., 2003b). Insight into therapists’ treatment approaches and whether or not training can modify them could have implications for the education of therapists and for daily practice. Two diﬀerent treatment approaches are described in literature. First, the traditional biomedical approach in which treatment is focused on pain caused by physiological pathology or impairment (Turk and Flor, 1984). Therapists support a pain-contingent treatment approach, where treatment is guided by the amount of pain the patient experiences. Second, the biopsychosocial treatment approach in which psychological and social factors are assumed to be important determinants in the development and maintenance of complaints, and in which pain can persist long after the initial pathology has healed. Therapists support a time-contingent approach in which patients’ activities are systematically increased (Fordyce, 1976; Lindstrom et al., 1992). To measure physiotherapists’ treatment approach, Ostelo et al. (2003) developed the questionnaire ‘pain attitudes and belief scale for physiotherapists (PABSPTs)’, which was further validated by Houben et al. (2005b). From this questionnaire two categories can be generated: a biomedical approach and a biopsychosocial approach. The categories are not opposites of the same scale, but both are important in determining therapists’ treatment approach (Houben et al., 2005b). The questionnaire has been used to examine the treatment approach of diﬀerent therapists, physiotherapy students, and general practitioners (GPs) (Houben et al., 2005a,b; Jellema et al., 2005). A recent review of ﬁve measurement tools for health care providers’ attitudes and beliefs concluded that the PABS-PT was one of two to have undergone the most thorough testing to date (Bishop et al., 2007). Although physiotherapists’ treatment approach may be important, an explicit description is often missing in trials. The aim of this exploratory study was to appraise the treatment approach of therapists in two ongoing trials (Vonk et al., 2004; Pool et al., 2006). Therefore, we formulated three research questions. First, do therapists who favour a behavioural graded activity (BGA) program diﬀer in their treatment approach from those therapists who favour conservative exercise (CE) or manual therapy? Second, does the primary specialisation

(physiotherapy/manual therapy) inﬂuence the treatment approach? This inﬂuence is assumed because in the Netherlands certiﬁed manual therapists are specialised in manipulation techniques and are allowed to use them, whereas physiotherapists are not. Third, can BGA training, based on the principles of behavioural change as described by Fordyce (1976) and as applied by Lindstrom et al. (1992), inﬂuence therapists’ treatment approach?

2. Methods 2.1. Physiotherapists Therapists included in this study (n ¼ 45) were involved in one of two ongoing randomised clinical trials (RCTs), i.e. Ephysion (Vonk et al., 2004) or the Neck Trial (Pool et al., 2006). In these trials a BGA program was compared with either conventional exercise (Ephysion) or manual therapy (Neck Trial) in sub-acute or chronic neck pain patients. Before assessment of the treatment approach, participating therapists were given the choice to decide which treatment arm they were most comfortable with to deliver within the trial. As a result, both the BGA and the CE treatment arm in the Ephysion study consisted of both physiotherapists and manual therapists. Three therapists from the Ephysion study were excluded: two applied after baseline measurement and one did not complete the baseline measurement. The BGA therapists from the Neck Trial were excluded because their treatment approach was only assessed after the BGA training. Consequently, insight into the inﬂuence of that training on their treatment approach was not possible. The 42 remaining therapists consisted of 30 therapists from the Ephysion study (13 CE therapists and 17 BGA therapists) and 12 manual therapists from the Neck Trial (see Fig. 1). All participating manual therapists were certiﬁed and registered by the Royal Dutch Association for Physical Therapist (KNGF). After baseline measurement, the BGA therapists received a 2-day training on the BGA approach. The remaining therapists participated in a consensus meeting to standardise their treatments (Vonk et al., 2004; Pool et al., 2006). 2.2. Questionnaires First, therapists’ characteristics were measured by a questionnaire, including gender, age, primary specialisation, work setting, and years of working experience. Second, therapists’ treatment approach towards neck pain was measured with the PABS-PT (Houben et al., 2005b). The PABS-PT is a 19-item questionnaire developed by Ostelo et al. (2003) and further validated by Houben et al. (2005b). It was designed to determine

133

F. Vonk et al. / Manual Therapy 14 (2009) 131e137

Ephysion trial n=30

Neck trial n=12

CE therapists (n=13 of which 3 are manual therapists)

BGA therapists (n=17)

Manual therapy (n =12)

3 manual therapists

10 physiotherapists

n=17

n=17

n=13

n=12

Total = 15 manual therapists

n=13

Research question 3:

Research question 1:

Research question 2:

Could a BGA training influence therapists treatment approach?

Do therapists who favour a different treatment arm differ in their treatment approach? BGA therapists vs. CE therapists vs. manual therapists

Does primary specialization (physiotherapy or manual therapy) influence the treatment approach?

Fig. 1. Overview of the compilation of the groups of therapists analysed to answer the research questions.

physiotherapists’ treatment approach towards chronic low back pain. To make the questionnaire suitable for the present study we replaced ‘low back pain’ with ‘neck pain’. Therapists were asked to rate every item on a six-point Likert scale ranging from ‘totally disagree (1)’ to ‘totally agree (6)’. From this, two factors were generated, i.e. (1) a biomedical approach including 10 items, and (2) a biopsychosocial approach including nine items (Houben et al., 2005b). Each treatment approach is calculated by the sum of the items ranging from 10 to 60 on factor 1 and from 9 to 54 on factor 2. Higher scores on factor 1 indicate a biomedical treatment approach, and higher scores on factor 2 indicate a biopsychosocial treatment approach.

2.3. Data collection The therapists in the Ephysion study received the PABS-PT twice: once at baseline (1 week before either the consensus meeting or the BGA training), and 3 months after the trial started. In the Neck Trial, therapists’ treatment approach was evaluated only 3 months after the trial started. Because the manual therapists from the Neck Trial showed no diﬀerences in demographics or characteristics compared with BGA and CE therapists and because they did not receive any training, their data were regarded as baseline data.

2.4. Statistical analysis 2.4.1. Research question 1 First, frequencies (number, mean, and standard deviation, SD) were calculated for demographics and characteristics of the participating therapists. To examine baseline diﬀerences in treatment approach we calculated scores for the biomedical and biopsychosocial approach and tested them using a one-way analysis of variance (ANOVA, research question 1). Fig. 1 shows which therapists were compared per research question. For further exploration of research question 1, we calculated a global treatment attitude at baseline, by combining the biomedical and biopsychosocial treatment approach after dividing the scores on these latter approaches into quartile. Five diﬀerent global treatment attitudes were derived, i.e. (1) therapists were considered to have a purely biomedical treatment attitude when their score was in the highest quartile on the biomedical treatment approach and in the lowest quartile on the biopsychosocial treatment approach, (2) they were considered to have a more biomedical treatment attitude when their score on the biomedical treatment approach was one quartile higher than their biopsychosocial score. The same applies vice versa for a (3) ‘purely’ or (4) ‘more’ biopsychosocial treatment attitude, and (5) therapists were considered to have a neutral treatment attitude when therapists scored both treatment approaches in

134

F. Vonk et al. / Manual Therapy 14 (2009) 131e137

the same quartile. The division into the global attitude is descriptive, no further statistical analyses have been carried out because of the small sample size. 2.4.2. Research question 2 Because of education diﬀerences, we assumed that primary specialisation (physiotherapy/manual therapy) could inﬂuence the treatment approach (research question 2). To examine this, the manual therapists from the CE treatment arm (n ¼ 3) were added to the manual therapists (n ¼ 12) of the Neck Trial. Then mean scores on the biomedical and biopsychosocial approach were calculated, and both groups were compared with an independent Student’s t-test (a ¼ 0.05). 2.4.3. Research question 3 Finally, we evaluated whether BGA training could inﬂuence the treatment approach (research question 3). We calculated follow-up scores of the treatment approaches and the within-person changes between baseline and follow-up. Diﬀerences in follow-up scores were examined with independent Student’s t-tests and diﬀerences from baseline scores with dependent Student’s t-test (a ¼ 0.05). Then the possible inﬂuence of the BGA training on the within-person changes was evaluated with linear regression. Confounding was checked by separately adding variables that were assumed to inﬂuence the treatment approach. Variables were subsequently added to the multivariate model when they were related to both the BGA training (determinant) and the within-person change (outcome), and when they changed the regression coeﬃcient of the BGA training by at least 10%; they were added in a block using the method ‘enter’. The examined variables were age (cut-oﬀ point 43 years, mean), gender, primary specialisation (physiotherapist/manual therapist), other trainings followed (biomedical/biopsychosocial training), experience of neck pain (yes/no), and work experience (cut-oﬀ point 18 years, mean) (Ostelo et al., 2003; Houben et al., 2005b).

3. Results 3.1. Research question 1 In total, 42 baseline questionnaires were completed. Table 1 presents the baseline demographics, characteristics and treatment approaches of the three treatments’ arms. There were no signiﬁcant diﬀerences in characteristics between the therapists. The overall mean age was 43.7 (SD 8.3) years and overall work experience was 19.1 (SD 7.5) years. In general, BGA therapists scored lower on the biomedical approach and higher on the biopsychosocial

Table 1 Baseline data on therapists’ gender/age, work characteristics and scores on treatment approach. Ephysion Ephysion Neck Trial CE therapists BGA therapists manual therapists (n ¼ 13) (n ¼ 17) (n ¼ 12) Male (n) Age in years, mean (SD) Registered as manual therapist (n) Work experience in years (SD) Weekly hours work, mean (SD) Biomedical, mean (SD) Biopsychosocial, mean (SD)

11 42.6 (10.8)

14 44.3 (6.8)

9 44.2 (7.5)

3

6

12

17.1 (8.6)

19.8 (7.1)

20.3 (7.1)

35.2 (9.7)

40.9 (12.0)

36.9 (11.7)

27.6 (4.7)

25.6 (5.4)

28.4 (8.7)

35.1 (4.7)

38.7 (4.5)

36.0 (6.4)

BGA ¼ graded activity program.

approach compared to CE therapists and manual therapists. However, when tested with ANOVA, these diﬀerences were not signiﬁcant for either the biomedical approach ( p ¼ 0.46) or the biopsychosocial approach ( p ¼ 0.14). The quartile borders (for calculating the global treatment attitude) lay at 24.2 and 29.0 points for the biomedical treatment approach and at 34.0 and 39.0 points for the biopsychosocial treatment approach, respectively. With these, the therapists were divided into ﬁve global treatment attitudes (Table 2). Table 2 shows that the majority of the CE therapists and manual therapists have a global biomedical attitude (76.9% and 58.3%, respectively) and the majority of the BGA therapists have a global biopsychosocial attitude (56.3%). 3.2. Research question 2 No diﬀerences were found for the inﬂuence of primary specialisation (physiotherapy/manual therapy) on the treatment approach. The mean biomedical score of Table 2 The ﬁve diﬀerent global treatment attitudes at baseline and the number (percentage) of therapists with that attitude per treatment arm.

Purely biomedical attitude More biomedical attitude Neutral attitude More biopsychosocial attitude Purely biopsychosocial attitude

CE therapists (n ¼ 13)

BGA therapists (n ¼ 17)

Manual therapists (n ¼ 12)

3 7 0 1 2

2 3 3 2 7

6 1 1 1 3

(23.1%) (53.8%) (7.6 %) (15.4%)

(12.5%) (18.8%) (18.6%) (12.5%) (43.8%)

(50%) (8.3%) (8.3%) (8.3%) (25%)

The global treatment attitude was revealed by calculation of one overall score, which was done by combining the quartile scores of the biomedical and the psychosocial approach.

135

F. Vonk et al. / Manual Therapy 14 (2009) 131e137

the manual therapists (n ¼ 15) was 27.6 (SD 8.0) compared with 28.6 (SD 4.8) for the physiotherapists (n ¼ 10) (mean diﬀerence [MD] 1.0, 95%CI 4.8; 6.8). The scores on the biopsychosocial approach were 35.7 (SD 5.9) and 35.3 (SD 5.1), respectively (MD 0.4, 95%CI 5.1; 4.4).

3.3. Research question 3 At 3-month follow-up, 27 questionnaires were returned in the Ephysion study. Three therapists (10%) did not return the follow-up questionnaire. They did not diﬀer in demographics, characteristics and treatment approach at baseline compared to the other therapists. The treatment approach scores at follow-up are presented in Table 3. Table 3 shows signiﬁcantly lower scores at follow-up on the biomedical approach for BGA therapists compared to CE therapists (MD 6.2 points, 95%CI 11.1; 1.3). The scores on the biopsychosocial approach for BGA therapists compared with CE therapists were signiﬁcantly higher (MD 5.8 points, 95%CI 1.8; 9.9). With regard to the within-person changes from baseline to follow-up, the BGA therapists showed a signiﬁcant decrease of 4.6 (95%CI 1.8; 7.4) points on the biomedical approach but no changes on the biopsychosocial approach. The CE therapists showed no within-person changes on either approach. Univariately, the BGA training was signiﬁcantly related to the biomedical approach (B ¼ 3.8, 95%CI 7.4; 0.3). The variables’ work experience and age were found to be confounders. However, because they were signiﬁcantly correlated (r ¼ 0.88) they could not be considered as separate variables. We considered work experience in physiotherapy a more important contributor to the development of a treatment approach than age and therefore added this variable to the multivariate model. Table 3 Mean scores on the biomedical and biopsychosocial approach at 3-month follow-up and change scores from baseline to follow-up. CE therapists (n ¼ 12)

Biomedical, mean (SD) Biopsychosocial, mean (SD)

BGA therapists (n ¼ 15)

Change scores from baseline to follow-up, mean (SD) CE therapists

BGA therapists

26.9 (4.5)

20.7 (7.1)a

0.8 (3.7)

4.6 (4.9)b

34.5 (4.3)

40.4 (5.6)a

0.8 (3.5)

0.7 (4.8)

a BGA therapists’ scores on both approaches are signiﬁcantly diﬀerent from CE therapists’ scores. b BGA therapists biomedical score has signiﬁcantly decreased from the baseline score in Table 1.

Table 4 Final multivariate models of the inﬂuence of the BGA training on the within-person change on the biomedical and biopsychosocial approaches corrected for work experience. Outcome

Variables

Ba

SE

95% CI

Within-person change on the biomedical approach

Constant BGA training Work experience (years)

0.81 4.37 2.43

1.73 1.73

7.95, 0.79 1.15, 6.01

Within-person change on the biopsychosocial approach

Constant BGA training Work experience (years)

6.99 0.67 3.87

1.46 1.46

2.35, 3.69 0.85, 6.89

BGA training (1) vs. no BGA training (0); work experience 18 years (1) vs. work experience <18 years (0). a B ¼ regression coeﬃcient as estimated with multiple linear regression analysis and corrected for work experience, SE ¼ Standard Error; CI ¼ 95% Conﬁdence Interval.

Table 4 presents the multivariate models for both approaches corrected for work experience. The ﬁrst model shows that the therapists who followed the BGA training had a 4.4 points higher decrease on the scores on the biomedical approach compared to the therapists who did not follow the training. Further, the second model shows that work experience is a more important variable than the BGA training in explaining the small changes in the biopsychosocial approach. The explained variance of both models is small, 17% for the biomedical and 20% for the biopsychosocial model.

4. Discussion This study shows that, at baseline there were no signiﬁcant diﬀerences between BGA, CE and manual therapists’ use of biomedical or biopsychosocial approaches. But there was a trend for BGA therapists to score higher on the biopsychosocial approach, and for CE and manual therapists to score higher on the biomedical approach. No signiﬁcant diﬀerences were found between physiotherapists and manual therapists in the treatment approach at baseline. Our results further indicate that the BGA training might inﬂuence the therapists’ treatment approach, as the scores on the biomedical approach decreased. 4.1. Possible limitations Our study had an observational design and our ﬁndings are based on a small sample. Therefore, we consider our analysis to be exploratory; one should be careful in generalising the results. No signiﬁcant diﬀerences in treatment approach were found at baseline, but this could be due to a power problem. ANOVA corrects for multiple testing and is therefore less sensitive in small sample sizes.

136

F. Vonk et al. / Manual Therapy 14 (2009) 131e137

The questionnaire used to measure treatment approach focussed on neck complaints in general, and does not discriminate between acute and chronic complaints. However, in our aim to measure a general treatment approach we chose not to make the questionnaire more speciﬁc. Furthermore, the original PABS-PT also makes no distinction between acute and chronic complaints even though it was constructed for chronic low back pain. Although the questionnaire was constructed for chronic low back pain we considered it suitable for chronic neck pain as well, because the treatment approach is considered to be based on the physiotherapists’ beliefs on chronic musculoskeletal problems in general and on their general preference for either the biomedical or biopsychosocial approach. This assumption is supported by a review on chronic pain, in which a heterogeneous group of pain problems was accepted as a whole, because neither the diagnosis, nor the site of pain, nor the medical ﬁndings were found to be major sources of variance in the targets of treatment (Morley et al., 1999). The suitability of the PABS-PT is further supported in our results by showing that the questionnaire can indicate diﬀerences between therapists on both the biomedical and the biopsychosocial approach for neck pain as well. The scores found in this study are similar to those found for back pain (Houben et al., 2005a). However, because the PABS-PT is newly developed no reference data were available, making it diﬃcult to interpret whether the (signiﬁcant) diﬀerences in treatment approach are clinically relevant. To our knowledge this is the ﬁrst study to use the PABS-PT longitudinally among physiotherapists. Recently, an adjusted pain attitudes and belief scale (PABS) was used longitudinally to measure the treatment approach among GPs (Jellema et al., 2005), but the questionnaire has not yet been validated for longitudinal use. Nevertheless, both studies indicate that the questionnaire seems suitable and sensitive to change. Finally, socially desired answers cannot be ruled out, particularly at follow-up in BGA therapists because the BGA training could have made them aware of desirable answers. However, despite promotion of a more biopsychosocial way of thinking in the training, the scores on this approach did not increase. 4.2. Comparison with other studies The impact of treatment approach on actual behaviour has never been evaluated so far, but our study is the ﬁrst to show an association between therapists’ treatment approach and the treatment they chose to perform in the trials. This could be a relevant factor when performing that particular treatment and for future research. In earlier studies it was argued that the two-factor structure of the PABS-PT provides more detailed information on a therapists’ treatment approach than

a measure with only one outcome dimension (Ostelo et al., 2003; Houben et al., 2005b). Although we agree, we additionally combined the two treatment approaches into one global treatment attitude because we consider this to provide better insight into which treatment approach the therapist actually favours and might therefore be an important predictor for their behaviour. In the present study we found no inﬂuence of the primary specialisation (physiotherapy/manual therapy) on the treatment approach, which is contrary to the ﬁndings of Ostelo et al. (2003), but similar to those of Houben et al. (2005b). Ostelo et al. (2003) found a signiﬁcantly higher biomedical treatment approach for therapists with a biomedical specialty; however, they included both manual therapists and McKenzie therapists in the biomedical specialty. Another explanation for the contrasting ﬁndings might be that they used an earlier version of the PABS-PT; although diﬀerences between the PABS-PT versions are small they might have caused the diﬀerent results. The present study diﬀers from previous studies in that it evaluates whether a 2-day BGA training inﬂuences the therapists’ treatment approach. As expected, we found that therapists who followed the BGA training had a larger decrease in their biomedical approach than therapists who did not follow the training. Contrary to our expectations, the biopsychosocial approach was not aﬀected by the training; work experience seemed to be a stronger contributor to the biopsychosocial change. Perhaps therapists with several years of practice were more biomedically educated and needed to decrease their biomedical treatment approach before being able to adopt a more biopsychosocial one. However, because our study is not an RCT, the results should be further evaluated in larger samples. In a recent RCT (Jellema et al., 2005) a similar trend was found in the change of the treatment approaches of GPs. At follow-up, they also found a decrease in the biomedical approach for GPs who were randomised to the treatment aimed at psychosocial factors, and also found minimal changes in the biopsychosocial approach. However, they evaluated a diﬀerent type of training, and had a follow-up period of 31 months. Finally, the question remains what magnitude of change in treatment approach is needed to show a clinically relevant change in therapists behaviour and, even more important, in patient outcome. Earlier studies found only small eﬀects of a short training on the attitude towards cognitive behavioural treatment compared to those not attending training (King et al., 2002; Jellema et al., 2005). Consequently the training had no discernible impact on patient treatment outcome. These latter studies, however, used (slightly) diﬀerent measurements and examined diﬀerent health care providers and complaints compared to the present study. Whether the change in treatment approach, as found in this study, is large enough to change behaviour needs to be investigated.

F. Vonk et al. / Manual Therapy 14 (2009) 131e137

5. Conclusions and recommendations Despite the limitations, this study shows no signiﬁcant diﬀerences between BGA, CE and manual therapists’ use of biomedical or biopsychosocial approaches at baseline. But there was a trend for BGA therapists to score higher on the biopsychosocial approach, and for CE and manual therapists to score higher on the biomedical approach. Further, therapists specialised in physiotherapy or manual therapy do not diﬀer in treatment approach at baseline. Finally, BGA training might inﬂuence the therapists’ treatment approach, as the scores on the biomedical approach decreased. Based on the possible trend, it might be advisable in future research to have the participating therapist choose what treatment they want to perform. This could prove beneﬁcial for the performance of that treatment; however, evaluation of our ﬁndings in larger samples is recommended. Whether a change in treatment approach causes changes in therapist’s actual behaviour should be further explored. Additionally, when it does, the magnitude of change in treatment approach needed to provide a change in therapist’s behaviour and in patients outcome needs to be determined. Finally, evaluation of the usage of the PABS-PT is recommended, i.e. to determine whether therapist’s actual behaviour corresponds best with the two separate approach scores from the PABS-PT, or whether it is better to calculate one global treatment attitude, based on combining the quartile scores of both treatment approaches.

6. Trial registration An international registration number has been assigned to the two ongoing trials Ephysion (ISRCTN88733332) and Neck Trial (ISRCTN81350628).

Acknowledgements We would like to thank all therapists for their participation in this study.

References Bishop A, Thomas E, Foster NE. Health care practitioners’ attitudes and beliefs about low back pain: a systematic search and critical review of available measurement tools. Pain 2007. doi:10.1026/ j.pain.2007.01.028. Bogduk N. Neck pain. Australian Fam Physician 1984;13:26e30. Fordyce WE. Behavioral Methods for Chronic Pain and Illness. St. Louis: C.V. Mosby; 1976.

137

Foster NE, Pincus T, Underwood MR, Vogel S, Breen A, Harding G. Understanding the process of care for musculoskeletal conditions e why a biomedical approach is inadequate. Rheumatology 2003a;42(3):401e4. Foster NE, Pincus T, Underwood MR, Vogel S, Breen A, Harding G. Treatment and the process of care in musculoskeletal conditions. A multidisciplinary perspective and integration. Orthop Clin North Am 2003b;34(2):239e44. Houben RM, Gijsen A, Peterson J, de Jong PJ, Vlaeyen JW. Do health care providers’ attitudes towards back pain predict their treatment recommendations? Diﬀerential predictive validity of implicit and explicit attitude measures. Pain 2005a;114:491e8. Houben RM, Ostelo RW, Vlaeyen JW, Wolters PM, Peters M, Stompvan den Berg SG. Health care providers’ orientations towards common low back pain predict perceived harmfulness of physical activities and recommendations regarding return to normal activity. Eur J Pain 2005b;9:173e83. Houben RM, Vlaeyen JW, Peters M, Ostelo RW, Wolters PM, Stompvan den Berg SG. Health care providers’ attitudes and beliefs towards common low back pain: factor structure and psychometric properties of the HC-PAIRS. Clin J Pain 2004;20:37e44. Jellema P, van der Windt DA, van der Horst HE, Blankenstein AH, Bouter LM, Stalman WA. Why is a treatment aimed at psychosocial factors not eﬀective in patients with (sub)acute low back pain? Pain 2005;118:350e9. King M, Davidson O, Taylor F, Haines A, Sharp D, Turner R. Eﬀectiveness of teaching general practitioners skills in brief cognitive behaviour therapy to treat patients with depression: randomised controlled trial. BMJ 2002;324:947e50. Lindstrom I, Ohlund C, Eek C, Wallin L, Peterson LE, Fordyce WE, et al. The eﬀect of graded activity on patients with subacute low back pain: a randomized prospective clinical study with an operant-conditioning behavioral approach. Phys Ther 1992;72:279e90 [discussion 291e3]. Linton SJ, Vlaeyen J, Ostelo R. The back pain beliefs of health care providers: are we fear-avoidant? J Occup Rehabil 2002;12:223e32. Morley S, Eccleston C, Williams A. Systematic review and meta-analysis of randomized controlled trials of cognitive behaviour therapy and behaviour therapy for chronic pain in adults, excluding headache. Pain 1999;80:1e13. Morley S, Williams AC. RCTs of psychological treatments for chronic pain: progress and challenges. Pain 2006;121:171e2. Ostelo RW, Stomp-van den Berg SG, Vlaeyen JW, Wolters PM, de Vet HC. Health care provider’s attitudes and beliefs towards chronic low back pain: the development of a questionnaire. Man Ther 2003;8:214e22. Picavet HS, Schouten JS. Musculoskeletal pain in the Netherlands: prevalences, consequences and risk groups, the DMC(3)-study. Pain 2003;102:167e78. Pool JJM, Ostelo RWJG, Ko¨ke AJ, Bouter LM, de Vet HCW. Comparison of the eﬀectiveness of a behavioural graded activity program and manual therapy in patients with sub-acute neck pain: design of a randomized clinical trial. Man Ther 2006;11:297e305. Rainville J, Carlson N, Polatin P, Gatchel RJ, Indahl A. Exploration of physicians’ recommendations for activities in chronic low back pain. Spine 2000;25:2210e20. Turk DC, Flor H. Etiological theories and treatments for chronic back pain. II. Psychological models and interventions. Pain 1984;19(3):209e33. Vonk F, Verhagen AP, Geilen M, Vos CJ, Koes BW. Eﬀectiveness of behavioural graded activity compared with physiotherapy treatment in chronic neck pain: design of a randomised clinical trial [ISRCTN88733332]. BMC Musculoskelet Disord 2004;5:34. Von Korﬀ M, Barlow W, Cherkin D, Deyo RA. Eﬀects of practice style in managing back pain. Ann Intern Med 1994;121:187e95.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 138e146 www.elsevier.com/math

Original Article

Hypoaesthesia occurs with sensory hypersensitivity in chronic whiplash e Further evidence of a neuropathic condition Andy Chien a, Eli Eliav b, Michele Sterling a,c,* a Division of Physiotherapy, The University of Queensland, QLD 4072, Australia University of Medicine and Dentistry New Jersey, Newark, NJ 07101, United States c Centre of National Research on Disability and Rehabilitation Medicine (CONROD), The University of Queensland, Mayne Medical School, Herston QLD 4006, Australia b

Received 9 May 2007; received in revised form 8 November 2007; accepted 21 December 2007

Abstract Hypersensitivity to a variety of stimuli has been shown in whiplash associated disorders and may be indicative of peripheral nerve involvement. This cross-sectional study utilised Quantitative sensory testing (QST) including vibration, thermal, electrical detection thresholds as an indirect measure of primary aﬀerents that mediate innocuous and painful sensation. Pain thresholds and psychological distress (SCL-90-R) were also measured. Thirty-one subjects with chronic whiplash (>3 months, NDI: 49 17) and 31 controls participated. The whiplash group demonstrated elevated vibration, heat and electrical detection thresholds at most hand sites compared to controls ( p < 0.05). Electrical detection thresholds in the lower limb were no diﬀerent from controls ( p ¼ 0.83). Mechanical and cold pain thresholds were lower in the whiplash group ( p < 0.05) with no group diﬀerence in heat pain thresholds ( p > 0.1). SCL-90 scores were higher in the whiplash group but did not impact on any of the sensory measures. A combination of pain threshold and detection measures best predicted the whiplash group. Sensory hypoaesthesia and hypersensitivity co-exist in the chronic whiplash condition. These ﬁndings may indicate peripheral aﬀerent nerve ﬁbre involvement but could be a further manifestation of disordered central pain processing. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Whiplash injury; Sensory hypersensitivity; Hypoaesthesia; Quantitative sensory testing

1. Introduction Whiplash associated disorders (WADs) remain one of the most debated musculoskeletal conditions. Sensory disturbances including hypersensitive responses to mechanical, thermal and electrical stimulation have been consistently shown to be a feature of both the acute and chronic stages of the whiplash condition (Curatolo et al., 2001; Moog et al., 2002; Sterling et al., 2003a). * Corresponding author. Centre of National Research on Disability and Rehabilitation Medicine (CONROD), The University of Queensland, Mayne Medical School, Herston QLD 4006, Australia. Tel.: þ61 7 3365 5344; fax: þ61 7 3346 4603. E-mail address: [email protected] (M. Sterling). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2007.12.004

Importantly some of the sensory changes have been shown to be associated with poor functional recovery (Kasch et al., 2005; Sterling et al., 2005). It is generally acknowledged that the sensory hypersensitivity represents augmented central nervous system pain processing mechanisms (Curatolo et al., 2001; Sterling et al., 2003a). However, some of the changes, particularly cold hyperalgesia and sympathetic nervous system (SNS) dysfunction, may be indicative of peripheral nerve pathology (Sterling et al., 2003a). This proposal has some basis as animal and cadaver models simulating whiplash injury have shown that the nonphysiological kinematic movement during the impact induces stresses in cervical neural tissue such as the nerve roots and spinal ganglia resulting in mechanical

A. Chien et al. / Manual Therapy 14 (2009) 138e146

139

compromise suﬃcient to cause structural damage (Ortengren et al., 1996; Taylor and Taylor, 1996; Cusick et al., 2001). Furthermore, mechanosensitivity has been demonstrated with clinical tests designed to provoke the brachial plexus as well as mechanical hyperalgesia over upper limb nerve trunks (Ide et al., 2001; Sterling et al., 2002a; Greening et al., 2005). Despite these ﬁndings, standard clinical neurological examination is often normal and deﬁcits in nerve conduction studies are rarely found (Barnsley et al., 1998; Alpar et al., 2002). Although nerve conduction studies are reliable and reproducible when carried out by a single examiner (Chaudhry et al., 1994), they are limited by their ability to assess only large myelinated nerve ﬁbres and the invasive nature of the technique. Quantitative sensory testing (QST) is proving to be a valuable tool to advance the classiﬁcation of speciﬁc disorders and may be useful in illuminating the underlying mechanism of pain disorders (Edwards et al., 2005). Rolke et al. (2006) have demonstrated the validity of using comprehensive QST to obtain a complete somatosensory proﬁle in order to characterize patients with suspected neuropathic conditions but such testing has never been undertaken in a WAD cohort. In a cross-sectional study design, comprehensive QST was used to further investigate the sensory presentation of chronic WAD. Diﬀerent modalities were incorporated to provide an indirect measure of primary aﬀerents that mediate both innocuous and painful sensation. We hypothesised that patients with chronic WAD would demonstrate elevated detection thresholds as well as widespread sensory hypersensitivity.

The study was approved by the institutional medical research ethics committee. All the subjects were unpaid volunteers and all gave written informed consent before inclusion.

2. Materials and methods

2.3.2. Thermal (hot, cold) pain thresholds (TPTs) TPTs were measured using the Thermotest system (Somedic AB, Farsta, Sweden) over the mid-cervical spine and the distal aspect of C7/8 dermatomes (dorsal aspect of the hand). The temperature was preset to either increase or decrease at a rate of 1 C/s from a baseline of 30 C. The subject pressed a switch when the cold or warm sensation ﬁrst became painful (Hurtig et al., 2001). The mean of three trials at each site was calculated for analysis.

2.1. Subjects Thirty-one volunteers (25 females, mean (SD) age 35.3 10.7 years) with neck pain (3 months to 3 years duration) as a result of a motor vehicle crash were recruited. Subjects fulﬁlled the Quebec Task Force Classiﬁcation criteria of WAD II, neck complaints and musculoskeletal signs but without conduction loss on clinical neurological examination (Spitzer et al., 1995). Subjects were excluded if they experienced concussion, loss of consciousness or head injury as a result of the accident, a previous history of neck or upper quadrant pain that required treatment and/or a diagnosed psychiatric disorder. The whiplash subjects were recruited via primary care practices and from advertisement within radio and print media. Thirty-one healthy volunteers (25 females, mean age 31.4 8.9) also participated in the study. The control group was recruited from the general community provided they had never experienced trauma or injuries to the cervical spine, head, and upper quadrant requiring treatment.

2.2. Brachial plexus provocation test (BPPT) The BPPT which has been used in previous studies of whiplash (Sterling et al., 2002b; Sterling et al., 2003a) was performed. The angle of elbow extension was measured at pain threshold using a standard goniometer aligned along the mid-humeral shaft, medial epicondyle and ulnar styloid (Balster and Jull, 1997; Sterling et al., 2002b). Subjects indicated their pain during the test on a 10 cm visual analogue scale (VAS) where 0 indicated no pain and 10 was the worst pain imaginable. 2.3. QST 2.3.1. Pressure pain thresholds (PPTs) PPTs were measured using a pressure algometer (Somedic AB, Farsta, Sweden) with a probe size of 1 cm2 and application rate of 40 kPa/s. Test sites included the articular pillars of C5/6, nerve trunk of the median nerve at the elbow bilaterally (palpated on the medial side of the biceps just before it forms its tendon) and at a bilateral remote site (muscle belly of tibialis anterior). The subjects depressed a button when the sensation under the probe changed from one of pressure alone to one of pressure and pain (Sterling et al., 2002b). Triplicate recording was taken at each site and the mean values used for analysis.

2.3.3. Vibration detection thresholds (VTs) A vibrometre (Somedic AB, Sweden) with a tissue displacement range of 0.1 400 mm and a constant frequency of 120 Hz was used. In order to familiarise the subjects with the vibration stimulus, three trials of the test stimuli, or until the subject was able to consistently indicate the onset of the stimulus, were applied over the muscle belly of brachioradialis. Measures were taken over areas of the hand innervated by distal aspect of the C6 (palmar aspect of the 1st metacarpal), C7 (palmar aspect of 2nd metacarpal; dorsum of the

140

A. Chien et al. / Manual Therapy 14 (2009) 138e146

2nd metacarpal) and C8 dermatomes (dorsum of the 5th metacarpal). Subjects indicated when the vibration ﬁrst appeared, the perception threshold (VPT), and when it disappeared, the disappearance threshold (VDT). The vibration threshold (VT) was the average of VPT and VDT. Triplicate recordings were taken at each site and the mean values used for analysis.

2.3.4. Thermal (hot, cold) detection thresholds (TDTs) TDTs assess the function of aﬀerent small myelinated A-delta ﬁbres (cold sense) and unmyelinated C-ﬁbres (warm sense) (Hallin et al., 1982; Adriaensen et al., 1983; Fowler et al., 1988). Incorporating the method of limits, the Thermotest (Somedic AB, Sweden) was used to measure TDTs over areas of the hand innervated by the C7 (dorsum over the 2nd metacarpal) and C8 (dorsum of the 5th metacarpal) dermatomes. The temperature was preset to either increase or decrease at a rate of 1 C/s from a baseline of 30 C. The patient pressed a switch when they ﬁrst detected the sensation of warmth or cold.

2.3.5. Electrocutaneous detection and pain thresholds A non-noxious method of electrocutaneous stimulation was used in a method of limits procedure using the Neurometer device (Neurotron, Baltimore, USA). Sites tested were those innervated by C5/6 (anterior shoulder, inferior to shoulder joint line), C7 (distal phalanx of index ﬁnger); C8 (distal phalanx of 5th digit) and tibialis anterior as a remote site. Three diﬀerent sinusoidal frequencies (2000 Hz, 250 Hz and 5 Hz) were applied to each site in order to evoke a response from a diﬀerent subpopulation of sensory ﬁbre (Katims et al., 1986; Katims et al., 1987). The subjects reported when they ﬁrst perceived the sensation (perception threshold) and again at the intensity at which they can no longer feel the sensation (disappearance threshold). The mean of these two values were calculated and recorded three times for analysis. The same sites used to determine current detection thresholds were used to determine pain threshold but only a frequency of 250 Hz was used. As the stimulus intensity increased, the subject released a button when they ﬁrst perceived the stimulus as painful. The procedure was repeated three times with the mean score recorded as electrical pain threshold. Ratios were obtained by dividing the electrocutaneous pain threshold over the electrocutaneous detection threshold. Low intensity electrical stimulation activates large A-beta nerve ﬁbres. Current evoked pain at or close to detection threshold (ratio of less than 2:1) has been suggested to be a substrate of A-beta ﬁbre allodynia (Sang et al., 2003).

2.4. Sympathetic vasoconstrictor reﬂex (SVR) A laser Doppler (Moor Instruments, Devon, UK) was used to assess SNS function (Schurmann et al., 1999). Electrodes were attached to the thenar eminence of both hands. The test was performed with subjects in a comfortable supine position, arms resting at heart level. After a period of acclimatization and normal breathing, participants were asked to take a sudden deep breath. This provocation manoeuvre (inspiratory gasp) is known to cause a short sympathetic reaction and cutaneous vasoconstriction (Schurmann et al., 1999) and has been used in previous investigation of whiplash (Sterling et al., 2005). The procedure was repeated three times. Two quotients (SRF and QI) which describe vasomotor reﬂexes following the inspiratory gasp were calculated. SRF value represents the relative drop of the curve after the manoeuvre with the QI parameter also being inﬂuenced by the duration of perfusion decrease (Schurmann et al., 1999). 2.5. Questionnaires All participants completed the Neck Disability Index (NDI) (Vernon and Mior, 1991) and The Symptom Check List 90-R (SCL-90-R). The NDI was used to assess the extent of perceived functional disability. The SCL-90-R assessed the psychological well being of participants. 2.6. Procedure Once the informed written consent was obtained, testing was performed in the following order: SVR, BPPT, PPT (tibialis anterior, median nerve, C5/6), TDTs, TPTs, VTs, electrocutaneous detection (2000 Hz, 250 Hz, 5 Hz) and pain thresholds (250 Hz). The SVR testing was performed in a temperature-controlled laboratory. The temperature was set at 20 C, lights were dimmed and ambient noise was kept low. The rest of the testing was completed in a standard airconditioned laboratory. For all the measures, the left side was tested ﬁrst followed by the right side. 2.7. Statistical analysis The SPSS 12.0 statistical package for Windows was used for analyses. A two sample t-tests determined within subject side to side diﬀerences for all measures. A multi-variate analysis of covariance (MANCOVA) was used to compare diﬀerences between the chronic whiplash group and controls. SCL-90-R scores were entered as covariates in the analysis. Receiver Operating Characteristic (ROC) analysis was determined to examine the ability of each variable to discriminate between the groups. Variables with

141

A. Chien et al. / Manual Therapy 14 (2009) 138e146

a greater predictive capacity based on the signiﬁcance level ( p < 0.01) were entered in a logistic regression analysis to determine the best combination to predict group membership. The regression analysis was then subjected to cross-validation analysis (leave one out) to examine its reliability and generalisability. To determine diﬀerences in sensory measures between whiplash participants with or without arm pain, Manne Whitney U test was used. The presence of arm pain was deﬁned as any pain (spontaneous or evoked) distal to the shoulder reported by the participants. For all analyses signiﬁcance was set at p < 0.05.

Table 1 Mean and standard deviation values for each variable. Measures

Site

Whiplash Mean

SD

Mean

SD

PPT

Cx* Med* Tib Ant*

180.35 212.67 394.07

64.77 99.17 188.55

313.86 300.97 592.04

62.49 61.26 170.74

CPT 2000 Hz

Elb* Ind* Lit* Tib Ant

106.9 254.44 193.53 186.92

26.64 55.84 40.96 78.15

88.82 180 145.46 151.52

22.33 45.08 31.88 56.24

CPT 250 Hz

Elb Ind* Lit* Tib Ant

41.84 84.79 83.65 37.26

34.1 32.23 40.31 14.64

32.61 62.16 60.5 41.94

8.68 25.88 21.89 14.45

CPT 5 Hz

Elb Ind* Lit Tib Ant

22 46.35 42.53 27.89

9.15 20.49 25.79 17.38

22.16 35.23 34.84 23.11

10.15 16.36 14.02 10.03

CPT Pain

Elb* Ind* Lit* Tib Ant*

0.33 0.55 0.53 0.37

0.19 0.17 0.16 0.14

0.21 0.34 0.35 0.25

0.1 0.14 0.15 0.12

Vibration

Dor 5th* Dor 2nd* Palm 2nd* Palm 1st*

0.48 0.4 0.46 0.79

0.4 0.27 0.31 0.62

0.29 0.26 0.28 0.41

0.12 0.09 0.16 0.25

3.2. Side to side diﬀerences

Heat Det

Ind* Lit*

34.91 34.43

2.29 2.2

32.35 32.32

1.43 1.12

There were no side to side diﬀerences for any variable in both groups (all p > 0.05). The mean of left and right sides were calculated and used for further analysis.

Cold Det

Ind Lit*

28.99 28.62

1.55 2.05

29.58 29.56

0.85 0.82

Heat pain

Cx Hand

44.67 45.82

3.12 3.23

45.71 44.72

2.6 2.86

Cold pain

Cx* Hand*

15.4 14.78

8.45 7.97

8.03 9

3.21 3.05

3. Results 3.1. Demographic details For the whiplash group, the mean (SD) symptom duration post injury was 16 11 months. Twenty-four patients were involved in ongoing compensation claims; four had settled their claims and three had no compensation involved. The mean (SD) NDI score was 45.9% 18.8%, a moderate level of disability (Vernon and Mior, 1991). Forty-ﬁve percent of whiplash patients reported arm pain at the time of testing and 66% experienced headache.

3.3. BPPT The whiplash group demonstrated less elbow extension at pain threshold (22.3 27.4 ) ( p ¼ 0.05) and higher VAS scores (2.4 2.3) compared to the control group (elbow extension: 11.0 5.9 ; VAS: 0.7 1.1) ( p ¼ 0.05). 3.4. QST 3.4.1. Pain thresholds The whiplash group demonstrated lower PPT’s at all test sites compared to controls ( p < 0.05) (Table 1) There was no signiﬁcant diﬀerence between the two groups for heat pain thresholds ( p > 0.1), while cold pain thresholds were signiﬁcantly reduced (pain at a higher temperature) at both sites in the whiplash group ( p < 0.01) (Table 1). 3.4.2. VT Fig. 1 shows the average parameters for VT (mean and SD data shown in Table 1). The whiplash group

Control

PPT ¼ pressure pain threshold, Cx ¼ cervical spine, Med ¼ median nerve, Tib Ant ¼ tibialis anterior; CPT ¼ current perception threshold, Elb ¼ elbow, ind ¼ index ﬁnger, Lit ¼ little ﬁnger; Dor5th and Dor2nd ¼ dorsum surface of the 5th and 2nd metacarpal, Palm1st and Palm2nd ¼ palmar surface of the 1st and 2nd metacarpal; Heat Det and Cold Det ¼ heat and cold detection thresholds. *p < 0.05 On MANOVA of group diﬀerence between WAD and controls.

demonstrated elevated detection thresholds for all sites compared to the control group ( p < 0.05). 3.4.3. TDTs Heat detection thresholds were higher in the whiplash group for all test sites compared to the control group ( p < 0.01). Cold detection thresholds were reduced (detection at a lower temperature) in the whiplash group at the 5th metacarpal site ( p < 0.05) but no diﬀerent from the controls at the 2nd metacarpal area ( p > 0.1) (Fig. 2).

142

A. Chien et al. / Manual Therapy 14 (2009) 138e146

(Fig. 3). There was no diﬀerence between the groups for electrical detection thresholds measured at tibialis anterior ( p ¼ 0.83). At 250 Hz, the whiplash group demonstrated lowered pain thresholds at all sites ( p < 0.05). At tibialis anterior a 37% decrease was found, while at all other sites, the whiplash group demonstrated a 20% decrease in pain thresholds. For the electrocutaneous pain over detection threshold ratios, the whiplash group showed diﬀerences at all sites when compared to controls ( p < 0.01) (Fig. 4). The index and little ﬁnger sites were found to have a pain over detection threshold ratio of less than two (Table 1). Fig. 1. Mean (SE) VTs in the whiplash group and control groups. The stimulus was applied over areas of the hand innervated by C6 (Palm 1st), C7 (Palm 2nd, Dor 2nd) and C8 dermatomes (Dor 5th). *p < 0.05 Signiﬁcantly diﬀerent from the control group.

3.4.4. Electrocutaneous stimulation thresholds At 2000 Hz, the whiplash group demonstrated elevated electrical detection thresholds at the shoulder, index and little ﬁnger sites ( p < 0.01). At 250 Hz the whiplash group demonstrated elevated electrical detection thresholds at the index and little ﬁnger sites ( p < 0.01) and at 5 Hz the same group showed elevated detection thresholds at the index ﬁnger site ( p < 0.05) Heat Detection Threshold Temperature (°C)

36

**

**

35

Whiplash Control

34 33 32 31

3.5. SVR The whiplash group demonstrated higher QI (76.00 12.54) ( p ¼ 0.05) and lower SRF (0.55 0.17) ( p ¼ 0.05) indicating reduced vasoconstriction when compared to the control group (QI: 68.17 7.58; SRF: 0.65 0.08). 3.6. ROC analysis Areas under the curve for ROC analysis for all of the sensory tests are presented in Table 2. Summary of the ROC analysis is presented in Table 2. Logistic regression showed that C5/6 and median nerve PPT, 2nd metacarpal heat detection threshold and index ﬁnger 2000 Hz detection and pain over detection ratio were the strongest variables and predicted group membership at 96.77%. Cross-validation demonstrated that the four variables combined revealed high sensitivity and speciﬁcity to predict group membership (90.32%).

30 IndHeat

Electrocutaneous Threshold

LitHeat

Site

300

Cold Detection Threshold

*

30

Whiplash Control

250 Whiplash Control

29 28

**

200

mA

Temperature (°C)

31

**

150

**

**

100

**

27

*

50

26 25

0 sh2000

24 IndCold

LitCold

Site Fig. 2. Thermal (warmth and cold) detection thresholds (mean SE) in the whiplash group and control groups. The stimulus was applied over the dorsum aspect of the hand corresponding to the C7 (Ind: dorsum over the 2nd metacarpal) and C8 (Lit: dorsum of the 5th metacarpal) dermatomes. **p < 0.01, *p < 0.05 Signiﬁcantly diﬀerent from the control group, respectively.

Ind2000

Lit2000

Ind250

Lit250

Ind5

Site and Frequency (Hz) Fig. 3. Electrocutaneous detection thresholds (mean SE). The ﬁgure illustrates the sites and frequencies demonstrating signiﬁcant diﬀerence between the whiplash and control groups. **p < 0.01, *p < 0.05 Significantly diﬀerent from the control group, respectively. (Sh2000, anterior shoulder 2000 Hz (C5/6); Ind2000, index ﬁnger 2000 Hz (C7); Lit2000, little ﬁnger 2000 Hz (C8); Ind250, index ﬁnger 250 Hz (C7); Lit250, little ﬁnger 250 Hz (C8); Ind5, index ﬁnger 5 Hz (C8)).

143

A. Chien et al. / Manual Therapy 14 (2009) 138e146

Ratio

Electrocutaneous Pain over Detection Ratio 6

Whiplash

5

Control

4 3

**

** **

**

2 1 0

Shoulder

Index

Little

TibAnt

Site Fig. 4. Ratio of electrocutaneous detection threshold/electrocutaneous pain thresholds for the whiplash and control groups (mean SE). The whiplash group showed statistically signiﬁcant diﬀerence for all sites when compared to controls (**p < 0.01).

3.7. Psychological distress (SCL-90-R) The whiplash subjects showed elevated distress, in particular the subscales of somatization (72 4 vs 41 5), depression (73 6 vs 43 5) and general severity index (66 5 vs 46 5) compared to controls ( p ¼ 0.01) (Table 3). Comparing the means of the general severity index, 21 out of the 31 whiplash participants (68%) demonstrated elevated scores above the population norms (Derogatis, 1977). However, when SCL-90-R scores were entered into the analysis as a covariate, group diﬀerences remained signiﬁcant for all measures ( p < 0.05) and the eﬀect size on the sensory measures was small (s2 ranged from 0.031 to 0.157). 3.8. Arm pain vs no arm pain There was no diﬀerence between patients with reported arm pain (n ¼ 13) and those without for age, gender and all QST measures ( p > 0.05). There was no diﬀerence between the groups for NDI scores (arm pain 42.8 15.0; no arm pain 44.8 20.9) ( p ¼ 1.27).

4. Discussion The results of this study conﬁrm the presence of generalised sensory hypersensitivity in chronic whiplash. Consistent ﬁndings of widespread decreased pain thresholds to a variety of sensory stimuli (pressure, thermal, electrical) likely reﬂects augmented central pain processes as a contributing factor to whiplash pain (Curatolo et al., 2001; Moog et al., 2002). For the ﬁrst time, the results of this study demonstrate the additional presence of elevated detection thresholds or hypoaesthesia. Hypoaesthesia was found for vibration, electrical and thermal stimulation. VTs were elevated by an average of 40% and present across areas of the hands innervated by the lower cervical nerve roots. This is consistent

Table 2 ROC curves, area under the curve and its signiﬁcance to discriminate between the whiplash and control groups, for all variables. Variable

Area

Standard error

Signiﬁcance

C5/6 PPT Index ﬁnger 2000 Hz detection threshold 2nd Metacarpal heat detection threshold 5th metacarpal heat detection threshold Little ﬁnger, 2000 Hz detection threshold Index ﬁnger, pain over detection ration Tib ant PPT Little ﬁnger, pain over detection ratio Median nerve PPT Tib Ant, pain over detection ratio QI value Little ﬁnger, 250 Hz detection threshold SVR value Cx cold pain threshold Hand cold pain threshold Index ﬁnger, 250 Hz detection threshold Shoulder, pain over detection ratio Index ﬁnger, 5 Hz detection threshold 5th Metacarpal, cold detection threshold Shoulder, 2000 Hz detection threshold Dorsum 5th metacarpal, VDT Palmr 1st metacarpal, VDT Palmar 2nd metacarpal, VDT Tib ant, 2000 Hz detection threshold Dorsum 2nd metacarpal, VDT Hand, heat pain threshold Tib ant, 250 Hz detection threshold Little ﬁnger, 5 Hz detection threshold 2nd Metacarpal, cold detection threshold Tib Ant, 5 Hz detection threshold Shoulder, 250 Hz detection threshold Cx heat pain threshold Shoulder, 5 Hz detection threshold

0.92 0.9

0.05 0.04

0.00* 0.00*

0.89

0.05

0.00*

0.86

0.05

0.00*

0.86

0.05

0.00*

0.84

0.06

0.00*

0.8 0.79

0.06 0.06

0.00* 0.00*

0.78 0.76

0.07 0.07

0.00* 0.00*

0.73 0.72

0.07 0.07

0.01 0.01

0.71 0.71 0.71 0.71

0.08 0.08 0.08 0.07

0.01 0.01 0.01 0.01

0.7

0.07

0.01

0.69

0.07

0.02

0.69

0.08

0.02

0.68

0.07

0.03

0.68 0.68 0.67 0.65

0.08 0.08 0.08 0.08

0.03 0.02 0.03 0.06

0.63 0.61 0.61

0.08 0.08 0.08

0.10 0.18 0.18

0.61

0.08

0.17

0.6

0.08

0.22

0.57

0.09

0.41

0.56

0.08

0.50

0.56 0.53

0.08 0.08

0.45 0.74

*p < 0.05.

with dysfunction of large myelinated or A-beta sensory ﬁbres (Lang et al., 1995; Greening and Lynn, 1998). Altered vibration detection sense is thought to be an early indicator of neural pathology (Greening et al., 2003). Electrical stimulation at detection threshold levels

144

A. Chien et al. / Manual Therapy 14 (2009) 138e146

Table 3 SCL-90-R psychological subscales (mean SD) for whiplash and control groups. SCL-90-R subscale

Whiplash

Control

p Value

Somatization Obsessive compulsive Interpersonal sensitivity Depression Anxiety Hostility Phobic anxiety Paranoid ideation Psychoticism General severity index

72 57 51 73 54 51 50 48 52 66

41 42 42 43 41 41 45 42 42 46

<0.01 0.04 0.05 <0.01 0.04 0.03 0.06 0.03 0.02 <0.01

(68e76) (53e61) (48e54) (67e79) (51e57) (48e54) (45e55) (46e50) (48e59) (61e71)

(38e48) (37e44) (40e48) (33e53) (38e48) (30e46) (43e52) (39e47) (40e48) (38e48)

bypasses receptors and directly stimulates A-beta ﬁbre aﬀerents (Eliav et al., 2003). Thus the elevation of electrical detection thresholds, across the innervation zones of the three upper limb peripheral nerve trunks is also consistent with A-beta ﬁbre dysfunction. Our ﬁndings of elevated electrical detection thresholds with three stimulus frequencies may indicate the presence of both large and small sensory ﬁbre dysfunction. Whilst it has never been reliably shown in humans, it has been suggested that electrical detection threshold testing may be able to discriminate large and small ﬁbre function based on the frequency of current utilised. Large myelinated ﬁbres (A-beta ﬁbres), small myelinated ﬁbres (A-delta ﬁbres) and unmyelinated ﬁbres (C-ﬁbres) may be selectively activated by 2000 Hz, 250 Hz and 5 Hz frequencies, respectively (Rendell et al., 1989). Whilst the prevalence of elevated electrical detection thresholds was higher with a frequency of 2000 Hz, similar changes were also found at frequencies of 250 Hz and 5 Hz indicating potential involvement across nerve ﬁbre types. Results from TDT testing support this, where elevated warmth detection thresholds and to a lesser extent cold detection threshold were demonstrated suggesting disturbances in both C and A-delta ﬁbre function, respectively. The whiplash injured participants also demonstrated reduced SNS vasoconstriction indicating further neural dysfunction in terms of sympathetic ﬁbre impairment and is consistent with previous investigations of whiplash (Sterling et al., 2003a). The sensory changes occurred bilaterally and there was no diﬀerence between participants with or without arm pain. This could be perceived as unusual where some arm pain or other symptoms would be expected in the presence of a neuropathy. However, animal studies have shown that peripheral neural pathology in one area can cause widespread eﬀects, including eﬀects in apparently uninvolved limbs (Koltzenburg et al., 1999; Kleinschnitz et al., 2005). Whilst most studies have demonstrated these eﬀects to be positive symptoms such as allodynia and hyperalgesia (Hubbard and Winkelstein,

2005; Kleinschnitz et al., 2005), negative symptoms (sensation loss) have also been described (Oaklander and Brown, 2004). Greening et al. (2003) have shown that asymptomatic oﬃce workers manifest similar sensory dysfunction to patients with non-speciﬁc arm pain indicating a possible subclinical presentation, and a similar mechanism may be present in our whiplash cohort without arm symptoms. The hypoaesthetic changes were widespread across dermatomes and as such may be another manifestation of disordered central pain processing rather than an indication of peripheral nerve dysfunction (Tucker et al., 2007). One factor negating this is that electrical detection thresholds at the remote site (tibialis anterior) were no diﬀerent from control data whereas electrical and PPTs were lower at this site in the whiplash group. This suggests diﬀerent underlying mechanisms for the hypoesthesia and hyperalgesia seen in our whiplash cohort. Interestingly ﬁbromyalgia (a condition thought to reﬂect central nervous system hyperexcitability) seems to manifest pain hypersensitivity but normal perception thresholds (Arroyo and Cohen, 1993; Gracely et al., 2003) indicating diﬀerent underlying mechanisms between these two conditions. It is possible that the whiplash condition includes both central and peripheral mechanisms. Further investigation utilising relatively objective tests such as nerve conduction studies, electromyography and evoked potentials may provide further information of speciﬁc processes underlying whiplash. Nonetheless some of our ﬁndings provide further new evidence for the presence of central pain processing changes in WAD. Reduced electrocutaneous pain/ detection threshold ratios occurred at all sites with a ratio of less than two in ﬁnger sites. Although yet to be validated, Sang et al. (2003) proposed that A-beta ﬁbre mediated allodynia can be identiﬁed when the electrical current evokes pain at or close to detection threshold. From investigation of healthy control subjects, these authors propose a ratio of less than two is abnormal and indicates altered central nervous system processing of A-beta input. Our results suggest that such processes are likely involved in whiplash pain and reinforce the consistent ﬁndings of central hyperexcitability in this group. A combination of pain sensitivity (mechanical hyperalgesia in the neck and upper limbs; electrical pain/ detection ratio) and detection thresholds (heat, electrical detection in the C6 innervated area) best predicted membership to either the whiplash or control group with a high classiﬁcation rate (90.32% after cross-validation). This suggests a combination of mechanisms contribute to persistent whiplash pain. The high classiﬁcation rate of these measures suggests that the use of QST in the assessment of chronic WAD may be necessary. However, due to the costly nature of the equipment and the lengthy time required for testing, further evaluation

A. Chien et al. / Manual Therapy 14 (2009) 138e146

of these measures is required before they could be eﬃciently used in the clinical situation. The QST conducted in this study was not carried out by a blind assessor. This is a short coming of the study and the results should be viewed cautiously until their replication is established. Our whiplash patient group exhibited psychological distress consistent with previous ﬁndings (Kessels et al., 1998; Curatolo et al., 2001; Moog et al., 2002; Sterling et al., 2003b) and it is well documented that psychological distress may inﬂuence pain threshold measures (Rhudy and Meagher, 2000). For this reason we included SCL-90 scores as a covariate in the group analyses of sensory data. Group diﬀerences for all sensory measures remained unchanged and the eﬀect sizes were very small. Whilst it is acknowledged that other psychological constructs were not measured in this study, our previous data would support the current ﬁndings that the sensory disturbances of whiplash cannot be fully explained by psychological factors alone and likely reﬂect physiological changes or a complex interplay between these substrates (Sterling et al., 2003a; Sterling and Kenardy, 2006). In summary, the ﬁndings of this study conﬁrm the existence of sensory hypersensitivity in chronic WAD. Moreover, patients with chronic whiplash also demonstrated the presence of hypoaesthesia, particularly in the lower cervical dermatomes. These ﬁndings may indicate the additional presence of peripheral aﬀerent nerve ﬁbre involvement in the whiplash condition but could be a further manifestation of disordered central pain processing. A combination of pain threshold and detection measures discriminated whiplash and control subjects. These ﬁndings suggest that assessment of whiplash injured patients may need to include more detailed sensory testing using QST and this could have implications for the management of this condition.

References Adriaensen H, Gybels J, Handwerker HO, Van Hees J. Response properties of thin myelinated (A-delta) ﬁbers in human skin nerves. Journal of Neurophysiology 1983;49:111e22. Alpar EK, Onuoha G, Killampalli VV, Waters R. Management of chronic pain in whiplash injury. Journal of Bone and Joint Surgery, British Volume 2002;84B:807e11. Arroyo JF, Cohen ML. Abnormal responses to electrocutaneous stimulation in ﬁbromyalgia. Journal of Rheumatology 1993;20:1925e31. Balster SM, Jull GA. Upper trapezius muscle activity during the brachial plexus tension test in asymptomatic subjects. Manual Therapy 1997;2:144e9. Barnsley L, Lord S, Bogduk N. The pathophysiology of whiplash. Spine: State of the Art Reviews 1998;12:209e42. Chaudhry V, Corse AM, Freimer ML, Glass JD, Mellits ED, Kuncl RW, et al. Inter- and intraexaminer reliability of nerve conduction measurements in patients with diabetic neuropathy. Neurology 1994;44:1459e62.

145

Curatolo M, Petersen-Felix S, Arendt-Nielsen L, Giani C, Zbinden AM, Radanov BP. Central hypersensitivity in chronic pain after whiplash injury. Clinical Journal of Pain 2001;17: 306e15. Cusick JF, Pintar FA, Yoganandan N. Whiplash syndrome: kinematic factors inﬂuencing pain patterns. Spine 2001;26:1252e8. Derogatis LR. SCL-90-R administration, scoring and practice manual. Clinical Psychiatric Research; 1977. Edwards RR, Sarlani E, Wesselmann U, Fillingim RB. Quantitative assessment of experimental pain perception: multiple domains of clinical relevance. Pain 2005;114:315e9. Eliav E, Teich S, Nitzan D, El Raziq DA, Nahlieli O, Tal M, et al. Facial arthralgia and myalgia: can they be diﬀerentiated by trigeminal sensory assessment? Pain 2003;104:481e90. Fowler CJ, Sitzoglou K, Ali Z, Halonen P. The conduction velocities of peripheral nerve ﬁbres conveying sensations of warming and cooling. Journal of Neurology, Neurosurgery and Psychiatry 1988;51:1164e70. Gracely RH, Grant MA, Giesecke T. Evoked pain measures in ﬁbromyalgia. Best Practice and Research. Clinical Rheumatology 2003;17:593e609. Greening J, Dilley A, Lynn B. In vivo study of nerve movement and mechanosensitivity of the median nerve in whiplash and nonspeciﬁc arm pain patients. Pain 2005;115:248e53. Greening J, Lynn B. Vibration sense in the upper limb in patients with repetitive strain injury and a group of at-risk oﬃce workers. International Archives of Occupational and Environmental Health 1998;71:29e34. Greening J, Lynn B, Leary R. Sensory and autonomic function in the hands of patients with non-speciﬁc arm pain (NSAP) and asymptomatic oﬃce workers. Pain 2003;104:275e81. Hallin RG, Torebjork HE, Wiesenfeld Z. Nociceptors and warm receptors innervated by C ﬁbres in human skin. Journal of Neurology, Neurosurgery and Psychiatry 1982;45:313e9. Hubbard RD, Winkelstein BA. Transient cervical nerve root compression in the rat induces bilateral forepaw allodynia and spinal glial activation: mechanical factors in painful neck injuries. Spine 2005;30:1924e32. Hurtig I, Raak R, Kendall S, Gerdle B, Wahren L. Quantitative sensory testing in ﬁbromylagia patients and in healthy subjects: identiﬁcation of subgroups. Clinical Journal of Pain 2001;17: 316e22. Ide M, Ide J, Yamaga M, Takagi K. Symptoms and signs of irritation of the brachial plexus in whiplash injuries. Journal of Bone and Joint Surgery. British Volume 2001;83:226e9. Kasch H, Qerama E, Bach F, Jensen T. Reduced cold pressor pain tolerance in non-recovered whiplash patients: a 1 year prospective study. European Journal of Pain 2005;9:561e9. Katims JJ, Long DM, Ng LK. Transcutaneous nerve stimulation. Frequency and waveform speciﬁcity in humans. Applied Neurophysiology 1986;49:86e91. Katims JJ, Naviasky EH, Rendell MS, Ng LK, Bleecker ML. Constant current sine wave transcutaneous nerve stimulation for the evaluation of peripheral neuropathy. Archives of Physical Medicine and Rehabilitation 1987;68:210e3. Kessels RP, Keyser A, Verhagen WI, van Luijtelaar EL. The whiplash syndrome: a psychophysiological and neuropsychological study towards attention. Acta Neurologica Scandinavica 1998;97:188e93. Kleinschnitz C, Brinkhoﬀ J, Sommer C, Stoll G. Contralateral cytokine gene induction after peripheral nerve lesions: dependence on the mode of injury and NMDA receptor signaling. Brain Research. Molecular Brain Research 2005;136:23e8. Koltzenburg M, Wall PD, McMahon SB. Does the right side know what the left is doing? Trends in Neurosciences 1999;22:122e7. Lang E, Claus D, Neundorfer B, Handwerker HO. Parameters of thick and thin nerveeﬁber functions as predictors of pain in carpal tunnel syndrome. Pain 1995;60:295e302.

146

A. Chien et al. / Manual Therapy 14 (2009) 138e146

Moog M, Quintner J, Hall T, Zusman M. The late whiplash syndrome: a psychophysical study. European Journal of Pain London 2002;6: 283e94. Oaklander AL, Brown JM. Unilateral nerve injury produces bilateral loss of distal innervation. Annals of Neurology 2004;55:639e44. Ortengren T, Hansson HA, Lovsund P, Svensson MY, Suneson A, Saljo A. Membrane leakage in spinal ganglion nerve cells induced by experimental whiplash extension motion: a study in pigs. Journal of Neurotrauma 1996;13:171e80. Rendell MS, Katims JJ, Richter R, Rowland F. A comparison of nerve conduction velocities and current perception thresholds as correlates of clinical severity of diabetic sensory neuropathy. Journal of Neurology, Neurosurgery and Psychiatry 1989;52:502e11. Rhudy JL, Meagher MW. Fear and anxiety: divergent eﬀects on human pain thresholds. Pain 2000;84:65e75. Rolke R, Magerl W, Campbell KA, Schalber C, Caspari S, Birklein F, et al. Quantitative sensory testing: a comprehensive protocol for clinical trials. European Journal of Pain 2006;10:77e88. Sang CN, Max MB, Gracely RH. Stability and reliability of detection thresholds for human A-beta and A-Delta sensory aﬀerents determined by cutaneous electrical stimulation. Journal of Pain and Symptom Management 2003;25:64e73. Schurmann M, Gradl G, Andress HJ, Furst H, Schildberg FW. Assessment of peripheral sympathetic nervous function for diagnosing early post-traumatic complex regional pain syndrome type I. Pain 1999;80:149e59. Spitzer WO, Skovron ML, Salmi LR, Cassidy JD, Duranceau J, Suissa S, et al. Scientiﬁc monograph of the Quebec Task-Force on whiplash-associated disorders e redeﬁning whiplash and its management. Spine 1995;20:S1e73.

Sterling M, Kenardy J. The relationship between sensory and sympathetic nervous system changes and posttraumatic stress reaction following whiplash injury e a prospective study. Journal of Psychosomatic Research 2006;60:387e93. Sterling M, Treleaven J, Edwards SL, Jull G. Pressure pain thresholds in chronic whiplash associated disorder: further evidence of altered central pain processing. Journal of Musculoskeletal Pain 2002a;10: 69e81. Sterling M, Treleaven J, Jull G. Responses to a clinical test of mechanical provocation of nerve tissue in whiplash associated disorder. Manual Therapy 2002b;7:89e94. Sterling M, Jull G, Vicenzino B, Kenardy J. Sensory hypersensitivity occurs soon after whiplash injury and is associated with poor recovery. Pain 2003a;104:509e17. Sterling M, Kenardy J, Jull G, Vicenzino B. The development of psychological changes following whiplash injury. Pain 2003b;106: 481e9. Sterling M, Jull G, Vicenzino B, Kenardy J, Darnell R. Physical and psychological factors predict outcome following whiplash injury. Pain 2005;114:141e8. Taylor J, Taylor M. Cervical spinal injuries: an autopsy study of 109 blunt injuries. Journal of Musculoskeletal Pain 1996;29: 335e9. Tucker A, White P, Kosek E, Pearson R, Henderson M, Coldrick A, Cooke E, Kidd B. Comparison of vibration perception thresholds in individuals with diﬀuse upper limb pain and carpal tunnel syndrome. Pain 2007;127:263e9. Vernon H, Mior S. The neck disability index e a study of reliability and validity. Journal of Manipulative and Physiological Therapeutics 1991;14:409e15.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 147e151 www.elsevier.com/math

Original Article

Iliotibial band tightness and patellofemoral pain syndrome: A case-control study Zoe Hudson a,*, Emma Darthuy b,1 a

Centre for Sports and Exercise Medicine, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, Mile End Hospital, Bancroft Road, London E1 4DG, UK b London Bridge Hospital, UK Received 20 April 2007; received in revised form 13 November 2007; accepted 3 December 2007

Abstract Tight lateral structures have been implicated in subjects presenting with patellofemoral pain syndrome (PFPS). It has been proposed that a tight iliotibial band (ITB) through its attachment of the lateral retinaculum into the patella could cause lateral patella tracking, patella tilt and compression. Twelve subjects presenting with PFPS were compared with 12 matched control subjects. Hip adduction was measured using the Ober test in each subject as an indirect measure of ITB length. The mean values for hip adduction in the control group were 21.4 (4.9) and 20.3 (3.8) degrees in the left and right legs, respectively, and in the PFPS group, 17.3 (6.1) and 14.9 (4.2) degrees in the non-painful leg and painful leg, respectively. One way analysis of variance (ANOVA) revealed a highly signiﬁcant diﬀerence between groups (F ¼ 4.485, p ¼ 0.008) and post-hoc analysis showed a signiﬁcant diﬀerence between the painful leg in the PFPS group and the left and right legs in the control group, p ¼ 0.002 and 0.009, respectively. The results from this study show that subjects presenting with PFPS do have a tighter ITB. Future work should investigate this observation prospectively in order to determine whether a tight ITB is the cause or eﬀect of PFPS. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Patellofemoral pain; Case-control study; ITB; Ober test

1. Introduction Patellofemoral pain syndrome (PFPS) is generally recognised as being multifactorial in origin. Many factors associated with this condition have been described including, abnormal lower limb biomechanics and altered motor control and recruitment (Fredericson et al., 2000; Earl et al., 2005). It is hypothesised that * Corresponding author. Tel.: þ44 208 223 8839; fax: þ44 208 223 8930. E-mail addresses: [email protected] (Z. Hudson), [email protected] (E. Darthuy). 1 The Wells Physiotherapy and Sports Clinic, 4 Clanricarde Gardens, Tunbridge Wells, Kent TN1 2PE, UK. Tel.: þ44 1892 525065; fax: þ44 1892 618413. 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2007.12.009

these changes result in abnormal patellar tracking or malalignment, with subsequent symptoms. Treatment paradigms include mobilising tight lateral structures, motor control and re-education of local and proximal muscles, patella taping and biomechanical correction with the use of orthotics (Bizzini et al., 2003; Aminaka and Gribble, 2005). The iliotibial band (ITB) has both a dynamic and passive role at the patellofemoral joint. Proximally the ITB attaches to the tensor fascia lata whilst distally, ﬁbres from the ITB interdigitate with vastus lateralis (Terry et al., 1986). Most of the lateral retinaculum (superﬁcial oblique and deep transverse portion) arises from the ITB (Standring, 2005), therefore the ITB indirectly provides lateral stabilisation and acts as a passive

148

Z. Hudson, E. Darthuy / Manual Therapy 14 (2009) 147e151

restraint to medial patella glide. A tight ITB could theoretically lead to lateral patella tracking, lateral patella tilt and lateral patella compression. Clinical assessment of ITB length remains under debate. However, clinicians have traditionally used the Ober test to evaluate hip adduction as an indirect measure of ITB length (Puniello, 1993; Magee, 1997; Herrington et al., 2006; Wang et al., 2006). This clinical test has been shown to have excellent intra (ICC ¼ 0.94) and inter-tester (ICC ¼ 0.91) reliability (Melchione and Sullivan, 1993; Reese and Bandy, 2003). Previous studies have attempted to quantify ITB length as measured by the Ober test in normal subjects (Gajdosik et al., 2003; Reese and Bandy, 2003) and those with PFPS (Melchione and Sullivan, 1993). It has been suggested by several authors that tightness of the ITB could be a contributory factor for PFPS (McConnell, 1986; Gerrard, 1989; Puniello, 1993; Bizzini et al., 2003). Whilst ITB tightness appears to be associated with PFPS, to the authors’ knowledge, this clinical observation remains to be tested in a case-control study. The aim of this study was to investigate whether subjects with PFPS had a tighter ITB measured using the Ober test, compared to a group of subjects with no pain.

Table 1 Inclusion and exclusion criteria for both study groups. Inclusion criteria PFPS 1. Insidious onset of pain group around patella that worsens with two or more of the following: - ascending or descending stairs - squatting activities - running - sitting still for 20 min or more 2. Objectively demonstrate two or more of the following: - full knee ROM restricted by pain - VMO weakness - Glut Med insuﬃciency performing a one leg squat 3. Either gender aged 18e40 4. Completed consent form Control 1. Age and gender group matched to PFPS group 2. Completed consent form

Exclusion criteria 1. Current back, hip or ankle pain 2. Previous surgery or traumatic injury to back, hip, knee or ankle 3. Any pain in back, hip or ankle in the last year requiring attention from a healthcare professional

As above

2. Methods

Committee and informed written consent was obtained from all participants prior to entering the study.

2.1. Subjects

2.2. Test procedure

Normative data from Reese and Bandy (2003) were used to determine the number of subjects required, assuming a diﬀerence of 6 to be clinically relevant. A sample size calculation revealed a minimum of 10 subjects would be required in each group to reveal a signiﬁcant diﬀerence in hip adduction with the adoption of alpha of 0.05 and a power of 0.8. A target of 12 subjects for each group was identiﬁed to accommodate a 20% drop out rate. A sample of convenience was used and 12 consecutive subjects referred to an outpatient physiotherapy department, who satisﬁed the inclusion criteria, were recruited for the study (PFPS group) (Table 1). Gluteus medius insuﬃciency during a one leg squat was subjectively assessed looking for internal rotation and adduction of the femur between 30 and 60 of knee ﬂexion, whilst VMO weakness was assessed using an isometric manually resisted contraction. Twelve healthy volunteers (control group) with no history of knee pain were recruited from hospital staﬀ. The control group was age, sex, height, weight and activity matched to those subjects in the PFPS group. Leg dominance was determined by which leg they would kick a ball with. All subjects were screened by one physiotherapist with 5 years musculoskeletal experience. The study was granted ethical approval from Guy’s Hospital and London Bridge Hospital Research Ethics

The Ober test was conducted on both legs of each subject by one of the authors (ED) according to a standard operating procedure (Reese and Bandy, 2003) (Fig. 1). In barefoot standing a spirit level was placed horizontally on the level of the PSIS and secured with tape (Fig. 2). In side lying, the lower leg was ﬂexed to 45 to maintain a neutral lumbar lordosis. The spirit level was checked to ensure no lateral or AP tilting of the pelvis occurred throughout the measurement procedure. Additionally, the tester stabilised the pelvis with the hand as necessary. The knee was ﬂexed to 90 and the upper leg was passively brought into abduction and extension. A bubble inclinometer (Baseline, Fabrication Enterprises Inc., New York) was placed on the lateral thigh, just proximal to the lateral femoral condyle. The tester lowered the leg into adduction, attempting to control for any visually observed unwanted hip rotation. The end point (angle of adduction) was deemed when no further adduction occurred, and the reading was taken from the inclinometer. If the limb was horizontal, it was considered to be at 0 , if below horizontal (adducted), the angle was recorded as a positive number, and if above horizontal (abducted), the angle was recorded as a negative number. The leg tested ﬁrst was randomised by the toss of a coin. The tester was not blinded to the group

149

Z. Hudson, E. Darthuy / Manual Therapy 14 (2009) 147e151

Table 2 Demographic and anthropometric data of the control and PFPS groups (mean SD).

Control group PFPS group

Number

Gender

Age (years)

Height (cm)

Weight (kg)

12

8 men, 4 women 8 men, 4 women

30.6 (5.9)

173.8 (8.9)

71.0 (10.4)

32.9 (5.1)

171.5 (11.5)

75.3 (12.3)

12

3. Results

Fig. 1. Standard position for measurement of the Ober test (adapted from Reese and Bandy, 2003).

allocation of the subject, but was blinded to the readings from the inclinometer at the time of testing. An independent observer recorded all these results. Each leg was measured on one occasion only.

2.3. Data analysis The ShapiroeWilk test was applied to all data sets to test for normality of distribution. A one way analysis of variance (ANOVA) was applied to investigate group differences and a post-hoc Least Signiﬁcant Diﬀerence test was employed to evaluate any relevant interactions. An alpha level of 0.05 was set for all statistical tests. All statistical analyses were conducted using Statistical Package for Social Scientists (SPSS 13.0, Chicago, Illinois).

Fig. 2. Three-way spirit level taped to the PSIS in standing.

Group demographics and anthropometric measures are shown in Table 2. Independent t-tests showed no signiﬁcant diﬀerence ( p > 0.05) in age, height or weight between the two groups. Participation in recreational physical activity was comparable between the groups, as assessed by a custom made questionnaire. One subject in each group participated in no physical activity per week, four subjects in the PFPS group and three in the control group ran at least once a week. Eleven subjects in each group participated in other recreational sports at least once a week, these included rugby, football, gym, kick-boxing, badminton, circuit training, cycling, spinning, boxercise and tennis in the PFPS group and dancing, squash, golf, gym, rugby, football, netball, hockey and cycling in the control group. In the PFPS group 50% of subjects reported previous injuries for which they had received treatment but not within the year prior to data collection for this study. Previous injuries included ankle sprain (1), achilles pain (1), lower back pain (2), patella tendinopathy on the opposite knee (1), groin injury (1). No injuries requiring treatment from a healthcare professional were reported in the control group. The majority of subjects had experienced symptoms for at least 6 months (minimum 2 months, maximum 20 years). All data sets obtained for hip adduction angle were normally distributed according to the ShapiroeWilk test, therefore parametric analysis was conducted. The mean values for hip adduction (dependent variable) in the control group were 21.4 (4.9) and 20.3 (3.8) degrees in the left and right legs, respectively, and in the PFPS group, 17.3 (6.1) and 14.9 (4.2) degrees in the non-painful leg and painful leg, respectively (Fig. 3). One way ANOVA revealed a highly signiﬁcant diﬀerence between groups (F ¼ 4.485, p ¼ 0.008). Least Signiﬁcant Diﬀerence post-hoc analysis revealed no signiﬁcant diﬀerence between the left and right legs in the control group and the painful and nonpainful legs in the PFPS group. There was a signiﬁcant diﬀerence between the painful leg in the PFPS group and the left and right legs in the control group, p ¼ 0.002 and 0.009, respectively. Analysis of the nonpainful leg in the PFPS group showed a signiﬁcant diﬀerence between the left leg ( p ¼ 0.04) and a non-

Range of Hip Adduction (degrees)

150

Z. Hudson, E. Darthuy / Manual Therapy 14 (2009) 147e151 25

20

15

10

5

0 control group-left

control group-right

PFPS group- PFPS groupno pain pain

Fig. 3. Mean values for hip adduction as measured by the Ober test for both groups. Error bars represent 95% conﬁdence intervals.

signiﬁcant diﬀerence ( p > 0.1) between the right leg in the control group.

4. Discussion The combined data for the control group provide comparable ﬁndings to those previously reported. Reese and Bandy (2003) reported a mean of 18.9 (7.6) hip adduction in 61 healthy subjects compared to 20.9 (4.3) in the present study. The only study investigating the Ober test in subjects with PFPS reported the reliability values of the test and not the actual values for hip adduction (Melchione and Sullivan, 1993). Subjects presenting with PFPS in the present study had a tighter ITB on the side with the painful knee, and this was shown to be highly signiﬁcant compared to both knees in the control group. This data would support clinical observations of ITB tightness in subjects presenting with PFPS (Hudson and Darthuy, 2006). However, the limitations of a case-control study mean that these results do not provide evidence of causality. Additionally, the non-painful knee in the PFPS subjects showed a trend to be tighter than both knees in the control group. These results could be interpreted in several ways. PFPS commonly occurs bilaterally and if we accept the causative model, and ITB is the cause of PFPS, perhaps the ITB had not become suﬃciently tight enough for the subjects to develop symptoms on the contralateral side. Alternatively, if altered biomechanics are the underlying cause for PFPS, then proximally, poor control of medial hip rotation via gluteus medius could place an existing tight ITB on to a stretch, whereby it is more likely to cause lateral tracking of the patella during dynamic weight bearing. Results from previous studies have suggested that strengthening muscles proximally and improving dynamic alignment can improve the symptoms in subjects

with PFPS (Mascal et al., 2003; Cibulka and ThrelkeldWatkins, 2005). Distally, excessive or uncontrolled pronation would also increase lower extremity internal rotation, which would have a similar eﬀect on the ITB length. This may explain why orthoses may have a role to play in some subjects with PFPS (Gross and Foxworth, 2003). Treatment paradigms for PFPS have included mobilising tight lateral structures, and bracing and taping to provide a sustained stretch on these structures. The latter has been shown to provide short-term pain reduction in these subjects (Herrington, 2004). Some studies have shown short-term deformation of the ITB with stretching. This has been shown directly using ultrasonography (Wang et al., 2006) and indirectly using kinematic and kinetic analysis (Fredericson et al., 2002). However, to date there is no study that has investigated the long-term eﬀects of ITB stretching and mobilisation on ITB length. Whether a tight ITB is causative or a result of PFPS, it is useful to have a valid measure of ITB length. There are some study limitations that should be acknowledged. The test procedure for this study was adopted from Reese and Bandy (2003) who reported good reliability for the Ober test. However, for pragmatic reasons, no tester reliability was conducted in this study. The tester attempted to control for hip rotation during the Ober test as most clinicians would in the usual patient setting. Whilst this is diﬃcult to standardise, it should reasonably consistent for the same tester. Alternatively, markers could have been placed on the patella and distal femur in an attempt to more accurately control this variable. A customised questionnaire was used to evaluate the type and frequency of physical activity and sport. Whilst this was not validated, it was used to ensure there was no speciﬁc physical activity that dominated either group and could have potentially been a confounding factor.

5. Conclusion This study has shown that subjects with PFPS have a signiﬁcantly tighter ITB on the symptomatic side compared to a matched control group of healthy subjects. In order to inform whether this observation is the cause or eﬀect in subjects presenting with PFPS, a study would need to be conducted whereby an asymptomatic group was evaluated for ITB length and followed prospectively to see which subjects developed PFPS. References Aminaka N, Gribble PA. A systematic review of the eﬀects of therapeutic taping on patellofemoral pain syndrome. Journal of Athletic Training 2005;40(4):341e51. Bizzini M, Childs JD, Piva SR. Systematic review of the quality of randomized controlled trials for patellofemoral pain syndrome.

Z. Hudson, E. Darthuy / Manual Therapy 14 (2009) 147e151 Journal of Orthopaedic and Sports Physical Therapy 2003;33(1):4e20. Cibulka MT, Threlkeld-Watkins J. Patellofemoral pain and asymmetrical hip rotation. Physical Therapy 2005;85(11): 1201e7. Earl JE, Hertel J, Denegar CR. Patterns of dynamic malalignment, muscle activation, joint motion, and patellofemoral-pain syndrome. Journal of Sport Rehabilitation 2005;14(3):215e33. Fredericson M, Cookingham CL, Chaudhari AM, Dowdell BC, Oestreicher N, Sahrmann SA. Hip abductor weakness in distance runners with iliotibial band syndrome. Clinical Journal of Sport Medicine 2000;10(3):169e75. Fredericson M, White JJ, MacMahon JM, Andriacchi TP. Quantitative analysis of the relative eﬀectiveness of 3 iliotibial band stretches. Archives of Physical Medicine and Rehabilitation 2002;83(5):589e92. Gajdosik RL, Sandler MM, Marr HL. Inﬂuence of knee positions and gender on the Ober test for length of the iliotibial band. Clinical Biomechanics 2003;18(1):77e9. Gerrard B. The patello-femoral pain syndrome: a clinical trial of the McConnell programme. The Australian Journal of Physiotherapy 1989;35(2):71e80. Gross MT, Foxworth JL. The role of foot orthoses as an intervention for patellofemoral pain. Journal of Orthopaedic and Sports Physical Therapy 2003;33(11):661e70. Hudson ZL, Darthuy E. Iliotibial band tightness and patellofemoral pain syndrome a case-control study. Conference proceedings. Physical Therapy in Sport 2006;7(4):173. Herrington L. The eﬀect of patella taping on quadriceps strength and functional performance in normal subjects. Physical Therapy in Sport 2004;5(1):33e6.

151

Herrington L, Rivett N, Munroa S. The relationship between patella position and length of the iliotibial band as assessed using Ober’s test. Manual Therapy 2006;11(3):182e6. Magee DJ. Orthopaedic physical assessment. 3rd ed. Philadelphia: WB Saunders; 1997. p. 483 [chapter 11]. Mascal CL, Landel R, Powers C. Management of patellofemoral pain targeting hip, pelvis, and trunk muscle function: 2 case reports. Journal of Orthopaedic and Sports Physical Therapy 2003;33(11):647e60. McConnell J. The management of chondromalacia patellae: a long term solution. The Australian Journal of Physiotherapy 1986;32(4):215e23. Melchione WE, Sullivan MS. Reliability of measurements obtained by use of an instrument designed to indirectly measure iliotibial band length. Journal of Orthopaedic and Sports Physical Therapy 1993;18(3):511e5. Puniello MS. Iliotibial band tightness and medial patellar glide in patients with patellofemoral dysfunction. Journal of Orthopaedic and Sports Physical Therapy 1993;17(3):144e8. Reese NB, Bandy WD. Use of an inclinometer to measure ﬂexibility of the iliotibial band using the Ober test and the modiﬁed Ober test: diﬀerences in magnitude and reliability of measurements. Journal of Orthopaedic and Sports Physical Therapy 2003;33(6):326e30. Standring S. Gray’s anatomy. 39th ed. Edinburgh: Elsevier Churchill Livingstone; 2005. p. 1479 [chapter 113]. Terry GC, Hughston JC, Norwood LA. The anatomy of the iliopatellar band and iliotibial tract. American Journal of Sports Medicine 1986;14(1):39e45. Wang TG, Jan MH, Lin KH, Wang HK. Assessment of stretching of the iliotibial tract with Ober and modiﬁed Ober tests: an ultrasonographic study. Archives of Physical Medicine and Rehabilitation 2006;87(10):1407e11.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 152e159 www.elsevier.com/math

Original Article

Interobserver reliability of physical examination of shoulder girdle Jettie G. Nomden a,b,*, Anton J. Slagers a,b, Gert J.D. Bergman c, Jan C. Winters c, Thomas J.B. Kropmans d, Pieter U. Dijkstra a,b,e a

Department of Rehabilitation, University Medical Center Groningen, University of Groningen, P.O. Box 30.001, 9700 RB Groningen, The Netherlands b Share, Graduate School for Health Research, University Medical Center Groningen, University of Groningen, The Netherlands c Department of General Practice, University Medical Center Groningen, University of Groningen, The Netherlands d Department of Medical Informatics & Medical Education, University of Ireland, Galway, Ireland e Department of Oral and Maxillofacial Surgery, University Medical Center Groningen, University of Groningen, The Netherlands Received 2 March 2007; received in revised form 20 December 2007; accepted 6 January 2008

Abstract The object of this study was to assess interobserver reliability in 23 tests concerning physical examination of the shoulder girdle. A physical therapist and a physical therapist/manual therapist independently performed a physical examination of the shoulder girdle in 91 patients with shoulder complaints of varying severity and duration. The observers assessed 23 items in total: active and passive abductions, passive external rotation, hand in neck (HIN) test, hand in back (HIB) test, impingement test according to Neer, springing test of the ﬁrst rib and joint play test of the acromioclavicular joint. The interobserver reliability was evaluated by means of a Cohen’s Kappa, the weighted Kappa and the intraclass correlation (ICC). Criteria for acceptable reliability were: Kappa value 0.60, ICC 0.75 or an absolute agreement 80%. The results showed that Kappa values varied from 0.09 (springing test ﬁrst rib, stiﬀness) to 0.66 (springing test ﬁrst rib, pain), weighted Kappa varied from 0.35 (pain during HIB) to 0.73 (range of motion HIB) and ICC varied from 0.54 (abduction passive starting point painful arc) to 0.96 (active and passive ranges of motion in abduction). In total 11 (48%) items fulﬁlled the criteria of acceptable reliability. In conclusion, there appears to be a great deal of variation in the reliability of the tests used in the physical examination of the shoulder girdle. Over 50% of the tests did not meet the statistical criteria for acceptable reliability. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Reliability; Observer; Shoulder girdle; Physical examination

1. Introduction Shoulder complaints are common in the locomotor system. The yearly prevalence of shoulder complaints ranges from 100 to 160 per 1000 patients in the general population (Winters et al., 1999). The diagnosis in patients with shoulder complaints is diﬃcult because currently no uniformity exists as to how shoulder * Corresponding author. þ31 50 3613651. E-mail address: [email protected] (J.G. Nomden). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.005

complaints should be labelled or deﬁned (Green et al., 1998a). Diagnostic criteria for deﬁning shoulder disorders are neither consistently nor reliably applied (Green et al., 2003). According to the Guidelines for Shoulder Complaints of the Dutch College of General Practitioners (Winters et al., 1999) most shoulder complaints are elicited by shoulder disorders, probably resulting from strain, aseptic inﬂammation or degeneration of soft tissues of the glenohumeral joint or of structures in the immediate surroundings. In most cases it cannot be determined accurately which structure is aﬀected.

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159

Hence, the term ‘shoulder complaints’ is used as a working as well as a ﬁnal diagnosis (Winters et al., 1999). Shoulder complaints may result in considerable disability (Green et al., 2003). Shoulder pain often impairs the ability to sleep, and restricted and/or painful range of motion of the shoulder inﬂuences performance of activities of daily living (Green et al., 2003). Treatment of shoulder complaints is aimed at reducing symptoms such as pain and restricted range of motion, increasing functional activities and re-starting participation in work and social activities. In order to focus treatment and to evaluate eﬀectiveness of treatment, reliable tests are an important prerequisite. Reliability of assessment of shoulder complaints and function of the shoulder diﬀers per study, ranging from low to moderate (Green et al., 1998b; de Winter, 1999; Hoving et al., 2002; Terwee et al., 2005). Recently, movement tests of the shoulder and shoulder girdle, as recommended in the Guidelines for Shoulder Complaints of the Dutch College of General Practitioners (Winters et al., 1999), together with additional functional tests were used as outcome measures in a randomised controlled trial (Bergman et al., 2002). Thus these tests were used for evaluation of treatment eﬃcacy. To interpret the outcomes of this study it is important to evaluate the reliability of the tests used. Diﬀerences found in the trial within or between groups may be caused by diﬀerences in treatment eﬀects but also by diﬀerences between observers. The aim of the present study is to determine the interobserver reliability of the physical examination of the shoulder girdle as performed in the above-mentioned randomised controlled trial.

2. Methods Consecutive patients eligible for participation in the randomised controlled trial were invited to participate in this reliability study. Inclusion criteria for patients in that trial were presence of shoulder complaints, not being treated for these complaints in the past 3 months and aged over 18 yrs. Shoulder complaints were deﬁned as pain at rest or provoked or aggravated by movement in the area between neck and elbow. Informed consent was obtained from all patients. Extension of the pain to the region between the scapulae, to the cervical spine or to the lower part of the arm was not an exclusion criterion. Exclusion criteria for patients were presence of speciﬁc rheumatic disorders, shoulder complaints caused by acute severe trauma or previous surgery, signs of cervical nerve root compression, or shoulder complaints related to general internal pathologic conditions of thoracic and abdominal organs (Bergman et al., 2002). Most patients included in the randomised controlled trial also participated in this reliability study.

153

Physical examinations were performed independently by a physical therapist and a physical therapist/manual therapist (JGN and AJS, 27 and 12 yrs practice experience, respectively). Before the study (clinical trial and reliability study) all tests were standardised and the observers received training in the application of the tests. The diagnosis was unknown to the observers. The order of examination by the two observers varied. Each observer examined about half of the patients as ﬁrst observer followed by the second observer who performed the same examination a few minutes later. Patients were sitting upright during all examinations. All tests were performed in the morning. During the study the two physical therapists did not exchange information concerning the outcome of the assessments. Patients were instructed not to give any comment about the previous examination. 2.1. Examination of shoulder girdle The examination of the shoulder girdle was based upon the Guidelines for Shoulder Complaints of the Dutch College of General Practitioners (Winters et al., 1999). The examination was focused on range of motion of the shoulder (visually assessed to the nearest 5 ), on pain experienced (four point ordinal scale: no pain, little pain, much pain, and excruciating pain) and on occurrence of pain during movement. The following movements were examined: 2.1.1. Functional tests: hand in neck (HIN) test and hand in back (HIB) test Both tests were slightly modiﬁed from the tests described by Solem-Bertoft et al. (1996) (Appendix 1). The HIN and HIB were graded in to a score (range 0e7) based upon the end point reached. Additionally, during the HIN and HIB pain was assessed on a four point ordinal scale: no pain, little pain, much pain, and excruciating pain. 2.1.2. Active abduction The starting position of the patient was arm stretched alongside the body, held in external rotation and thumb directed sidewards. The patient lifted his extended arm sideways and upwards in the frontal plane until it was beside his head. The range of motion and pain was assessed. 2.1.3. Painful arc during active abduction Presence of a painful arc was assessed and if present starting point and end point was visually estimated. 2.1.4. Passive abduction The starting position of the patient was arm stretched alongside the body, held in external rotation and thumb directed sideward. The patient was asked to keep the

154

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159

shoulder arm muscles relaxed. The observer lifted the extended arm sideways and upwards in the frontal plane until it was beside the patient’s head. The range of motion and pain was assessed. 2.1.5. Painful arc during passive abduction Presence of a painful arc was assessed and if present starting point and end point was visually estimated. 2.1.6. Passive external rotation The starting position of the upper arm was 0 elevation, elbow held in 90 and forearm in neutral position. The patient was asked to keep the shoulder arm muscles relaxed. The observer supported the arm at the wrist, locked the elbow, and held the arm bent at 90 and rotated it outwards in the transversal plane. Range of motion and pain was assessed. 2.1.7. Impingement test The impingement test was only performed if no glenohumeral restrictions were found. The starting position was similar to passive abduction. During the test scapular rotation was prevented with one hand by the observer, while the other hand of the observer raised the patient’s arm in abduction, causing the greater tuberosity to impinge against the acromion. The results of the tests were interpreted as positive or negative (Neer, 1983). 2.1.8. Springing test of the ﬁrst rib The observer exerted force with the second metacarpophalangeal joint on the ﬁrst rib of the patient, assessing range of motion (normal or restricted), pain (present or absent), and joint stiﬀness (present or absent) (Jirout, 1986). 2.1.9. Acromioclavicular joint assessment Visual assessment of swelling (present of absent) and joint play test of the acromioclavicular joint. The observer manipulated the joint in the sagittal plane assessing presence of pain (present or absent). According to the protocol in the randomised clinical trial each observer assessed the active and passive movements in one or two movements. No verbal encouragements were given by the observers during active tests. 2.2. Statistical analysis Data analyses were performed in SPSS (version 12). Percentage of absolute agreement (calculated as the number of observations in which both observers agreed with each other divided by the total number of observations), Cohen’s Kappa and weighted Cohen’s Kappa were calculated to quantify the interobserver agreement for dichotomous data and ordinal data. Regarding

range of motion of the shoulder, t-tests for related samples were performed and intraclass correlations (ICCs) were calculated. Additionally Bland and Altman (1986) plots were made for range of motion of the shoulder to analyse if the diﬀerences between observers were consistent across the range of measurements. Criteria for acceptable reliability were a Kappa value 0.60, and an ICC of 0.75 (Landis and Koch, 1977; Brouwer et al., 2003). Poor Kappa value can be present although absolute agreement is very high, probably related to lack of variation in cell ﬁlling. Therefore, an absolute agreement of 80% was also a criterion for an acceptable agreement. This study was approved by the Medical Ethics Committee of the University Medical Center Groningen, University of Groningen, The Netherlands.

3. Results A total of 91 participants were included in the study. Table 1 shows baseline characteristics of the patients. Generally, the duration of shoulder complaints ranged between 3 and 5 weeks. Many patients had had previous periods of shoulder complaints. In total 76 participants were assessed 6 weeks after inclusion in the trial and 15 participants were assessed 12 weeks after inclusion in the trial. Table 2 shows Cohen’s Kappa and absolute agreement for dichotomous data. For one test (‘acromioclavicular swelling’) Cohen’s Kappa could not be Table 1 Baseline characteristics of the participating patients. Variables

N ¼ 91

Age in years (mean SD) Male Female

48.5 (11.8) 43 (47.3%) 48 (52.7%)

Duration complaints 0e2 weeks 3e5 weeks 6e8 weeks 9e11 weeks 12e26 weeks >26 weeks Previous periods of shoulder complaints No Yes, left shoulder Yes, right shoulder Yes, both shoulders Previous neck complaints (minimally 1 week) No Yes

9 (9.9%) 28 (30.8%) 13 (14.3%) 11 (12.1%) 12 (13.2%) 18 (19.8%) 31 (34.1%) 23 (25.3%) 28 (30.8%) 9 (9.9%) 36 (39.6%) 55 (60.4%)

Development of complaints Rapid/acute Gradual

28 (31%) 63 (69%)

Shoulder pain (range 0e10)

3.4 (2.2)

Shoulder restrictions (range 0e10)

4.5 (2.8)

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159 Table 2 Cohen’s Kappa and absolute agreement for dichotomous data. Variables

Kappa Absolute agreement (%)

Active painful arc (present, absent) Passive painful arc (present, absent) Impingement (present, absent) Acromioclavicular swelling (present, absent) Springing test ﬁrst rib range of motion (normal, restricted) Springing test ﬁrst rib stiﬀ (present, absent) Springing test ﬁrst rib pain (present, absent)

0.46 0.52 0.47 e 0.26

74 76 74b 99a 66

0.09 0.66

68 82a

e: Cohen’s Kappa could not be calculated because of incomplete ﬁlling of the 2 2 tables. a Tests fulﬁlling criteria for acceptable reliability. b Test only performed if no restrictions in glenohumeral range of motion were found.

calculated because of incomplete ﬁlling of the 2 2 tables. For two tests (‘acromioclavicular swelling’ and ‘springing test ﬁrst rib pain’) acceptable reliability (absolute agreement > 80%) was found. Table 3 shows the results in absolute agreement for ordinal data. In two functional tests (‘pain HIN’ and ‘pain HIB’) the absolute agreement was less than 80%. In the other seven tests the reliability was acceptable. Data of the diﬀerences between observers, results of t-tests for diﬀerences in mean range of motion between observers, and the corresponding ICC are shown in Table 4. For the tests ‘abduction passive starting point of painful arc’ and ‘passive external rotation’ the diﬀerence between the observers was statistically signiﬁcant. For these outcome variables no plots were made because systematic diﬀerences between the observers exist (Bland and Altman, 1986). In Figs. 1 and 2 Bland and Altman plots are shown for ‘abduction range of motion active’ and ‘abduction active starting point of painful arc’ to illustrate the magnitude and direction of diﬀerences across the range of measurements. No funnel shape was observed in the plots. Similar results are found in Bland and Altman plots for ‘abduction passive range of motion’, Table 3 Weighted Kappa and absolute agreement for ordinal data. Variables

Kappa

Absolute agreement (%)

Range of motion HIN HIB

0.52 0.73

85a 94a

Pain HIN HIB Abduction active Abduction passive External rotation passive Impingement Acromioclavicular joint

0.52 0.35 0.65 0.69 0.50 0.62 0.51

79 73 90a 91a 82a 91a 90a

a

Tests fulﬁlling criteria for acceptable reliability.

155

‘abduction active end point of painful arc’ and for ‘abduction passive end point of painful arc’. Thus diﬀerences between observers were consistent across the range of measurements for these tests. In two tests (range of motion in active and passive abductions) an ICC of >0.75 was observed. For these tests the interobserver reliability was acceptable. In summary, 11 of the 23 tests (48%) had an acceptable interobserver reliability.

4. Discussion Substantial variation in the interobserver reliability, ranging from poor to good reliability in the tests of physical examination of the shoulder girdle was found in this study. In the 23 tests performed 11 (48%) fulﬁlled the criteria of an acceptable reliability. For the tests on dichotomous data two out of seven tests showed acceptable reliability, for tests on ordinal data seven out of nine tests showed acceptable reliability and for tests on interval data two out of seven tests showed acceptable reliability (Tables 2e4). Thus, tests on ordinal data showed a higher reliability than tests on dichotomous or interval data. One might consider several explanations for the overall moderate reliability reported in this study. These explanations are related to the data level of the physical examination, training eﬀects within patients, diﬀerence between observers and changes of the outcome as a result of the ﬁrst physical examination. 4.1. Data level An explanation for better reliability results of tests at ordinal data level could be that patients prefer more response options. Answering on a more gradual, ordinal, scale (no pain, little pain, much pain, and excruciating pain) might be easier than answering on a dichotomous scale: pain absent or present. On a gradual scale patients can indicate more precisely how they experience the pain during the test. The tests producing interval data were all tests based on visual estimation by the observer of active/passive range of motion and starting/end point of a painful arc. Two movements at most were performed during which the examiner had to do his assessment because this was the trial protocol. For the movements active and passive abductions a good reliability was found despite the large standard deviations of the mean diﬀerence between the observers. For the observer it may be more diﬃcult (i.e. less reliable) to assess range of motion during the movement, as for instance the starting point or end point of a painful arc, than in an end position of active and passive abductions. A signiﬁcant diﬀerence between the assessments of the two observers was found

156

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159

Table 4 Diﬀerences between observer 1 and observer 2, results of t-test for related samples and ICCs. Variable

Observer 1 mean (SD)

Observer 2 mean (SD)

Abduction range of motion Active Passive

160.2 (40.0) 165.9 (33.0)

160.2 (38.8) 165.0 (34.3)

0.0 (11.1) 1.0 (10.0)

1.000 0.346

0.96a 0.96a

Abduction active Starting point of painful arc End point of painful arc

104.8 (39.2) 158.0 (26.4)

110.7 (37.2) 153.0 (31.4)

5.9 (28.5) 5.0 (26.7)

0.180 0.226

0.72 0.57

Abduction passive Starting point of painful arc End point of painful arc

114.7 (35.2) 162.6 (24.8)

126.9 (36.3) 160.9 (26.5)

12.2 (33.1) 1.6 (19.5)

0.032b 0.617

0.54 0.72

55.5 (19.4)

63.2 (21.5)

7.7 (14.2)

<0.001b

0.70

External rotation range of motion passive a b

Diﬀerence mean (SD)

p value

ICC (one way random)

Tests fulﬁlling criteria for acceptable reliability between observers. Tests showing signiﬁcant diﬀerences between observers.

in ‘abduction passive starting point of painful arc’ and ‘passive external rotation’. The standard deviations of the mean diﬀerence between the observers provide an indication of the range of diﬀerences found between these observers. These diﬀerences are illustrated in the Bland and Altman plots (Figs. 1 and 2). The standard deviation of mean diﬀerence between the observers for ‘abduction active’ (11.1 ) indicates that if two observers measure the same patients a diﬀerence of 2 11.1 is to be expected in 95% of the number of patients. For the standard deviation of the ‘abduction passive end point of painful arc’ a diﬀerence of 2 19.5 is to be expected in 95% of the number of patients. These diﬀerences are considerable in the light of the total range measured. 4.2. Training eﬀects

patient during the physical examinations because ‘pain HIN’ and ‘pain HIB’ tests were the ﬁrst tests in the examination. Patients may ﬁnd it diﬃcult, initially, to indicate the experienced pain level (no pain, little pain, much pain, and excruciating pain) during the test. 4.3. Observer diﬀerences Examinations were carried out by two experienced physical therapists, who had been trained extensively in performing the tests. However, one of them was also a manual therapist. Manual therapy is a postgraduate course undertaken following a physical therapy course. Manual therapists are specialised in diagnosing and treatment of dysfunction of the musculoskeletal system Therefore, it is possible that the physical signs and symptoms were interpreted diﬀerently by the two observers.

It is remarkable that the tests on an ordinal scale ‘pain HIN’ and ‘pain HIB’ did not show an acceptable reliability. It is possible that a training eﬀect occurs within the

Fig. 1. Bland and Altman plot of the mean (of the two observers) active range of motion abduction plotted against the diﬀerence in active range of motion abduction between observers. Note that some data points represent more than one observation.

Fig. 2. Bland and Altman plot of the mean (of the two observers) starting point of painful arc abduction active plotted against the diﬀerence between observers of starting point of painful arc abduction active. Note that some data points represent more than one observation.

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159

Practical issues dictated which of the two observers performed the ﬁrst or the second examination. In a post-hoc analysis the inﬂuence of observer sequence was analysed for diﬀerences in active and passive abductions, for passive external rotation and for start and end of painful arc, active and passive. Only for two movements, passive external rotation and start of the painful arc (passive) did the sequence have a signiﬁcant inﬂuence on the diﬀerences between the observers. It is not clear why this phenomenon occurred only in these two movements. For all other movements the observer sequence had no eﬀect on the diﬀerences between observers. 4.4. Systematic changes of the outcome as a result of the ﬁrst examination It is possible that the ﬁrst examination induces a change in magnitude or presence of an outcome measure and as a consequence the results of the second examination will diﬀer from those of the ﬁrst. For instance, pain provoked during the ﬁrst examination of active abduction may increase pain perception during the second examination or may even inﬂuence the outcome of the assessment of the range of motion. 4.5. Random changes of outcome as a result of the ﬁrst examination Finally it is possible that the diﬀerences between the ﬁrst and the second examinations are based on random changes within the outcome variables assessed. An explanation for these diﬀerences cannot be given. Theoretically it might be possible that current neck pain inﬂuenced reliability of physical examination. This inﬂuence would only be possible if the inﬂuence of neck pain were diﬀerent for the two observers and thereby inducing a diﬀerence in outcomes of the observers. This diﬀerential inﬂuence of neck pain on reliability results was not analysed in this study. 4.6. Other considerations The tests analysed in the reliability study are all tests commonly used in physical therapy practice and in clinical medical practice. The choice to include a test in this reliability study was pragmatic. Retrospectively it might have been more interesting or clinically more relevant if other tests focussing on functional limitations or pathophysiology had been investigated. For the tests in this study no technical instruments were used, which make these tests suitable for use in daily practice. Some reliability studies on shoulder movement have been performed when using instruments (Riddle et al., 1987; Green et al., 1998b; Hoving et al., 2002), but is not incontestably found that using

157

instruments results in higher reliability. In Tables 5 and 6 an overview of the results of studies similar to the current is presented. Comparing the present results with those of other studies is diﬃcult because of diﬀerences in research methodology, for instance diﬀerences in diagnostic tests applied, joints assessed, active and passive motions, testing positions, and the profession of the observers (Riddle et al., 1987; Croft et al., 1994; Green et al., 1998b; de Winter, 1999; Hoving et al., 2002; Terwee et al., 2005). Within these studies and in the current study a similar variability was found concerning interobserver reliability (Tables 5 and 6). In the studies by Green et al. (1998b) and Hoving et al. (2002) the same design was used for a similar patient group. The physiotherapists achieved overall better results for interobserver reliability than the rheumatologists. Perhaps the training of physical therapists in physical examination during these studies was more extensive than that of the rheumatologists. In Terwee’s study (Terwee et al., 2005) ﬁve movements of the shoulder were estimated visually. Three tests and test conditions were similar to those in the current study. Active and passive abductions showed acceptable reliability in the current study as well as in the Terwee’s study. The mean diﬀerence and the standard deviation for active and passive abductions were higher in Terwee’s study than in the current study. In de Winter’s (1999) study interobserver agreement of the examination of the shoulder joint was performed and Kappa’s and absolute agreement were calculated. Five tests in that study were similar to the tests in the current study and similar reliability results were found (Table 6). In the current study two observers were used for logistical reasons. Because of the use of two observers we felt obligated to investigate interobserver diﬀerences. Within the time limits of this trial it was not possible to assess additionally the intraobserver reliability. In daily practice it is possible that two colleagues may temporarily take over each other’s duties. In that case, interobserver reliability assessed in this study is important. Diﬀerences in assessment results may be caused by improvements of the complaints but it may also reﬂect interobserver diﬀerences. The strength of the current study is the substantial number of patients (n ¼ 91) that participated. All patients who were asked to participate in this reliability study actually participated. However, not all patients participating in the trial of Bergman et al. (2002) could be recruited because of logistical reasons. The authors have no reason to believe that the selection of the patients for the reliability study may have inﬂuenced the results. Interobserver reliability of physical tests was moderate in this study as well as in other studies. Diﬀerences in assessments performed by two observers on the same subject do not automatically indicate actual change in

158

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159

Table 5 ICC reliability in similar shoulder movements in diﬀerent studies.

Observers: n Patients: n Year Profession Professional experience Method Standardization Time interval Movements/ comparable movements: n Flexion, act. Abduction, act. Abduction, pass.

Riddle (Riddle et al., 1987)

Croft (Croft et al., 1994) study 1

Croft (Croft et al., 1994) study 2

Green (Green et al., 1998b)

Hoving (Hoving et al., 2002)

Terwee Nomden (Terwee current et al., 2005)

16 50 1987 PT 6.3 yrs (mean)

6 6 1994 PT e

6 6 1994 PT e

6 6 1998 PT/MT Experienced

6 6 2002 Rheumatologists Experienced

Goniometer, large, small e e 7/2

Visual

Visual

Inclinometer Inclinometer

2 201 2005 PT 3 and 10 yrs Visual

2 91 current PT/MT 27 and 15 yrs Visual

Yes 15 min 2/2

Yes e 2 (4 pos)/2

Yes 1 hr 8/2

Yes 1 hr 8/2

Yes <1 hr 5/3

Yes <5 min 5

0.72 0.77a

0.72 0.49

0.88a 0.87a

0.96a 0.96a

4.7 (20.1) 4.1 (22.7)

0.0 (11.1) 1.0 (10.0)

0.88a (lying)

0.29 0.73

0.70

11.2 (12.0)

7.7 (14.2)

0.87a (large), 0.84a (small)

External rotation, act. External 0.88a (large), 0.90a (small) rotation, pass. Hand behind back

0.95a

0.99a

0.43

0.37 0.80a

0.73

Terwee (Terwee et al., 2005) mean diﬀerence (SD)

Nomden current mean diﬀerence (SD)

94a (abs. agr.)

e: not reported. a Acceptable reliability.

the outcome measures of that subject. Determining improvement or deterioration is not easy. It is still not clear which (combination of) tests should be used in diagnosing shoulder disorders and evaluation of shoulder treatment. It is recommended that more interobserver reliability studies should be carried out on tests producing ordinal data in order to analyse sources of measurement variation.

5. Conclusion A great variability in reliability exists in physical tests of the shoulder girdle. Despite the use of a standardised

protocol to assess physical examination of the shoulder girdle, acceptable interobserver reliability was hard to achieve. In this study overall reliability was moderate. The most reliable tests in the study were tests at ordinal data level. In other reliability studies substantial variability was also been found in interobserver reliability. Unfortunately, it is diﬃcult to compare these studies. Further investigations have to be carried out to ﬁnd out which (combination of) tests is most suitable to assess shoulder complaints. Clinicians and researchers should interpret outcomes of physical examination of the shoulder girdle cautiously because outcomes might be biased by observer diﬀerences, but also by other sources of variation.

Table 6 Kappa and absolute agreement in shoulder tests.

Patients (n) Statistics Abduction active, pain Abduction passive, pain External rotation passive, pain Presence painful arc active Presence painful arc passive a

Acceptable reliability.

de Winter (1999)

Nomden (current)

de Winter (1999)

Nomden (current)

201 Kappa 0.73a 0.44 0.45 0.67a 0.59

91 Kappa 0.65a 0.69a 0.50 0.46 0.52

201 Abs. agreement 95%a 89%a 80%a 88%a 89%a

91 Abs. agreement 90%a 91%a 82%a 74% 76%

J.G. Nomden et al. / Manual Therapy 14 (2009) 152e159

159

Appendix 1 HIN and HIB as assessed in the randomised controlled trial concerning the eﬀectiveness of manual therapy of the shoulder girdle Score

HIN, an external rotation movement pattern

HIB, an internal rotation movement pattern

1

From hand on thigh up to and including HIN on affected side, underarm in sagittal plane (90 ﬂexed elbow ﬁxed against hip) From HIN at affected side and underarm in sagittal plane just to touching with ﬁngertips processus spinosi C7 and underarm (about) in sagittal plane From ﬁngertips on processus spinosi C7 with underarm (about) in sagittal plane just to elbow in frontal plane From ﬁngertips on processus spinosi C7 and underarm in frontal plane just to ﬁngertips at heterolateral angulus superior scapulae with underarm in sagittal plane From ﬁngertips on heterolateral angulus superior scapulae with underarm in sagittal plane just to elbow in frontal plane From ﬁngertips on heterolateral angulus superior scapulae with elbow in frontal plane just to (almost) full abduction/elevation, but painful terminal passive abduction/elevation Active full abduction/elevation and (almost) painless terminal abduction/elevation

From hand on thigh till lateral side thigh-bone with palm of the hand From palm of the hand on lateral side of thigh-bone till back of the hand on homolateral buttock

2

3 4

5 6

7

From back of the hand on homolateral buttock till back of the hand on lumbosacral crossing (the height of processus spinosus L5) From back of the hand on lumbosacral crossing till ﬁst on waist (the height of processus spinosi L3) From ﬁst on waist till back of the hand on thoracolumbal crossing (the height of processus spinosi Th 12) From back of the hand on thoracolumbal crossing to ﬁngertips on heterolateral angulus inferior scapulae From ﬁngertips on heterolateral angulus inferior scapulae till back of the hand between scapulae (the height of processus spinosi Th 7)

HIN and HIB slightly modiﬁed from Solem-Bertoft et al. (1996).

References Bergman GJ, Winters JC, van der Heijden GJ, Postema K, Meyboomde-Jong B. Groningen manipulation study. The eﬀect of manipulation of the structures of the shoulder girdle as additional treatment for symptom relief and for prevention of chronicity or recurrence of shoulder symptoms. Design of a randomized controlled trial within a comprehensive prognostic cohort study. Journal of Manipulative and Physiological Therapeutics 2002;25(9):543e9. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986; 1(8476):307e10. Brouwer S, Reneman MF, Dijkstra PU, Groothoﬀ JW, Schellekens JM, Goeken LN. Testeretest reliability of the Isernhagen work systems functional capacity evaluation in patients with chronic low back pain. Journal of Occupational Rehabilitation 2003;13(4):207e18. Croft P, Pope D, Boswell R, Rigby A, Silman A. Observer variability in measuring elevation and external rotation of the shoulder. Primary Care Rheumatology Society Shoulder Study Group. British Journal of Rheumatology 1994;33(10):942e6. Green S, Buchbinder R, Glazier R, Forbes A. Systematic review of randomised controlled trials of interventions for painful shoulder: selection criteria, outcome assessment, and eﬃcacy. BMJ 1998a; 316(7128):354e60. Green S, Buchbinder R, Forbes A, Bellamy N. A standardized protocol for measurement of range of movement of the shoulder using the Plurimeter-V inclinometer and assessment of its intrarater and interrater reliability. Arthritis Care and Research 1998b; 11(1):43e52.

Green S, Buchbinder R, Hetrick S. Physiotherapy interventions for shoulder pain. Cochrane Database of Systematic Reviews 2003; 2:CD004258. Hoving JL, Buchbinder R, Green S, Forbes A, Bellamy N, Brand C, et al. How reliably do rheumatologists measure shoulder movement? Annals of the Rheumatic Diseases 2002;61(7):612e6. Jirout J. X-ray studies on the dynamics of the ﬁrst rib. Manual Medicine 1986;2:59e61. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977;33(1):159e74. Neer CS. Impingement lesions. Clinical Orthopaedics and Related Research 1983;173:70e7. Riddle DL, Rothstein JM, Lamb RL. Goniometric reliability in a clinical setting. Shoulder measurements. Physical Therapy 1987;67(5):668e73. Solem-Bertoft E, Lundh I, Westerberg CE. Pain is a major determinant of impaired performance in standardized active motor tests. A study in patients with fracture of the proximal humerus. Scandinavian Journal of Rehabilitation Medicine 1996;28(2):71e8. Terwee CB, de Winter AF, Scholten RJ, Jans MP, Deville W, van Schaardenburg D, et al. Interobserver reproducibility of the visual estimation of range of motion of the shoulder. Archives of Physical Medicine and Rehabilitation 2005;86(7):1356e61. de Winter AF. Diagnosis and classiﬁcation of shoulder complaints. Vrije Universiteit; 1999. p. 23e37. Winters JC, Sobel JS, van der Windt DAWM, Jonquiere M, de Winter AF, van der Heijden GJ, et al. NHG Standaard Schouderklachten (versie 1999) (Guidelines for shoulder Complaints of the Dutch College of General Practitioners (version 1999)). Huisarts en Wetenschap 1999;42:222e31.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 160e166 www.elsevier.com/math

Original Article

Displacement of the head of humerus while performing ‘‘mobilization with movements’’ in glenohumeral joint: A cadaver study Kai-Yu Ho a, Ar-Tyan Hsu b,c,* a

Musculoskeletal Biomechanics Research Laboratory, Division of Biokinesiology and Physical Therapy, University of Southern California, Los Angeles, CA, USA b Department of Physical Therapy, College of Medicine, National Cheng Kung University, 1 University Road, Tainan, Taiwan c Institute of Allied Health Sciences, College of Medicine, National Cheng Kung University, 1 University Road, Tainan, Taiwan Received 12 September 2006; received in revised form 17 December 2007; accepted 6 January 2008

Abstract The purpose of this study was to compare the displacement of the center of the humeral head (CHH), in a cadaveric glenohumeral joint model, during an experimental abduction simulation with and without the application of a mobilization with movement (MWM) maneuver in an anteroposterior direction. Ten physiotherapists performed passive abduction and a posteriorly directed MWM technique on a fresh cadaveric shoulder joint. The applied forces and joint angles were monitored and displacement of the CHH was calculated. In the abduction only trial, displacement of the humeral head was less than 0.9 mm in posterior, inferior, and lateral directions. During the MWM trial there were signiﬁcant increases in the displacement of the humeral head posteriorly (7.7 mm), inferiorly (2.7 mm), and laterally (0.5 mm) below 52 of abduction. We suggest that the MWM technique may be eﬀective in changing the joint kinematic characteristics during glenohumeral abduction. This hypothesis, however, would need to be tested in vivo with abduction performed actively. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Manual therapy; Shoulder joint; Mobilization with movement; Cadaver study

1. Introduction ‘‘Mobilization with movement’’ (MWM) is a therapeutic technique developed by a New Zealand physiotherapist, Brian Mulligan. The hypothesis behind the technique is that MWM corrects the positional fault and returns the joint to normal alignment and pain free function (Mulligan, 1999). When treating patients with a painful arc of movement the physiotherapist commonly applies a posteriorly directed force while * Corresponding author. Department of Physical Therapy, College of Medicine, National Cheng Kung University, 1 University Road, Tainan, Taiwan. Tel.: þ886 6 2353535x5931; fax: þ886 6 2370411. E-mail address: [email protected] (A.-T. Hsu). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.008

the patient actively elevates the arm (Mulligan, 1999). If the patient is pain free throughout the range of elevation, the positional fault is said to be corrected. MWM techniques were reported to be eﬀective in reducing pain (Abbott et al., 2001; Backstrom, 2002; Konstantinou et al., 2002; Paungmali et al., 2003), increasing active range of motion (Backstrom, 2002; Konstantinou et al., 2002) and promoting strength (Abbott et al., 2001; Paungmali et al., 2003). While there was evidence of ease of symptoms and enhancement of functional ability with the application of MWM, its underlying working mechanism in support of its eﬃcacy, however, remains unclear. The presence of a midrange painful arc (60e120 ) during shoulder elevation in the scapular plane is

K.-Y. Ho, A.-T. Hsu / Manual Therapy 14 (2009) 160e166

sometimes associated with subacromial impingement syndrome (SIS) (Calis et al., 2000; Park et al., 2005). In patients with SIS, increased displacement of the center of the humeral head (CHH) in the anterior and superior directions is found during glenohumeral abduction (Deutsch et al., 1996; Paletta et al., 1997; Ludewig and Cook, 2000; Ludewig and Cook, 2002). We hypothesize that the positional faults described by Mulligan may represent these characteristics and that the application of a posteriorly directed force with a MWM technique may eﬀectively alter joint arthrokinematics. The translation of the CHH during a MWM technique has not yet been reported. Therefore, the purpose of this study was to compare the displacement of the CHH, in a cadaveric glenohumeral joint model, during an experimental abduction simulation with and without the application of a MWM maneuver in an anteroposterior direction.

2. Methods 2.1. Subjects and specimen preparation Ten physiotherapists with orthopedic and manual therapy experience ranging from 3 to 24 years (averaged 5.7 6.5 years) participated in this study. To standardize the technique used in the present study, statements describing the abduction only and the MWM procedures were given to the subjects before testing. The protocol of this study was approved by the Institutional Review Board of the National Cheng Kung University Hospital. A fresh cadaver right shoulder specimen from a 57 years old male was used for testing. The procedures used for the preparation of the specimen were essentially the same as those of Hsu et al. (2002a,b,c). The diﬀerences were that the specimen was set up in an upright position and forces were applied to simulate muscle tone in the rotator cuﬀ. As the rotator cuﬀ provides dynamic stability to glenohumeral joint (Blasier et al., 1997), deﬁciency of the whole rotator cuﬀ leads to a signiﬁcant superior or anterior translation of the humeral head during elevation (Sharkey and Marder, 1995). To simulate muscle tone, a tensile force of 22.2 N was equally distributed to the tendons of supraspinatus, subscapularis, and the infraspinatuseteres minor complex (Fig. 1) (Panjabi, 1979). The lines of action for each of these muscles in relation to the spine of the scapula were 51 for the infraspinatuse teres minor complex, 58 for the subscapularis, and 8 for the supraspinatus (Apreleva et al., 1998). In our experiment the glenohumeral abduction/adduction movements occurred in the plane of scapula. 2.2. Instrumentation A 6-axis load cell (MC3A, Advanced Mechanical Technology, Inc. (AMTI), Massachusetts, U.S.A.)

161

Fig. 1. Experimental setup employed. (A) The triad of retroreﬂexive markers, (B) the glenohumeral specimen, (C) the top plate, (D) the AMTI 6-axis load cell, (E) the L plate, (F) the applied weight on supraspinatus, (G) the applied weight on the infraspinatuseteres minor, (H) the applied weight on the subscapularis, (I) the instruNet data acquisition box, and (J) the laptop computer.

with maximum capacities of 445 N (Fz) and 11 Nm (Mx, My) was used to measure forces applied to the scapulohumeral complex. The abduction only and the MWM maneuvers were simulated with the cadaver specimen in the upright orientation. The scapular block was rigidly ﬁxed on the top plate of the load cell with the plane of scapula perpendicular to the top plate and the anterior aspect of the scapula facing anteriorly. The medial border of the scapula was oriented parallel to the zaxis of the load cell. Force data registered by the AMTI load cell were recorded by an instruNet data acquisition system. A triad with three retroreﬂexive markers was drilled into the distal end of the humerus and deﬁned the humerus local coordinate system (Fig. 1). The 6-camera VICON 370 Motion Analysis System (Vicon Motion Systems Limited, Oxford, UK) was utilized to record the kinematical data throughout the experimental procedures. A biaxial material testing system (MTS) (858 Mini Bionix, MTS System Corp., Eden Prairie, Minnesota, U.S.A.) equipped with an xey table was used to test the mobility of the specimen before and after the abduction only and the MWM procedures. 2.3. Method of measuring the CHH The method used to calculate CHH incorporated the least squares method developed by Gamage and Lasenby (2002). To locate the reference CHH, the humeral head was pressed into the glenoid socket, and its position was derived from the trajectories of the triad markers as the humerus was moved from the neutral position to ﬂexion/extension, abduction/adduction, and circumduction in various planes of elevation through ranges of motion (ROMs) of less than 45 . Smaller errors (less than 0.05 mm) and more stable joint centers were reported with this method (Silaghi et al., 1998; Halvorsen et al., 1999; Gamage and Lasenby, 2002).

162

K.-Y. Ho, A.-T. Hsu / Manual Therapy 14 (2009) 160e166

2.4. Experimental procedures Small arc motions of the glenohumeral joint described previously were used to deﬁne the CHH. Once the CHH was measured the following procedures were conducted: 1. Five repetitions of abduction only procedures were performed by the therapist. 2. The therapist performed ﬁve repetitions of anteroposterior glide on the head of the humerus throughout the range of available abduction. The therapist simultaneously passively abducted the arm. Flexion and rotation were not restrained when the therapist was applying manual abduction in both the abduction only and the MWM trials. The abduction only and the MWM trials were repeated 30 min later in order to calculate the intersession intraclass correlation coeﬃcient (ICC2.1). The peak joint angles and the maximal applied forces registered in ﬁve successive repetitions were used to calculate the intrasession reliability. Average values of joint angles and forces between the two sessions were used to test the intersession reliability. Since only one glenohumeral joint specimen was used in the present study, we believed that it was necessary to monitor changes in the mobility of glenohumeral joint before and after all the experimental maneuvers were completed. The MTS system rather than a therapist was employed for mobility testing as the study Hsu et al. (2002c) conducted showed poor therapist testeretest reliability. The MTS system setup was designed according to the study presented by Hsu et al. (2002a,c). Marks were made on the clamp, the scapular block and the base plate of the MTS to make certain that the positions and the orientations of the scapula and the humerus were exactly the same for the pre- and the post-tests. We removed the simulated tensile forces on rotator cuﬀ tendons during the MTS testing procedures. In our study, the MTS testing procedures were conducted in the following sequences: (1) anterioreposterior (AP) gliding, (2) posterioreanterior (PA) gliding, (3) internal rotation (IR) and external rotation (ER), (4) inferior gliding, and (5) abduction. These procedures were performed once each in the neutral position, the resting position (40 of abduction) and the end-range glenohumeral abduction. At the end of the experiment, we dissected the shoulder specimen to rule out pathological changes such as osteoarthritis, capsular abnormalities, and rotator cuﬀ tear in the specimen. The specimen showed none of these pathological changes. 2.5. Data analysis Outcome measures obtained from the abduction only and the MWM procedures were the applied forces,

ROMs (abduction/adduction, ﬂexion/extension, and internal/external rotation), and displacements of CHH (AP, superioreinferior, and medialelateral directions). Mean values of these ﬁve cycles were used for calculation of the result. We compared the diﬀerences of spatial and temporal parameters between the abduction only trial and the MWM trial. The spatial parameter was the peak displacement of CHH during abduction; while the temporal parameter was the time spent in the abduction cycle where peak displacements occurred. Because the number of physiotherapists participated in our study was relatively small, the nonparametric statistical analysis was used. The null hypothesis (H0) was that there were no diﬀerences between the MWM trial and the abduction only trial in the displacement magnitude and the displacement pattern of CHH. The spatial parameters, temporal parameters, and ROMs in the abduction only trial and the MWM trial were tested by the Wilcoxon signed ranks test. The a level was set as 0.05. The ICCs2.1 were used to compute the intrasession and the intersession reliability. The Statistical Package for the Social Science (SPSS for Windows release 11.0, SPSS Inc., Chicago, U.S.A.) was used for all statistical analyses.

3. Results 3.1. Reliability The standard deviations of the diﬀerences in coordinates between the estimated reference CHH and the CHH computed in each frame during small arc movement were 0.56 mm in the AP direction (x-axis), 0.19 mm in the medialelateral direction ( y-axis), and 0.30 mm in the superioreinferior direction (z-axis), respectively. We believe that the deviations are small and the reference CHH is deﬁned with excellent precision in our study. The values of ICC for the intra- and intersession reliability of the abduction only trial and the MWM trial are all excellent (Table 1). 3.2. Translation of the CHH The maximum abduction ROM was 85.22 7.25 in the abduction only trial, and was accompanied by 12.98 5.52 of ﬂexion and 32.84 5.89 of ER. In the MWM trial, the maximum abduction ROM was 83.71 9.62 accompanied by 13.51 7.59 of ﬂexion and 37.91 8.93 of ER. Statistics showed that the abduction and ﬂexion ROMs didn’t change in these two trials, while the ER ROM signiﬁcantly increased ( p < 0.05). In the MWM trial, the therapists applied an average of 37.2 15.3 N of posteriorly directed force. Fig. 2 shows the trajectory of the CHH during 0e70 of abduction angle measured from the abduction only

163

K.-Y. Ho, A.-T. Hsu / Manual Therapy 14 (2009) 160e166 Table 1 The ICCs of intra- and intersession reliability for the abduction only trial and the MWM trial. Abduction only trial

MWM trial

Intrasession Intersession Intrasession Intersession ICC ICC ICC ICC Abduction angle Flexion angle ER angle Force x, y, z

0.99

0.86

0.98

0.94

0.96

0.81

0.96

0.94

0.97 0.86, 0.88, 0.92

0.79 0.76, 0.84, 0.85

0.95 0.95, 0.98, 0.88

0.78 0.97, 0.93, 0.88

and the MWM trials. During the ﬁrst 20 of abduction the CHH migrated minimally (0.03 mm), thereafter the CHH migrated about 0.12 mm posteriorly, 0.12 mm laterally, and 0.04 mm inferiorly per 10 of abduction angle. In the MWM maneuvers, the initial posterior force displaced the CHH 2.70 2.21 mm, 0.08 0.98 mm and 0.58 0.97 mm in the posterior, lateral and inferior directions, respectively. The CHH continued to move in the same directions until maximal displacement of 8.57 2.89 mm, 1.24 1.08 mm, and 3.44 1.15 mm was achieved posteriorly, laterally, and inferiorly at 51.54 16.94 , 44.51 26.70 , 40.56 13.77 of abduction, respectively. After that the CHH migrated

anteriorly, medially and superiorly until 70 abduction where CHH was located at 6.87 2.58 mm posteriorly, 0.57 1.10 mm laterally and 1.55 1.26 mm inferiorly to the glenoid center. The magnitudes of CHH displacement during the abduction only trial and the MWM trial were signiﬁcantly diﬀerent in AP and superioreinferior directions throughout the range tested ( p < 0.05). In the medialelateral direction, the MWM trial was also diﬀerent from abduction trial ( p < 0.05) except at 0 , 60 , and 70 of abduction. The peak displacements of the CHH in the posterior, lateral and inferior directions, and their corresponding angles of abduction are presented in Table 2. The magnitudes of peak translation of the MWM trials were signiﬁcantly diﬀerent from those of the abduction only trials in posterior, inferior and lateral directions ( p < 0.05). In the MWM trials, peak displacements occurred at smaller angles in the posterior and lateral directions ( p < 0.05). There was no diﬀerence in abduction position where peak displacement occurred between the abduction only trial and the MWM trial in the inferior direction ( p ¼ 0.059). 3.3. The MTS data The mobility data measured from the MTS procedures is presented in Table 3. Except for the AP þ PA Abduction only trial MWM trial

2 0 -2 -4 -6 ANTERIOR -8 -10 -12 -10 0

10

20

30

40

50

60

70

80

1 0.5 0 -0.5 MEDIAL -1 -1.5 -2 -2.5 -10 0

10

20

30

40

50

60

70

80

2 1 0 -1 -2 SUPERIOR -3 -4 -5 -10 0

10

20

30

40

50

60

70

80

Center of humeral head (mm)

a

b

c

Abduction angle (degree) Fig. 2. The trajectory of the center of humeral head during 0e70 of abduction angle measured from the abduction only trial and the MWM trial (error bars indicate 1 standard deviation). (a) AP direction, (b) medialelateral direction, and (c) Superioreinferior direction. There is a diﬀerence between the abduction only trial and the MWM trial with p < 0.05 except for 0 , 60 , and 70 in the medialelateral direction.

164

K.-Y. Ho, A.-T. Hsu / Manual Therapy 14 (2009) 160e166

Table 2 The mean peak displacement values of the head of humerus (CHH) and their corresponding angles of abduction during the abduction only and the MWM trials (N ¼ 10). Abduction only trial

MWM trial

Peak posterior translation (mm) Abduction angle where peak posterior translation occurred (degree)

0.93 0.40 77.47 9.35

8.57 2.89* 51.54 16.94*

Peak lateral translation (mm) Abduction angle where peak lateral translation occurred (degree)

0.65 0.84 57.56 30.05

1.24 1.08* 44.51 26.70*

Peak inferior translation (mm) Abduction angle where peak inferior translation occurred (degree)

0.66 0.59 54.21 30.16

3.44 1.15* 40.56 13.77

Values are presented as mean SD. *Signiﬁcant diﬀerences are obtained for the variables between abduction only trial and MWM trial, p < 0.05.

gliding at resting position, all other parameters increased in the second measurement. Considerable changes were found in the inferior direction at the neutral position (6.6 mm or 95.7% increase) and the resting position (3 mm or 34.9% increase). The maximal values of mobility changes in the AP direction and the joint angle diﬀerences were less than 11.5% (1.5 mm increase at the end-range position and 8.4 increase of abduction angle at the resting position, respectively).

4. Discussion The results of the present study indicate that the magnitudes and the patterns of displacement of the CHH were diﬀerent between the MWM trials and the abduction only trials. Therefore, the null hypothesis (H0) in Table 3 The MTS data measured before and after the abduction only and the MWM procedures. Neutral position

Resting position (40 abduction)

Before After Before AP þ PA gliding 27.6 (mm) IR þ ER angle 85.7 (degree) Inferior gliding 6.9 (mm) Abduction anglea 74.3 (degree)

After

End position Before After

28.9

36.9

35.0

16.2

17.7

85.8

113.8

115.9

83.1

86.9

13.5

8.6

11.6

2.5

2.5

81.4

72.9

81.3

77.9

82.9

AP, anterioreposterior; PA, posterioreanterior, IR, internal rotation; ER, external rotation. a The maximal abduction angles were measured after the mobility tests in each position.

our study was rejected. In a fresh cadaver specimen with rigid immobilization of the scapula and ﬁxation of the triplet markers to the humerus we procured an accurate and reliable full range trajectory of the migration of the CHH throughout the abduction only and the MWM maneuvers. The present study is the ﬁrst research describing quantitatively the kinematical and kinetic characteristics of the MWM techniques in a fresh cadaveric specimen. Comparing with the abduction only trials, there were increases in displacements of the CHH in a posterior, inferior, and lateral directions in the MWM trial. The midrange painful arc usually occurs between 60 and 120 of arm abduction which corresponds to between 40 and 80 of glenohumeral abduction as the reported patterns of scapulohumeral rhythm exhibit approximately a 2:1 ratio between the ranges of the humeral and the scapular rotation (Freedman and Munro, 1966; Doody et al., 1970; Poppen and Walker, 1976). During the MWM procedures the peak posterior, inferior and lateral displacements of the CHH occurred at 51.5 , 40.6 , and 44.5 of glenohumeral abduction, respectively. Therefore, it is likely that the MWM techniques may be able to prevent the humeral head from excessive translations in the anterior, superior, and medial directions within the range where the midrange painful arc frequently occurs. In previous in vivo studies investigators reported that the CHH translated superiorly 1.5e2 mm during 30e 60 of abduction (Graichen et al., 2000; Ludewig and Cook, 2002). After 60 of abduction, the CHH migrated inferiorly. An anterior translation of 1e3 mm was found during 30e60 of abduction. After then the CHH translated posteriorly. These prior in vivo studies demonstrated patterns of the CHH translation that was inconsistent with that of the abduction only trial obtained in the current study. Such variations might have resulted from diﬀerent levels and patterns of muscle activation between the current simulated cadaveric model and the in vivo conditions (Kronberg et al., 1990). Comparisons of the CHH translation of the abduction only trial in the current study and those of the previous in vitro studies are diﬃcult as migrations of the CHH during glenohumeral abduction in previous cadaveric studies were rather controversial. While Thompson et al. (1996) reported that the magnitude of the CHH translation in three principal directions was less than 2 mm; Apreleva et al. (1998) found that there was a 2 mm posterior translation from 0e40 of abduction and an anterior translation after 40 of abduction. Wuelker et al. (1994) showed that within 30e90 of abduction the CHH translated in an average of 0.95 mm superiorly and 0.55 mm anteriorly per 10 elevation. The inconsistency between the ﬁndings of the present study and those of previous researches might have resulted from diﬀerences in simulated cadaveric models employed in various studies, especially regarding

K.-Y. Ho, A.-T. Hsu / Manual Therapy 14 (2009) 160e166

the presence of the simulated rotator cuﬀ forces and their magnitudes, directions and patterns of application (Wuelker et al., 1994; Thompson et al., 1996; Apreleva et al., 1998). Results of the MTS procedures suggest that the mobility of the glenohumeral specimen was altered after the abduction only and the MWM procedures. There were substantial changes in the inferior gliding at the neutral and the resting positions. We speculate that the posterior capsule, especially the posterior portion of the inferior glenohumeral ligament might be speciﬁcally lengthened after the MWM procedures. The posterior portion of the inferior glenohumeral ligament not only serves as the primary constraint at both the neutral and the end-range positions to the posteriorly directed load (Debski et al., 1999; Brenneke et al., 2000), but also acts as the main constraint at the resting and the neutral positions during inferior gliding (Brenneke et al., 2000). Therefore, when a posterior load was applied during the MWM maneuver, these primary restraints were stretched and led to an increase in the laxity of the posterior portion of the inferior glenohumeral ligament and an increase of the inferior mobility at the neutral and the resting positions seen in the present study. There were several major limitations to the present study. Firstly, since only one cadaver specimen was used for all the experimental maneuvers and there were signiﬁcant changes in the mobility of the joint capsule at the end of the experimental procedures, it is conceivable that the therapist might have responded to the changes in the mobility of the specimen by altering the force applied during the MWM maneuvers, therefore, added to the overall variability of the outcome measurements. Secondly, the glenohumeral MWM techniques applied to patients include two distinct components: a posterior mobilization force applied by the therapist and an active arm elevation performed by the patient. In our study, the active elevation was substituted with a passive elevation by the therapist, and might have inﬂuenced the magnitude and pattern of the posteriordirected manual force, the CHH translation, and the maximum abduction range achieved during the MWM maneuvers. During active abduction of the shoulder the reported values of translation of the CHH were rather small, usually in the order of 1e2 mm (Graichen et al., 2000; Ludewig and Cook, 2002). The capability of a surface marker based motion analysis system in estimating the CHH in vivo with adequate accuracy and reliability without invasive procedures (i.e. surgical ﬁxation of bone pins to the scapula and the humerus) is so far uncertain. Therefore, we opted to substitute an in vivo active abduction procedure with the in vitro passive abduction paradigm so that the scapula could be rigidly ﬁxed and the CHH accurately and reliably estimated. In addition, even though the alternative

165

hypothesis in the present study has been accepted, caution should be taken when generalizing the current results to individuals with SIS. Much more research work is needed to clarify how the MWM maneuver aﬀects the CHH translation in shoulder specimens with simulated SIS, and how the MWM maneuver works in vivo with and without SIS.

5. Conclusion Comparing with the abduction only trials, the MWM of the glenohumeral joint resulted in greater migration of the CHH posteriorly, inferiorly, and laterally. The peak displacements of the CHH in these directions occur within the range where the midrange painful arc frequently takes place. We have shown that an anteroposterior MWM technique during passive abduction was eﬀective in changing the kinematical characteristics of the glenohumeral joint in a cadaver. This hypothesis, however, would need to be tested in vivo with abduction performed actively.

Acknowledgements The authors thank Dr. Jia-Hao Chang for his invaluable technical assistance and MTS system operation; Ms. Jing-Fang Chiu, for her assistance in experimental preparation and data collection processes. Part of the results was reported in the 49th Annual Meeting of Physical Therapy Association of ROC (Taiwan), September 19, 2004, Taipei, Taiwan.

References Abbott JH, Patla CE, Jensen RH. The initial eﬀects of an elbow mobilization with movement technique on grip strength in subjects with lateral epicondylalgia. Manual Therapy 2001;6(3):163e9. Apreleva M, Hasselman CT, Debski RE, Fu FH, Woo SL, Warner JJ. A dynamic analysis of glenohumeral motion after simulated capsulolabral injury. Journal of Bone and Joint Surgery 1998;80A(4):474e80. Backstrom KM. Mobilization with movement as an adjunct intervention in a patient with complicated De Quervain’s tenosynovitis: a case report. Journal of Orthopaedic and Sports Physical Therapy 2002;32(3):86e94. Blasier RB, Soslowsky LJ, Malicky DM, Palmer ML, Arbor A. Posterior glenohumeral subluxation: active and passive stabilization in a biomechanical model. Journal of Bone and Joint Surgery 1997;79A(3):433e40. Brenneke SL, Reid J, Ching RP, Wheeler DL. Glenohumeral kinematics and capsuleeligamentous strain resulting from laxity exams. Clinical Biomechanics 2000;15(10):735e42. Calis M, Akgun K, Birtane M, Karacan I, Calis H, Tuzun F. Diagnostic values of clinical diagnostic tests in subacromial impingement syndrome. Annals of the Rheumatic Diseases 2000;59(1):44e7. Debski RE, Wong EK, Woo SL, Sakane M, Fu FH, Warner JJ. In situ force distribution in the glenohumeral joint capsule during

166

K.-Y. Ho, A.-T. Hsu / Manual Therapy 14 (2009) 160e166

anterioreposterior loading. Journal of Orthopaedic Research 1999;17(5):769e75. Deutsch A, Altchek DW, Shwartz E, Otis JC, Warren RF. Radiological measurement of superior displacement of the humeral head in the impingement syndrome. Journal of Shoulder and Elbow Surgery 1996;5(3):186e93. Doody SG, Freedman L, Waterland JC. Shoulder movements during abduction in the scapular plane. Archives of Physical Medicine and Rehabilitation 1970;51:595e604. Freedman L, Munro RR. Abduction of the arm in the scapular plane: scapular and glenohumeral movements. A roentgenographic study. Journal of Bone and Joint Surgery 1966;48A:1503e10. Gamage SS, Lasenby J. New least squares solutions for estimating the average centre of rotation and the axis of rotation. Journal of Biomechanics 2002;35(1):87e93. Graichen H, Stammberger T, Bonel H, Englmeier Karl-Hans, Reiser M, Eckstein F. Glenohumeral translation during active and passive elevation of the shoulder e a 3D open-MRI study. Journal of Biomechanics 2000;33(5):609e13. Halvorsen K, Lesser M, Lundberg A. A new method for estimating the axis of rotation and the center of rotation. Journal of Biomechanics 1999;32(11):1221e7. Hsu AT, Chang JH, Chang CH. Determining the resting position of the glenohumeral joint: a cadaver study. Journal of Orthopaedic and Sports Physical Therapy 2002a;32(12):605e12. Hsu AT, Hedman T, Chang JH, Vo C, Ho L, Ho S, Chang GL. Changes in abduction and rotation range of motion in response to simulated dorsal and ventral translational mobilization of the glenohumeral joint. Physical Therapy 2002b;82(6):544e56. Hsu AT, Ho L, Chang JH, Chang GL, Hedman T. Characterization of tissue resistance during a dorsally directed translational mobilization of the glenohumeral joint. Archives of Physical Medicine and Rehabilitation 2002c;83(3):360e6. Konstantinou K, Foster N, Rushton A, Baxter D. The use and reported eﬀects of mobilization with movement techniques in low back pain management; a cross-sectional descriptive survey of physiotherapists in Britain. Manual Therapy 2002;7(4):206e14. Kronberg M, Nemeth G, Brostrom LA. Muscle activity and coordination in the normal shoulder. Clinical Orthopaedics and Related Research 1990;257:76e85.

Ludewig PM, Cook TM. Translation of the humerus in persons with shoulder impingement syndromes. Journal of Orthopaedic and Sports Physical Therapy 2002;32(6):248e59. Ludewig PM, Cook TM. Alterations in shoulder kinematics and associated muscle activity in people with symptoms of shoulder impingement. Physical Therapy 2000;80(3):276e91. Mulligan BR. Manual therapy: ‘‘Nags’’, ‘‘Snags’’, ‘‘MWMS’’ etc. 4th ed. Wellington, New Zealand: Plane View Services Ltd; 1999. p. 87e103. Paletta Jr GA, Warner JJ, Warren RF, Deutsch A, Altchek DW. Shoulder kinematics with two-plane X-ray evaluation in patients with anterior instability or rotator cuﬀ tearing. Journal of Shoulder and Elbow Surgery 1997;6(6):516e27. Panjabi MM. Center and angles of rotation of body joints: a study of errors and optimization. Journal of Biomechanics 1979;12(12): 911e20. Park HB, Yokota A, Gill H, El Rassi G, McFarland E. Diagnostic accuracy of clinical tests for the diﬀerent degrees of subacromial impingement syndrome. Journal of Bone and Joint Surgery 2005;87(7):1446e55. Paungmali A, O’Leary S, Souvlis T, Vicenzino B. Hypoalgesic and sympathoexcitatory eﬀects of mobilization with movement for lateral epicondylalgia. Physical Therapy 2003;83(4):374e83. Poppen NK, Walker PS. Normal and abnormal motion of the shoulder. Journal of Bone and Joint Surgery 1976;58A:195e201. Sharkey NA, Marder RA. The rotator cuﬀ opposes superior translation of the humeral head. American Journal of Sports Medicine 1995;23(3):270e5. Silaghi M, Plaenkers R, Boulic R, Fua P, Thalmann D. Local and global skeleton ﬁtting techniques for optical motion capture. In: Magnenat-Thalmann N, Thalmann D, editors. Modeling and motion capture techniques for virtual environments, lecture notes in artiﬁcial intelligence, vol. 1537. Berlin: Springer; 1998. p. 26e40. Thompson WO, Debski RE, Boardman ND, Taskiran E, Warner JJ, Fu FH, Woo SL. A biomechanical analysis of rotator cuﬀ deﬁciency in a cadaveric model. American Journal of Sports Medicine 1996;24(3):286e93. Wuelker N, Schmotzer H, Thren K, Korell M. Translation of the glenohumeral joint with simulated active elevation. Clinical Orthopaedics and Related Research 1994;309:193e200.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 167e172 www.elsevier.com/math

Original Article

Transient eﬀects of stretching exercises on gait parameters of elderly women Andre´ L.F. Rodacki a,*, Ricardo M. Souza a, Carlos Ugrinowitsch b,1, Fabiano Cristopoliski a, Neil E. Fowler c,2 a Universidade Federal do Parana´, Setor de Cieˆncias Biolo´gicas, Departamento de Educac¸a~o Fı´sica, Centro de Estudos do Comportamento Motor. R. Corac¸a~o de Maria, 92, BR116, Km 95, Jardim Botaˆnico, Curitiba, Parana´, Brazil b Universidade de S~ ao Paulo, Escola de Educac¸a~o Fı´sica e Esportes, Av. Mello Moraes, 65, Cidade Universita´ria, Butant~ a, S~ ao Paulo, S~ ao Paulo, Brazil c The Manchester Metropolitan University, Department of Exercise and Sport Sciences, Centre for Biophysical and Clinical Research into Human Movement, Hassall Road, Alsager, Stoke-on-Trent, England ST7 2HL, United Kingdom

Received 2 May 2007; received in revised form 18 December 2007; accepted 6 January 2008

Abstract This study aimed to analyse the eﬀects of a single stretching exercise session on a number of gait parameters in elderly participants in an attempt to determine whether these exercises can inﬂuence the risk of fall. Fifteen healthy women living in the community volunteered to participate in the study. A kinematic gait analysis was performed immediately before and after a session of static stretching exercises applied on hip ﬂexor/extensor muscles. Results showed a signiﬁcant inﬂuence of stretching exercises on a number of gait parameters, which have previously been proposed as fall predictors. Participants showed increased gait velocity, greater step length and reduced double support time during stance after performing stretching exercises, suggesting improved stability and mobility. Changes around the pelvis (increased anterioreposterior tilt and rotation range of motion) resulting from the stretching exercises were suggested to inﬂuence the gait parameters (velocity, step length and double support time). Therefore, stretching exercises were shown to be a promising strategy to facilitate changes in gait parameters related to the risk of fall. Some other gait variables related to the risk of fall remained unaltered (e.g., toe clearance). The stable pattern of segmental angular velocities was proposed to explain the stability of these unchanged gait variables. The results indicate that stretching exercises, performed on a regular (daily) basis, result in gait adaptations which can be considered as indicative of reduced fall risk. Other studies to determine whether regular stretching routines are an eﬀective strategy to reduce the risk of fall are required. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Risk of fall; Gait; Stretching exercises

1. Introduction Trauma resulting from falls in the elderly is one of the most signiﬁcant causes of injury and death (Blake * Corresponding author. Tel.: þ55 41 3360 4333; fax: þ55 41 3360 4336. E-mail address: [email protected] (A.L.F. Rodacki). 1 Tel.: þ55 11 3091 2143. 2 Tel.: þ44 0161 247 5491; fax: þ44 0161 247 6375. 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.006

et al., 1988; Cameron and Quine, 1994), with annual costs estimated of $10 billion (Campbell et al., 1989). Although less than 2% of falls among the elderly result in a hip fracture, more than 90% of hip fractures occur as a consequence of a fall. In addition, fall injuries in the elderly usually demand longer hospitalization periods and may lead to seriously impaired mobility and an important decline on functional ability after recovery (Cameron and Quine, 1994) that may result in social isolation, loss of independence and need of

168

A.L.F. Rodacki et al. / Manual Therapy 14 (2009) 167e172

care assistance (Andersson and Schultz, 1979; Cameron and Quine, 1994). The largest number of hip fractures results from a fall occurring during locomotion in which extrinsic and intrinsic factors play a role. Extrinsic factors are associated with environmental hazards such as slippery surfaces, while intrinsic factors are individual-related. Intrinsic factors have been pointed as the best fall predictors among elders (Honeycutt and Ramsey, 2002) and include physiological, medical problems, medication and alcohol use. Muscle weakness, as a result of a natural decrease in muscle mass with aging, has been considered as a major cause of falls (Cummings et al., 1990). In general, falls have been strongly associated with decreased physical activity and impaired mobility measurements (body sway and gait). Reduced range of motion, as a consequence of the muscleetendon unit and surrounding connective tissue stiﬀness, has been indicated to assume a positive relationship to fall incidence (Guimar~aes and Farinatti, 2005). Other investigations have indicated that reduced range of motion, speciﬁcally about the hip and the ankle joints, constitutes one of the main causes of fall due to the inﬂuence that hip rigidity has over the lower limb dynamics during walking (Rose and Gamble, 2006). Kerrigan et al. (2001) proposed that a reduction in hip joint mobility is one of the most important age-related factors inﬂuencing the walking pattern. It has been shown that peak hip extension during walking is consistently lower in both elderly fallers and non-fallers than in young adults, irrespective of the walking speed. As peak hip extension is inﬂuenced by the tightness of the antagonistic muscles, speciﬁc hip ﬂexor stretching exercises may be an attractive possibility to improve walking performance in the elderly and reduce the risk of falls (Kerrigan et al., 2001). Kerrigan et al. (2001) showed that fallers are characterized by exaggerated hip tightness. Kerrigan et al. (2003) showed a non-signiﬁcant increase in peak hip extension during gait performance as a result of a 10 week unsupervised exercise programme. The failure to ﬁnd a signiﬁcant eﬀect may be related to the poor adherence and control of the exercise practice. Indeed King et al. (2002) have shown that controlled, centre-based exercises are more eﬀective than when practiced at home (unsupervised). It remains to be seen whether correctly performed stretching exercise aﬀects gait parameters. If one considers that the outcome of long-term exercises is a cumulative response of successive training sessions, analyzing the transient eﬀect produced by a single session may constitute an interesting alternative to understand the long lasting eﬀects of stretching training programs. Indeed, stretching exercises have been shown to produce acute changes in joint range of motion (Taylor et al., 1990; McHugh et al., 1992; Halbertsma and Goeken, 1994; Willy et al., 2001). This acute eﬀect

may provide additional amplitude at the hip joint and reverse some changes in gait pattern that characterises aging (e.g., increase step length). Therefore, the present study aimed to analyse the immediate eﬀects of a session of static stretching exercises for the hip ﬂexor muscle group over the gait and a number of parameters that have been related to the risk of fall in elderly participants.

2. Methods Fifteen healthy women (age: 64.5 3.2 years; height: 1.59 0.09 m; body mass: 77.3 8.2 kg) living in the community volunteered to participate. The study was approved by the University’s ethic committee and all participants were informed of the inherent risks and beneﬁts, before signing an informed consent form. Participants with problems that could aﬀect their ability to walk (e.g., lower limb surgery, low back pain, previous fractures, arthritis, etc.) were not included in the study. Male participants were also not included due to viscoelastic diﬀerences between genders (Kubo et al., 2002). An interview revealed that participants were able to perform their regular daily activities with no assistance but were not involved in systematic physical activities programs during the last six months that preceded the present study. Participants reported no fall history during the last 12 months that preceded the experiment. To determine the eﬀect of one stretching session on the gait pattern, participants performed one experimental session. A kinematic gait analysis was conducted before (PRE) and immediately after (POS) one set of speciﬁc static stretching exercises for the hip ﬂexor muscle group on both limbs. During the exercise, participants remained lying on their back with both lower limbs hanging from the edge of a padded surface. Stretching was applied by one of the experimenters by ﬂexing one thigh towards the trunk at approximately 45 with respect to the horizontal, while a second experimenter moved the contralateral thigh downwards causing hip hyperextension. Then, the knee of the stretched leg was taken into ﬂexion. The experimenter sustained a position in which participants reported the ﬁrst symptoms of muscle discomfort for 60 s. Exercises were repeated four alternate times in each leg (240 s in each limb). Static exercises were applied because they have been proved to provide satisfactory results in groups of elderly individuals (Feland et al., 2001a; Ferber et al., 2002). Fig. 1 represents the stretching procedures. Immediately after the stretching procedures, participants were requested to walk in the laboratory area to have their gait ﬁlmed. Participants were allowed to walk in the walkway a few of times (4e6 trials) in an attempt to familiarise them with the protocol used in the experiment. The interval between the end of the

A.L.F. Rodacki et al. / Manual Therapy 14 (2009) 167e172

169

Fig. 1. Schematic representation of the stretching exercise. The non-stretched thigh was ﬂexed towards the trunk segment approximately 45 with respect to the horizontal (A) while the other limb (stretched limb) was passively forced downwards before the knee was passively forced into ﬂexion (B).

stretching procedures and the initiation of the gait was less than 30 s. Walking was performed barefoot at the participant’s freely chosen speed in the plane and ﬁlmed by three camcorders (JVC GR-AX 25; two placed on the right side and one in the left side of the participants) sampling with a frequency of 30 Hz. Images were recorded on VHS tape and transferred onto a personal computer for analysis (Pinnacle, LINX). Recorded images were processed and digitalized using speciﬁc software (SIMI MOTION, 6.1). A common LED was set in the ﬁeld of view of all cameras to synchronize them. Fig. 2 shows the setup of the data collection area. A number of markers (25 mm of diameter) were placed over the skin and clothes to represent the following landmarks in both sides of the body: (1) anterior superior iliac crest (ASIC), (2) the most prominent protuberance of the greater trochanter (TROC), (3) lateral femoral epicondyle (KNEE), (4) lateral malleolus (MALL) and (5) the ﬁfth metatarsal joint (META). Although markers were placed on both sides of the body only the right side was analysed. The connection between these points deﬁned four rigid body segments, which are represented in Fig. 3. A three dimensional

movement reconstruction was performed, from which two separate two dimensional analyses (sagittal and frontal planes) of the pelvis and of the lower limbs were performed. Unilateral analysis has been used in other studies (Kerrigan et al., 2001, 2003; Evans et al., 2003) and a symmetrical proﬁle between segments in healthy individuals has been reported (Sadeghi et al., 2000). The angular convention is shown in Fig. 3. Ten gait cycles were ﬁlmed for each participant in both experimental conditions (PRE and POS), from which the ﬁrst three valid trials (e.g., trials in which all markers were visible) were selected for further analysis. Special attention was given to not interfere with the freely chosen walking velocity. The gait cycle was considered as the period between two consecutive heel contacts of the right foot, which were normalised to 100% of the gait cycle. These three cycles were normalised with respect to the gait cycle (ﬁrst heel contact corresponded to 0% and the second heel contact corresponded to 100%) and averaged (ensemble averaged) to represent each individual’s movement pattern. Angular variables in the sagittal plane were normalised by subtracting participant’s angles obtained in their normal standing

Fig. 2. Data collection area schematic representation.

170

A.L.F. Rodacki et al. / Manual Therapy 14 (2009) 167e172

Fig. 3. Body landmarks and angular displacement conventions. Representation of the anatomical landmarks, body segments, joints and movement convention. (ASIC e anterior superior iliac crest (L ¼ left; R ¼ right), TROC e the most prominent protuberance of the greater trochanter, KNEE e lateral femoral epicondyle, MALL e lateral malleolus, and META e the ﬁfth metatarsal joint).

posture. The variables used to describe the gait in the present study are presented in Table 1. It was not possible to include a control condition to estimate gait parameter variability. However, a group with equivalent physical characteristics (aged 65.4 2.9 yearsold, 1.61 0.07 m and 74.3 6.6 kg) participated in the same gait assessment as a control group in another experiment performed by our laboratory (using a similar measurement protocol e unpublished data) showed no signiﬁcant diﬀerences ( p > 0.05) between gait parameters measured two months apart (within subjects, between testing days). Variability within subjects (between trials) also showed similar ( p > 0.05) values for the control group and the pre-test of the experimental group (see Table 1, last column). On average, variability of the selected parameters between groups (control and PREePOS) was similar to that observed in the experimental conditions. A more detailed analysis revealed that the variability found in the control group (mean variability ¼ 11.8%) was comparable to that observed in the experimental group in both conditions (PRE ¼ w12.6% and POS ¼ 12.3%). A one way ANOVA revealed no signiﬁcant diﬀerences in terms of variability ( p > 0.05). Data normality was conﬁrmed using the Kolmogorove Smirnov test and allowed a number of t-tests for dependent gait variables to determine signiﬁcant diﬀerences between the two experimental conditions (PRE and POS). Statistical tests were performed in StatisticaÒ software package, version 5.5 and the signiﬁcance level was set at p < 0.05. Bonferroni’s correction was performed to adjust the signiﬁcance of coeﬃcient level.

3. Results The ﬁndings of the study are summarized in Table 1. These show a number of signiﬁcant diﬀerences between the gait parameters before and after stretching. After stretching participants were able to achieve a 6.6% Table 1 Mean gait variables (standard deviation) before (PRE) and after (POST) stretching exercises, the mean diﬀerence and variability within subjects’ trials (Vwt). Variable (unit)

PRE

POST

Diﬀerence (%) Vwt

CYD (s) 1.10 0.09 1.09 0.09 0.6 STD (%) 62.0 2.1 60.1 2.4 3.1 (*) SWD (%) 38.0 2.1 39.9 2.4 þ5.1 (*) DSD (s) 0.18 0.01 0.17 0.005 5.6 (*) CAD (step/min) 55.3 4.8 55.7 4.3 þ0.7 SLE (m) 0.51 0.07 0.54 0.06 þ5.8 (*) CLE (m) 0.014 0.01 0.017 0.01 þ16.2 0.96 0.16 1.02 0.15 þ6.6 (*) SPE (m s1) 1.09 0.3 1.17 0.2 þ6.9 HSV (m s1) 12.9 2.6 15.7 4.6 þ21.9 (*) PAM ( ) 5.0 1.5 6.5 1.7 þ28.4 (*) PRO ( ) 24.4 3.1 25.8 3.7 þ5.7 HAM ( ) 49.9 3.3 50.2 5.5 þ0.6 KAM ( ) 23.2 3.6 24.9 5.7 þ7.3 AAM ( )

0.02 0.03 0.12 0.06 0.02 0.16 0.49 0.03 0.18 0.12 0.23 0.15 0.05 0.06

Signiﬁcant diﬀerences ( p 0.05) are marked (*). CYD e cycle duration; STD e stance phase duration; SWD e swing phase duration; DSD e double support phase duration; CAD e cadence; SLE e step length; CLE e toe clearance; SPE e gait speed; HSV e heel velocity at foot strike; PAM e pelvic anterior/posterior tilt amplitude; PRO e pelvic rotation; HAM e hip ﬂexion/extension amplitude; KAM e knee ﬂexion/extension amplitude; AAM e ankle dorsiﬂexion/extension amplitude; Vwt e Variability within subjects’ trials.

A.L.F. Rodacki et al. / Manual Therapy 14 (2009) 167e172

greater walking velocity achieved by greater step length with no change in cadence. The increase in step length was mainly achieved by virtue of greater motion about the pelvis with increases in both anterior tilt and rotation in the transverse plane. The gait pattern also showed changes in the temporal pattern. The increased gait velocity after stretching was accompanied by a reduction in the stance time, a lower proportion of time in double support and, a longer swing duration. These temporal changes are indicative of improved balance.

4. Discussion This study aimed to analyse the acute eﬀects of stretching the hip ﬂexors muscles on walking gait. It was hypothesized that the transient eﬀect of a single bout of static stretching exercises would acutely increase joint range of motion and change gait pattern. These changes are expected to reduce the risk of falls in elderly (Kerrigan et al., 1998, 2001, 2003; Evans et al., 2003). The gait pattern exhibited immediately before stretching showed dynamic temporal and spatial features similar to those reported in other studies (Murray et al., 1969; Winter, 1991; Prince et al., 1997; Kerrigan et al., 1998; Mills and Barrett, 2001). This indicated that the sample used in the present study was adequate to represent general healthy elderly population living independently in the community. Aging-related conditions (e.g., balance problems, osteoarthritis) may produce changes in gait pattern that could inﬂuence our results. The stretching protocol used in this study was similar to several others, which have shown signiﬁcant gains in range of motion (Murray et al., 1969; Taylor et al., 1990; Bandy et al., 1997; Prince et al., 1997; Feland et al., 2001a,b). Although the acute eﬀects of stretching were not recorded during the experimental session to determine whether they were still present during the gait assessment, the short interval imposed (30 s) was considered suﬃcient to preserve most exercise eﬀects. Spernoga et al. (2001) analysed the muscleetendon elastic properties over a much longer period and detected signiﬁcant eﬀects were still present 6 min after stretching. Gait speed has been suggested as the best independent fall-related predictor (Dargent-Molina et al., 1996). Guimar~ aes and Isaacs (1980) and Woo et al. (1995) have demonstrated that fallers tend to have a lower gait velocity in comparison to non-fallers. Therefore, the greater walking speed found after stretching suggests that these exercises were successful to improve some important functional eﬀects of aging and, resulted in improved mobility. Thus, stretching exercises may represent an important strategy to reduce risk of falls during walking.

171

Walking speed is ultimately determined by step length and cadence (Zakas et al., 2005). The greater walking speed found in the present study cannot be explained by cadence, which remained unaltered. Rather increased step length as a result of increased pelvic rotation and tilting range of motion can be considered as the key to the greater walking speed after stretching. The greater range of motion around the pelvis may have allowed the heel of the swinging leg to strike further in front of the body (Rose and Gamble, 2006). Increased pelvic rotation is believed to have an important eﬀect on gait dynamics by ﬂattening the summit of the centre of mass path, which produces a smoother displacement of the body (Rose and Gamble, 2006). It is also described as to cause a more smooth change in the centre of mass that allows the elderly to attenuate the impact forces with the ground. Thus, it can be speculated that reducing the impact forces at heel strike may help to reduce head acceleration during progression and provide a facilitated stabilization of the visual platform (Yack and Berger, 1993) and fewer disturbances over the vestibular apparatus. Increased double support time in the elderly (Kemoun et al., 2002) is another well known predictor of falls. The longer duration of double support can be seen as a necessity to increase stability during progression for the next step (Viel, 2001). Therefore, smaller double support time may indicate a better stability during gait, which may also represent a measure of mobility. This reinforces the idea that stretching exercises can be an eﬀective way to improve gait performance in the elderly. The anterioreposterior heel contact velocity and the toe clearance have been related to the risk of fall (Winter, 1991). The anterioreposterior heel contact velocity was similar to that described in other studies (1.15 m s1 e Sadeghi et al., 2000). This variable is considered to be largely determined by the segmental angular velocities of the thigh, shank and foot of the swinging leg. The stability of the segmental angular velocities found in the present study can explain the unchanged anterioreposterior heel contact velocity and clearance.

5. Conclusion Stretching exercises resulted in important modiﬁcations in gait characteristics that allowed the elderly to present a movement pattern more similar to that observed in healthy adults. These results are suggestive that these exercises constitute an attractive strategy to improve and/or reduce the negative inﬂuence of aging over a number of functional characteristic related to fall risk during gait. It is important to have in mind that stretching exercises are an important component of physical ﬁtness programs and should be viewed as one of the factors that inﬂuences gait performance.

172

A.L.F. Rodacki et al. / Manual Therapy 14 (2009) 167e172

Studies analyzing the long-term eﬀects of stretching exercises performed under supervision are required to observe whether the transient eﬀects shown in the present study occur as a result of a systematic training program. In addition, longitudinal studies relating stretching and the risk of fall are necessary to conﬁrm experimentally these suppositions. Conﬂicts of interest Authors have exclusive academic interest in this manuscript and there are no conﬂicts of interest in the present submission. References Andersson BVG, Schultz AB. Transmission of moments across the elbow joint and the lumbar spine. Journal of Biomechanics 1979;12:745e55. Bandy WD, Irion JM, Briggler M. The eﬀect of time and frequency of static stretching on ﬂexibility of the hamstring muscles. Physical Therapy 1997;77(7):1090e6. Blake A, Morgan K, Bendall M. Falls by elderly people at home: prevalence and associated factors. Age & Ageing 1988;17:365e72. Cameron I, Quine S. External hip protectors: likely non-compliance among high risk elderly living in the community. Archives of Gerontology Geriatric 1994;19:273e81. Campbell AJ, Borrie MJ, Spears GF. Risk factors for falls in a community-based prospective study of people 70 years and older. Journal of Gerontology 1989;44:112e7. Cummings SR, Black DM, Nevitt MC, Browner WS, Cauley JA, Genant HK, et al. Appendicular bone density and age predict hip fracture in women. JAMA 1990;263:665e8. Dargent-Molina P, Favier F, Grandjean H, Baudoin C, Schott AM, Hausherr E, et al. Fall-related factors and risk of hip fractures: the EPIDOS prospective study. The Lancet 1996;348:145e9. Evans JM, Zavarei K, Lelas JJ, Riley PO, Kerrigan DC. Reduce hip extension in the elderly: dynamic or postural? Archives of Physical Medicine and Rehabilitation 2003;84:A15. Feland JB, Myrer JW, Merrill RM. Acute changes in hamstring ﬂexibility: PNF versus static stretch in senior athletes. Physical Therapy in Sports 2001a;2:186e93. Feland JB, Myrer JW, Schulthies SS, Fellingham GW, Measom GW. The eﬀect of duration of stretching of the hamstring muscle group for increasing range of motion in people aged 65 years or older. Physical Therapy 2001b;81(5):1110e7. Ferber R, Osternig LR, Gravelle DC. Eﬀect of PNF stretch techniques on knee ﬂexor muscle EMG activity in older adults. Journal of Electromyography and Kinesiology 2002;12:391e7. Guimar~ aes JMN, Farinatti PTV. Ana´lise descritiva de varia´veis teoricamente associadas ao risco de quedas em mulheres idosas. Rev. Bras. Med. Esporte 2005;11(5):299e305. Guimar~ aes RM, Isaacs B. Characteristics of the gait of old people who fall. International Rehabilitation Medicine 1980;2:177e80. Halbertsma J, Goeken L. Stretching exercises: eﬀect on passive extensibility and stiﬀness in short hamstrings of healthy subjects. Archives of Physical Medicine and Rehabilitation 1994;74:976e81.

Honeycutt PH, Ramsey P. Factor contributing to falls in elderly men living in the community. Geriatric Nursing 2002;23(5):250e5. Kemoun G, Thoumie P, Boisson D, Guieu JD. Ankle dorsiﬂexion delay can predict falls in the elderly. Journal of Rehabilitation Medicine 2002;34:278e83. Kerrigan DC, Lee LW, Collins JJ, Riley PO, Lipsitz LA. Reduce hip extension during walking: healthy elderly and fallers versus young adults. Archive of Physical Medicine and Rehabilitation 2001;82:26e30. Kerrigan DC, Todd MK, Della Croce U, Lipsitz LA, Collins JJ. Biomechanical gait alterations independent of speed in the healthy elderly: evidence for speciﬁc limiting impairments. Archives of Physical Medicine and Rehabilitation 1998;79:317e22. Kerrigan DC, Xenopoulos-Oddsson A, Sullivan MJ, Lelas JJ, Riley PO. Eﬀect of a hip ﬂexor-stretching program on gait in the elderly. Archive of Physical Medicine and Rehabilitation 2003;84:1e6. King MB, Whipple RH, Gruman CA, Judge JO, Schmidt JA, Wolfson LI. The performance enhancement project: improving physical performance in older persons. Archives of Physical Medicine and Rehabilitation 2002;83:1060e9. Kubo K, Kanehisa H, Fukunaga T. Eﬀect of stretching training on the viscoelastic properties of human tendon structures in vivo. Journal of Applied Physiology 2002;92:595e601. McHugh M, Magnusson S, Gleim G, Nicholas J. Viscoelastic stress relaxation in human skeletal muscle. Medicine & Science in Sports & Exercise 1992;24(12):1375e82. Mills PM, Barrett RS. Swing phase mechanics of healthy young and elderly men. Human Movement Science 2001;20:427e46. Murray MP, Kory RC, Clarkson BH. Walking patterns in healthy old men. Journal of Gerontology 1969;24:169e78. Prince F, Corriveau H, He´bert R, Winter DA. Gait in elderly. Gait and Posture 1997;5:128e35. Rose J, Gamble JG. Human walking. 3rd ed. Baltimore: Lippincott Williams & Wilkins; 2006. Sadeghi H, Allard P, Prince F, Labelle H. Symmetry and limb dominance in able-bodied gait: a review. Gait and Posture 2000;12:34e45. Spernoga SG, Uhl TL, Arnold BL, Gansneder BM. Duration of maintained hamstring ﬂexibility after one-time, modiﬁed hold-relax stretching protocol. Journal of Athletic Training 2001;36(1):44e8. Taylor DC, Dalton JD, Seaber AV, Garrett WE. Viscoelastic properties of muscleetendon units: the biomechanical eﬀects of stretching. The American Journal of Sports Medicine 1990;18(3):300e9. Viel E. A marcha humana, a corrida e o salto: biomecaˆnica, inves~ es, normas e disfunc¸o ~ es, Manole; 2001. tigac¸o Willy R, Kyle B, Moore S, Chleboun G. Eﬀect of cessation and resumption of static hamstring muscle stretching on joint range of motion. Journal of Orthopaedic and Sports Physical Therapy 2001;31(3):138e44. Winter DA. The biomechanics and motor control of human gait: normal, elderly and pathological. 2nd ed. Waterloo: University of Waterloo Press; 1991. Woo J, Ho SC, Lau J, Chan SG, Yuen YK. Age-associated gait-changes in the elderly: pathological or physiological? Neuroepidemiology 1995;14:65e71. Yack HJ, Berger RC. Dynamic stability in the elderly: identifying a possible measure. Journal of Gerontology 1993;48:225e30. Zakas A, Balaska P, Grammatikopoulou MG, Zakas N, Vergou A. Acute eﬀects of stretching duration of range of motion of elderly woman. Journal of Bodywork and Movements Therapies 2005;9:270e6.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 173e179 www.elsevier.com/math

Original Article

A neuropathic pain component is common in acute whiplash and associated with a more complex clinical presentation Michele Sterling a,b,*, Ashley Pedler c a

Centre of National Research on Disability and Rehabilitation Medicine (CONROD), The University of Queensland, Mayne Medical School, Herston Road, Herston, QLD 4066, Australia b Division of Physiotherapy, The University of Queensland, QLD 4006, Australia c Division of Physiotherapy, The University of Queensland, QLD 4072, Australia Received 21 September 2007; received in revised form 6 January 2008; accepted 21 January 2008

Abstract Whiplash is a heterogeneous condition with some individuals showing features suggestive of neuropathic pain. This study investigated the presence of a neuropathic pain component in acute whiplash using the Self-reported Leeds Assessment of Neuropathic Signs and Symptoms’ scale (S-LANSS) and evaluated relationships among S-LANSS responses, pain/disability, sensory characteristics (mechanical, thermal pain thresholds, and Brachial plexus provocation test (BPPT) responses) and psychological distress (General Health Questionnaire-28 (GHQ-28)). Participants were 85 people with acute whiplash (<4 weeks) (54 females, age 36.27 12.69 years). Thirty-four percent demonstrated a predominantly neuropathic pain component (S-LANSS 12). This group showed higher pain/disability, cold hyperalgesia, cervical mechanical hyperalgesia, and less elbow extension with the BPPT ( p < 0.03) when compared to the group with non-neuropathic pain (S-LANSS 12). Pressure pain thresholds (PPTs) at distant sites and psychological distress (GHQ28) were not diﬀerent between the groups ( p > 0.09). None of the S-LANSS items could discriminate those with cold hyperalgesia ( p ¼ 0.06). A predominantly neuropathic pain component is related to a complex presentation of higher pain/disability and sensory hypersensitivity. The S-LANSS may be a useful tool and the BPPT a useful clinical test in the early assessment of whiplash. Ó 2008 Published by Elsevier Ltd. Keywords: Whiplash; Neuropathic pain; Sensory hypersensitivity; Acute pain

1. Introduction Whiplash associated disorders (WADs) are heterogeneous and costly musculoskeletal conditions. A proportion (approx. 20e30%) of whiplash-injured people demonstrate a complex presentation manifested by higher levels of pain and disability, cold and mechanical * Corresponding author. Centre of National Research on Disability and Rehabilitation Medicine (CONROD), The University of Queensland, Mayne Medical School, Herston Road, Herston, QLD 4066, Australia. Tel.: þ61 7 3365 5344; fax: þ61 7 3346 4603. E-mail address: [email protected] (M. Sterling). 1356-689X/$ - see front matter Ó 2008 Published by Elsevier Ltd. doi:10.1016/j.math.2008.01.009

hyperalgesia and sympathetic nervous system dysfunction (Sterling et al., 2003a; Kasch et al., 2005). Hypoaesthesia to mechanical and thermal stimulations is also present in the chronic stage of the condition (Chien et al., 2009) as well as spinal cord hyperexcitability identiﬁed using nociceptor withdrawal reﬂexes (Banic et al., 2004). The presence of such phenomena suggests the existence of a neuropathic pain condition in some individuals with whiplash. The deﬁnition of neuropathic pain is controversial and without consensus (Bennett, 2003). For the purpose of this study, we have utilised the broad deﬁnition of The International Association for the Study of Pain (IASP) e neuropathic pain as being

174

M. Sterling, A. Pedler / Manual Therapy 14 (2009) 173e179

caused by a lesion or dysfunction of the nervous system. Sensory hypersensitivity ﬁts with the ‘dysfunction’ aspect of this deﬁnition (Bennett, 2003). More importantly it has been shown that the early presence of some neuropathic features is associated with poor functional recovery at both short and long term follow-ups (Kasch et al., 2005; Sterling et al., 2005, 2006). In particular cold and generalized mechanical hyperalgesia occur within a few weeks of injury, in those with eventual poor recovery and persist virtually unchanged to the chronic stage of the condition (Sterling et al., 2003a). Additionally patients with chronic WAD and the presence of both cold and mechanical hyperalgesia demonstrate recalcitrance to physical rehabilitation (Jull et al., 2007). The identiﬁcation of neuropathic pain has direct implications for treatment where it has been frequently argued that treatments should be directed toward particular pain mechanisms (Gallagher, 2006). In the case of the whiplash sub-group showing early neuropathic features such an approach may have the capacity to reduce the transition to chronicity. In the acute phase following whiplash injury, patients are frequently assessed by musculoskeletal clinicians. At this crucial stage of the condition it is important that practitioners can identify those at risk of poor recovery. Initial high levels of pain and/or disability are the most consistent prognostic factors for whiplash (ScholtenPeeters et al., 2003) and can be easily measured using validated questionnaires (Stewart et al., 2007). However, the clinical determination of sensory hypersensitivity is more diﬃcult, time consuming and usually not evaluated by clinicians. Whilst mechanical hyperalgesia can be measured in the clinic using a commercial pressure algometer (Ylinen, 2007), there are no clinical devices available to quantify cold pain threshold. In recent times various screening tools have been developed to identify neuropathic pain (Bennett et al., 2007). Whilst most have been used in the investigation of more easily recognized neuropathic pain conditions such as diabetic neuropathies, some have identiﬁed neuropathic pain in musculoskeletal conditions including low back pain (Freynhagen et al., 2006). Whilst most of the tools require a physical assessment component, the Self-reported Leeds Assessment of Neuropathic Signs and Symptoms’ scale (S-LANSS) is particularly attractive for use in primary care as it is a self-report tool only (Bennett et al., 2005). The usefulness of such tools in the evaluation of whiplash is not known. The aims of this study were: (1) to evaluate the presence of a neuropathic pain component in an acute whiplash cohort, (2) to investigate relationships between S-LANSS scores and pain and disability, psychological distress and sensory features of acute whiplash and (3) to determine relationships of S-LANSS items and cold pain threshold in acute whiplash.

2. Methods 2.1. Participants Eighty-ﬁve individuals (54 females, mean (Standard Deviation, SD) age: 36.27 12.69 years, mean (SD) symptom duration: 2.6 1.2 weeks) reporting neck pain as a result of a motor vehicle crash participated in the study. The whiplash subjects were recruited via hospital accident and emergency departments, primary care practices and from advertisement. They were eligible if they met the Quebec Task Force Classiﬁcation of WAD I, II or III (Spitzer et al., 1995). Subjects were excluded if they were WAD IV, experienced concussion, loss of consciousness or head injury as a result of the accident and if they reported a previous history of whiplash, neck pain or headaches that required treatment. Ethical clearance was gained from the Medical Research Ethics Committee of the institution involved. 2.2. Questionnaires 2.2.1. Neck Disability Index (NDI) The NDI consists of 10 items addressing functional activities such as personal care, lifting, reading, work, driving, sleeping and recreational activities as well as pain intensity, concentration and headache (Vernon and Mior, 1991). There are six potential responses for each item ranging from no disability (0) to total disability (10). The overall score (out of 100) is calculated by totaling the responses of each individual item and multiplying by two. A higher score indicates greater pain and disability (Vernon and Mior, 1991). The NDI is a valid, reliable and responsive measure of neck pain and disability (Pietrobon et al., 2002) and has been frequently used in research of whiplash (Sterling et al., 2006; Stewart et al., 2007). 2.2.2. S-LANSS The S-LANSS is a validated self-report version of the Leeds Assessment of Neuropathic Symptoms and Signs pain scale (Bennett et al., 2005). It consists of seven items and includes two self-examination items. A score of 12 or greater identify patients with pain of a predominantly neuropathic nature (Bennett et al., 2007) (see Appendix 1). 2.2.3. General Health Questionnaire-28 (GHQ-28) The GHQ-28 is a 28-item measure of emotional distress in medical settings (Goldberg, 1978) which is divided into four sub-scales: somatic symptoms, anxiety/ insomnia, social dysfunction, and severe depression. The total score can be used as a measure of psychological distress. The GHQ-28 has been used in previous research of whiplash (Gargan et al., 1997; Sterling et al., 2003b).

175

M. Sterling, A. Pedler / Manual Therapy 14 (2009) 173e179

2.3. Quantitative sensory tests 2.3.1. Pressure pain thresholds (PPTs) PPTs were measured using a pressure algometer with a probe size of 1 cm2 and application rate of 40 kPa/s (Somedic AB, Farsta, Sweden). PPTs were measured bilaterally over the spinous processes of C2 and C5; over the median nerve trunk at the anterior elbow and at a remote site (tibialis anterior). These sites have been previously used in investigation of WAD (Sterling et al., 2003a). Triplicate recordings were taken at each site and the mean values used for analysis. 2.3.2. Cold pain thresholds Cold pain thresholds were measured bilaterally over the mid to lower cervical spine using the Thermotest system (Somedic AB, Farsta, Sweden) (Sterling et al., 2003a). Triplicate recordings were taken at each site and the mean values used for analysis. 2.3.3. Brachial plexus provocation test (BPPT) The BPPT was performed as described previously (Sterling et al., 2003a). The range of elbow extension was measured at the subjects’ pain threshold using a standard goniometer. If the subject did not experience pain, the test was continued until end of available range. At the completion of this test, the subjects were asked to record their pain perceived during the test on a 10 cm visual analogue scale (VAS). 2.4. Procedure Participants ﬁrst completed all questionnaires. Quantitative sensory testing was then performed in the following order PPT, cold pain threshold and BPPT. The examiner performing these tests was blind to participant responses on the questionnaires. For all tests no verbal feedback was given to participants on their performance. PPT was performed in the following order C2, C5, median nerve, and tibialis anterior. For PPT, cold pain threshold and BPPT, testing was performed on the left side ﬁrst. 2.5. Data analysis SPSS 14.0 for Windows was used for all analyses. Paired t-tests indicated no diﬀerence between sides ( p > 0.05) for PPT, cold pain threshold or responses (elbow extension and VAS pain scores) to the BPPT so the mean of left and right sides was used in further analysis. The participants were classiﬁed into two groups based on S-LANSS scores. (1) Pain of predominantly neuropathic nature. This was deﬁned as a score of 12 on the S-LANSS (Bennett et al., 2007; Smith et al., 2007) and (2) non-neuropathic pain deﬁned as a S-LANSS score of <12. A Multivariate Analysis of Variance

(MANOVA) was used to determine group diﬀerences in questionnaire data and results of quantitative sensory tests. Receiver Operating Characteristic (ROC) Analysis was determined to examine the ability of each item and total scores of the S-LANSS to discriminate between those with and without cold hyperalgesia. For this analysis, cold pain threshold data was dichotomised as follows: 15 C was the cut-oﬀ for the presence of cold hyperalgesia based on 95% conﬁdence intervals of previous data from healthy controls (Sterling et al., 2003a) and other suggestions in the literature (Bennett, 2006). For all analyses alpha was set at p < 0.05.

3. Results The sample comprised of 54 (62.35%) females, mean (SD) age 36.27 12.69 years, mean (SD) symptom duration: 2.6 1.2 weeks. Thirsty-four percent (29/85) of the participants with acute whiplash injury reported a score of 12 on the S-LANSS, indicating a predominantly neuropathic component to the pain reported by these participants (Bennett et al., 2007). Within this group, Item 7 (numbness or tenderness in the painful area and allodynia) was the most frequently reported response (89.5%) and Item 2 (painful area changes colour e mottled or more red, autonomic) was the least reported at 36.8% (Table 1). MANOVA revealed that this group (pain predominantly neuropathic nature) demonstrated signiﬁcantly higher levels of pain and disability (NDI scores) ( p ¼ 0.005), lowered cold pain thresholds (cold hyperalgesia) ( p ¼ 0.036), lowered PPTs over both C2 and C5 spinous processes ( p < 0.03) and less elbow extension at pain threshold with the BPPT ( p ¼ 0.03) (Table 2). There were no diﬀerences between the groups for PPTs at upper and lower limb sites (all p > 0.28), pain (VAS) reported with the BPPT ( p ¼ 0.48) or GHQ-28 scores ( p ¼ 0.09). Results of ROC analysis indicated that none of the S-LANSS items could signiﬁcantly discriminate between the group with cold hyperalgesia and those without Table 1 Response frequency to S-LANSS items in the acute whiplash group with a predominantly neuropathic pain component, S-LANSS 12. S-LANSS item

% Positive S-LANSS score

Item Item Item Item Item Item Item

73.7 36.8 84.2 52.6 63.2 84.2 89.5

1 2 3 4 5 6 7

(dysesthesia) (autonomic) (evoked pain) (paroxysmal) (thermal) (allodynia) (tender/numb)

n ¼ 29/85, 34% of cohort.

176

M. Sterling, A. Pedler / Manual Therapy 14 (2009) 173e179

Table 2 Mean (SD) values for sensory and questionnaire data for each group. Variable

Group 1: Pain with predominantly neuropathic component (S-LANSS 12), N ¼ 29

Group 2: non-neuropathic pain (S-LANSS 12), N ¼ 56

p-Value

Pain and disability (NDI) Cold pain threshold ( C) PPT e C2 (kPa) PPT e C5 (kPa) PPT e median nerve (kPa) PPT e tibialis anterior (kPa) BPPT e elbow extension (from 180 ) BPPT e VAS (/10) GHQ-28

42.97 16.38 144.29 146.8 222.23 401.8 56.5 1.7 39.2

27.1 12.3 207.21 242.68 230.8 483.4 35.3 1.1 31.5

0.005 0.036 0.047 0.003 0.79 0.22 0.003 0.82 0.09

(19.5) (6.2) (101.7) (83.7) (95) (183.3) (28) (3.5) (15.6)

(all p > 0.06). Areas under the curve ranged from 0.453 (Item 6) to 0.653 (Item 1).

4. Discussion Whiplash is a heterogeneous condition with some individuals displaying a more complex clinical presentation that includes moderate to high levels of pain and disability, generalized hyperalgesia and hyperexcitable motor responses suggestive of a neuropathic pain condition (Moog et al., 2002; Banic et al., 2004). The results of this study support this proposal with 34% of an acute whiplash cohort demonstrating a predominantly neuropathic nature to their pain as assessed by the S-LANSS instrument. There is much current debate as to whether or not conditions such as whiplash, that demonstrate neuropathic type features but with no obvious injury to the nervous system, do in fact represent neuropathic pain (Fishbain et al., 2008). Recent investigation has shown that chronic low back pain (Freynhagen et al., 2006), ﬁbromyalgia and complex regional pain syndrome 1 (Fishbain et al., 2008) may have a neuropathic component. Our data indicate that this may also be the case for acute whiplash. It has been argued that painful conditions should not be classiﬁed into two mutually exclusive groups, that is, either nociceptive or neuropathic in origin (Attal and Bouhissera, 2004; Bennett et al., 2006). These authors advocate a more ﬂexible model of classiﬁcation where the aim is to identify pain of predominantly neuropathic origin rather than an all or nothing phenomenon (Bennett et al., 2006) and the S-LANSS instrument was developed along these lines (Bennett et al., 2005). Therefore whilst we cannot say that 34% of our cohort have deﬁnitive neuropathic pain, our ﬁndings indicate that a signiﬁcant proportion of individuals with acute whiplash injury demonstrate pain that is predominantly neuropathic in nature. The whiplash group with S-LANSS scores of 12 or greater demonstrated a clinical presentation that would support a neuropathic pain model. This group showed NDI scores indicating moderate to severe pain and disability; cold hyperalgesia, local mechanical hyperalgesia and heightened responses to the BPPT. Cold and

(17.6) (6.0) (104.4) (110) (110.4) (235.17) (19) (2.8) (14.6)

mechanical hyperalgesia are common features of neuropathic pain (Bennett, 2006; Wasner et al., in press) and we have previously argued that generalized and heightened responses to the BPPT are likely to be an indication of central nervous system hyperexcitability (Sterling et al., 2002). Interestingly, there was no diﬀerence in PPTs at the upper or lower limb sites between the two whiplash groups. This may be considered unusual since widespread mechanical hyperalgesia is also considered to be a feature of neuropathic pain (Koelbaek-Johansen et al., 1999). However, we have recently shown that psychological factors such as distress and catastrophisation may play a role in sensory hypersensitivity at more distant sites (Sterling et al., 2008). As there was no diﬀerence in levels of distress between the two whiplash groups of study, this may explain the lack of diﬀerence found in PPTs at distant sites. It also suggests that measurement of cold hyperalgesia, PPTs over the cervical spine and responses to the BPPT could provide a clearer clinical picture of possible neuropathic pain in whiplash. The ﬁndings of this study are relevant to clinical practice. Many individuals will consult a musculoskeletal clinician in the early acute stage post whiplash injury. It is clear that this is a crucial stage of the whiplash condition as it has been shown that there is limited recovery after two to three months post accident (Rebbeck et al., 2006). It is imperative that primary care practitioners consider the presence of adverse prognostic indicators in their assessment of patients with whiplash. Whilst some prognostic indicators (for example pain and disability levels, Scholten-Peeters et al., 2003) are relatively straight forward to measure in the clinic, others such as cold hyperalgesia (Sterling et al., 2006) are more diﬃcult and require laboratory equipment to quantify. For this reason we explored the ability of individual S-LANSS items and the total score to discriminate the group with cold hyperalgesia from those without, cold hyperalgesia being deﬁned as 15 C (Bennett, 2006). None of the S-LANSS items or the total score discriminated the two groups and as such indicates that additional (physical) measures of cold hyperalgesia may be required for adequate assessment. In addition to its predictive capacity, cold hyperalgesia may also be an

177

M. Sterling, A. Pedler / Manual Therapy 14 (2009) 173e179

indicator of non-responsiveness to physical rehabilitation, at least in chronic WAD (Jull et al., 2007). It is not known whether whiplash-injured patients identiﬁed as having pain of a predominantly neuropathic component may also show recalcitrance to standard interventions and this requires further investigation. There was no diﬀerence in psychological distress (GHQ-28 scores) between the predominantly neuropathic pain group and the non-neuropathic pain group. However, both whiplash groups were well above the threshold scores of 24/25 for the GHQ-28 (Goldberg, 1978), indicating that whiplash injury and its associated neck pain are distressing irrespective of symptom level. This would support previous ﬁndings where elevated levels of distress were found in the majority of an acute whiplash cohort but decreased in those who eventually recovered, closely paralleling decreasing pain and disability levels (Sterling et al., 2003b). Whether or not psychological distress decreases over time in the nonneuropathic group of our study remains to be seen. It has been argued that assessment for the presence of a neuropathic pain component should not only comprise questionnaires but physical examination is also essential (Hansson, 2007). Our ﬁndings of a lack of relationship between S-LANSS items and cold pain threshold would concur with this suggestion. The question for the assessment of whiplash is which physical examination tests should be included. At the present time, cold pain threshold is difﬁcult to measure in the clinic but options may include the use of thermorollers set at predetermined temperatures (Jensen and Baron, 2003). Pressure algometry has been

suggested as a useful clinical tool but our results indicate that measurement of PPTs at distant sites may not provide information of neuropathic components to whiplash pain. Instead the BPPT may be a useful clinical tool, due to the diﬀerences in elbow extension (bilaterally) between the predominant neuropathic and non-neuropathic group of our study. Heightened bilateral limited elbow extension with the BPPT may provide indication of central hyperexcitability (Sterling et al., 2003a) and our results would support this proposal. It should be noted that in these studies, elbow extension was measured at pain threshold only and that if the BPPT is an indication of augmented central pain processing, then care will be required with its use in order to avoid potential symptom exacerbation.

5. Conclusion The presence of a predominantly neuropathic component to acute whiplash pain was present in 34% of this cohort and is associated with a more complex presentation of higher pain and disability levels, cold hyperalgesia, local cervical hyperalgesia and less bilateral elbow extension with the BPPT. The S-LANSS may be a useful tool to include in the early assessment of whiplash injury. However, there was no relationship between S-LANSS items and cold pain threshold indicating that physical measures of sensory hypersensitivity may also need to be included in the assessment of acute whiplash.

Appendix 1

Leeds Assessment of Neuropathic Symptoms and Sign (S-LANSS) Think about how your pain that you showed in the diagram has felt over the last week. Please tick the descriptions that best match your pain. These descriptions may, or may not, match your pain no matter how severe it feels. 1.

2.

3.

In the area where you have pain, do you also have ‘pin and needles’, tingling or prickling sensations? a. NO – I don’t get the sensations

(0)

b. YES – I do get these sensations

(5)

Does the painful area change colour (perhaps looks mottled or more red) when the pain is particularly bad? a. NO – The pain does not affect the colour of my skin

(0)

b. YES – I have noticed that the pain does make my skin different from normal

(5)

Does your pain make the affected skin abnormally sensitive to touch? Getting unpleasant sensations or pain when lightly stroking the skin might describe this. a. NO – The pain does not make my skin in that area abnormally sensitive to touch (0) b. YES – My skin in that area is particularly sensitive to touch

(3)

178

M. Sterling, A. Pedler / Manual Therapy 14 (2009) 173e179

4.

5.

6.

Does your pain come on suddenly and in bursts for no apparent reason when you are completely still? Words like ‘electric shocks’, jumping and bursting might describe this. a. NO – My pain doesn’t really feel like this

(0)

b. YES – I get these sensations often

(2)

In the area where you have pain, does your skin feel unusually hot like a burning pain? a. NO – I don’t have burning pain

(0)

b. YES – I get these sensations often

(1)

Gently rub the painful area with your index finger and then rub a non-painful area (for example, an area of skin further away or on the opposite side from the painful area). How does this rubbing feel in the painful area? a. The pain area feels no different from the non-painful area.

(0)

b. I feel discomfort, like pins and needle, tingling or burning in the painful area that is different from the non-painful area. (5) 7.

Gently press on the painful area with your finger then gently press in the same way onto a nonpainful area (the same non-painful area that you chose in the last question). How does this feel in the painful area? a. The pain area feels no different from the non-painful area.

(0)

b. I feel numbness or tenderness in the painful area that is different from the non-painful area.

References Attal N, Bouhissera D. Can pain be more or less neuropathic? Pain 2004:110. Banic B, Petersen-Felix S, Andersen O, Radanov B, Villiger P, Arendt-Nielsen L, et al. Evidence for spinal cord hypersensitivity in chronic pain after whiplash injury and in ﬁbromyalgia. Pain 2004;107:7e15. Bennett G. Neuropathic pain: a crisis of deﬁnition. Anesthesia and Analgesia 2003;97:619. Bennett G. Can we distinguish between inﬂammatory and neuropathic pain? Pain Research and Management 2006;11:11e5. Bennett M, Attal N, Backonja M, Baron R, Bouhassira D, Freynhagen R, et al. Using screening tools to identify neuropathic pain. Pain 2007;127:199e203. Bennett M, Smith B, Torrance N, Lee A. Can pain be more or less neuropathic? Comparison of symptom assessment tools with ratings of certainty by clinicians. Pain 2006;122:289e94. Bennett M, Smith B, Torrance N, Potter J. The S-LANSS score for identifying pain of predominantly neuropathic origin: validation for use in clinical and postal research. The Journal of Pain 2005;6:149e58. Chien A, Eliav E, Sterling M. Hypoaesthesia occurs with sensory hypersensitivity in chronic whiplash: indication of a minor peripheral neuropathy? Manual Therapy 2009;14:137e45. Fishbain D, Lewis J, Cutler R, Cole B, Rosomoﬀ H, Rosomoﬀ R. Can the neuropathic Pain Scale discriminate between non-neuropathic and neuropathic pain. Pain Medicine 2008;9:149e60. Freynhagen R, Baron R, Gockel U, Tolle T. painDETECT: a new screening questionnaire to identify neuropathic components in patients with low back pain. Current Medical Research and Opinion 2006;22:1911e20.

(3)

Gallagher R. Management of neuropathic pain. Clinical Journal of Pain 2006;22:S2e8. Gargan M, Bannister G, Main C, Hollis S. The behavioural response to whiplash injury. The Journal of Bone and Joint Surgery 1997;79-B:523e6. Goldberg D. Manual of the general health questionnaire. Windsor: NFER-Nelson; 1978. Hansson P. Diagnostic work up of neuropathic pain: computing, using questionnaires or examining the patient? European Journal of Pain 2007;11:367e9. Jensen T, Baron R. Translation of symptoms and signs into mechanisms in neuropathic pain. Pain 2003;102:1e8. Jull G, Sterling M, Kenardy J, Beller E. Does the presence of sensory hypersensitivity inﬂuence outcomes of physical rehabilitation for chronic whiplash? e A preliminary RCT. Pain 2007;129:28e34. Kasch H, Qerama E, Bach F, Jensen T. Reduced cold pressor pain tolerance in non-recovered whiplash patients: a 1 year prospective study. European Journal of Pain 2005;9:561e9. Koelbaek-Johansen M, Graven-Nielsen T, Schou-Olesen A, ArendtNielsen L. Muscular hyperalgesia and referred pain in chronic whiplash syndrome. Pain 1999;83:229e34. Moog M, Quintner J, Hall T, Zusman M. The late whiplash syndrome: a psychophysical study. European Journal of Pain 2002; 6:283e94. Pietrobon R, Coevtaux R, Carey T, Richardson W, De Vellis R. Standard scales for measurement of functional outcome for cervical pain or dysfunction: a systematic review. Spine 2002;27:515e22. Rebbeck T, Sindhausen D, Cameron I. A prospective cohort study of health outcomes following whiplash associated disorders in an Australian population. Injury Prevention 2006;12:86e93. Scholten-Peeters G, Verhagen A, Bekkering G, van der Windt D, Barnsley L, Oostendorp R, et al. Prognostic factors of whiplash

M. Sterling, A. Pedler / Manual Therapy 14 (2009) 173e179 associated disorders: a systematic review of prospective cohort studies. Pain 2003;104:303e22. Smith B, Torrance N, Bennett M, Lee A. Health and quality of life associated with chronic pain of predominantly neuropathic origin in the community. Clinical Journal of Pain 2007;23:143e9. Spitzer W, Skovron M, Salmi L, Cassidy J, Duranceau J, Suissa S, et al. Scientiﬁc monograph of Quebec task force on whiplash associated disorders: redeﬁning ‘‘Whiplash’’ and its management. Spine 1995;20:1e73. Sterling M, Jull G, Kenardy J. Physical and psychological predictors of outcome following whiplash injury maintain predictive capacity at long term follow-up. Pain 2006;122:102e8. Sterling M, Jull G, Vicenzino B, Kenardy J. Sensory hypersensitivity occurs soon after whiplash injury and is associated with poor recovery. Pain 2003a;104:509e17. Sterling M, Kenardy J, Jull G, Vicenzino B. The development of psychological changes following whiplash injury. Pain 2003b;106:481e9. Sterling M, Jull G, Vicenzino B, Kenardy J, Darnell R. Physical and psychological factors predict outcome following whiplash injury. Pain 2005;114:141e8.

179

Sterling M, Pettiford C, Hodkinson E, Curatolo M. Psychological factors are related to some sensory pain thresholds but not nociceptive ﬂexion reﬂex threshold in chronic whiplash. Clinical Journal of Pain 2008;24:124e30. Sterling M, Treleaven J, Jull G. Responses to a clinical test of mechanical provocation of nerve tissue in whiplash associated disorders. Manual Therapy 2002;7:89e94. Stewart M, Maher C, Refshauge K, Bogduk N, Nicholas M. Responsiveness of pain and disability measures for chronic whiplash. Spine 2007;32:580e5. Vernon H, Mior S. The neck disability index: a study of reliability and validity. Journal of Manipulative and Physiological Therapeutics 1991;14:409e15. Wasner G, Naleschinski D, Binder A, Schattschneider J, McLachlan E, Baron R. The eﬀect of menthol on cold allodynia in patients with neuropathic pain. Pain Medicine; in press. doi:10.1111/j.15264637.2007.00290.x. Ylinen J. Clinimetrics: pressure algometry. The Australian Journal of Physiotherapy 2007;53:207.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 180e188 www.elsevier.com/math

Original Article

Eﬀect of motor control and strengthening exercises on shoulder function in persons with impingement syndrome: A single-subject study design* Jean-Se´bastien Roy a,*, He´le`ne Moﬀet a,b, Luc J. He´bert c,d, Richard Lirette e a

Centre for Interdisciplinary Research in Rehabilitation and Social Integration, Canada b Department of Rehabilitation, Faculty of Medicine, Laval University, Canada c Department of Radiology, Faculty of Medicine, Laval University, Canada d National Defence of Canada, Canada e Club Entrain Medical Center, Canada

Received 4 July 2007; received in revised form 14 January 2008; accepted 21 January 2008

Abstract The aim of the study was to evaluate the eﬀect of an intervention including shoulder control and strengthening exercises on function in persons with shoulder impingement. Eight subjects with shoulder impingement were evaluated weekly during the nine weeks of this single-subject design study. The study was divided into three phases (A1eBeA2) and involved repeated measures of shoulder pain and function (Shoulder Pain And Disability Index (SPADI) questionnaire), painful arc of motion, peak torque and 3-dimensional scapular attitudes. During the intervention phase, each subject participated in 12 exercise sessions supervised by a physiotherapist. Measures taken during the intervention and post-intervention phases were compared to pre-intervention values. All subjects showed signiﬁcant improvement in the SPADI at the end of the study. A disappearance of a painful arc of motion in ﬂexion and abduction (n ¼ 6), an increase in isometric peak torque in lateral rotation (n ¼ 3) and abduction (n ¼ 2), and changes in the scapular kinematics, mainly in the sagittal plane, were also observed. The present results provide preliminary evidence to support the use of shoulder control exercises to reduce pain and improve function of persons with shoulder impingement. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Rehabilitation; Kinematics; Exercise

1. Introduction

* Institution to which the work should be attributed is Centre for Interdisciplinary Research in Rehabilitation and Social Integration, Quebec Rehabilitation Institute, 525, Boulevard Hamel, Quebec City (QC), Canada G1M 2S8. * Corresponding author. Centre interdisciplinaire de recherche en re´adaptation et en inte´gration sociale, Institut de re´adaptation en de´ﬁcience physique de Que´bec, Local H-1602, 525, Boulevard WilfridHamel, Que´ bec (QC), Canada G1M 2S8. Tel.: þ1 418 529 9141x6559; fax: þ1 418 529 3548. E-mail address: [email protected] (J.-S. Roy).

1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.010

More than a third of painful shoulder diagnoses are related to disorders of the rotator cuﬀ that are often associated with a clinical entity called shoulder impingement syndrome (SIS) (Matsen and Arntz, 1990). SIS has been described as a repeated mechanical compression of the subacromial structures under the coracoacromial arch during arm elevation (Matsen and Arntz, 1990). In a systemic review, Michener et al. (2004) concluded from limited evidence that exercises and joint mobilization are eﬃcacious for people with SIS. Many other studies have also reported a positive eﬀect of exercises,

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

such as strengthening, stretching, and motor control exercises, on shoulder function (Brox et al., 1993; Bang and Deyle, 2000; Ludewig and Borstad, 2003; McClure et al., 2004; Walther et al., 2004; Ginn and Cohen, 2005). However, the duration of the proposed exercise programs (three weeksesix months), as well as their intensity and level of subject’s supervision diverged widely across studies. Several studies have identiﬁed impairments associated with SIS. They have reported that people with SIS present weakness of scapulohumeral muscles (Warner et al., 1990; Leroux et al., 1994) and improper control of the glenohumeral (G/H) and scapulothoracic (S/T) movements during arm elevation. Improper control is characterized by changes in muscle activation levels. More speciﬁcally, lower activity of the serratus anterior, higher activity of the upper and lower trapezius (Ludewig and Cook, 2000), and lack of coordination between the diﬀerent parts of the trapezius have been observed (Wadsworth and Bullock-Saxton, 1997; Cools et al., 2003). This inadequate muscle control is believed to contribute to a reduction of amplitude in posterior tilting and lateral rotation of the scapula during arm elevation (Ludewig and Cook, 2000; Borstad and Ludewig, 2002). Lower activity of the infraspinatus and subscapularis (Reddy et al., 2000) as well as inadequate coactivation of the scapulohumeral muscles (Myers et al., 2003) have also been reported. This abnormal muscle control is most likely associated with a reduction of the subacromial space (Graichen et al., 1999; He´bert et al., 2003b) leading to impingement. The hypothesis that can be derived from these studies is that an improper control of G/H and S/T joint movements and strength deﬁcits in the scapulohumeral and scapulathoracic muscles seem to be partly responsible for the level of shoulder disability in patients with SIS. Hence, this highlights the importance of assessing a comprehensive rehabilitation program that combines two types of exercises used in rehabilitation for SIS: supervised motor control exercises to correct the abnormal G/H and S/T movements and strengthening exercises. As a ﬁrst step to assess the potential beneﬁt of such a program, the individual responses to this type of rehabilitation intervention must be evaluated. The aim of this study was to evaluate, using a single-subject design, the eﬀects of a 4-week supervised rehabilitation intervention based on a combination of shoulder control and strengthening exercises on shoulder function in persons with SIS.

2. Methods 2.1. Subject selection Eight subjects with unilateral SIS, diagnosed by an orthopaedic surgeon, were recruited (Table 1). The subjects were included if they had at least one positive

181

ﬁnding in each of these categories (He´bert et al., 2003b): (1) painful arc of movement during ﬂexion or abduction, (2) positive Neer or KennedyeHawkins impingement signs, or (3) pain on resisted lateral rotation, abduction or Jobe test. Exclusion criteria were type III acromion, calciﬁcation or fracture; shoulder instability; previous shoulder surgery; and cervicobrachialgia or shoulder pain during neck movement. All subjects signed an informed consent form. This study was approved by the Ethics Committee of the Quebec Rehabilitation Institute. 2.2. Study design An A1eBeA2 single-subject design was used (Backman et al., 1997). The study was divided into three phases over a 9-week period. Within the ﬁrst two weeks, three evaluations of the outcome measures were performed (phase A1). During the following four weeks (phase B), each subject participated in 12 supervised exercise sessions and the immediate eﬀect of the intervention was assessed at the end of each week. The last three weeks consisted of the post-intervention phase (phase A2) during which the short-term eﬀects of the intervention were assessed once a week. The subjects were assessed and treated by the same physiotherapist. 2.3. Outcome measures The main outcome was the pain and disability level, which was evaluated at the beginning of the study and each week thereafter using the Shoulder Pain And Disability Index (SPADI). The SPADI is a valid and reliable self-administered questionnaire (Roach et al., 1991). Higher scores indicate a greater level of pain and disability (0e100). Secondary outcomes were the presence of a painful arc of motion, assessed at the same time period as the SPADI, the isometric peak torque, the pain intensity during strength tests and the 3-dimensional scapular attitudes (3DSA), assessed at the beginning of the study and the end of A1, B and A2 phases. In a seated position, the presence of a painful arc of shoulder motion during ﬂexion and abduction was evaluated. If pain was present during one of the two trials performed in each plane of movement, the subject was considered having a painful arc of motion in that plane. In a supine position, the maximal isometric strength of shoulder abductors (shoulder at 10 of abduction; elbow at 0 ) and lateral rotators (shoulder at 0 of abduction; elbow at 90 ) was assessed with a dynamometer (Chatillon CSD 300, Greensboro, NC). The mean torque (n ¼ 2) in Newton-meters was calculated. The intensity of pain during these tests was measured with a visual analogue scale (VAS). The VAS scores for each muscle

182

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

Table 1 Subjects’ characteristics at the initial evaluation. Subject

Age (years)

Gender

Dominant side

Impaired side

Weight (kg)

Height (m)

Duration (month)b

Mean SPADIc

S1 S2 S3 S4 S5 S6 S7 S8

53 40 32 49 60 29 56 50

F M F F F F F F

Right Left Right Right Right Left Right Left

Right Right Right Left Right Right Left Left

57 96 85 93 69 62 79 63

1.61 1.80 1.74 1.51 1.62 1.61 1.69 1.70

12 26 3 16 3 5 8 48

69.6 27.0 28.1 67.9 47.1 38.2 42.0 26.5

Total

46a (11)

1 male 7 female

5 right 3 left

5 right 3 left

75.5a (14.9)

1.66a (0.09)

15.1a (15.4)

43.3a (17.4)

a b c

Mean (1 standard deviation). Time between the appearance of the symptoms and the initial evaluation. Mean of the three SPADI scores during phase A1.

group were averaged (n ¼ 2) to calculate the ﬁnal outcome (0e100). The 3DSA were calculated at two shoulder positions, 90 of abduction and 70 of ﬂexion, with the Optotrak Probing System (Northern Digital Inc., Waterloo, Ontario, Canada) (He´bert et al., 2000; Roy et al., 2007). These positions were chosen because it has been shown that a reduced posterior tilting at those two positions along with ﬁve other variables could explain 91% of the variance of the pain and disability level experienced by subjects with SIS (He´bert et al., 2003a). Two trials were recorded at each position and the mean (n ¼ 2) was used for the analysis. For each trial, six body landmarks were digitized: three on the scapula (acromial angle, inferior angle, root of the spine), and three on the trunk (C7 spinous process, right and left posterosuperior iliac spines). The position of the scapula was calculated relative to the trunk. The three scapular rotations used to described the 3DSA were lateral/medial rotation, anterior/posterior tilting, and protraction/retraction (Fig. 1). The coordinate system and Euler angle sequence of rotations were deﬁned in accordance with ISB recommendations (Wu et al., 2005).

2.5. Phase B: intervention Before developing the intervention program, a review of the literature on SIS and motor learning principles was conducted and a focus group of physiotherapists was held. Thereafter, the aims of the intervention were determined. It was ﬁrstly to promote proper scapula kinematic during arm elevation against gravity and secondly, to strengthen the scapulohumeral and scapulathoracic muscles with an external resistance. The decision to introduce strengthening exercises with an external resistance only when proper shoulder control has been observed was taken to ensure a gradual loading of the muscle-tendon-bone units without any setback in the pain level. It resulted that during the intervention more emphasis was put on shoulder control. The subjects participated in three exercise sessions per week. Exercises of increasing diﬃculty in terms of movement plane, ROM, number of repetitions, speed and resistance were performed. Two indicators were used to determine the level of diﬃculty of the exercises: quality

2.4. Phase A1: pre-intervention At the ﬁrst evaluation visit, the outcomes were all evaluated and the participants were taught a standardized home exercise program. This program, performed daily, was comprised of submaximal isometric contraction exercises in abduction and lateral and medial rotations against a wall. This program was prescribed for ethical reasons since it was not possible to leave the subjects without any intervention for two weeks. Participants were evaluated at the end of each week during this 2-week phase. At the last evaluation, a standardized physical examination was performed (shoulder range of motion [ROM], evaluation of scapular movements during arm elevation). The results of this examination were used to determine the intensity of the exercises performed in phase B.

Fig. 1. Representation of the scapular rotations around the Y, X and Z axes. The scapular rotations are deﬁned in accordance with the ISB recommendations. The sequence of rotations used is YsXsZs.

183

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

of shoulder motion and perceived intensity of pain. The intervention started with shoulder control exercises during arm elevation in the frontal, sagittal and scapular planes. These exercises were progressed following a 6phase retraining program and began under the close supervision of the physiotherapist, who directed the retraining with feedback (Table 2). The retraining phases were graded according to: (1) the level of resistance applied on the shoulder during arm elevation (no resistance/passive movement; active assisted; active with or without external resistance); and (2) the use or non use of feedback during the movement. The phases ranged from no resistance with feedback to active movement with external resistance without feedback. In each retraining phase, the ROM was gradually increased as shoulder control improved until proper control was achieved for the full ROM in each vertical plane. When the subject was able to perform a series of 10 repetitions with proper control, series were added to reach three. Then, the subject moved up the next phase. At the end of each session, exercises in diagonal planes were performed. Subjects had to touch targets in a determined sequence, which took into account the maximum ROM they were able to reach in each vertical plane. Once abduction up to a range of 90 was properly controlled, humeral lateral rotation at 90 of abduction was performed. When a proper control was achieved with supervision, the exercise was practiced alone as home exercise. The criterion to start strengthening exercises was to be able to perform pain-free arm elevations with a resistance of 0.45 kg. Humeral medial and lateral rotation at 0 of shoulder abduction using Thera-Bands (red to blue level), push-ups with a progression from vertical wall to

standard horizontal push-ups, and horizontal arm abduction in supine performed with a dumbbell (starting with 0.45 kg) were the exercises performed. The number of repetitions was increased from one to three series of 10. When three series were easily performed, resistance was progressively increased. 2.6. Phase A2: post-intervention At the end of phase B, an individualized home exercise program was given. The content of this program was determined according to the level of shoulder control and strength reached at the end of phase B and was reviewed at the two subsequent visits. 2.7. Data analysis Outcome values obtained in phases B and A2 were compared to the pre-intervention values using two standard deviations above and below the pre-intervention mean (A1-interval). For the outcomes measured on a weekly basis (SPADI; painful arc of motion), two consecutive SPADI scores outside the A1-interval or an absence of a painful arc of motion for two consecutive evaluations were necessary to conclude to a signiﬁcant change in the corresponding B and A2 phases. For the outcomes measured less frequently (peak torque; pain during peak torque), one measurement outside the A1interval was necessary to conclude to a signiﬁcant change in phases B and A2. Finally, the diﬀerences between 3DSA of phase A1 and 3DSA of phases B and A2 were calculated and illustrated graphically to describe the direction of changes during the study.

Table 2 Phases for retraining of shoulder control and manual feedback given according to scapular dyskinesis. Phases

Steps for retraining of shoulder control 1

2

3

4

1a

Passive elevation Active assisted elevationb

3a

Active elevation with manual feedback if needed Phase 3, but without manual feedback Phase 4, but without visual feedback. Phase 5, but with the elevation performed faster, and then with a load.

Active return with manual feedback if needed Active return with manual feedback if needed Active return with manual feedback if needed

Verbal feedback

2a

Final position actively kept for 5 sec Final position actively kept for 5 sec Final position actively kept for 5 sec

4a 5 6

Verbal feedback Verbal feedback

Types of dyskinesis

Description of the scapular dyskinesis

Manual feedback

1

Decrease of the scapular lateral rotation

2 3

Tilt of the scapular inferior angle Elevation of the superior border of the scapula Tilt of the medial scapular border

Guidance of lateral rotation with a lateral pressure on the inferior angle of the scapula Restriction of the tilt with a anterior pressure on the inferior angle of the scapula Restriction of the scapular elevation with a inferior pressure on the acromion

4 a b

Restriction of the tilt with a anterior pressure on the medial border of the scapula

In front of a mirror. Movement assisted by the physiotherapist to reduce the load on the shoulder.

184

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

3. Results Seven of the eight subjects showed signiﬁcant improvement in the SPADI during phase B. For ﬁve subjects, the improvement started following the ﬁrst intervention week, whereas for two subjects, the improvement started following the second week. All eight subjects showed signiﬁcant improvement during phase A2 (Fig. 2). In ﬂexion, one subject (subject 5 [S5]) did not experience a painful arc of motion, while the seven other subjects presented a painful arc of motion during phase A1. During phase B, only one subject (S8) presented significant improvement with disappearance of pain during ﬂexion for two consecutive evaluations. In phase A2,

ﬁve subjects (S1, S2, S3, S4, S8) presented signiﬁcant improvement. At the last evaluation, only one subject (S7) had a painful arc of motion. In abduction, all eight subjects presented a painful arc during phase A1. Two subjects showed signiﬁcant improvement during phase B (S2, S5) and six during phase A2 (S2, S3, S4, S5, S7, S8). Two subjects (S6, S7) still presented a painful arc in abduction at the last evaluation. Signiﬁcant increase in isometric abduction peak torque was seen at the end of phases B and A2 for one subject and at the end of phase B for two subjects (Fig. 3). In lateral rotation, a signiﬁcant increase in peak torque was found in only one subject following phase B, and in three subjects following phase A2 (Fig. 3). Six of the eight subjects (S1, S3, S4, S5, S6 and S8) exhibited signiﬁcant

Fig. 2. Proﬁle of the SPADI scores. Proﬁles of the SPADI scores over the three phases of the study (pre-intervention [A1], intervention [B] and postintervention [A2]). The grey band represents two standard deviations above and below the pre-intervention mean and the line in the middle of this band indicates the mean (n ¼ 3) value during phase A1. The * indicates signiﬁcant changes in the SPADI during phases B and A2.

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

185

Fig. 3. Isometric peak torque in abduction and lateral rotation. The * indicates signiﬁcant changes in the peak torque during phases B and A2.

reduction of pain intensity during strength testing following phase A2 in abduction and lateral rotation. For the 3DSA in abduction, posterior tilting was increased for seven subjects following phase B and was still increased for ﬁve subjects at the end of phase A2; lateral rotation was increased for ﬁve subjects following phase B and for six subjects following phase A2; ﬁnally, protraction was increased for seven subjects following phases B and A2 (Fig. 4). In ﬂexion, posterior tilting was increased for ﬁve subjects following phase B and for six subjects following phase A2; lateral rotation was increased for four subjects at the end of phases B and A2; ﬁnally, protraction was increased for four subjects following phases B and A2 (Fig. 5). 3.1. Compliance with the intervention All eight subjects participated in the 12 supervised sessions and performed both shoulder control and strengthening exercises (Table 3). The shoulder control exercises were progressed for all subjects from exercises in the vertical and diagonal planes, to exercises in lateral rotation at 90 of abduction. Strengthening exercises were begun between the third and seventh session with strengthening in medial and lateral rotations. Two subjects had to stop these exercises after three days because of an increased level of pain. Only S4 did not perform push-ups because of pain during its execution. Finally, four subjects performed the horizontal abduction

Fig. 4. Three-dimensional scapular attitude at 90 of abduction. Phases B (white diamonds) and A2 (black triangles) 3DSA diﬀerences established in comparison with the mean pre-intervention (A1) phase 3DSA.

strengthening exercise. The four other subjects did not perform horizontal abduction since they had pain during its execution with a dumbbell of 0.45 kg.

4. Discussion The present results suggest that a rehabilitation program based on motor control and strengthening exercises is eﬀective to reduce shoulder pain and

186

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

Fig. 5. Three-dimensional scapular attitude at 70 of ﬂexion. Phases B (white diamonds) and A2 (black triangles) 3DSA diﬀerences established in comparison with the mean pre-intervention (A1) phase 3DSA.

promote better function in persons with SIS. These improvements were accompanied, for most subjects, by reduction in pain during maximal contractions and disappearance of the painful arc of motion. Interestingly, the improvement persisted after the end of the supervised intervention, suggesting that home exercises were suﬃcient to maintain or even enhance the beneﬁts of the intervention. Our results support the ﬁndings of other studies that have shown the positive eﬀects of rehabilitation in persons with SIS (Brox et al., 1999;

Bang and Deyle, 2000; Ludewig and Borstad, 2003; McClure et al., 2004; Walther et al., 2004; Ginn and Cohen, 2005). The main contribution of this study is to propose a 4week exercise program, based mainly on motor control principles, that provides a fast improvement in shoulder pain and function. In comparison to previous studies in which exercises have been used to improve shoulder control in individuals with SIS, our results seem promising. Indeed, Conroy and Hayes (1998) reported no diﬀerence in pain following a supervised exercise program of similar duration (three weeks) but composed of other types of exercises (stretching and isometric strengthening). The addition of joint mobilization to their program led, however, to a better functional outcome. As in the present study, Ludewig and Borstad (2003) also observed a signiﬁcant improvement in shoulder function following home exercises. However, the duration of their home exercise program was more than twice longer (10 weeks) as ours. Finally, improvement in shoulder function has also been demonstrated by Brox et al. (1999) following a much longer supervised exercise program of threeesix months. The intervention proposed in this study includes shoulder control exercises targeting the speciﬁc impairments described in patients with SIS (Ludewig and Cook, 2000; Borstad and Ludewig, 2002). More speciﬁcally, the exercises were designed, in part, to promote larger amplitude of posterior tilting and lateral rotation of the scapula during arm elevation. Such changes in scapular rotations were not consistently found among subjects. Variability in the response to the intervention in a relatively small sample of subjects may explain this result. One can also argue that the measure used to quantify scapular rotations was not sensitive enough to capture changes that are relevant to function. When looking at individual data, changes of small magnitudes were observed following intervention for some subjects. They were mostly found in the sagittal plane with larger posterior tilting amplitude. It is known that posterior tilting elevates the anterior part of the acromion and that the acromiohumeral distance in people with SIS is decreased by only 1.2e1.3 mm around 90 and 110 of arm elevation (He´bert et al., 2003b). Therefore such small increases in posterior tilting could have resulted in less compression of the subacromial structures (Ludewig and Cook, 2000), which may have had an impact on overall shoulder pain and function. Only small changes were observed in the isometric peak torques following the intervention. In the present study, more emphasis was put on exercises promoting better shoulder control in the ﬁrst weeks of the intervention. Strengthening exercises were only introduced when proper shoulder control was achieved. Once started, strengthening exercises were progressed in order to gradually load the muscleetendonebone units without any

187

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188 Table 3 Description of the exercises performed during phase B. Subjects

S1 S2 S3 S4 S5 S6 S7 S8

Vertical planes

Diagonal planes

Lateral rotation at 90 of abduction

Medial and lateral rotation with Thera-Band

Push-ups

From

From

From

For

From

For

From

For

From

For

3 5 3 2 11 9 11 10

4 3 4 7 4 5 7 3

9 10 9 2b 9 8 3b 10

6 4 5 NP 6 8 8 4

7 4 8 NP 7 5 5 9

9 12 8 NP NP NP NP 7

4 1 5 NP NP NP NP 6

1 1 1 1 1 1 1 1

a

(4) (3) (4) (6) (4) (5) (4) (3)

For 12 12 12 12 12 12 12 12

2 2 4 4 2 3 2 1

(7) (4) (8)

a

(11) (8) (9) (8)

For 11 8 7 7 11 8 8 11

7 5 6 7 2 4 2 2

a

(7) (11) (7)

(7) (5)

Horizontal abduction in supine

Abbreviations: From, session where the exercise was ﬁrst performed; For, the total number of sessions where the exercise was performed; NP, exercise not performed. a The number in brackets represents the session where the exercise was ﬁrst performed with a dumbbell. b The exercise had to be stopped because of an increased level of pain.

setback in the pain level. In some subjects who experienced pain during strengthening exercises, these exercises had to be stopped or progressed more slowly than expected. One can hypothesize that tension or compression of the degenerated rotator cuﬀ tendons may have been responsible for the enhancement of shoulder pain. Hence, the number of weeks during which they were performed was probably not large enough to bring about changes in shoulder strength. In comparison, McClure et al. (2004) observed signiﬁcant gains in the isometric strength of the rotators and abductors of the shoulder following a 6-week program composed of more intense strengthening exercises. Undoubtedly, strengthening exercises help improve function in subjects with SIS. They should, however, be introduced at a proper stage during recovery to avoid pain recurrence and performed at a suﬃcient intensity to promote functional changes. Although all the subjects showed improvement in shoulder pain and function, they did not reach normal level at the last evaluation. A longer follow-up evaluation could have provided more information of the long term outcomes and guided us on the need for some subjects to have a longer duration of supervised intervention. The home exercises performed during the preintervention phase may have introduced an additional source of variability on measurements, potentially leading to larger 95% conﬁdence intervals and a reduction in our capacity to detect changes. The use of a single-study design limits the generalizability of the results and, by performing repeated measurements of outcomes, bias may have been introduced. Finally, the eﬀect of not having an independent evaluator may have reduced the strength of our conclusions. The use of a self-administered questionnaire as the primary outcome, as well as standardized measurement procedures and valid outcomes enhance, however, the conﬁdence in our results. This study has brought a deeper understanding of the mechanisms that led to the changes observed following

the proposed program. However, a randomised controlled trial is needed to conﬁrm the present ﬁndings.

5. Conclusions Results of this study suggest that a 4-week program including motor control and strengthening exercises reduces shoulder pain and improves function in persons with SIS. To better understand how shoulder control is modiﬁed, further studies need to evaluate changes in muscle and interjoint coordination using electromyography and motion analysis systems. Nonetheless, this study provides preliminary evidence to support the use of shoulder control exercises to promote better function in people with SIS.

References Backman CL, Harris SR, Chisholm JA, Monette AD. Single-subject research in rehabilitation: a review of studies using AB, withdrawal, multiple baseline, and alternating treatments designs. Archives of Physical Medicine and Rehabilitation 1997;78:1145e53. Bang MD, Deyle GD. Comparison of supervised exercise with and without manual physical therapy for patients with shoulder impingement syndrome. Journal of Orthopaedic & Sports Physical Therapy 2000;30:126e37. Borstad JD, Ludewig PM. Comparison of scapular kinematics between elevation and lowering of the arm in the scapular plane. Clinical Biomechanics (Bristol, Avon) 2002;17:650e9. Brox JI, Gjengedal E, Uppheim G, Bohmer AS, Brevik JI, Ljunggren AE, et al. Arthroscopic surgery versus supervised exercises in patients with rotator cuﬀ disease (stage II impingement syndrome): a prospective, randomized, controlled study in 125 patients with a 2 1/2-year follow-up. Journal of Shoulder and Elbow Surgery 1999;8:102e11. Brox JI, Staﬀ PH, Ljunggren AE, Brevik JI. Arthroscopic surgery compared with supervised exercises in patients with rotator cuﬀ disease (stage II impingement syndrome). British Medical Journal 1993;307:899e903. Conroy DE, Hayes KW. The eﬀect of joint mobilization as a component of comprehensive treatment for primary shoulder impingement

188

J.-S. Roy et al. / Manual Therapy 14 (2009) 180e188

syndrome. Journal of Orthopaedic & Sports Physical Therapy 1998;28:3e14. Cools AM, Witvrouw EE, Declercq GA, Danneels LA, Cambier DC. Scapular muscle recruitment patterns: trapezius muscle latency with and without impingement symptoms. American Journal of Sports Medicine 2003;31:542e9. Ginn KA, Cohen ML. Exercise therapy for shoulder pain aimed at restoring neuromuscular control: a randomized comparative clinical trial. Journal of Rehabilitation Medicine 2005;37:115e22. Graichen H, Bonel H, Stammberger T, Haubner M, Rohrer H, Englmeier KH, et al. Three-dimensional analysis of the width of the subacromial space in healthy subjects and patients with impingement syndrome. American Journal of Roentgenology 1999;172:1081e6. He´bert LJ, Moﬀet H, McFadyen BJ, St-Vincent G. A method of measuring three-dimensional scapular attitudes using the Optotrak probing system. Clinical Biomechanics (Bristol, Avon) 2000;15:1e8. He´bert LJ, Moﬀet H, Dionne CE, McFadyen BJ, Dufour M, Lirette R. Shoulder impingement syndrome: clinical indicators and short-term predictors of disability. Archives of Physical Medicine and Rehabilitation 2003a;84:A7. He´bert LJ, Moﬀet H, Dufour M, Moisan C. Acromiohumeral distance in a seated position in persons with impingement syndrome. Journal of Magnetic Resonance Imaging 2003b;18:72e9. Leroux JL, Codine P, Thomas E, Pocholle M, Mailhe D, Blotman F. Isokinetic evaluation of rotational strength in normal shoulders and shoulders with impingement syndrome. Clinical Orthopaedics and Related Research 1994:108e15. Ludewig PM, Borstad JD. Eﬀects of a home exercise programme on shoulder pain and functional status in construction workers. Occupational Environmental Medicine 2003;60:841e9. Ludewig PM, Cook TM. Alterations in shoulder kinematics and associated muscle activity in people with symptoms of shoulder impingement. Physical Therapy 2000;80:276e91. Matsen FA, Arntz CT. Subacromial impingement. In: Rockwood CA, Matsen FA, editors. The Shoulder. 9th ed. Philadelphia: WA Saunders Co; 1990. p. 623e46. McClure PW, Bialker J, Neﬀ N, Williams G, Karduna A. Shoulder function and 3-dimensional kinematics in people with shoulder

impingement syndrome before and after a 6-week exercise program. Physical Therapy 2004;84:832e48. Michener LA, Walsworth MK, Burnet EN. Eﬀectiveness of rehabilitation for patients with subacromial impingement syndrome: a systematic review. Journal of Hand Therapy 2004;17:152e64. Myers JB, Hwang JH, Pasquale MR, Rodosky MW, Ju YY, Lephart SM. Shoulder muscle coactivation alterations in patients with subacromial impingement. Medicine & Science in Sports & Exercise 2003;35(5):S346. Reddy AS, Mohr KJ, Pink MM, Jobe FW. Electromyographic analysis of the deltoid and rotator cuﬀ muscles in persons with subacromial impingement. Journal of Shoulder and Elbow Surgery 2000;9:519e23. Roach KE, Budiman-Mak E, Songsiridej N, Lertratanakul Y. Development of a shoulder pain and disability index. Arthritis Care & Research 1991;4:143e9. Roy JS, Moﬀet H, Hebert LJ, St-Vincent G, McFadyen BJ. The reliability of three-dimensional scapular attitudes in healthy people and people with shoulder impingement syndrome. BMC Musculoskeletal Disorders 2007;8:49. Wadsworth DJ, Bullock-Saxton JE. Recruitment patterns of the scapular rotator muscles in freestyle swimmers with subacromial impingement. International Journal of Sports Medicine 1997;18: 618e24. Walther M, Werner A, Stahlschmidt T, Woelfel R, Gohlke F. The subacromial impingement syndrome of the shoulder treated by conventional physiotherapy, self-training, and a shoulder brace: results of a prospective, randomized study. Journal of Shoulder and Elbow Surgery 2004;13:417e23. Warner JJ, Micheli LJ, Arslanian LE, Kennedy J, Kennedy R. Patterns of ﬂexibility, laxity, and strength in normal shoulders and shoulders with instability and impingement. American Journal of Sports Medicine 1990;18:366e75. Wu G, van der Helm FC, Veeger HE, Makhsous M, van Roy P, Anglin C, et al. ISB recommendation on deﬁnitions of joint coordinate systems of various joints for the reporting of human joint motion e part II: shoulder, elbow, wrist and hand. Journal of Biomechanics 2005;38:981e92.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 189e196 www.elsevier.com/math

Original Article

Physiotherapists’ use of advice and exercise for the management of chronic low back pain: A national survey S. Dianne Liddle a,*, G. David Baxter b, Jacqueline H. Gracey a a

Health and Rehabilitation Sciences Research Institute, University of Ulster, Shore Road, Newtownabbey, Northern Ireland b Centre for Physiotherapy Research, School of Physiotherapy, University of Otago, New Zealand Received 31 January 2007; received in revised form 25 January 2008; accepted 30 January 2008

Abstract The objective of the study was to establish the speciﬁc use of advice and exercise by physiotherapists, for the management of chronic low back pain (LBP). A questionnaire was mailed to a random sample of 600 members of the Irish Society of Chartered Physiotherapists. Open and closed questions were used to obtain information on treatments provided to chronic LBP patients. Respondents’ treatment goals were also investigated, along with the typical methods used to assess treatment outcome. Four hundred and nineteen of the sample returned the questionnaire; 280/419 (67%) indicated that they currently treated LBP of which 76% (n ¼ 214) were senior grade therapists. Advice and exercise, respectively, were the treatments most frequently used for chronic LBP: advice was most commonly delivered as part of an exercise programme, with strengthening (including core stability) the most frequently used exercise type. Supervision of exercise and follow-up advice were underutilised with respect to the recommendations of relevant clinical guidelines. Pain relief was an important treatment goal. Emphasis on exercise programme supervision, incorporating reassurance that its safe to stay active and ‘hurt does not mean harm’, must be more eﬀectively disseminated and promoted in practice. The inﬂuence of follow-up advice on exercise adherence warrants further investigation. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Advice; Exercise; Chronic low back pain; Adherence

1. Introduction The intractability of chronic low back pain (LBP; i.e. symptoms > 12 weeks or 3þ recurrent episodes within 12 months) has led to the adoption of a wide variety of treatment approaches by healthcare professionals (Cherkin, 1998; Foster et al., 1999; Gracey et al., 2002; Armstrong et al., 2003; Snook, 2004), with variable results (Cottingham and Maitland, 1997; Carpenter and Nelson, 1999; * Corresponding author. Room 1F114, Health and Rehabilitation Sciences Research Institute, University of Ulster, Shore Road, Newtownabbey, Co. Antrim BT37 OQB, Northern Ireland. Tel.: þ44 02890 366423; fax: þ44 02890 368068. E-mail address: [email protected] (S.D. Liddle). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.012

Miller and Timson, 2004). The resulting socioeconomic implications have been identiﬁed by various authors (Maniadakis and Gray, 2000; Bartley and Coﬀey, 2001; Ehrlich, 2003; Speed, 2004; Waddell, 2004). Current research evidence supports the use of advice and exercise for the management of chronic LBP (Hilde et al., 2002; Liddle et al., 2004; van Tulder et al., 2004; Hayden et al., 2005; Liddle et al., 2007a), and previous surveys investigating the physiotherapeutic management of LBP throughout the UK and Ireland have highlighted the popularity of these treatments (Foster et al., 1999; Gracey et al., 2002; Byrne et al., 2006). Kerssens et al. (1999), is one of few studies that have investigated the advice given to LBP patients; their report, compiled from a database of private practice

190

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

consultations within the Netherlands, concluded that physiotherapists’ advice for LBP management is often dependent on the individual therapist, with many diﬀerences between therapists in the amount of information provided during treatment, and in the provision of follow-up support after treatment. These ﬁndings, along with the conclusions from a recently published systematic review of the use of exercise for chronic LBP (Hayden et al., 2005) have underlined the beneﬁts of individually tailored exercise programmes, and the inﬂuence of supervision during treatment on adherence. Similarly, when compared to a general exercise programme, one that was individually tailored to the needs and capabilities of the patient was shown to be more eﬀective in reducing the disability and pain experienced by subacute and chronic LBP patients (Descarreaux et al., 2002). The maintenance of exercise-induced gains is often the most challenging aspect of exercise prescription, being intricately related to the successful integration of exercise science with behavioural techniques, in order to promote adherence and individual goal achievement (ACSM, 2000, p. 140). Kerssens et al. (1999) concluded that the majority of advice or information that was being given to LBP patients was speciﬁcally related to home exercises and back care instructions. There is now strong evidence from randomised controlled trials (RCTs) using advice for the management of chronic LBP, to support the use of advice to remain active in addition to speciﬁc advice relating to the most appropriate exercise, and/or functional activities for each individual patient (Liddle et al., 2007a). European guidelines for the management of chronic LBP (Airaksinen et al., 2004) support the above, however, there is evidence to suggest that such guidelines and recommendations are frequently not applied in practice (Armstrong et al., 2003; Grol and Buchan, 2006). No previous LBP surveys have speciﬁcally investigated the use of both advice and exercise for the management of chronic LBP in current practice, in particular the type and frequency of advice and exercise being oﬀered. Therefore the aim of this survey was to establish the relative importance of advice and exercise for the management of chronic LBP amongst physiotherapists practicing throughout Ireland, with a speciﬁc focus on how these treatment approaches would typically be provided. In addition, given the inherent association between therapists’ treatment goals, and their choice of treatment, respondents’ treatment goals, and typical methods of assessing the outcome of treatment were also investigated.

2. Methods 2.1. Survey design A cross-sectional (self-administered) postal questionnaire was developed to investigate physiotherapists’ use

of advice and exercise for the management of chronic LBP: the speciﬁc information requested within this study precluded the use of any previously validated questionnaire for this purpose. For the purposes of this survey, chronic LBP was deﬁned as representing patients with symptoms of greater than 12 weeks duration, or those with 3 or more recurrent episodes within the previous 12 months (Liddle et al., 2004). Questions were based on the ﬁndings of two systematic reviews carried out by the authors: the ﬁrst of which investigated the type and frequency of advice provided for LBP, and the other the type and quality of exercise for chronic LBP patients (Liddle et al., 2004). A sample questionnaire is included in the Appendix (published online). The main survey was conducted between March and June of 2004. 2.2. Sampling frame No ethics framework existed in the Republic of Ireland at the time this study was undertaken. The study protocol and questionnaire were reviewed and approved by the executive board of the Irish Society of Chartered Physiotherapists (ISCP), following which a random sample (n ¼ 600) of ISCP members was provided for the survey: a stratiﬁed systematic sampling procedure was employed. This sample size was agreed with a statistician, based on the power calculation (from the results of a pilot study), and an anticipated 50e60% response rate. The power calculation indicated that between n ¼ 216 and n ¼ 385 respondents were required for 80% power, and 95% conﬁdence that the response ratio to a given question was not due to chance. As it was not possible to identify each therapist’s practicing area of expertise, in order to identify the most relevant subgroups, information on identiﬁed clinical interests of therapists was obtained from the database. From a total membership of approximately n ¼ 1600 therapists, the random sample was subsequently drawn from n ¼ 1000 therapists with at least one of the following clinical interests: acupuncture, manipulative therapy, sports medicine, women’s health, the workplace, community care, education, private practice. This procedure was adopted in an attempt to reduce the number of nonresponders, and respondents who were not employed in a setting that included the treatment of LBP patients. 2.3. Questionnaire A pilot survey was undertaken to ensure the relevance of the content, and clarity of questions in the questionnaire. The ﬁnal questionnaire contained a combination of 23 open and closed questions, divided into four sections: therapist’s background and LBP experience; information on the therapist’s management strategies for chronic LBP patients, and factors inﬂuencing their

191

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

prescription of exercise and advice; the speciﬁc type, frequency and mode of delivery of advice and exercise oﬀered to chronic LBP patients; therapists’ use of outcome measures and treatment goals with chronic LBP patients. Closed-ended questions predominated, with the use of a ranked or descriptive answer format: this was intended to enhance question relevance, to allow direct comparisons between respondents, to facilitate objective analysis, and to optimise the level of data analysis where possible (Hicks, 1999, p. 20). The anonymity and conﬁdentiality of the survey design were intended to minimise the inﬂuence of social desirability on responses (Metcalfe et al., 2001). The FORMIC software package (Formic limited, London) was used to simplify questionnaire layout, and to allow automated raw data entry into the Statistical Package for the Social Sciences for Windows, Version 11 (SPSS Inc., Cary, NJ; 1989e2001). In order to accurately monitor response rate, each questionnaire was allocated a unique identiﬁcation number. Those respondents not employed in a setting that included the treatment of LBP were given the option (rather than failing to respond) of answering the ﬁrst four questions, which provided information on therapist background and experience. Each questionnaire package contained a hand-signed covering letter explaining the study, and a postage-paid, preprinted return envelope (Edwards et al., 2002). Five weeks after the initial distribution of questionnaires, a reminder questionnaire package was sent to nonresponders (Edwards et al., 2002). Postage-paid preprinted postcards with simple ‘tick and return’ response options were then mailed to remaining non-responders (n ¼ 181) after a further six weeks to investigate reasons for non-response, with a further four weeks allocated for replies.

ANOVA) was used to explore whether the number of treatment sessions provided to chronic LBP patients changed in relation to the therapists’ clinical grade. All statistical tests were carried out using Statistical Package for the Social Sciences for Windows, Version 11 (SPSS Inc., Cary, NJ; 1989e2001).

3. Results 3.1. Respondents There was a 70% response rate to the survey (n ¼ 419); 67% of respondents (n ¼ 280/419) indicated that they currently treated LBP patients, and therefore completed the entire questionnaire: 43% (n ¼ 119/280) were employed in a public hospital, and 41% (n ¼ 115/ 280) in private practice. The remaining respondents (n ¼ 139) did not treat LBP: 93 were employed in a public or private hospital, 34 in community care and learning disabilities, 5 in private practice, and 7 in third level education. Twenty-seven percent (n ¼ 49) of nonresponders returned the pre-paid postcard. The most common reason given for not responding was ‘do not treat LBP’; ‘other’ reasons, such as being on (study) leave, or a career break, were also given. Since 73% (n ¼ 132) of non-responders did not return the prepaid postcard, these results represent a very limited description of non-responders. The clinical proﬁle of those respondents currently treating LBP (n ¼ 280) is presented in Table 1: 78% (n ¼ 217) were experienced clinicians with at least six years of experience treating LBP, and 44% (n ¼ 122) had been qualiﬁed for more than 10 years. Fig. 1 details the professional development spinal courses completed by respondents.

2.4. Data analysis Automated computer scanning was used to input answers to closed questions; responses to open questions were inputted manually. Apart from descriptive statistics (presented for all responders; n ¼ 419), the remaining statistical analyses were completed on data from responders currently treating LBP (n ¼ 280). As data were not normally distributed and measured on ordinal scales, non-parametric statistics were used. Where appropriate, the level of signiﬁcance was set at p < .05. Chi-square (c2) analysis was used to explore the relationship between two variables. The Friedman Test (the non-parametric equivalent of the one-way analysis of variance (ANOVA)), was used to determine if a significant diﬀerence existed between mean ranks: if a signiﬁcant diﬀerence was found, then the Wilcoxon signed rank test was used compare each rank to establish where the diﬀerences occurred. The Kruskal Wallis Test (the non-parametric equivalent of a one-way between groups

3.2. Chronic LBP management NB: The following analyses concentrate only on those respondents currently treating LBP patients, i.e. n ¼ 280.

Table 1 Clinical proﬁle of respondents currently treating LBP (n ¼ 280). Clinical grade

Place of work Public hospital

Private hospital

Private practice

Total Community (%) care

Basic 47 Senior 66 Clinical 2 specialist Manager 4 Private e practitioner

5 11 e

1 13 1

1 19 e

54 (19) 109 (39) 3 (1)

2 5

e 100

3 e

9 (3) 105 (38)

Total

23

115

23

280 (100)

119

192

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

treatment essions provided in relation to the therapist’s clinical experience (Kruskal Wallis Test df ¼ 4, p ¼ .238).

56.4%

60%

50.7% 49.3% 48.6% 46.1%

50%

3.3. Duration of LBP

%

40% 30% 18.9% 18.2%

20%

14.6%

12.5% 5.7%

10% 0%

/P

b

M

gs

&

s

ag

Sn

M

nd

tla

ai

x

ie

ria

Cy

Na

nz

e cK

M

er

in

NG

Pa

th

O

P

AC

M

e

u

Ac

T

ur

ct

n pu

Sc

M

M

Name of professional development spinal course Fig. 1. Percentage of respondents having completed each professional development spinal course (n ¼ 280). Key: Mb/P, muscle imbalance/pilates; NG Pain, neurogenic pain management; MACP, Manipulation Association of Chartered Physiotherapists Manual Therapy Course; MSc MT, Masters level in Manual Therapy.

Respondents were asked to rank the type(s) of treatment they used most frequently with chronic LBP patients; higher ranks indicating more frequent use. Advice, active exercise(s), and mobilisation techniques were ranked ﬁrst, second and third, respectively (see Table 2). The diﬀerences in rank (based upon averaged individual rankings) between these three most popular treatments were all statistically signiﬁcant (Wilcoxon Signed Ranks Test p < .001). Exercise accounted for the greatest amount of total treatment time respondents gave to chronic LBP patients (median ¼ 40%); however, advice and ‘other treatments’ each accounted for a median of 30% of treatment time. Acupuncture, manipulation, and traction were typically not ranked by respondents. The number of treatment sessions most often provided ranged between 6 and 10 (64% of respondents, n ¼ 179), with a further 27% providing a maximum of 5 sessions. Only 9% of respondents indicated that they would provide greater than 10 sessions. There were no signiﬁcant diﬀerences in the number of Table 2 Treatments provided for chronic LBP patients listed by rank frequency. Name of treatment provided

Mean rank

% As ﬁrst rank

Advice Active exercises Mobilisation techniques McKenzie Electrotherapy (including ice or heat) Neurogenic pain techniques/neural tension Massage Traction Manipulation (grade v) Acupuncture Other

2.06 2.56 3.50 5.44 5.65

61.8 51.1 25.4 6.8 8.6

6.24

3.2

6.84 7.59 8.28 8.85 9.00

5.0 1.8 1.1 2.9 3.6

The chronic LBP subgroup represented the largest proportion of respondents’ LBP caseloads (Friedman’s Chi square ¼ 28.690, df ¼ 2, p < .001). Chi-square analysis revealed a signiﬁcant association (c2 ¼ 126.343, p < .001) between the place of work and the proportion of chronic LBP patients included in each respondent’s LBP caseload: of those working in public hospitals, 50% (n ¼ 139) indicated that chronic LBP formed the largest proportion of their LBP caseload, compared to 25% (n ¼ 70) of private practitioners. 3.4. Type and frequency of advice Table 3 shows the mean rank given by respondents to each type of advice. There were no signiﬁcant diﬀerences in the type of advice given in relation to the place of work, or the clinical experience of the therapist. Respondents typically gave advice during the treatment session along with some form of supplementary information, e.g. booklet or exercise sheet. Follow-up advice after the last treatment session was not typically provided. Respondents considered the patient’s age as having the most inﬂuence on the advice they provided to chronic LBP patients; cognitive/behavioural factors and current clinical guidelines were also considered highly inﬂuential. 3.5. Type and frequency of exercise Ninety-eight percent (n ¼ 273) of respondents frequently used exercise to manage chronic LBP, and expected patients to carry out home exercises. However, only 56% (n ¼ 156) provided a supervised exercise programme. Strengthening exercise (including core stability) was most frequently used by respondents, regardless of their clinical experience, with ﬂexibility exercise ranked second, and aerobic exercise ranked third: there were statistically signiﬁcant diﬀerences between all three ranks (Wilcoxon Signed Ranks Test, p < .001). The Table 3 Type of advice provided for chronic LBP patients listed by rank frequency. Type of advice

Mean rank

% As ﬁrst rank

Advice as an adjunct to exercise Advice as part of a functional restoration approach Advice to stay active Advice with another intervention Advice as part of a back school approach

2.33 2.41

44.3 42.9

3.20 3.26 3.80

28.6 28.6 16.8

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

most inﬂuential factor for respondents when prescribing exercise to chronic LBP patients was the patient’s current pain intensity; cognitive factors were also considered important. Clinical guidelines were considered by respondents to have little inﬂuence on the exercise they prescribed (only n ¼ 16 respondents indicated that clinical guidelines inﬂuenced their prescription of exercise to chronic LBP patients). 3.6. Respondents’ treatment goals and assessment of treatment outcome Respondents’ treatment goals are ranked in order of importance in Table 4. There was no signiﬁcant diﬀerence in the rank importance that respondents gave to improved function when compared to pain relief (ranked second) (Wilcoxon Signed Ranks Test, p ¼ .206). The importance given by respondents to pain relief is reﬂected in the use of pain-related outcome measures: pain intensity was used by 70% (n ¼ 195) of respondents to assess treatment outcome. An assessment of the patient’s satisfaction with treatment was also considered important by 60% (n ¼ 168) of respondents. In contrast, back-speciﬁc functional outcomes, such as the Roland Morris or Oswestry Disability Questionnaires, were used by only 16% (n ¼ 44) and 12% (n ¼ 33) of respondents, respectively; ‘other’ outcome measures were used by 14% (n ¼ 40) of respondents.

4. Discussion The principal ﬁndings of this national survey indicate that the most frequently used treatments adopted for chronic LBP, within the Irish health system (public or private sectors), are advice and exercise respectively. However, despite current recommendations that it is safe for this patient subgroup to remain active, that ‘hurt does not mean harm’, and respondents’ recognition of the primary importance of functional improvement, it appears that pain relief continues to be a major treatment priority for physiotherapists. Whilst respondents regularly prescribed home exercises, they did not appear to be routinely providing supervised Table 4 Respondents’ treatment goals listed by rank importance. Treatment goal

Mean rank % As ﬁrst rank

Improved function Pain relief Return to work/usual activities Change patient perceptions about chronic LBP Prevent recurrence Increased spinal range of movement Others

2.34 2.56 2.84 4.29

47.9 55.7 35.0 19.3

4.36 4.82 6.79

10.4 5.4 0.7

193

exercise classes. This is despite the potential role of supervision in enhancing exercise adherence and thus treatment outcomes (Sluijs et al., 1993; ACSM, 2000, p. 162; Liddle et al., 2004). In addition, follow-up advice after the last scheduled face-to-face treatment, to provide support and promote long-term self management, does not appear to be a common treatment strategy. Perhaps if therapists devoted more time to incorporating supervision and follow-up, the maintenance of exercise gains in the longer term could be facilitated, helping to reduce the socioeconomic burden of chronic LBP. More investigation into why supervision and follow-up are not commonly provided is necessary to tackle this apparent shortfall in practice. 4.1. Chronic LBP management While advice and exercise were clearly the most frequently used treatments for chronic LBP (62% and 51% of respondents, respectively), mobilisation techniques were also popular (ranked third); 25% (n ¼ 71) of respondents indicated that they used mobilisation techniques most frequently. Respondents indicated that they typically included ‘other treatments’ with advice and exercise, and spent similar proportions of time on each. However, this trend is typical in practice having been reported by previous authors investigating LBP management (Foster et al., 1999; Kerssens et al., 1999; Li and Bombardier, 2001; Gracey et al., 2002). The variations in treatment approach did not appear to inﬂuence the number of treatment sessions provided, with 6e10 sessions being the norm. The evidence from randomised controlled trials of chronic LBP and exercise (Liddle et al., 2004), and guidelines for exercise prescription and behaviour change (ACSM, 2000, p. 154), would suggest that it is unlikely that 6e10 sessions represents an adequate time frame for individuals with longstanding pain to develop adequate ‘self-management’ strategies, and functionally-related goals that are the necessary pre-requisites for eﬀective long-term symptom management. 4.2. Type and frequency of advice The frequent use of advice reported in this survey suggests that physiotherapists, regardless of their place of work or clinical experience, are aware of the need to encourage individuals during treatment to increase their activity, and learn to incorporate such changes into their daily lifestyle. Respondents also appear to be aware of the need to tailor advice according to the patient’s age, previous treatment experiences and beliefs, and to reﬂect current guidelines. It was reported that provision of advice throughout the treatment programme commonly included supplementary written

194

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

information as an additional guide for patients. It has been suggested that refresher programmes (following a course of treatment) may help to maintain the positive results of treatment for chronic LBP patients (Harkapaa et al., 1989; Bendix et al., 1998); however, the provision of follow-up advice is rare even in randomised controlled trials (Liddle et al., 2007a). Given that a large proportion of respondents reported frequently treating chronic LBP, perhaps the provision of follow-up advice after the last treatment session could help to decrease barriers to exercise (Middleton, 2004), reinforce the advice given during treatment, and ensure continued adherence to exercise and activity programmes: this hypothesis warrants further investigation. 4.3. Type and frequency of exercise The majority of respondents currently treating LBP patients indicated that they frequently used exercise for the management of chronic LBP (n ¼ 273/280). Core stabilisation has been identiﬁed as an important component of exercise programmes for chronic LBP patients, and it was clearly valued by those physiotherapists responding to this survey; of those respondents who indicated what type of active exercise they actually used, core stabilisation was by far the most popular. This ﬁnding is in keeping with those of a smaller scale survey of physiotherapists working in the acute hospital sector in Ireland (Byrne et al., 2006). It is unclear from this survey why supervision is not a common component of treatment, given its recognised value in exercise prescription (ACSM, 2000, p. 162), and support for it within clinical guidelines (Airaksinen et al., 2004); however, the fact that only n ¼ 16 respondents indicated that clinical guidelines inﬂuenced their prescription of exercise to chronic LBP patients would suggest that perhaps current expectations about the impact of clinical guidelines is unrealistic, and therefore future guideline development should focus on the end-user, providing clear statements and educational materials (Grol and Buchan, 2006). The ﬁndings from a recent qualitative study underscores the importance that chronic LBP patients place on the provision of exercise programme supervision, not only for enhancing adherence but also for general reassurance (Liddle et al., 2007b). 4.4. Outcomes and goals of treatment Pain relief, whilst important, is not widely considered a primary goal in chronic LBP management; rather, recent authors and groups have emphasised the importance of improving functional activities despite pain (Rainville et al., 1997; Davey and Broadbent, 1998; Frost et al., 2000; Deyo and Weinstein, 2001; Cohen and Rainville, 2002; Lively, 2002). This is also reﬂected in recommendations for outcome assessment in chronic

LBP, where improved function and return to work are two of the ﬁve proposed ‘core’ categories of outcome measure recommended for use with such patients (Deyo et al., 1998; Bombardier, 2000; Bombardier et al., 2001). This notwithstanding, the results of this survey highlight the emphasis still being placed by therapists on pain relief (as previously reported by Foster et al., 1999), and the apparent lack of use of clinically relevant outcome measures. However, this is a problem that is not conﬁned to clinical practice, as this has also been highlighted as a weakness within clinical trials in this area (Liddle et al., 2004; Liddle et al., 2007a). Interestingly, pain intensity had a much greater inﬂuence on exercise prescription than level of function (66% versus 7% of respondents, respectively). The underlying reasons for this are unclear, however, it may represent an attempt by physiotherapists to incorporate patients’ treatment expectations more directly into their management of chronic LBP (Liddle et al., 2007b), and/or the inﬂuence that therapists’ attitudes to back pain may have on their treatment decisions (Pincus et al., 2005). Respondents clearly realise the value of improved function as a goal of treatment (ranked ﬁrst), but the assessment of functional improvement appears to rely on subjective report and opinion: 33% of respondents who reported using ‘other’ categories of outcome measure used unvalidated subjective measures of functional improvement. This may be the result of limited time, availability and/or a lack of emphasis being placed on the clinical relevance of speciﬁc categories of outcome measure. 4.5. Limitations of the study The principal limitation of this study was the sample size of respondents currently treating LBP (n ¼ 280). It is important to note that the overall response rate was 70% (n ¼ 419), however, only 47% (n ¼ 280) of the respondents treated LBP patients, and therefore completed the whole questionnaire. This factor limits the generalisability of the results, and underlines the need to make comparisons with current practice in other countries and healthcare settings (Byrne et al., 2006). It must also be borne in mind that those who take the time to respond to a questionnaire may be diﬀerent from those who do not, therefore the results of a survey cannot necessarily be generalised beyond those who have responded (Domholdt and Malone, 1985). In addition, the inﬂuence of social desirability bias on responses cannot be excluded. Finally the authors acknowledge that the closed response format, predominantly used throughout this survey, may have led to diﬀering interpretations of questions (Metcalfe et al., 2001); however, given the multifaceted nature of chronic LBP and its management, it was considered necessary to have some means of quantifying the data for statistical analysis.

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

5. Conclusion The ﬁndings of this survey demonstrate that respondents working in the public or private sector throughout Ireland recognise the value of advice and exercise for the management of chronic LBP. There is also evidence that a variety of treatments are being used alongside advice and exercise. The use of exercise programme supervision and follow-up advice, which are both considered important in facilitating the maintenance of advice and exercise-induced treatment gains, are not widely used by therapists responding to this survey. The potential beneﬁt of follow-up advice (provided after the last treatment session), as a means of reinforcing the advice given during treatment, and ensuring continued adherence to exercise and activity programmes warrants further investigation. The importance of treatments designed to improve chronic LBP patients’ function, using individually tailored and supervised exercise programmes, must be more strongly emphasised in clinical guidelines, that focus on the end-user, and provide clear statements and educational materials (Grol and Buchan, 2006), in order to reap the rewards of these treatments in the longer term.

Acknowledgements The authors gratefully acknowledge the physiotherapists who took part in this study, and the support of the Department of Employment and Learning (Northern Ireland). There are no conﬂicts of interest.

Appendix A. Supplementary data Supplementary data associated with this article can be found, in the online version, at doi:10.1016/j.math. 2008.01.012.

References American College of Sports Medicine (ACSM). ACSM’s guidelines for exercise testing and prescription. 6th ed. Philadelphia: Lippincot, Williams and Wilkins; 2000. Armstrong MP, McDonough S, Baxter GD. Clinical guidelines versus clinical practice in the management of low back pain. International Journal of Clinical Practice 2003;57:9e13. Airaksinen O, Brox JL, Cedraschi C, Hildebrandt J, Klaber-Moﬀett J, Kovacs F, et al. European guidelines for the management of chronic non-speciﬁc low back pain. COST B13 Working Group, www.backpaineurope.org; 2004. Bartley R, Coﬀey P. Management of low back pain in primary care. Oxford: Butterworth Heinemann; 2001. p. 19. Bendix AF, Bendix T, Haestrup C, Busch E. A prospective, randomised 5-year follow-up study of functional restoration in chronic low back pain patients. European Spine Journal 1998;7:111e9.

195

Bombardier C. Outcome assessments in the evaluation of treatment of spinal disorders. Spine 2000;25:3100e3. Bombardier C, Hayden J, Beaton DE. Minimal clinically important diﬀerence. Low back pain: outcome measures. Journal of Rheumatology 2001;28:431e8. Byrne K, Doody C, Hurley DA. Exercise therapy for low back pain: a small-scale exploratory survey of current physiotherapy practice in the Republic of Ireland acute hospital setting. Manual Therapy 2006;11:272e8. Carpenter DM, Nelson B. Low back strengthening for the prevention and treatment of low back pain. Medicine and Science in Sports and Exercise 1999;31:18e24. Cherkin DC. Primary care research on low back pain: the state of the science. Spine 1998;23:1997e2002. Cohen I, Rainville J. Aggressive exercise as treatment for chronic low back pain. Sports Medicine 2002;32:75e82. Cottingham JT, Maitland J. A three-paradigm treatment model using soft tissue mobilisation and guided movement-awareness techniques for a patient with chronic low back pain: a case study. Journal of Orthopaedic and Sports Physical Therapy 1997; 26:155e67. Davey R, Broadbent H. Group rehabilitation for chronic back pain: a pilot study. British Journal of Therapy and Rehabilitation 1998;5:636e42. Descarreaux M, Normand M, Laurencelle L, Dugas C. Evaluation of a speciﬁc home exercise programme for low back pain. Journal of Manipulative and Physiological Therapeutics 2002;25:497e503. Deyo RA, Weinstein J. Low back pain. The New England Journal of Medicine 2001;344:363e70. Deyo RA, Battie M, Beurskens AJHM, Bombardier C, Croft P, Koes B, et al. Outcome measures for low back pain research: a proposal for standardised use. Spine 1998;23:2003e13. Domholdt EA, Malone TR. Evaluating research literature: the educated clinician. Physical Therapy 1985;65:487e91. Edwards P, Roberts I, Clarke M, DiGuiseppi C, Pratap S, Wentz R, et al. Methods to inﬂuence response to postal questionnaires (Cochrane methodology review). The Cochrane Library 2002:4. Ehrlich G. Back pain. The Journal of Rheumatology 2003;30 (Supplement 67):26e31. Foster N, Thompson K, Baxter GD, Allen JM. Management of nonspeciﬁc low back pain by physiotherapists in Britain and Ireland: a descriptive questionnaire of current clinical practice. Spine 1999;24:1332e42. Frost H, Lamb SE, Shackleton CH. A functional restoration programme for chronic low back pain: a prospective outcome study. Physiotherapy 2000;86:285e93. Gracey JH, McDonough S, Baxter GD. Physiotherapy management of low back pain: a survey of current practice in Northern Ireland. Spine 2002;27:406e11. Grol R, Buchan H. Clinical guidelines: what can we do to increase their use? Strategies to close the gap between development and implementation of guidelines. Medical Journal of Australia 2006;185:301e2. Harkapaa K, Jarvikoski A, Mellin G, Hurri H. A controlled study on the outcome of inpatient and outpatient treatment of low back pain: Part 1. Pain, disability, compliance, and reported treatment beneﬁts three months after treatment. Scandinavian Journal Rehabilitation Medicine 1989;21:81e9. Hayden JA, van Tulder MW, Tomlinson G. Systematic review: strategies for using exercise therapy to improve outcomes in chronic low back pain. Annals of Internal Medicine 2005;142:776e85. Hicks CM. Research methods for clinical therapists: applied project design and analysis. 3rd ed. Edinburgh: Churchill Livingstone; 1999. Hilde G, Hagen KB, Jamtvedt G, Winnem M. Advice to stay active as a single treatment for low back pain and sciatica (Cochrane Review). The Cochrane Library 2002:3.

196

S.D. Liddle et al. / Manual Therapy 14 (2009) 189e196

Kerssens JJ, Sluijs EM, Verhaak PFM, Knibbe HJJ, Hermans IMJ. Back care instructions in physical therapy: a trend analysis of individualized back care programs. Physical Therapy 1999;79:286e95. Li LC, Bombardier C. Physical therapy management of low back pain: an exploratory survey of therapist approaches. Physical Therapy 2001;81:1018e28. Liddle SD, Baxter GD, Gracey JH. Exercise and chronic low back pain: what works? Pain 2004;107:176e90. Liddle SD, Gracey JH, Baxter GD. Advice for the management of low back pain: a systematic review of randomised controlled trials. Manual Therapy 2007a;12:310e27. Liddle SD, Baxter GD, Gracey JH. Chronic low back pain: patients’ experiences, opinions and expectations for clinical management. Disability and Rehabilitation 2007b;29:1899e909. Lively M. Sports medicine approach to low back pain. Southern Medical Journal 2002;95:642e6. Maniadakis N, Gray A. The economic burden of back pain in the UK. Pain 2000;84:95e103. Metcalfe C, Lewin R, Wisher S, Perry S, Bannigan K, Klaber Moﬀett J. Barriers to implementing the evidence base in four NHS therapies. Physiotherapy 2001;87:433e41. Middleton A. Chronic low back pain: patient compliance with physiotherapy advice and exercise, perceived barriers and motivation. Physical Therapy Reviews 2004;9:153e60.

Miller J, Timson D. Exploring the experience of partners who live with a chronic low back pain suﬀerer. Health and Social Care in the Community 2004;12:34e42. Pincus T, Vogel S, Santos R, Breen AC, Foster NE, Underwood M. The attitudes to back pain scale in musculoskeletal practitioners (ABS-MP); the development and testing of a new questionnaire. British Journal of Bone and Joint Surgery 2005;87-B(Suppl. II):207. Rainville J, Sobel J, Hartigan C, Monlux G, Bean J. Decreasing disability in chronic back pain through aggressive spine rehabilitation. Journal of Rehabilitation Research and Development 1997; 34:383e93. Sluijs EM, Kok GJ, van der Zee J. Correlates of exercise compliance in physical therapy. Physical Therapy 1993;73:771e86. Snook SH. Self-care guidelines for the management of non-speciﬁc low back pain. Journal of Occupational Rehabilitation 2004;14: 243e53. Speed K. ABC of Rheumatology: low back pain. British Medical Journal 2004;328:1119e21. van Tulder M, Malmivaara A, Esmail R, Koes B. Exercise therapy for low back pain (Cochrane Review). The Cochrane Library 2004:4. Waddell G. The back pain revolution. 2nd ed. London: Churchill Livingstone; 2004. p. 75.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 197e205 www.elsevier.com/math

Original Article

Reliability of assisted indentation in measuring lumbar spinal stiﬀness Tasha R. Stanton 1, Gregory N. Kawchuk* University of Alberta, Department of Physical Therapy, Faculty of Rehabilitation Medicine, 3-44 Corbett Hall, Common Spinal Disorders Lab, Edmonton, Alberta T6G 2G4, Canada Received 27 June 2007; received in revised form 22 January 2008; accepted 30 January 2008

Abstract The reliability of manual methods to assess spinal stiﬀness is modest at best. In response, instrumentation has been developed which may be reliable, but is often diﬃcult to use in clinical settings. The purpose of this study was to determine the intra-rater reliability of assisted indentation (AI), a smaller, less automated technique of measuring spinal stiﬀness in vivo. Twenty-three asymptomatic subjects were included in the study. The AI device was placed over the 4th lumbar spinous process in each prone, resting subject. Ten indentations were performed at approximately 2-min intervals while load and displacement data were collected simultaneously. From these data, two outcome variables were calculated: Global Stiﬀness (GS; slope of the forceedisplacement data) and Mean Maximal Stiﬀness (MMS; peak force/peak displacement). Intra-class correlation coeﬃcient values for 10 consecutive measures of GS and MMS were 0.93 and 0.91, respectively. A repeated measures analysis of variance (ANOVA) did not demonstrate signiﬁcant diﬀerences between any indentation trials from the same subject. Measurement of spinal stiﬀness using AI demonstrated excellent intra-rater reliability. These data, in addition to speciﬁc features of AI (small, transportable, relatively low cost, ease of operation) suggest that AI may be of beneﬁt within clinical environments. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Indentation; Assisted indention; Reliability; Spinal stiﬀness; Posteroanterior compression

1. Background and purpose The manual assessment of low back stiﬀness remains a key tenet for many professionals who diagnose and treat low back pain. Most often, the clinical assessment of spinal stiﬀness involves a manual pressure test where a clinician uses their hands to apply pressure in a posteroanterior (PA) direction to the spinous process of interest. During the application of PA pressure, the clinician appreciates the resulting tissue response and * Corresponding author. Tel.: þ1 780 492 6891; fax: þ1 780 492 4492. E-mail address: [email protected] (G.N. Kawchuk). 1 University of Sydney, School of Physiotherapy, Faculty of Health Sciences, East Street, Lidcombe, NSW 2141, Australia. 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.011

forms a subjective impression of spinal stiﬀness. The resulting impression formed by the clinician during the pressure test is then used to judge if the spine is too compliant (hypermobility), too stiﬀ (hypomobility), or within normal limits (Maitland et al., 2001). These judgments often provide a basis for individual treatment programs and have also been shown to be important in predicting therapeutic success when stabilization exercise programs are prescribed (Hicks et al., 2005). Unfortunately, the PA pressure test is based on human performance, interpretation and communication. As a result, the PA pressure test is highly variable in many respects including the magnitude of applied peak force, (Latimer et al., 1998) the direction of force application (Caling and Lee, 2001) and in the

198

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

identiﬁcation of a speciﬁc spinous process (Harlick et al., 2007) as a PA pressure target. In addition, the level of human sensitivity in detecting alterations in stiﬀness is limited. It has been estimated that the discrimination threshold for stiﬀness is of the order of 11% when using a pisiform grip to evaluate stiﬀness in the range of 6e11 N/mm (Maher and Adams, 1995). As a result, clinicians may be unable to perceive signiﬁcant changes in spinal stiﬀness that occur below this threshold. Given the above, it is not surprising that stiﬀness values obtained from manual assessment of spinal stiﬀness vary considerably between clinicians (Snodgrass et al., 2006). Speciﬁcally, studies of between-clinician agreement have shown that the reliability of stiﬀness assessment remains poor with intra-class correlation coeﬃcients (ICC) ranging between 0.03 and 0.55 (Fleiss, 1986; Maher and Adams, 1994; Binkley et al., 1995). In response to the poor reliability (Fleiss, 1986; Maher and Adams, 1994; Binkley et al., 1995), large variability (Snodgrass et al., 2006), and limits of human perception (Maher and Adams, 1995) associated with manual assessment of spinal stiﬀness, mechanical instruments have been designed to measure the applied loads and resulting tissue deformations that occur during manual PA testing. These devices include the Spinal Physiotherapy Simulator (SPS) (Lee and Svensson, 1990), Lee and Evans’ stiﬀness assessment device (Lee and Evans, 1992), Stiﬀness Assessment Machine (SAM) (Latimer et al., 1996aec), Spinal Posteroanterior Mobilizer (SPAM) (Edmondston et al., 1998), and the Rigid Frame Indentor (Kawchuk and Herzog, 1996). While the reliability of the majority of these instruments is high, these devices are designed primarily for research applications. As a result, many features of these devices such as their size, expense, and complex operation preclude their use in clinical settings. To exploit the increased performance of mechanical devices in assessing spinal stiﬀness yet avoid the limitations common to these research-based devices, a new stiﬀness assessment technique is proposed. This technique, assisted indentation (AI), uses manual load application with the addition of instrumentation designed to assist the operator and improve reliability and accuracy. Given recent ﬁndings that indicate stiﬀness may be a variable which helps predict outcome success (Childs et al., 2004; Hicks et al., 2005), there may be a future clinical need for a device which can measure stiﬀness accurately and reliably. Although the accuracy of AI has been shown to be excellent (absolute maximal diﬀerence of 0.22 mm compared to gold standard) (Kawchuk et al., 2006), the reliability of AI has yet to be determined. Therefore, the purpose of this study was to measure the in vivo, with-in operator reliability of AI measurements of spinal stiﬀness. It was hypothesized that AI reliability would be excellent (ICC greater than 0.75) (Fleiss, 1986).

2. Methods 2.1. Subjects Following approval from the University of Alberta Health Research Ethics Board, 23 consenting subjects were recruited from the University of Alberta and surrounding area over a 1-month period. This sample size was calculated a priori using a power of 80% and a level of signiﬁcance of p ¼ 0.05. 2.1.1. Inclusion criteria Study subjects included asymptomatic males and females between the ages of 18 and 30 with no history of low back pain within the last year as well as no current low back pain. 2.1.2. Exclusion criteria Subjects were excluded from this study if they reported back pain and/or medical conditions that could aﬀect the safety of measurement of spinal stiﬀness using AI and/or intolerance to screening procedures designed to identify those persons sensitive to direct spinous process loading. Please refer to Table 1 for a detailed list of exclusion criteria. 2.2. Research design This study quantiﬁed (1) single operator reliability of AI measures in a sample human population and (2) repeatability of AI measures generated by a single operator within single subjects. 2.3. Instrumentation A description of the device used to perform AI has been published previously (Fig. 1) (Kawchuk et al., 2006). In brief, the AI equipment is made up of an outer Table 1 Exclusion criteria. Injury related

Disease processes (Maitland, 1986)

Subject factors

Current low back pain Low back pain within the last year Previous back surgery

Osteoporosis

Lower extremity injury within the last year

Ankylosing Spondylitis Known malignancy Known spondylolisthesis Multiple sclerosis Severe scoliosis

Pregnancy (unsure or conﬁrmed) Medications affecting muscle function (e.g. steroids) Medications affecting pain recognition (e.g. pain medications) Unable to tolerate indentation

Osteoarthritis

Rheumatoid arthritis

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

199

2.4. Calibration Calibration of the assisted indention device was achieved using masses of known magnitude applied to the load cell and spacers of known dimensions applied to the LVDT. After each application of increasing calibration mass or dimension, force and displacement signals were collected then plotted against the known mass or dimension. These data were then modeled with a linear data ﬁtting technique. In each case, the r2 value of the line of best ﬁt was greater than 0.90. The resulting equation of the line of best ﬁt was then used to determine the units of measure for the output voltage of each transducer. Calibration was completed prior to subject testing. 2.5. Spinal stiﬀness measurement

Fig. 1. Assisted Indentor.

frame that is supported by an external support arm (Tenet Medical Engineering, Calgary, Alberta, Canada). The use of this rigid arm creates a stationary reference point. These structures suspend an inner probe that is moved manually to apply an external force to the anatomical target of interest. By using a ceramic air-bearing to hold the indenting probe, near frictionless movement of the inner probe with respect to the outer frame can be achieved thereby reducing artifacts due to movement of the frame during indentation loading. To measure applied force, a compressive-tension load cell (Entran, Fairﬁeld, NJ) is connected in-series with the probe. The displacement of the probe is measured by a linear variable diﬀerential transformer (LVDT) (Honeywell International Inc., Morristown, NJ) attached between the probe and the outer housing. Because the displacement of the indenter is initiated by a manual process, but restricted by mechanical boundaries, this form of indentation is called ‘‘Assisted Indentation’’. Signals from the load cell and the LVDT were conditioned appropriately and collected by customized LABview software (National Instruments, Austin, TX) at a collection rate of 200 Hz.

In each prone subject, the AI device was placed perpendicular to the L4 spinous process with a contact load of less than 1 N (Fig. 2). The subject was then instructed to breathe out comfortably then to hold his/ her breath for the duration of the indentation (approximately 5 s) (Kawchuk and Fauvel, 2001). During indentation, the indentation probe was advanced manually (approximately 2 mm/s) into the spine until a force threshold of 100 N was read from a visual indicator. This level of force application was considered to be safe as forces up to 200 N have been used within an asymptomatic human population (Latimer et al., 1998) and forces up to 105 N within a symptomatic human population (Latimer et al., 1996b) without any adverse

Fig. 2. Placement of the Assisted Indentor on a subject.

200

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

eﬀects reported. When the 100 N threshold was reached, the indentor position was maintained at this load for approximately 1 s after which the indenter was removed from contacting the subject. To decrease variability in the rate of indentation loading, the equipment operator viewed a computerized bar graph which increased in size at a rate of 2 mm/s. Next to this graph, a second bar graph displayed the actual displacement of the AI device. With these two displays, the operator could continually adjust their performance to match the desired indentation rate.

during indentation increased from baseline. If the subject wanted indentation to cease for any reason they were instructed to squeeze the trigger fully which produced an audible alarm alerting the researcher to remove the indentor. If this situation occurred, the researcher re-positioned the indentor and indentation was attempted again. Re-positioning of the indentor was allowed a maximum of two times after which further indications of painful indentation excluded the subject from further participation. 2.7. Analysis of spinal stiﬀness measurement

2.6. Study procedure Once informed consent was attained from the subjects, a verbal history questionnaire was completed to ensure that subjects met the inclusion criteria and did not possess any factors that would cause exclusion from the study. Following the questionnaire, each subject’s height and weight were recorded, and Body Mass Index (BMI; kg/m2) was calculated (Astrand et al., 2003). With the subject lying in prone on a plinth, the subject’s spine was palpated by the researcher and the L4 spinous process identiﬁed. Although identiﬁcation of spinous processes in the lumbar spine has demonstrated moderate accuracy with use of preferred palpation procedures (47% were on the level intended) (Harlick et al., 2007), a standardized procedure was utilized in this study to reduce this error. Speciﬁcally, the horizontal line between the iliac crests was used to identify the L4/5 interspace and the vertebrae above were determined to be L4 (if this line between the iliac crests gave a spinous process, this was identiﬁed as L4) (Grieves, 1984). The L4 vertebra was chosen as the site of indentation as this has been shown to be a commonly symptomatic area in patients with low back pain (Maitland et al., 2001). The skin over the presumed L4 spinous was then marked using a pen to provide a visual guide for placing the indenter. The indenter was then placed over the ink marking and a series of ﬁve consecutive indentations were provided to familiarize subjects with the indentation process. Once the familiarization indentations were completed, 10 consecutive spinal stiﬀness measurements (indentations) were collected, each separated by a time period of approximately 2 min. During times between indentations, subjects were instructed to remain in a resting prone position and to remain stationary and relaxed. Each subject was examined at one time period. Indentations were performed by one researcher (T.S.) who had logged approximately 100 h of using the indentation device prior to data collected for this experiment. During the indentation process, all subjects held an analog trigger to indicate if their level of discomfort

Indentation data (force and displacement) were used to calculate the spinal stiﬀness at the indentation site. Stiﬀness was quantiﬁed in two ways: (1) Global Stiﬀness (GS); and (2) Mean Maximal Stiﬀness (MMS). GS, calculated as the slope of the forceedisplacement curve between 30 N and maximal force, represents the stiﬀness of the underlying tissues during the indentation itself. It is assumed that the relationship between force and displacement is linear between 30 and 100 N given previous work. (Latimer et al., 1996b). MMS, the second variable representing stiﬀness, was computed by taking the average stiﬀness value (N/mm) over the time period where the maximal indentation force has been held for a period of approximately 1 s. The MMS variable is therefore a ratio between the applied maximal force and the resultant maximal displacement of the underlying tissues (Fig. 3). 2.8. Statistical analysis For data analysis purposes, all ﬁve of the familiarization trials were discarded (Latimer et al., 1996b,c). In addition, the ﬁrst trial (stiﬀness measurement during rest) of the 10 experimental indentations was discarded as this trial has been shown to highly variable (Latimer et al., 1996b,c) while stiﬀness measurements from subsequent trials (after the ﬁrst trial) have demonstrated stability (Latimer et al., 1996b,c). To assess intra-rater reliability of the researcher/instrument in measuring spinal stiﬀness, the intra-class correlation coeﬃcient (3,1) was calculated (Shrout and Fleiss, 1979). To describe repeatability, inter-trial inconsistency values for stiﬀness variables were calculated by taking the diﬀerence between two consecutive indentations expressed as a percentage of the average of the same two indentations. Finally, to further explore repeatability and investigate the possibility that a gradual change in stiﬀness values may occur with successive indentations, a condition that may not be reﬂected in ICC values, a repeated measures analysis of variance (ANOVA) with a Bonferroni correction was performed.

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

201

Fig. 3. Graphical example of stiﬀness measurement output. The top graph represents the loadedisplacement curve of a single AI (force on y-axis, displacement on x-axis) with vertical white bars representing the section over which slope was taken (GS). The bottom graph shows the indentation proﬁle (numerical scale for force and displacement on y-axis, time on x-axis). In the bottom graph, the upper trace is the applied force while the bottom trace is the resultant displacement. The vertical white bars represent the section over which force was divided by displacement (MMS).

3. Results

4. Discussion

A total of 30 subjects were recruited to participate in this project with three excluded due to previous back or lower extremity injury within the last year, two excluded for exceeding the age limit, and two excluded prior to formal testing (did not pass the indentation screening procedure in that they reported discomfort with indentation even after the indentor was re-positioned twice). This resulted in 12 male and 11 female subjects who participated in this study (n ¼ 23) (see Table 2 for subject demographics). In this experiment, the reliability of the stiﬀness measures was described by the ICC which was calculated to be 0.91 for GS and 0.93 for MMS. Additionally, an estimate of the consistency in stiﬀness measures was obtained by calculating the inter-trial inconsistency value which was 6.23% (4.52%) for the GS and 7.71% (5.33%) for MMS (see Figs. 4 and 5 for individual subject representation of inter-trial inconsistency values). The repeated measures ANOVA did not reveal significant diﬀerences between any indentation trials for either GS or MMS ( p ¼ 0.09e1.00 and p ¼ 1.00 for all comparisons, respectively). See Figs. 6 and 7 for the graphical representation of the change in stiﬀness values over time.

Data from this study support the hypothesis that AI has excellent reliability (ICC 0.75) (Fleiss, 1986). Speciﬁcally, AI exhibited excellent intra-rater reliability for all outcome variables used to quantify L4 stiﬀness. Furthermore, the average inter-trial inconsistency remained below 8% for all stiﬀness variables. Compared to the manual testing of stiﬀness, ICC values found for the AI technique were much higher (Table 3). Overall, reliability values for the evaluation of spinal stiﬀness using the manual PA pressure test have been found to be poor (Matyas and Bach, 1985; Maher and Adams, 1994; Binkley et al., 1995). Matyas and Bach (1985) ﬁrst found poor reliability of manual PA stiﬀness assessment when they reported Pearson’s r ranging from 0.09 to 0.46. Unfortunately, these reliability results using Pearson’s r cannot be compared directly to the current study. Later studies also noted poor reliability with ICC (1,1) values ranging from 0.03 to 0.37 (Maher and Adams, 1994; Binkley et al., 1995). With improvements to the testing protocol and delineation of stiﬀness into ranges, reliability increased to a fair level (Fleiss, 1986) with an ICC value reported to be 0.55 (range 0.50e 0.62) (Maher et al., 1998). The ICC value of the PA pressure test increased further when an 11-point stiﬀness rating scale was employed and more rigorously controlled testing protocol were used (ICC ¼ 0.77) (Maher et al., 1998). Although improvements in the reliability of the manual assessment of spinal stiﬀness have been demonstrated, these improvements occur only under standardized and artiﬁcial conditions that are not typically employed in the clinical environment. It may be argued that any form of instrumented stiﬀness assessment, such as AI, is also not typical of the clinical procedures (i.e. PA testing) due to increased

Table 2 Mean (standard deviation) of subject demographic characteristics.

Age (years) Height (m) Weight (kg) BMI (kg/m2)

Male (n ¼ 12)

Female (n ¼ 11)

26.17 1.79 76.23 23.85

24.45 1.63 58.41 21.59

(3.10) (0.065) (9.64) (2.39)

(3.21) (0.052) (8.28) (2.21)

202

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

Fig. 4. Inter-trial inconsistency values (mean standard deviation) for GS estimates of L4 stiﬀness values.

size of the instrumentation and necessary operator training. However, if the desire is to objectively quantify stiﬀness in a reproducible way, then changing clinical practice to involve use of scales to delineate stiﬀness levels or involve use of an instrument becomes important. If changes to clinical practice are mandated and/ or desirable, using a method of stiﬀness assessment with the combination of high reliability values and minimally clinically invasiveness is paramount. With this in mind, AI may become a viable option for clinical stiﬀness testing due to its excellent reliability values as well as a design that allows for ease of use by a single operator in a small footprint, low cost device (w$10,000 Canadian dollars) that does not require advanced mechanization such as motors, pulleys or pistons. The observation that AI exhibits greater reliability than manual assessment of spinal stiﬀness was expected for three reasons. First, AI measures several variables in an objective manner, increasing the reliability of spinal stiﬀness assessment. Speciﬁcally, use of technology to quantify force and displacement data (load cell and

a LVDT, respectively), in addition to customized computer programming, allows consistency of force application and real-time visualization of results. Second, AI reduces variability in factors shown to alter spinal stiﬀness measures including visual occlusion (Maher and Adams, 1996), peak force (Latimer et al., 1998), frequency of PA loading (Lee and Svensson, 1990; Lee and Liversidge, 1994), direction of force application (Caling and Lee, 2001), and force angulation (Kawchuk and Herzog, 1996). Finally, we elected to employ stiﬀness variables which considered regions of data that were larger than those used in previous studies. This approach was chosen because the most clinically important region of a loadedisplacement graph remains unknown. While there is some evidence to suggest that stiﬀness may play a role in predicting outcomes of speciﬁc treatments (Childs et al., 2004; Hicks et al., 2005), an understanding of the physiologic basis of spinal stiﬀness, or its alteration due to pathology or treatment, remains elusive. With respect to other studies, the reliability values for AI, although slightly lower, are comparable to those

Fig. 5. Inter-trial inconsistency values (mean standard deviation) for MMS estimates of L4 stiﬀness values.

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

203

Fig. 6. Change in GS values over time for all subjects. GS values (mean standard deviation) normalized to indentation trial 2.

found for mechanical indentation devices (Table 3). Intra-class correlation coeﬃcient values have been reported to be over 0.90 for almost all mechanical indentation instruments. Speciﬁcally, the SPAM was found to have an ICC value of 0.979 at L5 (Edmondston et al., 1998), Lee and Evans’ stiﬀness assessment device had an ICC value of 0.99 for L3/4 and 0.95 for L4/5 (Lee and Evans, 1992), SAM had an ICC value of 0.96 for lumbar vertebrae (Latimer et al., 1996aec), and Rigid Frame Indentation at 0.99e1.00 for varying experimental conditions (Kawchuk and Herzog, 1996). Interestingly, the reliability of AI was higher than that of the SPS which found an ICC value of 0.88 at L3 (Lee and Svensson, 1990). That mechanical indentation devices have higher reliability values (overall) than AI is expected. While the rate of indentation studying the AI procedure is standardized using a visual cue (graphic display of force data), slight variations in the rate of indentation were likely to occur. These variations may alter the resulting measures as (1) the target tissues are viscoelastic and may exhibit rate dependant behaviors (White and Panjabi, 1990) and (2) variations in the data may inﬂuence stiﬀness analysis techniques such as GS which is based on a linear approximation of shape.

While these variations were not of suﬃcient magnitude to create poor reliability, they may account for the slightly lower reliability values that occur with AI compared to other automated techniques. Further support for the comparability of AI to mechanical techniques is suggested by our repeated measures ANOVA results; no signiﬁcant diﬀerences were present between any of the indentation trials for both GS and MMS measures. This observation suggests that all indentations using the AI found similar stiﬀness values regardless of the time at which the stiﬀness measure was taken. This ﬁnding strengthens the excellent reliability values by demonstrating consistency over time with the stiﬀness measurements. Further, these ANOVA data suggest that repeated AI trials do not aﬀect viscoelasticity of the target tissue. This is likely due to tissues reaching a steady state of viscoelastic change following suﬃcient familiarization trials and experimental indentations and/or adequate time between all indentations such that between-trial tissue recovery was complete. It should be noted that large diﬀerences in individual subject inter-trial inconsistency values were exhibited with some single subjects having inter-trial inconsistency values approaching 30% (1 SD). This suggests that the

Fig. 7. Change in MMS values over time for all subjects. MMS values (mean standard deviation) normalized to indentation trial 2.

204

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205

Table 3 Intra-class correlation values for three methods of stiﬀness assessment. Method of stiﬀness assessment

ICC value

Manual Mechanical Assisted indentation

0.03e0.77 0.88e1.00 0.91e0.93

consistency of stiﬀness results obtained by AI may be speciﬁc to the individual and may be inﬂuenced by other factors not deﬁned in this study. Possible factors that could explain measurement inconsistency with these few subjects may include inconsistent localization of the indentation contact point between trials or failure to control subject speciﬁc factors which inﬂuence stiﬀness (e.g. intra-abdominal pressure, muscle contraction, subject movement, etc.) (Kawchuk and Fauvel, 2001). In this situation, changes in measured spinal stiﬀness may occur as the indentation test may involve diﬀerent anatomy. In addition, the subject’s baseline stiﬀness could also be a confounding factor. Although a formal analysis was not performed, it was observed that those subjects with high baseline stiﬀness values for GS and MMS (stiﬀ back) often had large changes in their stiﬀness values over time. Finally, variables such as plinth padding (Maher et al., 1999), subject positioning, (Edmondston et al., 1998), adipose tissue (Viner et al., 1997), and breathing (Beaumont et al., 1991) must be controlled within a single subject if stiﬀness measures are to be compared within the same subject over time. Several limitations of this study are noted. First, only intra-operator reliability was measured in asymptomatic subjects. As a result, we cannot comment on intraoperator reliability in a symptomatic population nor inter-operator reliability. Second, our results apply speciﬁcally to a sample of patients with an average age of 25 years and average BMI of 22 kg/m2; generalization to those outside this group is unwarranted. 5. Conclusion Measurement of spinal stiﬀness using AI demonstrated excellent intra-rater reliability. Due to the smaller and less cumbersome nature of AI compared to other mechanical instruments, AI may be viable technology for clinical use, however, further research is needed to quantify inter-rater reliability and to investigate the responsiveness of this instrument (sensitivity and speciﬁcity) to alterations in stiﬀness values. Acknowledgements Funding for this project and for Tasha Stanton was provided by the Province of Alberta Graduate Scholarship, the Strathcona Physiotherapy Foundation and

NSERC. Support for Greg Kawchuk was supplied by the Canada Research Chairs program. We would like to express our sincere appreciation to Gian Jhangri and Dr. Trish Manns for their statistical assistance, Al Fleming and Sam Graziano for their technical support, and the members of the Common Spinals Disorder Lab at the University of Alberta for their feedback and support.

References Astrand PO, Rodahl K, Dahl HA, Stromme SB. Textbook of work physiology: physiological bases of exercise. 4th ed. Champaign, IL: Human Kinetics; 2003. Beaumont A, McCrumb C, Lee M. Diﬀerences in the posteroanterior stiﬀness of the lumbar spine during tidal breathing and breath holding. In: Proceedings of the seventh biennial conference of the manipulative physiotherapists association of Australia, Sydney, New South Wales, Australia 1991, p. 244e51. Binkley JM, Stratford PW, Gill C. Intrarater reliability of lumbar accessory motion mobility testing. Phys Ther 1995;75:786e92. Caling B, Lee M. Eﬀect of direction of applied mobilization force on the posteroanterior response in the lumbar spine. J Manipulative Physiol Ther 2001;24:71e8. Childs JD, Fritz JM, Flynn TW, Irrgang JJ, Johnson KK, Majkowski GR, et al. A clinical prediction rule to identify patients with low back pain most likely to beneﬁt from spinal manipulation: a validation study. Ann Intern Med 2004;141:920e8. Edmondston SJ, Allison GT, Gregg CD, Purden SM, Svansson GR, Watson AE. Eﬀect of position on the posteroanterior stiﬀness of the lumbar spine. Man Ther 1998;3:21e6. Fleiss JL. The design and analysis of clinical experiments. 1st ed. New York: Wiley; 1986. Grieves G. Mobilisation of the spine: notes on examination, assessment, and clinical method. 4th ed. Edinburgh; New York: Churchill Livingstone; 1984. Harlick JC, Milosavljevic S, Milburn PD. Palpation identiﬁcation of spinous processes in the lumbar spine. Man Ther 2007;12:56e62. Hicks GE, Fritz JM, Delitto A, McGill SM. Preliminary development of a clinical prediction rule for determining which patients with low back pain will respond to a stabilization program. Arch Phys Med Rehabil 2005;86:1753e62. Kawchuk G, Herzog W. A new technique of tissue stiﬀness (compliance) assessment: its reliability, accuracy and comparison with an existing method. J Manipulative Physiol Ther 1996;19:13e8. Kawchuk GN, Fauvel OR. Sources of variation in spinal indentation testing: indentation site relocation, intraabdominal pressure, subject movement, muscular response, and stiﬀness estimate. J Manipulative Physiol Ther 2001;24:84e91. Kawchuk G, Liddle T, Fauvel R. The accuracy of ultrasonic indentation: a comparison of three techniques. J Manipulative Physiol Ther 2006;29:126e33. Latimer J, Lee M, Adams R. The eﬀects of high and low loading forces on measured values of lumbar stiﬀness. J Manipulative Physiol Ther 1998;21:157e63. Latimer J, Lee M, Adams R, Moran C. An investigation of the relationship between low back pain and lumbar posteroanterior stiﬀness. J Manipulative Physiol Ther 1996a;19:587e91. Latimer J, Lee M, Goodsell M, Maher C, Wilkinson B, Adams R. Instrumented measurement of spinal stiﬀness. Man Ther 1996b;1:204e9. Latimer J, Goodsell MM, Lee M, Maher CG, Wilkinson BN, Moran CC. Evaluation of a new device for measuring responses to posteroanterior forces in a patient population, Part I: Reliability testing. Phys Ther 1996c;76:158e65.

T.R. Stanton, G.N. Kawchuk / Manual Therapy 14 (2009) 197e205 Lee R, Evans J. Loadedisplacementetime characteristics of the spine under posteroanterior mobilisation. Aust J Physiother 1992;38: 115e23. Lee M, Liversidge K. Posteroanterior stiﬀness at three locations in the lumbar spine. J Manipulative Physiol Ther 1994;17:511e6. Lee M, Svensson NL. Measurement of stiﬀness during simulated spinal physiotherapy. Clin Phys Physiol Meas 1990;11:201e7. Maher CG, Adams R. Reliability of pain and stiﬀness assessments in clinical manual lumbar spine examination. Phys Ther 1994;74:801e9. Maher C, Adams R. A psychophysical evaluation of manual stiﬀness discrimination. Aust J Physiother 1995;41:161e7. Maher CG, Adams RD. Stiﬀness judgments are aﬀected by visual occlusion. J Manipulative Physiol Ther 1996;19:250e6. Maher CG, Latimer J, Adams R. An investigation of the reliability and validity of posteroanterior spinal stiﬀness judgments using a reference-based protocol. Phys Ther 1998;78: 829e37.

205

Maher CG, Latimer J, Holland MJ. Plinth padding confounds measures of posteroanterior stiﬀness. Man Ther 1999;14:145e50. Maitland GD. Vertebral manipulation. 5th ed. London: Butterworthe Heinemann; 1986. Maitland GD, Hengeveld E, Banks K, English K. Maitland’s vertebral manipulation. 6th ed. London: ButterwortheHeinemann; 2001. Matyas T, Bach TM. The reliability of selected techniques in clinical arthrometrics. Aust J Physiother 1985;31:175e99. Shrout PE, Fleiss JL. Intraclass correlation: uses in assessing rater reliability. Psychol Bull 1979;86:420e8. Snodgrass SJ, Rivett DA, Robertson VJ. Manual forces applied during poster-to-anterior spinal mobilization: a review of the evidence. J Manipulative Physiol Ther 2006;29:316e29. Viner A, Lee M, Adams R. Posteroanterior stiﬀness in the lumbosacral spine: the correlation between adjacent vertebral levels. Spine 1997;22:2724e9 [discussion 2729e30]. White AA, Panjabi MM. Clinical biomechanics of the spine. 2nd ed. Philadelphia: Lippincott; 1990.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 206e212 www.elsevier.com/math

Original Article

Reliability, validity and responsiveness of the French version of the questionnaire Quick Disability of the Arm, Shoulder and Hand in shoulder disorders Fouad Fayad a,*, Marie-Martine Lefevre-Colau b,e, Vincent Gautheron c,e, Yann Mace´ a, Jacques Fermanian d, Anne Mayoux-Benhamou a, Alexandra Roren a, Franc¸ois Rannou a, Agne`s Roby-Brami e, Michel Revel a,e, Serge Poiraudeau a,e a

Department of Rehabilitation, Assistance Publique-Hoˆpitaux de Paris (AP-HP), Cochin Hospital, Paris Descartes University, 27 Rue du Faubourg Saint Jacques, 75679 Paris Cedex 14, France b Department of Rehabilitation, AP-HP, Corentin-Celton Hospital, Paris Descartes University, Issy-les-Moulineaux, France c Department of Rehabilitation, Bellevue Hospital, Jean Monnet University, Saint-Etienne, France d Department of Biostatistics, AP-HP, Necker Hospital, Paris Descartes University, Paris, France e Institut Fe´de´ratif de Recherche sur le Handicap (IFR 25), Institut National de la Sante´ et de la Recherche Me´dicale (INSERM), Paris, France Received 27 July 2007; received in revised form 24 January 2008; accepted 30 January 2008

Abstract We assessed the reliability, validity and responsiveness of the French short version of the scale Disability of the Arm, Shoulder and Hand-Disability/Symptom (F-QuickDASH-D/S) in patients with shoulder disorders. We extracted QuickDASH item responses from the responses to the full-length DASH questionnaire completed by 153 patients. In addition to collecting demographic and clinical data, subjective assessment of activities of daily living (ADL), active range of motion (ROM), and measurement of abduction strength (strength) were recorded by use of the Constant scale. Cronbach’s alpha coeﬃcient was 0.89. The intraclass correlation coeﬃcient was 0.94, which suggested excellent testeretest reliability. Correlation of the F-QuickDASH-D/S score with scores for FDASH-D/S (r ¼ 0.96), handicap (r ¼ 0.79), ADL (r ¼ 0.73), pain during activities (r ¼ 0.63), strength (r ¼ 0.58), pain at rest (r ¼ 0.57) and ROM (r ¼ 0.51) indicated good construct validity. Factor analysis identiﬁed 2 factors accounting for 59.1% of the variance. The responsiveness of F-QuickDASH-D/S was excellent, with standardized response mean and eﬀect size values of 1.09 and 1.23, respectively. The F-QuickDASH-D/S has good reliability, construct validity and responsiveness. The strong correlation of its score with the full-length DASH-D/S scale score suggests that the QuickDASH-D/S could be the preferred scale because it is easier to use. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Shoulder; Disability; QuickDASH questionnaire; Outcome measure

1. Introduction Symptomatic shoulder disorders constitute the third most common musculoskeletal reason, after back and * Corresponding author. Tel.: þ33 1 58 41 25 41; fax: þ33 1 58 41 25 45. E-mail address: [email protected] (F. Fayad). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.01.013

neck pain, for consultation in medical practice (Rekola et al., 1993; Linsell et al., 2006; Feleus et al., 2008). Patient’s subjective perception of their disease status is decisive for both diagnostic work-up and subsequent therapeutic management. In addition, patient-reported outcome measures have become an important part of the assessment used in clinical studies. Numerous shoulder outcome-measure instruments are available (Fayad

F. Fayad et al. / Manual Therapy 14 (2009) 206e212

et al., 2005). The Disability of the Arm, Shoulder, and Hand scale (DASH) (Hudak et al., 1996) is among the best-rated self-administered questionnaires for their clinimetric properties (Bot et al., 2004; Gabel et al., 2006). From the original 30-item DASH questionnaire, a shorter version, of 11 items, the QuickDASH, was recently developed (Beaton et al., 2005). The psychometric properties of the QuickDASH are similar to those of the original questionnaire, and the QuickDASH may be preferred because of the reduced time for responding as well as less administrative burden. Furthermore, the QuickDASH has been selected by the American Medical Association’ Guides to the Evaluation of Permanent Impairment for the functional assessment measure of the upper extremity (Matheson et al., 2006). Cross-cultural adaptation of validated outcome instruments has been advocated to facilitate their use in international multicenter clinical trials (Ware et al., 1995), which would also reduce the need for developing new instruments with the same purpose. The full-length version of the DASH has been validated or translated in several languages (Atroshi et al., 2000; Dubert et al., 2001; Oﬀenbaecher et al., 2002; Padua et al., 2003; Lee et al., 2005). The references are some examples only, because many more language versions exist, and the list here is not exhaustive. However, to date for the QuickDASH only English, Sweden and Japanese versions have been validated (Beaton et al., 2005; Gummesson et al., 2006; Imaeda et al., 2006). We aimed to assess the reliability, validity and responsiveness of the French version of the Disability/ Symptom scale of the QuickDASH (F-QuickDASHD/S) in patients with common shoulder disorders.

2. Patients and method 2.1. Patients Patients with common shoulder conditions (rotator cuﬀ tendinopathies, frozen shoulder, osteoarthritis and proximal humeral fractures after bone healing) referred to a tertiary care rehabilitation unit were considered for inclusion in this study. Exclusion criteria were age less than 18 years; symptom duration of less than 2 months; shoulder pain originating from neurological or vascular disorders or neoplasms; referred pain from internal organs; systemic rheumatic conditions; inability to complete questionnaires because of cognitive impairment; or language diﬃculties. French bioethics legislation does not require consent from the Hospital Ethics Committee for this type of study. The study was conducted in compliance with the protocol Good Clinical Practices and Declaration of Helsinki principles and all patients provided informed consent.

207

2.2. Patient self-administered questionnaire The full-length French DASH-D/S questionnaire (Dubert et al., 2001) was completed by 153 consecutive eligible patients. The QuickDASH item responses were extracted from the subjects’ responses to the full-length scale. The 11 items of the QuickDASH ask about the degree of diﬃculty in performing various physical activities because of arm, shoulder or hand problems (6 items); the severity of pain and tingling (2 items); as well as the problem’s eﬀect on social activities, work, and sleep (3 items). Each item has 5 response options, ranging from 1, ‘‘no diﬃculty or no symptom,’’ to 5, ‘‘unable to perform activity or very severe symptom.’’ If at least 10 of the 11 items are completed, a score ranging from 0 (no disability) to 100 (most severe disability) can be calculated [(sum of n responses/n) 1] 25 (Beaton et al., 2005). Data for patients with more than 1 unanswered item on the questionnaire were excluded. The 2 optional scales of the QuickDASH (sport/music and work) were not part of the study. Indeed, we chose to include patients with shoulder disorders without any restriction in age or activities. This led to the inclusion of many patients without professional activity or sport/ music activities. 2.3. Statistical methods 2.3.1. Variables recorded other than the QuickDASH score Demographic and clinical data were collected at the ﬁrst visit (baseline) by a physician (FF). Parameters recorded were age, sex, body mass index, disease duration, pain scores at rest and during activities (on a visual analog scale [VAS], 0e100 mm), and perceived handicap (on a VAS, 0e100 mm). The following Constant subscale scores (Constant and Murley, 1987) were used: Twenty points were allocated to subjective assessment of activities of daily living (ADL), 40 to active range of motion (ROM), and 25 to abduction strength (strength). The Constant subscale for pain was not used in this study. 2.3.2. Statistics Data analysis involved use of Systat 9 Delta Soft for Windows (Systat Software, Point Richmond, CA). Quantitative variables (age, disease duration, body mass index, pain scores at rest and during activities, perceived handicap, ADL, ROM, strength, and F-DASHD/S and F-QuickDASH-D/S scores) are described with medians and ranges. The qualitative variable (sex) is described with percentages. The chi-square test was applied to test for a normal distribution of the variables: in the whole sample of patients, for all variables we could not determine their normal distribution; by

208

F. Fayad et al. / Manual Therapy 14 (2009) 206e212

contrast, in 2 subgroups of patients (see below), the variables of interest showed a normal distribution.

2.3.3. Reliability Testeretest reliability was analyzed in a subgroup of 42 patients, selected at random by use of random numbers generated by computer. Each patient completed the questionnaire twice within a mean interval of 3.3 (range 2e9) days. The self-administered questionnaire was given at the second visit by a physical therapist (AR) just before the beginning of the rehabilitation program. No speciﬁc treatment for the shoulder was given between the 2 evaluations, and all these patients reported no change in functional status at the second visit. All the variables in this subgroup of patients can be considered, after a KolmogoroveSmirnov test, to be normally distributed. Testeretest reliability was assessed with both the intraclass correlation coeﬃcient (ICC), with a 2-way random-eﬀects model (Shrout and Fleiss, 1979), and the Bland and Altman (1986) method. Internal consistency of the F-QuickDASHD/S scale was assessed with the Cronbach’s alpha coeﬃcient.

2.3.5. Responsiveness Responsiveness was analyzed in a subgroup of patients treated with a corticosteroid injection followed by a supervised 5-session program of physical therapy and a self-management program of rehabilitation at home. At this stage, data for 26 patients were analyzed. These patients had rotator cuﬀ tendinopathies with subacromial bursitis (n ¼ 8), frozen shoulder (n ¼ 8), or osteoarthritis (n ¼ 10). All the variables in this subgroup of patients can be considered, after a Kolmogorove Smirnov test, to be normally distributed. Responsiveness statistics-distribution-based was computed by use of the standardized response mean (SRM) and the eﬀect size (ES) (Fortin et al., 1995). Values <0.50, 0.50e0.80, and >0.80 were considered to represent small, moderate, and large degrees of responsiveness, respectively (Husted et al., 2000). The relation between the change in the F-QuickDASH score and change in patient’s perceived handicap (reﬂecting the overall perceived patient improvement) was studied by use of Pearson correlation to establish the longitudinal construct validity of the F-QuickDASH. 3. Results 3.1. Demographic and clinical data

2.3.4. Construct validity Construct validity of the F-QuickDASH-D/S was investigated on the whole sample of patients. Convergent construct validity was assessed by correlating the questionnaire scores with scores on variables supposedly assessing similar dimensions or concepts (Poiraudeau et al., 2001; Lefevre-Colau et al., 2003; Fermanian, 2005). We hypothesized that the F-QuickDASH-D/S score would have (1) strongest association with ADL score and perceived handicap; (2) moderate association with ROM, strength, pain at rest, and pain during activities; and (3) weakest association with age, disease duration and body mass index. Because a normal distribution could not be demonstrated for all parameters studied, the nonparametric Spearman rank coeﬃcient (r) was used to assess the correlation between 2 quantitative variables. Spearman’s correlation was interpreted as excellent (>0.91), good (0.90e0.71), moderate (0.70e 0.51), fair (0.50e0.31), or little or no correlation (<0.31) (Fermanian, 1984). Principal component analysis was used to extract factors. Then, independent factors were obtained by use of the varimax rotation method, an orthogonal rotation method applied to the initial factorial solution, to minimize the number of variables with high loading in each factor. Retained factors had eigenvalues > 1. Eigenvalues are obtained by matrix algebra and represent the part of the whole variation of the data that can be attributed to each factor.

Demographic and clinical characteristics are shown in Table 1. Sixty-ﬁve patients had rotator cuﬀ tendinopathies, 32 frozen shoulder, 25 osteoarthritis and 31 fractures of the humeral head. The fracture group was a distinct group and its onset period was much shorter than that of others, with lower pain and disability scores as well. Data for 2 patients were excluded because of more than 1 unanswered item on the questionnaire. Four patients did not respond to item 6, related to recreational activities, and 2 to item 2, related to heavy household chores. No item had a ﬂoor or ceiling eﬀect. No patients recorded the minimum disability score of 0 on the F-QuickDASH-D/S scale, which would represent the maximum health status score (ceiling), and no corresponding maximum disability score of 100, which would represent the minimum health status score (ﬂoor). 3.2. Reliability Internal consistency was high, with a Cronbach’s alpha coeﬃcient of 0.89. Testeretest reliability was analyzed for 42 of the patients (62% women) with mean age 59 13.9 years (range 25e85 years). The FQuickDASH-D/S scores at the ﬁrst and the second visit were 48.3 18.1 and 44.9 20.1 ( p ¼ 0.001), respectively. Testeretest reliability analysis gave an ICC of 0.94 (95% conﬁdence interval, 0.87e0.97), indicating excellent reliability. Bland and Altman analysis revealed

209

F. Fayad et al. / Manual Therapy 14 (2009) 206e212 Table 1 Demographic and clinical characteristics of 153 patients and patients by disorder, for whom the F-QuickDASH-D/S was validated. Variables

Whole group (n ¼ 153)

Rotator cuﬀ tendinopathies (n ¼ 65)

Frozen shoulder (n ¼ 32)

Osteoarthritis (n ¼ 25)

Fractures of the humeral head (n ¼ 31)

Age (years) Sex, F (%) Body mass index (kg/m2) Disease duration (months) Pain score at rest (VAS, 0e100 mm) Pain score during activities (VAS, 0e100 mm) Perceived handicap (VAS, 0e100 mm) F-DASH-D/S score (range 0e100) F-QuickDASH-D/S score (range 0e100) ADL (range 0e20) ROM (range 0e40) Strength (range 0e25)

57.0 100 24.6 9.0 9.0 54.0 50.0 48.3 50.0 12 28 5

57.0 40 25.3 10.0 18.5 60.0 50.0 52.5 54.5 11 28 4

49.0 22 23.5 11.5 8.0 58.5 50.0 50.8 51.1 10 17 5

70.0 18 26.8 36.0 23.0 70.0 50.0 45.8 50.0 11 22 3

57.0 20 23.4 3.0 0 29.0 20.0 20.7 21.6 17 32 9

(23e89) (65.4) (16.4e42.1) (2e180) (0e82) (0e100) (0e90) (5.0e87.5) (2.3e88.6) (3e20) (0e40) (0e17)

(27e85) (61.5) (16.4e33.2) (2e120) (0e82) (0e100) (12e90) (5.8e85.8) (2.3e81.8) (4e18) (10e40) (0e16)

(30e69) (68.7) (17.1e42.1) (2e34) (0e72) (0e92) (20e80) (20.0e87.5) (18.2e88.6) (3e19) (8e34) (2e17)

(56e86) (72.0) (19.0e34.7) (3e180) (0e63) (11e100) (10e85) (27.5e80) (20.5e79.5) (3e17) (0e38) (0e16)

(23e89) (64.5) (18.6e37.6) (2e12) (0e16) (0e72) (0e80) (5.0e77.5) (6.8e70.5) (6e20) (0e40) (0e14)

Values are median (minemax), unless indicated. VAS: visual analog scale; F-DASH-D/S: French version of the Disability of the Arm, Shoulder and Hand questionnaire Disability/Symptom scale (30 items); F-QuickDASH-D/S: short version of F-DASH-D/S (11 items); ADL: subjective assessment of activities of daily living; ROM: active range of motion; and strength: measurement of abduction strength.

testeretest results not strictly centered (mean 3.4 6.0), but no systematic trend was observed (r ¼ 0.29). The limits of agreement were 8.4 to 15.2 (Fig. 1).

3.3. Construct validity

Differences between scores

The scale had good convergent validity with perceived handicap and ADL scores; moderate correlation with scores for ROM, strength, pain at rest, and pain during activities; fair correlation with disease duration; and little correlation with age and body mass index. Furthermore, the F-QuickDASH-D/S scale had excellent correlation with the full-length F-DASH-D/S score (r ¼ 0.96) (Table 2). Principal component analysis extracted 2 factors, explaining 59.1% of the variance. Varimax rotation showed that the ﬁrst factor comprises 7 items of ADL and the second factor comprises 4 items, 3 related to pain. The loading of each item after varimax rotation is given in Table 3. 20

3.4. Responsiveness Responsiveness was analyzed in a subgroup of 26 patients (20 women, mean age of 58.2 11.0 years) treated with a corticosteroid injection followed by physiotherapy (Table 4). All patients were evaluated twice, at baseline and at a mean of 7.8 3.9 weeks after treatment. Twenty-four patients reported improvement, 1 reported unchanged clinical status and another deteriorated clinical status after treatment. The mean F-QuickDASHD/S score decreased signiﬁcantly (48.4 15.8 vs 29.0 16.4, paired t-test, p < 0.0001), with SRM and ES values of 1.09 and 1.23, respectively, which indicates large degree of sensitivity of the instrument to the clinical improvement (Husted et al., 2000). The mean patient perceived handicap score decreased signiﬁcantly (51.1 19.4 vs 27.7 23.0, paired t-test, p < 0.0001). The correlation between change in patient’s perceived

Table 2 Correlation of F-QuickDASH-D/S with other variables (n ¼ 153). Spearman correlation coeﬃcient (r)

1.96 SD

10 0 -1.96 SD

-10 -20 0

20

40

60

80

100

Mean of the 2 scores Fig. 1. Bland and Altman plot of testeretest scores in analysis of the FQuickDASH-D/S for shoulder disorders.

F-DASH-D/S score (range 0e100) Perceived handicap score (VAS, 0e100 mm) ADL score (range 0e20) Pain score during activities (VAS, 0e100 mm) Strength score (range 0e25) Pain score at rest (VAS, 0e100 mm) ROM score (range 0e40) Disease duration Age Body mass index

0.96 0.79 0.73 0.63 0.58 0.57 0.51 0.38 0.22 0.15

QuickDASH-D/S: short version of the Disability of the Arm, Shoulder and Hand questionnaire (11 items); VAS: visual analog scale; ADL: activities of daily living; and ROM: range of motion.

210

F. Fayad et al. / Manual Therapy 14 (2009) 206e212

Table 3 Factor loading of principal components of the F-QuickDASH-D/S (the highest loading of each item for each factor is bold). Item

Factor 1

Factor 2

1 2 3 4 5 6 7 8 9 10 11

0.766 0.834 0.823 0.562 0.579 0.745 0.347 0.671 0.339 0.065 0.239

0.086 0.215 0.228 0.359 0.338 0.239 0.663 0.366 0.772 0.690 0.727

F-QuickDASH-D/S: French short version of the Disability of the Arm, Shoulder and Hand questionnaire.

handicap and change in QuickDASH-D/S score was moderate (r ¼ 0.57).

4. Discussion Our results strongly suggest that the F-QuickDASHD/S scale can be used for evaluating shoulder conditions. The reliability and internal consistency of the F-QuickDASH-D/S scale were equal to those of the original (English) version (ICC 0.94 and Cronbach’s alpha 0.89 vs 0.94 and 0.94, respectively, Beaton et al., 2005). Although our ICC for testeretest can be considered excellent, graphic representation of the testeretest scores by the Bland and Altman method revealed that despite a marginal number of outliers (2.4%), the scores were not centered, and no systemic trend was observed. The Bland and Altman method revealed the following: (A) The testeretest results were signiﬁcantly diﬀerent. This can be explained by the fact that the 2 observations (test and retest) were not independent, probably because knowledge of the ﬁrst measurement aﬀected the second measurement. Thus, we have a bias between the 2 series of measures. This phenomenon is frequently observed in testeretest Table 4 Demographic and clinical characteristics of 26 patients included in the study of responsiveness of the F-QuickDASH-D/S. Variables Age (years) Sex, F (%) Disease duration (months) Pain score during activities (VAS, 0e100 mm) Perceived handicap (VAS, 0e100 mm) F-QuickDASH-D/S score (range 0e100)

58.2 20 25.7 57.8 51.1 48.4

(11.0) (76.9) (39.4) (22.0) (19.4) (15.8)

Values are mean (SD), unless indicated. F-QuickDASH-D/S: French short version of the Disability of the Arm, Shoulder and Hand questionnaire.

studies (Nunnally and Bernstein, 1994). We note that this bias (mean 3.4), although signiﬁcantly different from 0, is small. In the current study, we therefore measured, according to the terminology of Bland and Altman, not the repeatability of the testeretest results, which supposes independent observations, but, rather, their agreement, which is possible to evaluate in the presence of our small bias (Bland and Altman, 1999). (B) The limits of agreement were 8.4 and 15.2. Thus, 95% of the diﬀerences in testeretest results could be expected to fall within this range, which could be considered clinically unimportant. Indeed, to our knowledge, the minimal important diﬀerence for the QuickDASH score has not been published. However, the parameter computed for the DASH score was 12.6 (Schmitt and Di Fabio, 2004). We found a diﬀerence between measurements of greater than 12.6 for only 2 patients (4.8%) in our subgroup (n ¼ 42). Thus, our results for diﬀerences between test and retest were in a clinically acceptable range. Because no criterion standard exists to assess functional disability (Guyatt et al., 1993), we assessed construct validity. The F-QuickDASH-D/S scale has good correlation with ADL score and patient perceived handicap, which reﬂects its ability to measure shoulder disability. The F-QuickDASH-D/S scale has shown similar construct validity to the Japanese and English short versions (Beaton et al., 2005; Imaeda et al., 2006) as well as to the original full-length version (Beaton et al., 2001). In addition, the F-QuickDASH-D/S showed an excellent correlation with the full F-DASH-D/S (r ¼ 0.96) as observed in the study by Beaton et al. (2005). These ﬁndings suggest that the F-QuickDASH-D/S scale should give a view of disability that is relatively similar to that provided by the full-length DASH. We performed principal component factor analysis for our sample of 153 patients. Indeed, no consensus exists on the minimum number of subjects needed to perform principal component analysis. A minimum of 100e300 subjects has been proposed as necessary (Comrey, 1973; Kline, 1993) or 5e10 times the number of variables (Nunnally and Bernstein, 1994; Streiner, 1994). Principal component analysis of the F-QuickDASH-D/ S scale revealed 1 major factor accounting for 48.4% of the total variance, which was consistent with results of the Japanese version (Imaeda et al., 2006). The current study is the ﬁrst to provide results of factor analysis with varimax rotation of the QuickDASHD/S and revealed 2 independent factors explaining 59.1% of the total variance. All items retained in each factor have a high loading (>0.5) in 1 factor and weak loading in others. The assignment of item 7 is problematic because this item, representing social activities, would be more clinically relevant in factor 1, whereas

F. Fayad et al. / Manual Therapy 14 (2009) 206e212

it showed high loading in factor 2. That factors could be easily identiﬁed after varimax rotation reinforces clinically the robustness of the factorial structure of the scale. We used exploratory and not conﬁrmatory analysis to assess the factorial structure of the F-QuickDASH-D/S scale. Conﬁrmatory analysis is considered more appropriate if the aim of a study is to conﬁrm the existing second-order single-factor of the F-QuickDASH-D/S scale (de Vet et al., 2005). However, exploratory analysis is considered appropriate if the aim of the study is to examine the factor structure of the scale in a population or language in which the QuickDASH has not yet been evaluated (de Vet et al., 2005). Because the factorial structure of the QuickDASH-D/S scale in shoulder disorders is unknown in the French population, we considered that exploratory analysis was relevant. Imaeda et al. (2006) performed principal component analysis without varimax rotation. The authors also found 2 factors; the ﬁrst 1 had an eigenvalue of 5.12, which explained 47% of the total variance of the QuickDASH-D/S. The second factor had an eigenvalue of 1.74. The authors stated that ‘‘The unidimensionality was found to be strong as a result of a substantial diﬀerence between the ﬁrst and the second factors’’. The responsiveness of the F-QuickDASH-D/S shows that the scale has excellent ability to detect clinical meaningful changes in disability over time in patients with degenerative shoulder disorders after corticosteroid injection and physical treatment. As far as we know, this is the ﬁrst study of the responsiveness of the QuickDASH-D/S scale in medical shoulder disorders. The sensitivity statistics of the F-QuickDASH-D/S scale are similar to those of the original version (Beaton et al., 2005) with higher SRM than was found for the full-length scale after arthroscopic acromioplasty in 25 patients (Gummesson et al., 2003). Although SRM and ES are useful indicators of the amount of changes, these sensitivity statistics lack discriminate power (Fortin et al., 1995). Thus, relevant changes must be assessed by comparing the change score with an external indicator of change such as self-perceived handicap. In the current study, the correlation between change in patient’s perceived handicap and change in QuickDASHD/S score was moderate. We studied the psychometric properties of the FQuickDASH scale in patients with various shoulder pathologies, rotator cuﬀ tendinopathies, frozen shoulder, osteoarthritis and proximal humeral fracture. As far as we know, the current study was the ﬁrst to evaluate this self-administered questionnaire in non-operative proximal humerus fracture although this condition is common and cause prolonged disability. The fracture group was characterized by a shorter duration of symptoms, and a lower pain and disability scores than other groups. This particularity may be explained by the fact that patients in this group were seen after bone healing.

211

Nevertheless, our study shows the applicability of the FQuickDASH in patients with medical and traumatic shoulder disorders. The validation of this short version of the DASH outcomes tool may help the clinicians and physical therapists by facilitating the monitoring of disability and dependence of their patients. Our study is limited by the fact that we had to extract the QuickDASH item responses from the full-length DASH questionnaire for the psychometric testing of the scale. This use of data may constitute a bias: patients’ responses to the 11 items would have been diﬀerent if only the QuickDASH was administered. Thus, our results could lead to an overestimation of the similarity between the short and full-length scales (Haavardsholm et al., 2000). This problem is inherent to many studies validating the short versions of scales (Gummesson et al., 2006; Imaeda et al., 2006; Baron et al., 2007). Two other limitations warrant acknowledgment. First, because all patients were recruited in a tertiary care centre, the results may not be generalizable to a primary care setting. As well, the results of the current study are limited to shoulder disorders and cannot be generalized to other upper extremity disorders.

5. Conclusion The F-QuickDASH-D/S scale is a reliable, valid and responsive instrument for assessing disability in common shoulder disorders. Its psychometric properties are comparable to those of the full-length version of this scale. Therefore, the QuickDASH-D/S could be the preferred scale because it is easier and quicker to use.

Acknowledgement The authors thank the patients who participated in the study and the technical staﬀ of the Department of Rehabilitation Medicine, Cochin Hospital, Paris, France, for their help with data collection.

References Atroshi I, Gummesson C, Andersson B, Dahlgren E, Johansson A. The disabilities of the arm, shoulder and hand (DASH) outcome questionnaire: reliability and validity of the Swedish version evaluated in 176 patients. Acta Orthopaedica Scandinavica 2000;71:613e8. Baron G, Tubach F, Ravaud P, Logeart I, Dougados M. Validation of a short form of the Western Ontario and McMaster Universities Osteoarthritis Index function subscale in hip and knee osteoarthritis. Arthritis and Rheumatism 2007;57:633e8. Beaton DE, Katz JN, Fossel AH, Wright JG, Tarasuk V, Bombardier C. Measuring the whole or the parts? Validity, reliability, and responsiveness of the disabilities of the arm, shoulder and

212

F. Fayad et al. / Manual Therapy 14 (2009) 206e212

hand outcome measure in diﬀerent regions of the upper extremity. Journal of Hand Therapy 2001;14:128e46. Beaton DE, Wright JG, Katz JN. Upper Extremity Collaborative Group. Development of the QuickDASH: comparison of three item-reduction approaches. The Journal of Bone and Joint Surgery American Volume 2005;87:1038e46. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. The Lancet 1986;1:307e10. Bland JM, Altman DG. Measuring agreement in method comparison studies. Statistical Methods in Medical Research 1999;8:135e60. Bot SD, Terwee CB, van der Windt DA, Bouter LM, Dekker J, de Vet HC. Clinimetric evaluation of shoulder disability questionnaires: a systematic review of the literature. Annals of the Rheumatic Diseases 2004;63:335e41. Comrey AL. A ﬁrst course in factor analysis. New York: New York Academic Press; 1973. Constant CR, Murley AHG. A clinical method for the functional assessment of the shoulder. Clinical Orthopaedics and Related Research 1987;214:160e4. Dubert T, Voche P, Dumontier C, Dinh A. The DASH questionnaire. French translation of a trans-cultural adaptation. Chirurgie de la Main 2001;20:294e302. Fayad F, Mace Y, Lefevre-Colau MM. Shoulder disability questionnaires: a systematic review. Annales de Re´adaptation et de Me´decine Physique 2005;48:298e306. Feleus A, Bierma-Zeinstra SM, Miedema HS, Bernsen RM, Verhaar JA, Koes BW. Incidence of non-traumatic complaints of arm, neck and shoulder in general practice. Manual Therapy 2008;13:426e33. Fermanian J. Measuring agreement between 2 observers: a quantitative case. Revue Epide´miologique et Sante´ Publique 1984;32:408e13. Fermanian J. Validation of assessment scales in physical medicine and rehabilitation: how are psychometric properties determined?. Annales de Re´adaptation et de Me´decine Physique 2005;48:281e7. Fortin PR, Stucki G, Katz JN. Measuring relevant change: an emerging challenge in rheumatologic clinical trials. Arthritis and Rheumatism 1995;38:1027e30. Gabel CP, Michener LA, Burkett B, Neller A. The upper limb functional index: development and determination of reliability, validity, and responsiveness. Journal of Hand Therapy 2006;19:328e48. Gummesson C, Atroshi I, Ekdahl C. The disabilities of the arm, shoulder and hand (DASH) outcome questionnaire: longitudinal construct validity and measuring self-rated health change after surgery. BMC Musculoskeletal Disorders 2003;4:11. Gummesson C, Ward MM, Atroshi I. The shortened disabilities of the arm, shoulder and hand questionnaire (QuickDASH): validity and reliability based on responses within the full-length DASH. BMC Musculoskeletal Disorders 2006;7:44. Guyatt GH, Feeny DH, Patrick DL. Measuring health-related quality of life. Annals of Internal Medicine 1993;118:622e9. Haavardsholm EA, Kvien TK, Uhlig T, Smedstad LM, Guillemin F. A comparison of agreement and sensitivity to change between AIMS2 and a short form of AIMS2 (AIMS2-SF) in more than 1,000 rheumatoid arthritis patients. Journal of Rheumatology 2000;27:2810e6. Hudak PL, Amadio PC, Bombardier C. Development of an upper extremity outcome measure: the DASH (disabilities of the arm, shoulder and hand). American Journal of Industrial Medicine 1996;29:602e8.

Husted JA, Cook RJ, Farewell VT, Gladman DD. Methods for assessing responsiveness: a critical review and recommendations. Journal of Clinical Epidemiology 2000;53:459e68. Imaeda T, Toh S, Wada T, Uchiyama S, Okinaga S, Kusunose K, et al. Validation of the Japanese society for surgery of the hand version of the quick disability of the arm, shoulder, and hand (QuickDASH-JSSH) questionnaire. Journal of Orthopaedic Science 2006;11:248e53. Kline P. The handbook of psychological testing. London/New York, NY: Routledge; 1993. Lee EW, Chung MM, Li AP, Lo SK. Construct validity of the Chinese version of the disabilities of the arm, shoulder and hand questionnaire (DASH-HKPWH). Journal of Hand Surgery (European Volume) 2005;30:29e34. Lefevre-Colau MM, Poiraudeau S, Oberlin C, Demaille S, Fermanian J, Rannou F, et al. Reliability, validity, and responsiveness of the modiﬁed Kapandji index for assessment of functional mobility of the rheumatoid hand. Archives of Physical Medicine and Rehabilitation 2003;84:1032e8. Linsell L, Dawson J, Zondervan K, Rose P, Randall T, Fitzpatrick R, et al. Prevalence and incidence of adults consulting for shoulder conditions in UK primary care; patterns of diagnosis and referral. Rheumatology 2006;45:215e21. Matheson LN, Melhorn JM, Mayer TG, Theodore BR, Gatchel RJ. Reliability of a visual analog version of the QuickDASH. The Journal of Bone and Joint Surgery American Volume 2006;88:1782e7. Nunnally JC, Bernstein IH. Psychometric theory. 3rd ed. New York: Mc Graw Hill; 1994. Oﬀenbaecher M, Ewert T, Sangha O, Stucki G. Validation of a German version of the disabilities of arm, shoulder, and hand questionnaire (DASH-G). The Journal of Rheumatology 2002;29:401e2. Padua R, Padua L, Ceccarelli E, Romanini E, Zanoli G, Amadio PC, et al. Italian version of the disability of the arm, shoulder and hand (DASH) questionnaire. Cross-cultural adaptation and validation. Journal of Hand Surgery (European Volume) 2003;28:179e86. Poiraudeau S, Chevalier X, Conrozier T, Flippo RM, Liote F, Noel E, et al. Reliability, validity, and sensitivity to change of the Cochin hand functional disability scale in hand osteoarthritis. Osteoarthritis and Cartilage 2001;9:570e7. Rekola KE, Keinanen-Kiukaanniemi S, Takala J. Use of primary health services in sparsely populated country districts by patients with musculoskeletal symptoms: consultations with a physician. Journal of Epidemiology and Community Health 1993;47:153e7. Schmitt JS, Di Fabio RP. Reliable change and minimum important diﬀerence (MID) proportions facilitated group responsiveness comparisons using individual threshold criteria. Journal of Clinical Epidemiology 2004;57:1008e18. Shrout PE, Fleiss JL. Intraclass coeﬃcients: uses in assessing rater reliability. Psychological Bulletin 1979;86:420e8. Streiner DL. Figuring out factors: the use and misuse of factor analysis. Canadian Journal of Psychiatry 1994;39:135e40. de Vet HC, Ader HJ, Terwee BC, Pouwer F. Are factor analytical techniques used appropriately in the validation of health status questionnaires? A systematic review on the quality of factor analysis of the SF-36. Quality of Life Research 2005;14:1203e18. Ware Jr JE, Keller SD, Gandek B, Brazier JE, Sullivan M. Evaluating translations of health status questionnaires. Methods from the IQOLA project. International Quality of Life Assessment. International Journal of Technology Assessment in Health Care 1995;11:525e51.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 213e221 www.elsevier.com/math

Original Article

Inter- and intra-examiner reliability of single and composites of selected motion palpation and pain provocation tests for sacroiliac joint* Amir Massoud Arab a,*, Iraj Abdollahi a, Mohammad Taghi Joghataei b, Zahra Golafshani c, Anoshirvan Kazemnejad d a

Department of Physical Therapy, University of Social Welfare and Rehabilitation Sciences, Evin, Koodakyar Avenue, P.O. Box 19834, Tehran, Iran b Iran University of Medical Sciences, Tehran, Iran c University of Social Welfare and Rehabilitation Sciences, Tehran, Iran d Department of Biostatistics, School of medical sciences, Tarbiat Modarres University, Tehran, Iran Received 14 March 2007; received in revised form 2 February 2008; accepted 7 February 2008

Abstract The sacroiliac joint (SIJ) has been implicated as a potential source of low back and buttock pain. Several types of motion palpation and provocation tests are used to examine the SIJ. It has been suggested that use of a cluster of motion palpation or provocation tests is a more acceptable method than single test to assess SIJ. This study examined the inter- and intra-examiner reliability of single and composites of the motion palpation and provocation tests together. Twenty-ﬁve patients between the ages of 20 and 65 years participated. Four motion palpation and three provocation tests were examined three times on both sides (left, right) by two examiners. Kappa coeﬃcient and prevalence-adjusted and bias-adjusted kappa (PABAK) were calculated to evaluate the reliability. PABAK for intra- and inter-examiner reliability of individual tests ranged from 0.36 to 0.84 (95% CI: 0.22 to 1.12) and 0.52 to 0.84 (95% CI: 0.18 to 1.08) which is considered fair to substantial. PABAK for intra- and inter-examiner reliability for clusters of motion palpation or provocation tests ranged from 0.44 to 0.92 (95% CI: 0.36 to 1.2) which is considered moderate to excellent reliability. PABAK for intra- and inter-examiner reliability of composites of motion palpation and provocation tests ranged from 0.44 to 1.00 (95% CI: 0.22 to 1.12) and 0.52 to 0.92 (95% CI: 0.02 to 1.32) which is considered substantial to excellent. It seems that composites of motion palpation and provocation tests together have reliability suﬃciently high for use in clinical assessment of the SIJ. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Sacroiliac joint; Reliability; Low back pain; Test

1. Introduction

* This research was reviewed and was approved by the Human Subject Committee at University of Social Welfare and Rehabilitation Sciences. * Corresponding author. Tel./fax: þ98 21 22418746 (Oﬃce). E-mail addresses: [email protected], amarab@ uswr.ac.ir (A.M. Arab).

1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.02.004

Low back pain (LBP) is one of the most frequent musculoskeletal complaints in today’s societies. Epidemiologic studies have indicated a lifetime prevalence of 70e80% in the western population (Ehrlich, 2003). Several factors have been associated with the development of LBP. The sacroiliac joint (SIJ) has been implicated as a potential source of the pain in low back and buttock

214

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

with or without lower extremity symptom (Fortin et al., 1994a,b; Schwarzer et al., 1995; Slipman et al., 2001). Schwarzer et al. (1995) reported 13e30% prevalence of SIJ pain in LBP patients. A wide variety of diagnostic tests are used to evaluate the SIJ in patients with LBP. These tests are classiﬁed into three categories: (1) motion palpation tests to assess movement; (2) pain provocation tests to stress SIJ structures and (3) tests designed to assess location and relative symmetry of SIJ landmark. Pain provocation tests attempt to assess whether or not the structure being stressed is a source of pain while motion palpation tests may be used to assess SIJ dysfunction. In clinical practice, it is suggested to treat on the basis of an accurate diagnosis. Many SIJ tests could be inﬂuenced by various structures in the low back, hip and other tissues, so tests might lose their precision (Maigne et al., 1996). For diagnostic tests to yield meaningful results in clinical practice, they should be both valid and reproducible. Previous studies of individual SIJ tests indicate that inter-examiner reliability is poor for motion palpation tests and from poor to excellent for provocation tests (Potter and Rothstein, 1985; Laslett and Williams, 1994; Vincent-Smith and Gibbons, 1999). Current evidence suggests that single test is not reliable enough to be used for diagnosing SIJ pain or dysfunction, whereas the use of a cluster of tests (combining the results of a number of tests) is a more acceptable method (Cibulka et al., 1988; Haas, 1991; Kokmeyer et al., 2002; Laslett et al., 2005; Robinson et al., 2007). Review of the literature revealed some limitations in previous studies for reliability of SIJ tests. Potter and Rothstein (1985) only reported percent agreement and did not calculate kappa. Therefore, the results were not corrected for chance agreement. Some were unclear as to when the performed tests were positive (Dreyfuss et al., 1996). In some studies tests were classiﬁed as positive or negative, regardless of apparent referencing to a particular side (Cibulka and Koldehoﬀ, 1999). In most previous studies motion palpation and provocation tests have been evaluated separately and tests clusters are combination of either motion palpation or provocation tests. Many clinicians use combinations of motion palpation and provocation tests, yet the studies evaluating the reliability and validity deal with either single tests or either of the two types (Bernard, 1997; Mooney, 1997). Utilization of diﬀerent types of motion palpation and provocation tests has been reported in the literature. Some investigators published systematic methodological review articles on studies of SIJ tests. Based on some criteria list including study population, test procedure and results, articles related to SIJ tests were scored (van der Wurﬀ et al., 2000a,b; Stuber, 2007). Among several types of provocation tests have been previously described to assess SIJ pain, we selected PatrickeFABER test, thigh thrust or posterior shear test and resisted abduction test that have acceptable level of

sensitivity and speciﬁcity, high method scores and valid authors’ conclusions in systematic methodological reviews (van der Wurﬀ et al., 2000a,b; Stuber, 2007). The inter-examiner reliability of single thigh thrust and Patrick’s tests has been reported from poor to excellent (Laslett and Williams, 1994; Dreyfuss et al., 1996; Strender et al., 1997), but no data were found regarding intraexaminer reliability. However, we found no study that has evaluated reliability of the resisted abduction test. Of motion palpation procedures, we included the standing ﬂexion test, Gillet test, sitting ﬂexion test and prone knee ﬂexion test (Potter and Rothstein, 1985; Cibulka and Koldehoﬀ, 1999; Meijne et al., 1999; Vincent-Smith and Gibbons, 1999; Riddle and Freburger, 2002) which are widely used in clinics. Despite several published studies regarding the reliability of single and clusters of motion palpation or provocation tests, we found no study that has examined reliability for composites of motion palpation and provocation tests. With the exception of one study (Kokmeyer et al., 2002), paradoxical eﬀects of bias and prevalence on kappa coeﬃcient have not been considered in previous studies. This is important since the magnitude of kappa could be inﬂuenced by prevalence and bias index (BI) (Byrt et al., 1993; Sim and Wright, 2005). It seems it is necessary to consider these potential biases when calculating kappa for better interpretation of the results. The current study examined selected pain provocation and motion palpation tests and evaluated the inter- and intraexaminer reliability of single and composites of tests considering the particular side in test results, including bias and prevalence eﬀects on kappa.

2. Methods 2.1. Subjects A total of 25 subjects aged 20e65 presenting in Orthopaedic and Physical Therapy Departments with LBP were selected for inclusion in the study. All subjects signed an informed consent form approved by the human subjects committee at the University of Social Welfare and Rehabilitation Sciences before participating in the study. Patients were included in the study if their reported pain was below L5, over the posterior aspect of SIJ around posterior superior iliac spine (PSIS) and buttock with or without leg pain. The patients were excluded if they had only midline or symmetrical pain above the level of L5 or radicular pain with neurological sign (sensory or motor deﬁcit) (Laslett et al., 2003, 2005; Young et al., 2003). Subjects with history of spinal surgery, fracture of the spine, pelvis and lower extremities, hospitalization for severe trauma or car accident, leg length diﬀerence, hip/knee dysfunctions, pregnancy, any systemic disease and liver and/or kidney failure were

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

also excluded. Two physical therapists with 1-year experience, blinded to all patient information, tested the subjects. Both examiners were given a written description of test procedures and instructed to practice on each other prior to examining patients. 2.2. Procedures The patients were assigned to one of two rooms by an independent observer. The ﬁrst examiner completed tests with a subject, and the second examiner repeated the examination after a 15 min rest period. The second examiner carried out the test procedures in a random order, diﬀerent from the ﬁrst examination sequence. This process was repeated three times in a random order with a break of 30 min between them. The ﬁrst and second examiners were randomly selected and their results blinded from each other. All seven tests were applied three times to both sides on all subjects by both examiners. In overall 2100 measurements were taken. The procedure for each of the tests was as follows. 2.3. PatrickeFABER test The patient lies supine on the table, and the examiner stands next to him/her. The examiner brings the ipsilateral hip into ﬂexion, abduction and external rotation and knee into ﬂexion so that the heel is on the contralateral knee. Then the examiner ﬁxates the contralateral anterior superior iliac spine (ASIS) and applies pressure on the subject’s ﬂexed knee. The test is positive when similar buttock or groin pain below L5 is reproduced (Maigne et al., 1996; Kokmeyer et al., 2002; Robinson et al., 2007). 2.4. Thigh thrust or posterior shear test The subject lies supine on the table. The examiner ﬂexes the hip and knee that hip is approximately in 90 ﬂexion and slight adduction and thigh is at right angle to the table while the knee remains relaxed. One of the examiner’s hand cups the sacrum and the other arm and hand wraps around the ﬂexed knee. The axial pressure applied is directed through the long axis of the femur, which causes anterior to posterior shear to the SIJ. The test is considered positive when familiar pain is provoked over the posterior aspect of SIJ below L5 (Kokmeyer et al., 2002; Laslett et al., 2003, 2005). 2.5. Resisted abduction test The subject’s position is supine with the leg fully extended as well as being abducted to 30 . The examiner holds the ankle and pushes medially while the subject pushes laterally. The test is positive when similar pain

215

is produced over the SIJ below L5 (Broadhurst and Bond, 1998). 2.6. Standing ﬂexion test The standing ﬂexion test is performed by palpating the PSISs while the subject is bending forward from standing position. The test is negative if PSISs appear to move equally and symmetrically and positive on the side in which the PSIS moves cranially more than other side. A positive result in a standing ﬂexion test indicates limited movement of the ilium on the sacrum, and therefore limited SIJ motion on the side of the superior PSIS (Potter and Rothstein, 1985; Cibulka and Koldehoﬀ, 1999; Riddle and Freburger, 2002). 2.7. Sitting ﬂexion test The procedure is similar to standing ﬂexion test except that it is performed with the patient sitting on a level surface. The test is positive on the side in which the PSIS moves cranially more than other side and negative if PSISs move equally. A positive result in this test indicates limited movement of the sacrum on the ilium, and limited SIJ motion on the side of the superior PSIS (Potter and Rothstein, 1985; Cibulka and Koldehoﬀ, 1999; Riddle and Freburger, 2002). 2.8. Gillet test While the subject stands, the examiner palpates the PSISs. The subject is asked to stand on one leg while pulling the opposite knee up to chest. The test is repeated with other legs. On the normal side, the PSIS move inferiorly. If the PSIS on the side on which the knee is ﬂexed and pulled to chest remains at the level of other PSISs or moves down minimally or even paradoxically moves superiorly, it indicates a positive test (Potter and Rothstein, 1985; Dreyfuss et al., 1996; Meijne et al., 1999). 2.9. Prone knee ﬂexion test The subject’s position is prone. While the examiner holds both heels, the patient’s knees are passively ﬂexed to 90 . The leg lengths are compared by examining the left and right soles of the heel in the prone and prone knees ﬂexed position. The test is negative if no relative change in leg lengths between two positions occurred. If one leg appears shorter than other in the prone knee extended position, apparent lengthening of the short leg in prone knees ﬂexed position implies a hypothesized posterior innominate rotation (Potter and Rothstein, 1985; Cibulka and Koldehoﬀ, 1999; Riddle and Freburger, 2002).

k ¼ kappa coeﬃcient, SE ¼ standard error, 95% CI ¼ 95% conﬁdence interval, kmax ¼ maximum kappa coeﬃcient, PI ¼ prevalence index, BI ¼ bias index, and PABAK ¼ prevalence-adjusted and bias-adjusted kappa.

0.78 (0.14) 0.49e1.07 0.78 0.52 (0.08) 0.84 0.5 (0.26) 0.02 to 1.03 0.84 0.72 (0.04) 0.76 0.78 0.52 (0.08) 0.68 1 0.76 (0) 0.84 0.56 (0.19) 0.17e0.95 0.62 (0.25) 0.11e1.12 0.48 (0.2) 0.07e0.88 0.5 (0.22) 0.06e0.95 R L Resisted abduction

0.79 0.48 (0.04) 0.6 0.75 0.6 (0.08) 0.68

1 0.44 (0) 0.68 0.80 0.44 (0.08) 0.52 0.6 (0.18) 0.24e0.96 0.4 (0.21) 0.00e0.82 0.62 0.48 (0.12) 0.6 0.51 0.6 (0.16) 0.68 0.49 (0.2) 0.09e0.89 0.51 (0.22) 0.08e0.95 0.44 (0.19) 0.06e0.83 0.4 (0.21) 0.00e0.82 R L Thigh thrust

1 0.36 (0) 0.52 0.80 0.44 (0.08) 0.52

1 0.36 (0) 0.52 0.70 0.48 (0.12) 0.6 0.44 (0.19) 0.06e0.83 0.49 (0.2) 0.09e0.89 0.31 (0.2) 0.08 to 0.70 0.74 0.28 (0.08) 0.36 0.4 (0.21) 0.03 to 0.82 0.80 0.44 (0.08) 0.52 0.75 0.24 (0.12) 0.44 0.91 0.24 (0.04) 0.44 0.41 (0.18) 0.07e0.78 0.40 (0.19) 0.03e0.78 R L FABER

0.72 0.4 (0.12) 0.44 0.89 0.48 (0.04) 0.6 0.41 (0.18) 0.07e0.78 0.75 0.24 (0.12) 0.44 0.27 (0.25) 0.22 to 0.78 0.52 0.6 (0.16) 0.52 R L Prone knee ﬂexion

0.34 (0.21) 0.06 to 0.7 0.48 (0.2) 0.07e0.88

0.58 (0.16) 0.25e0.91 0.75 0.24 (0.12) 0.6 0.33 (0.26) 0.18 to 0.85 0.61 0.64 (0.12) 0.6

0.75 0.6 (0.08) 0.84 0.64 0.36 (0.16) 0.68 0.75 (0.16) 0.42e1.08 0.64 (0.16) 0.32e0.96 0.65 0.56 (0.12) 0.76 0.73 0.32 (0.12) 0.6 0.91 0.32 (0.04) 0.76 0.82 0.28 (0.08) 0.68 0.73 (0.14) 0.45e1.01 0.65 (0.15) 0.34e0.96 Sitting ﬂexion R L

0.65 (0.18) 0.29e1.02 0.56 (0.17) 0.21e0.9

0.51 0.6 (0.16) 0.68 0.73 0.32 (0.04) 0.6 0.51 (0.22) 0.08e0.95 0.55 (0.17) 0.2e0.9 0.6 0.64 (0.12) 0.76 0.51 0.6 (0.16) 0.68 0.6 (0.21) 0.18e1.02 0.51 (0.22) 0.08e0.95 0.89 0.48 (0.04) 0.76 0.75 0.44 (0.16) 0.68 0.68 (0.16) 0.35e1.01 0.61 (0.17) 0.27e0.96 R L Standing ﬂexion

PABAK kmax PI (BI) 95% CI

0.41 (23) 0.03e0.87 0.34 (0.21) 0.06 to 0.7

PABAK k (SE)

0.25 0.6 (0) 0.52 0.61 0.44 (0.16) 0.36

kmax PI (BI) 95% CI

0.25 (0.26) 0.2 to 0.77 0.23 (0.22) 0.2 to 0.67

PABAK k (SE) kmax PI (BI) 95% CI k (SE)

0.42 (0.22) 0.01 to 0.87 0.44 0.56 (0.12) 0.6 0.49 (0.2) 0.09e0.89 0.70 0.48 (0.12) 0.6

Inter-tester Tester 2 Side Tester 1

Twenty-ﬁve subjects (15 males and 10 females) between the ages of 20 and 65 with a mean age of 43 10 years participated in the study. The subjects’ mean height was 168 7 cm and mean weight was 68 10 kg. Table 1 presents the intra- and interexaminer reliability estimates for each single motion palpation and provocation test used in the study. For intra- and inter-examiner reliability of individual provocation tests, PABAK ranged from 0.36 to 0.84 and 0.52 to 0.84 and kappa from 0.31 to 0.62 and 0.44 to 0.78 (95% CI: 0.08 to 1.12 and 0.06 to 1.07). For intraand inter-examiner reliability of individual motion

Tests

3. Results

Table 1 Intra- and inter-examiner reliability of the single motion palpation and pain provocation test.

MedCalcÒ statistical software was used for data analysis. The kappa coeﬃcient (k) with 95% conﬁdence interval which discounts the proportion of agreement that is expected by chance was calculated to assess reliability. Although the kappa coeﬃcient is widely used to assess the reliability, there are two main paradoxes that can inﬂuence the magnitude of kappa. Thus alongside the obtained value of kappa, it is necessary to consider the paradoxical eﬀects of prevalence and BI on kappa for better interpretation. For a situation in which raters choose between classifying cases as either positive or negative in respect to a clinical sign, prevalence eﬀect exists when the proportion of agreements on the positive classiﬁcation diﬀers from that of the negative classiﬁcation. This can be expressed by the prevalence index (PI). When the PI is high, i.e. approaches to 1.0, chance agreement is also high and kappa is reduced accordingly. Bias is the extent to which the raters disagree on the proportion of positive (or negative) cases and could be stated by the BI. For example, in the 2 2 contingency table is shown as in Table S1, cells a and d indicate, respectively, the numbers of subjects for whom both examiner agree on negative and positive and cells b and c indicate the numbers of subjects on whom the examiners disagree. PI is the ja dj/n, where ja dj is the absolute value of the diﬀerence between cells. BI is the jb cj/n. Some statisticians have devised kappa adjustments to take account of prevalence and bias inﬂuences by calculating prevalence-adjusted and biasadjusted kappa (PABAK). Use of the BI is equivalent to replacing cells b and c by their average ([b þ c]/2) while use of the PI is equivalent to replacing cells a and d by their average ([a þ d]/2) and calculating kappa in the usual fashion. We included bias and prevalence eﬀects on kappa coeﬃcient by calculating BI, PI and PABAK values as suggested by others (Byrt et al., 1993; Hoehler, 2000; Sim and Wright, 2005). Kappa maximum (kmax) was also calculated.

R L

2.10. Data analysis

0.88 0.56 (0.04) 0.6 0.72 0.4 (0.12) 0.44

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

Gillet

216

0.88 0.56 (0.04) 0.92 0.83 0.72 (0.04) 0.92 0.88 (0.11) 0.66e1.10 0.83 (0.16) 0.51e1.15 0.88 0.56 (0.04) 0.76 0.76 (0) 0.84 1 1 1

0.6 (0) 0.68 (0)

0.84 0.84 0.75 (0.17) 0.41e1.08 0.7 (0.2) 0.3e1.09 3 of 3 provocation R L

0.65 (0.18) 0.28e1.02 0.62 (0.25) 0.11e1.12

0.89 0.48 (0.04) 0.76 0.75 0.6 (0.08) 0.68 0.68 (0.16) 0.35e1.01 0.5 (0.22) 0.06e0.95 0.78 0.52 (0.08) 0.68 0.51 0.6 (0.16) 0.68 0.63 (0.16) 0.30e0.96 1 0.36 (0) 0.68 0.41 (0.23) 0.03 to 0.87 0.88 0.56 (0.04) 0.6 2 of 3 provocation R L

0.56 (0.19) 0.17e0.95 0.51 (0.22) 0.08e0.95

0.92 0.08 (0.04) 0.6 0.83 0.2 (0.16) 0.52 0.59 (0.16) 0.28e0.91 0.51 (0.17) 0.17e0.85 0.75 0.24 (0.12) 0.44 0.73 0.32 (0.04) 0.6 0.41 (0.18) 0.07e0.78 0.55 (0.17) 0.2e0.9 0.91 0.16 (0.04) 0.44 0.91 0.16 (0.04) 0.44 0.42 (0.18) 0.06e0.78 0.42 (0.18) 0.06e0.78 1 of 3 provocation R L

0.8 (0.04) 0.92 0.8 (0.04) 0.76 0.77 (0.21) 0.35e1.2 0.77 0.33 (0.35) 0.36 to 1.04 0.78 R L 4 of 4 motion palpation

0.77 (0.21) 0.35e1.2 0.77 0.8 (0.04) 0.92 0.46 (0.36) 0.23 to 1.17 0.46 0.84 (0.08) 0.84

0.84 (0) 0.52 (0)

0.84 0.84 0.45 (0.36) 0.26 to 1.17 1 0.78 (0.14) 0.48e1.07 1

0.6 0.64 (0.12) 0.76 0.92 0 (0.04) 0.44 0.6 (0.21) 0.18e1.02 0.44 (0.17) 0.08e0.79 R L 3 of 4 motion palpation

0.6 (0.21) 0.18e1.02 0.68 (0.14) 0.39e0.96

0.6 0.64 (0.12) 0.76 0.68 0.12 (0.16) 0.68

0.41 (0.27) 0.11 to 0.94 0.71 0.68 (0.08) 0.68 0.42 (0.18) 0.07e0.79 0.91 0.16 (0.12) 0.44

0.78 0.52 (0.08) 0.84 0.81 0.36 (0.08) 0.84 0.78 (0.14) 0.49e1.07 0.81 (0.12) 0.57e1.06 0.78 0.52 (0.08) 0.68 1 0.28 (0) 0.68 0.56 (0.19) 0.17e0.95 0.65 (0.15) 0.34e0.96 1 0.44 (0) 0.84 0.91 0.32 (0.04) 0.76 0.8 (0.13) 0.53e1.06 0.73 (0.14) 0.45e1.01 R L 2 of 4 motion palpation

PABAK

0.76 0.08 (0.12) 0.6 1 0.52 (0) 0.68

kmax PI (BI) 95% CI

0.6 (0.15) 0.29e0.91 0.56 (0.2) 0.16e0.95

PABAK k (SE)

0.69 0.12 (0.16) 0.52 0.89 0.48 (0.04) 0.76

kmax PI (BI) 95% CI

0.52 (0.16) 0.19e0.85 0.68 (0.16) 0.35e1.01

PABAK k (SE) kmax PI (BI)

0.91 0.16 (0.04) 0.44 0.89 0.48 (0.04) 0.76

Inter-tester Tester2

95% CI k (SE)

0.42 (0.18) 0.06e0.78 0.68 (0.16) 0.35e1.01 R L

The kappa values for composites of motion palpation and provocation tests together revealed reliability exists along a continuum from no agreement (k ¼ 0) to excellent (e.g. k ¼ 1). Although the magnitude of kappa is widely used to test reliability in several studies, the interpretation of kappa, however, is not so straightforward, as there are some other factors that can inﬂuence the magnitude of the coeﬃcient. Among those factors that can inﬂuence the magnitude of kappa the main are prevalence and bias (Hoehler, 2000; Sim and Wright, 2005). This issue has been completely explained in data analysis section. As discussed, in Table S1 as 2 2 contingency table of data from two examiner: cells a and d indicate,

Side Tester1

4.1. Kappa and PABAK

Tests clusters

To interpret kappa values, the guidelines proposed by Landis and Koch (1977) were used. Based on the kappa values, the results derived from this study mostly demonstrate fair to moderate reliability for the single motion palpation and provocation tests and moderate to substantial reliability for cluster of provocation or motion palpation tests (Tables 1e3).

Table 2 Intra- and inter-examiner reliability for cluster of motion palpation or provocation tests.

4. Discussion

1 of 4 motion palpation

palpation tests, the range of PABAK was between 0.44 and 0.76 and between 0.60 and 0.84 and kappa between 0.23 and 0.73 and between 0.33 and 0.75 (95% CI: 0.2 to 1.01 and 0.18 to 1.08). The ranges are for kappa and PABAK from test with lowest to test with highest scores. The results of the intra- and inter-examiner reliability for cluster of provocation and motion palpation tests separately are presented in Table 2. For intra- and inter-examiner reliability of clusters of provocation tests, PABAK ranged from 0.44 to 0.84 and 0.52 to 0.92 and kappa from 0.41 to 0.75 and 0.50 to 0.88 (95% CI: 0.03 to 1.08 and 0.06 to 1.1). In cluster of motion palpation tests, PABAK ranged from 0.44 to 0.92 and 0.44 to 0.84 for intra- and inter-examiner reliability and kappa ranged from 0.41 to 0.80 and 0.33 to 0.81 (95% CI: 0.11 to 1.06 and 0.36 to 1.06) for intra- and inter-examiner reliability. The ranges are for kappa and PABAK from cluster with lowest to highest kappa and PABAK. Table 3 represents the intra- and inter-examiner reliability for composites of motion palpation and provocation tests together. The range of kappa and PABAK for intra-examiner reliability varied from 0.00 to 1.00 and 0.44 to 1.00 (95% CI: 1.92 to 1) and for inter-examiner reliability ranged between 0.00 and 0.77 and between 0.52 and 0.92 (95% CI: 1.32 to 1), respectively. The ranges are for kappa and PABAK from composite with lowest to highest kappa and PABAK.

k ¼ kappa coeﬃcient, SE ¼ standard error, 95% CI ¼ 95% conﬁdence interval, kmax ¼ maximum kappa coeﬃcient, PI ¼ prevalence index, BI ¼ bias index, and PABAK ¼ prevalence-adjusted and bias-adjusted kappa.

217

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

218

Table 3 Reliability for the composites of the tests. Tests clusters Side Tester1 k (SE)

Tester2 95% CI

R L

0.34 (0.23) 0.11 to 0.8 0.45 (0.19) 0.06e0.83

1 mp/2p

R L

1 mp/3p

PABAK k (SE)

1 0.52 (0) 0.81 0.36 (0)

0.52 0.52

Inter-tester 95% CI

kmax PI (BI)

95% CI

kmax PI

PABAK

0.68 0.6

0.33 (0.26) 0.18 to 0.85 0.61 0.64 (0.12) 0.6 0.47 (0.18) 0.11e0.84 1 0.28 (0) 0.52

0.41 (27) 0.11 to 0.94 0.71 0.68 (0.08) 0.68 0.2 (0.25) 0.03 to 0.70 0.66 0.56 (0.12) 0.44

0.33 (0.35) 0.36 to 1.04 0.78 0.8 (0.04) 0.76 0.43 (0.26) 0.07 to 0.94 0.43 0.68 (0.16) 0.68

0.41 (0.27) 0.11 to 0.94 0.71 0.68 (0.08) 0.68 0.33 (0.35) 0.36 to 1.04 0.78 0.8 (0.04) 0.76

R L

0.5 (0.26) 0.02 to 1.03 0.84 0.72 (0.04) 0.76 0.33 (0.35) 0.36 to 1.04 0.78 0.8 (0.04) 0.76

0.33 (0.35) 0.36 to 1.04 0.78 0.8 (04) 0.45 (0.36) 0.26 to 1.17 1 0.84 (0)

0.62 (0.25) 0.13e1.12 0.62 0.76 (0.08) 0.84 0.33 (0.35) 0.36 to 1.04 0.78 0.8 (0.04) 0.76

2 mp/1p

R L

0.4 (0.27) 0.13 to 0.93 1 0.63 (0.16) 0.3e0.96 1

0.00 (0.4) 0.78 to 0.78 0.00 0.5 (0.22) 0.06e0.95 0.75

2 mp/2p

R L

0.62 (0.25) 0.11e1.12 0.5 (0.22) 0.06e0.95

1 0.76 (0) 0.84 0.75 0.6 (0.08) 0.68

0.62 (0.25) 0.13e1.12 0.51 (0.26) 0.00e1.03

2 mp/3p

R L

0.62 (0.25) 0.11e1.12 0.77 (0.21) 0.35e1.2

1 0.76 (0) 0.84 0.77 0.8 (0.04) 0.92

0.77 (0.21) 0.35e1.2 0.77 0.8 (0.04) 0.92 0.45 (0.36) 0.26 to 1.17 1 0.84 (0) 0.84

0.77 (0.21) 0.35e1.2 0.77 0.8 (0.04) 0.92 0.45 (0.36) 0.26 to 1.17 1 0.84 (0) 0.84

3 mp/1p

R L

0.77 (0.21) 0.35e1.2 0.49 (0.2) 0.09e0.89

0.77 0.8 (0.04) 0.92 0.70 0.48 (0.12) 0.6

0.62 (0.25) 0.13e1.12 0.51 (0.26) 0.00e1.03

0.62 (0.25) 0.11e1.12 0.49 (0.2) 0.09e0.89

3 mp/2p

R L

1.0 (0.0) 1.0e1.0 0.62 (0.25) 0.13e1.12

1 0.84 (0) 1.0 0.62 0.76 (0.08) 0.84

0.77 0.8 (0.04) 0.92 0.77 (0.21) 0.35e1.2 0.45 (0.36) 0.26 to 1.17 1 0.84 (0) 0.84

0.77 (0.21) 0.35e1.2 0.77 0.8 (0.04) 0.92 0.64 (0.34) 0.02 to 1.32 0.64 0.88 (0.04) 0.92

3 mp/3p

R L

1.0 (.0) 1.0e1.0 1 0.45 (0.36) 0.26 to 1.17 1

1.0 0.84

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.64 (0.34) 0.02 to 1.32 0.64 0.88 (0.04) 0.92

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.45 (0.36) 0.26 to 1.17 1 0.84 (0) 0.84

4 mp/1p

R L

1.0 (0.0) 1.0e1.0 1 0.84 (0) 1.0 0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.00 (0.98) 1.92 to 1.92 0.00 0.96 (0.04) 0.92

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.64 (0.34) 0.02 to 1.32 0.64 0.88 (0.04) 0.92

4 mp/2p

R L

1.0 (0.0) 1.0e1.0 1 0.84 (0) 1.0 0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.00 (0.98) 1.92 to 1.92 0.00 0.96 (0.04) 0.92

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.64 (0.34) 0.02 to 1.32 0.64 0.88 (0.04) 0.92

4 mp/3p

R L

1.0 (0.0) 1.0e1.0 1.0 0.84 (0) 1.0 0.00 (0.98) 1.92 to 1.92 0.00 0.96 (0.04) 0.92

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.00 (0.98) 1.92 to 1.92 0.00 0.96 (0.04) 0.92

0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84 0.00 (0.67) 1.32 to 1.32 0.00 0.92 (0.08) 0.84

mp ¼ motion palpation and p ¼ provocation.

0.68 (0) 0.36 (0)

0.84 (0) 0.84 (0)

0.68 0.68

0.56 (0.2) 0.16 to 0.95 1 0.5 (0.19) 0.11e0.89 0.5

PABAK k (SE)

0.52 (0) 0.48 (0.2)

0.76 0.84

0.8 (0.2) 0.6 0.6 (0.08) 0.68

0.59 (0.22) 0.16e1.02 0.46 (0.19) 0.09e0.83

0.86 0.64 (0.04) 0.76 0.64 0.36 (0.16) 0.52

0.62 0.76 (0.08) 0.84 0.51 0.72 (0.12) 0.76

0.77 (0.21) 0.35e1.2 0.62 (0.25) 0.13e1.12

0.77 0.8 (0.04) 0.92 0.62 0.76 (0.08) 0.84

0.62 0.76 (0) 0.84 0.51 0.72 (0.12) 0.76

0.62 0.76 (0) 0.84 0.70 0.48 (0.12) 0.6

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

1 mp/1p

kmax PI (BI)

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

respectively, the numbers of subjects for whom both examiner agree on negative or positive and cells b and c indicate the numbers of subjects on whom the examiners disagree (cell b: examiner 1 positive while examiner 2 negative; cell c: examiner 1 negative while examiner 2 positive) (Table S1). Considering our data, PI in examining the reliability of individual tests is not very high and the kappa and PABAK are similar (Table 1). But for cluster of provocation and motion palpation tests especially composites of motion palpation and provocation tests, it can be seen that PI values are nearly to 1 (high) (Tables 2 and 3), indicating that kappa is aﬀected by prevalence. For more explanation, the raw data of the 2 2 contingency tables of data for composites of motion palpation and provocation tests are displayed in Table S2 which is placed on the electronic version only. As an example, for inter-examiner reliability of composite of four motion palpation and two provocation tests in right side the proportion of examiners’ agreement on the negative results is high (23 of 25 patients) but agreement on positive result is 0 (a ¼ 23, d ¼ 0, b ¼ 2, c ¼ 0). The PI, therefore, is high (0.92) and PABAK is 0.84 while k ¼ 0.00 (Table 3). Table S2 presents the raw data of 2 2 tables of two examinations for other composites of tests for better interpretation. Thus PABAK was used to interpret the results especially for tests clusters. The standards proposed by Landis and Koch (1977) is also used to interpret the magnitude of PABAK (Hoehler, 2000; Kokmeyer et al., 2002; Sim and Wright, 2005). 4.2. Reliability of the individual tests Using PABAK, therefore, our data indicate fair to substantial reliability for the individual tests (Table 1). Some authors have suggested that motion palpation tests are reliable (Herzog et al., 1989; Cibulka and Koldehoﬀ, 1999), while some other studies have demonstrated low reliability for individual motion palpation tests and poor to substantial for single pain provocation tests (Potter and Rothstein, 1985; Laslett and Williams, 1994; Strender et al., 1997; Meijne et al., 1999). However, they did not use exactly the same tests. We attempted to select tests with acceptable level validity, sensitivity and speciﬁcity (van der Wurﬀ et al., 2000a,b). Reliability can be inﬂuenced by several factors such as the participants, therapists and clinical tests. In former studies, some researchers have used asymptomatic subjects. In this study the participants were recruited from LBP patients with clinical signs suggestive of SIJ and patients with symptoms suggesting other sources of LBP were excluded. For the pain provocation tests, concordant pain response is one in which there is reproduction of a pain that is similar to or exactly the same as the complaint, and discordant pain is provocation of a pain that is

219

atypical of the complaint. The tests in this study were classiﬁed as positive or negative referencing to a particular side, as it has been recommended. In some clinical situations, doing a test on one side may produce pain on the opposite side, and it may be improperly considered positive. We considered tests positive if a concordant pain was reproduced in the same side. Laslett (1998) believes that insuﬃcient pressure when applying provocation tests may generate many false negatives and aﬀect the reliability. It has been assumed that variability in applied force and time interval force could aﬀect the results of provocation tests (Levin et al., 2001; Levin and Stenstrom, 2003). O’Haire and Gibbons (2000) attributed poor reliability of SIJ motion palpation tests to lack of reliability of SIJ landmark palpation and location. The moderate and substantial reliability for single provocation and motion palpation tests in the present study could be explained by our addressing these factors. Unlike other tests, reliability of the resisted abduction test has not been reported previously. Broadhurst and Bond (1998) reported 87% sensitivity and 100% speciﬁcity for it. Our data indicate substantial reliability for it as a single test (Table S1). It has been supposed that in this test the leg is used as a lever with the fulcrum at the inferior border of the SIJ, therefore, stressing the cephalic aspect of SIJ. 4.3. Reliability for the cluster of motion palpation or provocation tests Considering PABAK, the results of this study showed moderate to excellent reliability for the cluster of motion palpation or provocation tests (Table 2). By comparing the reliability data provided in Table S1 and Table 1, it is discernible that reliability of test clusters achieved better reliability than individually performed tests. From diﬀerent clusters of motion palpation tests, reliability for cluster of three positive out of four tests was found to be substantial and more than other types. For clusters of provocation tests, reliability of three positive of three tests was substantial and better than other types (Table S1, Table 1). In multi-test regimens recommended for evaluation of SIJ, only clusters of motion palpation or provocation tests regardless of involved side have been used. Robinson et al. (2007) found good reliability for clusters of provocation tests and poor for the single tests in left and right sides. 4.4. Reliability for composites of motion palpation and provocation tests Our ﬁndings indicate substantial to excellent reliability for composites of motion palpation and provocation tests (Table 3). By considering that pain provocation and motion palpation tests assess SIJ pain and dysfunction, respectively, composites of motion palpation and provocation tests are generally used to assess and

220

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221

diagnose SIJ disorders as a commonly clinical practice. We attempted to examine whether the combination of motion palpation and pain provocation tests is reliable. For composites of motion palpation and provocation tests, the reliability of a composite of three or more motion palpation together with two or more provocation tests was found to be excellent and better than other composites (Table 3). Cibulka and Koldehoﬀ (1999) used only four palpation tests and categorized positive or negative, regardless of side of SIJ dysfunction. Riddle and Freburger (2002) examined the degree of agreement between therapists for the same tests by taking into account the side oﬀ and type of the presumed dysfunction and found poor reliability for the composite results of four tests. The problem with this study is that internal consistency between test results was not assessed. Thus they did not account for either type or side of dysfunction. Kokmeyer et al. (2002) showed good reliability for a multi-test regimen of ﬁve provocation tests. Robinson et al. (2007) assessed the reliability for two clusters of three or ﬁve provocation tests regarding the side of pain and showed good reliability of clusters. As said, in those studies only clusters of palpation or provocation tests were examined. Based on the results of our study, we advocate the composites of three or more positive motion palpation and two or more positive pain provocation tests for clinical use. One of the limitations of this study is using testers with only 1 year of experience. We examined reliability of tests using two therapists with 1-year experience in order to know if single test or composites of tests are reliable when are clinically used by testers even with low experience.

5. Conclusion This study showed fair to substantial reliability for the individual motion palpation or pain provocation tests. Our data demonstrated moderate to substantial intra- and inter-examiner reliability for clusters of motion palpation or pain provocation tests. Considering excellent reliability for composites of motion palpation together with pain provocation tests from this study, it seems that composites of them could be used as a reliable method for SIJ assessment in clinical practice. Kappa is aﬀected by paradoxical eﬀects of the prevalence and BI and it seems that it is better to calculate PABAK for appropriate interpreting the reliability in such studies.

Appendix A. Supplementary data Supplementary data associated with this article can be found in the online version at doi:10.1016/ j.math.2008.02.004.

References Bernard TN. The role of the sacroiliac joints in low back pain: basic aspects of pathophysiology, and management. In: Vleeming A, Mooney V, Dorman T, Snijders C, Stoeckart R, editors. Movement, stability & low back pain. The essential role of the pelvis. 2nd ed. Edinburgh: Churchill Livingstone; 1997. p. 73e88. Broadhurst NA, Bond MJ. Pain provocation tests for the assessment of sacroiliac joint dysfunction. Journal of Spinal Disorders 1998;11(4):341e5. Byrt T, Bishop J, Carlin JB. Bias, prevalence and kappa. Journal of Clinical Epidemiology 1993;46:423e9. Cibulka MT, Delitto A, Koldehoﬀ RM. Changes in innominate tilt after manipulation of the sacroiliac joint in patients with low back pain: an experimental study. Physical Therapy 1988;68:1359e63. Cibulka MT, Koldehoﬀ R. Clinical usefulness of a cluster of sacroiliac joint tests in patients with and without low back pain. Journal of Orthopaedic and Sports Physical Therapy 1999;9(2):83e9. Dreyfuss P, Michaelsen M, Pauza K, McLarty J, Bogduk N. The value of medical history and physical examination in diagnosing sacroiliac joint pain. Spine 1996;21(22):2594e602. Ehrlich GE. Low back pain. Bulletin of the World Health Organization 2003;81(9):671e2. Fortin J, Dwyer A, West S, Pier J. Sacroiliac joint: pain referral maps upon applying a new injection/arthrography technique. Part I: asymptomatic volunteers. Spine 1994a;19:1475e82. Fortin J, Aprill C, Ponthieux B, Pier J. Sacroiliac joint: pain referral maps upon applying a new injection/arthrography technique. Part II: clinical evaluation. Spine 1994b;19:1483e9. Haas M. Interexaminer reliability for multiple diagnostic test regimens. Journal of Manipulative and Physiological Therapeutics 1991;14(2):95e103. Herzog W, Read LJ, Conway PJ, Shaw LD, McEwen MC. Reliability of motion palpation procedures to detect sacroiliac joint ﬁxations. Journal of Manipulative and Physiological Therapeutics 1989;12(2):86e92. Hoehler FK. Bias and prevalence eﬀects on kappa viewed in terms of sensitivity and speciﬁcity. Journal of Clinical Epidemiology 2000;53:499e503. Kokmeyer DJ, van der Wurﬀ P, Aufdemkampe G, Fickenscher TC. The reliability of multitest regimens with sacroiliac pain provocation tests. Journal of Manipulative and Physiological Therapeutics 2002;25(1):42e8. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977;33:159e74. Laslett M. The value of the physical examination in diagnosis of painful sacroiliac joint pathologies. Spine 1998;23:962e4. Laslett M, Williams M. The reliability of selected pain provocation tests for sacroiliac joint pathology. Spine 1994;9(11):1243e9. Laslett M, Young S, Aprill C, McDonald B. Diagnosing painful sacroiliac joints: a validity study of a McKenzie evaluation and sacroiliac provocation tests. The Australian Journal of Physiotherapy 2003;49(2):89e97. Laslett M, Aprill C, McDonald B, Young S. Diagnosis of sacroiliac joint pain: validity of individual provocation tests and composites of tests. Manual Therapy 2005;10(3):207e18. Levin U, Nilsson-Wikmar L, Harms-Ringdahl, Stenstrom CH. Variability of forces applied be experienced physiotherapists during provocation of the sacroiliac joint. Clinical Biomechanics 2001;16:300e6. Levin U, Stenstrom CH. Force and time recording for validating the sacroiliac distraction test. Clinical Biomechanics 2003;18:821e6. Maigne JY, Aivaliklis A, Pfefer F. Results of sacroiliac joint double block and value of sacroiliac pain provocation tests in 54 patients with low back pain. Spine 1996;21(16):1889e92.

A.M. Arab et al. / Manual Therapy 14 (2009) 213e221 MedCalc statistical software. Broekstraat 52, B-9030 Mariakerke, Belgium. Meijne W, van Neerbos K, Aufdemkampe G, van der Wurﬀ P. Intraexaminer and interexaminer reliability of the Gillet test. Journal of Manipulative and Physiological Therapeutics 1999;22(1):4e9. Mooney V. Sacroiliac joint dysfunction. In: Vleeming A, Mooney V, Dorman T, Snijders C, Stoeckart R, editors. Movement, stability & low back pain. The essential role of the pelvis. 2nd ed. Edinburgh: Churchill Livingstone; 1997. p. 37e52. O’Haire C, Gibbons P. Inter-examiner and intra-examiner agreement for assessing sacroiliac anatomical landmarks using palpation and observation: pilot study. Manual Therapy 2000;5(1):13e20. Potter NA, Rothstein JM. Intertester reliability for selected clinical tests of the sacroiliac joint. Physical Therapy 1985;65(11):1671e5. Riddle DL, Freburger JK. Evaluation of the presence of sacroiliac joint region dysfunction using a combination of tests: a multicenter intertester reliability study. Physical Therapy 2002;82(8):772e81. Robinson HS, Brox JI, Robinson R, Bjelland E, Solem S, Telje T. The reliability of selected motion- and pain provocation tests for the sacroiliac joint. Manual Therapy 2007;12(1):72e9. Schwarzer AC, Aprill CN, Bogduk N. The sacroiliac joint in chronic low back pain. Spine 1995;20(1):31e7.

221

Sim J, Wright CC. The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Physical Therapy 2005;85:257e68. Slipman CW, Whyte 2nd WS, Chow DW, Chou L, Lenrow D, Ellen M. Sacroiliac joint syndrome. Pain Physician 2001;4(2): 143e52. Strender LE, Sjoblom A, Sundell K, Ludwig R, Taube A. Interexaminer reliability in physical examination of patients with low back pain. Spine 1997;22(7):814e20. Stuber KJ. Speciﬁcity, sensitivity and predictive values of clinical tests of the sacroiliac joint: a systematic review of the literature. Journal of the Canadian Chiropractic Association 2007;51(1):30e41. Vincent-Smith B, Gibbons P. Inter-examiner and intra-examiner reliability of the standing ﬂexion test. Manual Therapy 1999;4(2):87e93. van der Wurﬀ P, Hagmeijer RH, Meyne W. Clinical tests of the sacroiliac joint. A systemic methodological review. Part 1: reliability. Manual Therapy 2000a;5(1):30e6. van der Wurﬀ P, Meyne W, Hagmeijer RH. Clinical tests of the sacroiliac joint. A systemic methodological review. Part 2: validity. Manual Therapy 2000b;5(2):89e96. Young S, Aprill C, Laslett M. Correlation of clinical examination characteristics with three sources of chronic low back pain. The Spine Journal 2003;3:460e5.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 222e230 www.elsevier.com/math

Professional Issue

Classiﬁcation of low back-related leg paindA proposed patho-mechanism-based approach Axel Scha¨fer a,b,*, Toby Hall b,c,1, Kathy Briﬀa b,2 b

a Ru¨ckenzentrum am Michel, Ludwig Erhart Straße 18, 20459 Hamburg, Germany School of Physiotherapy, Curtin University of Technology, GPO Box U1987, Perth, WA 6845, Australia c Manual Concepts, P.O. Box 1236, Booragoon, WA 6954, Australia

Received 13 July 2006; received in revised form 30 September 2007; accepted 4 October 2007

Abstract Leg pain is a frequent accompaniment to low back pain, arising from disorders of neural or musculoskeletal structures of the lumbar spine. Diﬀerentiating between diﬀerent sources of radiating leg pain is important to make an appropriate diagnosis and identify the underlying pathology. It is proposed that low back-related leg pain be divided into four subgroups according to the predominating pathomechanisms involved. The ﬁrst subgroup features central sensitization with mainly positive symptoms such as hyperalgesia, the second subgroup involves denervation with signiﬁcant axonal damage showing predominantly negative sensory symptoms and possibly motor loss and the third subgroup involves peripheral nerve sensitization with enhanced nerve trunk mechanosensitization. The fourth subgroup features somatic referred pain from musculoskeletal structures, such as the intervertebral disc or facet joints. Accordingly, four groups of patients with leg pain associated with structures in the lower back can be identiﬁed:

1. 2. 3. 4.

Central sensitization. Denervation. Peripheral nerve sensitization. Musculoskeletal.

Each group presents with a distinct pattern of symptoms and signs. Although there may be considerable overlap between the classiﬁcations, the authors propose the existence of an overriding mechanism. The importance of distinguishing low back-related leg pain into these four groups is to facilitate diagnosis and provide a more eﬀective, appropriate treatment. Ó 2007 Published by Elsevier Ltd. Keywords: Neuropathic pain; Low back pain; Leg pain; Sciatica; Classiﬁcation; Diagnosis

1. Introduction

* Corresponding author. Ru¨ckenzentrum am Michel, Ludwig Erhart Straße 18, 20459 Hamburg, Germany. Tel.: þ49 40 43280274. E-mail addresses: [email protected] (A. Scha¨ fer), info@ manualconcepts.com (T. Hall), k.briﬀ[email protected] (K. Briﬀa). 1 Tel./fax: þ61 8 93164080. 2 Tel.: þ61 8 9266 3666; fax: þ61 8 9266 3699. 1356-689X/$ - see front matter Ó 2007 Published by Elsevier Ltd. doi:10.1016/j.math.2007.10.003

Low back pain (LBP) is one of the major health problems in western industrial societies with a lifetime prevalence of 84% (Taylor et al., 2000). The high economic cost this imposes on society is comparable to other disorders such as heart disease, depression or diabetes (Maetzel and Li, 2002). Accompanying leg

223

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

pain is present in approximately 25e57% of all LBP cases (Helio¨vaara et al., 1987; Cavanaugh and Weinstein, 1994; Selim et al., 1998), but these cases account for a disproportionately large amount of the costs of medical care and disability compensation caused by LBP (Ren et al., 1999). Furthermore, accompanying leg pain is an important predictor for chronicity of LBP and an indicator of the severity of the disorder (Selim et al., 1998). The primary pathology causing referred leg pain is often indistinct, as many structures are capable of evoking a similar pattern of pain (Adams et al., 2002; Bogduk and McGuirk, 2002). Failure to distinguish diﬀerent forms of referred pain in the assessment of LBP is reported to be a common error leading to inappropriate investigations and treatment (Bogduk and McGuirk, 2002). Low back-related leg pain may be due to damage or dysfunction of neural or musculoskeletal structures. Possible events causing damage to neural structures may be mechanical, such as intervertebral disc protrusion, or biochemical, caused by cytokines or other inﬂammatory mediators. Thereby induced perturbation of neural structures may lead to a variety of clinical manifestations, ranging from negative symptoms such as motor disturbances and loss of sensation to positive symptoms such as paraesthesias or hyperalgesia associated with central sensitization (Table 1). Furthermore, it is well known that nerve injury alone is not always painful (Boden et al., 1990; Beattie et al., 2001) and that patients exhibiting severe symptoms may not necessarily have evidence of nerve root compression on imaging studies (Boos et al., 1995; Ohnmeiss et al., 1997). Consequently, pain is not a necessary event following neural compromise, and diagnosis based on pathology may not be the most relevant. A focus on pathomechanisms may be more appropriate. Although, as yet, it is not possible to diagnose pain mechanisms by clinical evaluation, a profound examination protocol consisting of neurological examination, screening for central sensitization and assessment of nerve tissue mechanosensitization may help to elucidate some of the mechanisms currently considered responsible for signs and symptoms seen in low back-related leg pain. Depending on the assumed predominance of pathomechanisms, diﬀerentiation of low back-related leg

pain into four distinct subgroups is proposed. These categories are central sensitization comprising major features of central nervous system sensitization, denervation arising from signiﬁcant axonal compromise without evidence of central nervous system changes, peripheral nerve sensitization arising from nerve trunk inﬂammation without clinical evidence of signiﬁcant denervation, and musculoskeletal pain referred from non-neural structures such as the disc or facet joints. The purpose of this article is to present the rationale for the proposed classiﬁcation system and the corresponding signs and symptoms for each of the groups. Background pathoanatomy and pathomechanisms will be reviewed and an algorithm for clinical classiﬁcation presented.

2. Pathoanatomy and pathomechanisms 2.1. Peripheral events 2.1.1. Inﬂammation The lumbar intervertebral disc (IVD) plays a central role in the development of low back-related leg pain and radiculopathy (Yoshizawa et al., 1995). The pathomechanisms involved are internal disc disruption, ﬁssure formation and nucleus pulposus (NP) prolapse or sequestration leading to inﬂammation of the nerve root, and subsequent pain of nerve origin, even without mechanical compression. Inﬂammation caused by biochemical substances from the NP plays a signiﬁcant role in the development of low back-related leg pain (Olmarker et al., 1993; Olmarker, 1997; Brisby, 2003). A suggested cause for this kind of inﬂammation are endplate fractures of the vertebrae, where NP material may become exposed to osteochondral blood and susceptible to progressive degradation of NP matrix (Bogduk, 2005). Degenerative changes of the IVD, associated with internal disc disruption, commonly lead to ﬁssures in the annulus, which allow inﬂammatory mediators to disperse through the disc and contact the innervated outer third of the annulus (Videman and Nurminen, 2004; Peng et al., 2005). These chemicals may cause excitation of nociceptive aﬀerents and thereby discogenic pain, which may then refer into the lower limb (O’Neill et al., 2002). In case of a full annular rupture, NP material and inﬂammatory mediators

Table 1 Positive and negative symptoms and signs in neuropathic pain. Symptoms

Positive

Negative

Signs

Sensory

Motor

Sensory

Motor

Pain Paroxysm Dysaesthesia Paraesthesia

Spasm

Hyperalgesia (thermal and mechanical)

Hyperreﬂexia Clonus Babinsky

Hypoaesthesia Hypopathia

Palsy Weakness

Allodynia (light touch, pin-prick) Wind-up Hypoaesthesia

Muscle weakness Hyporeﬂexia

224

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

may leak into the spinal canal, contact nerve tissues such as transiting or exiting nerve roots and lead to inﬂammation of these structures (Videman and Nurminen, 2004). A number of pro-inﬂammatory cytokines are associated with inﬂammation such as interleukin 6 (IL-6), which is up-regulated after macrophage inﬁltration (Takada et al., 2004; Mulleman et al., 2006). Increased levels of IL-6 could be one of the major causes for neurological signs and symptoms, especially neurogenic pain (Takada et al., 2004). Other inﬂammatory mediators may be expressed on the surface of NP cells (Kayama et al., 1998) such as the pro-inﬂammatory cytokine tumour necrosis factor a and nitric oxide (NO), which may enhance neuropathic pain states (Olmarker and Larsson, 1998; Brisby et al., 2000; Olmarker and Rydevik, 2001). Additionally, inﬂammatory changes may cause an increase in sodium channel density/conductance in the nerve root and dorsal root ganglion, which in turn may contribute to increased ectopic discharges and nerve trunk mechanosensitivity (Devor and Rappaport, 1990; Chen et al., 2004; Devor, 2006). This concept of chemically induced nerve root pain is supported by animal experimental evidence indicating that locally induced inﬂammatory processes in the vicinity of nerves may lead to marked pain behaviour with increased allodynic and hyperalgesic responses even in the absence of axonal damage (Eliav et al., 1999, 2001). Other studies have demonstrated that locally induced neuritis to a peroneal or sciatic nerve caused pressure and stretch mechanosensitivity of the nerve trunk (Bove et al., 2003; Dilley et al., 2005). This is consistent with ﬁndings by Olmarker and Myers (1998), who found lowered mechanical and heat pain thresholds, but with minor evidence of axonal damage, after application of NP material to nerve roots in rats. It is possible that these processes could account for some types of movement-dependent referred pain (Bove et al., 2003). For example, Greening et al. (2005) demonstrated altered median nerve movement and elevated nerve trunk mechanosensitivity to pressure and stretch in whiplash patients and patients with non-speciﬁc arm pain. These ﬁndings suggest that injury related inﬂammation may cause widespread changes to nerve ﬁbres leading to increased nerve trunk mechanosensitivity and dysfunction at the peripheral terminals (Greening, 2004). 2.2. Compression Mechanical nerve root compression can be caused by prolapsed IVD tissue, osteophytes, facet joint hypertrophy or ligamentum ﬂavum hypertrophy (Taylor and Twomey, 1986; Kobayashi et al., 2005). The putative eﬀects of nerve root compressions include impaired intraradicular blood ﬂow, increased endoneural ﬂuid pressure and nerve ﬁbre deformation (Rydevik et al., 1984,

1991; Olmarker et al., 1989). This combination of increased endoneural ﬂuid pressure and decreased blood ﬂow may result in neuronal ischaemia leading to breakdown of axonal myelin sheaths and alteration of the bloodenerve barrier (Cornefjord et al., 1997; Kobayashi et al., 2004; Igarashi et al., 2005). Such structural nerve damage may be the cause of sensory and motor dysfunction and radiating pain. On the other hand, contrary to this notion, it is well known that compression of nerves does not always cause pain (McNab, 1972; Wiesel et al., 1984; Kjaer et al., 2005), although the reason for this is unclear. One factor may be the rate of nerve compression. Rapid-onset neural compromise is likely to be associated with inﬂammatory change and development of neural irritation according to the process described above (Kobayashi et al., 2005). Another common situation is chronic, gradual onset nerve compression (Olmarker et al., 1990), but the eﬀect of chronic compression has been less thoroughly studied than acute compression. Although the extent of nerve injury from chronic or acute compression cannot be easily compared in animal experiments, it seems that acute nerve injury causes more severe changes (Olmarker et al., 1990; Yoshizawa et al., 1995; Cornefjord et al., 1997; Igarashi et al., 2005). A typical example of chronic nerve root compression of gradual onset is spinal stenosis, where inﬂammation is usually not well developed. In this example, pain usually occurs after sustained extension loading such as in standing or walking (Takahashi et al., 1995a, b), which causes reduced foraminal and spinal canal volume, vascular compromise and nerve root anoxia (Blau and Logue, 1978). In pure spinal stenosis, there is usually an absence of nerve trunk mechanosensitivity (Arbit and Pannullo, 2001). 2.3. Central events Continued noxious input from the peripheral nervous system as a result of inﬂammation or compression of nerve structures may lead to augmented response of signalling neurons in the central nervous system, a process commonly referred to as central sensitization (Campbell and Meyer, 2006). Most of the changes leading to central sensitization take place in the dorsal horn of the spinal cord, where intense and sustained nociceptor activation, especially C ﬁbre activation, leads to phosphorylation of N-methyl-D-aspartate receptors in nociceptive speciﬁc dorsal horn neurons. This may cause longer lasting changes in their excitability so that previously subthreshold C ﬁbre inputs can now drive the postsynaptic neuron (Costigan and Woolf, 2000). A subtype of dorsal horn neurons are wide dynamic range (WDR) neurons where tactile Ab and nociceptive C ﬁbres converge. Therefore, involvement of WDR neurons may lead to enhanced synaptic eﬃcacy of tactile Ab ﬁbres and consequently innocuous Ab signals are coded as pain (Simone

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

et al., 1991; Woolf and Doubell, 1994). Co-existent with the above-mentioned changes, nerve injury may alter the properties of Ab ﬁbres, such that they begin to act like nociceptive ﬁbres expressing neuropeptides, which enables these ﬁbres to evoke central sensitization (Woolf and Salter, 2000). This mechanism is called cell phenotypic shift. Similarly, Ab ﬁbres may sprout onto Lamina II in the dorsal horn, an area which normally receives only nociceptor information (Mannion et al., 1996). Sensitization of WDR neurons, phenotypic shift and Ab ﬁbre sprouting lead to enhanced pain in response to normally innocuous signals (allodynia), and this again may drive central sensitization. In addition to enhanced pain input, spontaneous activity in central nociceptive neurons may be caused by loss of sensory input due to damage of primary aﬀerent axons (deaﬀerentation) in the dorsal nerve root (Baumga¨rtner et al., 2002). Furthermore, diminished inhibitory mechanisms may contribute to enhanced pain processing including cell death of inhibitory interneurons in the dorsal horn (Woolf and Mannion, 1999) as well as changed descending modulatory mechanisms from the brain stem (Ren and Dubner, 1996, 2002; Gardell et al., 2003). Finally, secondary changes in cortical and subcortical brain regions, triggered by cognitions, emotions and attention, may further enhance central sensitization and development of spontaneous activity and pain (Tracey et al., 2002; Zusman, 2002; Apkarian et al., 2005). In summary, the main mechanisms responsible for enhanced pain processing associated with central sensitization are sensitization of nociceptive speciﬁc dorsal horn neurons, especially WDR neurons, disinhibition, deaﬀerentation, phenotypic switch and sprouting of Ab ﬁbres, as well as changes in cortical and subcortical brain regions. 2.4. Referred leg pain from musculoskeletal structures A large proportion of low back-related leg pain is accounted for by disorders of musculoskeletal structures (Bogduk and McGuirk, 2002). It has been shown that intervertebral discs (Ohnmeiss et al., 1997; O’Neill et al., 2002), facet joints (Mooney and Robertson, 1976; Schwarzer et al., 1994), sacroiliac joints (Fortin et al., 1994) and muscles (Travell and Simons, 1983) may refer pain into the lower limb. The convergence theory explains the reason for this phenomenon. Here, aﬀerent impulses from diﬀerent regions converge upon the same viscerosomatotopic neurons in the central nervous system, causing a mental projection of pain to the region corresponding with the spinal nerve through which the aﬀerent nerve ﬁbres enter the spinal cord (Jinkins, 2004). For example, a projection neuron for the ﬁfth lumbar nerve may receive input from the hip, thigh,

225

leg, and foot. Noxious input of suﬃcient strength from an injured facet joint or intervertebral disc can activate this projection neuron so that, via second-order neurons, the contralateral somatosensory cortex receives information that the nociceptive input arises from the lumbar structure and the extremity (Gillette et al., 1993). 2.5. Proposed classiﬁcation and their corresponding signs and symptoms Based on the mechanisms described above, the following classiﬁcation of low back-related leg pain into four categories is proposed (Table 2). 2.5.1. Central sensitization Some patients report primarily positive symptoms such as paraesthesias, dysaesthesias, hyperalgesia, dynamic mechanical allodynia and stimulus independent pain driven by central processes (Woolf and Mannion, 1999; Baumga¨rtner et al., 2002). Clinical features of central sensitization (Table 2) are revealed by pain descriptors such as shooting, lancinating or burning. The patient may report paroxysms, or complain of mechanical or thermal allodynia. The neurological examination may reveal light touch allodynia or altered pin prick thresholds (Bennett, 2001). A number of studies have investigated the inﬂuence of central sensitization on sensory changes. There is good evidence demonstrating mechanical pressure and thermal hyperalgesia attributed to central sensitization in acute and chronic whiplash patients (Moog et al., 2002; Sterling et al., 2003, 2005; Scott et al., 2005). Mechanical hyperalgesia is also a feature of complex regional pain syndrome which is now widely acknowledged to be enhanced by central sensitization linked to neuroimmune activation (Rommel et al., 2001; Alexander et al., 2005). Although there are no published data so far, it seems reasonable to extrapolate that central sensitization following nerve root injury in the lumbar spine may also be associated with mechanical and/or thermal hyperalgesia. 2.5.2. Denervation Denervation can be caused by structural nerve damage with primarily negative symptoms such as sensory or motor deﬁcits (Baumga¨rtner et al., 2002). For example, radiculopathies deﬁned as ‘‘objective loss of sensory and/or motor function as a result of conduction block in axons of a spinal nerve or its roots’’ (Merskey and Bogduk, 1994) are seen as a common cause of neuropathic pain (Dworkin et al., 2003; Baron and Binder, 2004; Jensen et al., 2004). Clinical examination of neurological function (muscle power, reﬂexes and skin sensitivity to light touch and

226

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

Table 2 Diagnostic group and related features. Diagnostic group

Central sensitization

Denervation

Peripheral nerve sensitization

Musculoskeletal

Classiﬁcation Symptomatic structure

Neuropathic Neural

Neuropathic Neural

Neuropathic or nociceptive Neural

Nociceptive Musculoskeletal

Mechanisms

Sensitization of WDR neurons Disinhibition Forebrain-mediated CS

Wallerian degeneration Demyelination

Convergence

Eﬀect

Enhanced processing of peripheral input Distal pain Hyperaesthesia Hyperalgesia Paraesthesia Allodynia LANSS score P12 May have features of the diagnostic groups denervation and peripheral sensitization

Conduction block Deaﬀerentation Segmentally distributed distal pain Hypoesthesia Weakness Palsy Diminished light touch and pinprick Diminished or absent reﬂexes Muscle weakness Minimal features of peripheral sensitization LANSS score<12

Inﬂammation Increased Na channel and mechanosensitive channel expression and conductance Enhanced nerve trunk mechano-sensitivity Pain anywhere in the leg Pain associated with movements that elongate the nerve trunk

Symptoms

Signs

Nerve is sensitive to elongation and pressure Reduced active movements corresponding to nerve mechano-sensitivity, LANSS score <12

Mental projection of pain to the limb Referred leg pain Pain tends to be worse proximally Normal neurological function None of the signs shown left LANSS score <12

LANSS: Leeds Assessment of Neuropathic Symptoms and Signs (Bennett, 2001).

pinprick) should reveal major deﬁcits in terms of negative sensory and motor symptoms (Table 2). 2.5.3. Peripheral nerve sensitization Peripheral nerve sensitization is caused by nerve root or nerve trunk inﬂammation leading to adverse response to mechanical provocation of nerve tissue (Elvey, 1997). This occurs even in the absence of gross neurological deﬁcits indicating subtle changes in sensory nerve function (Greening et al., 2005). Clinically important ﬁndings of peripheral nerve sensitization are referred leg pain associated with neural tissue movement. If the disorder is severe enough, the patient may present with an antalgic posture to protect mechanosensitive nerve tissue. Neural tissue provocation tests such as nerve palpation and the straight leg raise test (SLR) will be provocative, but the neurological examination should not show any signs of signiﬁcant neurological dysfunction (Hall and Elvey, 2004). 2.5.4. Musculoskeletal In disorders involving musculoskeletal structures, pain may be referred into the lower limb, and may extend distal to the knee or even the foot (O’Neill et al., 2002), with a dull, deep ache or pressure like quality (Feinstein et al., 1954). The neurological examination, as well as neural tissue provocation tests, should be normal. Physical procedures such as sacroiliac joint pain provocation tests or tests for centralization/peripheralization of limb pain speciﬁcally stressing musculoskeletal structures in

the lower back should reveal a pain-generating structure other than the nerve root (Donelson et al., 1997; Young et al., 2003; Laslett et al., 2005). There should be no evidence of small nerve ﬁbre dysfunction, although local pressure pain thresholds may be elevated (Giesbrecht and Battie, 2005). 2.5.5. Mixed pathologies A distinction has been made between four separate groups of patients with low back-related leg pain, but, in reality, there may be considerable overlap between these four groups. Peripheral sensitization of nerve tissue can trigger central sensitization, and inﬂammatory products released during denervation may also alter the properties of intact nerve ﬁbres. The possibility of mixed pathologies is not refuted, as many radicular disorders undoubtedly are a mixture of nociceptive and neuropathic pain (Baron and Binder, 2004). However, the existence of a predominant mechanism is proposed, that is primarily responsible for the patient’s complaints. This dominant mechanism may be identiﬁed with help of a thorough physical examination protocol.

3. Examination protocol The examination protocol outlined in Fig. 1 has been developed to identify the presence of positive signs and symptoms, neurological deﬁcit and the presence of signs indicative of nerve trunk sensitization. Reﬂection on the

227

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

LOW BACK RELATED LEG PAIN yes

LANSS* PAIN SCALE > 12?

yes

CENTRAL SENSITIZATION

no

NEUROLOGICAL DEFICIT?

yes

DENERVATION

no

NERVE TRUNK MECHANOSENSITIVITY?

no

yes

PERIPHERAL NERVE SENSITIZATION

MUSCULOSKELETAL

*LANSS: Leeds Assessment of Neuropathic Symptoms and Signs (Bennett 2001)

Fig. 1. Classiﬁcation algorithm LANSS: Leeds Assessment of Neuropathic Symptoms and Signs (Bennett, 2001).

information derived should allow the clinician to classify patients into one of the four groups previously described according to the current consensus regarding the possible patho-mechanisms for low back-related leg pain. The examination protocol includes a comprehensive assessment of the patient’s subjective complaint and physical examination. Although this process as a whole does not have established validity and reliability, there are published validity and reliability data for a number of individual components of the examination (Strender et al., 1997; Vroomen et al., 1999; Hunt et al., 2001). The ﬁrst part of the examination protocol is the subjective assessment, which incorporates the Leeds Assessment of Neuropathic Symptoms and Signs (LANSS) scale (Bennett, 2001). The LANSS scale is an interview-based questionnaire designed to screen for neuropathic symptoms and signs indicative of central processes enhancing the sensitivity of the sensory system. It is a valid and reliable tool to assess neuropathic pain in clinical and research settings (Yucel et al., 2004; Bennett et al., 2005, 2006). The second part is the physical examination including neurological examination, assessment of active movements, and neural tissue provocation tests. A neurological examination is carried out including assessment of muscular strength, reﬂexes and altered sensitivity for pinprick (Ad) and light touch (Ab ﬁbres). This is followed by an investigation of signs indicative of peripheral sensitization of the nerve trunk with enhanced mechanosensitivity (Elvey and Hall, 1997; Hall and Elvey, 1999, 2004). These signs include restricted active range of movement and hyperalgesia on neural tissue provocation tests (e.g. slump test or SLR, and nerve trunk palpation) correlating with suspected nerve trunk mechanosensitivity. Positive neural tissue provocation tests, in the absence of positive symptoms and neurological deﬁcit, are indicative of peripheral sensitization of the nerve trunk with enhanced mechanosensitivity. The

procedure and rationale for this examination protocol are described in more detail elsewhere (Hall and Elvey, 2004).

4. Discussion A number of classiﬁcation systems for LBP from a variety of diﬀerent perspectives have been proposed, some incorporating related leg pain. Dimensions for classifying LBP have included patho-anatomical, signs and symptoms and psycho-social (Waddell, 2002). Examples of classiﬁcation based on signs and symptoms include the one developed by McKenzie. The McKenzie model (McKenzie, 1981) evaluates the behaviour of pain to guide treatment decisions and has been extensively investigated with mostly encouraging results (Werneke and Hart, 2001; Kilpikoski et al., 2002; Wetzel and Donelson, 2003). Another classiﬁcation system described by Petersen et al. (2003) is pathoanatomic oriented, relying on the behaviour and location of symptoms. In their system, low back-related leg pain is classiﬁed into nerve related (adherent nerve root, nerve root entrapment, nerve root compression, spinal stenosis, and adverse neural tension) or musculoskeletal. This classiﬁcation is based on aetiology rather than on underlying mechanisms. In the present article, a clinical classiﬁcation of low back-related leg pain based on mechanisms is proposed. Undoubtedly, there is signiﬁcant overlap and causal interrelation between the diﬀerent mechanisms (Baron and Binder, 2004). It is feasible, however, that there is a predominant mechanism primarily responsible for the patient’s complaints that may be identiﬁed with the help of the physical examination protocol. The proposed classiﬁcation system is rudimentary in the face of the complexity of the many pathomechanisms occurring with nerve injury as demonstrated in the laboratory,

228

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

however, it may serve as a useful guideline for treatment decisions in clinical practice.

5. Conclusion Classiﬁcation of low back-related leg pain using a mechanism-based approach can be applied to all parts of the body and such classiﬁcation provides a rational guideline for clinical decision making (Woolf et al., 1998). In the present article, we have applied this principle to low back-related leg pain and outlined an algorithm for clinical classiﬁcation. Several studies are currently in progress to evaluate the proposed classiﬁcation system.

Acknowledgements We are grateful to Dr. Gerd Mu¨ller and Dr. Joachim Mallwitz at the Ru¨ckenzentrum am Michel, Hamburg and Dr. Roman Rolke at the University of Mainz for manuscript review and feedback.

References Adams MA, Bogduk N, Burton K, Patricia D. Biology of spinal tissues. The biomechanics of backpain. Edinburgh: Churchill Livingstone; 2002. p. 49e71. Alexander GM, van Rijn MA, van Hilten JJ, Perreault MJ, Schwartzman RJ. Changes in cerebrospinal ﬂuid levels of proinﬂammatory cytokines in CRPS. Pain 2005;116(3):213e9. Apkarian AV, Bushnell MC, Treede RD, Zubieta JK. Human brain mechanisms of pain perception and regulation in health and disease. European Journal of Pain 2005;9(4):463e84. Arbit E, Pannullo S. Lumbar stenosis: a clinical review. Clinical Orthopaedics and Related Research 2001;(384):137e43. Baron R, Binder A. Wie neuropathisch ist die Lumboischialgie. Das Mixed-pain-Konzept. Orthopade 2004;33(5):568e75. Baumga¨rtner U, Magerl W, Klein T, Hopf HC, Treede RD. Neurogenic hyperalgesia versus painful hypoalgesia: two distinct mechanisms of neuropathic pain. Pain 2002;96(1e2):141e51. Beattie PF, Meyers SP, Stratford P, Millard RW, Hollenberg GM. Associations between patient report of symptoms and anatomic impairment visible on lumbar magnetic resonance imaging. Spine 2001;25(7):819e28. Bennett M. The LANSS Pain Scale: the Leeds assessment of neuropathic symptoms and signs. Pain 2001;92(1e2):147e57. Bennett MI, Smith BH, Torrance N, Potter J. The S-LANSS score for identifying pain of predominantly neuropathic origin: validation for use in clinical and postal research. Journal of Pain 2005;6(3):149e58. Bennett MI, Smith BH, Torrance N, Lee AJ. Can pain can be more or less neuropathic? Comparison of symptom assessment tools with ratings of certainty by clinicians. Pain 2006;122(3):289e94. Blau JN, Logue V. The natural history of intermittent claudication of the cauda equina. A long term follow-up study. Brain 1978; 101(2):211e22. Boden SD, Davis DO, Dina TS, Patronas NJ, Wiesel SW. Abnormal magnetic-resonance scans of the lumbar spine in asymptomatic subjects. A prospective investigation. Journal of Bone and Joint SurgerydAmerican Volume 1990;72(3):403e8.

Bogduk N. Low back pain, clinical anatomy of the lumbar spine and sacrum. 4th ed. Edinburgh: Elsevier, Churchill Livingstone; 2005. p. 183e216. Bogduk N, McGuirk B. Causes and sources of chronic low back pain, Medical management of acute and chronic low back pain. An evidence based approach. Amsterdam: Elsevier; 2002. p. 115e25. Boos N, Rieder R, Schade V, Spratt KF, Semmer N, Aebi M. Volvo award in clinical sciences. The diagnostic accuracy of magnetic resonance imaging, work perception, and psychosocial factors in identifying symptomatic disc herniations. Spine 1995;20(24):2613e25. Bove GM, Ransil BJ, Lin HC, Leem JG. Inﬂammation induces ectopic mechanical sensitivity in axons of nociceptors innervating deep tissues. Journal of Neurophysiology 2003;90(3):1949e55. Brisby H. Nerve root injuries in patients with chronic low back pain. Orthopedic Clinics of North America 2003;34(2):221e30. Brisby H, Byrod G, Olmarker K, Miller VM, Aoki Y, Rydevik B. Nitric oxide as a mediator of nucleus pulposus-induced eﬀects on spinal nerve roots. Journal of Orthopaedic Research 2000; 18(5):815e20. Campbell JN, Meyer RA. Mechanisms of neuropathic pain. Neuron 2006;52(1):77e92. Cavanaugh JM, Weinstein JN. Low back pain: epidemiology, anatomy and neurophysiology. In: Wall PD, Melzack R, editors. The textbook of pain. 3rd ed. Edinburgh; New York: Churchill Livingstone; 1994. p. 441e55. Chen C, Cavanaugh JM, Song Z, Takebayashi T, Kallakuri S, Wooley PH. Eﬀects of nucleus pulposus on nerve root neural activity, mechanosensitivity, axonal morphology, and sodium channel expression. Spine 2004;29(1):17e25. Cornefjord M, Sato K, Olmarker K, Rydevik B, Nordborg C. A model for chronic nerve root compression studies: presentation of a porcine model for controlled, slow-onset compression with analyses of anatomic aspects, compression onset rate, and morphologic and neurophysiologic eﬀects. Spine 1997;22(9):946e57. Costigan M, Woolf CJ. Pain: molecular mechanisms. Journal of Pain 2000;1(3):35e44. Devor M. Sodium channels and mechanisms of neuropathic pain. Journal of Pain 2006;7(Suppl. 1):S3e12. Devor M, Rappaport HZ. Pain and the pathophysiology of damaged nerve. In: Fields HL, editor. Pain syndromes in neurology. Oxford: Butterworth Heinemann; 1990. p. 47e83. Dilley A, Lynn B, Pang SJ. Pressure and stretch mechanosensitivity of peripheral nerve ﬁbres following local inﬂammation of the nerve trunk. Pain 2005;117(3):462e72. Donelson R, Aprill C, Medcalf R, Grant W. A prospective study of centralization of lumbar and referred pain. A predictor of symptomatic discs and anular competence. Spine 1997;22(10):1115e22. Dworkin RH, Backonja M, Rowbotham MC, Allen RR, Argoﬀ CR, Bennett GJ, et al. Advances in neuropathic pain: diagnosis, mechanisms, and treatment recommendations. Archives of Neurology 2003;60(11):1524e34. Eliav E, Herzberg U, Ruda MA, Bennett GJ. Neuropathic pain from an experimental neuritis of the rat sciatic nerve. Pain 1999;83(2):169e82. Eliav E, Benoliel R, Tal M. Inﬂammation with no axonal damage of the rat saphenous nerve trunk induces ectopic discharge and mechanosensitivity in myelinated axons. Neuroscience Letters 2001;(311):49e52. Elvey RL. Physical evaluation of the peripheral nervous system in disorders of pain and dysfunction. Journal of Hand Therapy 1997;10(2):122e9. Elvey RL, Hall TM. Neural tissue evaluation and treatment. In: Donatelli R, editor. Physical therapy of the shoulder. 3rd ed. New York, Philadelphia: Churchill Livingstone; 1997. p. 131e52. Feinstein B, Langton JN, Jameson RM, Schiller F. Experiments on pain referred from deep somatic tissues. Journal of Bone and Joint SurgerydAmerican Volume 1954;36-A(5):981e97.

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230 Fortin JD, Aprill CN, Ponthieux B, Pier J. Sacroiliac joint: pain referral maps upon applying a new injection/arthrography technique. Part II: Clinical evaluation. Spine 1994;19(13):1483e9. Gardell LR, Vanderah TW, Gardell SE, Wang R, Ossipov MH, Lai J, et al. Enhanced evoked excitatory transmitter release in experimental neuropathy requires descending facilitation. Journal of Neuroscience 2003;23(23):8370e9. Giesbrecht RJ, Battie MC. A comparison of pressure pain detection thresholds in people with chronic low back pain and volunteers without pain. Physical Therapy 2005;85(10):1085e92. Gillette R, Kramis R, Roberts W. Characterization of spinal somatosensory neurons having receptive ﬁelds in lumbar tissues of cats. Pain 1993;54:85e98. Greening J. How inﬂammation and minor nerve injury contribute to pain in nerve root and peripheral neuropathies. In: Boyling JD, Jull G, editors. Grieve’s modern manual therapy: the vertebral column. 2nd ed. Edinburgh, New York: Churchill Livingstone; 2004. p. 205e14. Greening J, Dilley A, Lynn B. In vivo study of nerve movement and mechanosensitivity of the median nerve in whiplash and nonspeciﬁc arm pain patients. Pain 2005;115(3):248e53. Hall TM, Elvey RL. Nerve trunk pain: physical diagnosis and treatment. Manual Therapy 1999;4(2):63e73. Hall TM, Elvey RL. Management of mechanosensitivity of the nervous system in spinal pain syndromes. In: Boyling JD, Jull G, editors. Grieves modern manual therapy. 3rd ed. Edinburgh: Churchill Livingstone; 2004. p. 413e33. Helio¨vaara M, Impivaara O, Sievers K. Lumbar disc syndrome in Finland. Journal of Epidemiology and Communication Health 1987;(41):251e8. Hunt DG, Zuberbier OA, Kozlowski AJ, Robinson J, Berkowitz J, Schultz IZ, et al. Reliability of the lumbar ﬂexion, lumbar extension, and passive straight leg raise test in normal populations embedded within a complete physical examination. Spine 2001; 26(24):2714e8. Igarashi T, Yabuki S, Kikuchi S, Myers RR. Eﬀect of acute nerve root compression on endoneurial ﬂuid pressure and blood ﬂow in rat dorsal root ganglia. Journal of Orthopaedic Research 2005; 23(2):420e4. Jensen MP, Nielson WR, Turner JA, Romano JM, Hill ML. Changes in readiness to self-manage pain are associated with improvement in multidisciplinary pain treatment and pain coping. Pain 2004;111(1e2):84e95. Jinkins JR. The anatomic and physiologic basis of local, referred and radiating lumbosacral pain syndromes related to disease of the spine. Journal of Neuroradiology 2004;31(3):163e80. Kayama S, Olmarker K, Larsson K, Sjogren-Jansson E, Lindahl A, Rydevik B. Cultured, autologous nucleus pulposus cells induce functional changes in spinal nerve roots. Spine 1998;23(20):2155e8. Kilpikoski S, Airaksinen O, Kankaanpaa M, Leminen P, Videman T, Alen M. Interexaminer reliability of low back pain assessment using the McKenzie method. Spine 2002;27(8):E207e14. Kjaer P, Leboeuf-Yde C, Korsholm L, Sorensen JS, Bendix T. Magnetic resonance imaging and low back pain in adults: a diagnostic imaging study of 40-year-old men and women. Spine 2005; 30(10):1173e80. Kobayashi S, Yoshizawa H, Yamada S. Pathology of lumbar nerve root compression. Part 1: Intraradicular inﬂammatory changes induced by mechanical compression. Journal of Orthopaedic Research 2004;22(1):170e9. Kobayashi S, Baba H, Uchida K, Kokubo Y, Kubota C, Yamada S, et al. Eﬀect of mechanical compression on the lumbar nerve root: localization and changes of intraradicular inﬂammatory cytokines, nitric oxide, and cyclooxygenase. Spine 2005; 30(15):1699e705. Laslett M, Oberg B, Aprill CN, McDonald B. Centralization as a predictor of provocation discography results in chronic low back pain,

229

and the inﬂuence of disability and distress on diagnostic power. Spine Journal 2005;5(4):370e80. Maetzel A, Li L. The economic burden of low back pain: a review of studies published between 1996 and 2001. Best Practice and Research in Clinical Rheumatology 2002;16(1):23e30. Mannion RJ, Doubell TP, Coggeshall RE, Woolf CJ. Collateral sprouting of uninjured primary aﬀerent A-ﬁbers into the superﬁcial dorsal horn of the adult rat spinal cord after topical capsaicin treatment to the sciatic nerve. Journal of Neuroscience 1996; 16(16):5189e95. McKenzie RA. The lumbar spine: mechanical diagnosis and therapy. Waikanae, NZ: Spinal Pub; 1981. p. 164. McNab I. The mechanism of spondylogenic pain. In: Hirsch C, Zotterman Y, editors. Cervical pain. New York: Pergamon Press; 1972. p. 89e95. Merskey H, Bogduk N. Classiﬁcation of chronic pain. 2nd ed. Seattle: IASP Press; 1994. Moog M, Quintner J, Hall T, Zusman M. The late whiplash syndrome: a psychophysical study. European Journal of Pain 2002;6(4):283e94. Mooney V, Robertson J. The facet syndrome. Clinical Orthopedics and Related Research 1976;(115):149e56. Mulleman D, Mammou S, Griﬀoul I, Watier H, Goupille P. Pathophysiology of disk-related sciatica. I.dEvidence supporting a chemical component. Joint Bone Spine 2006;73:151e8. O’Neill CW, Kurgansky ME, Derby R, Ryan DP. Disc stimulation and patterns of referred pain. Spine 2002;27(24):2776e81. Ohnmeiss DD, Vanharanta H, Ekholm J. Degree of disc disruption and lower extremity pain. Spine 1997;22(14):1600e5. Olmarker K. Anatomy and physiology of spinal nerve roots and the results of compression and irritation. In: Giles LGF, Singer KP, editors. Clinical anatomy and management of low back pain. Oxford: Butterworth-Heinemann; 1997. p. 243e54. Olmarker K, Holm S, Rydevik B. Importance of compression onset rate for the degree of impairment of impulse propagation in experimental compression injury of the porcine cauda equina. Spine 1990;15(5):416e9. Olmarker K, Larsson K. Tumor necrosis factor alpha and nucleuspulposus-induced nerve root injury. Spine 1998;23(23):2538e44. Olmarker K, Myers RR. Pathogenesis of sciatic pain: role of herniated nucleus pulposus and deformation of spinal nerve root and dorsal root ganglion. Pain 1998;78(2):99e105. Olmarker K, Rydevik B. Selective inhibition of tumor necrosis factoralpha prevents nucleus pulposus-induced thrombus formation, intraneural edema, and reduction of nerve conduction velocity: possible implications for future pharmacologic treatment strategies of sciatica. Spine 2001;26(8):863e9. Olmarker K, Rydevik B, Holm S, Bagge U. Eﬀects of experimental graded compression on blood ﬂow in spinal nerve roots. A vital microscopic study on the porcine cauda equina. Journal of Orthopaedic Research 1989;7(6):817e23. Olmarker K, Rydevik B, Nordborg C. Autologous nucleus pulposus induces neurophysiologic and histologic changes in porcine cauda equina nerve roots. Spine 1993;18(11):1425e32. Peng B, Wu W, Hou S, Li P, Zhang C, Yang Y. The pathogenesis of discogenic low back pain. Journal of Bone and Joint Surgeryd British Volume 2005;87(1):62e7. Petersen T, Laslett M, Thorsen H, Manniche C, Ekdahl C, Jacobson S. Diagnostic classiﬁcation of non-speciﬁc low back pain. A new system integrating pathonatomic and clinical categories. Physiotherapy Theory and Practice 2003;(19):213e37. Ren K, Dubner R. Enhanced descending modulation of nociception in rats with persistent hindpaw inﬂammation. Journal of Neurophysiology 1996;76(5):3025e37. Ren K, Dubner R. Descending modulation in persistent pain: an update. Pain 2002;100(1e2):1e6. Ren XS, Selim AJ, Fincke G, Deyo RA, Linzer M, Lee A, et al. Assessment of functional status, low back disability, and use of diagnostic

230

A. Scha¨fer et al. / Manual Therapy 14 (2009) 222e230

imaging in patients with low back pain and radiating leg pain. Journal of Clinical Epidemiology 1999;52(11):1063e71. Rommel O, Malin JP, Zenz M, Janig W. Quantitative sensory testing, neurophysiological and psychological examination in patients with complex regional pain syndrome and hemisensory deﬁcits. Pain 2001;93(3):279e93. Rydevik B, Brown MD, Lundborg G. Pathoanatomy and pathophysiology of nerve root compression. Spine 1984;9(1):7e15. Rydevik BL, Pedowitz RA, Hargens AR, Swenson MR, Myers RR, Garﬁn SR. Eﬀects of acute, graded compression on spinal nerve root function and structure. An experimental study of the pig cauda equina. Spine 1991;16(5):487e93. Schwarzer AC, Aprill CN, Derby R, Fortin J, Kine G, Bogduk N. The relative contributions of the disc and zygapophyseal joint in chronic low back pain. Spine 1994;19(7):801e6. Scott D, Jull G, Sterling M. Widespread sensory hypersensitivity is a feature of chronic whiplash-associated disorder but not chronic idiopathic neck pain. Clinical Journal of Pain 2005;21(2):175e81. Selim AJ, Ren XS, Fincke G, Deyo RA, Rogers W, Miller D, et al. The importance of radiating leg pain in assessing health outcomes among patients with low back pain. Results from the Veterans Health Study. Spine 1998;23(4):470e4. Simone DA, Sorkin LS, Oh U, Chung JM, Owens C, LaMotte RH, et al. Neurogenic hyperalgesia: central neural correlates in responses of spinothalamic tract neurons. Journal of Neurophysiology 1991;66(1):228e46. Sterling M, Jull G, Vicenzino B, Kenardy J. Sensory hypersensitivity occurs soon after whiplash injury and is associated with poor recovery. Pain 2003;104(3):509e17. Sterling M, Jull G, Vicenzino B, Kenardy J, Darnell R. Physical and psychological factors predict outcome following whiplash injury. Pain 2005;114(1e2):141e8. Strender LE, Sjoblom A, Sundell K, Ludwig R, Taube A. Interexaminer reliability in physical examination of patients with low back pain. Spine 1997;22(7):814e20. Takada T, Nishida K, Doita M, Miyamoto H, Kurosaka M. Interleukin-6 production is upregulated by interaction between disc tissue and macrophages. Spine 2004;29(10):1089e92 (discussion 1093). Takahashi K, Kagechika K, Takino T, Matsui T, Miyazaki T, Shima I. Changes in epidural pressure during walking in patients with lumbar spinal stenosis. Spine 1995;20(24):2746e9. Takahashi K, Miyazaki T, Takino T, Matsui T, Tomita K. Epidural pressure measurements. Relationship between epidural pressure and posture in patients with lumbar spinal stenosis. Spine 1995;20(6):650e3. Taylor J, Twomey L, Levander B. Contrasts between cervical and lumbar motion segments. Critical Reviews in Physical and Rehabilitation Medicine 2000;12:345e71.

Taylor JR, Twomey LT. Age changes in lumbar zygapophyseal joints. Observations on structure and function. Spine 1986;11(7):739e45. Tracey I, Ploghaus A, Gati JS, Clare S, Smith S, Menon RS, et al. Imaging attentional modulation of pain in the periaqueductal gray in humans. Journal of Neuroscience 2002;22(7):2748e52. Travell JG, Simons DG. The lower extremities, vol. 2. Philadelphia: Lippincott Williams & Wilkins; 1983. Videman T, Nurminen M. The occurrence of anular tears and their relation to lifetime back pain history: a cadaveric study using barium sulfate discography. Spine 2004;29(23):2668e76. Vroomen PC, de Krom MC, Knottnerus JA. Diagnostic value of history and physical examination in patients suspected of sciatica due to disc herniation: a systematic review. Journal of Neurology 1999;246(10):899e906. Waddell G. Recent developments in low back pain. In: Giamberardino MA, editor. Pain 2002dan updated review: refresher course syllabus. Seattle: IASP Press; 2002. p. 259e66. Werneke M, Hart DL. Centralization phenomenon as a prognostic factor for chronic low back pain and disability. Spine 2001;26(7):758e64 (discussion 765). Wetzel FT, Donelson R. The role of repeated end-range/pain response assessment in the management of symptomatic lumbar discs. Spine Journal 2003;3(2):146e54. Wiesel SW, Tsourmas N, Feﬀer HL, Citrin CM, Patronas N. A study of computer-assisted tomography. I. The incidence of positive CAT scans in an asymptomatic group of patients. Spine 1984;9(6):549e51. Woolf CJ, Bennett GJ, Doherty M, Dubner R, Kidd B, Koltzenburg M, et al. Towards a mechanism-based classiﬁcation of pain? Pain 1998;77(3):227e9. Woolf CJ, Doubell TP. The pathophysiology of chronic pain-increased sensitivity to low threshold A beta-ﬁbre inputs. Current Opinion in Neurobiology 1994;4(4):525e34. Woolf CJ, Mannion RJ. Neuropathic pain: aetiology, symptoms, mechanisms, and management. The Lancet 1999;353(9168):1959e64. Woolf CJ, Salter MW. Neuronal plasticity: increasing the gain in pain. Science 2000;288:1765e8. Yoshizawa H, Kobayashi S, Morita T. Chronic nerve root compression. Pathophysiologic mechanism of nerve root dysfunction. Spine 1995;20(4):397e407. Young S, Aprill C, Laslett M. Correlation of clinical examination characteristics with three sources of chronic low back pain. Spine Journal 2003;3(6):460e5. Yucel A, Senocak M, Kocasoy Orhan E, Cimen A, Ertas M. Results of the Leeds assessment of neuropathic symptoms and signs pain scale in Turkey: a validation study. Journal of Pain 2004;5(8):427e32. Zusman M. Forebrain-mediated sensitization of central pain pathways: ‘non-speciﬁc’ pain and a new image for MT. Manual Therapy 2002;7(2):80e8.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 231e239 www.elsevier.com/math

Technical and Measurement Report

Intra- and interexaminer reliability of four manual shoulder maneuvers used to identify subacromial pain Kajsa Johansson a,*, So¨ren Ivarson b a

Senior lecturer, Division of Physical Therapy, Department of Medical and Health Sciences, Linko¨ping University, S-581 83 Linko¨ping, Sweden b Specialist in Orthopedic Medicine, Feelgood AB, Linko¨ping, Sweden Received 26 July 2007; received in revised form 1 February 2008; accepted 1 March 2008

Abstract Shoulder pain is a diagnostic challenge and the physical clinical examination of the shoulder is crucial. It is important that the diagnostic tests used are valid as well as reliable. The objective of the study was to assess intra- and interexaminer reliability for four manual shoulder maneuvers; the Neer impingement sign, the HawkinseKennedy impingement test, the Patte maneuver, the Jobe supraspinatus test. These maneuvers are frequently used in clinical practice to examine patients with shoulder complaints in which subacromial pain is highly suspected. Thirty-three participants with shoulder pain were included consecutively. Within a week from inclusion, the four maneuvers were performed by a physiotherapist. The procedure was standardized in order to increase reproducibility. After a week, the maneuvers were performed again by the same physical therapist (testeretest) and by another physical therapist (test for interexaminer reliability). All four maneuvers have an almost perfect agreement (Kappa coeﬃcients 0.91e1.00), if performed with suggested standardizations. Neer impingement sign, HawkinseKennedy impingement test, Patte maneuver as well as Jobe supraspinatus test, are highly reproducible and therefore reliable to use in clinical practice to identify patients with subacromial pain with an impingement phenomenon, but the maneuvers are limited as structural discriminators. Ó 2008 Elsevier Ltd. All rights reserved. Keywords: Shoulder impingement syndrome; Physical examination; Diagnostic tests; Reliability

1. Introduction Patients with shoulder pain, especially subacromial pain with impingement phenomenon, are commonly seen in clinical practice and present a diagnostic challenge (Van der Windt et al., 1996). The physical clinical examination is crucial and it is important that the diagnostic tests used are valid as well as reliable (Krebs, 1987; Fritz and Wainner, 2001). Several tests or maneuvers are used in clinical practice to diagnose shoulder patients thought to have a subacromial origin. The theory of several of these is to stress the tissues thought to be

* Corresponding author. Tel.: þ46 13 22 74 89. E-mail address: [email protected] (K. Johansson). 1356-689X/$ - see front matter Ó 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.math.2008.03.003

involved in the pain-generating mechanism, for example; Neer impingement sign (Neer, 1972,1983), Hawkinse Kennedy impingement test (Hawkins and Kennedy, 1980), the Patte maneuver (Leroux et al., 1995) and Jobe supraspinatus test (Jobe and Moynes, 1982). Their diﬀerent abilities to produce pain by provoking subacromial structures have been validated in several earlier studies (Sigholm and Styf, 1988; Leroux et al., 1995; C¸alisx et al., 2000; MacDonald et al., 2000; Valadie et al., 2000; Holtby and Razmjou, 2004; Park et al., 2005) and appropriate sensitivity was reported, but less speciﬁcity which aﬀects their structural discriminating ability. However, the knowledge about their reliability aspects is limited. Earlier studies have reported an unclear picture for Hawkins impingement test. De Wilde et al. (2003) reported high intra- and interexaminer

232

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239

reliability on shoulders in healthy subjects. Only acceptable interexaminer reliability was reported by Nørregaard et al. (2002) in patients with longstanding shoulder pain. For Neer impingement sign, good interexaminer reliability was reported when patients with hemiplegic shoulder pain were evaluated (Dromerick et al., 2006). No studies were found evaluating the Patte maneuver or Jobe supraspinatus test. The objective of this study was to assess intra- and interexaminer reliability for four manual shoulder maneuvers: Neer impingement sign, HawkinseKennedy impingement test, Patte maneuver and Jobe supraspinatus test.

2. Methods 2.1. Subjects Patients with shoulder pain, attending primary health care in the Swedish city of Linko¨ping during August 2004eMarch 2005, were oﬀered participation. The family physicians and physical therapists (PT) had received information about the study and recruited participants with a probable subacromial impingement syndrome. Those who gave their informed consent to participate were referred to the research PT. Participants were consecutively recruited according to the following inclusion criteria: age 18e50 and duration of symptoms for less than 16 weeks. Those with a known rheumatic- or neurological disease were excluded, as well as those with neck problems or former surgery in the neck- and/or shoulder region. This study was performed after approval of the regional Ethics Committee at the Faculty of Health Sciences, Linko¨ping University, Sweden (no. M177-04). 2.2. Procedure Within a week from inclusion, each participant was examined by the research PT using the four maneuvers: Neers impingement sign (Neer, 1972,1983), Hawkinse Kennedy impingement test (Hawkins and Kennedy, 1980), Patte maneuver (Leroux et al., 1995) and Jobe supraspinatus test (Jobe and Moynes, 1982). The testing procedure was standardized in order to increase reproducibility. After a week, the four maneuvers were performed again by the same PT (testeretest) and by another PT (test for interexaminer reliability). Before starting the study, ﬁve subjects were pilot-tested in order to standardize the test procedure of the four maneuvers as well as the order: (1) the Neer impingement sign, (2) the HawkinseKennedy impingement test, (3) the Patte maneuver and (4) the Jobe supraspinatus test. A detailed description of all maneuvers is presented in the electronic version with matching pictures.

The two PTs performing the maneuvers diﬀered in post-graduate education. One had a level III-certiﬁcate in orthopedic manual therapy (OMT) and 18 years of experience working in the ﬁeld of musculo-skeletal disorders. The other had ﬁve years experience and a level I OMT-certiﬁcate. The participants were not informed about the response noted by the PTs until all maneuvers at the second occasion had been performed. Further, each participant was instructed not to give the research PT any information about their complaint except for the response of each maneuver expressed in terms of reproduction of their shoulder pain or not. A positive response to each maneuver was deﬁned as a reproduction of shoulder pain familiar to the patient as well as the pain localization; around the shoulder and especially in the lateral aspect of the upper arm (the C5 dermatome). At the second test occasion, the participant rested for 1 h in-between the two test sessions: retest for intraexaminer reliability and the test of interexaminer reliability. The PT who performed the test session ﬁrst was randomized. Each maneuver was repeated twice to secure a consistent response. Stability of the current shoulder complaint and painlevels in-between test occasions as well as in-between sessions at the second occasion due to the maneuvers was controlled by using a Visual Analogue Scale (VAS) before starting each test session. 2.3. Statistical analyses Descriptive statistics was used to present characteristics of participants. For analysis of intra- and interexaminer reliability (two examiners and two category nominal scale) Kappa statistics were used (Streiner and Norman, 1998). All analyses were undertaken using the Statistical Package for the Social Sciences (SPSS, version 10.1 for Windows). The Kappa coeﬃcients (k) were derived from 2 2 contingency tables (SPSS crosstabs). To interpret levels of agreement, k of >0.81 was considered almost perfect, 0.61e0.80 as substantial, 0.41e0.60 as moderate, 0.21e0.40 as fair and 0.0e0.20 was considered as a slight agreement (Landis and Koch, 1977).

3. Results Thirty-three participants with shoulder pain were included and all completed their participation in the study. The mean age was 32 years old (SD 10) and ranging from 18 to 50. The mean duration of symptoms was 7.5 weeks (SD 4.0), ranging from 2 to 14 weeks. The arm aﬀected was equally distributed between the left and right side among the participants and only four of

233

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239 Table 1 The number of positive and negative responses for the respective examiner in relation to each maneuver (n ¼ 33). Examiner A

Neer impingement sign HawkinseKennedy impingement test Patte maneuver Jobe supraspinatus test

Examiner B

First test occasion

Second test occasion (Re-test)

Second test occasion

Positive

Negative

Positive

Negative

Positive

Negative

26 25 31 15

7 8 2 18

26 25 31 15

7 8 2 18

26 26 31 16

7 7 2 17

them reported pain at rest. However, no one reported a VAS above 3.0. In mean, their experienced shoulder disability was 2.2 on VAS (SD 2.0). There was a perfect agreement between the two test occasions, when the same examiner repeated the maneuvers, intraexaminer reliability. Each and every one of the maneuvers had a k of 1.0. For interexaminer reliability, when both PTs examined the same patient independently from each other, there was a perfect agreement for the Neer impingement sign and the Patte maneuver (k ¼ 1.0) and the HawkinseKennedy impingement test and the Jobe supraspinatus test had an almost perfect agreement (k ¼ 0.91 and 0.94, respectively). These results are based on the ﬁgures in Table 1 and each test response is plotted in Figs. 1e8. None of the maneuvers generated prolonged elevated pain. In case of a positive response to a maneuver, the provoked pain returned to pre-test level within the hour.

4. Discussion The impingement phenomenon is a clinical syndrome. A pathological process in the subacromial

structures is indicated when maneuvers that change the distance between the roof and the ﬂoor in the subacromial space and/or demands rotator cuﬀ activation in a position whereas the space is narrowed reproduce pain. The objective of this study was to assess intraand interexaminer reliability for four of these manual shoulder maneuvers: the Neer impingement sign, the HawkinseKennedy impingement test, the Patte maneuver, and the Jobe supraspinatus test. All of them had a high level of agreement, both for intra- and interexaminer reliability. To our knowledge, there are only a limited number of reliability studies in the ﬁeld of diagnosing subacromial pain. De Wilde et al. (2003) reported high levels of intra- and interexaminer reliability (ICC 0.93e0.97) of a modiﬁed HawkinseKennedy impingement test. Five examiners performed three measurements each on ﬁve shoulders in healthy subjects and used a supine position. In the study by Nørregaard et al. (2002), impingement provocations referred to as Neer impingement sign and the HawkinseKennedy impingement test were evaluated. The ﬁrst resulted in poor interexaminer agreement and the later reached moderate interexaminer agreement (k > 0.4) when an orthopedic surgeon and a rheumatologist performed the test in

Positive response

Negative response Test examiner A Re-test examiner A 1

2

3

4

5

6

7

8

9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 1. Neer impingement sign. Patient (n ¼ 33) responses at testeretest.

234

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239

Positive response

Negative response

Examiner A Examiner B

1

2

3

4

5

6

7

8

9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 2. Neer impingement sign. Patient’s (n ¼ 33) responses from examiners A and B.

consecutive patients with shoulder problems. Dromerick et al. (2006) evaluated the Neer impingement sign, among other tests, assigning patients from an academic inpatient stroke rehabilitation service. They reported good interexaminer reliability (k ¼ 0.78) using two examiners who evaluated patients with hemiplegic shoulder pain. These researchers present various conclusions about reliability for some of the maneuvers in focus. Results based upon more or less divergent materials which make comparisons diﬃcult and could explain the variation in levels of agreement. When interpreting the perfect to almost perfect levels of agreement in the current study, some methodological aspects must be taken into consideration. Only two examiners were included in this study which limited the source of variation and inﬂuenced the levels of agreement. This is partly compensated by the reasonable

number of participating patients, but should be accounted for in aspects of extrapolations. Further, the evaluated maneuvers had a dichotomous response, a positive or negative ﬁnding. This also has a limitative eﬀect on the possibility of measurement variation and consequently on the levels of agreement, both for intraand interexaminer reliability. However, a dichotomous response is the reality of how these tests are used in clinical practice. At the second test occasion, re-test, there was a risk of patients remembering their responses from the ﬁrst test occasion and a possibility that the patients tried to be helpful when responding. On the contrary, the clinical experience is that these responses are distinctly expressed both verbally and in body language supporting a true test response. Agreement levels could further be biased by the fact that the examiner remembered the test response from the ﬁrst text occasion

Positive response

Negative response Test examiner A Re-test examiner A 1

2

3

4

5

6

7

8

9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 3. HawkinseKennedy impingement test. Patient (n ¼ 33) responses at testeretest.

235

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239

Positive response

Negative response Examiner A Examiner B

1

2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 4. HawkinseKennedy impingement test. Patient’s (n ¼ 33) responses from examiners A and B.

that inﬂuenced the interpretation of the second. A preselection of participants with suspected subacromial pain were enrolled in this study. This limits the amount of negative responses, but in the actual clinical encounter these maneuvers are chosen especially when subacromial soft tissue involvement is suspected. The results of this study present both negative and positive responses (Table 1), but most prevalent are the positive responses for three of four maneuvers. Only Jobe supraspinatus test had a more even distribution (Table 1). The patients also reported limited duration of symptoms and no one reported extensive pain or disability. Patients with higher pain ratings or more disabled shoulders, probably due to increased involvement of surrounding tissues, could make the test response more diﬃcult to interpret and thereby increase variability. But since these tests

have been reported as highly sensitive (Park et al., 2005), inclusion of patients with more disabled shoulders would probably increase the number of positive test and not diminish reliability. In summary, all these aspects could inﬂuence the k-coeﬃcients (Sim and Wright, 2005). The standardization used (Appendix) emphasizes the importance of locking the thoraco-scapular movement. This is crucial in order to provoke the subacromial structures as well as to obtain this high degree of reproducibility. This is supported in the study by De Wilde et al. (2003). The Jobe supraspinatus test was performed unilaterally, a conscious choice in order to secure a correct performance. Jobe and Moynes (1982) recommended the Jobe supraspinatus test as useful both when examining

Positive response

Negative response Test examiner A Re-test examiner A 1

2

3

4

5

6

7

8

9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 5. Patte maneuver. Patient (n ¼ 33) responses at testeretest.

236

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239

Positive response

Negative response Examiner A Examiner B 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 6. Patte maneuver. Patient’s (n ¼ 33) responses from examiners A and B.

strength in the supraspinatus muscle and when strengthening it. This test, as well as the Patte maneuver (Leroux et al., 1995), was in the current study interpreted in relation to pain provocation not only muscle force, which diﬀers from the original description. When these maneuvers are performed by, for example, a PT in patients with suspected subacromial impingement, no pain or pain combined with varying degree of muscular weakness is the main response. The muscular weakness could be a result of muscularetendonal changes and/or probably due to neuro-muscular inhibition in the presence of pain (Farina et al., 2004). Accordingly, pain or no pain as test response seems more relevant since muscle force is hard to evaluate in the presence of pain. Intra- and interexaminer reliability is aﬀected of diﬀerent sources of variation that could inﬂuence reproducibility. The examiners experience of the used

maneuvers could probably be of importance, but the results in the current study where experience diﬀered indicates that equal experience is not necessary to reach almost perfect intra- and interexaminer reliability. In this study, variation was limited by standardizing the maneuvers. Further the within-subject variation was monitored. The stability of the current shoulder complaint was assessed by VAS for pain at rest as well as VAS for functional disability, before each test occasion. Further, the duration of pain in case of a positive response was monitored by using VAS. The pain always returned to pre-test level before start of the test by the second examiner. Since these factors were stable, the examiner(s) seems to be the main source of variation. All together, these procedures can often be controlled in the actual clinical encounter to support reliability when using these maneuvers in daily practice.

Positive response

Negative response Test examiner A Re-test examiner A 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 7. Jobe supraspinatus test. Patient (n ¼ 33) responses at testeretest.

237

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239

Positive response

Negative response Examiner A Examiner B 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33

Patients Fig. 8. Jobe supraspinatus test. Patient’s (n ¼ 33) responses from examiners A and B.

5. Conclusion

examiner prevented the thoraco-scapular movement ﬁxating the acromion with a depressive force.

The Neer impingement sign, the HawkinseKennedy impingement test, the Patte maneuver as well as the Jobe supraspinatus test, are all highly reliable. In combination with earlier research about their validity, these four maneuvers seem suitable for use in clinical practice to identify patients with subacromial pain with impingement phenomena. However, their ability to discriminate between structures in the area is limited. Their high level of intra- and interexaminer reliability, together with validity aspects, are the clinicians’ tool in the diagnostic procedure. A homogenous diagnostic classiﬁcation is a prerequisite for relevant choice of treatment and necessary when implementing research results into clinical practice.

Acknowledgements We wish to thank participating patients, involved family physicians and physical therapists, especially Soﬁ Tagesson, for making this study possible as well as Henrik Magnusson and Elisabeth Wilhelm for cooperation in the statistical area. Financial support: Linko¨ping University and Hemborgs Memorial. Illustrates the Neer impingement sign.

Appendix The HawkinseKennedy impingement test All maneuvers were performed with the patient in a seated position. The Neer impingement sign The patient’s arm was forward ﬂexed combined with medial rotation in the gleno-humeral joint. The

The patient’s arm was positioned in 90 ﬂexion in the gleno-humeral joint as well as in the elbow. Then the gleno-humeral joint was forcibly rotated medially by lowering the forearm while supporting the elbow. The examiner prevented the thoraco-scapular movement ﬁxating the acromion with a depressive force.

238

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239

The Jobe supraspinatus test The patient’s arm was extended and medially rotated and elevated to 90 abduction in the scapular plane (90 abduction and then 30 horizontal adduction). The examiner instructed the patient to maintain position and resist a downward pressure.

Illustrates the Hawkins–Kennedy impingement test.

The Patte maneuver The patient’s arm was positioned in 90 ﬂexion in the gleno-humeral joint with the elbow in 90 ﬂexion and then medially rotated by lowering the forearm. The patient was then instructed to activate lateral rotation against the examiners resistance. The examiner prevented the thoraco-scapular movement ﬁxating the acromion with a depressive force.

Illustrates the Patte maneuver.

Illustrates the Jobe supraspinatus test.

References C¸alis x M, Akgu¨n K, Birtane M, Karacan I, C¸alis x H, Tu¨zu¨n F. Diagnostic value of clinical diagnostic tests in subacromial impingement syndrome. Annuals of Rheumatic Disease 2000;59:44e7. De Wilde L, Plasschaert F, Berghs B, Van Hoecke M, Verstaete K, Verdonk R. Quantiﬁed measurement of subacromial impingement. Journal of Shoulder and Elbow Surgery 2003;12:346e9. Dromerick AW, Kumar A, Volshteyn Edwards DF. Hemiplegic shoulder pain syndrome: interrater reliability of physical diagnosis signs. Archives of Physical Medicine and Rehabilitation 2006;87:294e5. Farina D, Arendt-Nielsen L, Merletti R, Graven-Nielsen T. Eﬀect of experimental muscle pain on motor unit ﬁring rate and conduction velocity. Journal of Neurophysiology 2004;91:1250e9. Fritz JM, Wainner RS. Examining diagnostic tests: an evidence-based perspective. Physical Therapy 2001;81:1546e64. Hawkins RJ, Kennedy JC. Impingement syndrome in athletes. American Journal of Sports Medicine 1980;8:151e8. Holtby R, Razmjou H. Validity of the supraspinatus test as a single clinical test in diagnosing patients with rotator cuﬀ pathology. Journal of Orthopaedic and Sports Physical Therapy 2004;34: 194e200. Jobe FW, Moynes DR. Delineation of diagnostic criteria and a rehabilitation program for rotator cuﬀ injuries. American Journal of Sports Medicine 1982;10:336e9. Krebs DE. Measurement theory. Physical Therapy 1987;67:1834e9. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977;33:159e74. Leroux J-L, Thomas E, Bonnel F, Blotman F. Diagnostic value of clinical tests for shoulder impingement syndrome. Revue du Rhumatisme 1995;62:423e8 (Engl. ed.).

K. Johansson, S. Ivarson / Manual Therapy 14 (2009) 231e239 MacDonald P, Clark P, Sutherland K. An analysis of the diagnostic accuracy of the Hawkins and Neer subacromial impingement signs. The Journal of Shoulder and Elbow Surgery 2000;9:299e301. Neer CS. Anterior acromioplasty for the chronic impingement syndrome in the shoulder. The Journal of Bone and Joint Surgery 1972;54-A:41e50. Neer CS. Impingement lesions. Clinical Orthopaedics and Related Research 1983;173:70e7. Nørregaard J, Krogsgaard MR, Lorenzen T, Jensen EM. Diagnosing patients with longstanding shoulder joint pain. Annals of Rheumatic Disease 2002;61:646e9. Park HB, Yokota A, Gill HS, El Rassi G, McFarland G. Diagnostic accuracy of clinical tests for the diﬀerent degrees of subacromial impingement syndrome. The Journal of Bone and Joint Surgery 2005;87-A:1446e55.

239

Sigholm G, Styf J. Subacromial pressure during diagnostic shoulder tests. Clinical Biomechanics 1988;3:187e9. Sim J, Wright CC. The Kappa statistic in reliability studies: use, interpretation, and sample size requirements. Physical Therapy 2005;85:257e68. Streiner DL, Norman GR. Health measurement scales: a practical guide to their development and use. 2nd ed. New York: Oxford University Press Inc.; 1998. p. 104e27 [chapter 8]. Valadie III A, Jobe C, Pink M, Ekman EF, Jobe FW. Anatomy of provocative tests for impingement syndrome of the shoulder. The Journal of Shoulder and Elbow Surgery 2000;9:36e46. Van der Windt DAWM, Koes BW, Boeke AJP, Deville´ W, De Jong BA, Bouter LM. Shoulder disorders in general practice: prognostic indicators of outcome. British Journal of General Practice 1996;46:519e23.

Available online at www.sciencedirect.com

Manual Therapy 14 (2009) 240 www.elsevier.com/math

Diary of events Back and beyond Theme The lumbar spine and pelvis Dates Sat 28e29th March 2009 Venue East Midlands Conference Centre, Nottingham For more details visit www.physioﬁrst.org.uk NZMPA biennial scientiﬁc conference, Heritage Hotel, Rotorua, New Zealand 28, 29 & 30 August 2009. The theme is ‘Striving for Excellence in OMT’ & also celebrating 40 years of Manual Therapy in New Zealand. The conference co-coordinator is Vicki Reid, Phone 0800 646 000 or 09 476 5353 Fax 09 476 5354 e-mail: [email protected] Website: www.nzmpa.org.nz NOI International conference UK and Ireland Nottingham UK e April 15e17, 2010 Dublin IRELAND April 21e23, 2010

1356-689X/$ - see front matter doi:10.1016/S1356-689X(09)00012-5

For further details www.noi2010.com Fax þ 3906 51882443

Janet G. Travell, MD Seminar Series, Bethesda, USA For information, contact: Myopain Seminars, 7830 Old Georgetown Road, Suite C-15, Bethesda, MD 20814-2432, USA. Tel.: þ1 301 656 0220; Fax: þ1 301 654 0333; website: www.painpoints.com/seminars.htm E-mail: [email protected]

If you wish to advertise a course/conference, please contact: Karen Beeton, Associate Head of School (Professional Development), School of Health and Emergency Professions, University of Hertfordshire, College Lane, Hatﬁeld, Herts AL10 9AB, UK. E-mail: [email protected] There is no charge for this service.