scholarly journals Psychometric properties of the PHQ-9 depression scale in people with multiple sclerosis: a systematic review

2018 ◽  
Author(s):  
Sarah Patrick ◽  
Peter Connick

AbstractBackgroundDepression affects approximately 25% of people with MS (pwMS) at any given time. It is however under recognised in clinical practice, in part due to a lack of uptake for brief assessment tools and uncertainty about their psychometric properties. The 9-item Patient Health Questionnaire (PHQ-9) is an attractive candidate for this role.ObjectiveTo synthesise published findings on the psychometric properties of the 9-item Patient Health Questionnaire (PHQ-9) when applied to people with multiple sclerosis (pwMS).Data sourcesPubMed, Medline and ISI Web of Science databases, supplemented by hand-searching of references from all eligible sources.Study eligibility criteriaPrimary literature written in English and published following peer-review with a primary aim to evaluate the performance of the PHQ-9 in pwMS.Outcome measuresPsychometric performance with respect to appropriateness, reliability, validity, responsiveness, precision, interpretability, acceptability, and feasibility.ResultsSeven relevant studies were identified, these were of high quality and included 5080 participants from all MS disease-course groups. Strong evidence was found supporting the validity of the PHQ-9 as a unidimensional measure of depression. Used as a screening tool for major depressive disorder (MDD) with a cut-point of 11, sensitivity was 95% sensitivity and specificity 88.3% (PPV 51.4%, NPV 48.6%). Alternative scoring systems that may address the issue of overlap between somatic features of depression and features of MS per se are being developed, although their utility remains unclear. However data on reliability was limited, and no specific evidence was available on test-retest reliability, responsiveness, acceptability, or feasibility.ConclusionsThe PHQ-9 represents a suitable tool to screen for MDD in pwMS. However use as a diagnostic tool cannot currently be recommended, and the potential value for monitoring depressive symptoms cannot be established without further evidence on test-retest reliability, responsiveness, acceptability, and feasibility.PROSPERO register ID: CRD42017067814

2021 ◽  
Author(s):  
Tomer Yona ◽  
Asaf Weisman ◽  
Uri Gottlieb ◽  
Eshed Lin ◽  
Youssef Masharawi

Objective: To assess the psychometric properties of the Hebrew version of the Patient Health Questionnaire (PHQ-9) in the general population. Methods: Using an online survey, we assessed test-retest reliability with a two-week time interval. A total of 118 participants enrolled in the study, of whom 103 completed the survey twice. Each participant filled out the PHQ-9 and the 12-Item Short Form Survey (SF-12). Our statistical analysis includes Cronbachs alpha, Intraclass Correlation Coefficient (ICC2,1), Spearmans rank correlation coefficient, Standard Error of Measurement (SEM), and Minimal Detectable Change (MDC). Results: Internal consistency of the Hebrew version of the PHQ-9 ranged from alpha=0.79-0.83. The test-retest reliability of the questionnaire is good (ICC2,1= 0.81), and it is moderately and negatively correlated to the mental component of the SF-12 (Spearman rho;= -0.57, p< .05). The SEM of the PHQ-9 is 1.83 points, and the MDC was found to be 5 points. Conclusion: The Hebrew version of the PHQ-9 is valid and reliable for screening self-reported depressive symptoms online in the general Hebrew-speaking population.


2018 ◽  
Vol 67 (4) ◽  
pp. 255-263 ◽  
Author(s):  
Maren de Moraes e Silva ◽  
Pilar Bueno Siqueira Mercer ◽  
Maria Carolina Zavagna Witt ◽  
Renata Guedes Ramina Pessoa ◽  
Camila Poletto Viveiros ◽  
...  

RESUMO Objetivo Realizar uma revisão sistemática com metanálise visando evidenciar as ferramentas de rastreio psiquiátrico mais adequadas na abordagem de pacientes adultos com esclerose múltipla avaliadas por estudos de acurácia diagnóstica. Métodos As bases de dados Medline, SciELO, PubMed e Lilacs foram utilizadas para pesquisa de artigos referentes ao tema proposto. Para essa busca, foram utilizados os termos “ multiple sclerosis psychiatric symptoms ” e “ multiple sclerosis psychiatric screening ”, sendo incluídos artigos na língua portuguesa e inglesa publicados entre 2007 e 2017. A qualidade dos estudos incluídos foi avaliada utilizando o método QUADAS. Uma metanálise foi conduzida com o auxílio do programa RevMan 5.3. Resultados Sete artigos foram selecionados para análise. A concordância dos revisores foi calculada com um kappa de 0,95. A maioria dos estudos selecionados realizou avaliação de ferramentas de rastreio para depressão, tendo apenas dois deles abordado a busca de transtornos de ansiedade. O rastreio de transtorno depressivo foi realizado a partir da avaliação de 11 instrumentos diferentes, enquanto o de ansiedade, por apenas dois. A maior parte dos testes analisados apresentou boa acurácia, e a ferramenta BAI foi a única com desempenho regular na análise da curva ROC (0,77 de área sob a curva). Os demais testes apresentaram desempenho bom ou excelente, cursando com mais de 83% na avaliação de área sob a curva. Conclusões As ferramentas HADS ( Hospital Anxiety and Depression Scale ), CESD ( Center for Epidemiological Scale – Depression ) e PHQ-9 ( Patient Health Questionnaire-9 ) parecem ser as mais indicadas para o rastreio psiquiátrico de pacientes com esclerose múltipla. Registro PROSPERO: CRD42017082741.


2021 ◽  
pp. 025371762110221
Author(s):  
Aseem Mehra ◽  
Arun Agarwal ◽  
Mohammad Bashar ◽  
Sandeep Grover

Background: A limited number of studies have evaluated the psychometric properties of rating scales used to assess depression in the older adults. The present study aimed to assess the validity of the Hindi Geriatric Depression Scale (GDS, 30, 15, 10, 5, 4, and 1 item version) and Hindi Patient Health Questionnaire (nine and two items version) in a group of older adults residing in a rural community. Methods: The psychometric properties of these scales were assessed against the diagnosis of depression a qualified psychiatrist made by using a semistructured interview. Results: Total 125 older adults were recruited from a rural community, with a mean age of 65.5 (SD: 6.4) years. The prevalence of depression was 36.8% as per the evaluation by the psychiatrist. When the agreement of different scales with the clinicians’ diagnosis was evaluated, it was seen that sensitivity, specificity, and Cohen Kappa value of GDS-30 and 15 were better than the other scales used to assess depression. When the sensitivity and specificity were evaluated using newer cutoffs, the specificity and sensitivity of GDS-30 were more than that of other scales. Conclusion: Hindi version of GDS-30 with a cutoff of 13 has excellent psychometric properties.


Author(s):  
Hala Darwish ◽  
Pia Zeinoun ◽  
Natali Farran ◽  
Husam Ghusn ◽  
Bassem Yamout ◽  
...  

Abstract Objective: Multiple sclerosis (MS) is often associated with cognitive deficits. Accurate evaluation of the MS patients’ cognitive performance is essential for diagnosis and treatment recommendation. The Brief International Cognitive Assessment in Multiple Sclerosis (BICAMS), widely used cognitive testing battery, examines processing speed, verbal and visuospatial learning, and memory. Our study aims to examine the psychometric properties of an Arabic version of the BICAMS and to provide normative values in a Lebanese sample. Method: The BICAMS, comprised of the Symbol Digit Modalities Test (SDMT), Brief Visuospatial Memory Test-Revised (BVMT-R), and a newly developed verbal learning/memory test, the Verbal Memory Arabic Test (VMAT), were administered on healthy subjects and MS patients. The sample consisted of 180 healthy individuals, of whom 63 were retested after 2–3 weeks. Forty-three MS patients matched with 43 healthy subjects based on age, sex, and years of education were assessed. A sample of 10 MS patients was also examined on two occasions. Test–retest reliability and criterion-related validity were examined, and regression-based norms were derived. Results: The test–retest correlations showed good evidence of reliability with coefficients ranging between 0.64 and 0.73 in the healthy sample, and between 0.43 and 0.92 in the MS sample. The BICAMS was able to discriminate between MS patients and matched healthy participants on the SDMT and BVMT-R. Normative data were comparable to other studies. Conclusions: This new Arabic version of the BICAMS shows initial good psychometric properties. While good evidence of VMAT’s reliability was shown in the healthy participants, less test–retest reliability in this tool was seen in the MS group, and partial criterion-related validity was evident. This renders further examination of the VMAT. We provide regression-based norms for a Lebanese sample and encourage the use of this battery in both research and clinical settings.


BMJ Open ◽  
2018 ◽  
Vol 8 (10) ◽  
pp. e021734 ◽  
Author(s):  
Alison Griffiths ◽  
Rachel Toovey ◽  
Prue E Morgan ◽  
Alicia J Spittle

ObjectiveGross motor assessment tools have a critical role in identifying, diagnosing and evaluating motor difficulties in childhood. The objective of this review was to systematically evaluate the psychometric properties and clinical utility of gross motor assessment tools for children aged 2–12 years.MethodA systematic search of MEDLINE, Embase, CINAHL and AMED was performed between May and July 2017. Methodological quality was assessed with the COnsensus-based Standards for the selection of health status Measurement INstruments checklist and an outcome measures rating form was used to evaluate reliability, validity and clinical utility of assessment tools.ResultsSeven assessment tools from 37 studies/manuals met the inclusion criteria: Bayley Scale of Infant and Toddler Development-III (Bayley-III), Bruininks-Oseretsky Test of Motor Proficiency-2 (BOT-2), Movement Assessment Battery for Children-2 (MABC-2), McCarron Assessment of Neuromuscular Development (MAND), Neurological Sensory Motor Developmental Assessment (NSMDA), Peabody Developmental Motor Scales-2 (PDMS-2) and Test of Gross Motor Development-2 (TGMD-2). Methodological quality varied from poor to excellent. Validity and internal consistency varied from fair to excellent (α=0.5–0.99). The Bayley-III, NSMDA and MABC-2 have evidence of predictive validity. Test–retest reliability is excellent in the BOT-2 (intraclass correlation coefficient (ICC)=0.80–0.99), PDMS-2 (ICC=0.97), MABC-2 (ICC=0.83–0.96) and TGMD-2 (ICC=0.81–0.92). TGMD-2 has the highest inter-rater (ICC=0.88–0.93) and intrarater reliability (ICC=0.92–0.99).ConclusionsThe majority of gross motor assessments for children have good-excellent validity. Test–retest reliability is highest in the BOT-2, MABC-2, PDMS-2 and TGMD-2. The Bayley-III has the best predictive validity at 2 years of age for later motor outcome. None of the assessment tools demonstrate good evaluative validity. Further research on evaluative gross motor assessment tools are urgently needed.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Stefanie Bühn ◽  
Peggy Ober ◽  
Tim Mathes ◽  
Uta Wegewitz ◽  
Anja Jacobs ◽  
...  

Abstract Background Systematic Reviews (SRs) can build the groundwork for evidence-based health care decision-making. A sound methodological quality of SRs is crucial. AMSTAR (A Measurement Tool to Assess Systematic Reviews) is a widely used tool developed to assess the methodological quality of SRs of randomized controlled trials (RCTs). Research shows that AMSTAR seems to be valid and reliable in terms of interrater reliability (IRR), but the test retest reliability (TRR) of AMSTAR has never been investigated. In our study we investigated the TRR of AMSTAR to evaluate the importance of its measurement and contribute to the discussion of the measurement properties of AMSTAR and other quality assessment tools. Methods Seven raters at three institutions independently assessed the methodological quality of SRs in the field of occupational health with AMSTAR. Between the first and second ratings was a timespan of approximately two years. Answers were dichotomized, and we calculated the TRR of all raters and AMSTAR items using Gwet’s AC1 coefficient. To investigate the impact of variation in the ratings over time, we obtained summary scores for each review. Results AMSTAR item 4 (Was the status of publication used as an inclusion criterion?) provided the lowest median TRR of 0.53 (moderate agreement). Perfect agreement of all reviewers was detected for AMSTAR-item 1 with a Gwet’s AC1 of 1, which represented perfect agreement. The median TRR of the single raters varied between 0.69 (substantial agreement) and 0.89 (almost perfect agreement). Variation of two or more points in yes-scored AMSTAR items was observed in 65% (73/112) of all assessments. Conclusions The high variation between the first and second AMSTAR ratings suggests that consideration of the TRR is important when evaluating the psychometric properties of AMSTAR.. However, more evidence is needed to investigate this neglected issue of measurement properties. Our results may initiate discussion of the importance of considering the TRR of assessment tools. A further examination of the TRR of AMSTAR, as well as other recently established rating tools such as AMSTAR 2 and ROBIS (Risk Of Bias In Systematic reviews), would be useful.


Sign in / Sign up

Export Citation Format

Share Document