The reporting quality of studies investigating the diagnostic accuracy of anti-CCP antibody in rheumatoid arthritis and its impact on diagnostic estimates
© Zintzaras et al.; licensee BioMed Central Ltd. 2012
Received: 19 July 2011
Accepted: 8 June 2012
Published: 25 June 2012
Recently anti-CCP testing has become popular in the diagnosis of rheumatoid arthritis (RA). However, the inadequate reporting of the relevant diagnostic studies may overestimate and bias the results, directing scientists into making false decisions. The aim of the present study was to evaluate the reporting quality of studies used anti-CCP2 for the diagnosis of RA and to explore the impact of reporting quality on pooled estimates of diagnostic measures.
PubMed was searched for clinical studies investigated the diagnostic accuracy of anti-CCP. The studies were evaluated for their reporting quality according to STARD statement. The overall reporting quality and the differences between high and low quality studies were explored. The effect of reporting quality on pooled estimates of diagnostic accuracy was also examined.
The overall reporting quality was relatively good but there are some essential methodological aspects of the studies that are seldom reported making the assessment of study validity difficult. Comparing the quality of reporting in high versus low quality articles, significant differences were seen in a relatively large number of methodological items. Overall, the STARD score (high/low) has no effect on the pooled sensitivities and specificities. However, the reporting of specific STARD items (e.g. reporting sufficiently the methods used in calculating the measures of diagnostic accuracy and reporting of demographic and clinical characteristics/features of the study population) has an effect on sensitivity and specificity.
The reporting quality of the diagnostic studies needs further improvement since the study quality may bias the estimates of diagnostic accuracy.
KeywordsRheumatoid arthritis Anti-cyclic citrullinated peptide 2 Anti-CCP2 Quality, Sensitivity Specificity, Meta-analysis
Rheumatoid arthritis (RA) is a chronic, systemic inflammatory disorder that affects many tissues and organs, mainly synovial joints . The disease leads progressively to the destruction of articular cartilage and ankylosis of the joints . Although the cause of RA is unknown, autoimmunity plays a pivotal role in both its chronicity and progression . RA affects females more frequently than males and it is diagnosed mainly in age 40–60 years .
The diagnosis of RA is based on clinical criteria and laboratory tests. Regarding the later tests, the presence of the rheumatoid factor (RF), an autoantibody, consists one of the American College of Rheumatology (ACR) criteria for presence and severity of RA . However, RF has a limited specificity since it can be detected in other autoimmune or infectious diseases, and in the healthy elderly. Anti-cyclic citrullinated protein antibodies (anti-CCP) are other autoantibodies that may be detected in RA patients. Recently anti-CCP testing has become substantial part of ACR-EULAR classification criteria for RA . There is evidence that CCP-assays provide comparable performance with that of RF . However, analysis of the association between anti-CCP antibody titre and RA activity produced contradictory results [8, 9]. Anti-CCP2 assay is the most popular because of its high diagnostic specificity and its predictive and prognostic value in RA [10–12].
Currently, diagnostic studies on anti-CCP assays are publishing with a high rate . However, overestimated and biased results from poorly designed and reported studies may direct scientists into making false decisions [14–16]. The reporting information on design and conduct of diagnostic studies is crucial, though, its absence has already been noticed [17, 18]. Nevertheless, appropriate reporting may allow researchers to detect potential bias in studies’ internal validity, to assess generalizability and applicability of their results . A survey of published studies of diagnostic accuracy showed that the methodological quality was not optimal. In addition, information on issues like study design, conduct and data analysis was often not reported [20, 21].
Inadequate reporting of the published diagnostic accuracy studies may restrict the generalizability, applicability and credibility of studies’ results. A number of guidelines and statements have been developed to improve the quality of a variety of study designs , including the diagnostic accuracy studies . In particular, in order to improve the reporting of diagnostic accuracy studies, the Standards for Reporting of Diagnostic Accuracy (STARD) statement has been proposed (http://www.stard-statement.org/ ). The STARD statement is a checklist of 25 criteria that diagnostic accuracy studies should conform to in order to make their conclusions easier to assess, interpret and generalize, and lead as a result to better decisions in diagnosis. However, STARD does not assess the actual quality of the research study but the reporting quality, two issues which are not necessarily correlated. In addition to STARD, another tooled has been proposed, called QUADAS, for assessing the methodological quality of diagnostic accuracy studies . Recently, QUADAS was used to evaluate the quality of anti-CCP RA studies in a meta-analysis .
The aim of the present study was twofold: first, to evaluate the reporting quality of studies used anti-CCP2 for the diagnosis of RA, according to the STARD statement, and second, to investigate whether quality of reporting is associated with the effect size of diagnostic metrics using meta-analytic techniques (data synthesis). The analysis was focused on the reporting of methods and results sections of the STARD statement. The effect of quality on diagnostic accuracy was focused on studies scored as “high quality” and “low quality”, and for specific items of STARD.
PubMed was searched for clinical studies, published from January 1987 (date of imposing the revised ACR criteria  to September 2010 that assessed the utility of anti-CCP2 assay in the diagnosis of RA. The search used the following strategy: (("diagnosis" or "diagnostic" or "sensitivity" or "specificity") and ("rheumatoid arthritis" or "RA") and ("anti-cyclic citrullinated peptide antibodies" or "anti-CCP" or "antiCCP" or "anti-CCP2" or "antiCCP2")).
The authors independently reviewed the abstracts to determine the eligibility of each article to potentially meet the search strategy. The references of the retrieved articles were also searched. Only articles in English language, published as full papers or short reports were considered in our study. Reviews, editorials, letters and comments were excluded. The agreement level was reported using Kappa statistics.
We included studies that evaluated the utility of anti-CCP2 antibody for diagnosis of RA with more than 10 participants enrolled that provided data sufficient to estimate both sensitivity and specificity. As controls were defined participants free of RA (i.e. diseased with other conditions or healthy). Disagreements were resolved by discussing the full articles.
The data were abstracted from each study by two authors (AP and DZ) independently. Data were extracted by using a standardized form that included study setting and technical details of the assay, demographic characteristics of the patients and 2×2 contingency tables (disease status and test outcome) needed to calculate at least the sensitivity and specificity.
When articles reported more than one set of 2×2 data (such as assays data from different manufacturers and/or different cut-offs), then each data set was considered as a different study. Also, articles reported data separately for multiple control groups (diseased, healthy) were considered as separate studies. In overlapping studies, the most recent and/or the largest study was recorded. The agreement level was also reported using Kappa statistics.
Study quality assessment with STARD
Proportion of reporting of the items in the STARD statement, overall and in a total of 103 diagnostic studies involving rheumatoid arthritis by STARD score group
Overall % of reporting item n = 103
% of reporting item
Lower quality articles (score < 9) n = 50
Higher quality articles (score ≥ 9) n = 53
1. The study population: The inclusion and exclusion criteria, setting and locations where the data were collected.
2. Participant recruitment: Was recruitment based on presenting symptoms, results from previous tests, or the fact that the participants had received the index tests or the reference standard?
3. Participant sampling: Was the study population a consecutive series of participants defined by the selection criteria in item 3 and 4? If not, specify how participants were further selected.
4. Data collection: Was data collection planned before the index test and reference standard were performed (prospective study) or after (retrospective study)?
5. The reference standard and its rationale.
6. Technical specifications of material and methods involved including how and when measurements were taken, and/or cite references for index tests and reference standard.
7. Definition of and rationale for the units, cut-offs and/or categories of the results of the index tests and the reference standard.
8. The number, training and expertise of the persons executing and reading the index tests and the reference standard.
9. Whether or not the readers of the index tests and reference standard were blind (masked) to the results of the other test and describe any other clinical information available to the readers.
10. Methods for calculating or comparing measures of diagnostic accuracy, and the statistical methods used to quantify uncertainty (e.g. 95% confidence intervals).
11. Methods for calculating test reproducibility, if done. #
12. When study was done, including beginning and ending dates of recruitment.
13. Clinical and demographic characteristics of the study population (e.g. age, sex, spectrum of presenting symptoms, comorbidity, current treatments, recruitment centers).
14. The number of participants satisfying the criteria for inclusion that did or did not undergo the index tests and/or the reference standard; describe why participants failed to receive either test (a flow diagram is strongly recommended).
15. Time interval from the index tests to the reference standard, and any treatment administered between.
16. Distribution of severity of disease (define criteria) in those with the target condition; other diagnoses in participants without the target condition.
17. A cross tabulation of the results of the index tests (including indeterminate and missing results) by the results of the reference standard; for continuous results, the distribution of the test results by the results of the reference standard.
18. Any adverse events from performing the index tests or the reference standard.
19. Estimates of diagnostic accuracy and measures of statistical uncertainty (e.g. 95% confidence intervals).
20. How indeterminate results, missing responses and outliers of the index tests were handled.
21. Estimates of variability of diagnostic accuracy between subgroups of participants, readers or centers, if done. #
22. Estimates of test reproducibility, typically imprecision (as CV) at 2 or 3 concentrations, if done. #
Estimation of diagnostic accuracy
The estimation of the diagnostic accuracy was based on the sensitivity (Se) and specificity (Sp). Se and Sp were calculated from contingency tables abstracted from each study.
Data synthesis and analysis
For each study the diagnostic metrics (Se, Sp, positive and negative likelihood ratio) were calculated. A bivariate model [24, 25] was used to estimate summary sensitivity and specificity, with 95% confidence and prediction regions around the summary points. Hierarchical SROC analysis that allows for between-study heterogeneity was also applied to four or more studies . Heterogeneity was evaluated visually by using the SROC curve and numerically by using the variance of the logit-transformed sensitivity and specificity. A smaller value of variance indicates low between study heterogeneity. The statistical analysis was performed using Stata v.10 (metandi and metandiplot commands ) (StataCorp, College Station, Texas) and SPSS, version 13.0 (SPSS Inc., Chicago).
Effect of study quality
In addition, to the overall percentages of reporting the STARD statement items, the quality of reporting in high versus low quality articles was explored. Studies were classified as high quality of reporting when quality score ≥ 9 and as lower quality when quality score < 9. The choice of quality score = 9 as cut-off was the median of the overall quality scores of studies. The overall quality score for each article was calculated by summing the weighted score of reported items. A unit weight was applied for each of the item 2, 5, 7, 10, 13, 16 and 19 (considered subjectively more “important”), whereas, a weight of 0.5 for each of the other items. The effect of study quality on diagnostic accuracy was evaluated based on the level of quality (high/low) and on the reporting results of the above “important” STARD items. Then, the estimates of pooled sensitivities and specificities were compared with a z-score test.
Results of Meta-analysis
Sensitivity (95% ci),%
Specificity (95% ci),%
Effect of STARD score
High quality score
Low quality score
Effect of STARD item 2
Effect of STARD item 5
Effect of STARD item 7
Effect of STARD item 10
Effect of STARD item 13
Effect of STARD item 16
Effect of STARD item 19
Table 1 shows the overall proportion of reporting of the 22 items in the methods and results sections of the STARD statement and the corresponding proportions for high and low quality articles.
Overall, 10 items (six and four items in methods and results sections, respectively) were reported by 85% or more of the studies (Table 1). In methods, the items include the reporting of 1) study population (inclusion/exclusion criteria, setting, location), 2) participants recruitment (eg. based on symptoms, previous testing), 3) participant sampling, 4) data collection (prospective or retrospective study), 5) methods for calculating or comparing measures of diagnostic accuracy and statistical methods used to quantify uncertainty and 6) methods for calculating reproducibility, if done. In results, the items include the reporting of 1) clinical and demographic characteristics of the study population (age, sex, presenting symptoms, comorbidity, current treatment), 2) the cross tabulation or the distribution of the test results by the results of the reference standard, 3) estimates of variability of diagnostic accuracy between subgroups of participants, centers, if done and 4) estimates of test reproducibility, if done.
Furthermore, 13 items (including the ten items already mentioned above) were reported by 70% or more of the studies. The 3 additional items were the reporting of 1) reference standard and its rationale of, 2) definition of and rationale for the units, cut-offs and/or categories of tests results and 3) estimates of diagnostic accuracy and measures of statistical uncertainty.
In contrast, some items were reported only by a small fraction of articles. For example, 20% of articles provided the number, training and expertise of persons executing the tests, 18% reported the blinding status, 13% provided information on recruitment, 12% reported adverse events and finally, 8% provided details about handling of missing responses and outliers.
Effect of study quality
In comparing the quality of reporting in high quality (quality score ≥ 9) versus lower quality (quality score < 9) articles, significant differences were seen in 11 items (P < 0.05) (6 items in methods: study population, data collection, reference standard, definition of units/cut-offs, number/training/expertise of persons executing the tests, methods for calculating diagnostic measures and 5 in results: dates of recruitment, clinical/demographic characteristics, information on recruitment, time interval between tests, estimates of diagnostic accuracy). In all these items high quality articles showed better performance. An item-by-item comparison is presented in Table 1.
Impact of study quality on diagnostic estimates
Table 2 shows the meta-analysis’ overall results (pooled sensitivities and specificities), the results according to STARD score (high/low quality) and the results for specific STARD items (comparison of outcome “yes” vs. “no”).
In comparing specific items (“yes” vs. “not”), the estimates of pooled sensitivities were statistically significant for items 10 and 13 [p = 0.03 and p = 0.06 (marginal), respectively]. In addition, the estimates of pooled specificities were statistically significant for items 13 and 16 (p = 0.01 and p = 0.01, respectively).
The present study investigated the quality of reporting of studies using the anti-CCP2 assay in RA patients according to the STARD statement. The differences between high and low quality studies were explored. The effect of reporting quality on pooled estimates of diagnostic metrics was also examined. Our analysis focused on the reporting of methodological items (items in method and results’ sections). In total, the 103 articles (corresponding to 132 studies) covered a publication period of 23 years. Almost the articles used in our analysis were published after the introduction of STARD statement (only 4 of them were published during 2003, year of STARD appearance).
Although the overall reporting quality was relatively good (13 items were reported by 70% or more of the studies) there are some essential methodological aspects of the studies (such as number/training/expertise of persons executing the tests, readers’ blinding to results, information on recruitment, adverse events from performing the tests, handling of missing responses and outliers) that are seldom reported making it difficult for the reader to assess explicitly the validity of a study. Comparing the quality of reporting in high versus low quality articles, significant differences were seen in a relatively large number of methodological items (11 items referred to: study population, data collection, reference standard, definition of units/cut-offs, number/training/expertise of persons executing and reading the tests, methods for calculating diagnostic measures, dates of recruitment, clinical/demographic characteristics, information on recruitment, time interval between tests, estimates of diagnostic accuracy).
Overall, the STARD quality score (high/low) has no effect on pooled sensitivity and pooled specificity. However, the meta-analysis showed an effect for specific STARD items. Studies not reporting sufficiently the methods used in calculating the measures of diagnostic accuracy (item 1), may have overestimated the sensitivity. In addition, the reporting of demographic and clinical characteristics/features of the study population (items 13 and 16) has affected the effect size of specificity, i.e. they have overestimated it, indicating also a spectrum bias .
However, the findings of the present synthesis (sensitivity of anti-CCP2, 71% and specificity, 96%) are compatible with those of earlier reviews (Nishimura et al. : sensitivity, 67% and specificity, 95%, Whiting et al. : sensitivity, 67%, specificity, 96%). An overestimation of our overall sensitivity might be resulted because of the lack of stratification by study design or disease duration in the analysis.
In a recent review, Whiting et al.  compared the accuracy of ACPA with that of RF in diagnosing RA in patients with early symptoms of the disease. They also assessed their studies for methodological quality by using a modification of the QUADAS criteria (items related to reporting quality, were removed). However, the impact of quality effect in diagnostic accuracy was not evaluated further. Nevertheless, the primary aim of the present study was to evaluate the effect of quality of reporting (according to STARD) in diagnostic accuracy rather than evaluating the effect of methodological quality (according to QUADAS); though, both tools can be useful for assessing the quality of diagnostic studies in a different perspective .
Applications of the STARD statement guidelines for assessing the quality of reporting in diagnostic accuracy studies, have been conducted in various medical fields such as in the field of diagnostic endoscopy , of juvenile idiopathic arthritis in peripheral joints , of diabetic retinopathy screening , of glucose monitor studies , of optical coherence tomography in glaucoma , of ultrasonography for the diagnosis of developmental dysplasia of the hip  and in the field of screening ultrasonography for trauma .
A limitation of the present study is that the literature search was restricted to PubMed. In addition, some studies may have been missed since we included only studies that provided data to estimate both sensitivity and specificity. However, the number of articles used is relatively large and an overview of reporting quality of studies may be obtained and the reached conclusions are unlikely to be affected by omitted studies. We would like to stress that lack of reporting of a STARD item does not necessarily implies that this item was not performed. Thus, a badly performed but well reported study will necessarily receive full credit. Finally, the published studies have had different design settings, and involved different stages of rheumatoid arthritis (study design, disease duration) which may question the synthesis of information, and therefore, the generalizability of results.
In conclusion, our attempt to assess the reporting quality of diagnostic accuracy studies in RA highlights the need for further improvement. Implementation of the quality reporting statements (e.g. CONSORT) have already improved the quality of reporting in other fields of medical research . Thus, guidelines on the reporting of diagnostic accuracy studies are expected to improve the quality of reports of diagnostic studies as well. Finally, the study quality has no effect on the pooled estimates of diagnostic accuracy.
The project was supported by Research Committee of the University of Athens and the Biometrical, Epidemiological and Clinical Research Organization (BECRO).
- Zintzaras E, Voulgarelis M, Moutsopoulos HM: The risk of lymphoma development in autoimmune diseases: a meta-analysis. Arch Intern Med. 2005, 165: 2337-2344. 10.1001/archinte.165.20.2337.View ArticlePubMedGoogle Scholar
- Lee DM, Weinblatt ME: Rheumatoid arthritis. Lancet. 2001, 358: 903-911. 10.1016/S0140-6736(01)06075-5.View ArticlePubMedGoogle Scholar
- Zintzaras E, Dahabreh IJ, Giannouli S, Voulgarelis M, Moutsopoulos HM: Infliximab and methotrexate in the treatment of rheumatoid arthritis: a systematic review and meta-analysis of dosage regimens. Clin Ther. 2008, 30: 1939-1955. 10.1016/j.clinthera.2008.11.007.View ArticlePubMedGoogle Scholar
- MacGregor AJ, Silman AJ: Rheumatoid arthritis-classification and epidemiology. Rheumatology. 2000, Mosby, London, 2Google Scholar
- Arnett FC, Edworthy SM, Bloch DA, McShane DJ, Fries JF, Cooper NS, Healey LA, Kaplan SR, Liang MH, Luthra HS, et al.: The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum. 1988, 31: 315-324. 10.1002/art.1780310302.View ArticlePubMedGoogle Scholar
- Aletaha D, Neogi T, Silman AJ, Funovits J, Felson DT, Bingham CO, Birnbaum NS, Burmester GR, Bykerk VP, Cohen MD, et al.: 2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative. Ann Rheum Dis. 2010, 69: 1580-1588. 10.1136/ard.2010.138461.View ArticlePubMedGoogle Scholar
- Coenen D, Verschueren P, Westhovens R, Bossuyt X: Technical and diagnostic performance of 6 assays for the measurement of citrullinated protein/peptide antibodies in the diagnosis of rheumatoid arthritis. Clin Chem. 2007, 53: 498-504. 10.1373/clinchem.2006.078063.View ArticlePubMedGoogle Scholar
- Greiner A, Plischke H, Kellner H, Gruber R: Association of anti-cyclic citrullinated peptide antibodies, anti-citrullin antibodies, and IgM and IgA rheumatoid factors with serological parameters of disease activity in rheumatoid arthritis. Ann N Y Acad Sci. 2005, 1050: 295-303. 10.1196/annals.1313.031.View ArticlePubMedGoogle Scholar
- van Gaalen F, Ioan-Facsinay A, Huizinga TW, Toes RE: The devil in the details: the emerging role of anticitrulline autoimmunity in rheumatoid arthritis. J Immunol. 2005, 175: 5575-5580.View ArticlePubMedGoogle Scholar
- De Rycke L, Peene I, Hoffman IE, Kruithof E, Union A, Meheus L, Lebeer K, Wyns B, Vincent C, Mielants H, et al.: Rheumatoid factor and anticitrullinated protein antibodies in rheumatoid arthritis: diagnostic value, associations with radiological progression rate, and extra-articular manifestations. Ann Rheum Dis. 2004, 63: 1587-1593. 10.1136/ard.2003.017574.View ArticlePubMedPubMed CentralGoogle Scholar
- Kudo-Tanaka E, Ohshima S, Ishii M, Mima T, Matsushita M, Azuma N, Harada Y, Katada Y, Ikeue H, Umeshita-Sasai M, et al.: Autoantibodies to cyclic citrullinated peptide 2 (CCP2) are superior to other potential diagnostic biomarkers for predicting rheumatoid arthritis in early undifferentiated arthritis. Clin Rheumatol. 2007, 26: 1627-1633. 10.1007/s10067-007-0558-5.View ArticlePubMedGoogle Scholar
- Liu X, Jia R, Zhao J, Li Z: The role of anti-mutated citrullinated vimentin antibodies in the diagnosis of early rheumatoid arthritis. J Rheumatol. 2009, 36: 1136-1142. 10.3899/jrheum.080796.View ArticlePubMedGoogle Scholar
- Whiting PF, Smidt N, Sterne JA, Harbord R, Burton A, Burke M, Beynon R, Ben-Shlomo Y, Axford J, Dieppe P: Systematic review: accuracy of anti-citrullinated Peptide antibodies for diagnosing rheumatoid arthritis. Ann Intern Med. 2010, 152: 456-464. W155-466View ArticlePubMedGoogle Scholar
- Guyatt GH, Tugwell PX, Feeny DH, Haynes RB, Drummond M: A framework for clinical evaluation of diagnostic technologies. CMAJ. 1986, 134: 587-594.PubMedPubMed CentralGoogle Scholar
- Fryback DG, Thornbury JR: The efficacy of diagnostic imaging. Med Decis Making. 1991, 11: 88-94. 10.1177/0272989X9101100203.View ArticlePubMedGoogle Scholar
- Kent DL, Larson EB: Disease, level of impact, and quality of research methods. Three dimensions of clinical efficacy assessment applied to magnetic resonance imaging. Invest Radiol. 1992, 27: 245-254. 10.1097/00004424-199203000-00014.View ArticlePubMedGoogle Scholar
- Nelemans PJ, Leiner T, de Vet HC, van Engelshoven JM: Peripheral arterial disease: meta-analysis of the diagnostic performance of MR angiography. Radiology. 2000, 217: 105-114.View ArticlePubMedGoogle Scholar
- De Vries E, Breedveld FC: An approach to therapeutic trials in rheumatology: the foundation for applied rheumatology research. Br J Rheumatol. 1996, 35: 1000-1001. 10.1093/rheumatology/35.10.1000.View ArticlePubMedGoogle Scholar
- Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, Lijmer JG, Moher D, Rennie D, de Vet HC: Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative. Ann Intern Med. 2003, 138: 40-44.View ArticlePubMedGoogle Scholar
- Reid MC, Lachs MS, Feinstein AR: Use of methodological standards in diagnostic test research. Getting better but still not good. JAMA. 1995, 274: 645-651. 10.1001/jama.1995.03530080061042.View ArticlePubMedGoogle Scholar
- Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, Moher D, Rennie D, de Vet HC, Lijmer JG: The STARD statement for reporting studies of diagnostic accuracy: explanation and elaboration. Ann Intern Med. 2003, 138: W1-W12.View ArticlePubMedGoogle Scholar
- Vandenbroucke JP: STREGA, STROBE, STARD, SQUIRE, MOOSE, PRISMA, GNOSIS, TREND, ORION, COREQ, QUOROM, REMARK… and CONSORT: for whom does the guideline toll?. J Clin Epidemiol. 2009, 62: 594-596. 10.1016/j.jclinepi.2008.12.003.View ArticlePubMedGoogle Scholar
- Whiting P, Rutjes AW, Reitsma JB, Bossuyt PM, Kleijnen J: The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Med Res Meth. 2003, 3: 25-10.1186/1471-2288-3-25.View ArticleGoogle Scholar
- Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH: Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005, 58: 982-990. 10.1016/j.jclinepi.2005.02.022.View ArticlePubMedGoogle Scholar
- Harbord RM, Deeks JJ, Egger M, Whiting P, Sterne JA: A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics. 2007, 8: 239-251.View ArticlePubMedGoogle Scholar
- Harbord R, Whiting P: Metandi: Meta-analysis of diagnostic accuracy using hierarchical logistic regression. Stata J. 2009, 9: 211-229.Google Scholar
- Nishimura K, Sugiyama D, Kogata Y, Tsuji G, Nakazawa T, Kawano S, Saigo K, Morinobu A, Koshiba M, Kuntz KM, et al.: Meta-analysis: diagnostic accuracy of anti-cyclic citrullinated peptide antibody and rheumatoid factor for rheumatoid arthritis. Ann Intern Med. 2007, 146: 797-808.View ArticlePubMedGoogle Scholar
- Fontela PS, Pant Pai N, Schiller I, Dendukuri N, Ramsay A, Pai M: Quality and reporting of diagnostic accuracy studies in TB, HIV and malaria: evaluation using QUADAS and STARD standards. PLoS One. 2009, 4: e7753-10.1371/journal.pone.0007753.View ArticlePubMedPubMed CentralGoogle Scholar
- Areia M, Soares M, Dinis-Ribeiro M: Quality reporting of endoscopic diagnostic studies in gastrointestinal journals: where do we stand on the use of the STARD and CONSORT statements?. Endoscopy. 2010, 42: 138-147. 10.1055/s-0029-1243846.View ArticlePubMedGoogle Scholar
- Miller E, Roposch A, Uleryk E, Doria AS: Juvenile idiopathic arthritis of peripheral joints: quality of reporting of diagnostic accuracy of conventional MRI. Acad Radiol. 2009, 16: 739-757. 10.1016/j.acra.2009.01.012.View ArticlePubMedGoogle Scholar
- Zafar A, Khan GI, Siddiqui MA: The quality of reporting of diagnostic accuracy studies in diabetic retinopathy screening: a systematic review. Clin Exp Ophthalmol. 2008, 36: 537-542. 10.1111/j.1442-9071.2008.01826.x.View ArticleGoogle Scholar
- Mahoney J, Ellison J: Assessing the quality of glucose monitor studies: a critical evaluation of published reports. Clin Chem. 2007, 53: 1122-1128. 10.1373/clinchem.2006.083493.View ArticlePubMedGoogle Scholar
- Johnson ZK, Siddiqui MA, Azuara-Blanco A: The quality of reporting of diagnostic accuracy studies of optical coherence tomography in glaucoma. Ophthalmology. 2007, 114: 1607-1612. 10.1016/j.ophtha.2006.11.036.View ArticlePubMedGoogle Scholar
- Roposch A, Moreau NM, Uleryk E, Doria AS: Developmental dysplasia of the hip: quality of reporting of diagnostic accuracy for US. Radiology. 2006, 241: 854-860. 10.1148/radiol.2413051358.View ArticlePubMedGoogle Scholar
- Stengel D, Bauwens K, Rademacher G, Mutze S, Ekkernkamp A: Association between compliance with methodological standards of diagnostic research and reported test accuracy: meta-analysis of focused assessment of US for trauma. Radiology. 2005, 236: 102-111. 10.1148/radiol.2361040791.View ArticlePubMedGoogle Scholar
- Ziogas DC, Zintzaras E: Analysis of the quality of reporting of randomized controlled trials in acute and chronic myeloid leukemia, and myelodysplastic syndromes as governed by the CONSORT statement. Ann Epidemiol. 2009, 19: 494-500. 10.1016/j.annepidem.2009.03.018.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2474/13/113/prepub