Explanation of Medical Literature Ratings

(Ratings “1a” through “11c” noted under summary of each study)

 

Back to ODG - TWC Index

 

Ranking by Type of Evidence:

(click on links to go to explanation)

            STUDIES

1. Systematic Review/Meta-Analysis

2. Controlled Trial – Randomized (RCT) or Controlled

3. Cohort Study - Prospective or Retrospective

4. Case Series

5. Unstructured Review

            OTHER:

6. Nationally Recognized Treatment Guideline (from guidelines.gov)

7. State Treatment Guideline

8. Other Treatment Guideline

9. Textbook

10. Conference Proceedings/Presentation Slides

11. Case Reports and Descriptions

 

Ranking by Quality within Type of Evidence:

(click on links to go to explanation)

a. High Quality

b. Medium Quality

c. Low Quality

 

Ranking by Type of Evidence

 

1. Systematic Review/Meta-Analysis

Systematic Reviews:  Written by reviewers who use explicit and rigorous methods to identify, critically appraise, and synthesize relevant studies from the published medical research.  They use the process of systematically locating, appraising and synthesizing evidence from scientific studies in order to obtain a reliable overview.  The function of a systematic review is: 1) to summarize the literature and 2) to provide new information that may not be readily apparent from individual studies where the effects are small, but become apparent in when the data from many studies are pooled together.  Example: Cochrane Database of Systematic Reviews.

Meta-analysis:  A type of systematic review that is an overview and also uses quantitative methods to summarize the results. A quantitative method of combining the results of independent studies (usually drawn from the published literature) and synthesizing summaries and conclusions which may be used to evaluate therapeutic effectiveness, plan new studies, etc., with application chiefly in the areas of research and medicine.  Any study with the Level 1 ranking in ODG must have been accepted for publication in a peer reviewed journal, and that journal must be one of the journals accepted for inclusion in MEDLINE® by the National Library of Medicine.  For this Journal Selection Criteria, see www.nlm.nih.gov/pubs/factsheets/jsel.html.  Unpublished studies, or studies in magazines that do not publish original research, would not receive this ranking.

 

2. Controlled Trial – Randomized (RCT) or Controlled

These are analytical experimental studies, where variables can be better controlled on a prospective basis. In a RCT (Randomized Controlled Clinical Trial), a group of patients is randomized into an experimental group and a control group. These groups are followed up for the variables/outcomes of interest.  Advantages: Unbiased distribution of confounders; Blinding more likely; Randomization facilitates statistical analysis. Disadvantages: Expensive: time and money; Volunteer selection bias; Ethically problematic at times.  Any study with the Level 2 ranking in ODG must have been accepted for publication in a peer reviewed journal, and that journal must be one of the journals accepted for inclusion in MEDLINE® by the National Library of Medicine.  Unpublished studies, or studies in magazines that do not publish original research, would not receive this ranking.

 

3. Cohort Study - Prospective or Retrospective

Analytical observational studies involving identification of two groups (cohorts) of patients, one which did receive the exposure of interest, and one which did not, and following these cohorts forward for the outcome of interest.   Advantages: Ethically safe; Subjects can be matched; Can establish timing and direction of events; Eligibility criteria and outcome assessments can be standardized; Administratively easier and cheaper than RCT.  Disadvantages: Controls may be difficult to identify; Exposure may be linked to a hidden confounder; Blinding is difficult; Randomization not present; For rare disease, large sample sizes or long follow-up necessary.  Any study with the Level 3 ranking in ODG must have been accepted for publication in a peer reviewed journal, and that journal must be one of the journals accepted for inclusion in MEDLINE® by the National Library of Medicine.

 

4. Case Series

Analytical observational studies involving identifying groups of patients who have the outcome or treatment of interest (cases) and quantifying the results.  Ideally, control patients without the same outcome are also tracked, looking back to see if they had the exposure of interest.  (The use of controls would influence the quality rating of a Case Series.)  Generally, since the minimum ODG quality rating for studies (“c”) requires at least 10 cases, there must be 10 or more cases for a study to be classified as a Case Series, and otherwise the article would be classified in ODG as Case Reports and Descriptions.  Advantages of Case Series: Quick and cheap; Only feasible method for very rare disorders or those with long lag between exposure and outcome; Fewer subjects needed than cross-sectional studies.  Disadvantages: Reliance on recall or records to determine exposure status; Confounders; Selection of control groups is difficult; Potential bias: recall, selection.  Any study with the Level 4 ranking in ODG must have been accepted for publication in a peer reviewed journal, and that journal must be one of the journals accepted for inclusion in MEDLINE® by the National Library of Medicine.

 

5. Unstructured Review

Descriptive (versus analytical) and observational (versus experimental) studies, written by reviewers who describe current practice as well as relevant studies from the published medical research, with no attempt to pool the results analytically.  Compared to Systematic Reviews, an Unstructured Review makes little attempt to quantify outcomes based on the body of evidence described.  Any study with the Level 5 ranking in ODG must have been accepted for publication in a peer reviewed journal, and that journal must be one of the journals accepted for inclusion in MEDLINE® by the National Library of Medicine.

 

6. Nationally Recognized Treatment Guideline (from guidelines.gov)

 

Accepted for inclusion in the National Guideline Clearinghouse by the Federal Agency for Healthcare Research & Quality (AHRQ), which requires that the guideline recommendations be based on a systematic literature search and review of scientific studies published in peer reviewed journals, and revised on a regular basis to maintain currency with new studies.

 

7. State Treatment Guideline

 

Treatment guidelines created for use in a specific state in the U.S., or for use in a province in Canada, or for use by another governmental entity, and they have the backing of the respective jurisdictional or governmental authority.

 

8. Other Treatment Guideline

 

Other treatment guidelines.  These are typically national treatment guidelines not accepted in the National Guideline Clearinghouse, in many cases because the guideline publishers have chosen not to apply for inclusion (for example, commercial guidelines such as Milliman, McKesson, InterQual, etc.), or because they are private guidelines created for use under the terms of a specific health insurance policy (for example, Blue Cross, Medicare, Aetna, Cigna, United Healthcare, etc.).  Since studies by healthcare insurers are generally given a rating of Level 8, they are not characterized in ODG as among the highest quality references when there are numerous other studies available.  However, when there are limited studies available with the high quality ratings, it may be necessary to identify other studies that could provide guidance on a subject.  In fact, many of the healthcare insurance provider structured reviews are very high quality, they represent a thorough analysis and quantitative weighting of all available evidence on a subject, including unpublished studies that the insurer may have conducted, and these healthcare insurance reviews might even rank as Level 1 if they were published in the peer-reviewed literature and available in MEDLINE®.  Furthermore, the fact that a particular treatment is either covered or not covered by healthcare insurance should be relevant to coverage decisions in workers’ compensation.

 

9. Textbook

 

Medical reference texts, which may represent standards of practice, but which in and of themselves, are not necessarily evidence based versus consensus based or based primarily on the personal experiences of the authors.

 

10. Conference Proceedings/Presentation Slides

 

These are studies that have not been published in peer reviewed journals.

 

11. Case Reports and Descriptions

 

Descriptive articles published in the peer reviewed journals covering individual cases, and lacking any comparisons to controls.  Generally, since the minimum ODG quality rating for studies (“c”) requires at least 10 cases, there must be 10 or more cases for a study to be classified as a Case Series, and otherwise the article would be classified in ODG as Case Reports and Descriptions.  These articles were not included in the evidence base for any treatment guidelines except for the Council on Chiropractic Guidelines for Practice Parameters (CCGPP) chiropractic practice guidelines.

 

 

Ranking by Quality within Type of Evidence:

 

In evaluating clinical trials ODG has adopted the standards from the "Cochrane Handbook for Systematic Reviews of Interventions," as updated in September 2006.  (Higgins, 2006)  Specific additional criteria used by ODG include the following:

 

a. High Quality

Sample size: Generally over 300, but at least 100, depending on other factors below.

Conflict of interest: Authors and researchers had no financial interest in the product or service being studied.

Study design: Ideally, blinded.  No identifiable bias, including recall bias, confounding factors, selection bias, compliance bias, non-response bias, or measurement bias. If a case series, should be a case control series.

Statistical significance: 99% Confidence level that the outcomes likelihood ratio will not cross 1.0 (i.e., the p value is .01).

 

b. Medium Quality

Sample size: From 20-50 up to 100-300, depending on other factors below.

Conflict of interest: Authors and researchers had no financial interest in the product or service being studied.

Study design: No significant bias, including recall bias, confounding factors, selection bias, compliance bias, non-response bias, or measurement bias.  If a case series, should be a case control series.

Statistical significance: 95% Confidence level that the likelihood ratio will not cross 1.0 (i.e., the p value is .05).

 

c. Low Quality

Sample size:  Generally under 20-50, depending on other factors below, but no less than 10.

Conflict of interest: Authors and researchers may have had some financial interest in the product or service being studied, even if the sample size was large.

Study design: Some obvious bias, including recall bias, confounding factors, selection bias, compliance bias, non-response bias, or measurement bias.

Statistical significance: Does not meet the 95% Confidence level that the likelihood ratio will not cross 1.0 (i.e., the p value is .05).

 

 

Link between evidence and recommendations

 

ODG Treatment is being updated quarterly on the Web.  The Contents page indicates the last date updated for each chapter.  The hard copy version is published once a year, but this is not recommended since it does not link into the actual studies, and it is not as current as the Web version.

 

The heart of each chapter in ODG Treatment is the "Procedure Summary", which provides a concise synopsis of effectiveness, if any, based on existing medical evidence, hyper-linked directly into the studies on which they are based, in abstract form, which have been ranked, highlighted and indexed.  The "Treatment Planning" section identifies the ideal treatment plans that may be followed after illness or injury, based on the "Procedure Summary".  "Codes for Automated-Approval" maps procedure codes to ICD-9 diagnosis codes based on the ideal treatment protocol, with a field for “maximum occurrences”, for auto-approval of charges that meet the guideline.

 

For example, in the Low Back chapter, under Fusion, it says, "Not recommended in the absence of fracture, dislocation, or instability", so the Treatment Protocol does not include fusion.  Same for IDET, facet injections, etc., etc.  Under Epidural injections, it says, "Recommended as an option prior to surgery when there are radicular signs… and the number of injections should be limited to two...", so the Treatment Protocol for "With Radiculopathy" includes 2 ESI's, and the Codes for Auto Approval includes CPT code 62311 (Epidural steroid injection) 2 times for ICD9 722.x (Intervertebral disc disorders).

 

This effort to translate the evidence into specific auto-authorization protocols is unique, for pre-approval of treatment plans and triage of claims management.  Of course, most cases will not meet this ideal protocol, and that is where the many other listings in the Procedure Summary come into play.

 

In a recent pilot use of these Codes for Auto Approval reduced medical costs by 64%, cut lost days by 69%, minimized treatment delays for injured workers, and drew considerable praise from providers.  (Ohio ODG Pilot, Comp Management, 2005)

 

##########################################

 

Higgins JPT, Green S, editors. Cochrane Handbook for Systematic Reviews of Interventions 4.2.5. In: The Cochrane Library, Issue 3, 2005. Chichester, UK: John Wiley & Sons, Ltd. September 2006.

 

6. ASSESSMENT OF STUDY QUALITY

6.0 Quality assessment of studies

Quality assessment of individual studies that are summarized in systematic reviews is necessary to limit bias in conducting the systematic review, gain insight into potential comparisons, and guide interpretation of findings. Factors that warrant assessment are those related to applicability of findings, validity of individual studies, and certain design characteristics that affect interpretation of results. Applicability, which is also called external validity or generalize-ability by some, is related to the definition of the key components of well-formulated questions outlined in section 4. Specifically, whether a review's findings are applicable to a particular population, intervention strategy or outcome is dependent upon the studies selected for review, and on how the people, interventions and outcomes of interest were defined by these studies and the authors (reviewers).

6.1 Validity

In the context of a systematic review, the validity of a study is the extent to which its design and conduct are likely to prevent systematic errors, or bias. An important issue that should not be confused with validity is precision. Precision is a measure of the likelihood of chance effects leading to random errors. It is reflected in the confidence interval around the estimate of effect from each study and the weight given to the results of each study when an overall estimate of effect or weighted average is derived. More precise results are given more weight.

6.2 Sources of bias in trials of healthcare interventions

There are four sources of systematic bias in trials of the effects of healthcare: selection bias, performance bias, attrition bias and detection bias.

6.3 Selection bias

Participants and those who recruit should remain unaware of next assignment in sequence. Empirical research has shown that lack of allocation concealment is associated with bias. For that reason trials should use approaches such as allocation by a central office unaware of subject characteristics, pre-numbered or coded identical containers which are administered serially to participants, or an on-site computer system combined with allocations kept in an unreadable file that can be accessed only after the characteristics of enrolled participants have been entered.

6.4 Performance bias

This refers to systematic differences in the care provided to the participants in the comparison groups other than the intervention under investigation. To protect against unintended differences in care and placebo effects, those providing and receiving care can be "blinded" so that they did not know the group to which the recipients of care have been allocated.

6.5 Attrition bias

This refers to systematic differences between comparison groups in the loss of participants from the study. The study should consider how losses of participants (withdrawals, dropouts and protocol deviations) are handled.

6.6 Detection bias

This refers to systematic differences between the comparison groups in outcome assessment.

 

Rating: 1a