Header and Logo

Welcome Guest
Logo
Glossary



Absolute risk reduction
See risk difference.
Allocation concealment
See concealment of allocation.
Applicability
(synonyms: external validity, generalisability, relevance, transferability)
The degree to which the results of an observation, study or review hold true in other settings.
Attrition bias
Systematic differences between comparison groups in withdrawals or exclusions of participants from the results of a study. For example, patients may drop out of a study because of side effects of the intervention. Excluding these patients from the analysis could result in an overestimate of the effectiveness of the intervention.
Bias
A systematic error or deviation in results or inferences. In studies of the effects of healthcare, bias can arise from systematic differences in the groups that are compared (selection bias), the care that is provided, or exposure to other factors apart from the intervention of interest (performance bias), withdrawals or exclusions of people entered into the study (attrition bias) or how outcomes are assessed (detection bias). Bias does not necessarily imply a prejudice, such as the investigators' desire for particular results. This differs from conventional use of the word in which bias refers to a partisan point of view. See also methodological quality, validity.
Blinding
(synonym: masking) Keeping secret group assignment (e.g. to treatment or control) from the study participants or investigators. Blinding is used to protect against the possibility that knowledge of assignment may affect patient response to treatment, provider behaviours (performance bias) or outcome assessment (detection bias). Blinding is not always practical (e.g. when comparing surgery to drug treatment). The importance of blinding depends on how objective the outcome measure is; blinding is more important for less objective outcome measures such as pain or quality of life. See also single blind, double blind and triple blind.
Case series
An uncontrolled observational study involving an intervention and outcome for more than one person.
Case study
(synonyms: anecdote, case history, single case report) An uncontrolled observational study involving an intervention and outcome for a single person (or other unit).
CCTR
See Cochrane Controlled Trials Register.
CDSR
See Cochrane Database of Systematic Reviews.
CENTRAL
The Cochrane Collaboration's register of studies which may be relevant for inclusion in Cochrane Reviews. Its development is guided by the CENTRAL/CCTR Advisory Group. See also Cochrane Controlled Trials Register.
Chi-square test
Any statistical test based on comparison of a test statistic to a chi-square distribution.
CI
See Confidence interval
CINAHL
(Cumulative Index of Nursing and Allied Health Literature) Electronic database covering the major journals in nursing and allied health. Years of coverage: 1983 - present.
Clinical guideline
A systematically developed statement for practitioners and patients about appropriate health care for specific clinical circumstances. (e.g. Scottish Intercollegiate or SIGN guidelines, Royal College of Physicians guidelines etc.)
Clinical trial
(synonyms: therapeutic trial, intervention study) A trial that tests out a drug or other intervention to assess its effectiveness and safety. This general term encompasses randomised controlled trials and controlled clinical trials.
Cochrane Collaboration
An international organisation that aims to help people make well informed decisions about health by preparing, maintaining and ensuring the accessibility of systematic reviews of the benefits and risks of healthcare interventions.
Cochrane Controlled Trials Register
(CCTR) A database of references to controlled trials in health care. Cochrane groups and other organisations have been invited to contribute their specialised registers, and these registers, together with references to clinical trials identified on MEDLINE and other sources, form the CENTRAL register of studies. Records from CENTRAL, following quality control to try to ensure that only reports of definite randomised controlled trials or controlled clinical trials are included, make up The Cochrane Controlled Trials Register (CCTR). Trials included in the STEP process are derived from the Cochrane Stroke Group.
Cochrane Database of Systematic Reviews
(CDSR) It brings together all the currently available Cochrane Reviews and is updated quarterly.
Cochrane Review
A Cochrane Review is a systematic, up-to-date summary of reliable evidence of the benefits and risks of healthcare. Cochrane Reviews are intended to help people make practical decisions.
Cointervention
In a randomised controlled trial, the application of additional diagnostic or therapeutic procedures to members of either or both the experimental and the control groups.
Concealment of allocation
The process used to prevent foreknowledge of group assignment in a randomised controlled trial, which should be seen as distinct from blinding. The allocation process should be impervious to any influence by the individual making the allocation by having the randomisation process administered by someone who is not responsible for recruiting participants; for example, a hospital pharmacy, or a central office. Using methods of assignment such as date of birth and case record numbers (see quasi random allocation) are open to manipulation. Adequate methods of allocation concealment include: centralized randomisation schemes; randomisation schemes controlled by a pharmacy; numbered or coded containers in which capsules from identical-looking, numbered bottles are administered sequentially; on-site computer systems, where allocations are in a locked unreadable file; and sequentially numbered opaque, sealed envelopes.
Confidence interval
(CI) The range within which the "true" value (e.g. size of effect of an intervention) is expected to lie with a given degree of certainty (e.g. 95% or 99%). Note: Confidence intervals represent the probability of random errors, but not systematic errors (bias).
Conflict of interest declaration
[or Competing interests declaration] A statement by a contributor to a report or review of personal financial or other interests that could have influenced the findings or their interpretation.
Confounding
A situation in which a measure of the effect of an intervention or exposure is distorted because of the association of exposure with other factor(s) that influence the outcome under investigation.
Consumer
(healthcare consumer) Someone who uses, is affected by, or who is entitled or compelled to use a health related service.
Contamination
In clinical trials, the unintentional application of the intervention being evaluated to people in the control group or inadvertent failure to apply the intervention to people assigned to the intervention group.
Context
The conditions and circumstances that are relevant to the application of an intervention, for example the setting [in hospital, at home, in the air], the time [working day, holiday, night-time], type of practice [primary, secondary, tertiary care; private practice, insurance practice, charity], whether routine or emergency.
Control
In clinical trials comparing two or more interventions, a control is a person in the comparison group that receives a placebo, no intervention, usual care or another form of care.
Controlled clinical trial
Refers to a study that compares one or more intervention groups to one or more comparison (control) groups. Whilst not all controlled studies are randomised, all randomised trials are controlled.
Critical appraisal
The process of assessing and interpreting evidence by systematically considering its validity, results and relevance.
Cross-over trial
A type of clinical trial comparing two or more interventions in which the participants, upon completion of the course of one treatment are switched to another. For example, for a comparison of treatments A and B, half the participants are randomly allocated to receive them in the order A, B and half to receive them in the order B, A. A problem with this design is that the effects of the first treatment may carry over into the period when the second is given.
DARE
See Database of Abstracts of Reviews of Effectiveness.
Database
A collection of organised information, usually held on a computer. In some ways a database is similar to a filing system, but with important advantages: the information can be revised and kept up to date easily, and the computer can retrieve information from it very quickly. Electronic databases such as MEDLINE, EMBASE and the CDSR can be distributed on disk, CD-ROM or via the Internet.
Database of Abstracts of Reviews of Effectiveness
(DARE) A collection of structured abstracts and bibliographic references of systematic reviews of the effects of healthcare.
Detection bias
(synonym: ascertainment bias) Systematic differences between comparison groups in how outcomes are ascertained, diagnosed or verified.
Dichotomous data
(synonym: binary data) Observations with two possible categories such as dead/alive, smoker/non-smoker, present/not present.
Double blind
(synonym: double masked) Neither the participants in a trial nor the investigators (outcome assessors) are aware of which intervention the participants are given. The purpose of blinding the participants (recipients and providers of care) is to prevent performance bias. The purpose of blinding the investigators (outcome assessors, who might also be the care providers) is to protect against detection bias. See also blinding, single blind, triple blind, concealment of allocation.
Effect size
1. A generic term for the estimate of effect for a study. 2. A dimensionless measure of effect that is typically used for continuous data when different scales (e.g. for measuring pain) are used to measure an outcome and is usually defined as the difference in means between the intervention and control groups divided by the standard deviation of the control or both groups. See standardised mean difference.
Effectiveness
The extent to which a specific intervention, when used under ordinary circumstances, does what it is intended to do.
Efficacy
The extent to which an intervention produces a beneficial result under ideal conditions. Clinical trials that assess efficacy are sometimes called explanatory trials and are restricted to participants who fully co-operate.
EMBASE
(Excerpta Medica database) A European-based electronic database of pharmacological and biomedical literature covering 3,500 journals from 110 countries. Years of coverage - 1974 to present.
Empirical
Empirical results are based on experience (or observation) rather than on reasoning alone.
Epidemiology
The study of the distribution and determinants of health-related states or events in specified populations.
Estimate of effect
(synonym: treatment effect) In studies of the effects of healthcare, the observed relationship between an intervention and an outcome expressed as, for example, a number needed to treat, odds ratio, risk difference, relative risk, standardised mean difference, or weighted mean difference.
Event rate
The proportion of participants in a group in whom an event is observed. Thus, if out of 100 patients the event (e.g. a stroke) is observed in 32, the event rate is 0.32.
External peer reviewer
A person with relevant content, methodological or user expertise who critically examines reviews in her/his area of expertise.
External validity
(synonyms: external validity, generalisability, relevance, transferability) The degree to which the results of an observation hold true in other settings. See also validity .
Factorial design
Most trials only consider a single factor, where an intervention is compared with one or more alternatives, or a placebo. In a trial using a 2x2 factorial design, participants are allocated to one of four possible combinations. For example in a 2x2 factorial, RCT of nicotine replacement and counselling, participants would be allocated to: nicotine replacement alone, counselling alone, both, or neither. In this way it is possible to test the independent effect of each intervention on smoking cessation and the combined effect of (interaction between) the two interventions.
Follow-up
The ascertainment of outcomes of an intervention at one or more stated times after the intervention has ended.
Generalisability
(synonyms: applicability, external validity, relevance, transferability) Generalisability is the degree to which the results of a study or systematic review can be extrapolated to other circumstances, in particular to routine health care situations.
Gold standard
The method, procedure or measurement that is widely accepted as being the best available against which new interventions should be compared.
Handsearching
Handsearching refers to the planned searching of a journal page by page (i.e. by hand), including editorials, letters, etc., to identify all reports of randomised controlled trials and controlled clinical trials.
Heterogeneity
In systematic reviews heterogeneity refers to variability or differences between studies in the estimates of effects. A distinction is sometimes made between "statistical heterogeneity" (differences in the reported effects), "methodological heterogeneity" (differences in study design) and "clinical heterogeneity" (differences between studies in key characteristics of the participants, interventions or outcome measures). Statistical tests of heterogeneity are used to assess whether the observed variability in study results (effect sizes) is greater than that expected to occur by chance. However, these tests have low statistical power. See also homogeneity.
Historical control
Person or group for whom data were collected earlier than for the group being studied. Because of changes over time in risks, prognosis, healthcare, etc. there is a large risk of bias (in studies that use historical controls) due to systematic differences between the comparison groups.
Homogeneity
In systematic reviews homogeneity refers to the degree to which the results of studies included in a review are similar. "Clinical homogeneity" means that, in trials included in a review, the participants, interventions and outcome measures are similar or comparable. Studies are considered "statistically homogeneous" if their results vary no more than might be expected by the play of chance. See heterogeneity.
Incidence
The number of new cases of a disease or event in a population during a specific period of time.
Index Medicus Catalogue of the United States National Library of Medicine
(NLM), and a periodical index to the medical literature. Available in printed form, or electronically as MEDLINE.
Intention-to-treat
An intention-to-treat analysis is one in which all the participants in a trial are analysed according to the intervention to which they were allocated, whether they received it or not. Intention-to-treat analyses are favoured in assessments of effectiveness as they mirror the noncompliance and treatment changes that are likely to occur when the intervention is used in practice, and because of the risk of attrition bias when participants are excluded from the analysis.
Internal validity
See validity.
Internet
Network of millions of computers worldwide. Computers on the Internet use compatible communication standards and share the ability to contact each other and share data. Users of the Internet communicate via electronic mail (e-mail), via Telnet (a process which allows a person to log in to a remote host), and via FTP. See also World Wide Web.
Inter-rater reliability
The degree of stability exhibited when a measurement is repeated under identical conditions by different raters. Reliability refers to the degree to which the results obtained by a measurement procedure can be replicated. Lack of inter-rater reliability may arise from divergences between observers or instability of the attribute being measured. See also Intra-rater reliability.
Intervention study
See Clinical trial.
Intra-rater reliability
The degree of stability exhibited when a measurement is repeated under identical conditions by the same rater. Reliability refers to the degree to which the results obtained by a measurement procedure can be replicated. Lack of intra-rater reliability may arise from divergences between instruments of measurement or instability of the attribute being measured. See also Inter-rater reliability.
Masking
See blinding.
Mean
(synonyms: arithmetic mean, average) The average value, calculated by adding all the observations and dividing by the number of observations.
MEDLINE
(MEDlars onLINE) An electronic database produced by the United States National Library of Medicine. It indexes millions of articles in selected (about 3,700) journals. It is available through most medical libraries, and can be accessed on CD-ROM, the Internet and by other means. Years of coverage - 1966 to present.
MeSH headings
(Medical Subject Headings) Terms used by the United States National Library of Medicine to index articles in Index Medicus and MEDLINE. Designed to reduce problems that arise from, for example, differences in British and American spelling. The MeSH system has a tree structure in which broad subject terms branch into a series of progressively narrower subject terms.
Meta-analysis
The use of statistical techniques in a systematic review to integrate the results of included studies. Sometimes used as a synonym for systematic reviews, where the review includes meta-analysis.
Meta-regression
Multivariate meta-analytic techniques, such as logistic regression, used to explore the relationship between study characteristics (e.g. allocation concealment, baseline risk, timing of the intervention) and study results (the magnitude of effect observed in each study) in a systematic review.
Methodological quality
(synonyms: validity, internal validity) The extent to which the design and conduct of a study are likely to have prevented systematic errors (bias). Variation in quality can explain variation in the results of studies included in a systematic review. More rigorously designed (better 'quality') trials are more likely to yield results that are closer to the 'truth'. See also external validity, validity.
Minimisation
A method of allocation used to provide comparison groups that are closely similar for several variables. It can be done with or without a component of randomisation.
Negative study
A term often used to refer to a study that does not have "statistically significant" (positive) results indicating a beneficial effect of the intervention being studied. The term can generate confusion because it refers to both statistical significance and the direction of effect. Studies often have multiple outcomes, the criteria for classifying studies as "negative" are not always clear and, in the case of studies of risk or undesirable effects, "negative" studies are ones that do not show a harmful effect.
Null hypothesis
The statistical hypothesis that one variable (e.g. whether or not a study participant was allocated to receive an intervention) has no association with another variable or set of variables (e.g. whether or not a study participant died), or that two or more population distributions do not differ from one another. In simplest terms, the null hypothesis states that the results observed in a study are no different from what might have occurred as a result of the play of chance.
Number needed to treat
(NNT) The number of patients who need to be treated to prevent one bad outcome. It is the inverse of the risk difference.
Odds ratio
(OR) The ratio of the odds of an event in the experimental (intervention) group to the odds of an event in the control group. Odds are the ratio of the number of people in a group with an event to the number without an event. Thus, if a group of 100 people had an event rate of 0.20, 20 people had the event and 80 did not, and the odds would be 20/80 or 0.25. An odds ratio of one indicates no difference between comparison groups. For undesirable outcomes an OR that is less than one indicates that the intervention was effective in reducing the risk of that outcome. When the event rate is small, odds ratios are very similar to relative risks.
Open label design
A study design in which the investigator is aware which intervention is being given to which participant (i.e. not double blind). Some studies with an open label design are randomised trials, but some do not include a comparison group and, therefore, cannot be randomised. See also open clinical trial.
Ordinal data
Data that are classified into more than two categories where there is a natural order to the categories; for example, non-smokers, ex-smokers, light smokers and heavy smokers. Ordinal data are often reduced to two categories to simplify analysis and presentation, which may result in a considerable loss of information.
Outcomes
Components of patients' clinical and functional status after an intervention has been applied.
Paired design
A study in which participants or groups of participants are matched (e.g. based on prognostic factors) and one member of each pair is allocated to the experimental (intervention) group and the other to the control group.
Parallel group trial
(synonym: independent group design) A trial that compares two groups of people, one of which receives the intervention of interest and one of which is a control group. Some parallel trials have more than two comparison groups and some compare different interventions without including a non-intervention control group.
Peer review
A refereeing process whose aim is to check the quality and importance of reports of research. An article submitted for publication in a peer reviewed journal is reviewed by other experts in the area. See also external peer reviewer (of a Cochrane Review).
Performance bias
Systematic differences in care provided apart from the intervention being evaluated. For example, if patients know they are in the control group they may be more likely to use other forms of care, patients who know they are in the experimental (intervention) group may experience placebo effects, and care providers may treat patients differently according to what group they are in. Blinding of study participants (both the recipients and providers of care) is used to protect against performance bias.
Phase I studies
The first stage in testing a new drug in humans. Usually performed on healthy volunteers without a comparison group.
Phase II studies
Second stage in testing a new drug in humans. These are sometimes randomised controlled trials.
Phase III studies
Studies that are a full-scale evaluation of treatment. After a drug has been shown to be reasonably effective, it is essential to compare it to the current standard treatments for the same condition. Phase III studies are often randomised controlled trials.
Phase IV studies
Studies that are concerned with post-marketing surveillance. They are often promotional exercises aimed at bringing a new drug to the attention of a large number of clinicians, and may be of limited scientific value.
Placebo
An inactive substance or procedure administered to a patient, usually to compare its effects with those of a real drug or other intervention, but sometimes for the psychological benefit to the patient through a belief that s/he is receiving treatment. Placebos are used in clinical trials to blind people to their treatment allocation. Placebos should be indistinguishable from the active intervention to ensure adequate blinding. Placebo effect A favourable response to an intervention, regardless of whether it is the real thing or a placebo, attributable to the expectation of an effect, i.e. the power of suggestion. The effects of many healthcare interventions are attributable to a combination of both placebo and "active" (non-placebo) effects.
Positive study
A term used to refer to a study with results indicating a beneficial effect of the intervention being studied. The term can generate confusion because it can refer to both statistical significance and the direction of effect, studies often have multiple outcomes, the criteria for classifying studies as negative or positive are not always clear and, in the case of studies of risk or undesirable effects, "positive" studies are ones that show a harmful effect.
Precision
1. A measure of the likelihood of random errors in the results of a study, meta-analysis or measurement. Confidence intervals around the estimate of effect from each study are a measure of precision, and the weight given to the results of each study in a meta-analysis (typically the inverse of the variance of the estimate of effect) is a measure of precision (i.e. the degree to which a study influences the overall estimate of effect in a meta-analysis is determinedby the precision of its estimate of effect). 2. The proportion of relevant citations located using a specific search strategy, i.e. the number of relevant studies meeting the inclusion criteria for a trials register or a review) divided by the total number of citations retrieved.
Prevalence
The number of existing cases of a particular disease or condition in a given population at a designated time.
Prevalence study
See cross-sectional study.
Primary study
(synonyms: included study, original study) "Original research" in which data are first collected. The term primary research is sometimes used to distinguish it from "secondary research" (reanalysis of previously collected data), meta-analysis, and other ways of combining studies (such as economic analysis and decision analysis). However, because systematic reviews can provide answers not possible from individual studies they can also be considered to be primary research.
Probability distribution
The function that gives the probabilities that a variable equals each of a sequence of possible values. Examples include the bionomial, chi square, normal and Poisson distributions.
Proportional hazards model
(synomym: Cox model) A statistical model in survival analysis that asserts that the effect of the study factors (e.g. the intervention of interest) on the hazard rate (the risk of occurrence of an event, such as death, at a point in time) in the study population is multiplicative and does not change over time.
Protocol
The plan or set of steps to be followed in a study. A protocol for a systematic review should describe the rationale for the review; the objectives; and the methods that will be used to locate, select and critically appraise studies, and to collect and analyse data from the included studies.
Publication bias
A bias in the published literature where the publication of research depends on the nature and direction of the study results. Studies in which an intervention is not found to be effective are sometimes not published. Because of this, systematic reviews that fail to include unpublished studies may overestimate the true effect of an intervention.
P-value
The probability (ranging from zero to one) that the results observed in a study (or results more extreme) could have occurred by chance. In a meta-analysis the P-value for the overall effect assesses the overall statistical significance of the difference between the intervention groups, whilst the P-value for the heterogeneity statistic assesses the statistical significance of differences between the effects observed in each study.
Quality
See methodological quality.
Quasi-random allocation
A method of allocating participants to different forms of care that is not truly random; for example, allocation by date of birth, day of the week, medical record number, month of the year, or the order in which participants are included in the study (e.g. alternation).
Quasi-randomised trial
A trial using a quasi-random method of allocating participants to different forms of care. There is a greater risk of selection bias in quasi-random trials where allocation is not adequately concealed compared with randomised controlled trials with adequate allocation concealment.
Random
Governed by chance. See randomisation.
Random allocation
A method that uses the play of chance to assign participants to comparison groups in a trial, e.g. by using a random numbers table or a computer-generated random sequence. Random allocation implies that each individual or unit being entered into a trial has the same chance of receiving each of the possible interventions. It also implies that the probability that an individual will receive a particular intervention is independent of the probability that any other individual will receive the same intervention. See also concealment of allocation, quasi-random allocation, randomisation.
Random error
(synonym: sampling error) Error due to the play of chance. Confidence intervals and P-values represent the probability of random errors, but not systematic errors (bias).
Random permuted blocks
A method of randomisation that ensures that, at any point in a trial, roughly equal numbers of participants have been allocated to all the comparison groups. Permuted blocks are often used in combination with stratified randomisation.
Random selection
(synonym: random sampling) A method of obtaining a representative, unbiased group of people from a larger population. Random selection that is not related to the allocation of participants to comparison groups is frequently used in cross-sectional and cohort studies. It is rarely used in randomised controlled trials. However, in older trial reports, the term is occasionally used instead of random allocation or randomisation.
Randomisation
(spelled randomization in US English) Method used to generate a random allocation sequence, such as using tables of random numbers or computer-generated random sequences. The method of randomisation should be distinguished from concealment of allocation because of the risk of selection bias despite the use of randomisation, if there is not adequate allocation concealment. For instance, a list of random numbers may be used to randomise participants, but if the list is open to the individuals responsible for recruiting and allocating participants, those individuals can influence the allocation process, either knowingly or unknowingly.
Randomisation blinding
See concealment of allocation.
Randomised controlled trial
(RCT) (Synomym: randomised clinical trial) An experiment in which investigators randomly allocate eligible people into intervention groups to receive or not to receive one or more interventions that are being compared. The results are assessed by comparing outcomes in the treatment and control groups.
RCT
See randomised controlled trial.
Register of trials
See trials register.
Relative Risk
(RR) (synonym: risk ratio) The ratio of risk in the intervention group to the risk in the control group. The risk (proportion, probability or rate) is the ratio of people with an event in a group to the total in the group. A relative risk of one indicates no difference between comparison groups. For undesirable outcomes an RR that is less than one indicates that the intervention was effective in reducing the risk of that outcome.
Reliability
Refers to the degree to which results obtained by a measurement procedure can be replicated. Lack of reliability can arise from divergences between observers or measurement instruments, or instability in the attribute being measured.
Retrospective study
A study in which the outcomes have occurred to the participants before the study commenced. Case control studies are always retrospective, cohort studies sometimes are, randomised controlled trials never are. See prospective study.
Review
1. A systematic review. 2. A review article in the medical literature which summarises a number of different studies and may draw conclusions about a particular intervention. Review articles are often not systematic. Review articles are also sometimes called overviews. 3. To referee a paper. See referee, referee process, external peer reviewer.
Reviewer
Somebody responsible for preparing and, in the case of Cochrane Reviews, keeping up-to-date a systematic review. The term “reviewer” is also sometimes used to refer to an external peer reviewer, or referee.
Risk difference
(RD) (synonym: absolute risk reduction) The absolute difference in the event rate between two comparison groups. A risk difference of zero indicates no difference between comparison groups. A RD that is less than zero indicates that the intervention was effective in reducing the risk of that outcome.
Risk factor
An aspect of a person's condition, lifestyle or environment that increases the probability of occurrence of a disease. For example, cigarette smoking is a risk factor for lung cancer.
Run-in period
A period before a trial is commenced when no treatment is given. The data from this stage of a trial are only occasionally of value but can serve a valuable role in screening out ineligible or non-compliant participants, in ensuring that participants are in a stable condition, and in providing baseline observations. A run-in period is sometimes called a washout period if treatments that participants were using before entering the trial are discontinued.
Sampling error
See random error.
Search strategy
1. The methods used by a Collaborative Review Group (CRG) to identify trials within the Group's scope. This includes handsearching relevant journals, searching electronic databases, contacting drug companies, other forms of personal contact and checking reference lists. CRGs must describe their search strategy in detail in the Group's module. Reviewers can refer to the Group's search strategy when preparing a Cochrane Review, and if necessary supplement this with a description of their own additional searches. 2. The methods used by a reviewer to locate relevant studies, including the use of a CRG's trials register. 3. The combination of terms used to identify studies in an electronic database such as MEDLINE.
Selection bias
1. In assessments of the validity of studies of healthcare interventions, selection bias refers to systematic differences between comparison groups in prognosis or responsiveness to treatment. Random allocation with adequate concealment of allocation protects against selection bias. Other means of selecting who receives the intervention of interest, particularly leaving it up to the providers and recipients of care, are more prone to bias because decisions about care can be related to prognosis and responsiveness to treatment. 2. Selection bias is sometimes used to describe a systematic error in reviews due to how studies are selected for inclusion. Publication bias is an example of this type of selection bias. 3. Selection bias, confusingly, is also sometimes used to describe a systematic difference in characteristics between those who are selected for study and those who are not. This affects the generalisability (external validity) of a study but not its (internal) validity.
Sensitivity analysis
An analysis used to determine how sensitive the results of a study or systematic review are to changes in how it was done. Sensitivity analyses are used to assess how robust the results are to uncertain decisions or assumptions about the data and the methods that were used.
Single blind
(synonym: single masked) The investigator is aware of the treatment/intervention the participant is getting, but the participant is unaware. See also blinding, double blind, triple blind.
Specialised register
See Register of trials.
Standardised mean difference
The difference between two means divided by an estimate of the within-group standard deviation. When an outcome (such as pain) is measured in a variety of ways across studies (using different scales) it may not be possible directly to compare or combine study results in a systematic review. By expressing the effects as a standardised value the results can be combined since they have no units. Standardised mean differences are sometimes referred to as a d index.
Statistical power
The probability that the null hypothesis will be rejected if it is indeed false. In studies of the effectiveness of healthcare interventions, power is a measure of the certainty of avoiding a false negative conclusion that an intervention is not effective when in truth it is effective. The power of a study is determined by how large it is (the number of participants), the number of events (e.g. strokes) or the degree of variation in a continuous outcome (such as weight), how small an effect one believes is important (i.e. the smallest difference in outcomes between the intervention and the control groups that is considered to be important), and how certain one wants to be of avoiding a false positive conclusion (i.e. the cut-off that is used for statistical significance).
Statistical significance
An estimate of the probability of an association (effect) as large or larger than what is observed in a study occurring by chance, usually expressed as a P-value. For example, a P-value of 0.049 for a risk difference of 10% means that there is less than a one in 20 (0.05) chance of an association that is as large or larger having occurred by chance and it could be said that the results are "statistically significant" at P = 0.05). The cut-off for statistical significance is usually taken at 0.05, but sometimes at 0.01 or 0.10. These cut-offs are arbitrary and have no specific importance. Although it is often done, it is inappropriate to interpret the results of a study differently according to whether the P-value is, say, 0.055 or 0.045 (which are quite similar values, not diametrically opposed ones).
Stratified randomisation
In any randomised trial it is desirable that the comparison groups should be as similar as possible as regards participant characteristics that might influence the response to the intervention. Stratified randomisation is used to ensure that equal numbers of participants with a characteristic thought to affect prognosis or response to the intervention will be allocated to each comparison group. For example, in a trial of women with breast cancer, it may be important to have similar numbers of pre-menopausal and post-menopausal women in each comparison group. Stratified randomisation could be used to allocate equal numbers of pre- and post-menopausal women to each treatment group. Stratified randomisation is performed either by performing separate randomisation (often using random permuted blocks) for each strata, or by using minimisation.
Study validity
See validity.
Surrogate endpoints
(synonym: intermediary outcomes; surrogate outcomes) Outcome measures that are not of direct practical importance but are believed to reflect outcomes that are important; for example, blood pressure is not directly important to patients but it is often used as an outcome in clinical trials because it is a risk factor for stroke and heart attacks. Surrogate endpoints are often physiological or biochemical markers that can be relatively quickly and easily measured, and that are taken as being predictive of important clinical outcomes. They are often used when observation of clinical outcomes requires long follow-up.
Systematic error
See bias.
Systematic review
(synonym: systematic overview) A review of a clearly formulated question that uses systematic and explicit methods to identify, select and critically appraise relevant research, and to collect and analyse data from the studies that are included in the review. Statistical methods (meta-analysis) may or may not be used to analyse and summarise the results of the included studies. See also Cochrane Review.
t-distribution, t-test
(synonym: Student t-test) The t-distribution is the distribution of a quotient of independent random variables, the numerator of which is a standardised normal random variable and the denominator of which is the positive square root of the quotient of a chi-square distributed random variable and its number of degrees of freedom. The t-test uses the t-distribution to test whether two means differ significantly or to test linear regression or correlation coefficients.
Test of association
See statistical significance.
Therapeutic trial
See clinical trial.
Trend
Used loosely to refer to an association or possible effect that is not statistically significant. A consistent movement across ordered categories, e.g. a change in the effect observed in studies grouped according to, for instance, intensity of treatment.
Triple blind
(synonym: triple masked) An expression that is sometimes used to indicate that knowledge of which study participants are in which comparison group is kept secret from the statistician doing the analysis as well as from the study participants and investigators (outcome assessors). See also blinding, single blind, double blind.
Unit of allocation
The unit that is assigned to the alternative interventions being investigated in a trial. Most commonly, the unit will be an individual person but, in some trials, people will be assigned in groups to one or other of the interventions. This is done to avoid contamination or for convenience and the units might be, for example, hospitals or communities. In other trials, different parts of a person (such as the left or right eye) might be assigned to receive different interventions. See unit of analysis error.
Unit of analysis error
In some studies people are allocated in groups instead of individually (e.g. by practice, by hospital or by community). Often when this is done the unit of allocation is different from the unit of analysis, i.e. people are allocated by groups and analysed as though they had been allocated individually. This is sometimes called a unit of analysis error. Effectively, using individuals as the unit of analysis when groups of people are allocated increases the power of the studies by increasing the degrees of freedom. This can result in overly narrow confidence intervals and false positive conclusions that the intervention had an effect when in truth there is greater uncertainty than what is reflected by the P-value. In the context of a review, it can result in studies having narrower confidence intervals and receiving more weight than is appropriate.
Users of reviews
Patients or healthcare professionals or policy makers using a review to make practical decisions about healthcare, and researchers conducting or considering further research.
Validity
(synonym: internal validity) Validity is the degree to which a result (of a measurement or study) is likely to be true and free of bias (systematic errors). Validity has several other meanings, usually accompanied by a qualifying word or phrase; for example, in the context of measurement, expressions such as "construct validity", "content validity" and "criterion validity" are used. The expression "internal validity" is sometimes used to distinguish validity (the extent to which the observed effects are true for the people in a study) from external validity or generalisability (the extent to which the effects observed in a study truly reflect what can be expected in a target population beyond the people included in the study). See also methodological quality, random error.
Variable
Any quantity that varies. A factor that can have different values.
Variance
A measure of the variation shown by a set of observations, defined by the sum of the squares of deviations from the mean, divided by the number of degrees of freedom in the set of observations.
Washout period
The stage in a cross-over trial when treatment is withdrawn before the second treatment is given. Washout periods are usually necessary because of the possibility that the intervention administered first can affect the outcome variable for some time after treatment ceases. A run-in period before a trial starts is sometimes called a washout period if treatments that participants were using before entering the trial are discontinued.
Weighted mean difference
(in meta-analysis) A method of meta-analysis used to combine measures on continuous scales (such as weight), where the mean, standard deviation and sample size in each group are known. The weight given to each study (e.g. how much influence each study has on the overall results of the meta-analysis) is determined by the precision of its estimate of effect and, in the statistical software in RevMan and CDSR, is equal to the inverse of the variance. This method assumes that all of the trials have measured the outcome on the same scale. See also standardised mean difference.
WMD
See weighted mean difference
World Wide Web
(WWW) A part of the Internet with a graphical interface. "Web pages" or "home pages" are HyperText Markup Language (HTML) documents on the WWW. Hypertext allows users to jump from one place in a document to another, from one document to another, and from one computer on the WWW to another. A connection through a cable or over the telephone and a Web browser (software program), such as Netscape, are needed to access and view WWW documents.