Wellcome Unit for the History of Medicine |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Comparing like with like:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| References: |
|
|
1.
|
Kaptchuk TJ. Intentional ignorance: a history of blind assessment and placebo controls in medicine. Bulletin of the History of Medicine 1998;72:389-433. |
|
2.
|
Trhler U.To improve the Evidence of Medicine: the 18th century British origins of a critical approach. Edinburgh: Royal College of Physicians, 2000. |
|
3.
|
Chalmers I. Comparing like with like: some historical milestones in the evolution of methods to create unbiased comparison groups in therapeutic experiments. International Journal of Epidemiology 2001;30:1170-78. |
|
4.
|
Milne I, Chalmers I. Tackling bias in assessing the effects of health care interventions: early contributions from James Lind, Alexander Lesassier Hamilton and T. Graham Balfour. Proc Roy Soc Coll Phys Edin 2001;31 Suppl 9:46-48. |
|
5.
|
Lind J. A treatise of the scurvy. In three parts. Containing an inquiry into the nature, causes and cure, of that disease. Together with a critical and chronological view of what has been published on the subject. Edinburgh: Printed by Sands, Murray and Cochran for A Kincaid and A Donaldson, 1753. |
|
6.
|
Hamilton AL. Dissertatio Medica Inauguralis De Synocho Castrensi. Edinburgh: J Ballantyne, 1816. |
|
7.
|
Balfour TG. Quoted in West C. Lectures on the Diseases of Infancy and Childhood. London: Longman, Brown, Green and Longmans, 1854, p 600. |
|
8.
|
Fisher RA. The design of experiments. London: Oliver and Boyd, 1935. |
|
9.
|
Armitage P. Quoted in: Chalmers I. Comparing like with like: some historical milestones in the evolution of methods to create unbiased comparison groups in therapeutic experiments. International Journal of Epidemiology 2001;30:1170-78. |
|
10.
|
Amberson JB, McMahon BT, Pinner M. A clinical trial of sanocrysin in pulmonary tuberculosis. The American Review of Tuberculosis 1931;24:401-35. |
|
11.
|
Theobald GW. Effect of calcium and vitamin A and D on incidence of pregnancy toxaemia. Lancet 1937;2:1397-1399. |
|
12.
|
Medical Research Council. Streptomycin treatment of pulmonary tuberculosis: a Medical Research Council investigation. BMJ 1948;2:769-782. |
|
13.
|
D'Arcy Hart P. A change in scientific approach: from alternation to randomised allocation in clinical trials in the 1940s. BMJ 1999; 319: 572-573. |
|
14.
|
Chalmers I. Why transition from alternation to randomisation
in clinical trials was made. BMJ 1999;319:1372. |
In the 1920s R.A. Fisher presented randomization as an essential ingredient of his approach to the design and analysis of experiments, validating significance tests. In its absence the experimenter had to rely on his judgement that the effects of biases could be discounted. Twenty years later, A. Bradford Hill promulgated the random assignment of treatments in clinical trials as the only means of avoiding systematic bias between the characteristics of patients assigned to different treatments. The two approaches were complementary, Fisher appealing to statistical theory, Hill to practical needs. The two men remained on good terms throughout most of their careers.
| Selected bibliography: | |
| Armitage P. The search for optimality in clinical
trials. Int Stat Rev 1985;53:15-24. |
|
| Atkins WRB, Fisher RA. The therapeutic use of
vitamin C. J Roy Army
Med Corps 1943;83:251-252. |
|
| Box JF. R.A. Fisher: The Life of a Scientist.
New York: Wiley, 1978. |
|
| Doll R, Hill AB. Smoking and carcinoma of the
lung. BMJ 1950;2:739-748. |
|
| Fisher RA. The arrangement of field experiments.
J Ministr Agric Gr Brit 1926;33:503-513. |
|
| Fisher RA.The Design of Experiments. Edinburgh:
Oliver and Boyd, 1935. |
|
| Fisher RA. Development of the theory of experimental
design. Proceedings of the International Statistical Conferences
1947;3:434-439. |
|
| Fisher RA, Bartlett S. Pasteurised and raw milk.
Nature 1931;127:591-592. |
|
|
Hill AB. Principles of Medical Statistics. London: Lancet,
1937. |
|
|
Hill AB.í¹,í° The clinical trial. New Eng J Med 1952;247:113-119. |
|
| Hill AB. Memories of the British streptomycin
trial in tuberculosis: the first randomized clinical trial.
Contr Clin Trials 1990;11:77-79. |
|
| Marks HM. The Progress of Experiment:
Science and Therapeutic Reform in the United States, 1900-1990.
Cambridge, England: Cambridge University Press, 1997. |
Fishers book on statistical methods for research workers came to my notice when a medical student and led to my appreciation of the chi-share test and consequently to my first medical publication,1 in which I showed that the observations that one of my teachers had cited as vindication of a new treatment could easily have arisen by chance (p= 0.6). Fishers book did not, however, lead me to use randomisation rather than alternation in the first two trials I undertook (or sought to undertake) after qualification (in 1937). A few years later Bradford Hills example did, when I came to test the value of many treatments advocated for patients with gastric ulcers. From him, also, I learnt (i) the central importance of ethical considerations in the conduct of trials,2 as exemplified in the first large randomised clinical trial (of streptomycin for pulmonary tuberculosis); and (ii) the logical approach, adapted for each case, that could lead to the determination of whether an epidemiological association should be taken to imply causation.3
References:
|
1.
|
Doll R. Medical statistics. St Thomass Hospital Gazette 1936:294-297. |
|
2.
|
Hill AB. Medical ethics and controlled trials. BMJ 1963;1:1043. |
|
3.
|
Hill AB.
The environment and disease: association or causation?
Proc Roy Soc Med 1965;58:295-300. |
Two epistemological claims underwrite the randomized clinical trial. The first, associated with A. Bradford Hill, asserts that randomization prevents biased estimates of the value of new therapies. The second, associated with R.A. Fisher, maintains that randomization is necessary for the valid interpretation of statistical significance.
This paper places Fishers views on randomization in the larger context of his views on statistical inference and the nature of science. Fishers life-long interest in the problem of induction, how to draw valid empirical conclusions about the world, led him to emphasize the provisional character of knowledge. The statistical methods he developed - randomization, likelihood - were aimed at producing what Fisher termed rigorously specified uncertainty. Fishers highly technical arguments about the nature of probability and likelihood are rooted in his more general concerns about the evolutionary and political importance of intellectual autonomy - concerns rooted in his early eugenic views but strongly reinforced by his ideological critique of socialist science during the Cold War.
Notwithstanding the esoteric character of some of his arguments, Fishers ideas have great relevance for contemporary debates within evidence-based medicine. How can we best distinguish what is known from what is not known? How can we emphasize the uncertainty of existing knowledge; and how can we best capture that uncertainty? All these Fisherian concerns are crucial in a time where the passage from the journal article to the press conference is increasingly swift.
Selected bibliography:
Fisher Box J. R.A. Fisher. The Life of A Scientist. New York: John Wiley, 1978.
Fisher RA. Collected Papers, ed. JH Bennett. University of Adelaide, 1971.
Fisher RA. Statistical Inference and Analysis.Selected Correspondence of R.A. Fisher, ed. JH Bennett. Oxford: Oxford University Press, 1990.
Jones G. Science, Politics and the Cold War. London: Routledge, 1988.
Marks HM. The Progress of Experiment. Science and Therapeutic Reform in the United States, 1900-1990. Cambridge: Cambridge University Press, 1997, 2000.
Mazumdar P. Eugenics, Human Genetics and Human Failings. The Eugenics Society, Its Sources and Critics in Britain. London: Routledge, 1992.
Perkin H. The Rise of Professional Society. England Since 1880. London: Routledge, 1990.
Savage LJ. On Rereading R.A. Fisher.Annals of Statistics 1976;4:441-445.
Inert palliatives, called
placebos, were traditionally used as psychological props in clinical
practice in the centuries before their widespread adoption in clinical
trials as sham treatments designed to reduce observer bias. Currently,
the ethics of dummy controls are under scrutiny, as is the clinical
relevance of therapeutic efficacy defined merely as the group difference
in outcomes between a treatment being tested and dummy. At the same
time, the so-called placebo response is suspected of
being a dustbin category for psychosocial variables neglected by
biomedicine, or even an artefact of research that forgets the natural
history of disease and regression to the mean.
Historical perspectives
cannot solve our placebo problems, but can help in their analysis.
In this paper I give an overview of important early experiments
involving masking with dummy treatments, drawing particularly on
the work of Ted Kaptchuk, and my own systematic review of early
prospective trials of homeopathy. I make four main points, and one
recommendation:
The emergence of dummy control treatments was not a clear-cut and one-sided application of scientific scrutiny to unorthodox treatments. Experiment and clinical practice using dummies within mesmerism and homeopathy appears to have given the lead, while external evaluations were often more rhetorical than scientific.
Dummy controls penetrated allopathic and biomedical self-evaluation much less quickly than homeopathy and mesmerism, perhaps because they threatened to expose the mistaken nature of much institutionalised therapeutic practice.
Many of the most decisive early clinical trials did not use dummy controls, but involved pragmatic comparisons of competing treatments or untreated controls.
Important aspects of the placebo effect, including therapist empathy and expectations, milieu, and negative responses to dummy treatments (nocebos) were identified at a very early date, but have not been the subject of research until recently.
A refinement of technical vocabulary is needed, since the single term placebo no longer seems adequate to contain (1) paternalistic patient management; (2) dummy controls; and (3) context effects.
Selected bibliography (* = Historical reviews containing references to primary sources mentioned in the paper):
Dean ME. A homeopathic origin for placebo controls: An invaluable gift of God. Alternative Therapies in Health and Medicine 2000;6:5866.*
Dean ME. The trials of homeopathy: a critical-historical account of the origins, structure and development of Hahnemann's scientific therapeutics, and two systematic reviews of homeopathic clinical trials, 18211953 and 19401998 (PhD thesis). University of York, 2001.*
de Craen AJM, Kaptchuk TJ, Tijssen JGP, Kleijnen J. Placebos and placebo effects in medicine: historical overview. J Roy Soc Med 1999;92:511515.*
Di Blasi Z, Harkness E, Ernst E, Georgiou A, Kleijnen J. Influence of context effects on health outcomes: a systematic review. Lancet 2001;357:757-62.
Donner F. Zwlf Vorlesungen ber Homopathie. Berlin:Haug, 1948.*
Ellenberger HF. The discovery of the unconscious. The history and evolution of dynamic psychiatry. London: Allen Lane, 1970.*
Glaser EM. Volunteers, controls, placebos, and questionaries in clinical trials. In: Witts L J, ed. Medical surveys and clinical trials: some methods and applications of group research in medicine. London: Oxford University Press, 1959:105117.
G tzsche PC. The logic of the placebo concept. Is there any? Lancet 1994;344:925927.
Haas H, Fink H, Hrtfelder G. Das Placeboproblem. Fortschritte der Arzneimittelforschung / Progress in Drug Research. 1959;1:279454.*
Hahn RA. The nocebo phenomenon: concept, evidence and implications for public health. Preventive Medicine 1997;26:607611.
Harley D. Rhetoric and the social construction of sickness and healing. Soc Hist Med 1999;12:407435.
Hrobjartsson A, G tzsche PC. Is the placebo powerless? An analysis of clinical trials comparing placebo with no treatment. New England Journal of Medicine 2001;344:1594-602.
Jobst K. A matter of mind or matter? Network 1997;64:68.
Kaptchuk TJ. Intentional ignorance: a history of blind assessment and placebo controls in medicine. Bulletin of the History of Medicine 1998;72:389-433.*
Kaptchuk TJ. Powerful placebo: the dark side of the randomised controlled trial. Lancet 1998;351:17221725.*
Kaptchuk TJ, Dean ME. Debate over the history of placebos in medicine. Alternative Therapies in Health and Medicine 2000;6:1820.*
Kienle GS, Kiene H. The placebo effect: a scientific critique. Comp Ther Med 1998;6:1424.
Reubi FC. Armand Trousseau (1801-1867) et leffet placebo. Schweiz Med Woch 1986;116:27-28.*
Shelley JH, Baur MP. Paul Martini: the first clinical pharmacologist? Lancet 1999; 353:18701873.*
Suttie I. The origins of love and hate. Harmondsworth: Penguin, 1952 [1936].
Walach H, Maidhof C. Is the placebo effect dependent on time? A meta-analysis. In: Kirsch I., ed. How expectancies shape experience. Washington, DC, American Psychological Association, 1999:321332.
The history of research on side effects of drugs is not older than about half a century. It parallels the history of modern chronic disease epidemiology. Several ideas in side effect research have been influenced by concepts from chronic disease epidemiology, and vice versa. The historical record can improve our insight into the essential conditions that make side effect research by observational means possible, and in this way it can also enlighten a broader discussion about the relative place of observational vs. experimental research in clinical medicine. A key question is: Under what conditions in observational studies of side effects it is possible to assume that treatment allocation was unbiased (that is, approaching randomisation) in respect of particular prognostic factors? In this presentation, I will explore the origin and development of these ideas in theoretical papers on side effect research.
Bibliography of key references to the older literature:
Finney DJ. The design and logic of a monitor of drug use. J Chron Dis 1965;18:77-98. [an update was published in 1971, and reprinted with postscript as: Finney DJ. Statistical logic in the monitoring of reactions to therapeutic drugs. In: Inman WHW, Ed. Monitoring drug safety. Philadelphia, Lippincot: 1980]
Jick H, Vessey MP. Case-control studies in the evaluation of drug induced illness. Am J Epidemiol 1978;107:1-7.
Jick H, Walker AM, Spriet-Pourra CI. Postmarketing follow-up. JAMA 1979;242:2310-4.
Skegg DCG. Medical record linkage. In: Inman WHW, Ed. Monitoring drug safety. Philadephia, Lippincot: 1980:337-48.
Many excellent notions or experiments are, by sober and modest men, suppressed
Robert Boyle, 1661
Publication bias is a tendency to prepare, submit and publish research findings, based on the nature and direction of the results. If the literature is more likely to contain reports showing positive results than negative or null findings, estimates of the effects of treatments, for example, are likely to be biased. Although publication bias has been recognized for centuries,1 only recently have the size of the problem and its sources been the subject of research. There is now a large body of evidence confirming the existence of publication bias.
Early cross sectional studies by Sterling2 and Smart3 showed that the majority of published articles in psychology and the social sciences had statistically significant findings. Others, including those examining medical journals, have confirmed these findings.4, 5, 6 Surveys of peer reviewers and investigator authors have also shown that studies with positive results are more likely to be submitted or recommended for publication.7, 8, 9, 10, 11 Experimental studies, in which test manuscripts with positive and negative findings were submitted to journals or referees, have shown that positive results are associated with acceptance.12, 13, 14
The most direct evidence of publication bias in the medical field comes from six follow-up studies of the research projects identified at the time of funding or ethical approval.15, 16, 17, 18, 19 These found that positive findings was the main factor associated with subsequent publication.20 Investigators stated that their main reason for not reporting projects was that they had never written them up or submitted a manuscript for publication because they were not interested. Editorial rejection by journals was a rare cause of failure to publish.
There is mixed evidence about whether full publication of studies originally published in abstract form is associated with positive findings, depending on how positive findings is defined.21 These and other investigators have examined the time taken to publish research findings and have found mixed results: in some cases negative findings take longer to reach publication22 and in others they do not.23, 24
Despite the evidence identifying investigators as the main cause of publication bias, they continue to claim that editorial bias is the main reason that negative or null results are not published, and that this is why they do not submit negative findings. A recent study examining editorial review of reports of controlled trials at JAMA found little evidence of editorial bias in that journal.25
If trials were registered at inception, reviewers and others would at least know when relevant studies had been done, and could pursue the responsible investigators to obtain study results. Under-reporting of research is unethical and constitutes scientific misconduct.26 Unless this problem is addressed, estimates of the effects of treatments based on published evidence may be biased.
References:
1. Ferriar J. Medical
histories and reflexions. Vol 1. Cadell and Davies, London 1792.
2. Sterling TD. Publication decisions and their possible effects on inferences
drawn from tests of significance - or vice versa. Journal of the
American Statistical Association 1959;54:30-4./p>
3. Smart RG. The importance of negative results in psychological research. Canadian Psychologist 1964;5:225-32
4. Sterling TD, Rosenbaum WL, Weinkam JJ. Publication decisions revisited: The effect of the outcome of statistical tests on the decision to publish. Am Statist 1995;49:108-112.
5. Moscati R, Jehle D, Ellis D, Fiorello A, Landi M. Positive-outcome bias: comparison of emergency medicine and general medicine literatures. Acad Emerg Med 1994;1:267-71.
6. Vickers A, Goyal N, Harland R, Rees R. Do certain countries produce only positive results? A systematic review of controlled trials. Control Clin Trials 1998;19:159-66.
7. Consequences of prejudice against the null hypothesis. Psychol Bull 1975;82:1-20.
8. Coursol A, Wagner EE. Effect of positive findings on submission and acceptance rates: A note on meta-analysis bias. Professional Psychology: Research and Practice 1986;17:136-7.
9. Shadish WR, Doherty M, Montgomery LM. How many studies are in the file drawer? An estimate from the family/marital psychotherapy literature. Clin Psychol Rev 1989;9:589-603.
10. Sommer B. The file drawer effect and publication rates in menstrual cycle research. Psychology of Women Quarterly 1987;11:233-42.
11. Dickersin K, Chan S, Chalmers TC, Sacks HS, Smith H Jr. Publication bias and clinical trials. Control Clin Trials 1987;8:343-53.
12. Mahoney MJ. Publication prejudices: An experimental study of confirmatory bias in the peer review system. Cognitive Therapy and Research 1977;1:161-75.
13. Peters D, Ceci S. Peer review practice of psychologic journals: The fate of published articles submitted again. Behav Brain Sci 1982;5:187-95.
14. Epstein WM. Confirmational response bias among social work journals. Science, Technology, & Human Values 1990;15:9-37.
15. Easterbrook PJ, Berlin JA, Gopalan R, Matthews DR. Publication bias in clinical research. Lancet 1991;337:867-72.
16. Dickersin K, Min YI, Meinert CL. Factors influencing publication of research results. Follow-up of applications submitted to two institutional review boards. JAMA 1992;267:374-8.
17. Dickersin K, Min YI. NIH clinical trials and publication bias. Online J Curr Clin Trials 1993; Doc No 50:[4967 words; 53 paragraphs].
18. Stern JM, Simes RJ. Publication bias: evidence of delayed publication in a cohort study of clinical research projects. BMJ 1997;315:640-5.
19. Ioannidis JP. Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA 1998;279:281-6.
20. Dickersin K. How important is publication bias? A synthesis of available data. AIDS Educ Prev 1997;9(1 Suppl):15-21.
21. Scherer RW, Langenberg P. Full publication of results initially presented in abstracts (Cochrane Methodology Review). In: The Cochrane Library, Issue 3, 2002. Oxford: Update Software.
22. Hopewell S, Clarke M, Stewart L, Tierney J. Time to publication for results of clinical trials (Cochrane Methodology Review). In: The Cochrane Library, Issue 3, 2002. Oxford: Update Software.
23. Tierney J, Stewart L. Looking for the evidence: is there bias in the publication of individual patient data meta-analyses? Presented at 5th Cochrane Colloquium. Amsterdam, 1997.
24. Dickersin K, Olson CM, Rennie D, Cook D, Flanagin A, Zhu Q, Reiling J, Pace B. Association between time interval to publication and statistical significance. JAMA. 2002;287:2829-31.
25. Olson CM, Rennie D, Cook D, Dickersin K, Flanagin A, Hogan J, Zhu Q, Reiling J, Pace B. Publication bias in editorial decision making. JAMA. 2002;287:2825-8.
26. Chalmers I. Under-reporting research is scientific misconduct. JAMA 1990;263:1405-1408.
Controlling bias in studies designed to investigate the efficacy, effectiveness and safety of therapeutic measures plays a critical role in the design and assessment of formal clinical studies. For organisations such as the Committee on Safety of Medicines and the National Institute for Clinical Excellence, with broad responsibilities to promote public health, controlling for these sources of scientific bias are pivotal. Nevertheless, such bodies also have to exercise judgement; and controlling for judgemental bias is equally important if their decisions are to yield appropriate health gains and retain broad professional and public support.
The necessary judgements that have to be made by public bodies fall into two types. Some forms of judgement are essentially scientific and rely on a skilful evaluation of the evidence. They apply particularly to circumstances where extrapolation, beyond the studies immediate findings, has to be made. Examples include the relevance of intermediate endpoints (or surrogate markers) to predict longer-term outcomes; or judgements about the generalisability of data, from the often relatively artificial environments of randomised controlled trials, to a heterogeneous population; and judgements about assumption made in constructing models.
The second form of judgement often involves balancing conflicting issues, where a decision must be based on intuition rather than analysis. For the Committee on Safety of Medicines it is the interplay between efficacy and safety; whilst for National Institute for Clinical Excellence it is primarily the tension between clinical benefit and cost.
In exercising such judgements on behalf of the public, national organisations must both act, and be seen to act, in the best interests of patients and the public as a whole. In particular, they must avoid inappropriate influences (sometimes known as 'capture') from any of a wide group of stakeholders. The stakeholder group most frequently cited as protagonists in these circumstances is the pharmaceutical industry. But there are others including politicians, patients (individually or collectively), the clinical professions, and the health service itself. Stakeholders' agendas are often similar, and they may form strategic alliances.
Modern capture theory 1 offers important insights into the judgemental biases associated with evaluating risk-benefit and cost-effectiveness by national bodies. It provides a coherent basis for controlling and avoiding them, and needs to be more widely understood by both "regulators" and the "regulated".
Reference:
1. Hancher H. Regulating for Competition: Government, Law, and
Return to Events Index
Visit the BMJ editorial page http://bmj.com/cgi/reprint/325/7377/1372