A person can experience an effect on the occurrence of an outcome in a defined follow-up period without experiencing an effect on the risk of that outcome over the same period. Sufficient causes are sometimes used to deepen potential-outcome explanations of this phenomenon. In doing so, care should be taken to avoid tipping the balance between simplification and realism too far toward simplification. Death and other competing risks should not be assumed away. The time scale should be explicit, with specific times for the occurrence of specified component causes and for the completion of each sufficient cause. Component causes that affect risk should occur no later than the start of the risk period. Sufficient causes should be allowed to have component causes in common. When individuals experience all components of two or more sufficient causes, the outcome must be recurrent. In addition to effects on rates and risks, effects on incidence time itself should be considered.
Austin Bradford Hill's landmark 1965 paper contains several important lessons for the current conduct of epidemiology. Unfortunately, it is almost exclusively cited as the source of the "Bradford-Hill criteria" for inferring causation when association is observed, despite Hill's explicit statement that cause-effect decisions cannot be based on a set of rules. Overlooked are Hill's important lessons about how to make decisions based on epidemiologic evidence. He advised epidemiologists to avoid over-emphasizing statistical significance testing, given the observation that systematic error is often greater than random error. His compelling and intuitive examples point out the need to consider costs and benefits when making decisions about health-promoting interventions. These lessons, which offer ways to dramatically increase the contribution of health science to decision making, are as needed today as they were when Hill presented them.
Background:
The WINPEPI (PEPI-for-Windows) computer programs for epidemiologists are designed for use in practice and research in the health field and as learning or teaching aids. They aim to complement other statistics packages. The programs are free, and can be downloaded from the Internet.ImplementationThere are at present four WINPEPI programs: DESCRIBE, for use in descriptive epidemiology, COMPARE2, for use in comparisons of two independent groups or samples, PAIRSetc, for use in comparisons of paired and other matched observations, and WHATIS, a "ready reckoner" utility program. The programs contain 75 modules, each of which provides a number, sometimes a large number, of statistical procedures. The manuals explain the uses, limitations and applicability of specific procedures, and furnish formulae and references.
Conclusions:
WINPEPI provides a wide variety of statistical routines commonly used by epidemiologists, and is a handy resource for many procedures that are not very commonly used or easily found. The programs are in general user-friendly, although some users may be confused by the large numbers of options and results provided. The main limitations are the inability to read data files and the fact that only one of the programs presents graphic results. WINPEPI has a considerable potential as a learning and teaching aid.
Sufficient causes of disease are redundant when an individual acquires the components of two or more sufficient causes. In this circumstance, the individual still would have become diseased even if one of the sufficient causes had not been acquired. In the context of a study, when any individuals acquire components of more than one sufficient cause over the observation period, the etiologic effect of the exposure (defined as the absolute or relative difference between the proportion of the exposed who develop the disease by the end of the study period and the proportion of those individuals who would have developed the disease at the moment they did even in the absence of the exposure) may be underestimated. Even in the absence of confounding and bias, the observed effect estimate represents only a subset of the etiologic effect. This underestimation occurs regardless of the measure of effect used.To some extent, redundancy of sufficient causes is always present, and under some circumstances, it may make a true cause of disease appear to be not causal. This problem is particularly relevant when the researcher's goal is to characterize the universe of sufficient causes of the disease, identify risk factors for targeted interventions, or construct causal diagrams. In this paper, we use the sufficient component cause model and the disease response type framework to show how redundant causation arises and the factors that determine the extent of its impact on epidemiologic effect measures.
This paper describes how to fit an additive Poisson model using standard software. It is illustrated with SAS code, but can be similarly used for other software packages.
This analysis presents a detailed defense of my epidemiologic research in the May 17, 2003 British Medical Journal that found no significant relationship between environmental tobacco smoke (ETS) and tobacco-related mortality. In order to defend the honesty and scientific integrity of my research, I have identified and addressed in a detailed manner several unethical and erroneous attacks on this research. Specifically, I have demonstrated that this research is not "fatally flawed," that I have not made "inappropriate use" of the underlying database, and that my findings agree with other United States results on this relationship. My research suggests, contrary to popular claims, that there is not a causal relationship between ETS and mortality in the U.S. responsible for 50,000 excess annual deaths, but rather there is a weak and inconsistent relationship. The popular claims tend to damage the credibility of epidemiology.In addition, I address the omission of my research from the 2006 Surgeon General's Report on Involuntary Smoking and the inclusion of it in a massive U.S. Department of Justice racketeering lawsuit. I refute erroneous statements made by powerful U.S. epidemiologists and activists about me and my research and I defend the funding used to conduct this research. Finally, I compare many aspect of ETS epidemiology in the U.S. with pseudoscience in the Soviet Union during the period of Trofim Denisovich Lysenko. Overall, this paper is intended to defend legitimate research against illegitimate criticism by those who have attempted to suppress and discredit it because it does not support their ideological and political agendas. Hopefully, this defense will help other scientists defend their legitimate research and combat "Lysenko pseudoscience."
When carefully planned and analysed, the case-cohort design is a powerful choice for follow-up studies with multiple event types of interest. While the literature is rich with analysis methods for case-cohort data, little is written about the designing of a case-cohort study. Our experiences in designing, coordinating and analysing the MORGAM case-cohort study are potentially useful for other studies with similar characteristics. The motivation for using the case-cohort design in the MORGAM genetic study is discussed and issues relevant to its planning and analysis are studied. We propose solutions for appending the earlier case-cohort selection after an extension of the follow-up period and for achieving maximum overlap between earlier designs and the case-cohort design. Approaches for statistical analysis are studied in a simulation example based on the MORGAM data.
Background:
Schistosomiasis infection, contracted through contact with contaminated water, is a global public health concern. In this paper we analyze data from a retrospective study reporting water contact and schistosomiasis infection status among 1011 individuals in rural China. We present semi-parametric methods for identifying risk factors through a comparison of three analysis approaches: a prediction-focused machine learning algorithm, a simple main-effects multivariable regression, and a semi-parametric variable importance (VI) estimate inspired by a causal population intervention parameter.
Results:
The multivariable regression found only tool washing to be associated with the outcome, with a relative risk of 1.03 and a 95% confidence interval (CI) of 1.01-1.05. Three types of water contact were found to be associated with the outcome in the semi-parametric VI analysis: July water contact (VI estimate 0.16, 95% CI 0.11-0.22), water contact from tool washing (VI estimate 0.88, 95% CI 0.80-0.97), and water contact from rice planting (VI estimate 0.71, 95% CI 0.53-0.96). The July VI result, in particular, indicated a strong association with infection status - its causal interpretation implies that eliminating water contact in July would reduce the prevalence of schistosomiasis in our study population by 84%, or from 0.3 to 0.05 (95% CI 78%-89%).
Conclusions:
The July VI estimate suggests possible within-season variability in schistosomiasis infection risk, an association not detected by the regression analysis. Though there are many limitations to this study that temper the potential for causal interpretations, if a high-risk time period could be detected in something close to real time, new prevention options would be opened. Most importantly, we emphasize that traditional regression approaches are usually based on arbitrary pre-specified models, making their parameters difficult to interpret in the context of real-world applications. Our results support the practical application of analysis approaches that, in contrast, do not require arbitrary model pre-specification, estimate parameters that have simple public health interpretations, and apply inference that considers model selection as a source of variation.
ObjectiveThe purpose of this paper was to compare two mathematical procedures to estimate the annual attributable number of deaths (the Allison et al procedure and the Mokdad et al procedure), and derive a new procedure that combines the best aspects of both procedures. The new procedure calculates attributable number of deaths along a continuum (i.e. for each unit of exposure), and allows for one or more neutral (neither exposed nor nonexposed) exposure categories.
Methods:
Mathematical derivations and real datasets were used to demonstrate the theoretical relationship and practical differences between the two procedures. Results of the comparison were used to develop a new procedure that combines the best features of both.FindingsThe Allison procedure is complex because it directly estimates the number of attributable deaths. This necessitates calculation of probabilities of death. The Mokdad procedure is simpler because it estimates the number of attributable deaths indirectly through population attributable fractions. The probabilities of death cancel out in the numerator and denominator of the fractions. However, the Mokdad procedure is not applicable when a neutral exposure category exists.
Conclusion:
By combining the innovation of the Allison procedure (allowing for a neutral category) and the simplicity of the Mokdad procedure (using population attributable fractions), this paper proposes a new procedure to calculate attributable numbers of death.