Surveillance research is of great importance for effective and efficient epidemiological monitoring of case counts and disease prevalence. Taking specific motivation from ongoing efforts to identify recurrent cases based on the Georgia Cancer Registry, we extend recently proposed “anchor stream” sampling design and estimation methodology. Our approach offers a more efficient and defensible alternative to traditional capture-recapture (CRC) methods by leveraging a relatively small random sample of participants whose recurrence status is obtained through a principled application of medical records abstraction. This sample is combined with one or more existing signaling data streams, which may yield data based on arbitrarily non-representative subsets of the full registry population. The key extension developed here accounts for the common problem of false positive or negative diagnostic signals from the existing data stream(s). In particular, we show that the design only requires documentation of positive signals in these non-anchor surveillance streams, and permits valid estimation of the true case count based on an estimable positive predictive value (PPV) parameter. We borrow ideas from the multiple imputation paradigm to provide accompanying standard errors, and develop an adapted Bayesian credible interval approach that yields favorable frequentist coverage properties. We demonstrate the benefits of the proposed methods through simulation studies, and provide a data example targeting estimation of the breast cancer recurrence case count among Metro Atlanta area patients from the Georgia Cancer Registry-based Cancer Recurrence Information and Surveillance Program (CRISP) database.
Pooling biological specimens prior to performing expensive laboratory assays has been shown to be a cost effective approach for estimating parameters of interest. In addition to requiring specialized statistical techniques, however, the pooling of samples can introduce assay errors due to processing, possibly in addition to measurement error that may be present when the assay is applied to individual samples. Failure to account for these sources of error can result in biased parameter estimates and ultimately faulty inference. Prior research addressing biomarker mean and variance estimation advocates hybrid designs consisting of individual as well as pooled samples to account for measurement and processing (or pooling) error. We consider adapting this approach to the problem of estimating a covariate-adjusted odds ratio (OR) relating a binary outcome to a continuous exposure or biomarker level assessed in pools. In particular, we explore the applicability of a discriminant function-based analysis that assumes normal residual, processing, and measurement errors. A potential advantage of this method is that maximum likelihood estimation of the desired adjusted log OR is straightforward and computationally convenient. Moreover, in the absence of measurement and processing error, the method yields an efficient unbiased estimator for the parameter of interest assuming normal residual errors. We illustrate the approach using real data from an ancillary study of the Collaborative Perinatal Project, and we use simulations to demonstrate the ability of the proposed estimators to alleviate bias due to measurement and processing error.
Background: Family carepartner management and support can improve stroke survivor recovery, yet research has placed little emphasis on how to integrate families into the rehabilitation process without increasing negative carepartner outcomes. Our group has developed creative approaches for engaging family carepartners in rehabilitation activities to improve physical and psychosocial health for both the carepartner and stroke survivor. The purpose of this study is to explore a novel, web-based intervention (Carepartner and Constraint-Induced Therapy; CARE-CITE) designed to facilitate positive carepartner involvement during a home-based application of constraint-induced movement therapy (CIMT) for the upper extremity.
Methods: The primary aim of the study is to determine feasibility of CARE-CITE for both stroke survivors and their carepartners. Carepartner mental health, family conflict surrounding stroke recovery, and stroke survivor upper extremity function will be evaluated using an evaluator blinded, two-group experimental design (blocked randomization protocol according to a 2:1 randomization schema) with 32 intervention dyads and 16 control dyads (who will receive CIMT without structured carepartner involvement). CARE-CITE consists of online education modules for the carepartner to review in parallel to the 30-h CIMT that the stroke survivor receives. The intent of CARE-CITE is to enhance the home-based intervention of CIMT, by helping the carepartner support the therapy and create a therapeutic home environment encouraging practice of the weaker arm in functional tasks.
Discussion: The CARE-CITE study is testing the feasibility of a family-integrated rehabilitation approach applied in the home environment, and results will provide the foundation for larger clinical studies. The overall significance of this research plan is to increase the understanding and further development of interventions that may serve as models to promote family involvement in the rehabilitation process.
Planned interventions and/or natural conditions often effect change on an ordinal categorical outcome (e.g., symptom severity). In such scenarios, it is sometimes desirable to assign a priori scores to observed changes in status, typically giving higher weight to changes of greater magnitude. We define change indices for such data based upon a multinomial model for each row of a c × c table, where the rows represent the baseline status categories. We distinguish an index designed to assess conditional changes within each baseline category from two others designed to capture overall change. One of these overall indices measures expected change across a target population. The other is scaled to capture the proportion of total possible change in the direction indicated by the data, so that it ranges from -1 (when all subjects finish in the least favorable category) to +1 (when all finish in the most favorable category). The conditional assessment of change can be informative regardless of how subjects are sampled into the baseline categories. In contrast, the overall indices become relevant when subjects are randomly sampled at baseline from the target population of interest, or when the investigator is able to make certain assumptions about the baseline status distribution in that population. We use a Dirichlet-multinomial model to obtain Bayesian credible intervals for the conditional change index that exhibit favorable small-sample frequentist properties. Simulation studies illustrate the methods, and we apply them to examples involving changes in ordinal responses for studies of sleep deprivation and activities of daily living.
Background: Little is known about the effect of drought on all-cause mortality, especially in higher income countries such as the United States. As the frequency and severity of droughts are likely to increase, understanding the connections between drought and mortality becomes increasingly important. Methods: Our exposure variable was an annual cumulative drought severity score based on the 1-month, county-level Standardized Precipitation Evapotranspiration Index. The outcome variables of demographic subgroup-specific all-cause mortality count data per year were obtained from the National Vital Statistics System. Any counts below 10 deaths were censored in that demographic group per county. We modeled county-stratum-year mortality using interval-censored negative binomial regression with county-level random intercepts, for each combined age-race-sex stratum either with or without further stratification by climate regions. Fixed effects meta-regression was used to test the associations between age, race, sex, and region with the drought-mortality regression coefficients. Predictive margins were then calculated from the meta-regression model to estimate larger subgroup (e.g., 'race' or 'sex') associations of drought with mortality. Results: Most of the results were null for associations between drought severity and mortality, across joint strata of race, age, sex and region, but incidence rate ratios (IRRs) for 17 subgroups were significant after accounting for the multiple testing; ten were < 1 indicating a possible protective effect of drought on mortality for that particular subpopulation. The meta-regression indicated heterogeneity in the association of drought with mortality according to race, climate region, and age, but not by sex. Marginal means of the estimated log-incidence rate ratios differed significantly from zero for age groups 25-34, 35-44, 45-54 and 55-64; for the white race group; and for the South, West and Southwest regions, in the analysis that included wet county-years. The margin of the meta-regression model suggested a slightly negative, but not statistically significant, association of drought with same-year mortality in the overall population. Conclusions: There were significant, heterogeneous-direction associations in subpopulation-stratified models, after controlling for multiple comparisons, suggesting that the impacts of drought on mortality may not be monolithic across the United States. Meta-regression identified systematic differences in the associations of drought severity with all-cause mortality according to climate region, race, and age. These findings suggest there may be important contextual differences in the effects of drought severity on mortality, motivating further work focused on local mechanisms. We speculate that some of the estimated negative associations of drought severity with same-year mortality could be consistent with either a protective effect of drought on total mortality in the same year, or with a delayed health effect of drought beyond the same year. Further research is needed to clarify associations of drought with more specific causes of death and with sublethal health outcomes, for specific subpopulations, and considering lagged effects occurring beyond the same year as the drought.
by
Hui-Len Lue;
Xiaojian Yang;
Ruoxiang Wang;
Wei Ping Qian;
Roy Z.H. Xu;
Robert Lyles;
Adeboye Osunkoya;
Binhua P. Zhou;
Robert L. Vessella;
Majd Zayzafoon;
Zhi-Ren Liu;
Haiyen E. Zhau;
Leland W.K. Chung
LIV-1, a zinc transporter, is an effector molecule downstream from soluble growth factors. This protein has been shown to promote epithelial-to-mesenchymal transition (EMT) in human pancreatic, breast, and prostate cancer cells. Despite the implication of LIV-1 in cancer growth and metastasis, there has been no study to determine the role of LIV-1 in prostate cancer progression. Moreover, there was no clear delineation of the molecular mechanism underlying LIV-1 function in cancer cells. In the present communication, we found increased LIV-1 expression in benign, PIN, primary and bone metastatic human prostate cancer. We characterized the mechanism by which LIV-1 drives human prostate cancer EMT in an androgen-refractory prostate cancer cells (ARCaP) prostate cancer bone metastasis model. LIV-1, when overexpressed in ARCaP E (derivative cells of ARCaP with epithelial phenotype) cells, promoted EMT irreversibly. LIV-1 overexpressed ARCaP E cells had elevated levels of HB-EGF and matrix metalloproteinase (MMP) 2 and MMP 9 proteolytic enzyme activities, without affecting intracellular zinc concentration. The activation of MMPs resulted in the shedding of heparin binding-epidermal growth factor (HB-EGF) from ARCaP E cells that elicited constitutive epidermal growth factor receptor (EGFR) phosphorylation and its downstream extracellular signal regulated kinase (ERK) signaling. These results suggest that LIV-1 is involved in prostate cancer progression as an intracellular target of growth factor receptor signaling which promoted EMT and cancer metastasis. LIV-1 could be an attractive therapeutic target for the eradication of pre-existing human prostate cancer and bone and soft tissue metastases.
Patients with cystic fibrosis (CF) suffer from chronic lung infection and inflammation leading to respiratory failure. Vitamin D deficiency is common in patients with CF, and correction of vitamin D deficiency may improve innate immunity and reduce inflammation in patients with CF. We conducted a double-blinded, placebo-controlled, randomized clinical trial of high-dose vitamin D to assess the impact of vitamin D therapy on antimicrobial peptide concentrations and markers of inflammation. We randomized 30 adults with CF hospitalized with a pulmonary exacerbation to 250 000 IU of cholecalciferol or placebo, and evaluated changes in plasma concentrations of inflammatory markers and the antimicrobial peptide LL-37 at baseline and 12 weeks post intervention. In the vitamin D group, there was a 50.4% reduction in tumor necrosis factor-α (TNF-α) at 12 weeks (P<0.01), and there was a trend for a 64.5% reduction in interleukin-6 (IL-6) (P = 0.09). There were no significant changes in IL-1β, IL-8, IL-10, IL-18BP and NGAL (neutrophil gelatinase-associated lipocalin). We conclude that a large bolus dose of vitamin D is associated with reductions in two inflammatory cytokines, IL-6 and TNF-α. This study supports the concept that vitamin D may help regulate inflammation in CF, and that further research is needed to elucidate the potential mechanisms involved and the impact on clinical outcomes.
Assuming a binary outcome, logistic regression is the most common approach to estimating a crude or adjusted odds ratio corresponding to a continuous predictor. We revisit a method termed the discriminant function approach, which leads to closed-form estimators and corresponding standard errors. In its most appealing application, we show that the approach suggests a multiple linear regression of the continuous predictor of interest on the outcome and other covariates, in place of the traditional logistic regression model. If standard diagnostics support the assumptions (including normality of errors) accompanying this linear regression model, the resulting estimator has demonstrable advantages over the usual maximum likelihood estimator via logistic regression. These include improvements in terms of bias and efficiency based on a minimum variance unbiased estimator of the log odds ratio, as well as the availability of an estimate when logistic regression fails to converge due to a separation of data points. Use of the discriminant function approach as described here for multivariable analysis requires less stringent assumptions than those for which it was historically criticized, and is worth considering when the adjusted odds ratio associated with a particular continuous predictor is of primary interest. Simulation and case studies illustrate these points.
Misclassification of binary outcome variables is a known source of potentially serious bias when estimating adjusted odds ratios. Although researchers have described frequentist and Bayesian methods for dealing with the problem, these methods have seldom fully bridged the gap between statistical research and epidemiologic practice. In particular, there have been few real-world applications of readily grasped and computationally accessible methods that make direct use of internal validation data to adjust for differential outcome misclassification in logistic regression. In this paper, we illustrate likelihood-based methods for this purpose that can be implemented using standard statistical software. Using main study and internal validation data from the HIV Epidemiology Research Study, we demonstrate how misclassification rates can depend on the values of subject-specific covariates, and illustrate the importance of accounting for this dependence. Simulation studies confirm the effectiveness of the maximum likelihood approach. We emphasize clear exposition of the likelihood function itself, to permit the reader to easily assimilate appended computer code that facilitates sensitivity analyses as well as the efficient handling of main/external and main/internal validation-study data. These methods are readily applicable under random cross-sectional sampling, and we discuss the extent to which the main/internal analysis remains appropriate under outcome-dependent (case-control) sampling.
Ratio estimators of effect are ordinarily obtained by exponentiating maximum-likelihood estimators (MLEs) of log-linear or logistic regression coefficients. These estimators can display marked positive finite-sample bias, however. We propose a simple correction that removes a substantial portion of the bias due to exponentiation. By combining this correction with bias correction on the log scale, we demonstrate that one achieves complete removal of second-order bias in odds ratio estimators in important special cases. We show how this approach extends to address bias in odds or risk ratio estimators in many common regression settings. We also propose a class of estimators that provide reduced mean bias and squared error, while allowing the investigator to control the risk of underestimating the true ratio parameter. We present simulation studies in which the proposed estimators are shown to exhibit considerable reduction in bias, variance, and mean squared error compared to MLEs. Bootstrapping provides further improvement, including narrower confidence intervals without sacrificing coverage.