Emerging areas of research in environmental health sciences and the ascendancy of new interdisciplinary approaches have led to the development of new hypotheses and technologies to collect massive complex data that present great data analysis challenges. Substantially motivated by analytic needs arising from a major environmental health research center at this University of Michigan (U-M), the primary goal is to develop a set of novel statistical models and efficient algorithms to evaluate, interpret, and predict impacts of prenatal and/or postnatal exposures to environmental risk factors for adverse child health and developmental outcomes.
Funded by an NIH methodology R01 grant, the lab aims to (i) develop semiparametric models and algorithms to evaluate the influence of prenatal and/or postnatal exposures to toxic mixtures on delayed somatic growth and sexual maturation during the adolescent period; and (ii) develop semiparametric stochastic models to evaluate the functional rate of growth changes during the 0-5 year age-period and its potential alterations driven by exposure mixtures. Stochastic differential equations are utilized to model both growth velocity and acceleration as functions of anthropometric characteristics and toxicant mixtures, and the resulting model helps investigators to study various child growth milestones such as timing of velocity peak and adiposity (BMI) rebound.
ReferencesWith the support from an NIH methodology R01 grant, the lab is devoted to the development of novel statistical methods that enables to assess the validity of merging longitudinal cohort data and to perform joint analysis of merged data. Merging longitudinal data sets from multiple cohorts helps yield a desirable age spectrum in growth analyses but complicated by the underlying heterogeneity across different age cohorts. An efficient fused LASSO algorithm is proposed to detect potential inter-cohort heterogeneities that are subsequently accounted for to reach a valid joint analysis of merged data.
ReferencesAn evolving strategy, known as Kidney Paired Donation (KPD), provides an approach to overcome the barriers faced by many patients with kidney failure who present with willing, but immunologically or blood type incompatible living donors. KPD programs use a computerized algorithm to match one incompatible donor/recipient pair to another pair with a complementary incompatibility, such that the donor of the first pair gives to the recipient of the second, and vice versa. More complex exchanges of organs involving three or more pairs are also considered, as are altruistic or non-directed donors (NDD) who donate a kidney voluntarily and thereby have the potential to create a chain of kidney transplants. Such chains have become increasingly important in KPD programs.
Funded by an NIH grant (renewed), the lab has developed important methods based on sets with fallback options, including extensions to incorporate various features of current KPD pools including partially directed donors, candidates with multiple incompatible donors, compatible donor-candidate pairs, deceased donor initiated chains, and donors from pools with differing genetic makeups. In addition, the lab is currently developing efficient algorithms to enumerate subsets of interest in a KPD pool and to evaluate the expected utility that such subsets would attain if selected.
ReferencesData collected from networks are pervasive in practice. A network refers to a set of nodes or vertices joined in pairs by edges. An important feature of a network is that between-node distance may not be defined precisely in a numeric metric.
Supported an NSF funding, the lab pursuits the research with the following overarching: to develop quasi-likelihood theory and methods for regression analysis of multi-dimensional response variables on covariates that are collected from networks. Because data from a network are correlated across nodes, in order to achieve desirable efficiency of statistical inference we commit to address relevant analytic challenges pertinent to the need of incorporating appropriate dependence structures in estimation and inference for regression parameters.
ReferencesThe lab focuses on developing simultaneous statistical inference for Big Data. The current literature mostly provides statistical inference on a single variable at one time, which is not ideal in many practical settings where several parameters need to be examined simultaneously.
ReferencesThis page was last modified on: 06/10/2018
Questions or comments with the site? Contact the maintainer (Mathieu Bray).