Skip to Content

Department of Statistics

  • Student working at a desk


Our department excels at collaborative research between faculty, students and other universities. 

Below lists recent papers (accepted/in press) authored by our tenured/tenure track faculty (shown in bold).

  1. Mou, X. and Wang, D. (2024). Additive partially linear model for pooled biomonitoring data. Computational Statistics and Data Analysis 190, 107862.
  2. Zhang W., Ma Z., Wang L., Fan D., and Ho Y. Genome-wide search algorithms for identifying dynamic gene co-expression via Bayesian variable selection. Statistics in Medicine, in Press.
  3. Zhang W., Ma Z., Ho Y., Yang S., Habiger J.D, Huang H.-H., Huang Y. Multi-omics integrative analysis for incomplete data using weighted p-value adjustment approach. Journal of Agricultural, Biological, and Environmental Statistics, in Press.
  4. Liu, Q. and Huang X. Parametric modal regression with error in covariates. Biometrical Journal, in press.
  5. Kwok, W.C., Ma, T., and Tam, T.C.C. (2023). Predicting the prognosis and survival in early-stage lung cancer after curative surgery. Journal of Oncology Research and Treatment, in press.
  6. Ma, T., Cai, Y., Shi, P., and Zhu, J. Hierarchical dependence modeling for the analysis of large insurance claims data. The Annals of Applied Statistics, in press.
  7. Ma, T., Mandujano Reyes, J.F., and Zhu, J. M-estimators for models with a mix of discrete and continuous parameters. Sankhya A, in press.
  8. Mo, C., Ma, T., and McPherson, B. ABO blood group and cochlear function-evidence from a large sample size study. International Journal of Audiology, in press.
  9. Zhong, S. and Hitchcock, D. (2023). Functional clustering of fictional narratives using Vonnegut curves. Advances in Data Analysis and Classification, in press.
  10. Hitchcock, D. (2023). Lessons from a discussion-based course on the history of statistics. The American Statistician, in press.
  11. Li, S., Hu, T., Wang, L., McMahan, C., and Tebbs, J. (2024). Regression analysis of group-tested current status data. Biometrika, in press.
  12. Shin, M., Wang, S., and Liu, J.S. Generative multi-purpose sampler for weighted M-estimation. Journal of Computational and Graphical Statistics, in press.
  1. Bai, R., Boland, M. R., and Chen, Y. (2023). Scalable high-dimensional Bayesian varying coefficient models with unknown within-subject covariance. Journal of Machine Learning Research 24, 1-49.
  2. Wang, D., Mou, X., and Liu, Y. (2023). Varying-coefficient regression analysis for pooled biomarker data. Biometrics 78, 1328-1341.
  3. Withana Gamage, P., McMahan, C., Wang, L. (2023). "A flexible parametric approach for analyzing arbitrarily censored data that are potentially subject to left truncation under the proportional hazards model". Lifetime Data Analysis 29, 188-212.
  4. Zhang, W., Ma, Z., Wang, L., Fan, D. and Ho Y. (2023). "Genome-wide search algorithms for identifying dynamic gene co-expression via Bayesian variable selection ". Statistics in Medicine 42, 5616-5629.
  5. Chakrabarti, M., Jiao, K., Potts, J.D., Wang, L., Branch, S., Harrelson, S., Khan, S., Mohamad Azhar (2023). "Hippo signaling mediates TGFß-dependent transcriptional inputs in cardiac cushion mesenchymal cells to regulate extracellular matrix remodeling". Journal Cardiovascular Development and Disease 10, 483.
  6. Zhang, H., Huang, X, and Arshad, H. (2023) Comparing dependent undirected Gaussian networks. Bayesian Analysis 18, 1341-1366.
  7. Chan, K.P.F., Ma, T., Sridhar, S., Lam, D.C.L., Ip, M.S.M. & Ho, P.L. (2023). Changes in etiology and clinical outcomes of pleural empyema during the COVID-19 pandemic. Microorganisms 11, 303.
  8. Kwok, W.C., Ho, J.C.M., Ma, T., Lam, D.C.L., Chan, J.W.M., Ip, M.S.M & Tam, T.C.C. (2023). Risk of hospitalized bronchiectasis exacerbation based on blood eosinophil counts among Chinese patients. The International Journal of Tuberculosis and Lung Disease 27, 61-65.
  9. Kwok, W.C., Ma, T., Ho, J., Lam, D. C. L., Sit, K. Y., Ip, M.S.M., Au, T.W.K. & Tam, T.C.C. (2023). Prediction model on disease recurrence for low-risk resected stage I lung adenocarcinoma. Respirology 28, 669-676.
  10. Ma, T., Wang, F., and Zhu, J. (2023). On generalized latent factor modeling and inference for high-dimensional binomial data. Biometrics 79, 2311-2320.
  11. Ma, T., Wang, F., Zhu, J., Ives, R.I. & Lewińska, K.E. (2023). Scalable semiparametric spatio-temporal regression for large data analysis. Journal of Agricultural, Biological and Environmental Statistics 28, 279-298.
  12. Mo, C., McPherson, B., and Ma, T. (2023). Cochlear function in individuals with and without spontaneous otoacoustic emissions. Audiology Research 13, 686-699.
  13. Oh, G., Aravamuthan, S., Ma, T., McGahan, I., Zhu, J., Ballmann, A., Russell, R. and Walsh, D. (2023). Model-based surveillance system design under practical constraints with application to white nose syndrome. Environmental and Ecological Statistics 30, 649-667.
  14. Liu, Z., Hitchcock, D., Singapogu, R. (2023). Cannulation skill assessment using functional data analysis. IEEE Journal of Biomedical and Health Informatics 27, 4512-4523.
  15. Dryden, I. (2023). Comments on: Shape-based functional data analysis. TEST.
  16. Sun, N., Bursac, Z., Dryden, I., Lucchini, R., Dabo‑Niang, S. and Ibrahimou, B. (2023). Bayesian spatiotemporal modelling for disease mapping: an application to preeclampsia and gestational diabetes in Florida, United States. Environmental Science and Pollution Research 30, 109283-109298.
  17. Mitchell, E., Dryden, I., Fallaize, C.J., Andersen, R., Bradley, A.V., Large, D.J., and Sowter, A. (2023). Model code for object oriented data analysis of surface motion time series in peatland landscapes. NERC EDS Environmental Information Data Centre.
  18. Warasi, M., Tebbs, J., McMahan, C., and Bilder, C. (2023). Estimating the prevalence of two or more diseases using outcomes from multiplex group testing. Biometrical Journal 65, 2200270.
  19. Bilder, C., Hitt, B., Biggerstaff, B., Tebbs, J., and McMahan (2023). bigGroup2: An R package for group testing. R Journal 15, 21-36.
  20. Qiang, B. and Pena, E. (2023). Robust simultaneous estimation of location parameters. Statistics and Probability Letters 193, 109730.
  21. Gel, Y., Pena, E. and Wang, H. (2023). Conversations with Gabor Szekely. Statistical Science 38, 355-367.
  22. Weaver, D. (2023). The mortality experience of disabled persons in the United States during the COVID-19 pandemic. Health Affairs Scholar, in press.
  23. Allotey, P. and Harel, O. (2023). Modeling geostatistical incomplete spatial correlated survival data with applications to COVID-19 mortality in Ghana. Spatial Statistics 54, 100730.
  24. Allotey, P. and Harel, O. (2023). Bayesian spatial modeling of incomplete data with application to HIV prevalence in Ghana. Sankhya B 85, 307-329.
  1. Bai, R., Moran, G., Antonelli, J., Chen, Y., and Boland, M. (2022). Spike-and-slab group lassos for grouped regression and sparse generalized additive models. Journal of the American Statistical Association 117, 184-197.
  2. Meeker, J., Burris, H., Bai, R., Levine, L., and Boland, M. (2022). Neighborhood deprivation increases the risk of post-induction cesarean delivery. Journal of the American Medical Informatics Association 29, 329-334.
  3. Cao, X., Gregory, K., and Wang, D. (2022). Inference for sparse linear regression based on the leave-one-covariate-out solution path. Communications in Statistics: Theory and Methods 52, 6640-6657.
  4. Wang, D., Mou, X., and Liu, Y. (2022). Varying coefficient regression analysis for pooled biomonitoring data. Biometrics 78, 1328-1341.
  5. Sun, L., Li, S., Wang, L., Song, X., and Sui, M. (2022). Simultaneous variable selection for joint models of multivariate interval-censored data. Biometrics 78, 1402-1413.
  6. Sherlock, P., DiStefano, C., and Habing, B. (2022). Effects of mixing weights and predictor distributions. Structural Equation Modeling 29, 70-85.
  7. Petitbon, A. and Hitchcock, D. (2022). What Kind of Music Do You Like? A Statistical Analysis of Music Genre Popularity Over Time.  Journal of Data Science 20, 168-187.
  8. Yang, Z., Ho, Y. (2022). Modeling dynamic correlation in zero-inflated bivariate count data with applications to single-cell RNA sequencing data. Biometrics 78, 766-776.
  9. Zhou, H. and Huang, X. (2022). Bayesian beta regression for bounded responses with unknown supports. Computational Statistics and Data Analysis 167, 107345.
  10. Wang, C. and Lin, X. (2022). Bayesian semiparametric regression analysis of multivariate panel count data. Stats 5, 477-493.
  11. Park, J., Jeon, Y., Shin, M., Jeon, M., and Jin, I. (2022). Bayesian shrinkage for functional network models with intractable normalizing constants. Journal of Computational and Graphical Statistics 31, 360-377.
  12. Kwok, W. C., Cheung, K. S., Ho, J. C. M., Li, B., Ma, T. , and Leung, W. K. (2022). High-dose proton pump inhibitors are associated with hospitalization for bronchiectasis exacerbation. The International Journal of Tuberculosis and Lung Disease 26, 917-921.
  13. Mo, Y., Habing, B., and Sedransk, N. (2022). Tree-based methods: A tool for modeling nonlinear complex  relationships and generating new insights from data. Journal of Data Science 3, 359-379.
  14. Sherlock, P., DiStefano, C. and Habing, B. (2022).  Effects of mixing weights and predictor distributions on regression mixture models.  Structural Equation Modeling: A Multidisciplinary Journal 29, 70-85.
  15. Shin, M. and Liu, J. (2022). Neuronized priors for Bayesian sparse linear regression. Journal of the American Statistical Association 17, 1695-1710.
  1. Bai, R. and Ghosh, M. (2021). On the beta prime prior for scale parameters in high-dimensional Bayesian regression models. Statistica Sinica 31, 843-865.
  2. Bai, R., Rockova, V., and George, E. (2021). Spike-and-slab meets LASSO: A review of the spike- and-slab LASSO. In Tadesse, M. and Vannucci, M. (Eds.), Handbook of Bayesian Variable Selection (pp 81-108). Chapman & Hall/CRC Press.
  3. Meeker, J., Canelon, S., Bai, R., Levine, L., and Boland, M. (2021). Individual- and neighborhood- level risk factors for severe maternal morbidity. Obstetrics & Gynecology 137, 847-854.
  4. Boland, M., Liu, J., Balocchi, C., Meeker, J., Bai, R., Mowery, D., and Herman, D. (2021). A method to link neighborhood-level covariates to COVID-19 infection patterns in Philadelphia using spatial regression. AMIA Annual Symposium Proceedings 2021, 545-554.
  5. Shin, M., Cho, H., Min, H., and Lim, S. (2021). Neural bootstrapper. Advances in Neural Information Processing Systems 34, NeurIPS 2021 Proceedings.
  6. Gregory, K., Mammen, E., and Wahl, M. (2021). Statistical inference in sparse high-dimensional additive models. Annals of Statistics, 49(3), 1514-1536.
  7. Peterson, L., Oram, M., Flavin, M., Seabloom, D., Smith, W., O’Sullivan, M., Vevang, K., Upadhyaya, P., Stornetta, A., Floeder, A., Ho, Y., and others (2021). Co-exposure to inhaled aldehydes or carbon dioxide enhances the carcinogenic properties of the tobacco specific nitrosamine 4- methylanitrosamino-1-(3-pyridyl)-1-butanone (NNK) in the A/J mouse lung. Chemical Research in Toxicology 34, 723-732.
  8. Lieberman, B., Kusi, M., Hung, C., Chou, C., He, N., Ho, Y., and others (2021). Toward uncharted territory of cellular heterogeneity: Advances and applications of single-cell RNA-seq. Journal of Translational Genetics and Genomics 5, 1-21.
  9. Wang, D. and Tang, C. (2021). Testing against uniform stochastic ordering with paired observations. Bernoulli 27, 2556-2563.
  10. Tang, C., Wang, D., El Barmi, H., and Tebbs, J. (2021). Testing for positive quadrant dependence. American Statistician 75, 23-30.
  11. Kim, C., Lin, X., and Nelson, K. (2021). Measuring rater bias in diagnostic tests with ordinal ratings. Statistics in Medicine 40, 4014-4033.
  12. Wang, L. and Wang, L. (2021). Regression analysis of arbitrarily censored survival data under the proportional odds model. Statistics in Medicine 40, 3724-3739.
  13. Sun, L., Li, S., Wang, L., and Song, X. (2021). A semiparametric mixture model approach for regression analysis of partly interval-censored data with a cured subgroup. Statistical Methods in Medical Research 30, 1890-1903.
  14. Pittman, R., Hitchcock, D., and Grego, J. (2021). Concurrent functional regression to reconstruct river stage data during flood events. Environmental and Ecological Statistics 28, 219-237.
  15. Zhong, S. and Hitchcock, D. (2021). S&P 500 stock price prediction using technical, fundamental and text data. Statistics, Optimization & Information Computing 9, 769-788.
  16. Zhang, H., Huang, X., Han, S., Rezwan, F., Karmaus, W., Arshad, H., and Holloway, J. (2021). Gaussian Bayesian network comparisons with graph ordering unknown. Computational Statistics and Data Analysis: 107156.
  17. Huang, X. and Zhang, H. (2021). Corrected score methods for estimating Bayesian networks with error-prone nodes. Statistics in Medicine 40, 2692-2712.
  18. Huang, X. and Zhang, H. (2021). Tests for Gaussian Bayesian networks via quadratic inference functions. Computational Statistics and Data Analysis: 107209.
  19. Kim, T., Lieberman, B., Luta, G., and Peña, E. (2021). Prediction regions for Poisson-based regression models. Wiley Interdisciplinary Reviews: Computational Statistics, e1568.
  20. Kim, T., Lieberman, B., Luta, G., and Peña, E. (2021+). Prediction regions for Poisson and over- dispersed Poisson regression models with applications in forecasting number of deaths during the covid-19 pandemic. Open Statistics 2, 81-112.
  21. Watson, S., Cooper, P., Liu, N., Gharraee, L., Du, L., Han, E., Peña, E., and others (2021). Diet alters age-related remodeling of aortic collagen in mice susceptible to atherosclerosis. American Journal of Physiology 320: H52-H65.
  22. Bilder, C., Tebbs, J., and McMahan, C. (2021). Informative array testing with multiplex assays. Statistics in Medicine 40, 3021-3034.
  23. Liu, Y., McMahan, C., Tebbs, J., Gallagher, C., and Bilder, C. (2021). Generalized additive regression for group testing data. Biostatistics 22, 873-889.
  24. Bilder, C., Tebbs, J., and McMahan, C. (2021). Discussion on “Is group testing ready for prime-time in disease identification?” Statistics in Medicine 40, 3881-3886.
  25. Mokalled, S., McMahan, C., Tebbs, J., Brown, D., and Bilder, C. (2021). Incorporating the dilution effect in group testing regression. Statistics in Medicine 40, 2540-2555.
  1. Joyner, C., McMahan, C., Tebbs, J., and Bilder, C. (2020). From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data. Biometrics 76, 913-923.
  2. Hou, P., Tebbs, J., Wang, D., McMahan, C., and Bilder, C. (2020). Array testing with multiplex assays. Biostatistics 21, 417-431.
  3. Wang, D., Tang, C., and Tebbs, J. (2020). More powerful goodness-of-fit tests for uniform stochastic ordering. Computational Statistics and Data Analysis 144, 106898.
  4. Bilder, C., Iwen, P., Abdalhamid, B., Tebbs, J., and McMahan, C. (2020). Tests in short supply? Try group testing. Significance 17, 15-16.
  5. Chakrabarti, M., Al-Sammarraie, N., Gebere, M., Bhattacharya, S., Johnson, J., Peña, E., and others (2020). Transforming growth factor Beta3 is required for cardiovascular development. Journal of Cardiovascular Development and Disease 7, 19, doi: 10.3390/jcdd7020019.
  6. Huang, X. and Zhou, H. (2020). Conditional density estimation with covariate measurement error. Electronic Journal of Statistics 14, 970-1023.
  7. Zhou, H. and Huang, X. (2020). Parametric mode regression for bounded responses. Biometrical Journal 61, 1791-1809.
  8. Wang, D., Mou, X., Li, X., and Huang, X. (2020). Local polynomial regression for pooled response data. Journal of Nonparametric Statistics 32, 814-837.
  9. Liu, H., Hitchcock, D., Samadi, S. (2020). Spatio-temporal analysis of flood data from South Carolina. Journal of Statistical Distributions and Applications 7, 11. 
  10. Samadi, S., Pourreza‐Bilondi, M., Wilson, C., and Hitchcock, D. (2020). Bayesian model averaging with fixed and flexible priors: Theory, concepts, and calibration experiments for rainfall‐runoff modeling. Journal of Advances in Modeling Earth Systems, 12, e2019MS001924.
  11. Liu, Q., Hodge, J., Wang, J., Wang, Y., Wang, L., and others (2020). Emodin reduces breast cancer lung metastasis by suppressing macrophage-induced breast cancer cell epithelial mesenchymal transition and cancer stem cell formation. Theranostics 10, 8365-8381.
  12. Mohammadi, E., Gregory, K., Thelwall, M., and Barahmand, N. (2020). Which health and biomedical topics generate the most Facebook interest and the strongest citation relationships? Information Processing and Management 57, 102230.
  13. Baek S., Ho, Y., and Ma, Y. (2020). Using sufficient direction factor model to analyze latent activities associated with breast cancer survival. Biometrics 76, 1340-1350. 
  14. Ma Z., Hanson T., Ho, Y. (2020). Flexible bivariate count data regressions. Statistics in Medicine 39, 3476-3490.
  15. Krizek, B., Blakley, I., Freese, N., Ho, Y., and Loraine A. (2020). The Arabidopsis transcription factor AINTEGUMENTA orchestrates patterning genes and auxin signaling in the establishment of flora growth and form. Plant Journal 103, 752-768.
  16. Yun, J., Shin, M., Jin, I., and Liang, F. (2020). Stochastic approximation Hamiltonian Monte Carlo. Journal of Statistical Computation and Simulation 90, 3135-3156.
  17. Shin, M., Bhattacharya, A., and Johnson, V. (2020). Functional horseshoe prior for subspace shrinkage. Journal of the American Statistical Association 115, 1784-1797.

Challenge the conventional. Create the exceptional. No Limits.