Vita

 

U of L Office     

Department of Bioinformatics & Biostatistics       

School of Public Health and Information Sciences 

University of Louisville       

Louisville, KY 40202                                 

(502) 852 6376 (phone)       

(502) 852 3294 (fax) 

 

Home Office

1518 Crosstimbers Drive

Louisville, KY 40245         

(502) 245 3504 (phone/fax)

   

PERSONAL

Born  1962, Calcutta (now Kolkata), India; Citizen, USA;

Married to Susmita; one child Anisha.

 

EDUCATION

•       Ph. D.  (1988),  Statistics and Probability, Michigan State University, East Lansing.

•       M. Stat. (1985),  Mathematical Statistics and Probability, Indian Statistical Institute, Calcutta.

•       B. Stat. (1983),  Statistics, Indian Statistical Institute, Calcutta.

 

ACADEMIC POSITIONS HELD

•       2005 (Summer) – present: Professor (tenured), Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, USA.

•       1998 (Fall) – 2005(Spring):  Professor, Department of Statistics, University of Georgia,  Athens, GA, USA.

•       1993 (Fall) - 1998 (Summer):  Associate Professor (tenured), Department of Statistics, University of  Georgia,  Athens, GA, USA.

•       1988 (Fall) - 1993 (Summer):  Assistant Professor, Department of Statistics, University of Georgia, Athens, GA, USA.

 

ADMINISTRATIVE POSITIONS HELD

•       2009 (May) – present: Vice Chair, Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, USA.

•       2008 (Fall) – present: Biostatistics PhD Program Director, Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, USA.

•       2005 (Fall) - 2008 (Summer): Biostatistics Graduate Coordinator, Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, USA.

 

SHORT ACADEMIC VISITS

•       School of Public Health, University of Tampere, Finland, August 2008, May 2009, June 2011.

•       Department of Statistics and OR, University of Vigo, Vigo, Spain, September 2010.

•       Department of Mathematics for Science and Technology, University of Minho, Guimarães, Portugal, September 2010.

•       Department of Statistics, Southwest Jiaotong University, Chengdu, China, May-June, 2010.

•       Department of Medical Statistics and Bioinformatics, Leiden University Medical Center,  The Netherlands, May 2009.

•       Department of Statistics and Applied Probability, National University of Singapore, Singapore, December 2004.

•       Math Stat Division, Indian Statistical Institute, Calcutta, India, July 1999.

 

RESEARCH

 

Ph. D. Dissertation Title:  "Asymptotically Optimal Bayes Compound and Empirical Bayes Estimators in Exponential Families with Compact Parameter Space" (Professor James F. Hannan, Ph. D. dissertation advisor).

 

Research Interest 

 

Methodological: Biostatistics, Bioinformatics, Bootstrap Methods, Compound Decision, Analysis of Clustered Data, Clustering and Classification, Empirical Bayes, Genomics, Multistate Models, Nonparametrics, Proteomics, Rank Tests, Survival Analysis, Time Series Analysis.

 

Clinical: Autism, Dental Research, Spinal Cord Injury.

 

PUBLICATIONS

 

105. Chakraborty, S., Datta, S. and Datta, S. Surrogate variable analysis using partial least squares (SVA-PLS) in gene expression studies. Preprint (2011).

 

104. Datta, S., Nevalainen, J. and Oja, H. A general class of signed rank tests for clustered data when the cluster size is potentially informative. Preprint (2011).

 

103. Datta, S. and van Houwelingen, H. C. Statistics in biological and medical sciences, Editorial. Statistics & Probability Letters, 81,  715-716 (2011).

 

102. Datta, S., Lorenz, D. J., Harkema,  S. J.  A dynamic longitudinal evaluation of the utility of the Berg Balance Scale in patients with motor incomplete spinal cord injury.  Preprint (2011).

 

101. Lorenz, D. J., Datta,  S., Harkema, S. J.  Longitudinal patterns of functional recovery in patients with incomplete spinal cord injury receiving activity-based rehabilitation.  Preprint (2011).

 

100. Mostajabi, F., Datta, S., and Datta, S. Predicting patient survival from proteomic profile using MALDI-TOF mass spectrometry data in non-small cell lung cancer patients. Preprint (2011).

 

99. Forrest, G. F., Lorenz, D. J., Hutchinson, K., Van Hiel, L., Basso, D. M., Datta,  S., Sisto, S. A., Harkema,  S. J.  Relationships between balance and walking measures at baseline and after locomotor training in incomplete SCI: impact of functional recovery.  Archives of Physical Medicine and Rehabilitation, to appear (2011).

 

98. Ferguson, A. N., Datta, S., Brock, G. msSurv, an R package for nonparametric estimation of multistate models. Preprint (2011).

 

97. Lorenz, D. J., Datta, S. and Harkema, S. J. Marginal association measures for clustered data. Statistics in Medicine, to appear (2011).

 

96. Lorenz, D. J. and Datta, S. A log-rank test for waiting times in a multi-stage model. Preprint (2011).

 

95.  Fan, J. and Datta, S. Fitting accelerated failure time models to clustered survival data with potentially informative cluster size. Computational Statistics & Data Analysis, to appear (2011).

 

94. Nevalainen, J., Datta, S. and Oja, H. An overview of informative cluster size problems. Preprint (2011).

 

93. Fan, J. and Datta, S. Mann-Whitney tests for comparing sojourn time distributions when the transition times are right censored. Preprint (2011).

 

92. Habtzghi, D. and Datta, S. Goodness of fit tests for hazard function under shape restrictions. Preprint (2011).

 

91. Pihur, V., Datta, S. and Datta, S. Meta analysis of chronic fatigue syndrome through integration of clinical, gene expression, SNP and proteomic data, Bioinformation, 6, 120-124 (2011).

 

90. Datta, S. and Ferguson, A. N. Nonparametric estimation of marginal temporal functionals in a multistate model. In (Ilia Frenkel  and Anatoly Lisnianski, Eds.) Recent Advances in System Reliability: Signatures, Multi-state Systems and Statistical Inference, Springer, New York, to appear (2011).

 

89. Datta, S., Datta, S., Kim, S., Chakraborty, S. and Gill, R. S. Statistical Analyses of Next Generation Sequence Data: A Partial Overview, Journal of Proteomics & Bioinformatics, 3: 183-190 (2010).

 

88. Datta, S, Pihur, V. and Datta, S. An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data,  BMC Bioinformatics, 11:427 (2010).

 

87. Gill, R., Datta, S. and Datta, S. A statistical framework for differential network analysis from microarray data using partial least squares, BMC Bioinformatics, 11:95 (2010).

 

86. Lan, L. and Datta, S.  Comparison of state occupation, entry, exit and waiting times in two or more groups based on current status data in a multistate model. Statistics in Medicine,  29, 906 - 914 (2010).

 

85. Datta, S., Bandyopadhyay, D. and Satten, G. A. Inverse probability of censoring weighted U-statistics for right censored data with applications. Scandinavian Journal of Statistics, 37, 680–700 (2010).

 

84. Wang, M., Kong, M. and Datta, S. Inference for marginal linear models with clustered longitudinal data with potentially informative cluster sizes. Statistical Methods in Medical Research, 

doi: 10.1177/0962280209347043  (2010).

 

83. Lan, L. and Datta, S.  Nonparametric estimation of state occupation, entry and exit times with multistate current status data. Statistical Methods in Medical Research, 19, 147-165 (2010).

 

82. Pihur, V., Datta, S. and Datta, S. RankAggreg, an R package for weighted rank aggregation. BMC Bioinformatics, 10, 62 (2009).

 

81. Datta, S. and Datta, S. Computational biology touches all bases. Genome Biology, 10, 303 (2009).

 

80. Datta, S., Lan, L. and Sundaram, R. Nonparametric estimation of waiting time distributions in a Markov model based on current status data, Journal of Statistical Planning and Inference, 139, 2885-2897 (2009).

 

79. Datta, S., Lorenz, D. J., Morrison, S., Ardolino, E., Harkema, S. J. A multivariate examination of temporal changes in Berg variables for patients with AIS C and D spinal cord injuries. Archives of Physical Medicine and Rehabilitation, 90, 1208-1217 (2009).

 

78. Pihur, V., Brock, G., Datta, S. and Datta, S. Cluster validation for microarray data: An appraisal. In Multivariate Statistical Methods, (A. SenGupta, ed), ISI Platinum Jubilee series, World Scientific Press, Ch 5, 79-94 (2009).

 

77.  Pihur, V., Datta, S. and Datta, S. Finding cancer genes through meta-analysis of microarray experiments: Rank aggregation via the cross entropy algorithm. Genomics, 92, 400-403 (2008).

 

76. Pihur, V., Datta, S. and Datta, S. Reconstruction of genetic association networks from microarray data: A partial least squares approach. Bioinformatics, 24, 561-568 (2008). 

 

75. Datta, S. Classification of breast cancer versus normal samples from mass spectrometry profiles using linear discriminant analysis of important features selected by Random Forest, Statistical Applications in Genetics and Molecular Biology, 7 (2), Article 7 (2008).

 

74. Brock, G., Pihur, V., Datta, S. and Datta, S. clValid , an R package for cluster validation. Journal of Statistical Software, 25, 4 (2008).

 

73. Datta, S., Datta, S., Parrish, R. S.  and Thompson, C. M.  Microarray data analysis, In Computational Methods in Biomedical Research, (R. Khatree and D. Naik, eds.), Chapman & Hall/CRC Biostatistics Series, Volume 24, 1-43 (2008).

 

72.  Datta, S. and Satten, G. A. A signed-rank test for clustered data. Biometrics, 64, 501-507 (2008).

 

71. Bandyopadhyay, D. and Datta, S.  A novel approach to testing equality of survival distributions when the population marks are missing. Journal of Statistical Planning and Inference, 138, 1722-1732 (2008).

 

70. Pihur, V., Datta, S. and Datta, S. Understanding Chronic Fatigue Syndrome (CFS) from CAMDA data: A systems biology approach.  In CAMDA 2007 Proceedings,  online @ http://camda.bioinfo.cipf.es/camda07/agenda/detailed.html (2007).

 

69. Johnson, S. B., Datta, S., Hornung, C. A., Casanova, M. F. Mathematical models of epigenetic influences in Autism: a new perspective based on neuropathological findings.  In Progress in Autism Research, (Paul C. Carlisle, ed), Nova Science Publishers, Inc., 101-114,  New York: New York (2007).

 

68. Pihur, V., Datta, S. and Datta, S. Weighted rank aggregation of cluster validation measures: A Monte Carlo cross-entropy approach.  Bioinformatics, 23, 1607-1615 (2007).

 

67. Boratyn, G. M., Datta, S. and Datta, S. Incorporation of biological knowledge into distance for clustering genes, Bioinformation, 1, 396-405 (2007).

 

66.  Datta, S., Le-Rademacher, J. and Datta, S. Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO. Biometrics,  63, 259-271 (2007).

 

65. Zheng, H., Basawa, I. V. and Datta, S.  First order random coefficient autoregressive processes, Journal of Statistical Planning and Inference, 173, 212 – 229 (2007).

 

64. Datta, S., and Datta, S. Combining functional information in selecting clustering algorithms. In Proceedings of Interface 2005, on CD-ROM (2006).

 

63. Datta, S.  and Datta, S. Evaluation of clustering algorithms for gene expression data, BMC Bioinformatics,  7 (Suppl 4): S17, (2006).

 

62. Datta, S.  and Datta, S. Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes, BMC Bioinformatics, 7:397 (2006). 

 

61. Boratyn, G. M., Datta, S. and Datta, S. Biologically supervised hierarchical clustering algorithms for gene expression data, In Proceedings of the 28th IEEE  EMBS Annual International Conference, New York City, USA, 5515-5518 (2006).

 

60. Zheng, H., Basawa, I. V. and Datta, S.  The p-th order random coefficient autoregressive processes, Journal of Time Series Analysis, 27, 411-440 (2006).

 

59. Datta, S.  and Sundaram, R. Nonparametric marginal estimation in a multistage  model using current status data, Biometrics, 62, 829–837 (2006).

 

58. Datta, S.  and Datta, S.  Validation measures for clustering algorithms incorporating biological information, In IEEE Proceedings of International Multi-Symposiums on Computer and Computional Sciences (IMSCCS|06), (J. Ni, J. Dongarra, Y. Zheng, G. Gu, G. Wolfgang and H. Jin, eds.), 1, 131-135 (2006). http://doi.ieeecomputersociety.org/10.1109/IMSCCS.2006.139

 

57. Datta, S.  Estimating the mean life time using right censored data. Statistical  Methodology, 2, 65-69 (2005).

 

56.  Datta, S.  and Datta, S.  Empirical Bayes screening (EBS) of many p-values with applications to microarray studies, Bioinformatics, 21,1987-1994 (2005).

 

55. Datta, S. and Satten, G. A. Rank-sum tests for clustered data, Journal of the American Statistical Association, 100, 908-915 (2005).

 

54. Datta, S. Bootstrapping,  In Encyclopedia of Statistical Sciences, Second edition, Wiley, (2005).

 

53. Datta, S. Empirical Bayes methods, In Encyclopedia of Statistical Sciences, Second edition, Wiley, (2005).

 

52. Satten, G. A., Datta, S., Moura, H., Woolfitt, A., Carvalho, G., De, B. K,  Pavlopoulos, A., Carlone, G. M., and Barr, J. Standardization and denoising algorithms for mass spectra to classify whole-organism bacterial specimens,  Bioinformatics, 20, 3128-3136 (2004). 

 

51. Datta, S.  and Datta, S. An empirical Bayes adjustment to multiple p-values for the detection of differentially expressed genes in microarray experiments. In APBC 2004, (Y-P. P. Chen, ed.), 29, 155-159 (2004).

50. Datta, S.,  Satten, G. A., Benos, D. J., Xia, J.,  Heslin, M., and Datta, S. An empirical Bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments, Bioinformatics,  20, 235-242 (2004).

 

49. Satten, G. A. and Datta, S. Marginal Analyses of Multistage Data. In Handbook of Statistics (N. Balakrishnan and C. R. Rao, eds.), 23,  559-574, Elsevier-North Holland (2004).

 

48.  Chakraborty, S. and Datta, S. How will plant pathogens adapt to host plant resistance at elevated CO2 under a changing climate? New Phytologist, 159, 733-742 (2003).

 

47. Datta, S. and Datta, S. Comparisons and validation of statistical clustering techniques for microarray gene expression data.  Bioinformatics, 19,  459-466 (2003).

 

46. Williamson, J., Datta, S., and Satten, G. A. Marginal analyses of clustered data when cluster size is informative. Biometrics, 59, 36-42 (2003).

 

45. Datta, S. and Satten, G. A. Estimation of integrated transition hazards and stage occupation probabilities for non-Markov systems under stage dependent censoring. Biometrics, 58, 792-802 (2002).

 

44. Satten, G. A. and Datta, S.  Marginal estimation for Multistage models: waiting time distributions and competing risk analyses. Statistics in Medicine, 21, 3-19 (2002).

 

43. Datta, S. and Satten, G. A. Validity of the Aalen-Johansen estimators of stage occupation probabilities and integrated transition hazards for non-Markov models.  Statistics and Probability Letters, 55, 403-411 (2001).

 

42. Satten, G. A., Datta, S. and Robins, J. M. An estimator for the survival function when data are subject to dependent censoring.  Statistics and Probability Letters, 54, 397-403 (2001).

 

41. Satten, G. A. and Datta, S. The Kaplan-Meier Estimator as an inverse-probability-of-censoring weighted average. American Statistician, 55, 207-210 (2001).

 

40. Williamson, J. M., Satten, G. A., Hanson, J. A., Weinstock, H., and Datta, S. Analysis of dynamic cohort data. American Journal of Epidemiology, 154, 366-372 (2001).

 

39. Li, G. and Datta, S.  A bootstrap approach to nonparametric regression for right censored data. Annals of the Institute of Statistical Mathematics, 53, 708-729 (2001).

 

38. Datta, S. and Satten, G. A.  Estimating future stage entry and occupation probabilities in a multistage model based on randomly right-censored data. Statistics and Probability Letters, 50, 89-95 (2000).

 

37. Satten, G. A. and Datta, S.  A simulate-update algorithm for missing data problems. Computational Statistics, 15, 243-277 (2000).

 

36. Datta, S.  Empirical Bayes estimation with non-identical components.  Journal of Nonparametric Statistics, 12, 709-725 (2000).

 

35. Datta, S., Satten, G. A. and Datta, S. Nonparametric estimation for the three-stage irreversible illness-death model. Biometrics, 56, 841-847 (2000).

 

34. Datta, S.,  Satten, G. A. and  Williamson, J. M.  Consistency and asymptotic normality of estimators in a regression model with interval censoring and left truncation.  Annals of the Institute of Statistical Mathematics, 52, 160-172 (2000).

 

33. Datta, S., Satten, G. A. and Datta, S.  Estimation of stage occupation probabilities in multistage models. In Advances on Theoretical and Methodological Aspects of Probability and Statistics, (N. Balakrishnan, ed.), 493-506 (2000), Gordon and Breach.

 

32. Satten, G. A., Janssen, R., Busch, M. P., and Datta, S. Validating marker-based incidence estimates in repeatedly screened population. Biometrics, 55, 1224-1227 (1999).

 

31. Allen, M. R. and Datta, S.  Estimation of the index parameter for autoregressive data using the estimated innovations.  Statistics and Probability Letters, 41, 315-324 (1999).

 

30. Satten, G. A. and Datta, S.  Kaplan-Meier representation of competing risk estimates. Statistics and Probability Letters, 42, 299-304 (1999).

 

29. Allen, M. and Datta, S. A note on bootstrapping M-estimators in ARMA models. Journal of Time Series Analysis, 20, 365-380 (1999).

 

28. Bagui S. C. and Datta, S.  Some useful properties of the Bayes risk in classification. Calcutta Statistical Association Bulletin, 48, 83-91 (1998).

 

27. Datta, S., Mathew G. and McCormick, W. P. Nonlinear autoregression with positive innovations. Australian & New Zealand Journal of Statistics, 40, 229-239 (1998).

 

26. Satten, G. A. and Datta, S. and Williamson, J. M. A semiparametric approach to the proportional hazards model for interval censored data. Journal of the American Statistical Association, 93, 318-327 (1998).

 

25. Datta, S. and McCormick, W. P. Inference for the tail parameters of a linear process with heavy tailed innovations.  Annals of the Institute of Statistical Mathematics,  50, 337-359 (1998).

 

24. Datta, S. Making the bootstrap work.  In Frontiers in Probability and Statistics, (S. P. Mukherjee, S. K.Basu and B. K. Sinha, eds), Nasora Publishing, 119-129 (1998), Narosa, New Delhi.

 

23. Datta, S. and Hannan, J. F. A uniform L1 law of large numbers for functions on a totally bounded metric space. Sankhya A,  59, 167-174 (1997).

 

22.  Datta, S.  L1 density estimation for linear processes.  Journal of Time Series Analysis,  18, 375-383 (1997).

 

21. Datta, S. and Sriram, T. N. A modified bootstrap for autoregression without stationarity.  Journal of Statistical Planning and Inference,  59, 19-30 (1997).

 

20. Datta, S. On asymptotic properties of bootstrap for AR(1) processes. Journal of Statistical Planning and Inference,  53, 361-374 (1996).

 

19. Datta, S. and McCormick, W. P. Bootstrap inference for a first order autoregression with positive innovations. Journal of American Statistical Association, 90, 1289-1300 (1995).

 

18. Datta, S.  Limit theory and bootstrap for explosive and partially explosive autoregression. Stochastic Processes and Their Applications,  57, 285-304 (1995).

 

17. Datta, S. and Sriram, T. N. A modified bootstrap for branching processes with immigration. Stochastic Processes and Their Applications,  56, 275-294 (1995).

 

16. Datta, S. On a modified bootstrap for certain asymptotically non-normal statistics. Statistics and Probability Letters, 24, 91-98 (1995).

 

15. Datta, S. A minimax optimal estimator for continuous monotone densities. Journal of Statistical Planning and Inference, 46, 181-193 (1995).

 

14. Datta, S. Consistency of the mle for a general sequential design problem. Sankhya A, 57, 88-99 (1995).

 

13. Datta, S. and McCormick, W. P. Some continuous Edgeworth expansions for Markov chains with applications to bootstrap. Journal of Multivariate Analysis, 52, 83-106 (1995).

 

12. Datta, S.  Empirical Bayes estimation in a threshold model. Sankhya A, 54, 106-117  (1994).

 

11. Basawa, I. V. and Datta, S.  Large sample estimation for nested models. Journal of the Indian Society of Probability and Statistics, 1, 19-42 (1994).

 

10. Datta, S.  A solution to the set compound problem with certain non regular components. Statistics and Decisions, 11, 343-355 (1993).

 

9. Datta, S. and McCormick, W. P. Regeneration based bootstrap for Markov chains. Canadian Journal of Statistics, 21, 181-193 (1993).

 

8. Datta, S. and McCormick, W. P. On first order Edgeworth expansions for a Markov chain. Journal of Multivariate Analysis, 44, 345-359 (1993).

 

7. Datta, S.  Some non asymptotic bounds for L1 density estimation using kernels. Annals of Statistics, 20, 1658-1667 (1992).

 

6. Bhat, B. R. and Datta, S.  On the completeness of a family of conditional distributions. Statistics and Probability Letters, 15, 27-30 (1992).

 

5. Datta, S.  A note on continuous Edgeworth expansions and the bootstrap. Sankhya A,  54, 171-182 (1992).

 

4. Datta, S.  and McCormick, W. P. Bootstrap for a finite state Markov chain based on i.i.d. resampling. In Exploring the Limits of Bootstrap, (L. LePage and L. Billard, eds), 77-97 (1992), Wiley, New York.

 

3. Datta, S.  Nonparametric empirical Bayes estimation with O(n-1/2) rate of a truncation parameter. Statistics and Decisions, 9, 45-61 (1991).

 

2. Datta, S.  Asymptotic optimality of Bayes compound estimators in compact exponential families. Annals of Statistics, 19, 354-365 (1991).

 

1. Datta, S.  On the consistency of posterior mixtures and its application. Annals of Statistics, 19, 338-353 (1991).

 

EXTERNAL RESEARCH FUNDING (since 1995)

 

•       National Security Agency, "Nonparametric Regression of State Occupation Probabilities, State Entry, Exit and Waiting Time Distributions in a Multistate Model", Mathematical Sciences Grant, January 2011-December 2012, Role: Principal Investigator, 8.5%.

 

•       National Science Foundation, Statistics Program (DMS), "Theory and Applications of U-statistics for Multistate Models under Censoring", Standard Grant, July 2007-June 2011, Role: Principal Investigator, 8.5%.

 

•       National Institute of Health, R01 Grant, "Gross morphological correlates to the minicolumnopathy of autism", PI: M. Casanova, September 2009- August 2011, Role: Co-Investigator, 10%.

 

•       National Institute of Health, R01 Grant, "Plasticity of Human Spinal Neural Networks After Injury", PI: S. Harkema, January 2007-2009, Role: Co-Investigator, 10%.

 

•       Christopher Reeve Foundation, "Development of Neural Recovery Rehabilation and Research Centers", PI: S. Harkema, August 2006- February 2012, Role: Senior Statistician, 10%-40%.

 

•       National Security Agency, Mathematical Science Grant, "Nonparametric inference in censored data problems", Jan 2005- Dec 2006, Role: Principal Investigator, 8.5%.

 

•       National Institute of Health,  P01 Grant, "Neuromuscular plasticity after spinal cord injury", PI: Edgerton,  July 2006-April 2008, Role: Co-Investigator, 5%-10% .

 

•       National Institute of Health, R34, "Outcomes of Teacher Training on Autism", PI: L. Ruble, 2005-2008, Role: Co-Investigator, 3%-5%.

 

•       Centers for Disease Control and Prevention, Division of Molecular Biology, IPA, "Problems in Genetic Epidemiology", June 2001-May 2005, Role: Principal Investigator, 25%.

 

•       National Institute of Health, R15, PI: S. Subramanian, 2004-2007, Role: Consultant. 5%.

 

•       National Security Agency, "Large Sample Theory of Inverse Probability of Censoring Weighted Estimation in Multistage and Mixed Linear Models",  Mathematical Science Grant, Dec. 2002-Nov 2004, Role: Principal Investigator, 8.5%.

•       Centers for Disease Control and Prevention, Division of HIV/AIDS Prevention: Surveillance and Epidemiology, IPA, "Analysis of Complex Survival Data", Sept 1996-August 2000, Role: Principal Investigator, 25%.

 

•       National Security Agency, Mathematical Science Grant, Dec 1996-Dec 1999, Role: Principal Investigator, 8.5%.

 

•       National Science Foundation, Statistics Program (DMS), "Mathematical Sciences Computing Research Environments", Standard Grant, August 1995- July 1996, Role: Co-Principal Investigator (awarded jointly with L. Billard and T. N. Sriram).

 

AWARDS/HONORS

 

•       2011: CDC ATSDR 2011 Statistical  Science Award: Best Theoretical Paper, "Inverse Probability of Censoring Weighted U-statistics for Right-Censored Data with an Application to Testing Hypotheses", Datta, Somnath, Bandyopadhyay, Dipankar and Satten, Glen A.,Scandinavian Journal of Statistics, 37, 680-700 (2010).

 

•       2010: Elected Fellow, Institute of Mathematical Statistics.

 

•       2009: Elected member, International Statistical Institute.

 

•       2007: First Place, American Spinal Injury Association, Best Poster Award, 33rd Annual Scientific Meeting, for the poster " A Multivariate Examination of Temporal Change in BERG Balance Scale Variables for Patients with ASIA C AND D Spinal Cord Injuries" by S. Datta, D. Lorenz, M. Schmidt-Read, E. Ardolino, S. Morrison, and S.J. Harkema.

 

•       2007: Listed in Who’s Who in America, 61st  Edition.

 

•       2006: Elected Fellow,  American Statistical Association.

 

•       2005: CDC ATSDR 2005 Statistical  Science Award: Best Application Paper, “Standardization and denoising algorithms for mass spectra to classify whole-organism bacterial specimens”,  Satten, G. A., Datta, S., Moura, H., Woolfitt, A., Carvalho, G., De, B. K,  Pavlopoulos, A., Carlone, G. M., and Barr, J. : Bioinformatics, 20, 3128-3136 (2004).

 

•       2004: CDC ATSDR 2004 Statistical  Science Award: Best Theoretical Paper, “Marginal analyses of clustered data when cluster size is informative”, Williamson, J. M., Datta, S.  and Satten, G. A, Biometrics, 59, 36-42 (2003).

 

•       2003: Snedecor Award nomination for the paper “Estimation of integrated transition hazards and stage occupation probabilities for non-Markov systems under stage dependent censoring” by Datta, S. and Satten, G. A.. Biometrics, 58, 792-802 (2002). 

 

•       2001: CDC ATSDR 2001 Statistical  Science Award: Best Theoretical Paper, “A simulate-update algorithm for missing data problems”, Satten, G. A. and Datta, S.  Computational Statistics, 15, 243-277 (2000).

 

•       1999: CDC ATSDR 1999 Statistical  Science Award: Best Theoretical Paper, “A semiparametric approach to the proportional hazards model for interval censored data”, Satten, G. A. and Datta, S. and Williamson, J. M., Journal of the American Statistical Association, 93, 318-327 (1998).

 

•       1985-1988: Intermittent fellowships for merit in addition to teaching assistantship throughout in the Ph. D. program at Michigan State University;  GPA 4.0.

 

•       1986: Pass with distinction on the Ph. D. prelims at Michigan State University.

 

•       1980-1985: First class honors with distinction in B. Stat. and M. Stat. and many cash awards throughout these programs.

 

PROFESSIONAL ACTIVITIES

 

•       Editor-in-Chief (co with H. Koul), Statistics & Probability Letters, 2007-2011.

 

•       Guest Editor (co with H. van  Houwelingen),  Special Issue on Statistics in Biological and Medical Sciences. Statistics & Probability Letters, 2010-2011.

 

•       Associate Editor, The American Statistician, 2006-2011.

 

•       Associate Editor, BMC Bioinformatics, 2010-current.

 

•       Associate Editor, Communications in Statistics, 2002-current.

 

•       Co-Editor, Sankhya, 2001-2007.

 

•       Referee for  Journal of American Statistical Association, Annals of Statistics, Sankhya, Biometrics, Biometrika, Bioinformatics, BMC Bioinformatics, Statistics in Medicine, Journal of Multivariate Analysis, Journal of  Statistical Planning and Inference, Mathematical Methods in Statistics, Statistics & Decisions, Statistics and Probability Letters, Journal of Nonparametric Statistics, Scandinavian Journal of Statistics, Communications in Statistics,  Indian Journal of Statistics  and numerous other journals.

 

•       Reviewer: Mathematical Reviews; National Science Foundation; Springer; Portugese Science Foundation etc.

 

•       External evaluator for numorous promotion and tenure cases.

 

•       External evaluator for overseas PhD dissertations.

 

•       Panel member: Integrative Cancer Biology and Tumor Microenvironment, National Institute of Health, 2010.

 

•       Panel member: National Science Foundation, Statistics, 2008.

 

•       Panel member: Integrative Cancer Biology, National Institute of Health, 2004.

 

•       Organizer: Invited session on Interval Censored Multistate Models, IBC 2010, Florianópolis, Brazil.

 

•       Organizer: Invited session on Proteomics, ENAR 2010, New Orleans.

 

•       Organizer: Invited session on Proteomics, IBC 2008, Dublin, Ireland.

 

•       Organizer: Invited session on Multistate Models, JSM 2007.

 

•       Chair: Invited session on Statistics in Genomics, JSM 2004.

 

•       Member: American Statistical Association, Institute of Mathematical Statistics, International Statistical Institute, International Biometric Society (ENAR), International Indian Statistical Association, International Society for Computational Biology (past), International Society for Clinical Biostatistics, Forum for Interdisciplinary Mathematics.

 

•       Vice-president: Forum for Interdisciplinary Mathematics, 2011-2013.

•       Vice-president: Forum for Interdisciplinary Mathematics, 2009-2011.

 

TEACHING

At University of Georgia (1988—2005):

 

•       STA 2000: Elementary Statistics.  Large lecture format (150--250 students). 

•       STA 8530: Advanced Statistical Inference 1.  Ph. D. core course. 

•       STA 8540: Advanced Statistical Inference 2. Ph. D. core course. 

•       STA 8550: Asymptotic Inference. Ph. D. level. Books used  Asymptotic Statistics by van der Vaart and Approximation Theorems of Mathematical Statistics by Serfling.

•       STA 8570: Statistical Decision Theory. Books used Statistical Decision Theory by Berger and Mathematical Statistics: A Decision Theoretic Approach by Ferguson.

•       STA 8650: Bootstrapping Techniques. Books used The Jackknife, the Bootstrap and Other Resampling Plans by Efron and The Bootstrap and Edgeworth Expansion by Hall.

•       STA 9270/80: Supervised Statistical Consulting. Students get real life experience in Statistical Consulting.

•       STA 3330:  Advanced Applications and Computing. Book used  Modern Applied Statistics with S, 4th Edn., by W. N. Venables and B. D. Ripley.

•       STA 8990: Special Topics in Statistics. A course in advanced survival analysis offered to the Ph.D. students. Book used Statistical Models Based on Counting Processes by Andersen, Gill, Borgan and Keiding.

•       STA 4/6380: Survival Analysis. An introductory course in Survival Analysis.

•       STA  4/6240: Sampling and Survey Methods. An introductory course in sampling.

 

At University of Louisville (2005 - current):

 

•       PHST 762: Advanced Statistical Inference. PhD (Biostatistics concentration) core course.

•       PHST 783: Advanced Survival Analysis. PhD (Biostatistics concentration) core course.

•       PHST 780: Advanced Nonparametrics. PhD (Biostatistics concentration) elective course.

•       Numerous Independent Study courses.

 

GRADUATE STUDENTS (Major Professor)

 

PhD

 

•       Michael R. Allen, Inference and Bootstrap for Some Linear Time Series Models.  Completed: Summer 1997. Currently at Department of Mathematics, Tennessee Technological University, Cookeville, TN 38505.

 

•       S. Kim (jointly with I. V. Basawa), Inference for Nonlinear Time Series Models via Estimating Functions. Completed: Spring 1998. Currently at Department of Applied Statistics, Chung-Ang University. Seoul,156-756, Korea.

 

•       HaiTao Zheng (jointly with I. V. Basawa), Inference for Time Series Models for Count Data. Completed: Summer 2005. Currently at Department of Statistics, Southwest Jiaotong University, Chengdu, China.

 

•       Dipankar Bandopadhyay,  Novel Nonparametric Methods for Event Time Data. Completed: Spring 2006. Currently at Department of Biostatistics, Bioinformatics and Epidemiology, Medical University of South Carolina, Charleston, SC 29425.

 

•       DeSale Habtzghi (jointly with M. Meyer), Maximum Likelihood Based Estimation of Hazard Function under Shape Restrictions and Related Statistical Inference. Completed: Spring 2006. Currently at Department of Statistics, University of Akron, Akron, OH 44325.

 

•       Lan Ling, Inference for Multistate Models.  Completed: Summer 2008. Currently at Department of Biostatistics and Epidemiology, Medical College of Georgia, Augusta, GA 30912.

 

•       Vasyl Pihur (jointly with Susmita Datta), Statistical Methods for High-Dimensional Genomics Data Analysis. Completed: Summer 2009. Currently at Department of Biostatistics (Irizarry Lab), Johns Hopkins University, Baltimore, MD 21218.

 

•       Jie Fan, Inference for Time to Event and Sojourn Time Data under Right Censoring Using Reweighting Approaches. Completed: Summer 2010. Currently at Lombardi Cancer Center, Georgetown University, Washington, D.C. 20007.

 

•       Doug Lorenz (jointly with R. Gill), Regression Models for Waiting Times in Multistate Models. Completed: Spring 2011.

 

•       Farida Mostajabi (jointly with Susmita Datta), Regression Methods for Survival and Multistate Models. Completed: Summer 2011.

 

•       Nicole Ferguson (jointly with G. Brock), Methods and Software for Nonparametric Estimation in Multistate Models. Completed: Summer 2011.

 

•       Sutirtha Chakraborty (jointly with Susmita Datta). Expected completion: Summer 2013.

 

MS

 

•       Cathleen Gillespie, Intra-Individual Variation in Serum Vitamin A Measures Among Participants in the Third National Health and Nutrition Examination Survey 1988-1994. Completed: Spring 2002.

 

•       Yang Fan, A New Bivariate Survival Function Estimator under Random Right Censoring. Completed: Spring 2005.

 

•       Vasyl Pihur, Weighted Rank Aggregation of Cluster Validation Measures: A Monte Carlo Cross-Entropy Approach. Completed: Spring 2007.

•       Jie Fan (jointly with G. Brock),  Imputation Based Statistical Tests for Right Censored Data. Completed: Summer 2007.

 

•       Bart Brown (jointly with G. Brock), A Novel Method for Reference Interval Estimation Using the Inverted Q-Q Plot. Completed: Summer 2007.

 

•       Ming Wang (jointly with M. Kong), Analysis for Clustered Longitudinal Data. Completed: Summer 2008.

 

•       Daniel Riggs, An Investigation of Sliced Inverse Regression with Censored Data. Completed, Summer 2010.

 

INVITED TALKS

 

•       Steklov Mathematical Institute of Academy of Sciences, St. Petersburg, Russia, June 17, 2011.

 

•       School of Public Health, University of Tampere, Tampere,  Finland, June 13, 2011.

 

•       3rd Nordic-Baltic Biometric Conference,  Turku, Finland, June 6-9, 2011.

 

•       Applied Stochastic Models and Data Analysis (ASMDA 2011), Rome, Italy, June 7 - 10, 2011.

 

•       Workshop on Statistical Challenges in Life History Analysis at the Centre de Recherches Mathematiques,Montreal, Canada, May 16-19, 2011.

 

•       DUSDAA, The First International Conference on Theory and Applications of Statistics, Dhaka University, Dhaka, Bangladesh, December 26-29, 2010.

 

•       XXXII National Congress of Statistics and Operations Research and the VI Meeting on Public Statistics, A Coruña, Spain, September 14-17, 2010.

 

•       Department of Statistics and OR, University of Vigo, Vigo, Spain, September 13, 2010.

 

•       LinStat'2010 - International Conference on Trends and Perspectives in Linear Statistical Inference, Tomar, Portugal, July 27-31, 2010. KEYNOTE LECTURE.

 

•       Conference on Nonparametric Statistics and Statistical Learning, The Ohio State University, Columbus, OH, May 19 - 22, 2010.

 

•       Discussant, Session on Current Issues in Statistical Proteomics, ENAR 2010, New Orleans, USA, March 21-24, 2010.

 

•       The International Symposium on Stochastic Models in Reliability Engineering, Life Sciences, and Operations Management (SMRLO'10),  Beer Sheva, Israel, February 8-11, 2010.

 

•       VIII IISA Joint Statistical Meeting, Visakhapatnam, India,  January 4-8, 2010.

 

•       Seventh International Triennial Calcutta Symposium on Probability and Statistics, Kolkata, India, December 28 - 31, 2009.

 

•       Biostatistics Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC, September 15, 2009.

 

•       Joint Statistical Meetings, Washington, DC, August 1 - 6, 2009.

 

•       First IMS-Pacific Rim Meeting, Discussant in an invited session on "Statistics in Health Sciences", Seoul, June 28-July 1, 2009.

 

•       Symposium on New Directions in Asymptotic Statistics, University of Georgia, Athens, May 15-16, 2009.

 

•       Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands, May 12, 2009.

 

•       School of Public Health, University of Tampere, Tampere,  Finland, May 6, 2009.

 

•       Winemiller 2008: Conference on Survival Analysis and Its Applications, October 16-18, 2008, Columbia, Missouri.

 

•       Joint Statistical Meetings, August 3 - 7, 2008, Denver, Colorado.

 

•       Nonparametric Statistics and Mixture Models: Past, Present and Future, May 22-25, 2008, State College, PA.

 

•       Conference on Recent Advances in Statistics - In honor of Hira Koul's 65th birthday, The Re-weighting Approach in Survival Analysis, May 15-17, 2008, E. Lansing, MI.

 

•       ENAR 2008, Nonparametric Estimation of State Waiting Time Distributions in a Markov Multistate Model, Arlington, Virginia. March 16-19, 2008.

 

•       Discussant, Session on Multistate Models under Complex Censoring, JSM 2007, July 29, 2007, Salt Lake City, UT, USA.

 

•       Discussant, Session on Interval Censored Data, ENAR 2007, March 12, 2007, Atlanta, GA, USA.

•       Classification Competition on Clinical Mass Spectrometry Proteomic Diagnosis Data: Presentation of Results, Leiden University Medical Center, March 1, 2007, Leiden, The Netherlands.

•       International Conference on Statistics, Probability and Related Areas by IISA, January 2-5, 2007, Cochin, India.

•       International Conference on Multivariate Statistical Methods, Dec 28-29, 2006, Kolkata, India.

•       Discussant, Session on Genomics & Proteomics, International Biometric Society Conference IBC 2006, Montreal, Canada, July, 2006.

•       International Multi-Symposiums on Computer and Computional Sciences (IMSCCS|06), June 20-24, 2006, Zhejiang University, Hangzhou, China.

•       SCMA 2005 / FIM XII, Twelfth International Conference on Statistics, Combinatorics, Mathematics and Applications, December 2-4, 2005, Auburn University, Auburn, AL, USA.

•       Workshop on Statistical Analysis of Complex Event History Data, Norwegian Academy of Science and Letters, August 31-September 2, 2005, Oslo, Norway.

•       Joint Annual Meeting of the Interface and the Classification Society of North America, June 8, 2005 - June 12, 2005, Washington University School of Medicine, St. Louis, MO.

•       International Conference on Future of Statistical Theory, Practice and Education, December 29, 2004 - January 1, 2005, Hyderabad, India.

•       Eleventh International Conference on Interdisciplinary Mathematical and Statistical Techniques, SCRA 2004, December 27-29, 2004, Lucknow, India.

•       International  Conference  on Statistics in Health Sciences, June 23-23, 2004, Nantes, France.

•       IISA Conference, May 2004, University of Georgia, Athens, USA.

•       International Conference on Reliability and Survival Analysis 2003, May 2003, Department of Statistics,  University of South Carolina,  Columbia,  USA.

•       SCRA 2002, International Conference on Statistics, Combinatorics and Related Areas and the Ninth International Conference of the Forum for Interdisciplinary Mathematics, December 2002, Allahabad, India.

•       International Conference on Current Advances and Trends in Nonparametric Statistics, July 2002, Crete, Greece.

•       IISA International Conference on Statistics, Probability and Related Areas, June 2002, Dekalb, Illinois, USA.

•       SCRA 2001, International Conference on Statistics, Combinatorics, and Related Areas, December 2001, Wollongong, Australia.

•       IISA-JSM-INDIA 2000-2001, International conference on Statistics and Probability, December 2000-January 2001, New Delhi, India.

•       Sixth International Conference on Statistics, Combinatorics, and Related Areas, December 1999, Mobile, Alabama, USA.

•       ENAR Spring Meeting, March, 1999, Atlanta, Georgia.

•       IISA Conference, October, 1998, McMaster University, Hamilton, Canada.

•       Conference in honor of Jim Hannan, May 1998, Michigan State University, East Lansing, MI, USA.

•       Special Session on Applied Probability, AMS meeting, October, 1996, Chattanooga, TN, USA.

•       Symposium on Estimating Functions, March 1996, Athens, Georgia, USA.

•       SRCOS/ASA Summer Research Conference (Discussion Leader), June 1995, Indialantic, Florida.

•       INFORMS Applied Probability Conference, June 1995, Atlanta, Georgia, USA.

•       IMS, ENAR Joint Spring Meeting, March, 1995, Birmingham, Alabama, USA.

•       Second International Triennial Calcutta Symposium on Probability and Statistics, December 1994, Calcutta, India.

•       First IMS North American New Researcher's Meeting, August 1993, Berkeley,  California.

•       The Third Canadian Conference in Applied Statistics, May 1991, Statistics Canada, Montreal, Canada.

•       214 IMS Meeting (special topic Bootstrap), May 1990, East Lansing, USA.

Colloquia:

•       Department of Statistics, University of California, Davis, November, 2008.

•       Department of Statistics and Probability, Michigan State University, E. Lansing, March 2007.

•       Department of Statistics, University of Kentucky, Lexington, October 2005.

•       ASA Kentucky Chapter, Frankfort, September 2005.

•       Department of Statistics and Applied Probability, National University of Singapore, December 2004.

•       Department of Bioinformatics and Biostatistics, University of Louisville, November 2004.

•       Department of Biostatistics, University of Minnesota, March 2004.

•       CHEDA user group, BimCore and Department of Biostatistics, Emory University, March 2004.

•       Department of Mathematics, Univ. of N. Carolina, Charlotte, April 2003.

•       Department of  Biostatistics, Emory University, March 2003.

•       Department of Statistics, Univ. of S. Carolina, Columbia, October 2001.

•       Department of Biostatistics, Univ. of Alabama, Birmingham, August 2001.

•       School of Industrial and Systems Engineering, Georgia Tech., April 2001.

•       Indian Statistical Institute, Calcutta, July 1999.

•       Department of Statistics, Texas A&M University, College Station, May 1996.

•       Department of Statistics, Univ. of North Carolina, Chapel Hill, April 1995.

•       Department of Statistics, SUNY at Buffalo, Buffalo, February 1995.

•       Department of Mathematics, Univ. of North Carolina, Charlotte, February 1995.

•       Division of Statistics and Mathematics, Indian Statistical Institute, Calcutta, India, September 1992.

•       Computer Science Unit, Indian Statistical Institute, Calcutta, India, August 1992.

•       Department of Statistics, Iowa State University, Ames, September 1989.

•       Department of Statistics, University of Wisconsin, Madison, September 1989.

•       Department of Statistics and Probability, Michigan State University, East Lansing, June 1989.

•       Department of Statistics, Purdue University, West Lafayette, February 1988.

•       Department of Mathematics, McGill University, Montreal, Canada, January 1988.

 

REFEREED TALKS

•       MCP 2009: The 6th International Conference on Multiple Comparison Procedures, Tokyo, Japan, 2009.

•       29th Annual Conference of the International Society of Clinical Biostatistics, August 17-21, 2008, Copenhagen, Denmark.

•       CAMDA 2007, Analysis of CSF data, December 13-14, Valencia, Spain.

•       The Second Asia Pacific Bioinformatics Conference 18-22 Jan, 2004, (full paper accepted), Dunedin, New Zealand.

 

CONTRIBUTED TALKS/POSTERS

•       JSM, Vancouver, Canada, August 2010. Topics Contributed.

•       ISMB, Vienna, Austria, July 2007.

•       Research Louisville, Louisville, October 2005.

•       JSM 2004, Toronto, August 2004.

•       ENAR Spring Meeting, Pittsburgh, March 2004.

•       IBS Meeting, Cape Town, South Africa, December   

        1998.

•       IBS Meeting, Amsterdam, The Netherlands, July

          1996.

•       56th IMS Annual Meeting, San Francisco, August

          1993.

•       Second International Symposium on Probability and Its Applications, Bloomington, March 1993.

•       Special Contributed Session, 5th Purdue Symposium on Statistical Decision Theory and Related Topics, W. Lafayette, Indiana, June 1992.

 

WORKSHOPS/MEETINGS ATTENDED

•       IMS Annual Meeting, Gothenburg, Sweden, August 2010.

•       Rocky '08, 6th Annual Rocky Mountain Bioinformatics Conference, Snowmass, CO, December, 2008.

•       NIEHS SNPs Workshop, Brown Hotel, Louisville, KY, January 2008.

•       UT-ORNL-KBRIN Bioinformatics Summit 2008, Lake Barkley State Park, KY, April, 2008.

•       UT-ORNL-KBRIN Bioinformatics Summit 2006, Lake Barkley State Park, KY, April, 2006.

 

SOMNATH DATTA

Revised: August 2011

Made with Namu6