Vita

I am a tenured Full Professor and a University Scholar in the Department of Bioinformatics and Biostatistics, School of Public Health and Information Sciences at the University of Louisville and a University Scholar. I am an elected member of the International Statistical Institute. My biography is included in the 2010 (64 th Edition) of Marquis Who's Who in America, and Who’s Who in the world, in Science and Engineering and Who’s Who in women. I serve as the editor/associate editor/editorial board member for five statistics and bioinformatics journals.  I am currently the PI on 2 federally funded projects (NIH, NSF) and a co-I on 2 R01s and co-I on NIH P30, P01. I have extensive research experience in population biology, statistical genetics, infectious disease modeling, non-linear regression modeling and survival analysis. My bioinformatics interests span transcriptomics (microarray data analysis), proteomics (MALDI-MS, MALDI-MS/MS, SELDI data) and metabolomics. My current research involves development of new methodology for high dimensional data analysis using modern multivariate clustering/ classification techniques and statistical inference, especially biomarker identification of cancer. I have generated my own proteomic data to detect biomarker(s) of fetal alcohol related disorder. I am the biostatistician for the NIH funded UofL Alcohol Research Center and Bioinformatics, Biostatistics group leader for the Biostatistics and Computational Biology (BBCB) Core for the Center for Genomics and Integrative Biology (CGIB). I have been designated as the Co-director of the Biostatistics Consortium for the Research Development Core of the CTSI proposal. I have always been involved in new program and new course developments. My students are well placed in academia and federal research institutes.

 

Address:

 

Department of Bioinformatics & Biostatistics

School of Public Health and Information Sciences

University of Louisville

Louisville, KY 40292

(502) 852 0081 (phone)

(502) 852 3294 (fax)

E-mail: susmita.datta_AT_louisville_DOT_edu

 

Education:

 

• Ph.D. Statistics, 1995, University of Georgia, Athens, USA.  Dissertation Title: Dynamics of Cytonuclear Disequilibria and Related Statistical Tests for The Neutrality of Mitochondrial DNA markers for Hybrid Zone Data  (under the direction of Prof. Jonathan Arnold, Department of Genetics, University of Georgia, Athens)

 

• M.S. Statistics, University of Georgia, Athens, USA.

 

• B.S. Physics major, University of Calcutta, India.

 

Positions Held:

 

• 2010 (June)-present, University Scholar, University of Louisville, Louisville.

 

• 2010 (January)-present, Professor, Department of Bioinformatics & Biostatistics, University of Louisville, Louisville.

 

• 2005- 2009, Associate Professor (tenured), Department of Bioinformatics & Biostatistics, University of Louisville, Louisville.

 

• 2002 - 2005, Associate Professor (tenured), Department of Mathematics and Statistics and Department of Biology, Georgia State University, Atlanta.

 

• 1997 - 2002, Assistant Professor, Department of Mathematics and Statistics, Georgia State University, Atlanta.

1995 - 1997, NRSA Post Doctoral Fellow, Department of Biostatistics, Emory University, Atlanta.

• Fall 2000-Summer 2001: Visiting Assistant Professor, Department of Genetics, University of Georgia, Athens.

 

Research Interests:

Bioinformatics, Proteomics, Infectious Disease Modeling, Statistical Genetics, Statistical Issues in Population Biology, Systems Biology, Birth Defect research and Survival Analysis.

 

Professional/Editorial:

 

Elected member of International Statistical Institute, 2007 - present.

Member: 

 

• International Society for Computational Biology.

• American Statistical Association.

• Institute of Mathematical Statistics. 

• International Biometric Society (ENAR).

• International Indian Statistical Association (Life)

• American Association for the Advancement of Science.

• Forum for Interdisciplinary Mathematics (Life).

 

Editorial Services: 

 

• Editorial Board Member, Briefings in Bioinformatics, 2010-

• Associate Editor, BMC Research Notes,  2008 -

• Associate Editor, Bioinformation, 2007-   

• Editorial Board Member, Bioinformation, 2006-   

• Special Issue Editor (Gene Expression Analysis), Bioinformation, 2007

• Associate Editor, Statistical Methodology, 2007-   

• Associate Editor, Statistics & Probability Letters, 2007-   

 

Member/Reviewer: 

 

• Program Committee Member of 19th Annual International Conference on Intelligent Systems for Molecular Biology ISMB and ECCB10, The 10th European Conference on Computational Biology, Vienna, July, 15-16, 2011.

 

• Elected Representative-at-Large Caucus for Women in Statistics for the term 2011-12.

 

• National Science Foundation, Biology Program, Mail Reviewer, May 2008.

 

• National Institute of Health, Bio-defense Study Section, April 2003.

 

• (Invited) Emtech Bio Scientific Advisory Board members and Seed Grant Reviewers: Georgia Tech, Atlanta, October 2002.

 

• Member, Advisory Panel for MRI Program, National Science Foundation, 2001-2002.

 

• National Institute of Health proposal review.

 

• Mathematical Review, several years.

 

• Book review for Statistics and Medicine, 2000.

 

• National Science Foundation proposal review, several years.

 

• Paper reviewer of multiple ISCB and CAMDA conferences as program committee member, several years.

 

 

Referee:

 

Bioinformatics, Breifings in Bioinformatics, BMC Bioinformatics, Proceedings of National Academy of Sciences,  Nucleic Acids Research, BMC Research Notes,  BMC Health Services, Biometrics, JASA, Statistics and Interface, Journal of Chemometrics, Journal of Statistical Planning and Inference, Statistics and Its Inference, Statistics in Medicine, Journal of Applied Statistics, Computational Statistics & Data Analysis, Journal of Multivariate Statistics, Scandinavian Journal of Statistics, Communications in Statistics, Mathematical Biosciences,  Biotechnology, Genomics,  Journal of Proteome Research, Pattern Recognition, International Journal of Data mining and Bioinformatics, Physica A: Statistical Mechanics and Its Applications, Int. J. Developmental Neuroscience, Computer Methods and Programs in Biomedicine etc.

 

Other:

 

• Program Committee Member ECCB10, The 9th European Conference on Computational Biology, Ghent, Belgium, Sept 26-29, 2010.

 

• ASA Scientific & Public Affairs Advisory Committee, 2010-2012.

 

• CAMDA 2009 Conference Scientific Committee, Chicago, IL, U.S.A., October 5-6, 2009.

 

• CAMDA 2008 Conference Scientific Committee, Vienna, Austria, December 2008.

 

• Invited session organizer at JSM 2008, A New Paradigm of Statistical Data Analysis: Omics Data, Denver, August 2008.

 

• Program Committee member, Frontiers of Probability and Statistical Science, Connecticut-Storrs, May 2008.

 

• CAMDA 2007 Conference Scientific Committee, Valencia, Spain, December 2007.

 

• Program Committee Member, ISMB 2007, Vienna, Austria, July 2007.

 

• Chair, Invited session at JSM 2007, Inference for Multistate Data under Complex Censoring Structures , Salt Lake City, July - August, 2007.

 

• Invited session organizer, Statistics in Genomics and Proteomics, International Biometric Society Conference, IBC 2006, Montreal, Canada, July, 2006

 

• Program Committee Member, ISMB 2005, Michigan, July 2005.

 

• Invited session organizer Statistics in Genomics, JSM Toronto, August 2004.

 

• Invited session organizer, Genetic Data Analysis, International Conference on Statistics in Health Sciences, Nantes, France, June 2004.

 

• Co-organizer, student paper competition for the IISA conference, Athens, GA, May 2004.

 

• Organized (and chaired) an invited session titled "Recent Contributions in Bioinformatics" at JSM San Francisco, August, 2003.

 

• Executive Board Member and President of Young Professional Statisticians, IISA.

 

• Organized an invited session on Bioinformatics at SCRA 2002-FIM IX: Ninth International Conference of Forum for Interdisciplinary Mathematics on Statistics Combinatorics and Related Areas, Department of Statistics and Department of Mathematics: University of Allahabad, Allahabad, UP 211 002, India, December 21-23, 2002.

 

• Organized an invited session titled Survival Skills for Young Statisticians at the IIISA International Conference on Statistics, Probability and Related Areas, Dekalb, Illinois, June 2002.

 

• Organized an invited session on Statistics in Bioinformatics at the International Conference on Statistics, Combinatorics and Related Areas and the Eighth International Conference of the Forum for Interdisciplinary Mathematics, Wollongong, Australia, December 2001.

 

• Chair, (invited session) Bioinformatics: Statistical Perspectives and Controversies' at International Conference on Statistics, Combinatorics and Related Areas and the Eighth International Conference of the Forum for Interdisciplinary Mathematics December 2001.

 

• Invited Session Organizer, ENAR, 2001 Joint Statistical Meeting, 2001, Atlanta, Georgia.

• Session Chair, Statistical Genetics, ENAR Spring Meeting, 1999, Atlanta, Georgia.

 

• Local Organizing Committee, ENAR Spring Meeting, 1999, Atlanta, Georgia.

• Session Chair, Applications of State-Space Modeling in the Science, Special Contributed Session,  Joint Statistical Meeting, 1999, Baltimore, Maryland.

 

Publications:

 

Datta, S., Fu, Y. X., Arnold, J. (1996). Dynamics and equilibrium behavior of cytonuclear disequilibria under genetic drift, mutation, and migration, Theoretical Population Biology, 50, 298-324.

 

Datta, S. and Arnold, J. (1996). Diagnostics and a statistical test of neutrality hypothesis using the dynamics of cytonuclear disequilibria, Biometrics, 52, 1042-1054.

 

Datta, S., Rand, D. M., and Arnold, J. (1996). A statistical test of a neutral model using the dynamics of cytonuclear disequilibria, Genetics, 144, 1985-1992.

 

Longini, I. M., Datta, S., and Halloran, E. (1996). Measuring vaccine efficacy for both susceptibility to infection in infectiousness for prophylactic HIV-1 vaccines, Journal of Acquired Immune Deficiency Syndromes and Human Retrovirology, 13, 440-447.

 

Datta, S., Longini, I. M., and Halloran, E. (1997). Measuring vaccine efficacy for different HIV vaccine trials, Statistics in Medicine, 17, 185-200.

 

Datta, S. and Arnold, J. (1998). Dynamics of cytonuclear disequilibria in subdivided populations, Journal of Theoretical Biology, 192, 99-111.

 

Datta, S. (1999). Hypotheses testing for different selection models using multi-generation cytonuclear data, Proceedings of American Statistical Association, Biometrics Section, 157-161, Alexandria, USA.

 

Scribner, K. T., Datta, S., Arnold, J., and Avise, J. C. (1999). Empirical evaluation of cytonuclear models incorporating genetic drift and tests for neutrality of mtDNA variants: data from experimental Gambusia hybrid zones, Genetica, 105, 101-108.

 

Datta, S., Halloran, E. M. and Longini, I. M. (1999). Efficiency of estimating vaccine efficacy for susceptibility and infectiousness: randomization by individual versus household, Biometrics, 55, 792-798.

 

Datta, S. (2000). Some statistical aspects of cytonuclear disequilibria. In Statistics in Molecular Biology and Genetics, Ed: Francoise Seillier-Moiseiwitsch, IMS Lecture Notes-Monograph Series, 33, 21-37.

 

Datta, S., Satten, G. A. and Datta, S. (2000). Nonparametric estimation for the three stage irreversible illness-death model, Biometrics, 56, 841-847.

 

Datta, S. (2000). Some statistical issues involving multi-generation cytonuclear data, In Advances on Methodological and Applied Aspects of Probability and Statistics, N. Balakrishnan, Ed., Gordon and Breach, 525-546.

 

Datta, S., Satten, G. A. and Datta, S. (2000). Estimation of stage occupation probabilities in multistage models, In Advances on Theoretical and Methodological Aspects of Probability and Statistics, N. Balakrishnan, Ed., Gordon and Breach, 493-506.

 

Datta, S. (2000). Book Review: Statistics in Human Genetics by Pak Sham. Statistics in Medicine, 19,1384-1385.

 

Datta, S. (2001). Estimation of selection parameters using multi-generation cytonuclear data, Biometrical Journal, 43, 219-233.

 

Datta, S. (2001). Exploring relationships in gene expressions: A partial least squares approach, Gene Expression, 9, 257-264.

Datta, S. (2001). Testing neutrality of mtDNA using multigeneration cytonuclear data, Selected Proceedings of the Symposium on Inference for Stochastic Processes, Eds.: I. V. Basawa, C. C. Heyde and R. L. Taylor,  IMS Lecture Notes - Monograph Series, 37, 173-184, IMS, Beachwood, OH.

 

Datta, S. and Arnold, J. (2002). Some comparisons of clustering and classification techniques applied to transcriptional profiling data. In Advances in Statistics, Combinatorics and Related Areas, Eds.: C. Gulati, Y-X. Lin, S. Mishra, and J. Rayner, World Scientific, 63-74.

 

Datta, S. (2003). Statistical techniques for microarray data: A partial overview, Communications in Statistics-Theory and Methods, 32, 263-280.

 

Datta, S. and Datta, S. (2003) Comparisons and validation of statistical clustering techniques for microarray gene expression data,  Bioinformatics, 19,  459-466. 

 

Arnold, J., Schuttler, H.-B.,Logan, D., Griffith, J., Arpinar, B. Datta, S., Kochut, K. J., Kraemer, E., Miller, J. A., Sheth, A., Aleman-Meza, B., Doss,  J., Harris, L. and Nyong, A. (2003).  Metabolomics,  In Handbook of Industrial Mycology, Chapter 23. Marcel-Dekker, New York, NY.

 

G., Brehm, S., Datta, S., and Adams, M. W. W. (2003). Whole Genome DNA microarray of a hyperthermophile and an archaeon: Pyrococcus furious grown on peptides and carbohydrate, Journal of Bacteriology, 185, 3935-3947.

 

Datta, S.,  Satten, G. A., Benos, D. J., Xia, J.,  Heslin, M., and Datta, S. (2004). An empirical Bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments, Bioinformatics, 20, 235-242.

 

Datta, S. and Datta, S. (2004). An empirical Bayes adjustment to multiple p-values for the detection of differentially expressed genes in microarray experiments. In Bioinformatics 2004,  Conferences in Research and Practice in Information Technology - Second Asia-Pacific Bioinformatics Conference, 29,Y-P. P. Chen, Ed., 155-159, Australian Computer Society, Sydney.

 

Warrenfeltz, Z., Pavlik, S., Datta, S., Kraemer, E., Benedict, B. Mcdonald, J. F. (2004).  Gene expression profiling of epithelial ovarian tumors corelated with malignant potential.  Molecular Cancer, 2004, 3:27.

 

Datta, S.  and Datta, S. (2005). Empirical Bayes screening (EBS) of many p-values with applications to microarray studies, Bioinformatics, 21, 1987-1994.

 

Weinberg, M. V., Schut, G. J., Brehm, S., Datta, S., and Adams, M. W. W.  (2005).  A hyperthermoplilic cold shock response: the archaeon Pyrococcus furiosus  synthesizes novel membrane-bound glycoproteins at a sub-optimal growth temperature. Journal of Bacteriology, 187, 336-348.

 

Datta, S. (2005). Statistics in Genetics, In Encyclopedia of Statistical Sciences, Second edition, Wiley, New York.

 

Datta, S. (2005). Statistics in Microarray Analysis, In Encyclopedia of Statistical Sciences, Second edition, Wiley, New York.

 

Datta, S. (2005). Statistics in Vaccine Studies, In Encyclopedia of Statistical Sciences, Second edition, Wiley, New York.

 

Datta, S. and de Padilla, L.M. (2006). Feature selection and machine learning with mass spectrometry data for distinguishing cancer and non-cancer samples, Statistical Methodology (Special Issue on Bioinformatics), 3, 79-92.

 

Datta, S.  and Datta, S. (2006). Validation measures for clustering algorithms incorporating biological information, IEEE Proceedings of International Multi-Symposiums on Computer and Computional Sciences (IMSCCS|06), (J. Ni, J. Dongarra, Y. Zheng, G. Gu, G. Wolfgang and H. Jin, Eds.), 1, 131-135.

 

Datta, S.  and Datta, S. (2006). Evaluation of clustering algorithms for gene expression data, BMC Bioinformatics, 7 (Suppl 4): S17.

 

Datta, S.  and Datta, S. (2006).  Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes, BMC Bioinformatics, 7, 397. (Highly Accessed)

 

Datta, S. and Datta, S. (2006). Validation of statistical clustering using biological information, Proceedings of INTERFACE 2005 (CD-ROM).

 

Boratyn, G. M., Datta, S. and Datta, S. (2006). Biologically supervised hierarchical clustering algorithms for gene expression data, Proceedings of the 28th IEEE  EMBS Annual International Conference, New York City, USA, 5515-5518.

 

Datta, S., Le-Rademacher, J. and Datta, S. (2007). Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO, Biometrics, 63, 259-271.

 

Datta, S., Datta, S., Parrish, R. S. and Thompson, C. M. (2007). Microarray data analysis. In Computational Methods in Biomedical Research, R. Khatree and D. Naik, eds., Chapman & Hall/CRC Biostatistics Series, Volume 24, 1-43.

 

Boratyn, G. M., Datta, S. and Datta, S. (2007). Incorporation of biological knowledge into distance for clustering genes. Bioinformation, 1, 396-405.

 

Pihur, V., Datta, S. and Datta, S. (2007). Weighted rank aggregation of cluster validation measures: A Monte Carlo cross-entropy approach. Bioinformatics,  23, 1607-1615.

 

Pihur, V., Datta, S. and Datta, S. (2007). Understanding Chronic Fatigue Syndrome (CFS) from CAMDA data: A systems biology approach. Proceedings of CAMDA 2007, full paper, online @ http://camda.bioinfo.cipf.es/camda07/agenda/detailed.html.

 

Brock, G., Pihur, V., Datta, S. and Datta, S. (2008). clValid, an R package for cluster validation. Journal of Statistical Software, 25, 4.

 

Pihur, V., Datta, S. and Datta, S. (2008). Finding cancer genes through meta-analysis of microarray experiments: Rank aggregation via the cross entropy algorithm. Genomics, 92, 400-403.

 

Pihur, V., Datta, S. and Datta, S. (2008). Reconstruction of genetic association networks from microarray data: A partial least squares approach. Bioinformatics,  24, 561-568.

 

Datta, S., Turner, D., Singh, R., Ruset, B., Pierce, W. M.,  and Knudsen, T. B. (2008). Fetal alcohol syndrome in mice detected through proteomics screening of the amniotic fluid. Birth Defects Research Part A: Clinical and Molecular Teratology, 82, 177-186.

 

Datta, S. and Pihur, V. (2009). Feature selection and machine learning with mass spectrometry data, R. Matthiesen, ed., In Clinical Proteomics: Methods, Applications and Tools, Humana Press, (Matthiesen, R. ed.), pp. 205-229.

 

Pihur, V., Brock, G., Datta, S. and Datta, S. (2009). Cluster validation for microarray data: An appraisal. In Multivariate Statistical Methods, ( A. SenGupta, ed), ISI Platinum Jubilee series, Vol 5, World Scientific Press, World Scientific Press, pp. 79-94.

 

Datta, S. and Datta, S. (2009). Computational biology touches all bases. Genome Biology, Genome Biology, 10:303.

 

Pihur, V., Datta, S. and Datta, S. (2009). RankAggreg, an R package for weighted rank aggregation. BMC Bioinformatics, 10:62. (Highly Accessed)

 

Yoo, J. K., Becky S. Patterson, B. S. and Datta, S. (2009). OLS-based predictor test in single index model to predict transcription rate by histone acetylation level, Statistics & Probability Letters, 79: 20, 2109-2114.

 

Atlas, M. and Datta, S. Monoisotopic Peak Detection for Mass spectrometry Data (2009). Monoisotopic Peak Detection for Mass spectrometry Data, Journal of Proteomics and Bioinformatics, 2.5, 202-216.

 

Gill, R., Datta, S. and Datta, S. (2010). A statistical framework for differential network analysis  from microarray data using partial least squares, BMC Bioinformatics, 11:95.

 

Datta, S, Pihur, V. and Datta, S. (2010). An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data, BMC Bioinformatics, 11:427.

 

Datta, S., Datta, S., Kim, S., Chakraborty, S. and Gill, R. S Statistical Analyses of Next Generation Sequence Data: A Partial Overview (2010). Journal of Proteomics & Bioinformatics, 3: 183-190.

 

Ndukum, J., Fonseca, L. L., Santos, H., Voit, E. O. and Datta, S. (2010). Statistical Inference for Sparse Biological Time Series Data, under revision.

 

Li, X, Gill, R., Cooper, N.,G.,F., Yoo, J., K., and Datta, S. (2010) Modeling microRNA-mRNA Interactions Using PLS Regression in Human Colon Cancer, Under Revision.

 

Mostajabi, F., Datta, S., Datta, S. (2010) Predicting Patient Survival from Proteomic Profile using MALDI-TOF Mass Spectrometry Data in Non-small Cell Lung Cancer Patients. Under revision.

 

Ndukum, J., Atlas, M., Datta, S. (2011) pkDACLASS: open source software for analyzing MALDI-TOF, submitted.

 

Litchfield, L. M., Riggs, K. A., Emberts, C. G., Hockenberry, A. M., McConda, D. B, Oliver, L. D., Fox, J. M., Hu, C., Cai, J., Pierce, W. M., Jr., Ivanova, M. M., Bates, P. J., Martin, R. G. C., Appana, S. N., Datta, S., Kulesza, P., and Klinge, C. M (2011). Regulation of COUP-TFII transcriptional activity by interaction with nucleolin in human breast cancer cells and tumors, submitted.

 

Manavalan, T. T., Teng, Y., Bhimani, S., Clark, S., Datta, S., Kalbfleisch, T. S. Li, Y., and Klinge, C. M. (2011). Differential expression of microRNA expression in tamoxifen-sensitive MCF-7 versus tamoxifen-resistant LY2 human breast cancer cells, ready to be submitted.

 

Stallons, L. J., Kalbfleisch, T. Cambon, A., Datta, S., Kucherlapati, R., Kunkel, T. A. and WG McGregor (2011). DNA polymerase iota alters PLK signaling for mitotic entry and exit after UV, ready to be submitted.

 

 

   

Grants:

 

• Co-director of Biostatistics Consortium for the Research Development Core 15% effort submitted CTSI proposal, 2010.

 

• Biostatistics Advisor on Bioinformatics core, 10% effort submitted CTSI proposal, 2010.

• PI (no co PI) 25% effort, National Institute of Health, NCI R15 CA133844-01. Funded. Development of Statistical Methods for Analyzing Proteomic Cancer Data, 7/1/2009-3/31/2012.

 

• PI 17% effort, National Science Foundation, Statistics Program (DMS), Standard Grant, Statistical peak detection, adaptive classification and protein-protein network construction using mass spectra, DMS-0805559, 2008-2011.

 

• Co-PI, U of Louisville Subcontract, 5% effort, NIH R01, Environmental Interactions with the Genome and Epigenome. 2008-2013.

• Co-I/Biostatistician, 10% effort, NIH P01 AA017103-01, Alcohol, Liver Disease and Alcohol Nutrient Interactions, (Craig McClain, PI) 2008-2013.

 

• Co-I, 8% effort, DNA Sequences Impact Estrogen and Anti-Estrogen Activities, (Carolyn M. Klinge, PI) NIH R01 DK053220-10A1, 2008-2013.

 

• Biostatistics Group Leader, Bioinformatics, Biostatistics and Computational Biology Core, Center for Environmental Genomics and Integrative Biology (K. Ramos, PI), 20% effort, NIEHS-NIH, 2007-2011.

 

• PI 30% effort, Proteomics Based Approach for Early Detection of Fetal Alcohol Syndrome, P20-RR/DE17702, NIH COBRE (PI, R. Green), 2006-2007.

• Co-I  (M. J. Kennedy, PI, Louisville) 5% effort, Aminoglycoside Urinary Proteomics, 2007-2009.

 

• Biostatistician (J. Klein, PI, Louisville) 10% effort, Pediatric Clinical Proteomics Center, Department of Energy, 2005-2008.

 

• PI  U of L subcontract (E. Voit, PI, Georgia Tech.) 10% effort, The Trehalose Cycle as Paradigm, National Science Foundation, 2006-2009.

 

• Co-I (P. Epstein, PI, Louisville) 10% effort, NIH R01, Podocytes and Oxidative stress in diabetic Kidney, 2006-2007.

 

• Co-PI (K. B. Grant, PI) Brains and Behavior Seed Grant, GSU, $25414, 2005-2006.

 

• Co-PI (I. Weber, PI) Research Program Enhancement Award, GSU, student support, $36000, 2004-2007.

 

• Investigator, Student & Travel support for five years, $75000, Georgia Cancer Coalition (Michael Eriksen PI), 2004-2009.

 

• Statistician (2 months of summer salary), BimCore, Emory University, summer 2004.

 

• Co PI (M. Brinton, PI) Biomedical Computing Center Seed Grant, GSU, $13467, summer 2004.

 

• Consultant on a NSF funded project in Structural Biology (B. C. Wang, PI), University of Georgia, $8087, summer 2003.

• Co-PI (J. Arnold, PI) Genomics and Computational Biology: A REU Site, National Science Foundation, Joint Program between UGA, GA State and Clarke Atlanta University,  $210,000, 2003-2005.

• Co-PI (G. Chen, PI) Tech Fee Grant , GSU, $58002, 2003.

• PI (no co-PI) Statistical Analysis of Microarray Gene Expression Data, National Science Foundation, $127,671, 2000-2002.

 

• PI (no co-PI), Research Experience for Undergraduates in Fungal Genomics and Computational Biology: GSU VPRSP grant, $18,666, Summer 2001.

• PI (no co PI) A Pilot Project for Developing Statistical Tools for Bioinformatics. GSU faculty initiation grant, $5000, 2000-2001.

 

• Co PI (D. Vidacovic, PI) Instructional Improvement Grant, GSU Center for Teaching & Learning, $5000, 2000-2001.

 

• Co PI (E. Dubinsky, PI) IPCURT Project Course and Curriculum Development, National Science Foundation, $100,000, 1998 - 1999.

Honors/Awards/Press:

 

• Designated as a University Scholar at University of Louisville, June,2010.

 

• Selected to be included in the 2010 (64 th Edition) of  Marquis Who'sWho in America.

 

• Elected Member of International Statistical institute, October 2007.

 

• Nominated for Provost's Award for exemplary advising, May 2007.

• Appeared in Fox News Atlanta, May 2004.

 

• Featured Research faculty in College of Arts and Sciences, Feb., 2003.

 

• Co-recipient of the CURO Excellence in Undergraduate Research Mentoring Award from University of Georgia, April 2002.

• Press coverage Atlanta Business Chronicle, April, 2002.

• Press coverage Georgia State University Magazine, Fall, 2002.

• NCI travel award for "Workshops for Junior Biostatisticians, 2001 ENAR", Charlotte, N. Carolina.

Phi Kappa Phi honor society, April 2000.

• Outstanding Junior Faculty Award nomination, Georgia State University, Atlanta, Georgia, April 2000.

 

• NSF Travel Award for IBC98, Cape Town, South Africa, December 1998.

• NSF Travel Award for Pathways to the Future workshop, Dallas, Texas, August 1998.

 

• Student paper award in SRCOS/ASA summer conference, Melbourne, Florida, June 1995.

• Best Theoretical Student Award, Department of Statistics, University of Georgia, Athens, Georgia, 1994.

Presentations:  

 

Invited Talks at Professional/Research Meetings:

 

 

•   “Rank aggregation and its use in bioinformatics problems” The First International Conference on Theory and Application of Statistics, Dhaka, Bangladesh, 26-28 December 2010

 

•   "Rank aggregation and its use in bioinformatics

problems". LinStat 2010, Tomar, Portugal, July 27, 2010.       

"Monoisotopic Peak Detection and Disease

Classification for Mass Spectrometry Data". ENAR Spring Meeting, New Orleans, March 22, 2010.

 

"Improved Automated Monoisotopic Peak Detection Method for Mass Spectrometry Data", UT-ORNL-KBRIN Bioinformatics Summit, Lake Barkley State Resort Park

Cadiz, KY. March 19, 2010.

• "Predicting Survival from High Dimensional Data" , International Symposium on Stochastic Models in Reliability Engineering, Life Science and Operations Management, February 10, 2010, Beer Sheva, Israel.

 

“Improved Automated Monoisotopic Peak Detection Method for Mass Spectrometry Data”, Seventh International Triennial Calcutta Symposium, Kolkata, Dec. 28, 2009, Kolkata, India.

 

“Improved Automated Monoisotopic Peak Detection Method for Mass Spectrometry Data”, International Conference on Frontiers of Interface between Statistics and Sciences, Hyderabad, India, 31 DEC 2009.

 

"A Statistical Framework for Differential Network Analysis (DNA) from Microarray Data Using Partial Least Squares", FACSS, October 18-22, 2009, Louisville, KY.

 

“Monoisotopic Peak Detection and detecting protein-protein interaction”,  First Institute of Mathematical Statistics Asia Pacific Rim Meeting, Seoul, June 28-July 1, 2009.

 

“Reverse Engineering To Construct Protein-Protein Interaction Network”, Joint Statistical Meetings, August 1-7, 2008, Denver, Colorado.

 

“Construction of Genetic Association Networks”, International Indian Statistical Association Conference: Frontiers of Probability and Statistical Science,  May 22-26, 2008, University of Connecticut-Storrs.

 

“Determination of optimal clustering algorithm by weighted rank aggregation: Cross entropy algorithm”, UT-ORNL-KBRIN Bioinformatics Summit 2008,  March 28, 2008, Cadiz, KY.

 

• International Conference on Statistics, Probability and Related Areas by IISA, January 2-5, 2007, Cochin, India.

• International Conference on Multivariate Statistical Methods, Dec 28-29, 2006, Kolkata, India.

 

“Combining functional information in validation of statistical clustering”, International Multi-Symposiums on Computer and Computational Sciences (IMSCCS|06), June 20-24, 2006, Zhejiang University, Hangzhou, China.

“Clustering Microarray Data”, UT-ORNL-KBRIN Bioinformatics Summit 2006,  April 21-23, 2006, Cadiz, Kentucky.

“Feature Selection in Mass Spectrometry Data for Cancer Classification”,  SCMA 2005 / FIM XII, International Conference on Statistics, Combinatorics, Mathematics and Applications: 12th Annual Conference of the Forum for Interdisciplinary Mathematics, December 2-4, 2005, Auburn University, AL, USA.

“Selecting an appropriate clustering algorithm for analyzing microarray data”, Joint Annual Meeting of the Interface and the Classification Society of North America,  June 8, 2005 - June 12, 2005, Washington University School of Medicine, St. Louis, Missouri.

• International Conference on Future of Statistical Theory, Practice and Education, December 29, 2004 - January 1, 2005, Hyderabad, India.

 

• Eleventh International Conference on Interdisciplinary Mathematical and Statistical Techniques, SCRA 2004, December 27-29, 2004, Lucknow, India.

 

“Parametric and Nonparametric Empirical Bayes Adjustments to Multiple P-values for the Detection of Differentially Expressed Genes in Microarray Experiments”,Joint Statistical Meeting,  August 7-12, 2004, Toronto, Canada.

“Empirical Bayes Screening of Many P-values with Applications to Microarray Studies”,  Microarray Data Analysis Conference arranged by Infocast Inc.  June 28-29, 2004, Rockville, MD,  USA.

 

“Empirical Bayes Screening of Many P-values with Applications to Microarray Studies”, International Conference on Statistics in Health Sciences, June 23-25, 2004, Nantes, France.

 

“Empirical Bayes Analyses of Multiple p-values For the Detection of Differentially Expressed Genes in Microarray Experiments”, IISA 2004 Meeting, May 7-9, 2004, Athens, Georgia, USA.

 

• The Second Asia Pacific Bioinformatics Conference 18-22 Jan, 2004, Dunedin, New Zealand (full paper).

 

"Comparisons of clustering algorithms for groupping genes based of expression profiles", SCRA 2002-FIM IX: Ninth International Conference of Forum for Interdisciplinary Mathematics on Statistics Combinatorics and Related Areas, Department of Statistics and Department of Mathematics: University of Allahabad, Allahabad, UP 211 002, India,  December 21-23, 2002.

 

•   "Statistical Techniques to Analyze Microarray Data: A Partial Overview", Applying Bioinformatics, from Genes to Systems: The University System of Georgia Research Symposium, Georgia State University, Atlanta, GA,  October 3-4, 2002.

 

"Clustering algorithms for microarray data: overview and comparative studies", International Conference on Current Advances and Trends in Nonparametric Statistics, Crete, Greece, July 2002.

 

"Clustering algorithms for microarray data: overview and comparative studies", IISA International Conference on Statistics, Probability and Related Areas, Dekalb, Illinois,  June 2002.

 

"Use of Partial Least Squares in Microarray Data",International Conference on Statistics,Combinatorics and Related Areas And The Eighth International Conference of the Forum for Interdisciplinary Mathematics,Wollongong, Australia,  December 2001.

 

"Microarray data and bioinformatics: a statistical future?", IISA-JSM-2000-2001 Conference, New Delhi, India,  January 2001.

 

"Microarray data and bioinformatics: a statistical future?", Triennial Calcutta Symposium, Calcutta, India,  December 2000.

 

"Estimation of Selection Parameters Using Multigeneration Cytonuclear Data" , Symposium on Inference for Stochastic Processes, Athens, Georgia. Estimation of Selection Parameters University of Georgia, Athens, May 2000.

 

• Sixth International Conference of Forum for Interdisciplinary Mathematics on Statistics, Combinatorics and Related Areas, Mobile, Alabama, December 1999.

 

• Pathways to the Future workshop, Dallas, Texas, June 1997.

 

• AMS - IMS - SIAM Joint Summer Research Conference on  Statistics in Molecular Biology, Seattle, Washington, June 1997.

 

• IISA International Conference on  Statistics: IISA98, Hamilton, Canada, October 1998.

 

• Second IMS new researchers' meeting, Kingston, Canada, July 1995.

 

• SRCOS/ASA summer conference, Melbourne, FL, Student paper award, June 1995.

Refereed Talks at Professional/Research Meetings:

 

• "Next Generation Sequencing: Statistical Challenges and Opportunities", CAMDA 2009, Oct 5-6, 2009, IL, Chicago

 

• "Improved monoisotopic peak detection in mass spectrometry data", MCP 2009,  Tokyo, Japan, March 27, 2009.

 

• "A Statistical Framework for Differential Network Analysis (DNA) from Microarray Data using Partial Least Squares",  Rocky '08, 6th Annual Rocky Mtn Bioinformatics Conference, Snowmass, CO, December 5-8, 2008.

 

• "Fetal Alcohol Syndrome Detected through Proteomics Screening of the Amniotic Fluid in High-Risk (B6J) and Low-Risk (B6N) C57BL/6 Mice", Teratology Society 47th Annual Meeting, Pittsburg, PA,  June 2007.

 

Colloquia, Seminars and Professional Courses:

 

• "High Dimensional Data : A New Paradigm of Biomedical Research". Research Incubation Meeting, SPHIS, University of Louisville, Louisville, Kentucky,10 March, 2010.

 

•   "Bioinformatics in Cancer Research: A Hype or Hope?",  JGB Cancer Center, University of Louisville, March 2008.

Department of Quantitative Health Sciences, Cleveland Clinic, January 2008.

 

• Department of Statistics, Bioinformatics Seminar, Texas A&M University, November 2007.

• National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, April 2007.

 

• Department of Epidemiology, Michigan State University, East Lansing, March 2007.

 

• Department of Statistics, University of Kentucky, Lexington, October 2006.

 

• ASA, Kentucky Chapter, December, 2005.

• Proteomics Center, University of Louisville, Louisville, November 2005.

 

• Department of Statistics and Applied Probability, National University of Singapore, December 2004.

• “Empirical Bayes Screening (EBS) of Many P-values with Applications to Microarray Studies”.  Mathematical Biosciences Institute, Ohio State University. Invited presenter in the workshop titled “Genomics, Proteomics, and Bioinformatics”, October 13th 2004.

 

• Department of Biostatistics, Univ. of Minnesota, April 2004.

• Department of Mathematics, Univ. of North Carolina at Charlotte, April 2003.

• Department of Mathematics, Georgia Institute of Technology,  March 2003.

• Department of Mathematics and Statistics, Georgia State University, March 2003.

• A one day professional training course on Poisson regression, Centers for Disease Control and Prevention, Atlanta, GA, February 2001.

 

• Department of Biostatistics, University of Alabama, Birmingham, AL, August 2001.

• Department of Statistics, University of Georgia, Athens, GA, April 2001.

 

• Bioinformatics group, Jonathan Arnold Lab, Department of Genetics, University of Georgia, Athens, GA, December 2000.

 

• A two day professional training course on Smoothing using S+, Centers for Disease Control and Prevention, Atlanta, GA, November 2000.

 

• Computer Science Unit and Mathematical Science Unit, Indian Statistical Institute, Calcutta, India, June 1999.

 

• Department of Mathematics and Statistics, University of North Carolina, Charlotte, April 1999.

• Department of Statistics, Purdue University, April 1999.

Department of Mathematics and Computer Science, Georgia State University,  March 1997.

• Department of Genetics, University of Georgia, December, 1996.

 

• Population Biology Group, Emory University, November, 1995.

 

• Department of Biostatistics, Emory University, October, 1995.

 

• Department of Mathematics & Statistics, University of North Florida, April 1995.

 

Contributed Talks at Professional/Research Meetings:

 

• ENAR Spring Meeting, Pittsburgh, March 2004.

• Digital Biology: The Emerging Paradigm, NIH/BISTI, Bethesda, Maryland, November 2003.

• Joint Statistical Meeting, Baltimore, Maryland, August 1999.

 

• ENAR Spring Meeting, Atlanta, Georgia, March 1999.

 

• 19th International Biometric Conference: IBC-98, Cape Town, South Africa, December 1998.

 

• 18th International Biometric Conference: IBC-96, Amsterdam, The Netherlands, June 1996.

 

• IMS/ENAR meeting, Richmond, Virginia, March 1996.

 

• ASA Winter Conference, Raleigh, North Carolina, January 1995. 

Refereed Posters in Professional/Research Meetings:

 

• "Stochastic Modeling and Statistical Inference of Time Course Metabolic Data". Integrative BioSystems Institute Conference, October 18 -21, 2008, Atlanta, Georgia.

 

"Finding cancer genes through meta-analysis of microarray experiments: Rank aggregation via the cross entropy algorithm". CAMDA 2007, December 2007, Valencia, Spain.

 

• "Selecting a clustering algorithm for statistical consistency and biological relevance for gene expression data". June 27, 2005, ISMB 2005, Detroit, Michigan.

 

• "clValid, an R package for cluster validation". July 2007, ISMB 2007, Vienna, Austria.

Posters in Professional/Research Meetings:

 

• Research Louisville 2005, University of Louisville.

 

Meetings Attended:

 

• RECOMB Satellite Conference on Computational Proteomics 2010, La Jolla, California, March 2010.

 

• Joint Statistical Meeting, Washington DC, August 2009.

 

• Joint Statistical Meeting, Seattle, Washington, August 2006.

 

• International Biometric Society Conference IBC 2006, Montreal, Canada, July, 2006.

 

• Joint Statistical Meeting, San Francisco, California, August 2003.

 

• Joint Statistical Meeting, Atlanta, Georgia, August 2003.

• Workshops for Junior Biostatisticians, 2001 ENAR, Charlotte, March 2001.

 

• Genomics and Medicine Symposium, Emory University, Atlanta, September 2000.

 

• Bioinformatics Conference, Georgia Research Alliance, Atlanta, June 2000 (invited).

 

• Beyond Genome 2000, San Francisco, June 2000.

 

• Joint Statistical Meeting, Dallas, Texas, August 1998.

Graduate Students Direction:

 

PhD (at UofL)

 

• Mourad Atlas, completed, Summer 2009, (Joined FDA)

 

• Vasyl Pihur (jointly with Somnath Datta), completed Summaer 2009,  (Joined John's Hopkins as a post-doc).

 

• Juliet Ndukum, Expected Completion, Summer 2012.

 

MS (at UofL) 

 

• Jasmit Shah, Expected completion, Summer, 2011.

 

• Xiaohong Li (jointly with R. Gill), Completed, Spring, 2010.

 

• Becky Patterson (jointly with P. Yoo),  Testing the Effects of Predictors Using Data Generated by Non-identity Link Functions of the Single-Index Model: A Monte carlo Approach, graduated Spring 2008.

 

• Christopher N. Barnes, Feature Selection and Classification in High-throughput Data Analysis, graduated Summer 2007.

MS  (at GSU)

 

• Sylvie Bougi,  A Comparative Power Study of Statistical Tests of Neutrality of DNA Markers Using Multigeneration Cytonuclear Data; graduated Spring 2001.

• Israel Hora, Time Series Analysis of Georgia Employment Data and Future Prediction of Emloyment Status, graduated  Fall, 2002.

 

• Ying, Yang (Biology),  non-thesis report, graduated Summer, 2002.

 

• Jennifer Le-Rademacher, Partial Least Squares in Censored Survival Regression,  graduated Summer 2004.

 

• Xiao Hong Zhu, graduated Summer 2004.

 

• Usha Ramakrishnan, graduated Fall 2004.

 

• Mourad Atlas, graduated Spring 2005.

• Lace Depadilla, graduated Spring 2005.

 

• Baofu Ma, graduated Summer 2005.

• Saurav Karmaker, graduated Summer 2005.

 

Teaching:     

 

@  University of Louisville

 

• PHST 675-01: (Designed and Taught) Statistics for Proteomics-Fall 2009.

• PHST 675-01: (Designed and Taught) Stochastic Modelling and Statistical Inference for Time-Course Data - Summer 2009.

 

• PHBI 750 Statistics for Bioinformatics (Designed and Taught). A course for the PhD students.

 

• PHBI 751 High-throughput Data Analysis (Designed and Taught). A course for the PhD students.

 

• PHST 662 Mathematical Statistics. A required course for the MS students.

 

@  Georgia State University

 

• Math 4544 Biostatistics. A split level methods course. Text: Fundamentals of Biostatistics, Fourth edition, by Bernard Rosner.

• Math 1070 Elementary Statistics. An introductory statistics course for undergraduates. Text: The Basic Practice of Statistics, 2nd Edition by David Moore, W. H. Freeman and Company, 1999.

 

• Stat 8440 (taught as Stat 8690) Survival Analysis. Graduate level course on basics of survival analysis. Text: Survival Analysis, Techniques for Censored and Truncated Data by John P. Klein and M. Moeschberger, Springer, New York.

 

• Stat 8540 Advanced Methods in Biostatistics (Designed and Taught). Graduate level course on modern/classical statistical/biostatistical methods including smoothing techniques and data summaries, linear models, generalized linear models, modern nonlinear regression techniques, multivariate statistics, survival analysis using S-Plus. Text: Modern Applied Statistics with S-Plus by Venables and Ripley, 3rd Edn, Springer, New York.

 

• Stat 8050 Statistics for Bioinformatics. A new interdisciplinary course taught in the Fall of 2002.

 

• PERS 2002N A perspective course on World Hunger. Jointly with M. Cody of Nutrition and W. Fritz of Ecology.

 

• Stat 8820 Research in Statistics. An independent studies course leading to a supervised research project.

 

 

Program Development:

 

@  University of Louisville

•       Contributed significantly towards the development of the Biostatistics PhD program

•       Developed Emphasis in Bioinformatics within the Biostatistics PhD program

•       Developed interdisciplinary Bioinformatics PhD program

@  Georgia State University

•       Developed concentration in Biostatistics within the MS program in Mathematics

•       Developed concentration in Bioinformatics within the MS program in Mathematics

•       Developed PhD program in Bioinformatics between the Biology and Mathematics department 

 

Susmita Datta

(Last updated February, 2011) 

Made with Namu6