Statistical Computing Series: Mealtime
The Statistical Computing Series hosted by Vanderbilt University Medical Center's Department of Biostatistics features presentations on the implementation of statistical models and methods, statistical computation, and graphics. These informal meetings allow experienced statisticians and developers to share their expertise on computing topics with practitioners across Vanderbilt. On Monday, November 25, 2024, at 1:30pm on Teams, principal biostatistician Josh DeClercq will present "Mealtime: A Shiny app for meal planning." Here is his description of the talk:
Have you ever come home from the grocery store and realize you have no idea what to cook? Do you have stacks of cookbooks that largely go unused? Mealtime is a data-driven approach to meal planning. This presentation will touch on how to deconstruct a recipe into its essential components to derive its "makeability" score. I'll explain how I scale this process to accommodate large numbers of recipes and ingredients and discuss how I leverage data to accommodate dietary restrictions. Finally, I will demonstrate how it all comes together in a Shiny application, sharing some tips and challenges I encountered along the way.
For access to this webinar, contact series organizer Ryan Moore.
Vanderbilt Biostatistics at ENAR 2025 - Invited Preliminary Program
ENAR 2025 has published a preliminary lineup of invited sessions - we congratulate the department members and alumni whose proposals have been accepted! They include:
Assistant professor Gustavo Amorim - speaker, Methodological Considerations for the Design and Analysis of Observational Studies Reliant on Electronic Health Records Data
Professor Benjamin French - organizer/chair, Modern Statistical Challenges of Electronic Health Records Data
PhD candidate Yeji Ko - speaker, Modern Statistical Challenges of Electronic Health Records Data
Alumna Lucy McGowan (PhD 2018) - speaker, Missing Data and Multiple Imputation and Their Applications
PhD student Ashley Mullan - organizer, Collaboration 101: What a Scientist Seeks in a Statistician vs. What a Statistician Seeks in a Scientist
Professor Bryan Shepherd - speaker, Precision in EHR Data: Overcoming Challenges of Measurement Error in Health Outcomes
PhD candidate Jiangmei (Ruby) Xiong - organizer, Collaboration 101: What a Scientist Seeks in a Statistician vs. What a Statistician Seeks in a Scientist
The conference will take place in New Orleans from March 23 through March 26, 2025.
Greenlight study demonstrates effective early intervention in preventing childhood obesity
The study published in JAMA was co-authored by professor Jonathan Schildcrout and principal biostatistician Aihua Bian.
Asthma drug does not speed COVID-19 recovery: study
Dr. Sean Collins: the success of ACTIV-6 “is a testament to the extensive amount of work and expertise behind the scenes by the Vanderbilt Coordinating Center and the Department of Biostatistics at VUMC.”
Blockbuster obesity drugs also may slow kidney disease
Co-authors of this study include senior biostatistician Zhihong Yu, data scientist Hua-Chang Chen, and associate professor Ran Tao.
All-department team publishes paper in Computational and Structural Biotechnology Journal
Congratulations to former visiting student Chia-Jung "Charlene" Chang (PhD candidate in biomedical engineering at National Cheng Kung University), research assistant professor Chih-Yuan Hsu, and professors Qi Liu and Yu Shyr on the publication of "VICTOR: Validation and inspection of cell type annotation through optimal regression." The article on this new method appeared online ahead of print on October 15 and will go to press as part of Computational and Structural Biotechnology Journal's December issue. As described in the paper's abstract:
Single-cell RNA sequencing provides unprecedent opportunities to explore the heterogeneity and dynamics inherent in cellular biology. An essential step in the data analysis involves the automatic annotation of cells. Despite development of numerous tools for automated cell annotation, assessing the reliability of predicted annotations remains challenging, particularly for rare and unknown cell types. Here, we introduce VICTOR: Validation and inspection of cell type annotation through optimal regression. VICTOR aims to gauge the confidence of cell annotations by an elastic-net regularized regression with optimal thresholds. We demonstrated that VICTOR performed well in identifying inaccurate annotations, surpassing existing methods in diagnostic ability across various single-cell datasets, including within-platform, cross-platform, cross-studies, and cross-omics settings.
Figure 1 in the paper provides an " One example in diagnosing the reliability of cell annotations. a) diagnostic performance of singleR, scmap, SCINA, scPred, CHETAH, scClassify, and Seurat. b) diagnostic performance of VICTOR when applied to annotations from singleR, scmap, SCINA, scPred, CHETAH, scClassify, and Seurat.
New evaluation system to render AI chatbots safe, empathetic
Alumnus Zhijun Yin (MS, 2017) is co-PI of this ARPA-H funded project, and professor Bradley Malin is a member of the team.
Vanderbilt Biostatistics at AMIA 2024
The 2024 AMIA (American Medical Informatics Association) Annual Symposium will take place in San Francisco from November 9 through November 13. Department members with work to be presented at the symposium include:
Saturday, November 9
Workshop 17, "REDCap on FHIR: Implementing and Using Clinical Data Interoperability Services" - professor Paul Harris, co-instructor/author
Sunday, November 10
Workshop 27, "Advancing Biomedical Research Using Multi-omics Data in the All of Us Researcher Workbench," 8:30 am - co-authored by Paul Harris
Session 7, "Pediatric Health Informatics - Kid Coders," 3:30 pm
"Revealing Patterns of Child Maltreatment Policy Differences and Demographic Dynamics using BERT-Networks and Clustering Approach" - co-authored by associate professor Rameela Raman
Monday, November 11
Session 17, "LIEAF: Artificial Intelligence and Data Science in Health Informatics Education," 8:30 am
Enhancing Causes of Death Prediction from Electronic Health Records through Multi-Modal Integration of Structured and Unstructured EHR Data - co-authored by professor Michael Matheny
Session 22, "AI Fairness and Ethics - Justice League," 8:30 am
- "Fairness of AI Collaboration and Suppression in Emergency Triage" - co-authored by professor Bradley Malin
- "Enhancement of Fairness in AI for Chest X-ray Classification" - co-authored by Bradley Malin
Session 53, "Utilization Data and Data Utilization - Auditory Audits, Listening to the Data," 3:30 pm
"Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs" - co-authored by Bradley Malin
Session 54, "Patient Generated Data - Organic Certified," 3:30 pm
"Examining Oral Anti-Cancer Medication Continuation Using Questionnaires, Prescription Refills, and Structured Electronic Health Records" - co-authored by professor Qingxia Chen, Bradley Malin, and alumnus Zhijun Yin (MS 2017)
Poster session 1, 5:00 pm
P114: "Machine Learning Methods for Estimating Gestational Age at Birth from Electronic Health Records" - co-authored by professor Leena Choi
P118: "Large Language Models Enhance the Identification of Emergency Department Visits for Symptomatic Kidney Stones" - co-authored by PhD candidate Siwei Zhang and assistant professor Yaomin Xu
P171: "Comparing EHR-recorded Race/Ethnicity to Self-reported Race/Ethnicity: Insights from the All of Us Research Program" - coauthored by Xiaoke (Sarah) Feng (first author), biostatistician Andrew Guide, assistant in biostatistics Shawn Garbett, and Qingxia Chen
Tuesday, November 12
Session 98, "Wearable Sensor Data - Data on the Go," 3:30 pm
"'I worry we’ll blow right by it': Barriers to Uptake of the STRATIFY CDSS for ED Discharge in Acute Heart Failure" - co-authored by associate professor Dandan Liu
P05: "Utilizing Large Language Models (LLM) to Optimize Domain-Specific Natural Language Processing (NLP) for Identifying Patients with No Reason for Not Prescribing ACEI/ARB in Chronic Kidney Disease (CKD) Management" - co-authored by Michael Matheny
P27: "Assessing ChatGPT Responses to Alzheimer’s Disease Myths" - co-authored by Bradley Malin and Zhijun Yin
P117: "Algorithmic Matching of Unique Device Information to Electronic Health Record Data" - co-authored by Michael Matheny
P178: "A Study of Challenges In Algorithmic Transportability Between VHA Sites" - co-authored by Michael Matheny
P188: "Real-Time Automated Billing for Tobacco Treatment: A CDS Hook Approach for Simulating Clinician Facing Coding Prompts Within EHRs" - co-authored by Michael Matheny
Wednesday, November 13
Session 102, "Self-Service Software Tools for Clinical and Translational Research: Rationale, Benefits, Limitations, Challenges, and the Future," 8:00 am - Paul Harris, speaker
Updated 11.11.2024 to include P01.
Vanderbilt Biostatistics at WSDS 2024
The 2024 Women in Statistics and Data Science Conference is underway in Reston, Virginia, from October 16 through 18. We are proud of the department members and alumni involved with this year's meeting. They include:
MS student Zongyue Teng
- First and presenting author of "Going for gold: Using record linkage and Bayesian hierarchical modeling to select winning gymnasts at the 2024 Paris Olympics" (speed session Wednesday, poster Thursday; graphic via WakeForestStats)
Sarah Lotspeich (PhD 2021)
- Co-author of "Quantifying the impact of measurement error on health disparities models" (speed session Wednesday, poster Thursday)
- Organizer of and speaker in "Mastering Data: Insights into Master's Degrees in Statistics and Analytics" (panel, Thursday)
- Organizer of "More than Statistics: Improving Maternal and Infant Health with Data" (invited session, Thursday)
- Co-author of "Adjusting for covariate misclassification to quantify the relationship between diabetes and local access to healthy food" (speed session 3, Thursday)
- Co-organizer of and speaker in "Statistical Methods For HIV Research: Battling An Epidemic With Linked, Missing, And Error-prone Data" (invited session, Thursday)
- Panelist for "Statistical Storytelling: Insights Into Effective Presentation Strategies" (Friday)
- Organizer of "Cause For Celebration: Adapting Causal Inference Methods For Challenging Datasets" (invited session, Friday)
PhD student Ashley Mullan
- First and presenting author of "Adjusting for covariate misclassification to quantify the relationship between diabetes and local access to healthy food" (speed session and poster, Thursday)
- Chair of and panelist for "Statistical Storytelling: Insights Into Effective Presentation Strategies" (Friday)
Lead biostatistician Amy Perkins
- First and presenting author of "Machine Learning Model Robustness and Performance Stability in Future Years when Predicting Adverse Events in a Veteran Population and a Diabetic Subpopulation" (speed session and poster, Thursday). Co-authors include assistant professor Amber Hackstadt and professor Michael Matheny.
Lucy D'Agostino McGowan (PhD 2018):
- Speaker in "Statistical Methods for Missing Data Imputation" (panel, Thursday)
- Co-organizer of "Statistical Methods For HIV Research: Battling An Epidemic With Linked, Missing, And Error-prone Data" (invited session, Thursday)
- Panelist for "Statistical Storytelling: Insights Into Effective Presentation Strategies" (Friday)
- Speaker in "Cause For Celebration: Adapting Causal Inference Methods For Challenging Datasets" (invited session, Friday)
Statistical Computing Series: Intro to GitHub
The Department of Biostatistics' Statistical Computing Series focuses on the implementation of statistical models and methods, statistical computation and graphics. These informal meetings allow experienced statisticians and developers to share their expertise on computing topics with practitioners across Vanderbilt. On Thursday, October 31, at 1:00 pm, application developer Savannah Obregon will present "Introduction to GitHub," on Microsoft Teams:
Use GitHub for seamless collaboration and robust version control in your projects. This presentation will guide you through the essential features and best practices for using GitHub in a team setting. Learn how to manage repositories, branches, pull requests, and issues to streamline your workflow.
For an example of what's possible on GitHub, see Obregon's own website: https://smobregon.github.io. She was recently named winner of the department's IT Innovation Award and has delivered presentations at conferences such as R/Medicine.
To obtain a link to this webinar, contact series organizer Ryan Moore.