I am currently a postdoctoral fellow in computer science at Harvard SEAS, advised by Prof. Finale Doshi-Velez. We are actively exploring applications of machine learning to clinical medicine, especially combination therapies for major depression and interventions in the Intensive Care Unit (ICU).
Our recent methods cover two exciting areas of core ML research: (1) Semi-supervised learning: We have new objectives for training semi-supervised latent variable models can simultaneously discovering disease subtypes and suggest useful treatments. (2) Explainable AI: Our upcoming AAAI '18 paper shows how to optimize deep neural networks to have more interpretable decision boundaries, especially for clinical tasks.
I completed my Ph.D. in computer science at Brown University in May 2016, advised by Prof. Erik Sudderth. My thesis studied large-scale unsupervised clustering problems like organizing every New York Times article from the last 20 years or segmenting the human genome to find patterns in the epigenetic modifiers that amplify or inhibit expression. My technical focus was developing reliable non-convex optimization algorithms for a broad family of Bayesian nonparametric models that include mixtures, topic models, sequential models, and relational models. We have released an open-source Python package called BNPy. Please try it out!
I am actively looking for tenure-track faculty positions this season (2017-18). Please reach out if you have questions.
[Jan 2018] Paper accepted to AISTATS 2018.
Our paper -- Semi-Supervised Prediction-Constrained Topic Models (PDF coming soon) -- describes a new framework for training topic models and other latent variable models to improve *supervised* predictions while still providing good generative models with interpretable topics. The new approach fixes core issues with past methods like sLDA, and shines especially in semi-supervised tasks, when only a small fraction of training examples are labeled.
[Dec 2017] Presenting at NIPS 2017 Workshops
• Poster and Talk (by Mike Wu): Optimizing deep models with tree-regularization at Transparent and Interpretable ML workshop (NIPS TIML 2017) (will also appear in AAAI '18)
[Nov 2017] Paper accepted to AAAI 2018.
Our paper describes a new regularization method to
optimizerecurrent neural networks to have more interpretable decision boundaries (closer to the decision trees that clinicians like).
[Nov 2017] Invited talk at MIT Lincoln Laboratory
"Optimizing Machine Learning Models for Clinical Interpretability"
Slides: [slides.pdf, 5 MB]
[Sep 2017] Organizing Machine Learning for Health (ML4H) workshop at NIPS 2017
Please submit some awesome papers!
[Mar 2017] Presented paper on ICU intervention prediction at AMIA CRI '17
Nominated for Clinical Informatics Research Award (1 of 7 nominees)
[Feb 2017] Invited talk on BNPy at Boston Bayesians meetup
[Dec 2016] BNPy software tutorial at NIPS 2016 workshop
New: BNPy project website with example gallery
[Dec 2016] Posters at NIPS 2016 Workshops
[Sep 2016] Organizing Workshop at NIPS '16: Practical Bayesian Nonparametrics
Please consider submitting to our workshop: https://sites.google.com/site/nipsbnp2016/
[Aug 2016] Started post-doc at Harvard
You can now find me at my new office in Maxwell-Dworkin (MD 209).
[May 2016] Successful Ph.D. defense!
Many thanks to family and friends who supported me along the way.
[Jan 2016] Invited talks on my thesis.
I visited several research groups at Northeastern, U. Washington, and MIT to discuss results from my thesis work trying to make effective variational inference for clustering that scales to millions of examples. [slides PDF] [slides PPTX]
[Dec 2015] Invited talk at NIPS 2015 workshop.
I gave an invited talk at the Bayesian Nonparametrics: The Next Generation workshop about my thesis work building effective variational inference for models based on the Dirichlet process and its hierarchical variants. [slides PDF]
[Sept 2015] Paper accepted at NIPS 2015.
Our paper [PDF] describes a new algorithm for Bayesian nonparametric hidden Markov models that can handle hundreds of sequences and add or remove hidden states during a single training run.
[May 2015] Paper accepted at AISTATS 2015.
Our paper [PDF] describes a new algorithm for topic models that can effectively remove redundant or junk topics during a single training run.