Tanmay Gangwani

I am a principal applied scientist at Microsoft, where I build collaborative AI agents that power intelligent experiences in Microsoft Teams.

Previously, I was at Amazon, where I worked on architectures and methods for personalized search, product ranking, and recommendation at scale. I hold a Ph.D. from the University of Illinois Urbana-Champaign, where my research focused on applied reinforcement learning and imitation learning algorithms (thesis), and an M.S. from the same institution, where I worked at the intersection of computer architecture and compilers (thesis).

LinkedIn  /  Google Scholar  /  X  /  Email  


Publications

Publications through 2022; for some work beyond that, see Google Scholar
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan, Jian Peng, Tanmay Gangwani
International Conference on Learning Representations (ICLR), 2022
[code] [video]
Imitation Learning from Observations under Transition Model Disparity
Tanmay Gangwani, Yuan Zhou, Jian Peng
International Conference on Learning Representations (ICLR), 2022
Learning Guidance Rewards with Trajectory-space Smoothing
Tanmay Gangwani, Yuan Zhou, Jian Peng
Conference on Neural Information Processing Systems (NeurIPS), 2020
[code] [poster] [video]
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Tanmay Gangwani, Jian Peng, Yuan Zhou
Conference on Robot Learning (CoRL), 2020
[code] [slides] [video]
State-only Imitation with Transition Dynamics Mismatch
Tanmay Gangwani, Jian Peng
International Conference on Learning Representations (ICLR), 2020
[code] [slides] [video]
Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Michael Wan, Tanmay Gangwani, Jian Peng
The Conference on Uncertainty in Artificial Intelligence (UAI), 2020
[slides] [video]
Learning Belief Representations for Imitation Learning in POMDPs
Tanmay Gangwani, Joel Lehman, Qiang Liu, Jian Peng
The Conference on Uncertainty in Artificial Intelligence (UAI), 2019
[code]
Learning Self-Imitating Diverse Policies
Tanmay Gangwani, Qiang Liu, Jian Peng
International Conference on Learning Representations (ICLR), 2019
[code] [slides] [poster]
Policy Optimization by Genetic Distillation
Tanmay Gangwani, Jian Peng
International Conference on Learning Representations (ICLR), 2018
[poster]
Distributed and Secure ML using Self-tallying Multi-party Aggregation
Yunhui Long*, Tanmay Gangwani*, Haris Mughees, Carl Gunter
(* denotes equal contribution)
NeurIPS workshop on Privacy Preserving Machine Learning (PPML), 2018
[code] [slides]
Architectural Support for Relaxed Concurrent Priority Queueing in Chip Multiprocessors
Azin Heidarshenas*, Tanmay Gangwani*, Serif Yesil, Adam Morrison, Josep Torrellas
(* denotes equal contribution)
International Conference on Supercomputing (ICS), 2020
[slides]
Breaking Serialization in Lock-Free Multicore Synchronization
Tanmay Gangwani, Adam Morrison, Josep Torrellas
Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2016
[poster] [slides]

Website design borrowed from here