Tanmay Gangwani
I am an Applied Scientist at Amazon, working towards solving various user-centric problems via the application of machine learning concepts. I enjoy designing solutions that leverage academic and industrial research in the areas of reinforcement learning, deep learning, and natural language processing.
I received my Ph.D. in Computer Science from the University of Illinois, Urbana Champaign, where I was advised by Prof. Jian Peng. My Ph.D. thesis focused on applied algorithms for deep reinforcement learning (RL). Concretely, I studied approaches that utilize expert demonstrations for RL (imitation learning), address the issues of exploration and sparse environmental rewards, and improve sample efficiency with the transfer-RL and meta-RL paradigms.
Before this, my research was at the intersection of computer architecture and compilers. I earned my Master's degree from the same school, advised by Prof. Josep Torrellas.
Email  / 
CV  / 
Google Scholar  / 
Twitter  / 
Github  
|
|
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan,
Jian Peng,
Tanmay Gangwani
International Conference on Learning Representations (ICLR), 2022
|
Imitation Learning from Observations under Transition Model Disparity
Tanmay Gangwani,
Yuan Zhou,
Jian Peng
International Conference on Learning Representations (ICLR), 2022
|
Learning Guidance Rewards with Trajectory-space Smoothing
Tanmay Gangwani,
Yuan Zhou,
Jian Peng
Conference on Neural Information Processing Systems (NeurIPS), 2020
code /
poster /
bibtex
|
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Tanmay Gangwani,
Jian Peng,
Yuan Zhou
Conference on Robot Learning (CoRL), 2020
code /
slides /
video /
bibtex
|
State-only Imitation with Transition Dynamics Mismatch
Tanmay Gangwani,
Jian Peng
International Conference on Learning Representations (ICLR), 2020
code /
slides /
video /
bibtex
|
Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Michael Wan,
Tanmay Gangwani,
Jian Peng
The Conference on Uncertainty in Artificial Intelligence (UAI), 2020
slides /
video /
bibtex
|
Learning Belief Representations for Imitation Learning in POMDPs
Tanmay Gangwani,
Joel Lehman,
Qiang Liu,
Jian Peng
The Conference on Uncertainty in Artificial Intelligence (UAI), 2019
code /
bibtex
|
Learning Self-Imitating Diverse Policies
Tanmay Gangwani,
Qiang Liu,
Jian Peng
International Conference on Learning Representations (ICLR), 2019
code /
slides /
poster /
bibtex
|
Policy Optimization by Genetic Distillation
Tanmay Gangwani,
Jian Peng
International Conference on Learning Representations (ICLR), 2018
poster /
bibtex
|
Distributed and Secure ML using Self-tallying Multi-party Aggregation
Yunhui Long*,
Tanmay Gangwani*,
Haris Mughees,
Carl Gunter
(* denotes co-first authorship)
NeurIPS workshop on Privacy Preserving Machine Learning (PPML), 2018
code /
slides /
bibtex
|
Architectural Support for Relaxed Concurrent Priority Queueing in Chip Multiprocessors
Azin Heidarshenas*,
Tanmay Gangwani*,
Serif Yesil,
Adam Morrison,
Josep Torrellas
(* denotes co-first authorship)
International Conference on Supercomputing (ICS), 2020
slides /
bibtex
|
Breaking Serialization in Lock-Free Multicore Synchronization
Tanmay Gangwani,
Adam Morrison,
Josep Torrellas
Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2016
poster /
slides /
bibtex
|
Partial Redundancy Elimination using Lazy Code Motion
Sandeep Dasgupta,
Tanmay Gangwani
Technical Report, 2015
code /
bibtex
|
Template imitation-learning (pun intended) using this!
|
|