[ICML 2024] “Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles”

Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brain M. Sadler, Amrit Singh Bedi, Dinesh Manocha.

[ICML 2023] “Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic.”

Wesley A. Suttle, Amrit Singh Bedi, Bhrij Patel, Alec Koppel, Brain M. Sadler, Dinesh Manocha.

[JoQC 2022]“In pursuit of interpretable, fair and accurate machine learning for criminal recidivism prediction”

Caroline Wang, Bin Han, Bhrij Patel, Cynthia Rudin

Preprints

“Embodied Question Answering via Multi-LLM Systems”

Bhrij Patel, Vishnu Sashank Dorbala, Dinesh Manocha, Amrit Singh Bedi.

“Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals”

Vishnu Sashank Dorbala,Bhrij Patel, Amrit Singh Bedi, Dinesh Manocha.

“Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation”

Bhrij Patel, Kasun Weerakoon, Wesley A. Suttle, Alec Koppel, Brian M. Sadler, Tianyi Zhou, Amrit Singh Bedi, Dinesh Manocha.