Beyond Exponentially Fast Mixing in Average-Reward RL via Multi-Level Monte Carlo Actor-Critic

Wesley A. Suttle*, Amrit Singh Bedi*, __Bhrij Patel__, Alec Koppel, Brain M. Sadler, Dinesh Manocha. "Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic." Proceedings of the 40th International Conference on Machine Learning, (2023).

In pursuit of interpretable, fair and accurate machine learning for criminal recidivism prediction

Caroline Wang*, Bin Han*, __Bhrij Patel__, Cynthia Rudin (2022). "In pursuit of interpretable, fair and accurate machine learning for criminal recidivism prediction." Journal of Quantitative Criminology. https://link.springer.com/article/10.1007/s10940-022-09545-w

Preprints

Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation

__Bhrij Patel__, Kasun Weerakoon, Wesley A. Suttle, Alec Koppel, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha.