Home | About me | Other Interests |
Below is a distillation of my professional journey. You can also find my resume here.
My work at Amazon can be divided into 3 separate tracks:
I have also co-mentored a couple of strong PhD interns. Some of our previous interns include Martin Klissarov, Zuxin Liu, Jesse Zhang, and Ming Yin.
While at Brown, I had the pleasure to study fundamentals of RL with one of the most elite RL intellectuals, Michael Littman. While working with Michael, I studied the importance of smoothness in RL ingredients, such as in softmax operators, transition models, and value-function architectures.
As a PhD students I also did 2 internships at MSR where I primarily worked with Jason Williams who is a pioneer in dialog systems. Together with Jason, we mainly explored the application of RL to dialog agents and language models.
I learned the fundamentals of RL with function approximation under the founder of modern RL, Rich Sutton. I also worked closely with Rich’s then post-doc Joseph Modayil. My work was primarily focused on usefully combining model-based and model-free RL. Together with Rich and Joseph, we proposed the Cascade Architecture.
I learned the basics of computer science and quickly developed a potent interest in AI. At the time learning with supervision somehow felt like cheating to me (because I thought too much burden is put on human expert to provide supervision). In sharp contrast the RL framework felt very natural to me. This led to my studying the RL book as a sophomere. Having finished the book, I wrote to Rich asking him to take me as his student and the rest is history! I also somehow made it into the Errata and Notes of the 1st edition of the RL book.