DiJia Andy Su received his ML PhD from Princeton.
Andy is advised by Vincent Poor and John Mulvey, and he also works closely with Jason Lee and Chi Jin.
During his time at Google, Andy worked with Dale Schuurmans and Craig Boutilier.
Selected Works
Shibo Hao, Sainbayar Sukhbaatar, DiJia Andy Su, Xian Li, Zhiting Hu, Jason Weston, Yuandong Tian
Training Large Language Models to Reason in a Continuous Latent Space
Preprint. Arxiv.
[1] DiJia Andy Su, Sainbayar Sukhbaatar, Michael Rabbat, Yuandong Tian, Qinqing Zheng
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
Preprint. Arxiv.
[2] Lucas Lehnert, Sainbayar Sukhbaatar, DiJia Andy Su, Qinqing Zheng, Paul Mcvay, Michael Rabbat, Yuandong Tian
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Conference of Language Modeling (COLM) 2024
[3] DiJia Andy Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier
Conqur: Mitigating Delusional Bias in Deep Q-Learning
International Conference on Machine Learning (ICML) 2021
- Note: Followup work on Non-delusional Q Learning (NeurIPS 18 Best Paper)
[4] DiJia Andy Su, Bertrand Douillard, Rami Al-Rfou, Cheol Park, Benjamin Sapp
Narrowing the Coordinate-Frame Gap in Behavior Prediction Models: Distillation for efficient and accurate scene-centric motion forecasting (training 15B model)
International Conference on Robotics and Automation (ICRA) 2022
- Note: Won 1st Place at Open Motion Prediction Challenge, as of Sep 1
[5] DiJia Andy Su, Jason D Lee, John M Mulvey, H Vincent Poor
Competitive Multi-Agent Reinforcement Learning with Self-Supervised Representation
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
[6] DiJia Andy Su, Jason D Lee, John M Mulvey, H Vincent Poor
Under Review at ICML.
[7] DiJia Andy Su, DF Su, Jason D Lee, John M Mulvey, H Vincent Poor
Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies
IEEE Transactions on Artificial Intelligence
[8] DiJia Andy Su, John M Mulvey, H Vincent Poor
Improving Portfolio Performance via Natural Language Processing Methods
Journal of Financial Data Science
[9] DiJia Andy Su *, Zihan Ding*, Qinghua Liu, Chi Jin
Under review.
[10] DiJia Andy Su, et al.
Machine Learning Optimization with Crypto Currency
Under Review at Informs Journal of Operation Research
[11] DiJia Andy Su, SA Mousavifar, Cyril Leung
Secrecy capacity and wireless energy harvesting in amplify-and-forward relay networks
IEEE Pacific Rim Conference on Communications, Computers and Signal Processing
Note: Won best paper award.
[12] Yunfei Zhang, Clarence W de Silva, DiJia Andy Su, Youtai Xue
International Conference on Computer Science (ICCSE)
Note: Won best paper award.
Patents:
Tyler Lu, Jayden Ooi, Craig Boutilier, Dale Schuurmans, DiJia Andy Su
Mitigating delusional bias in deep q-learning for robotic and/or other agents
Patent Number: US20220101111A1
Bertrand Douillard, DiJia Andy Su
Training agent trajectory prediction neural networks using distillation
Patent Number: US20230082079A1