Portfolio item number 1
Short description of portfolio item number 1 Read more
Short description of portfolio item number 1 Read more
Short description of portfolio item number 2 Read more
Published in Arxiv, 2020
Recommended citation: Jiajun Fan, He Ba, Xian Guo, Jianye Hao, "Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning." Arxiv, 2020. https://arxiv.org/abs/2011.06752
Published in In the proceedings of AAAI-22 Workshop on Reinforcement Learning in Games, 2021
Recommended citation: Jiajun Fan, "A Review for Deep Reinforcement Learning in Atari: Benchmarks, Challenges, and Solutions." In the proceedings of AAAI-22 Workshop on Reinforcement Learning in Games, 2021. https://arxiv.org/abs/2112.04145.html
Published in Arxiv, 2021
Recommended citation: Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng, "An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning." Arxiv, 2021. https://arxiv.org/abs/2106.00707
Published in Arxiv, 2021
Recommended citation: Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng, "CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation." Arxiv, 2021. https://arxiv.org/abs/2105.03923
Published in In the proceedings of AAAI-22 Workshop on Reinforcement Learning in Games, 2021
Recommended citation: Jiajun Fan, Changnan Xiao, Yue Huang, "GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning." In the proceedings of AAAI-22 Workshop on Reinforcement Learning in Games, 2021. https://arxiv.org/abs/2106.06232
Published in In the proceedings of Deep Reinforcement Learning Workshop NeurIPS 2022, 2022
Recommended citation: Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng, Haiyan Yin, "CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration." In the proceedings of Deep Reinforcement Learning Workshop NeurIPS 2022, 2022. https://arxiv.org/abs/2105.03923.html
Published in Arxiv, 2022
Recommended citation: Hao Wang, Zhichao Chen, Jiajun Fan, Yuxin Huang, Weiming Liu, Xinggao Liu, "Entire Space Counterfactual Learning: Tuning, Analytical Properties and Industrial Applications." Arxiv, 2022. https://doi.org/10.48550/arXiv.2210.11039
Published in In the proceedings of International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, 2022
Recommended citation: Jiajun Fan, Changnan Xiao, "Generalized Data Distribution Iteration." In the proceedings of International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, 2022. https://proceedings.mlr.press/v162/fan22c.html
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown! Read more
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field. Read more
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post. Read more
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post. Read more