Career Profile
I got my Bachelor degree and Master degree in Harbin Institute of Technology. My bachelor thesis is about model-based MARL and supervisored by Prof. Dibangoye. My research interests lie in Reinforcement Learning, particularly in Multi-Agent RL.
I am currently working at Tencent as a reinforcement learning engineer.
News
The paper named Correcting Biased Value Estimation in Mixing Value-Based Multi-Agent Reinforcement Learning by Multiple Choice Learning is accepted by Engineering Applications of Artificial Intelligence (IF:7.802/Q1).
Joined Tencent as full-time reinforcement learning engineer.
The paper named Multi-level credit assignment for cooperative multi-agent reinforcement learning is accepted by Applied Sciences (IF:2.838/Q2).
Joined Netease Fuxi AI lab as intern.
Joined a great company Parametrix.ai that focus on applying AI in games.
Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing accepted by ICML 2020.
Submitted a paper to ICML 2020.
Joined CITI-Lab in INRIA and INSA de Lyon and studied on MARL under the supervision of Prof. Dibangoye.
Paper accepted at IIHMSP in Jilin China.
Publications
Engineering Applications of Artificial Intelligence, 2022
Applied Sciences, 2022
IIHMSP/FITAT, 2019
Education
Experiences
- Proposed belief occupancy state as a summary to recast Dec-POMDPs under one-sideness sharing to boMDP which is MDP actually.
- Implemented belief occupancy state Heuristic search and value iteration algorithm to solve boMDP.
- Applied linear programming and tabular method to improve the scalability.
- Quantized floating point data of DL Networks into 16 or 8 bits on Caffe.
- Applied KL Divergence to decrease the loss caused by quantization of 8 bits to just 1.5 for MobileNet-SSD.
- Verified the Quantization Scheme for 16 bits on FPGA.
- Designed and trained DL Network for diagnosis of pneumonia on Caffe and implemented it on Zynq.
- Won 2nd Place in the 16th Challenge Cup and Silver Award in the 9th Zuguang Cup.
- Manufactured and debugged control module for a wireless charging system.
