Zibin Dong / 董子斌

Intern at Huawei Noah's Ark Lab. Master's student at TJU Deep Reinforcement Learning Lab.

photo.jpg

I am a master’s student at the TJU Deep Reinforcement Learning Lab, advised by Prof. Jianye Hao and co-advised by Prof. Yan Zheng. I received my Bachelor’s degree from the Harbin Institution of Technology (Shenzhen) (HITsz) in 2023, advised by Prof. Qingbin Gao.

My research interests primarily focus on developing general embodied agents for decision-making. To accomplish this goal, my current research encompasses Reinforcement Learning (RL): Offline RL, Model-based RL, RL from Human Feedback (RLHF), Diffusion Models for decision-making, and LLM agents. I aspire to leverage the transformative power of AI in shaping the world. I am highly self-motivated and possess a strong passion for AI research. I am constantly seeking opportunities for research collaborations, research interns, and PhD. You can find detailed information about me in my CV. Feel free to contact me at zibindong@outlook.com or wechat dongzibin1112.

news

Sep 26, 2024 🔥 Three papers accepted by NeurIPS 2024: “CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making”, “DiffuserLite: Towards Real-time Diffusion Planning”, “PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation”
Two papers accepted by NeurIPS 2024 Workshops: “Self-Supervised Bisimulation Action Chunk Representation for Efficient RL”, “A Method on Searching Better Activation Functions”
Jun 03, 2024 One papers accepted by ICML 2024: “KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations”
Jan 15, 2024 Two papers accepted by ICLR 2024: “Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model”, “Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback”
Aug 05, 2023 One paper accepted by CIKM 2023: “A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving”

selected publications

  1. dong2024cleandiffuser.png
    CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
    In The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track, NeurIPS , 2024
  2. dong2024diffuserlite.png
    DiffuserLite: Towards Real-time Diffusion Planning
    Zibin Dong , Jianye Hao , Yifu Yuan , and 4 more authors
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, NeurIPS , 2024
  3. dong2024aligndiff.png
    AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
    Zibin Dong , Yifu Yuan , Jianye HAO , and 7 more authors
    In The Twelfth International Conference on Learning Representations, ICLR , 2024
  4. ni2024peria.png
    PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, NeurIPS , 2024
  5. kou2024kisa.png
    KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations
    Longxin Kou , Fei Ni , YAN ZHENG , and 4 more authors
    In Forty-first International Conference on Machine Learning, ICML , 2024
  6. yuan2024unirlhf.png
    Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
    Yifu Yuan , Jianye HAO , Yi Ma , and 6 more authors
    In The Twelfth International Conference on Learning Representations, ICLR , 2024
  7. liang2023hierarchical.png
    A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving
    Hebin Liang , Zibin Dong , Yi Ma , and 3 more authors
    In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM , 2023
  8. yuan2024moduli.png
    MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
    Yifu Yuan , Zhenrui Zheng , Zibin Dong , and 1 more author
    In arXiv preprint arXiv:2408.15501 , 2024