英文字典中文字典


英文字典中文字典51ZiDian.com



中文字典辞典   英文字典 a   b   c   d   e   f   g   h   i   j   k   l   m   n   o   p   q   r   s   t   u   v   w   x   y   z       







请输入英文单字,中文词皆可:


请选择你想看的字典辞典:
单词字典翻译
369143查看 369143 在百度字典中的解释百度英翻中〔查看〕
369143查看 369143 在Google字典中的解释Google英翻中〔查看〕
369143查看 369143 在Yahoo字典中的解释Yahoo英翻中〔查看〕





安装中文字典英文字典查询工具!


中文字典英文字典工具:
选择颜色:
输入中英文单字

































































英文字典中文字典相关资料:


  • Continuous reinforcement learning via advantage value difference reward . . .
    To address the challenge posed by sparse reward structures in continuous control tasks, this paper proposes the advantage value difference reward shaping framework
  • Continuous reinforcement learning via advantage value difference reward . . .
    In the context of engineering applications, this paper proves the superiority of AVD in continuous control tasks within the multi-joint dynamics with contact (MuJoCo) environment
  • Continuous reinforcement learning via advantage value difference reward . . .
    In the context of engineering applications, this paper proves the superiority of AVD in continuous control tasks within the multi-joint dynamics with contact (MuJoCo) environment
  • Weizhi Xian - dblp
    Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng, Zhaowei Shang, Mingliang Zhou: Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective
  • Improving reward shaping in Deep RL for avoiding user’s biases and . . .
    In the context of engineering applications, this paper proves the superiority of AVD in continuous control tasks within the multi-joint dynamics with contact (MuJoCo) environment
  • Search | OpenReview
    Results for " ~Leong_Hou_U2 " Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng, Zhaowei Shang, Mingliang Zhou Published: 31 Dec 2024, Last Modified: 24 May 2026 Eng Appl Artif Intell 2025 Readers
  • Xuekai Wei - dblp
    Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng, Zhaowei Shang, Mingliang Zhou: Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective
  • 检索结果 · 重庆大学机构知识库
    In the context of engineering applications, this paper proves the superiority of AVD in continuous control tasks within the multi-joint dynamics with contact (MuJoCo) environment
  • Jiawei Lin - researchr alias
    Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng 0002, Zhaowei Shang, Mingliang Zhou eaai, 151:110676, 2025 [doi]





中文字典-英文字典  2005-2009