action policy短语⁶²³⁹⁹ 基本例句 🌏动作策略;行动方针;实施方针 The agent's fuzzy reward is proposed under the fuzzy knowledge of different decision goals, and a gradient learning algorithm is described to learn the agent's action policy under fuzzy reward. 通过建立代理决策目标的模糊知识,我们给出了基于模糊收益的多代理决策模型,并研究了基于梯度的代理策略学习算法。 cnki