深度Q-RBF网络下的瓶装食品装箱机械臂无碰轨迹规划

doi:10.3969/j.issn.1001-3881.2023.05.014

首页 > 过刊浏览>2023年第51卷第5期 >89-95. DOI:10.3969/j.issn.1001-3881.2023.05.014

深度Q-RBF网络下的瓶装食品装箱机械臂无碰轨迹规划
DOI:
                        10.3969/j.issn.1001-3881.2023.05.014
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金青年科学基金项目(62001198)；甘肃省青年科技基金计划(20JR10RA186;21JR7RA247)

Collision-Free Trajectory Planning for Manipulator Based on Deep Q-RBF Reinforcement Learning Network

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

由于机械臂存在高度的灵活性，可模拟人类手臂完成易碎瓶装食品的装箱工作，并实时矫正机械臂轨迹规划所存在的误差，提升稳定性与精准度。分析装箱机械臂的基本架构，提出基于深度Q-RBF强化学习网络的机械臂无碰轨迹规划模型，通过资源分配自适应方法，根据待建模的样本，调整RBF网络隐含层单元，从而提升网络学习速率与在线学习能力，结合自适应Q强化学习算法，获得最优操作集合。并选用学习率调参法完成网络的参数学习。仿真与实验结果表明：与其他两种方法对比，此方法具有较强的避障能力，机械臂能够较好地依据预定轨迹行进；避碰进程变化缓和，且能够尽快收敛并逐步趋向稳定。

Abstract:

Due to the high flexibility of the robot arm,it can simulate human arm to complete packing work of fragile bottled food,and correct errors existing in trajectory planning in real time to improve the stability and accuracy.Based on the basic architecture of the boxed manipulator,a non-collision trajectory planning model for the manipulator was proposed based on deep Q-RBF reinforcement learning network.The hidden layer units of the RBF network were adjusted according to the samples to be modeled through the resource allocation adaptive method,so as to improve the network learning rate and online learning ability.The optimal operation set was obtained by combining the adaptive Q-reinforcement learning algorithm.The learning rate adjustment method was used to complete the parameter learning of the network.The simulation and experimental results show that,compared with the other two methods,the method has strong obstacle avoidance ability,and the manipulator can move according to the predetermined trajectory better; the change of collision avoidance process is moderate,and can converge as soon as possible and gradually tend to be stable.

参考文献

相似文献

引证文献

引用本文

任银广.深度Q-RBF网络下的瓶装食品装箱机械臂无碰轨迹规划[J].机床与液压,2023,51(5):89-95.
REN Yinguang. Collision-Free Trajectory Planning for Manipulator Based on Deep Q-RBF Reinforcement Learning Network[J]. Machine Tool & Hydraulics,2023,51(5):89-95

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2023-04-07
出版日期: 2023-03-15

欢迎访问机床与液压官方网站!

网站首页

杂志简介

编委会

投稿须知

广告合作

联系我们

ENGLISH

引用本文

分享

文章指标

历史