陈钧, 宋薇, 周洋. 一种多模块神经网络与遗传算法相结合的单目位姿估计方法[J]. 机器人, 2023, 45(2): 187-196, 237. DOI: 10.13973/j.cnki.robot.210415
引用本文: 陈钧, 宋薇, 周洋. 一种多模块神经网络与遗传算法相结合的单目位姿估计方法[J]. 机器人, 2023, 45(2): 187-196, 237. DOI: 10.13973/j.cnki.robot.210415
CHEN Jun, SONG Wei, ZHOU Yang. A Monocular Pose Estimation Method Based on Multi-module Neural Network and Genetic Algorithm[J]. ROBOT, 2023, 45(2): 187-196, 237. DOI: 10.13973/j.cnki.robot.210415
Citation: CHEN Jun, SONG Wei, ZHOU Yang. A Monocular Pose Estimation Method Based on Multi-module Neural Network and Genetic Algorithm[J]. ROBOT, 2023, 45(2): 187-196, 237. DOI: 10.13973/j.cnki.robot.210415

一种多模块神经网络与遗传算法相结合的单目位姿估计方法

A Monocular Pose Estimation Method Based on Multi-module Neural Network and Genetic Algorithm

  • 摘要: 针对机器人抓取场景中存在的工件位姿不确定、堆叠遮挡等问题,提出一种多模块神经网络与遗传算法相结合的单目位姿估计方法,实现由目标工件检测到平面定位、再到位姿全方位立体匹配的逐级优化过程。首先,利用神经网络识别工件并分割工件区域,结合预测的中心位置构建L形边界,从而得到工件投影的局部有效模型。然后提取工件区域内的边缘信息来生成基于倾角分层的倒角距离函数,结合局部有效模型的形状构建匹配度函数,以适应遮挡情况。采用大范围搜索和小范围优化相结合的策略,利用遗传算法实现6D位姿的快速收敛。基于ArUco码对工件进行数据集构建和实验测试,结果表明该方法能在0.5 s左右实现对工件的位姿估计,在420 mm的观察距离下,横向平移误差能控制在1 mm左右,旋转角度平均误差控制在2°以内。通过实验对比分析可知,本方法能有效应对复杂环境下工件位姿的准确估计,提升机器人工作效率。

     

    Abstract: Aiming at the problems such as uncertain pose and stacking occlusion of industrial parts in robotic grasping scene, a monocular pose estimation method is proposed based on multi-module neural network and genetic algorithm, to realize a step-by-step optimization from the detection of industrial parts to their positioning in 2D plane, and to the all-round stereo matching. Firstly, the industrial parts are identified and their location area is segmented by neural network, and the L-shaped boundary is constructed by combining the predicted center position, so as to obtain the local effective model of industrial parts projection. Then, the edge information in the area of industrial parts is extracted to generate the chamfer distance function which is delimited by direction angles, and the matching function is constructed combined with the shape of the partially effective model to adapt to the occlusion. Lastly, the strategy of large-scale search and small-scale optimization is adopted, and the quick convergence of 6D pose is realized by genetic algorithm. Moreover, a dataset of the industrial parts with ArUco markers is constructed for experimental verification. Results show that the proposed method can estimate the pose of the industrial parts in about 0.5 s. In the observation distance of 420 mm, the lateral translation error can be controlled within 1 mm, the average rotation angle error can be controlled within 2°. Experimental comparison shows that the proposed method can effectively deal with the accurate estimation of industrial part pose in complex environment and improve the working efficiency of robot.

     

/

返回文章
返回