Autonomous person-following telepresence robot using monocular camera and deep learning YOLO

Telepresence robots (TRs) are increasingly important for remote communication and collaboration, particularly in situations where physical presence is not possible. One key feature of TRs is person-following, which relies on the detection and distance estimation of individuals. This study proposes a...

全面介绍

Saved in:
书目详细资料
Main Authors: Mat Lazim, Izzuddin, Sakri, Ahmad Amin Firdaus, Mauzi, Suffian At-Tsauri, Sahrim, Musab, Ramli, Liyana, Noordin, Aminurrashid
格式: Article
语言:English
出版: ARQII Publication 2024
在线阅读:http://eprints.utem.edu.my/id/eprint/27514/2/01084260420249539774.PDF
http://eprints.utem.edu.my/id/eprint/27514/
http://arqiipubl.com/ojs/index.php/AMS_Journal/article/view/574
标签: 添加标签
没有标签, 成为第一个标记此记录!
实物特征
总结:Telepresence robots (TRs) are increasingly important for remote communication and collaboration, particularly in situations where physical presence is not possible. One key feature of TRs is person-following, which relies on the detection and distance estimation of individuals. This study proposes an autonomous person-following TR using a monocular camera and deep-learning YOLO for person detection and distance estimation. To compensate for the monocular camera's inability to provide depth information, a novel distance estimation algorithm based on focal length and person width is introduced. The estimated width information of the detected person is extracted from the bounding box generated by YOLO. A pre-trained model using the MS COCO dataset is employed with YOLO for the person detection task. For robot movement control, a region-based controller is proposed to enable the robot to move based on the detected person's location in the image captured by the camera. Finally, integration and deployment of the proposed method in the TR is carried out using the Robot Operating System (ROS). Experimental results demonstrate that the TR can successfully follow a person using the proposed algorithm, thus highlighting its effectiveness for person-following tasks.