JP7709648B2

JP7709648B2 - Contact object estimation device, method, and program

Info

Publication number: JP7709648B2
Application number: JP2023569019A
Authority: JP
Inventors: 太智金田; 大祐佐藤; 雅人宮原; 匡人福田; 成宗松村; 仁志瀬下
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2021-12-24
Filing date: 2021-12-24
Publication date: 2025-07-17
Anticipated expiration: 2041-12-24
Also published as: WO2023119657A1; JPWO2023119657A1

Description

この発明の一態様は、例えばロボットを用いて対象物体を遠隔操作するシステムで使用される接触対象推定装置、方法およびプログラムに関する。One aspect of the present invention relates to a contact object estimation device, method and program for use in a system for remotely controlling a target object using, for example, a robot.

近年、操作者が遠隔地に配置されたロボットをネットワークを介して動作させることにより、遠隔地の物体を遠隔操作するシステムが種々開発されている。この種のシステムでは、操作対象となる物体に関する情報を先行して推定することで、よりスムーズな操作が期待できる。In recent years, various systems have been developed that allow an operator to remotely control an object in a remote location by operating a robot placed in the remote location via a network. This type of system is expected to enable smoother operation by estimating information about the object to be operated in advance.

そこで、例えば非特許文献１または非特許文献２では、ヘッドマウントディスプレイ（HMD）を用いて操作者の視線情報を取得し、取得された視線情報をもとに遠隔側のロボットが把持しようとしている物体を先行して推定し、これによりロボットの動作の遅延を低減する技術が提案されている。Therefore, for example, Non-Patent Document 1 or Non-Patent Document 2 propose a technology that acquires the operator's gaze information using a head-mounted display (HMD) and, based on the acquired gaze information, estimates in advance the object that the remote robot is attempting to grasp, thereby reducing delays in the robot's operations.

Negishi et.al “Operation Assistance Using Visual Feedback with Considering Human Intention on Master-Slave Systems” ROBIO 2016Negishi et.al “Operation Assistance Using Visual Feedback with Considering Human Intention on Master-Slave Systems” ROBIO 2016 Kojima et.al “Early Estimation of User’s Intention of Tele-Operation Using Object Affordance and Hand Motion in a Dual First-Person Vision” CVPR 2019Kojima et.al “Early Estimation of User’s Intention of Tele-Operation Using Object Affordance and Hand Motion in a Dual First-Person Vision” CVPR 2019

ところが、非特許文献１または２に記載された技術では、操作者の視線情報を必要とすることからHMDや映像を伝送するためのカメラが別途必要となる。このため、システム構成が複雑化し易いという課題を有している。However, the technology described in Non-Patent Documents 1 and 2 requires information on the operator's line of sight, which necessitates a separate HMD and a camera for transmitting images. This poses the problem that the system configuration can easily become complicated.

また、非特許文献１または２に記載された技術では、ロボット側に設置されたカメラにより撮像された映像をもとに、物体の位置を測定して、操作の接触対象となりそうな物体を推定するようにしている。このため、映像に複数の物体が含まれていると、これら複数の物体のすべてについて位置を測定し、その結果を総合して接触対象となる物体を推定する処理を行うことになり、処理に多くの時間が必要となる。この結果、制御処理のリアルタイム性を維持することが困難となる。 In addition, in the technology described in Non-Patent Documents 1 and 2, the position of an object is measured based on an image captured by a camera installed on the robot, and an object that is likely to be a contact target during operation is estimated. Therefore, if the image contains multiple objects, the positions of all of these objects are measured and the results are combined to estimate the object that is likely to be a contact target, which requires a lot of processing time. As a result, it becomes difficult to maintain real-time control processing.

この発明は上記事情に着目してなされたもので、システム構成の複雑化を防止し、かつ接触候補となる対象が複数存在する場合でも短時間に接触対象を推定できるようにして処理のリアルタイム性の向上を図った技術を提供しようとするものである。 This invention has been made in light of the above-mentioned circumstances, and aims to provide technology that prevents the system configuration from becoming complicated and improves the real-time processing by enabling the contact object to be estimated in a short period of time even when there are multiple potential contact objects.

上記課題を解決するために、この発明に係る接触対象推定装置または推定方法の一態様は、操作者の操作に応じて遠隔地のロボットを動作させることにより前記遠隔地の作業エリアに存在する物体を遠隔操作するシステムにあって、接触対象推定装置として動作する情報処理装置に、第１の処理部または処理過程と、第２の処理部または処理過程と、第３の処理部または処理過程とを備える。 In order to solve the above problem, one aspect of the contact object estimation device or estimation method of the present invention is a system that remotely controls an object present in a remote work area by operating a robot at the remote location in response to an operator's operation, and includes an information processing device that operates as the contact object estimation device, the information processing device having a first processing unit or processing step, a second processing unit or processing step, and a third processing unit or processing step.

第１の処理部または処理過程は、作業エリアを撮像した映像情報からその映像フレームに写っている複数の物体を検出し、検出された複数の前記物体の各々に対し第１の干渉領域を定義する。第２の処理部または処理過程は、前記複数の物体の各々について、前記第１の干渉領域と、前記操作者の操作情報と、この操作者の操作により動作するロボットに対し設定される第２の干渉領域とをもとに、前記ロボットとの間の干渉の有無を判定する。第３の処理部または処理過程は、干渉有りと判定された複数の物体の中から、当該物体と前記ロボットとの位置関係を表す情報に基づいて、前記ロボットが接触しようとしている前記物体を選定する。A first processing unit or process detects a plurality of objects captured in a video frame from video information capturing an image of a work area, and defines a first interference area for each of the detected objects. A second processing unit or process determines whether or not there is interference between the robot and each of the objects based on the first interference area, the operation information of the operator, and a second interference area set for the robot operated by the operation of the operator. A third processing unit or process selects, from among the objects determined to be interfering, the object with which the robot is about to come into contact, based on information representing the positional relationship between the object and the robot.

この発明の一態様によれば、複数の物体を含む作業エリアを撮像した映像情報と、操作者によるロボットの操作情報とに基づいて、ロボットが接触しようとしている物体が推定される。このため、例えば、操作者の視線情報を用いることなく接触対象となる対象を推定できるようになり、これによりHMD等の追加デバイスを不要にしてシステムの複雑化を回避することができる。According to one aspect of the present invention, the object that the robot is about to contact is estimated based on image information of a work area including multiple objects and information on the operation of the robot by the operator. Therefore, for example, it becomes possible to estimate the target to be contacted without using the line of sight information of the operator, which makes it possible to avoid the complication of the system by eliminating the need for additional devices such as an HMD.

また、映像情報から検出された各物体に対しそれぞれ干渉領域が定義され、この物体の干渉領域とロボットアームの干渉領域とをもとに、ロボットと干渉する可能性がある物体が選択される。そして、選択された物体の中から、例えばロボットアームが接触する可能性が最も高い物体が選定される。従って、作業エリアに接触候補となる複数の物体が存在する場合でも、接触対象となる可能性が最も高い物体を短時間に推定できるようになり、これにより処理のリアルタイム性を維持することが可能となる。 In addition, an interference area is defined for each object detected from the video information, and objects that may interfere with the robot are selected based on the interference area of this object and the interference area of the robot arm. Then, from among the selected objects, an object that is most likely to come into contact with the robot arm, for example, is selected. Therefore, even if there are multiple objects in the working area that are potential contact objects, it is possible to quickly estimate the object that is most likely to be the subject of contact, thereby making it possible to maintain real-time processing.

すなわちこの発明の一態様によれば、システム構成の複雑化を防止し、かつ接触候補の対象が複数存在する場合でも短時間に接触対象の物体を推定できるようにして処理のリアルタイム性の向上を図った技術を提供することができる。In other words, according to one aspect of the present invention, it is possible to provide a technology that prevents the system configuration from becoming complicated and improves the real-time processing by enabling the object to be contacted to be estimated in a short period of time even when there are multiple potential contact objects.

図１は、この発明の一実施形態に係る遠隔制御システムの一例を示す図である。FIG. 1 is a diagram showing an example of a remote control system according to an embodiment of the present invention. 図２は、この発明の一実施形態に係る接触対象推定装置のハードウェア構成の一例を示すブロック図である。FIG. 2 is a block diagram showing an example of a hardware configuration of a contact object estimation device according to an embodiment of the present invention. 図３は、この発明の一実施形態に係る接触対象推定装置のソフトウェア構成の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a software configuration of the contact object estimation device according to the embodiment of the present invention. 図４は、図３に示した接触対象推定装置の制御部が実行する動作の全体の処理手順と処理内容の一例を示すフローチャートである。FIG. 4 is a flowchart showing an example of the overall processing procedure and processing contents of the operation executed by the control unit of the contact object estimation device shown in FIG. 図５は、図４に示した処理手順のうち物体情報の検出および干渉領域の定義に係る処理の処理手順と処理内容の一例を示すフローチャートである。FIG. 5 is a flowchart showing an example of the processing procedure and processing contents related to the detection of object information and the definition of an interference area, among the processing procedures shown in FIG. 図６は、図４に示した処理手順のうち干渉判定処理の処理手順と処理内容の一例を示すフローチャートである。FIG. 6 is a flowchart showing an example of the procedure and contents of the collision detection process from the procedure shown in FIG. 図７は、図４に示した処理手順のうち接触対象選定処理の処理手順と処理内容の一例を示すフローチャートである。FIG. 7 is a flowchart showing an example of the procedure and contents of the contact target selection process from the procedure shown in FIG. 図８は、映像情報に含まれるバウンディングボックスの一例を示す図である。FIG. 8 is a diagram showing an example of a bounding box included in video information. 図９は、図３に示した手情報記憶部に記憶される手情報の一例を示す図である。FIG. 9 is a diagram illustrating an example of hand information stored in the hand information storage unit illustrated in FIG. 3 . 図１０は、図３に示した物体情報記憶部に記憶される物体情報の一例を示す図である。FIG. 10 is a diagram showing an example of object information stored in the object information storage unit shown in FIG. 3. As shown in FIG.

以下、図面を参照してこの発明に係わる実施形態を説明する。 Below, an embodiment of the present invention is described with reference to the drawings.

［一実施形態］
（構成例）
（１）システム
図１は、この発明の一実施形態に係る遠隔制御システムの一例を示す図である。 [One embodiment]
(Configuration example)
(1) System FIG. 1 is a diagram showing an example of a remote control system according to an embodiment of the present invention.

一実施形態の遠隔制御システムは、遠隔地の作業現場等に配置されたロボットアームＲＢを、操作者ＵＳがネットワークＮＷを介して遠隔制御することにより、例えば部品や工具、装置等の操作対象となる物体ＴＬに対し、所望の操作を行うようにしたものである。 In one embodiment of the remote control system, a robot arm RB placed at a remote work site or the like is remotely controlled by an operator US via a network NW to perform a desired operation on an object TL to be operated, such as a part, tool, or device.

遠隔地には、上記ロボットアームＲＢおよびその動作を制御するロボット制御装置ＲＵに加え、カメラＣＭと、接触対象推定装置ＥＵが配置されている。このうちカメラＣＭは、例えばRGB-Dカメラからなり、現場の物体ＴＬ群とロボットアームＲＢとを含むエリアを撮像して当該エリアの映像情報を出力する。In addition to the robot arm RB and the robot control device RU that controls its operation, a camera CM and a contact object estimation device EU are placed at the remote location. Of these, the camera CM is, for example, an RGB-D camera, and captures an area including the objects TL at the site and the robot arm RB, and outputs video information of the area.

一方、操作側には、操作側端末装置ＯＵと、モニタＭＴとが設けられ、さらに操作者ＵＳには例えば手の動きを検出する動きセンサＳＳが装着されている。モニタＭＴは、遠隔地の上記カメラＣＭにより撮像された映像情報を表示するために使用される。On the other hand, the operating side is provided with an operating side terminal device OU and a monitor MT, and furthermore, the operator US is equipped with a motion sensor SS for detecting, for example, hand movements. The monitor MT is used to display video information captured by the camera CM at a remote location.

動きセンサＳＳは、例えば赤外線式のセンサからなり、操作者ＵＳの手の動きを手の所定の部位の位置の変化として検出する。例えば、動きセンサＳＳは、操作者ＵＳの手の中心の位置、各指の関節の位置、各指先の位置および手首の位置を、それぞれ３次元座標情報として検出する。また、動きセンサＳＳは、操作者ＵＳが左右どちらの手を操作しているかと、手の表側と裏側のいずれを用いて操作しているかを検出する。なお、手の表側とは手の甲の側を、また手の裏側とは手の平の側をそれぞれ示す。The motion sensor SS is, for example, an infrared sensor, and detects the movement of the operator US's hand as a change in the position of a specific part of the hand. For example, the motion sensor SS detects the position of the center of the operator US's hand, the position of each finger joint, the position of each fingertip, and the position of the wrist, each as three-dimensional coordinate information. The motion sensor SS also detects whether the operator US is using his/her left or right hand, and whether he/she is using the front or back of the hand to operate. The front of the hand refers to the back of the hand, and the back of the hand refers to the palm side.

操作側端末装置ＯＵは、遠隔地の接触対象推定装置ＥＵからネットワークＮＷを介して伝送された映像情報を受信して上記モニタＭＴに表示させる機能と、上記動きセンサＳＳにより検出された操作者ＵＳの手の動きを表すセンサ情報をネットワークＮＷを介して遠隔地の接触対象推定装置ＥＵへ送信する機能とを備える。センサ情報には、操作者ＵＳの手の各部位の位置の変化を示す３次元座標情報に加え、左右どちらの手であるかを示す情報と、手の表裏の向きを表す情報も含まれる。The operating side terminal device OU has a function of receiving video information transmitted from a remote contact object estimation device EU via a network NW and displaying it on the monitor MT, and a function of transmitting sensor information representing the hand movement of the operator US detected by the motion sensor SS to the remote contact object estimation device EU via the network NW. The sensor information includes three-dimensional coordinate information indicating the change in position of each part of the hand of the operator US, as well as information indicating whether the hand is left or right and information indicating the front or back orientation of the hand.

（２）接触対象推定装置ＥＵ
図２および図３は、それぞれ接触対象推定装置ＥＵのハードウェア構成およびソフトウェア構成の一例を示すブロック図である。 (2) Contact object estimation device EU
2 and 3 are block diagrams showing an example of a hardware configuration and a software configuration, respectively, of the contact object estimation device EU.

接触対象推定装置ＥＵは、パーソナルコンピュータまたはサーバコンピュータ等の情報処理装置からなり、中央処理ユニット（Central Processing Unit：ＣＰＵ）および画像処理ユニット（Graphics Processing Unit：ＧＰＵ）等のハードウェアプロセッサを使用した制御部１を備えている。制御部１には、バス７を介して、プログラム記憶部２およびデータ記憶部３を有する記憶ユニットと、カメラインタフェース（以後インタフェースをＩ／Ｆと略称する）部４と、通信Ｉ／Ｆ部５と、ロボットＩ／Ｆ部６が接続されている。なお、制御部１は、ＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等を用いて構成されてもよい。The contact object estimation device EU is composed of an information processing device such as a personal computer or a server computer, and has a control unit 1 using hardware processors such as a central processing unit (CPU) and a graphics processing unit (GPU). A storage unit having a program storage unit 2 and a data storage unit 3, a camera interface (hereinafter the interface is abbreviated as I/F) unit 4, a communication I/F unit 5, and a robot I/F unit 6 are connected to the control unit 1 via a bus 7. The control unit 1 may be configured using a programmable logic device (PLD), a field programmable gate array (FPGA), or the like.

カメラＩ／Ｆ部４は、カメラＣＭから出力される映像情報を取り込むために使用される。通信Ｉ／Ｆ部５は、ネットワークＮＷにより定義される通信プロトコルに従い、上記カメラＣＭにより撮像された映像情報をネットワークＮＷを介して操作側端末装置ＯＵへ送信すると共に、操作側端末装置ＯＵからネットワークＮＷを介して伝送されたセンサ情報を受信する。ロボットＩ／Ｆ部６は、接触対象の選定結果を表す情報をロボット制御装置ＲＵへ出力する。The camera I/F unit 4 is used to capture video information output from the camera CM. The communication I/F unit 5 transmits video information captured by the camera CM to the operating terminal device OU via the network NW in accordance with a communication protocol defined by the network NW, and receives sensor information transmitted from the operating terminal device OU via the network NW. The robot I/F unit 6 outputs information representing the selection result of the contact target to the robot control device RU.

プログラム記憶部２は、例えば、記憶媒体としてＳＳＤ（Solid State Drive）等の随時書込みおよび読出しが可能な不揮発性メモリと、ＲＯＭ（Read Only Memory）等の不揮発性メモリとを組み合わせて構成したもので、ＯＳ（Operating System）等のミドルウェアに加えて、一実施形態に係る各種制御処理を実行するために必要なアプリケーション・プログラムを格納する。なお、以後ＯＳと各アプリケーション・プログラムとをまとめてプログラムと称する。The program storage unit 2 is configured by combining a non-volatile memory such as a solid state drive (SSD) as a storage medium that can be written to and read from at any time, and a non-volatile memory such as a read only memory (ROM), and stores application programs necessary for executing various control processes according to one embodiment, in addition to middleware such as an operating system (OS). Hereinafter, the OS and each application program will be collectively referred to as a program.

データ記憶部３は、例えば、記憶媒体として、ＳＳＤ等の随時書込みおよび読出しが可能な不揮発性メモリと、ＲＡＭ（Random Access Memory）等の揮発性メモリと組み合わせたもので、一実施形態を実施するために必要な主たる記憶領域として、映像情報記憶部３１と、手情報記憶部３２と、物体情報記憶部３３とを備えている。The data storage unit 3 is, for example, a combination of a non-volatile memory such as an SSD, which can be written to and read from at any time, and a volatile memory such as a RAM (Random Access Memory), as a storage medium, and includes an image information storage unit 31, a hand information storage unit 32, and an object information storage unit 33 as the main storage areas required to implement one embodiment.

映像情報記憶部３１は、カメラＣＭから取得した映像情報を、後述する物体検出処理のために一時保存する。 The video information storage unit 31 temporarily stores the video information obtained from the camera CM for the object detection processing described below.

手情報記憶部３２は、動きセンサＳＳにより検出された操作者ＵＳの手の動きを表すセンサ情報と、このセンサ情報をもとに生成される手の動きの速度ベクトル記憶するために使用される。また、手情報記憶部３２には、ロボットアームＲＢの操作手の干渉領域を表す情報も記憶される。The hand information storage unit 32 is used to store sensor information representing the hand movement of the operator US detected by the motion sensor SS and a velocity vector of the hand movement generated based on this sensor information. The hand information storage unit 32 also stores information representing the interference area of the operator hand of the robot arm RB.

物体情報記憶部３３は、カメラＣＭの映像情報から検出された複数の物体を表す情報をそれぞれ時系列に記憶するために使用される。物体を表す情報には、検出された物体に対しユニークに付与される物体ＩＤと、位置および速度と、物体の干渉領域を表す情報と、速度ベクトルと、干渉判定結果を表す情報とが含まれる。物体情報の詳細は動作例において述べる。The object information storage unit 33 is used to store information representing multiple objects detected from the video information of the camera CM in chronological order. The information representing the objects includes an object ID uniquely assigned to the detected object, its position and speed, information representing the object's interference area, a speed vector, and information representing the interference determination result. Details of the object information will be described in the operation example.

制御部１は、一実施形態を実施するために必要な処理機能として、映像情報取得処理部１１と、映像情報送信処理部１２と、手情報生成処理部１３と、物体検出処理部１４と、干渉領域定義処理部１５と、干渉判定処理部１６と、接触対象選定処理部１７と、推定情報出力処理部１８とを備える。上記処理部１１～１８は、何れもプログラム記憶部２に格納されたアプリケーション・プログラムを、制御部１のハードウェアプロセッサに実行させることにより実現される。The control unit 1 includes, as processing functions necessary for implementing one embodiment, a video information acquisition processing unit 11, a video information transmission processing unit 12, a hand information generation processing unit 13, an object detection processing unit 14, an interference area definition processing unit 15, an interference determination processing unit 16, a contact target selection processing unit 17, and an estimated information output processing unit 18. All of the above processing units 11 to 18 are realized by causing the hardware processor of the control unit 1 to execute application programs stored in the program storage unit 2.

映像情報取得処理部１１は、カメラＣＭから出力される時系列の映像情報を入出力Ｉ／Ｆ部４を介して取得し、取得された上記各映像情報を映像情報記憶部３１に一旦記憶させる処理を行う。The video information acquisition processing unit 11 acquires the time-series video information output from the camera CM via the input/output I/F unit 4, and performs a process of temporarily storing each of the acquired video information in the video information storage unit 31.

映像情報は、バウンディングボックス（Bounding Box）を含む映像フレームと、点群情報とから構成される。バウンディングボックスは、映像フレームに写っている物体の位置を矩形枠で表示するもので、例えば映像フレームの左上端の座標を（0, 0）としたとき、矩形枠の左上端の端点の座標と右下端の端点の座標とにより表される。点群情報は、例えばデプス情報であり、映像フレームのカメラＣＭからの画素（ピクセル）ごとの距離分布を表している。 Video information consists of video frames including bounding boxes, and point cloud information. A bounding box displays the position of an object in a video frame as a rectangular frame, and is represented by the coordinates of the upper left corner and the lower right corner of the rectangular frame, for example, when the coordinates of the upper left corner of the video frame are (0, 0). Point cloud information is, for example, depth information, and represents the distance distribution of each pixel from the camera CM of the video frame.

映像情報送信処理部１２は、上記カメラＣＭにより撮像された映像情報のうち、バウンディングボックスを含む映像フレームを、通信Ｉ／Ｆ部５から操作側端末装置ＯＵへ送信する処理を行う。The video information transmission processing unit 12 performs processing to transmit video frames including bounding boxes from the video information captured by the camera CM from the communication I/F unit 5 to the operating side terminal device OU.

手情報生成処理部１３は、操作側端末装置ＯＵから伝送されるセンサ情報を通信Ｉ／Ｆ部５を介して受信し、受信された上記センサ情報をもとに手の動きの速度ベクトルを算出する。そして、手情報生成処理部１３は、受信された上記センサ情報に含まれる、手の各部位の位置を表す情報、左右どちらの手であるかを示す情報、および操作側が手の表裏のいずれであるかを表す情報と、算出された上記速度ベクトルとを、手情報記憶部３２に記憶させる。The hand information generation processing unit 13 receives sensor information transmitted from the operating side terminal device OU via the communication I/F unit 5, and calculates a velocity vector of the hand movement based on the received sensor information. The hand information generation processing unit 13 then stores in the hand information storage unit 32 the information included in the received sensor information indicating the positions of each part of the hand, information indicating whether the hand is the left or right hand, and information indicating whether the operating side is the front or back of the hand, as well as the calculated velocity vector.

また、手情報生成処理部１３は、ロボットアームＲＢの操作手の中心を基準に予め設定された干渉領域を表す情報を、手情報記憶部３２に記憶させる処理も行う。なお、干渉領域を表す情報の具体的な設定手法については動作例において述べる。The hand information generation processing unit 13 also performs processing to store information representing a pre-set interference area based on the center of the operating hand of the robot arm RB in the hand information storage unit 32. A specific method for setting the information representing the interference area will be described in the operation example.

物体検出処理部１４は、映像情報記憶部３１から映像情報をフレームごとに読み込み、読み込まれた映像フレームに写っている複数の物体とロボットアームＲＢの操作手を表す情報を検出する。この検出は、バウンディングボックスを用いることで行われる。物体検出処理部１４は、検出された上記バウンディングボックスで囲まれる領域の映像データと点群情報を出力する。The object detection processing unit 14 reads the video information from the video information storage unit 31 frame by frame, and detects the multiple objects and information representing the operating hand of the robot arm RB that appear in the read video frames. This detection is performed using a bounding box. The object detection processing unit 14 outputs the video data and point cloud information of the area surrounded by the detected bounding box.

干渉領域定義処理部１５は、上記物体検出処理部１４により検出された物体ごとに、ロボットアームＲＢの操作範囲外となる物体を除外する処理を行う。また、干渉領域定義処理部１５は、上記除外処理により除外されなかった物体の各々について、当該物体の干渉領域を定義する処理を行う。上記除外処理および干渉領域を定義する処理の一例は、動作例において説明する。The interference area definition processing unit 15 performs a process of excluding objects that are outside the operational range of the robot arm RB for each object detected by the object detection processing unit 14. In addition, the interference area definition processing unit 15 performs a process of defining an interference area for each object that was not excluded by the above exclusion process. An example of the above exclusion process and the process of defining an interference area will be described in the operation example.

干渉判定処理部１６は、手情報記憶部３２に記憶された操作者ＵＳの手情報と、物体情報記憶部３３に記憶された複数の物体の物体情報とをもとに、物体ごとに、予め用意された判定式を用いて、各物体と、上記操作者ＵＳの手情報に対応するロボットアームＲＢの操作手との間の干渉の有無を判定する。そして、その判定結果を表す干渉判定情報を物体情報記憶部３３に追加する。干渉判定処理の一例についても動作例において説明する。The interference determination processing unit 16 determines whether or not there is interference between each object and the operating hand of the robot arm RB corresponding to the hand information of the operator US, using a determination formula prepared in advance for each object, based on the hand information of the operator US stored in the hand information storage unit 32 and the object information of multiple objects stored in the object information storage unit 33. Then, interference determination information representing the determination result is added to the object information storage unit 33. An example of the interference determination process will also be described in the operation example.

接触対象選定処理部１７は、先ず手情報記憶部３２に記憶された操作者ＵＳの手情報と、上記干渉判定処理部１６により干渉有りと判定された物体に関する物体情報を読み込む。続いて接触対象選定処理部１７は、読み込まれた上記各情報をもとに、干渉有りと判定された物体の各々について、物体に対し操作者ＵＳの手の表（手の甲側）と手の裏（手の平側）のいずれが対向しているかを表す情報と、操作者ＵＳの手が上記物体の中心をどれだけ捉えているかを示す指標である中心捕捉度をそれぞれ求める。そして、接触対象選定処理部１７は、上記手の表裏のいずれが対向しているかを表す情報と、上記中心捕捉度とに基づいて、スコア計算を行い、計算されたコストに基づいてロボットアームＲＢの操作手が接触する可能性が最も高い物体を選定する処理を行う。The contact target selection processing unit 17 first reads the hand information of the operator US stored in the hand information storage unit 32 and the object information on the object determined to be in interference by the interference determination processing unit 16. Next, based on the above-mentioned information read, the contact target selection processing unit 17 obtains, for each object determined to be in interference, information indicating which of the front (back side of the hand) or the back (palm side) of the operator US's hand faces the object, and a center capture degree, which is an index indicating how well the hand of the operator US captures the center of the object. Then, the contact target selection processing unit 17 performs a score calculation based on the information indicating which of the front and back of the hand faces and the center capture degree, and performs a process of selecting an object that is most likely to be contacted by the operator hand of the robot arm RB based on the calculated cost.

推定情報出力処理部１８は、上記接触対象選定処理部１７により選定された物体に関する物体情報を、ロボットアームＲＢの動作の補助情報として、ロボットＩ／Ｆ部６からロボット制御装置ＲＵへ出力する処理を行う。The estimated information output processing unit 18 performs processing to output object information regarding the object selected by the contact target selection processing unit 17 from the robot I/F unit 6 to the robot control device RU as auxiliary information for the operation of the robot arm RB.

（動作例）
次に、以上のように構成された接触対象推定装置ＥＵの動作例を説明する。
図４は、接触対象推定装置ＥＵの制御部１が実行する全体の処理手順と処理内容の一例を示すフローチャートである。 (Example of operation)
Next, an example of the operation of the contact object estimation device EU configured as above will be described.
FIG. 4 is a flowchart showing an example of the overall processing procedure and processing contents executed by the control unit 1 of the contact object estimation device EU.

（１）映像情報の取得と送信
接触対象推定装置ＥＵの制御部１は、ステップＳ１０において遠隔操作の開始を監視している。この状態で、操作者ＵＳがロボットアームＲＢを用いて物体を遠隔操作するために、システムを起動したとする、そうすると接触対象推定装置ＥＵの制御部１は、映像情報取得処理部１１の制御の下、ステップＳ１１において、カメラＣＭにより撮像された作業エリアの映像情報をカメラＩ／Ｆ部４を介して取得し、取得された上記映像情報を映像情報記憶部３１に記憶させる。 (1) Acquisition and transmission of video information The control unit 1 of the contact object estimation device EU monitors the start of remote operation in step S10. In this state, assume that the operator US starts the system to remotely operate an object using the robot arm RB. Then, under the control of the video information acquisition processing unit 11, the control unit 1 of the contact object estimation device EU acquires video information of the work area captured by the camera CM via the camera I/F unit 4 in step S11, and stores the acquired video information in the video information storage unit 31.

このとき、カメラＣＭの撮像範囲は、作業エリアに存在する複数の物体と、ロボットアームＲＢの操作手（把持部）とを含むように設定される。このため、映像情報取得処理部１１では、映像フレーム中に、上記作業エリアに存在する複数の物体の画像と、ロボットアームＲＢの操作手の画像とが含まれる映像情報が得られる。また、カメラＣＭとしてはRGB-Dカメラが用いられるため、バウンディングボックスが付された映像データと点群情報とを含む映像情報が取得される。At this time, the imaging range of the camera CM is set to include multiple objects present in the work area and the operator's hand (gripping part) of the robot arm RB. Therefore, the video information acquisition processing unit 11 obtains video information including images of multiple objects present in the work area and an image of the operator's hand of the robot arm RB in a video frame. In addition, since an RGB-D camera is used as the camera CM, video information including video data with bounding boxes and point cloud information is obtained.

上記映像情報が取得されると、接触対象推定装置ＥＵの制御部１は、映像情報送信処理部１２の制御の下、ステップＳ１２において、上記映像情報記憶部３１に記憶された映像情報に含まれる映像データと点群情報とのうち、映像データのみを読み出す。そして、読み出された映像データを、通信Ｉ／Ｆ部５からネットワークＮＷを介して操作側端末装置ＯＵへ送信する。When the video information is acquired, the control unit 1 of the contact target estimation device EU, under the control of the video information transmission processing unit 12, reads out only the video data from the video data and point cloud information contained in the video information stored in the video information storage unit 31 in step S12. Then, the read out video data is transmitted from the communication I/F unit 5 to the operating side terminal device OU via the network NW.

この結果、操作側では、操作側端末装置ＯＵにより上記映像データがモニタＭＴに表示される。従って、以後操作者ＵＳは、モニタＭＴに表示された上記作業エリアの映像を見ながら物体に対する遠隔操作を行うことが可能となる。As a result, on the operating side, the image data is displayed on the monitor MT by the operating side terminal device OU. Therefore, the operator US can then remotely control the object while watching the image of the work area displayed on the monitor MT.

（２）センサ情報の取得と手情報の生成および記憶
ロボットアームＲＢを遠隔操作するために操作者ＵＳが手を動かすと、接触対象推定装置ＥＵの制御部１は、手情報生成処理部１３の制御の下、先ずステップＳ１３により、動きセンサＳＳにより検出された操作者ＵＳの手の動きを表すセンサ情報を、所定の周期で通信Ｉ／Ｆ部５を介して受信する。センサ情報には、例えば操作者ＵＳの手の中心の位置、各指の関節の位置、各指先の位置および手首の位置をそれぞれ表す３次元座標情報と、左右いずれの手であるかを示す情報が含まれる。 (2) Acquisition of Sensor Information, and Generation and Storage of Hand Information When the operator US moves his/her hand to remotely operate the robot arm RB, the control unit 1 of the contact object estimation device EU, under the control of the hand information generation processing unit 13, first receives sensor information representing the hand movement of the operator US detected by the movement sensor SS at a predetermined cycle via the communication I/F unit 5 in step S13. The sensor information includes, for example, three-dimensional coordinate information representing the position of the center of the hand of the operator US, the positions of the joints of each finger, the positions of each fingertip, and the position of the wrist, as well as information indicating whether it is the left or right hand.

手情報生成処理部１３は、続いてステップＳ１４において、センサ情報が受信されるごとに、当該センサ情報に含まれる上記操作者ＵＳの手の各部位の位置を表す３次元座標情報と、一つ前のタイミングで受信されたセンサ情報に含まれる上記各部位の位置を表す３次元座標情報との差分から、速度ベクトルを算出する。Next, in step S14, each time sensor information is received, the hand information generation processing unit 13 calculates a velocity vector from the difference between the three-dimensional coordinate information representing the position of each part of the hand of the operator US contained in the sensor information and the three-dimensional coordinate information representing the position of each part of the hand contained in the sensor information received at the previous timing.

そして、手情報生成処理部１３は、受信された上記センサ情報に含まれる各部位の位置を表す３次元座標情報および左右いずれの手であるかを示す情報と、算出された上記速度ベクトルを、手情報記憶部３２に記憶させる。また、手情報生成処理部１３は、ロボットアームＲＢの操作手に対し設定される干渉領域を表す情報を、手情報記憶部３２に記憶させる。上記干渉領域を表す情報は、例えば、ロボットアームＲＢの操作手の掌の中心から直径９０mmの範囲を表す球体として設定される。図９は、手情報記憶部３２に記憶された手情報の一例を示すものである。The hand information generation processing unit 13 then stores in the hand information storage unit 32 the three-dimensional coordinate information representing the position of each part included in the received sensor information, information indicating whether the hand is the left or right hand, and the calculated velocity vector. The hand information generation processing unit 13 also stores in the hand information storage unit 32 information representing an interference area set for the operating hand of the robot arm RB. The information representing the interference area is set, for example, as a sphere representing a range of 90 mm in diameter from the center of the palm of the operating hand of the robot arm RB. Figure 9 shows an example of hand information stored in the hand information storage unit 32.

（３）物体の検出および干渉領域の定義
上記映像情報が取得されると接触対象推定装置ＥＵの制御部１は、ステップＳ１５において、映像フレームごとにその映像データから物体を検出すると共に、検出された各物体について干渉領域を定義する処理を以下のように実行する。 (3) Detection of Objects and Definition of Interference Areas When the above-mentioned video information is acquired, in step S15, the control unit 1 of the contact target estimation device EU detects objects from the video data for each video frame and executes a process of defining an interference area for each detected object as follows.

図５は、上記ステップＳ１５において、物体検出処理部１４および干渉領域定義処理部１５が実行する一連の処理の処理手順と処理内容の一例を示すフローチャートである。 Figure 5 is a flowchart showing an example of the processing steps and processing contents of a series of processes performed by the object detection processing unit 14 and the interference area definition processing unit 15 in step S15 above.

物体検出処理部１４は、ステップＳ２０において、映像情報記憶部３１から映像フレームを読み込み、読み込まれた映像フレームに写っている複数の物体とロボットアームＲＢの操作手を、これらに付与されているバウンディングボックスをもとに検出する。そして、物体検出処理部１４は、ステップＳ２１により、検出された上記各物体に対しユニークな物体ＩＤを付した後、当該各物体についてバウンディングボックスで囲まれた領域の映像データと点群情報を出力する。In step S20, the object detection processing unit 14 reads a video frame from the video information storage unit 31 and detects multiple objects and the operating hand of the robot arm RB that appear in the read video frame based on the bounding boxes assigned to them. Then, in step S21, the object detection processing unit 14 assigns a unique object ID to each of the detected objects, and outputs the video data and point cloud information of the area surrounded by the bounding box for each object.

図８は、検出された物体ＴＬ１，ＴＬ２，ＴＬ３と、当該各物体ＴＬ１，ＴＬ２，ＴＬ３の画像領域を表すバウンディングボックスＢ１，Ｂ２，Ｂ３の一例を示すものである。同図に示すように、バウンディングボックスＢ１，Ｂ２，Ｂ３の形状とサイズは、物体ＴＬ１，ＴＬ２，ＴＬ３の形状とサイズに応じて変化する。 Figure 8 shows an example of the detected objects TL1, TL2, TL3 and the bounding boxes B1, B2, B3 representing the image areas of the objects TL1, TL2, TL3. As shown in the figure, the shapes and sizes of the bounding boxes B1, B2, B3 change depending on the shapes and sizes of the objects TL1, TL2, TL3.

干渉領域定義処理部１５は、先ずステップＳ２２において、検出された上記物体の１つを選択してこの物体とロボットアームＲＢの操作手との間の距離を計算する。この距離は、物体およびロボットアームＲＢの操作手それぞれの３次元座標情報をもとに計算される。なお、物体およびロボットアームＲＢの操作手の３次元座標情報は、それぞれのバウンディングボックスの位置を示す座標情報と点群情報とから得られる。First, in step S22, the interference area definition processing unit 15 selects one of the detected objects and calculates the distance between this object and the operator's hand of the robot arm RB. This distance is calculated based on the three-dimensional coordinate information of the object and the operator's hand of the robot arm RB. The three-dimensional coordinate information of the object and the operator's hand of the robot arm RB is obtained from the coordinate information and point cloud information indicating the position of each bounding box.

干渉領域定義処理部１５は、続いてステップＳ２３において、算出された上記距離を予め設定されたしきい値と比較する。そして、距離がしきい値を超える物体を、ロボットアームＲＢでは操作することが不可能な物体であると判断し、当該物体を判定対象から除外する。In step S23, the interference area definition processing unit 15 then compares the calculated distance with a preset threshold value. If the distance exceeds the threshold value, the interference area definition processing unit 15 determines that the object cannot be operated by the robot arm RB, and excludes the object from the objects to be determined.

干渉領域定義処理部１５は、続いてステップＳ２４において、上記除外処理により除外されなかった物体、つまり距離がしきい値の範囲内にある各物体に対し干渉領域を定義する処理を行う。干渉領域の定義は、例えば物体の中心位置を基準とする球体を用いて行われる。干渉領域を示す球体の直径を算出するには、先ずバウンディングボックス内に存在する物体の両端を探索する。続いて、物体の両端となる点の候補の中から、バウンディングボックスの中心位置を通る直線を結び、その距離が最も大きい２点を選び、選んだ２点間の距離を球体の直径とする。In step S24, the interference area definition processing unit 15 then performs a process of defining an interference area for each object that was not excluded by the above exclusion process, that is, for each object whose distance is within the threshold range. The interference area is defined, for example, by using a sphere based on the center position of the object. To calculate the diameter of the sphere that represents the interference area, first, both ends of the object that exists within the bounding box are searched for. Next, from among the candidates for the points that are the both ends of the object, a straight line passing through the center position of the bounding box is drawn to connect the two points with the greatest distance, and the distance between the two selected points is set as the diameter of the sphere.

干渉領域定義処理部１５は、最後にステップＳ２５において、上記一連の処理により得られた物体を表す情報を物体情報記憶部３３に記憶させる。このとき記憶される物体を表す情報には、物体ＩＤと、物体の位置および速度と、物体の干渉領域を表す球体情報と、速度ベクトルが含まれる。図１０に、物体情報記憶部３３に記憶される物体情報の一例を示す。Finally, in step S25, the interference area definition processing unit 15 stores the information representing the object obtained by the above series of processes in the object information storage unit 33. The information representing the object stored at this time includes the object ID, the position and speed of the object, the sphere information representing the interference area of the object, and the speed vector. Figure 10 shows an example of the object information stored in the object information storage unit 33.

（４）干渉するか否かの判定
接触対象推定装置ＥＵの制御部１は、次に干渉判定処理部１６の制御の下、ステップＳ１６において、検出された各物体とロボットアームＲＢの操作手との間の干渉の有無を判定する処理を以下のように実行する。 (4) Determination of Whether or Not Interference Will Occur Next, under the control of the interference determination processing unit 16, in step S16, the control unit 1 of the contact object estimation device EU executes a process of determining whether or not there is interference between each detected object and the operating hand of the robot arm RB as follows.

図６は、上記ステップＳ１６において干渉判定処理部１６が実行する判定処理の処理手順と処理内容の一例を示すフローチャートである。 Figure 6 is a flowchart showing an example of the processing procedure and processing content of the determination processing performed by the interference determination processing unit 16 in step S16 above.

干渉判定処理部１６は、先ずステップＳ３０により、物体情報記憶部３３から物体情報のリストを読み込むと共に、手情報記憶部３２から操作者ＵＳの手操作の動きを表す手情報を読み込む。そして、干渉判定処理部１６は、読み込まれた上記物体情報と上記手情報とをもとに、物体ごとに予め用意された判定式を用いて干渉の有無を判定する処理を、ステップＳ３１～ステップ３６において実行する。First, in step S30, the interference determination processing unit 16 reads a list of object information from the object information storage unit 33, and also reads hand information representing the hand operation movement of the operator US from the hand information storage unit 32. Then, in steps S31 to S36, the interference determination processing unit 16 executes a process of determining the presence or absence of interference based on the read object information and hand information using a determination formula prepared in advance for each object.

すなわち、干渉判定で用いる要素は、操作者ＵＳの手および物体の各位置ベクトルと各速度ベクトルである。ある時刻t [秒]における物体Aおよび手Hの各位置ベクトルおよび各速度ベクトルはそれぞれ
と表される。 That is, the elements used in collision detection are the position vectors and velocity vectors of the hand of the operator US and the object. The position vectors and velocity vectors of the object A and the hand H at a certain time t [seconds] are respectively
This is expressed as:

同様に、操作者ＵＳの手と物体との相対位置および相対速度は、
と表される。 Similarly, the relative position and the relative velocity between the hand of the operator US and the object are expressed as follows:
This is expressed as:

現在時刻をt₀秒としたとき、任意の時刻t [秒]の時点における操作者ＵＳの手と物体との相対位置は、
と表すことができる。 When the current time is t ₀ seconds, the relative position between the hand of the operator US and the object at any time t [seconds] is
It can be expressed as:

この式において、３次元的な動きを扱うため、上記位置ベクトルおよび速度ベクトルをそれぞれ３軸1 ,2 ,3 で表すと、
のようになる。 In this formula, in order to handle three-dimensional movement, the position vector and velocity vector are expressed by three axes 1, 2, and 3, respectively, as follows:
It will look like this.

式(1) を解くと、
…(2)
となる。 Solving equation (1), we get
…(2)
It becomes.

ここで、a ，b ，c はそれぞれ
のように表される。 Here, a, b, and c are
It can be expressed as follows.

また、解である干渉開始時刻t₁ および干渉終了時刻t₂ はそれぞれ、
となる。 The solution, interference start time _t1 and interference end time _t2 , are respectively
It becomes.

上記式の中の
の値は、物体情報に含まれている干渉領域を表す球体の半径と、手情報に含まれる操作者ＵＳの手の干渉領域を表す球体の半径との和を示すものである。 In the above formula
The value of indicates the sum of the radius of the sphere representing the interference region included in the object information and the radius of the sphere representing the interference region of the hand of the operator US included in the hand information.

干渉判定処理部１６は、上記(2) 式のルートの中が正である場合には「干渉有り」、一方負の場合には「干渉無し」と判定される。「干渉有り」と判定された場合、上記判定式では、干渉開始時刻t₁ [秒]と干渉終了時刻t₂ [秒]との間で等速直線運動することを前提として計算をしている。その結果、現在位置、速度および干渉時刻から、干渉位置を
として求めることができる。 The interference determination processing unit 16 determines that "interference exists" if the value in the root of the above formula (2) is positive, and determines that "no interference exists" if the value is negative. When it is determined that "interference exists," the above determination formula is calculated on the assumption that there is uniform linear motion between the interference start time _t1 [seconds] and the interference end time _t2 [seconds]. As a result, the interference position can be determined from the current position, speed, and interference time.
It can be calculated as:

干渉判定処理部１６は、判定対象となった物体について、干渉の有無の判定結果を表す情報と、上記計算により得られた干渉時刻および干渉位置とを含む干渉情報を、物体ＩＤに対応付けて物体情報記憶部３３に追加記憶する。The interference determination processing unit 16 additionally stores, in the object information storage unit 33, interference information including information indicating the result of the determination of whether or not there is interference for the object being determined, and the interference time and interference position obtained by the above calculation, in association with the object ID.

以後同様に干渉判定処理部１６は、上記一連の干渉判定処理を物体情報リストに含まれるすべての物体についてそれぞれ行う。 The collision detection processing unit 16 then performs the above series of collision detection processes for each of the objects included in the object information list.

（５）接触対象となる物体の選定
接触対象推定装置ＥＵの制御部１は、最後に接触対象選定処理部１７の制御の下、ステップＳ１７において、ロボットアームＲＢの操作手の接触対象となる可能性が最も高い物体を選定する処理を以下のように実行する。 (5) Selection of object to be contacted Finally, in step S17, under the control of the contact target selection processing unit 17, the control unit 1 of the contact target estimation device EU executes a process of selecting an object that is most likely to be a contact target of the operating hand of the robot arm RB as follows.

図７は、接触対象選定処理部１７が実行する選定処理の処理手順と処理内容の一例を示すフローチャートである。 Figure 7 is a flowchart showing an example of the processing procedure and processing content of the selection processing performed by the contact target selection processing unit 17.

すなわち、接触対象選定処理部１７は、先ずステップＳ４１により、手情報記憶部３２から操作者ＵＳの手情報を読み込むと共に、物体情報記憶部３３から干渉有りと判定された物体に係る物体情報を読み込む。そして、接触対象選定処理部１７は、干渉有りと判定された各物体の各々について、ステップＳ４２～Ｓ４３によりスコア計算を行う。That is, first, in step S41, the contact target selection processing unit 17 reads hand information of the operator US from the hand information storage unit 32, and also reads object information related to the objects determined to be interfering from the object information storage unit 33. Then, the contact target selection processing unit 17 performs score calculation for each of the objects determined to be interfering in steps S42 to S43.

例えば、接触対象選定処理部１７は、干渉有りと判定された物体の各々について、当該物体に対し操作者ＵＳの手の表側と裏側のいずれが対向しているかを表す情報FBと、操作者ＵＳの手が物体の中心をどれだけ捉えているかを表す指標である中心捕捉度CCをそれぞれ算出する。For example, for each object determined to have interfered, the contact target selection processing unit 17 calculates information FB indicating whether the front or back of the hand of the operator US faces the object, and a central capture degree CC, which is an index indicating how well the hand of the operator US captures the center of the object.

接触対象選定処理部１７は、次に、ステップＳ４４において、上記手の表裏のいずれが対向しているかを表す情報FBと、上記中心捕捉度CCとをもとに、接触の可能性を示すコストCostを
Cost＝CC×FB
により計算する。 Next, in step S44, the contact target selection processing unit 17 calculates a cost Cost indicating the possibility of contact based on the information FB indicating whether the hand is facing the front or back and the center capture degree CC.
Cost＝CC×FB
It is calculated as follows.

ここで、上記中心捕捉度CCは、操作者ＵＳの手と物体それぞれの干渉領域が最接近したときの距離であり、
として計算される。 Here, the central capture degree CC is the distance when the interference areas of the hand of the operator US and the object are closest to each other,
It is calculated as:

すなわち、中心捕捉度CCは、干渉が発生する期間の開始時刻t₁ と終了時刻t₂ との中間の時刻である（t₁＋t₂）／2における、操作者ＵＳの手と物体との間の距離として計算される。 That is, the central capture degree CC is calculated as the distance between the hand of the operator US and the object at (t ₁ + t ₂ )/2, which is the midpoint between the start time t ₁ and the end time t ₂ of the period in which interference occurs.

また、物体に対し操作者ＵＳの手の表側と裏側のいずれが対向しているかを表す情報FBは、物体に対し手の平が対向するときの値が、手の甲が対向するときの値よりも小さくなるように予め設定されるパラメータであり、例えば、手の裏側に対しては“1.0”、手の表側に対しては“1.2”がそれぞれ設定される。これは、一般に物体に対する接触・把持の操作が操作者ＵＳの手の平、つまりロボットアームＲＢの手の平側で行われるからである。なお、上記パラメータの値はいずれも任意に調整可能である。 Information FB, which indicates whether the front or back of the hand of the operator US is facing the object, is a parameter that is preset so that the value when the palm of the hand is facing the object is smaller than the value when the back of the hand is facing the object; for example, "1.0" is set for the back of the hand and "1.2" is set for the front of the hand. This is because contact and grasping of an object is generally performed with the palm of the operator US, that is, the palm side of the robot arm RB. The values of all of the above parameters can be adjusted arbitrarily.

そして、接触対象選定処理部１７は、物体間で上記コストCostを比較し、コストCostの値が最も小さい物体を、操作者ＵＳの手が把持する可能性が最も高い物体と判断して、接触対象の物体として選定する。
なお、選定に際しては、上記したように可能性が最も高い物体を１つ選定してもよいが、可能性が高い順に上位の所定個数の物体を選定するようにしてもよい。 Then, the contact target selection processing unit 17 compares the above-mentioned costs between the objects, and determines that the object with the smallest cost value is the object most likely to be grasped by the hand of the operator US, and selects it as the object to be contacted.
In addition, when making the selection, the one object with the highest possibility may be selected as described above, but a predetermined number of objects with the highest possibility may also be selected.

接触対象推定装置ＥＵの制御部１は、推定情報出力処理部１８の制御の下、ステップＳ１８において、選定された上記物体に係る物体情報をロボットＩ／Ｆ部６からロボット制御装置ＲＵへ出力する。ロボット制御装置ＲＵは、上記物体情報を加味してロボットアームＲＢの動作を制御する。In step S18, the control unit 1 of the contact object estimation device EU outputs object information relating to the selected object from the robot I/F unit 6 to the robot control device RU under the control of the estimated information output processing unit 18. The robot control device RU controls the operation of the robot arm RB taking into account the object information.

（作用・効果）
以上述べたように一実施形態では、接触対象推定装置ＥＵにおいて、カメラＣＭにより撮像された作業エリアの映像情報から当該映像フレームに映っている各物体をロボットアームＲＢの手の部位と共に検出し、検出された各物体のうちロボットアームＲＢでは操作不可能な距離にある物体を除外した後、除外されなかった各物体についてそれぞれ干渉領域を定義する。一方、それと並行して、操作者ＵＳに装着された動きセンサＳＳにより得られたセンサ情報から、操作者ＵＳの手の位置を表す手情報を生成する。 (Action and Effects)
As described above, in one embodiment, the contact target estimation device EU detects each object shown in the image frame from the image information of the working area captured by the camera CM together with the hand part of the robot arm RB, and after excluding objects that are at a distance that cannot be operated by the robot arm RB from each of the detected objects, defines an interference area for each object that has not been excluded. Meanwhile, in parallel with this, hand information indicating the position of the hand of the operator US is generated from sensor information obtained by the motion sensor SS attached to the operator US.

次に、上記各物体について、当該物体および操作者ＵＳの手の位置ベクトルと速度ベクトルをもとに、物体の干渉領域がロボットアームＲＢの操作手の干渉領域と干渉するか否かを判定する。そして、干渉すると判定された各物体について、操作者ＵＳの手の表裏のいずれが対向しているかを表す情報FBと、操作者ＵＳの手が物体の中心をどれだけ捉えているかを表す中心捕捉度CCとをもとにコストCostを算出し、算出されたコストCostをもとに上記各物体の中でロボットアームＲＢの手が把持する可能性が最も高い物体を選定し、選定された物体に係る物体情報をロボット制御に使用するようにしている。Next, for each of the above objects, it is determined whether the interference area of the object interferes with the interference area of the operator's hand of the robot arm RB, based on the position vector and velocity vector of the object and the hand of the operator US. Then, for each object determined to interfere, a cost Cost is calculated based on information FB indicating whether the front or back of the operator's hand is facing, and a center capture degree CC indicating how well the operator's hand captures the center of the object, and the object that is most likely to be grasped by the hand of the robot arm RB is selected from among the above objects based on the calculated cost Cost, and the object information related to the selected object is used for robot control.

従って、一実施形態によれば、操作者ＵＳの視線情報を用いることなく、操作者ＵＳの操作に応じてロボットアームＲＢが把持しようとする物体を推定できるようになる。その結果、HMD 等の追加デバイスを不要にしてシステムの複雑化を回避することが可能となる。Therefore, according to one embodiment, it becomes possible to estimate the object that the robot arm RB is trying to grasp in response to the operation of the operator US without using the line of sight information of the operator US. As a result, it becomes possible to eliminate the need for additional devices such as an HMD and to avoid complicating the system.

また、映像中から検出された各物体に対しそれぞれ干渉領域を定義して、ロボットアームＲＢの手に設定された干渉領域と干渉する可能性がある物体を特定し、特定された物体の中からロボットアームＲＢが把持する可能性が最も高い物体を選定するようにしている。このため、作業エリアに操作対象の候補となる複数の物体が存在する場合でも、操作対象の物体を短時間に推定できるようになり、これにより処理のリアルタイム性を維持することが可能となる。 In addition, an interference area is defined for each object detected in the video, objects that may interfere with the interference area set in the hand of the robot arm RB are identified, and from among the identified objects, the object that is most likely to be grasped by the robot arm RB is selected. As a result, even if there are multiple objects in the working area that are candidates for operation targets, it is possible to quickly estimate the object to be operated, thereby maintaining real-time processing.

さらに、上記各物体に対し干渉領域を定義する際に、各物体とロボットアームＲＢとの間の距離がしきい値を超える物体を事前に除外し、除外されなかった物体のみを対象に干渉領域を定義するようにしている。このため、干渉領域を定義する処理に要する時間を短縮することができ、これにより処理のリアルタイム性をさらに高めることが可能となる。 Furthermore, when defining the interference area for each of the above objects, objects whose distance between each object and the robot arm RB exceeds a threshold value are excluded in advance, and the interference area is defined for only the objects that have not been excluded. This makes it possible to reduce the time required for the process of defining the interference area, thereby further improving the real-time nature of the process.

［その他の実施形態］
（１）一実施形態では、遠隔地に接触対象推定装置ＥＵを独立して設けた場合を例にとって説明したが、接触対象推定装置ＥＵのすべてまたは一部の機能をロボット制御装置ＲＵ内に設けるようにしてもよい。また、接触対象推定装置ＥＵは、必ずしも遠隔地側に配置する必要はなく操作側に配置してもよく、さらにはWeb 上またはクラウド上に配置してもよい。また、その際、接触対象推定装置ＥＵの処理機能を、遠隔地側と操作側にそれぞれ配置した情報処理装置、或いはWeb 上またはクラウド上に配置されたサーバ装置に分散配置するようにしてもよい。 [Other embodiments]
(1) In the embodiment, the contact object estimation device EU is independently provided at a remote location, but all or part of the functions of the contact object estimation device EU may be provided in the robot control device RU. The contact object estimation device EU does not necessarily have to be provided at the remote location, and may be provided at the operating side, or may be provided on the Web or on the cloud. In this case, the processing functions of the contact object estimation device EU may be distributed to information processing devices provided at the remote location and the operating side, or a server device provided on the Web or on the cloud.

（２）一実施形態では、RGB-D カメラを用いた場合を例にとって説明した。しかし、通常のカメラを用いてもよい。この場合、接触対象推定装置ＥＵに、カメラにより撮像された映像情報から物体映像を検出する機能と、検出された物体映像からデプス情報を生成する機能を持たせることで、一実施形態と同様に接触対象となる物体を推定することが可能となる。 (2) In the embodiment, an example has been described in which an RGB-D camera is used. However, a normal camera may also be used. In this case, by providing the contact object estimation device EU with a function for detecting an object image from video information captured by the camera and a function for generating depth information from the detected object image, it becomes possible to estimate an object to be contacted in the same manner as in the embodiment.

（３）物体映像を検出する手段としては、例えば、映像フレームを入力として、当該映像フレームに写っている物体のクラスと位置を表す情報を出力する機械学習モデルを使用することができる。また、デプス情報を生成する手段としては、例えば、映像フレームを入力として、当該映像フレームに写っている画像のカメラＣＭからの画素毎の距離分布を表すデプス情報を出力する距離情報検出用の機械学習モデルを使用することができる。(3) As a means for detecting an object image, for example, a machine learning model that takes a video frame as input and outputs information representing the class and position of an object appearing in the video frame can be used. Also, as a means for generating depth information, for example, a machine learning model for detecting distance information that takes a video frame as input and outputs depth information representing the distance distribution of each pixel from the camera CM of the image appearing in the video frame can be used.

（４）一実施形態では、接触対象として工具や部品、装置等の物体を検出する場合を例にとって説明した。しかし、接触対象としては、他に人や動物、植物などの生物であってもよい。(4) In one embodiment, the case where objects such as tools, parts, and devices are detected as contact targets is described as an example. However, the contact target may also be other living things such as people, animals, and plants.

（５）カメラＣＭは１台に限らず複数台配置して、これらのカメラＣＭの映像を選択的に用いて、或いは合成して物体の検出を行うようにしてもよい。さらに、操作者の動きを検出する動きセンサとしては加速度センサを用いてもよく、動きの検出対象となる部位は手に限らず足や腰、首であってもよい。(5) The number of cameras CM may not be limited to one, but may be multiple, and the images from these cameras CM may be selectively used or combined to detect an object. Furthermore, an acceleration sensor may be used as a motion sensor to detect the motion of the operator, and the body parts to be detected for motion detection may be not only the hands, but also the legs, waist, and neck.

その他、接触対象推定装置の機能構成や処理手順および処理内容等については、この発明の要旨を逸脱しない範囲で種々変形して実施可能である。In addition, the functional configuration, processing procedures, processing contents, etc. of the contact object estimation device can be modified in various ways without departing from the spirit and scope of this invention.

以上、この発明の実施形態を詳細に説明してきたが、前述までの説明はあらゆる点においてこの発明の例示に過ぎない。この発明の範囲を逸脱することなく種々の改良や変形を行うことができることは言うまでもない。つまり、この発明の実施にあたって、実施形態に応じた具体的構成が適宜採用されてもよい。 Although the embodiments of the present invention have been described in detail above, the above description is merely an example of the present invention in every respect. It goes without saying that various improvements and modifications can be made without departing from the scope of the present invention. In other words, in implementing the present invention, specific configurations according to the embodiments may be appropriately adopted.

要するにこの発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態に亘る構成要素を適宜組み合せてもよい。In short, this invention is not limited to the above-described embodiment as it is, and in the implementation stage, the components can be modified and embodied without departing from the gist of the invention. Furthermore, various inventions can be formed by appropriate combinations of multiple components disclosed in the above-described embodiment. For example, some components may be deleted from all of the components shown in the embodiment. Furthermore, components from different embodiments may be appropriately combined.

ＥＵ…接触対象推定装置
ＣＭ…カメラ
ＲＵ…ロボット制御装置
ＲＢ…ロボットアーム
ＵＳ…操作者
ＯＵ…操作側端末装置
ＳＳ…動きセンサ
ＭＴ…モニタ
１…制御部
２…プログラム記憶部
３…データ記憶部
４…カメラＩ／Ｆ部
５…通信Ｉ／Ｆ部
６…ロボットＩ／Ｆ部
７…バス
１１…映像情報取得処理部
１２…映像情報送信処理部
１３…手情報生成処理部
１４…物体検出処理部
１５…干渉領域定義処理部
１６…干渉判定処理部
１７…接触対象選定処理部
１８…推定情報出力処理部
３１…映像情報記憶部
３２…手情報記憶部
３３…物体情報記憶部
EU...contact object estimation device CM...camera RU...robot control device RB...robot arm US...operator OU...operating side terminal device SS...motion sensor MT...monitor 1...control unit 2...program storage unit 3...data storage unit 4...camera I/F unit 5...communication I/F unit 6...robot I/F unit 7...bus 11...video information acquisition processing unit 12...video information transmission processing unit 13...hand information generation processing unit 14...object detection processing unit 15...interference area definition processing unit 16...interference determination processing unit 17...contact object selection processing unit 18...estimated information output processing unit 31...video information storage unit 32...hand information storage unit 33...object information storage unit

Claims

1. A contact object estimation device used in a system for remotely controlling an object present in a work area at a remote location by operating a robot at the remote location in response to an operation by an operator, comprising:
a first processing unit that detects a plurality of the objects appearing in a video frame of the video information captured in the work area, and defines a first interference region for each of the detected plurality of the objects;
a second processing unit that determines whether or not there is interference between each of the plurality of objects and the robot, based on the first interference region defined for the object, a second interference region set for the robot, and operation information of the operator; and
and a third processing unit that selects the object that the robot is about to come into contact with from among the plurality of objects determined to be interfering, based on information representing a positional relationship between the object and the robot.

The first processing unit includes:
a processing unit that excludes an object from the plurality of objects, the object being located beyond a preset range of distance between the object and the robot;
The contact object estimation device according to claim 1 , further comprising: a processing unit that defines the first interference region for the object that has not been excluded.

The contact object estimation device of claim 1, wherein the second processing unit calculates a relative position between the object and the robot at any time based on the position vectors and velocity vectors of the object and the robot, and determines whether or not there is interference between the object and the robot based on the calculated relative position.

The contact object estimation device of claim 3, wherein the second processing unit calculates the relative position in three-dimensional space using the position vector of three axes and the velocity vector of three axes.

The contact target estimation device of claim 1, wherein the third processing unit selects the object that the robot is about to contact based on the distance between the robot and the object when the robot and the object are closest to each other.

The contact target estimation device of claim 5, wherein the third processing unit selects the object that the robot is about to contact by taking into account the distance between the robot and the object when the robot and the object are closest to each other, and information indicating whether the object is facing the front or back of the operator's hand.

1. A contact target estimation method executed by an information processing device used in a system for remotely controlling an object present in a work area at a remote location by operating a robot at the remote location in response to an operation by an operator, comprising:
a first processing step of detecting a plurality of objects captured in a video frame of the video information captured in the work area, and defining a first interference region for each of the detected plurality of objects;
a second processing step of determining whether or not there is interference between each of the plurality of objects and the robot, based on the first interference region defined for the object, a second interference region set for the robot, and operation information of the operator;
and a third processing step of selecting the object that the robot is about to come into contact with from among the plurality of objects determined to be interfering, based on information representing a positional relationship between the object and the robot.

A program for causing a processor provided in a contact target estimation device to execute at least one of the first processing unit, the second processing unit, and the third processing unit provided in the contact target estimation device described in any one of claims 1 to 6.