JP7694885B2

JP7694885B2 - Method and computing system for performing or facilitating physical edge detection - Patents.com

Info

Publication number: JP7694885B2
Application number: JP2022118399A
Authority: JP
Inventors: ユ，ジンゼ; ロドリゲス，ホセジェロニモモレイラ
Original assignee: Mujin Inc
Current assignee: Mujin Inc
Priority date: 2020-06-04
Filing date: 2022-07-26
Publication date: 2025-06-18
Anticipated expiration: 2041-06-03
Also published as: JP2022534342A; JP2022136177A; JP7118382B2; CN114096331A; WO2021246476A1

Description

関連出願の相互参照
本出願は、２０２０年６月４日に出願された、「ＲＯＢＯＴＩＣＳＹＳＴＥＭＷＩＴＨＶＩＳＩＯＮＭＥＣＨＡＮＩＳＭ」と題する、米国仮特許出願第第６３／０３４，４０３号の優先権を主張する、２０２１年５月２７日に出願された、「ＭＥＴＨＯＤＡＮＤＣＯＭＰＵＴＩＮＧＳＹＳＴＥＭＦＯＲＰＥＲＦＯＲＭＩＮＧＯＲＦＡＣＩＬＩＴＡＴＩＮＧＰＨＹＳＩＣＡＬＥＤＧＥＤＥＴＥＣＴＩＯＮ」と題する、米国特許出願第１７／３３１，８７８号の優先権を主張し、その全体の内容が参照により本明細書に組み込まれる。 CROSS-REFERENCE TO RELATED APPLICATIONS This application claims priority to U.S. Provisional Patent Application No. 63/034,403, entitled "ROBOTIC SYSTEM WITH VISION MECHANISME," filed on June 4, 2020, which claims priority to U.S. Provisional Patent Application No. 17/331,878, entitled "METHOD AND COMPUTING SYSTEM FOR PERFORMING OR FACILITATING PHYSICAL EDGE DETECTION," filed on May 27, 2021, the entire contents of which are incorporated herein by reference.

本開示は、物理的エッジ検出を実行または促進するための計算システムおよび方法に関する。 The present disclosure relates to computational systems and methods for performing or facilitating physical edge detection.

自動化がより一般的になるに従い、倉庫保管および小売環境など、より多くの環境においてロボットが使用される。例えば、ロボットは、倉庫の中にある物体と相互作用するように使用され得る。ロボットの動作は、一定であってもよく、または倉庫の中のセンサーによって生成された情報などの、入力に基づいてもよい。 As automation becomes more common, robots are used in more environments, such as warehousing and retail environments. For example, robots may be used to interact with objects in a warehouse. The robot's actions may be fixed or may be based on inputs, such as information generated by sensors in the warehouse.

本開示の一態様は、計算システム、または計算システムによって行われる方法に関する。計算システムは、通信インターフェイスおよび少なくとも一つの処理回路を含み得る。通信インターフェイスは、ロボット、およびカメラ視野を有するカメラと通信するように構成され得る。少なくとも一つの処理回路は、物体のグループがカメラ視野の中にあるとき、カメラによって生成される、物体のグループを表す画像情報を受信することと、画像情報から、物体のグループに関連付けられる複数の候補エッジを識別することであって、複数の候補エッジが、物体のグループの物理的エッジを表すためのそれぞれの候補を形成する、画像位置または物理的位置のそれぞれのセットであるか、またはそれらを含むことと、複数の候補エッジが、第一の画像領域と第二の画像領域との間の境界に基づいて形成される第一の候補エッジを含む場合、画像情報が、第一の候補エッジで定義された暗さ条件を満たすかどうかを判定することであって、第一の画像領域が、第二の画像領域よりも暗く、第一の画像領域および第二の画像領域が、画像情報によって記述されるそれぞれの領域であることと、複数の候補エッジのサブセットを選択して、物体のグループの物理的エッジを表すための候補エッジの選択されたサブセットを形成することであって、選択することが、画像情報が第一の候補エッジで定義された暗さ条件を満たすかどうかに基づいて、候補エッジの選択されたサブセット内に第一の候補エッジを含めることによって、物体のグループの物理的エッジの少なくとも一つを表す候補として、第一の候補エッジを保持するかどうかを判定することを含むことと、を実行するように構成され得る。 One aspect of the present disclosure relates to a computing system, or a method performed by a computing system. The computing system may include a communications interface and at least one processing circuit. The communications interface may be configured to communicate with a robot and a camera having a camera field of view. At least one processing circuit may be configured to perform the following: receiving image information representing the group of objects, generated by the camera when the group of objects is within the camera field of view; identifying from the image information a plurality of candidate edges associated with the group of objects, the plurality of candidate edges being or including respective sets of image locations or physical locations forming respective candidates for representing a physical edge of the group of objects; if the plurality of candidate edges includes a first candidate edge formed based on a boundary between a first image region and a second image region, determining whether the image information satisfies a darkness condition defined by the first candidate edge, the first image region being darker than the second image region, the first image region and the second image region being respective regions described by the image information; and selecting a subset of the plurality of candidate edges to form a selected subset of candidate edges for representing the physical edge of the group of objects, the selecting including determining whether to retain the first candidate edge as a candidate for representing at least one of the physical edges of the group of objects by including the first candidate edge in the selected subset of candidate edges based on whether the image information satisfies the darkness condition defined by the first candidate edge.

本明細書の実施形態と合致する、物理的エッジ検出を行うため、または促進するためのシステムを示す。1 illustrates a system for performing or facilitating physical edge detection consistent with embodiments herein. 本明細書の実施形態と合致する、物理的エッジ検出を行うため、または促進するためのシステムを示す。1 illustrates a system for performing or facilitating physical edge detection consistent with embodiments herein. 本明細書の実施形態と合致する、物理的エッジ検出を行うため、または促進するためのシステムを示す。1 illustrates a system for performing or facilitating physical edge detection consistent with embodiments herein. 本明細書の実施形態と合致する、物理的エッジ検出を行うため、または促進するためのシステムを示す。1 illustrates a system for performing or facilitating physical edge detection consistent with embodiments herein.

本明細書の実施形態と合致する、物理的エッジ検出を行うためまたは促進するために構成される、計算システムを示すブロック図を提供する。1 provides a block diagram illustrating a computing system configured to perform or facilitate physical edge detection consistent with embodiments herein. 本明細書の実施形態と合致する、物理的エッジ検出を行うためまたは促進するために構成される計算システムを示すブロック図を提供する。1 provides a block diagram illustrating a computing system configured to perform or facilitate physical edge detection consistent with embodiments herein. 本明細書の実施形態と合致する、物理的エッジ検出を行うためまたは促進するために構成される計算システムを示すブロック図を提供する。1 provides a block diagram illustrating a computing system configured to perform or facilitate physical edge detection consistent with embodiments herein. 本明細書の実施形態と合致する、物理的エッジ検出を行うためまたは促進するために構成される計算システムを示すブロック図を提供する。1 provides a block diagram illustrating a computing system configured to perform or facilitate physical edge detection consistent with embodiments herein.

本明細書の実施形態による、物理的エッジ検出が行われ得る環境を示す。1 illustrates an environment in which physical edge detection may be performed, according to embodiments herein. 本明細書の実施形態による、物理的エッジ検出が行われ得る環境を示す。1 illustrates an environment in which physical edge detection may be performed, according to embodiments herein. 本明細書の実施形態による、物理的エッジ検出が行われ得る環境を示す。1 illustrates an environment in which physical edge detection may be performed, according to embodiments herein.

本明細書の実施形態による、候補エッジを処理する方法を示すフロー図を提供する。1 provides a flow diagram illustrating a method for processing candidate edges according to an embodiment herein.

本明細書の実施形態による、候補エッジが識別され得る例示の画像情報を示す。1 illustrates example image information in which candidate edges may be identified, according to embodiments herein. 本明細書の実施形態による、候補エッジが識別され得る例示の画像情報を示す。1 illustrates example image information in which candidate edges may be identified, according to embodiments herein.

本明細書の実施形態による、画像情報から識別された候補エッジを示す。4 illustrates candidate edges identified from image information according to embodiments herein. 本明細書の実施形態による、画像情報から識別された候補エッジを示す。4 illustrates candidate edges identified from image information according to embodiments herein.

本明細書の実施形態による、画像情報から識別された候補エッジを示す。4 illustrates candidate edges identified from image information according to embodiments herein. 本明細書の実施形態による、画像情報から識別された候補エッジを示す。4 illustrates candidate edges identified from image information according to embodiments herein. 本明細書の実施形態による、画像情報から識別された候補エッジを示す。4 illustrates candidate edges identified from image information according to embodiments herein.

本明細書の実施形態による、入射光がカメラ視野内の物体を反射する拡散反射に基づいて画像が生成され得る環境を示す。1 illustrates an environment in which an image may be generated based on diffuse reflection of incident light off objects within the camera's field of view, according to embodiments herein.

本明細書の実施形態による、候補エッジの周りの画像強度プロファイルを示す。1 illustrates an image intensity profile around a candidate edge according to an embodiment herein. 本明細書の実施形態による、候補エッジの周りの画像強度プロファイルを示す。1 illustrates an image intensity profile around a candidate edge according to an embodiment herein. 本明細書の実施形態による、候補エッジの周りの画像強度プロファイルを示す。1 illustrates an image intensity profile around a candidate edge according to an embodiment herein. 本明細書の実施形態による、候補エッジに対応する位置の周りの領域の奥行き値を示す。1 illustrates depth values of regions around locations corresponding to candidate edges, according to embodiments herein.

本明細書の実施形態による、候補エッジの周りの画像強度プロファイルを示す。1 illustrates an image intensity profile around a candidate edge according to an embodiment herein. 本明細書の実施形態による、候補エッジの周りの画像強度プロファイルを示す。1 illustrates an image intensity profile around a candidate edge according to an embodiment herein. 本明細書の実施形態による、候補エッジの周りの画像強度プロファイルを示す。1 illustrates an image intensity profile around a candidate edge according to an embodiment herein.

本明細書の実施形態による、物理的エッジ部の周りの位置の奥行き値を示す一方、図１１Ｄは、物理的エッジを表す画像の画像強度プロファイルを示す。FIG. 11D illustrates an image intensity profile of an image representing a physical edge, while FIG. 11D illustrates depth values of positions around a physical edge, according to embodiments herein. 本明細書の実施形態による、物理的エッジ部の周りの位置の奥行き値を示す。1 illustrates depth values for positions around a physical edge, according to embodiments herein. 本明細書の実施形態による、物理的エッジ部の周りの位置の奥行き値を示す。1 illustrates depth values for positions around a physical edge, according to embodiments herein. 本明細書の実施形態による、物理的エッジを表す画像の画像強度プロファイルを示す。1 illustrates an image intensity profile of an image representing a physical edge, according to embodiments herein.

本明細書の実施形態による、隣接する画像領域よりも暗い画像領域を有する物体を表す、画像に対する画像強度プロファイルを示す。1 illustrates an image intensity profile for an image depicting an object having an image region that is darker than adjacent image regions, according to embodiments herein. 本明細書の実施形態による、隣接する画像領域よりも暗い画像領域を有する物体を表す、画像に対する画像強度プロファイルを示す。1 illustrates an image intensity profile for an image depicting an object having an image region that is darker than adjacent image regions, according to embodiments herein. 本明細書の実施形態による、隣接する画像領域よりも暗い画像領域を有する物体を表す、画像に対する画像強度プロファイルを示す。1 illustrates an image intensity profile for an image depicting an object having an image region that is darker than adjacent image regions, according to embodiments herein.

本明細書の実施形態による、物理的ギャップによって分離され、一方の物体が他方の物体よりも暗い、二つの物体を表す、画像に対する画像強度プロファイルを示す。1 illustrates an image intensity profile for an image representing two objects separated by a physical gap, one object being darker than the other, according to embodiments herein. 本明細書の実施形態による、物理的ギャップによって分離され、一方の物体が他方の物体よりも暗い、二つの物体を表す、画像に対する画像強度プロファイルを示す。1 illustrates an image intensity profile for an image representing two objects separated by a physical gap, one object being darker than the other, according to embodiments herein. 本明細書の実施形態による、物理的ギャップによって分離され、一方の物体が他方の物体よりも暗い、二つの物体を表す、画像に対する画像強度プロファイルを示す。1 illustrates an image intensity profile for an image representing two objects separated by a physical gap, one object being darker than the other, according to embodiments herein.

本開示の一態様は、物体のグループを表す画像情報を使用して、物体のグループの物理的エッジを検出またはその他の方法で識別することに関する。例えば、２Ｄ画像は、箱のグループを表してもよく、箱のグループの物理的エッジを潜在的に表し得る候補エッジを含んでもよい。計算システムは、画像情報内の候補エッジを使用して、画像情報内に表される個々の物体を区別し得る。一部の実例では、計算システムは、個々の箱を識別する情報を使用して、個々の箱に関与するロボット相互作用を制御し得る。例えば、ロボット相互作用は、ロボットのエンドエフェクター装置が物体の一つに接近し、物体をピックアップし、物体を目的地の位置に動かす、パレットから降ろす動作を含み得る。 One aspect of the disclosure relates to detecting or otherwise identifying physical edges of a group of objects using image information representing the group of objects. For example, a 2D image may represent a group of boxes and may include candidate edges that may potentially represent physical edges of the group of boxes. A computing system may use the candidate edges in the image information to distinguish between individual objects represented in the image information. In some instances, a computing system may use information identifying the individual boxes to control robotic interactions involving the individual boxes. For example, the robotic interactions may include an unpalletizing operation in which an end effector device of a robot approaches one of the objects, picks up the object, and moves the object to a destination location.

一部のシナリオでは、２Ｄ画像または他の画像情報は、カメラ視野内の物体の実際の物理的エッジに対応しない候補エッジであり得る、偽エッジである候補エッジを含んでもよい。従って、本開示の一態様は、候補エッジを評価して、候補エッジが、偽エッジであるのとは対照的に、実際の物理的エッジに対応する信頼レベルを決定することに関する。実施形態では、こうした決定は、特定の物理的エッジが画像にどのように現れる可能性が高いかに関する予期または予測に基づいてもよい。より具体的には、こうした決定は、物理的エッジが物体間の物理的ギャップと関連付けられる場合（例えば、物理的エッジが物理的ギャップの一方の側面を形成する）、こうした物理的ギャップが画像において非常に暗いように見えてもよく、および／または物理的ギャップに対応する画像領域の画像強度のスパイク減少を特徴とする画像強度プロファイルを有し得るという予期に基づいてもよい。従って、本開示の方法または計算システムは、物体間の物理的ギャップ、特に狭い物理的ギャップが、物理的ギャップが、画像の中でどのように暗いかに関連する特定の特性を有する画像によって表され得るとの予期に基づいて動作し得る。画像のこうした特徴または特徴は、ダークプライアと呼んでもよく、本開示は、ダークプライアを検出することに関連してもよく、ダークプライアの存在は、候補エッジが実際の物理的エッジに対応するかどうかに関する信頼レベルを増加させ得る。 In some scenarios, a 2D image or other image information may include candidate edges that are false edges, which may be candidate edges that do not correspond to actual physical edges of objects in the camera field of view. Thus, one aspect of the present disclosure relates to evaluating candidate edges to determine a confidence level that the candidate edge corresponds to an actual physical edge as opposed to being a false edge. In an embodiment, such a determination may be based on an expectation or prediction regarding how a particular physical edge is likely to appear in an image. More specifically, such a determination may be based on an expectation that when a physical edge is associated with a physical gap between objects (e.g., the physical edge forms one side of the physical gap), such a physical gap may appear very dark in the image and/or may have an image intensity profile characterized by a spiked decrease in image intensity in the image region corresponding to the physical gap. Thus, a method or computational system of the present disclosure may operate based on an expectation that a physical gap between objects, particularly a narrow physical gap, may be represented by an image having certain characteristics related to how dark the physical gap is in the image. Such features or characteristics of an image may be referred to as dark priors, and the present disclosure may relate to detecting dark priors, the presence of which may increase the confidence level as to whether a candidate edge corresponds to an actual physical edge.

実施形態では、本開示の方法またはシステムは、画像が、候補エッジで定義された暗さ条件を満たすかどうかを判定してもよく、定義された暗さ条件は、ダークプライアを検出することに関連し得る。より具体的には、定義された暗さ条件は、以下でより詳細に論じる、暗さ閾値基準、および／またはスパイク強度プロファイル基準によって定義され得る。本実施形態では、計算システムまたは方法が、画像が候補エッジで定義された暗さ条件を満たすと判定する場合、候補エッジが、二つの物体の間の物理的ギャップの一方の側面を形成する物理的エッジなど、実際の物理的エッジに対応する、より大きな信頼レベルがあり得る。一部の実例では、画像が候補エッジで定義された暗さ条件を満たさない場合、候補エッジが偽エッジである可能性がより高い場合がある。 In an embodiment, a method or system of the present disclosure may determine whether an image satisfies a candidate edge-defined darkness condition, which may be related to detecting a dark prior. More specifically, the defined darkness condition may be defined by a darkness threshold criterion and/or a spike intensity profile criterion, which are discussed in more detail below. In this embodiment, if the computing system or method determines that an image satisfies a candidate edge-defined darkness condition, there may be a greater confidence level that the candidate edge corresponds to an actual physical edge, such as a physical edge that forms one side of a physical gap between two objects. In some instances, if an image does not satisfy a candidate edge-defined darkness condition, the candidate edge may be more likely to be a false edge.

本開示の一態様は、２Ｄ画像情報を使用して、３Ｄ画像情報の限定を補償することに関し、その逆も同様である。例えば、二つ以上の箱などの複数の物体が互いに密接に隣り合わせに置かれ、狭い物理的ギャップによって分離される場合、３Ｄ画像情報は、物理的ギャップを捕捉する、またはそうでなければそれを表すのに十分な高い解像度を有しなくてもよい。従って、３Ｄ画像情報は、特に、複数の物体が、３Ｄ画像情報を生成するカメラに対して同じ奥行きを有する場合、複数の物体の個々の物体を区別するために使用される能力に限界を有し得る。こうした実施例では、複数の物体間の物理的ギャップは、２Ｄ画像情報に表され得る。より具体的には、物理的ギャップは、定義された暗さ条件を満たす画像領域によって表され得る。従って、こうした画像領域に関連付けられる候補エッジは、高レベルの信頼性で、物体の物理的エッジを表し得る。こうした状況では、２Ｄ画像情報の中の候補エッジは、物体のグループの個々の物体を区別するのに有用であり得る。従って、２Ｄ画像情報は、特定の状況において、個々の物体を区別する能力を高めることができる。 One aspect of the present disclosure relates to using 2D image information to compensate for limitations of 3D image information, and vice versa. For example, when multiple objects, such as two or more boxes, are placed closely next to each other and separated by a narrow physical gap, the 3D image information may not have a high enough resolution to capture or otherwise represent the physical gap. Thus, the 3D image information may have limitations in its ability to be used to distinguish individual objects of a group of objects, especially when the objects have the same depth relative to the camera generating the 3D image information. In such an embodiment, the physical gap between the objects may be represented in the 2D image information. More specifically, the physical gap may be represented by image regions that meet a defined darkness condition. Thus, the candidate edges associated with such image regions may represent the physical edges of the objects with a high level of confidence. In such a situation, the candidate edges in the 2D image information may be useful in distinguishing individual objects of a group of objects. Thus, the 2D image information may enhance the ability to distinguish individual objects in certain situations.

特定の状況では、３Ｄ画像情報は、２Ｄ画像情報の制限を補償し得る。例えば、２Ｄ画像は、２Ｄ画像中の特定の候補エッジで定義された暗さ条件を満たさなくてもよい。こうした例では、候補エッジは、カメラ視野内の任意の実際の物理的エッジ物体に対応する低信頼レベルを有し得る。３Ｄ画像情報は、２Ｄ画像情報の中の候補エッジが３Ｄ画像情報の中の候補エッジに対応する場合、２Ｄ画像情報におけるこの制限を補償するために使用され得る。より具体的には、２Ｄ画像情報中の候補エッジは、奥行きの急激な変化がある３Ｄ画像情報中の位置または位置のセットにマッピングされ得る。こうした状況では、３Ｄ画像情報は、２Ｄ画像情報の中の候補エッジが実際の物理的エッジに対応する信頼レベルを増加させるために使用され得る。 In certain circumstances, the 3D image information may compensate for limitations of the 2D image information. For example, the 2D image may not meet a darkness condition defined at a particular candidate edge in the 2D image. In such an example, the candidate edge may have a low confidence level that corresponds to any real physical edge object in the camera field of view. The 3D image information may be used to compensate for this limitation in the 2D image information if the candidate edge in the 2D image information corresponds to a candidate edge in the 3D image information. More specifically, a candidate edge in the 2D image information may be mapped to a location or set of locations in the 3D image information where there is an abrupt change in depth. In such circumstances, the 3D image information may be used to increase the confidence level that the candidate edge in the 2D image information corresponds to a real physical edge.

実施形態では、３Ｄ画像情報は、物体の表面（例えば、上部表面）を識別するために使用されてもよく、候補エッジは、二つの表面間の遷移がある位置に基づいて識別され得る。例えば、表面は、定義された測定分散閾値を超えて互いに逸脱しない３Ｄ画像情報中のそれぞれの奥行き値を有する位置のセットに基づいて識別され得る。定義された測定分散閾値は、３Ｄ画像情報の奥行き測定値にランダムな変動をもたらし得る、撮像ノイズ、製造公差、またはその他の要因の影響を記述し得る。識別される表面は、それぞれの奥行き値の平均である奥行き値と関連付けられ得る。いくつかの実施では、候補エッジは、定義された奥行き差閾値を超える、３Ｄ画像情報で識別された二つの表面間の奥行きの遷移を識別することに基づいて、３Ｄ画像情報で検出され得る。 In an embodiment, the 3D image information may be used to identify surfaces (e.g., top surfaces) of an object, and a candidate edge may be identified based on a location where there is a transition between two surfaces. For example, a surface may be identified based on a set of locations having respective depth values in the 3D image information that do not deviate from each other by more than a defined measurement variance threshold. The defined measurement variance threshold may account for the effects of imaging noise, manufacturing tolerances, or other factors that may cause random variations in the depth measurements of the 3D image information. The identified surface may be associated with a depth value that is the average of the respective depth values. In some implementations, a candidate edge may be detected in the 3D image information based on identifying a depth transition between two surfaces identified in the 3D image information that exceeds a defined depth difference threshold.

図１Ａは、一つまたは複数の物体の物理的エッジを検出またはその他の方法で識別するために、一つまたは複数の物体を表す画像情報を使用することを伴い得る、物理的エッジ検出を実行または促進するためのシステム１０００を示す。より詳細には、システム１０００は、計算システム１１００およびカメラ１２００を含み得る。この実施例では、カメラ１２００は、カメラ１２００が位置する環境を描写するか、もしくはそうでなければ表し、またはより具体的には、カメラ１２００の視野（カメラ視野とも呼ぶ）中の環境を表す、画像情報を生成するように構成され得る。環境は、例えば、倉庫、製造工場、小売空間、またはいくつかの他の施設であり得る。こうした実例では、画像情報が、箱、ビン、ケース、木枠または他の容器などの、こうした施設に位置する物体を表し得る。システム１０００は、以下でより詳細に論じるように、画像情報を使用して、カメラ視野内の個々の物体を区別すること、画像情報に基づいて物体認識または物体登録を行うこと、および／または画像情報に基づいてロボット運動計画を行うことなど、画像情報を生成、受信、および／または処理するよう構成され得る（用語「および／または」および「または」は、本開示では互換的に使用される）。ロボット運動計画作成は、例えば、ロボットと容器または他の物体との間のロボット相互作用を促進するように、施設でロボットを制御するために使用され得る。計算システム１１００およびカメラ１２００が、同じ施設に位置してもよく、または互いと遠隔に位置し得る。例えば、計算システム１１００は、倉庫または小売空間から遠隔のデータセンターでホストされる、クラウドコンピューティングプラットフォームの一部であってもよく、ネットワーク接続を介して、カメラ１２００と通信し得る。 FIG. 1A illustrates a system 1000 for performing or facilitating physical edge detection, which may involve using image information representative of one or more objects to detect or otherwise identify the physical edges of the one or more objects. More specifically, system 1000 may include a computing system 1100 and a camera 1200. In this example, camera 1200 may be configured to generate image information that depicts or otherwise represents an environment in which camera 1200 is located, or more specifically, represents an environment in a field of view (also referred to as a camera field of view) of camera 1200. The environment may be, for example, a warehouse, a manufacturing plant, a retail space, or some other facility. In such an instance, the image information may represent objects located in such a facility, such as boxes, bins, cases, crates, or other containers. The system 1000 may be configured to generate, receive, and/or process image information, such as using the image information to distinguish between individual objects in the camera field of view, perform object recognition or object registration based on the image information, and/or perform robotic motion planning based on the image information, as discussed in more detail below (the terms "and/or" and "or" are used interchangeably in this disclosure). Robotic motion planning may be used to control a robot at a facility, for example, to facilitate robotic interaction between the robot and a container or other object. The computing system 1100 and the camera 1200 may be located at the same facility or may be located remotely from each other. For example, the computing system 1100 may be part of a cloud computing platform hosted at a data center remote from a warehouse or retail space and may communicate with the camera 1200 via a network connection.

実施形態では、カメラ１２００（画像感知装置とも呼ぶ）は、２Ｄカメラおよび／または３Ｄカメラであり得る。例えば、図１Ｂは、計算システム１１００、ならびにカメラ１２００Ａおよびカメラ１２００Ｂ（その両方がカメラ１２００の実施形態であり得る）を含む、システム１０００Ａ（システム１０００の実施形態であり得る）を示す。この実施例では、カメラ１２００Ａは、カメラの視野中にある環境の視覚的外観を記述する２Ｄ画像を含む、または形成する、２Ｄ画像情報を生成するように構成される、２Ｄカメラであり得る。カメラ１２００Ｂは、カメラの視野中の環境に関する空間構造情報を含む、または形成する３Ｄ画像情報を生成するように構成される、３Ｄカメラ（空間構造感知カメラまたは空間構造感知装置とも呼ばれる）であり得る。空間構造情報は、カメラ１２００の視野中にあるさまざまな物体の表面上の位置など、カメラ１２００Ｂに対するさまざまな位置のそれぞれの奥行き値を記述する、奥行き情報（例えば、奥行きマップ）を含んでもよい。カメラの視野または物体の表面のこれらの位置はまた、物理的位置と呼んでもよい。この実施例の奥行き情報は、物体が３次元（３Ｄ）空間の中で空間的にどのように配設されるかを推定するために使用され得る。一部の実例では、空間構造情報は、カメラ１２００Ｂの視野中にある物体の一つまたは複数の表面上の位置を記述する、点群を含んでもよく、またはそれを生成するために使用され得る。より具体的には、空間構造情報が、物体の構造（物体構造とも呼ぶ）上のさまざまな位置を記述し得る。 In an embodiment, the camera 1200 (also referred to as an image sensing device) may be a 2D camera and/or a 3D camera. For example, FIG. 1B shows a system 1000A (which may be an embodiment of the system 1000) including a computing system 1100, and a camera 1200A and a camera 1200B (both of which may be embodiments of the camera 1200). In this example, the camera 1200A may be a 2D camera configured to generate 2D image information that includes or forms a 2D image describing the visual appearance of the environment in the camera's field of view. The camera 1200B may be a 3D camera (also referred to as a spatial structure sensing camera or a spatial structure sensing device) configured to generate 3D image information that includes or forms spatial structure information about the environment in the camera's field of view. The spatial structure information may include depth information (e.g., a depth map) that describes the respective depth values of various positions relative to the camera 1200B, such as positions on the surface of various objects in the field of view of the camera 1200. These locations on the camera's field of view or the surface of the object may also be referred to as physical locations. The depth information of this example may be used to estimate how the object is spatially arranged in three-dimensional (3D) space. In some instances, the spatial structure information may include, or be used to generate, a point cloud that describes locations on one or more surfaces of an object in the field of view of camera 1200B. More specifically, the spatial structure information may describe various locations on the structure of the object (also referred to as the object structure).

実施形態では、システム１０００が、カメラ１２００の環境でロボットとさまざまな物体との間のロボット相互作用を促進するための、ロボット操作システムであり得る。例えば、図１Ｃは、図１Ａおよび図１Ｂのシステム１０００／１０００Ａの実施形態であり得る、ロボット操作システム１０００Ｂを示す。ロボット操作システム１０００Ｂは、計算システム１１００、カメラ１２００、およびロボット１３００を含んでもよい。上述のように、ロボット１３００は、カメラ１２００の環境の中にある一つまたは複数の物体、例えば、箱、木枠、ビン、またはその他の容器と相互作用するために使用され得る。例えば、ロボット１３００は、一つの位置から容器を拾い上げ、それらを別の位置に移動するように構成され得る。一部の事例では、ロボット１３００は、容器または他の物体のグループが降ろされて、例えば、コンベヤーベルトに移動される、パレットから降ろす動作を実施するために使用され得る。一部の実装形態では、カメラ１２００は、ロボット１３００のロボットアームなど、ロボット１３００に取り付けられてもよい。一部の実装形態では、カメラ１２００は、ロボット１３００から分離し得る。例えば、カメラ１２００は、倉庫または他の構造の天井に装着されてもよく、構造に対して静止したままであり得る。 In an embodiment, the system 1000 may be a robotic manipulation system for facilitating robotic interaction between the robot and various objects in the environment of the camera 1200. For example, FIG. 1C shows a robotic manipulation system 1000B, which may be an embodiment of the system 1000/1000A of FIGS. 1A and 1B. The robotic manipulation system 1000B may include a computing system 1100, a camera 1200, and a robot 1300. As described above, the robot 1300 may be used to interact with one or more objects, such as boxes, crates, bins, or other containers, in the environment of the camera 1200. For example, the robot 1300 may be configured to pick up containers from one location and move them to another location. In some cases, the robot 1300 may be used to perform an unpalletizing operation, where a group of containers or other objects are unloaded and moved, for example, to a conveyor belt. In some implementations, the camera 1200 may be attached to the robot 1300, such as to a robotic arm of the robot 1300. In some implementations, the camera 1200 may be separate from the robot 1300. For example, the camera 1200 may be mounted to the ceiling of a warehouse or other structure and may remain stationary relative to the structure.

実施形態では、図１Ａ～図１Ｃの計算システム１１００は、ロボット操作システム１０００Ｂの一部である、ロボット制御システム（ロボットコントローラーとも呼ぶ）を形成しても、またはその一部であり得る。ロボット制御システムは、例えば、ロボット１３００と容器または他の物体との間のロボット相互作用を制御するためのロボット相互作用移動コマンドなどの、ロボット１３００用のコマンドを生成するように構成されるシステムであり得る。こうした実施形態では、計算システム１１００は、例えば、カメラ１２００／１２００Ａ／１２００Ｂによって生成された画像情報に基づいて、このようなコマンドを生成するように構成され得る。例えば、計算システム１１００は、画像情報に基づいて運動計画を決定するように構成されてもよく、運動計画は、例えば、物体を掴むか、または他の方法でピックアップすることを意図し得る。計算システム１１００は、運動計画を実行するために、一つまたは複数のロボット相互作用移動コマンドを生成し得る。 In an embodiment, the computing system 1100 of FIGS. 1A-1C may form or be part of a robot control system (also referred to as a robot controller) that is part of the robot manipulation system 1000B. The robot control system may be a system configured to generate commands for the robot 1300, such as, for example, robot interaction move commands for controlling the robot interaction between the robot 1300 and a container or other object. In such an embodiment, the computing system 1100 may be configured to generate such commands based on, for example, image information generated by the cameras 1200/1200A/1200B. For example, the computing system 1100 may be configured to determine a motion plan based on the image information, where the motion plan may be, for example, intended to grasp or otherwise pick up an object. The computing system 1100 may generate one or more robot interaction move commands to execute the motion plan.

実施形態では、計算システム１１００は、視覚システムを形成しても、またはその一部であり得る。視覚システムは、例えば、ロボット１３００が位置する環境を記述する、すなわちより具体的には、カメラ１２００が位置する環境を記述する、視覚情報を生成するシステムであり得る。視覚情報が、上で考察された３Ｄ画像情報、および／または２Ｄ画像情報、またはいくつかの他の画像情報を含んでもよい。一部のシナリオでは、計算システム１１００が、視覚システムを形成する場合、視覚システムは、上で考察されたロボット制御システムの一部であってもよく、またはロボット制御システムから分離し得る。視覚システムは、ロボット制御システムから分離する場合、視覚システムは、ロボット１３００が位置する環境を記述する、情報を出力するように構成され得る。情報は、視覚システムからこうした情報を受信し、情報に基づいて、運動計画を実施し、および／またはロボット相互作用移動コマンドを生成することができる、ロボット制御システムに出力され得る。 In an embodiment, the computing system 1100 may form or be part of a vision system. The vision system may be, for example, a system that generates visual information describing the environment in which the robot 1300 is located, i.e., more specifically, the environment in which the camera 1200 is located. The visual information may include the 3D image information discussed above, and/or 2D image information, or some other image information. In some scenarios, when the computing system 1100 forms a vision system, the vision system may be part of the robot control system discussed above, or may be separate from the robot control system. When the vision system is separate from the robot control system, the vision system may be configured to output information describing the environment in which the robot 1300 is located. The information may be output to a robot control system that can receive such information from the vision system and, based on the information, perform motion planning and/or generate robot interaction movement commands.

実施形態では、計算システム１１００は、ＲＳ－２３２インターフェイス、ユニバーサルシリアルバス（ＵＳＢ）インターフェイスなどの専用有線通信インターフェイスを介して、および／もしくは周辺構成要素相互接続（ＰＣＩ）バスなどのローカルコンピューターバスを介して提供される接続など、直接接続によってカメラ１２００ならびに／またはロボット１３００と通信し得る。実施形態では、計算システム１１００が、ネットワークを介してカメラ１２００および／またはロボット１３００と通信し得る。ネットワークは、パーソナルエリアネットワーク（ＰＡＮ）、例えば、イントラネットといったローカルエリアネットワーク（ＬＡＮ）、メトロポリタンエリアネットワーク（ＭＡＮ）、ワイドエリアネットワーク（ＷＡＮ）、またはインターネットなど、いかなるタイプおよび／または形態のネットワークであり得る。ネットワークは、例えば、イーサネットプロトコル、インターネットプロトコル群（ＴＣＰ／ＩＰ）、ＡＴＭ（ＡｓｙｎｃｈｒｏｎｏｕｓＴｒａｎｓｆｅｒＭｏｄｅ）技術、ＳＯＮＥＴ（ＳｙｎｃｈｒｏｎｏｕｓＯｐｔｉｃａｌＮｅｔｗｏｒｋｉｎｇ）プロトコル、またはＳＤＨ（ＳｙｎｃｈｒｏｎｏｕｓＤｉｇｉｔａｌＨｉｅｒａｒｃｈｙ）プロトコルを含む、プロトコルの異なる技術、および層またはスタックを利用し得る。 In an embodiment, the computing system 1100 may communicate with the camera 1200 and/or the robot 1300 by a direct connection, such as a connection provided through a dedicated wired communication interface, such as an RS-232 interface, a universal serial bus (USB) interface, and/or a local computer bus, such as a peripheral component interconnect (PCI) bus. In an embodiment, the computing system 1100 may communicate with the camera 1200 and/or the robot 1300 over a network. The network may be any type and/or form of network, such as a personal area network (PAN), a local area network (LAN), e.g., an intranet, a metropolitan area network (MAN), a wide area network (WAN), or the Internet. The network may utilize different technologies and layers or stacks of protocols, including, for example, the Ethernet protocol, the Internet Protocol suite (TCP/IP), Asynchronous Transfer Mode (ATM) technology, Synchronous Optical Networking (SONET) protocol, or Synchronous Digital Hierarchy (SDH) protocol.

実施形態では、計算システム１１００は、カメラ１２００および／もしくはロボット１３００と直接情報を伝達してもよく、または中間記憶装置、もしくはより広くは、中間の非一時的コンピューター可読媒体を介して通信し得る。例えば、図１Ｄは、計算システム１１００の外部にあり得る非一時的コンピューター可読媒体１４００を含む、システム１０００／１０００Ａ／１０００Ｂの実施形態であってもよく、例えば、カメラ１２００によって生成される画像情報を記憶するための外部バッファまたはリポジトリとして作用し得る、システム１０００Ｃを示す。こうした一実施例では、計算システム１１００は、非一時的コンピューター可読媒体１４００から、画像情報を検索するか、さもなければ受信することができる。非一時的コンピューター可読媒体１４００の例としては、電子記憶装置、磁気記憶装置、光学記憶装置、電磁記憶装置、半導体記憶装置、またはそれらの任意の好適な組み合わせが挙げられる。非一時的コンピューター可読媒体は、例えば、コンピューターディスケット、ハードディスクドライブ（ＨＤＤ）、ソリッドステートドライブ（ＳＤＤ）、ランダムアクセスメモリー（ＲＡＭ）、読み出し専用メモリー（ＲＯＭ）、消却可能プログラム可能読み出し専用メモリー（ＥＰＲＯＭまたはフラッシュメモリー）、スタティックランダムアクセスメモリー（ＳＲＡＭ）、携帯型コンパクトディスク読み出し専用メモリー（ＣＤ－ＲＯＭ）、デジタル多目的ディスク（ＤＶＤ）、および／またはメモリースティックを形成し得る。 In an embodiment, the computing system 1100 may communicate directly with the camera 1200 and/or the robot 1300, or may communicate via an intermediate storage device, or more broadly, an intermediate non-transitory computer-readable medium. For example, FIG. 1D illustrates a system 1000C, which may be an embodiment of the system 1000/1000A/1000B, including a non-transitory computer-readable medium 1400 that may be external to the computing system 1100, and may act as an external buffer or repository for storing image information generated by the camera 1200, for example. In one such example, the computing system 1100 may retrieve or otherwise receive image information from the non-transitory computer-readable medium 1400. Examples of the non-transitory computer-readable medium 1400 include electronic storage devices, magnetic storage devices, optical storage devices, electromagnetic storage devices, semiconductor storage devices, or any suitable combination thereof. The non-transitory computer readable medium may form, for example, a computer diskette, a hard disk drive (HDD), a solid state drive (SDD), a random access memory (RAM), a read only memory (ROM), an erasable programmable read only memory (EPROM or flash memory), a static random access memory (SRAM), a portable compact disc read only memory (CD-ROM), a digital versatile disc (DVD), and/or a memory stick.

上述のように、カメラ１２００は、３Ｄカメラおよび／または２Ｄカメラであり得る。２Ｄカメラは、カラー画像またはグレースケール画像などの、２Ｄ画像を生成するように構成され得る。３Ｄカメラは、例えば、飛行時間（ＴＯＦ）カメラもしくは構造化光カメラなどの、奥行き感知カメラ、またはいかなる他のタイプの３Ｄカメラであり得る。一部の事例では、２Ｄカメラおよび／または３Ｄカメラは、電荷結合素子（ＣＣＤ）センサーおよび／または相補型金属酸化膜半導体（ＣＭＯＳ）センサーなど、イメージセンサーを含み得る。実施形態では、３Ｄカメラは、レーザー、ＬＩＤＡＲデバイス、赤外線デバイス、明／暗センサー、運動センサー、マイクロ波検出器、超音波検出器、レーダー検出器、または奥行き情報、または空間構造情報を取り込むように構成される任意の他のデバイスを含み得る。 As described above, the camera 1200 may be a 3D camera and/or a 2D camera. The 2D camera may be configured to generate a 2D image, such as a color image or a grayscale image. The 3D camera may be a depth-sensing camera, such as a time-of-flight (TOF) camera or a structured light camera, or any other type of 3D camera. In some cases, the 2D camera and/or the 3D camera may include an image sensor, such as a charge-coupled device (CCD) sensor and/or a complementary metal-oxide semiconductor (CMOS) sensor. In an embodiment, the 3D camera may include a laser, a LIDAR device, an infrared device, a light/dark sensor, a motion sensor, a microwave detector, an ultrasonic detector, a radar detector, or any other device configured to capture depth information or spatial structure information.

上述のように、画像情報が、計算システム１１００によって処理され得る。実施形態では、計算システム１１００は、サーバー（例えば、一つまたは複数のサーバーブレード、プロセッサーなどを有する）、パーソナルコンピューター（例えば、デスクトップコンピューター、ラップトップコンピューターなど）、スマートフォン、タブレットコンピューター装置、および／もしくは他の任意の他の計算システムを含んでもよく、またはそれらとして構成され得る。実施形態では、計算システム１１００の機能性の全ては、クラウドコンピューティングプラットフォームの一部として行われてもよい。計算システム１１００は、単一のコンピューター装置（例えば、デスクトップコンピューター）であってもよく、または複数のコンピューター装置を含んでもよい。 As described above, the image information may be processed by computing system 1100. In an embodiment, computing system 1100 may include or be configured as a server (e.g., having one or more server blades, processors, etc.), a personal computer (e.g., a desktop computer, a laptop computer, etc.), a smartphone, a tablet computing device, and/or any other computing system. In an embodiment, all of the functionality of computing system 1100 may be performed as part of a cloud computing platform. Computing system 1100 may be a single computing device (e.g., a desktop computer) or may include multiple computing devices.

図２Ａは、計算システム１１００の実施形態を示す、ブロック図を提供する。計算システム１１００は、少なくとも一つの処理回路１１１０、および非一時的コンピューター可読媒体（または複数の媒体）１１２０を含む。実施形態では、処理回路１１１０は、一つまたは複数のプロセッサー、一つまたは複数の処理コア、プログラマブルロジックコントローラー（「ＰＬＣ」）、特定用途向け集積回路（「ＡＳＩＣ」）、プログラマブルゲートアレイ（「ＰＧＡ」）、フィールドプログラマブルゲートアレイ（「ＦＰＧＡ」）、それらの任意の組み合わせ、または任意の他の処理回路を含む。 2A provides a block diagram illustrating an embodiment of a computing system 1100. The computing system 1100 includes at least one processing circuit 1110 and a non-transitory computer-readable medium (or media) 1120. In an embodiment, the processing circuit 1110 includes one or more processors, one or more processing cores, a programmable logic controller ("PLC"), an application specific integrated circuit ("ASIC"), a programmable gate array ("PGA"), a field programmable gate array ("FPGA"), any combination thereof, or any other processing circuit.

実施形態では、計算システム１１００の一部である、非一時的コンピューター可読媒体１１２０が、上で考察された中間の非一時的コンピューター可読媒体１４００の代替または追加であり得る。非一時的コンピューター可読媒体１１２０は、電子記憶装置、磁気記憶装置、光学記憶装置、電磁記憶装置、半導体記憶装置、またはそれらの任意の好適な組み合わせなどの記憶装置であってもよく、例えば、コンピューターディスケット、ハードディスクドライブ（ＨＤＤ）、ソリッドステートドライブ（ＳＳＤ）、ランダムアクセスメモリー（ＲＡＭ）、読み出し専用メモリー（ＲＯＭ）、消却可能プログラム可能読み出し専用メモリー（ＥＰＲＯＭまたはフラッシュメモリー）、スタティックランダムアクセスメモリー（ＳＲＡＭ）、携帯型コンパクトディスク読み出し専用メモリー（ＣＤ－ＲＯＭ）、デジタル多目的ディスク（ＤＶＤ）、メモリースティック、それらの任意の組み合わせ、または任意の他の記憶装置などであり得る。一部の実例では、非一時的コンピューター可読媒体１１２０は、複数の記憶装置を含み得る。特定の実装形態では、非一時的コンピューター可読媒体１１２０が、カメラ１２００によって生成され、計算システム１１００によって受信される画像情報を記憶するように構成される。一部の実例では、非一時的コンピューター可読媒体１１２０は、物体認識操作を実施するために使用される一つまたは複数の物体認識テンプレートを記憶し得る。非一時的コンピューター可読媒体１１２０が、処理回路１１１０によって実行されるとき、処理回路１１１０に、図４に関して記載する操作など、本明細書に記載する一つまたは複数の手法を行わせるコンピューター可読プログラム命令を、代替的または追加的に記憶し得る。 In an embodiment, the non-transitory computer readable medium 1120 that is part of the computing system 1100 may be an alternative or addition to the intermediate non-transitory computer readable medium 1400 discussed above. The non-transitory computer readable medium 1120 may be a storage device such as an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination thereof, such as a computer diskette, a hard disk drive (HDD), a solid state drive (SSD), a random access memory (RAM), a read only memory (ROM), an erasable programmable read only memory (EPROM or flash memory), a static random access memory (SRAM), a portable compact disk read only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, any combination thereof, or any other storage device. In some instances, the non-transitory computer readable medium 1120 may include multiple storage devices. In certain implementations, the non-transitory computer-readable medium 1120 is configured to store image information generated by the camera 1200 and received by the computing system 1100. In some instances, the non-transitory computer-readable medium 1120 may store one or more object recognition templates used to perform object recognition operations. The non-transitory computer-readable medium 1120 may alternatively or additionally store computer-readable program instructions that, when executed by the processing circuit 1110, cause the processing circuit 1110 to perform one or more techniques described herein, such as the operations described with respect to FIG. 4.

図２Ｂは、計算システム１１００の実施形態であり、通信インターフェイス１１３０を含む、計算システム１１００Ａを描写する。通信インターフェイス１１３０は、例えば、図１Ａ～１Ｄのカメラ１２００によって生成された画像情報を受信するように構成され得る。画像情報は、上で考察された中間の非一時的コンピューター可読媒体１４００もしくはネットワークを介して、またはカメラ１２００と計算システム１１００／１１００Ａとの間のより直接的な接続を介して受信され得る。実施形態では、通信インターフェイス１１３０は、図１Ｃのロボット１３００と通信するように構成され得る。計算システム１１００が、ロボット制御システムの外部にある場合、計算システム１１００の通信インターフェイス１１３０が、ロボット制御システムと通信するように構成され得る。通信インターフェイス１１３０はまた、通信構成要素または通信回路と呼ばれる場合があり例えば、有線または無線プロトコルによって通信を行うように構成される通信回路を含んでもよい。実施例として、通信回路が、ＲＳ－２３２ポートコントローラー、ＵＳＢコントローラー、イーサネットコントローラー、Ｂｌｕｅｔｏｏｔｈ（登録商標）コントローラー、ＰＣＩバスコントローラー、任意の他の通信回路、またはそれらの組み合わせを含んでもよい。 FIG. 2B depicts an embodiment of the computing system 1100, the computing system 1100A, including a communication interface 1130. The communication interface 1130 may be configured to receive image information generated, for example, by the camera 1200 of FIGS. 1A-1D. The image information may be received via an intermediate non-transitory computer-readable medium 1400 or network discussed above, or via a more direct connection between the camera 1200 and the computing system 1100/1100A. In an embodiment, the communication interface 1130 may be configured to communicate with the robot 1300 of FIG. 1C. If the computing system 1100 is external to the robot control system, the communication interface 1130 of the computing system 1100 may be configured to communicate with the robot control system. The communication interface 1130 may also be referred to as a communication component or communication circuitry, and may include, for example, communication circuitry configured to communicate via a wired or wireless protocol. As an example, the communications circuitry may include an RS-232 port controller, a USB controller, an Ethernet controller, a Bluetooth (registered trademark) controller, a PCI bus controller, any other communications circuitry, or a combination thereof.

実施形態では、図２Ｃでは、非一時的コンピューター可読媒体１１２０は、カメラ１２００によって生成された画像情報から識別された複数の候補エッジを記述し得る、エッジ検出情報１１２６を記憶し得る。以下でより詳細に論じるように、画像情報が物体のグループを表す場合、候補エッジの各々は、物体のグループの複数の物理的エッジのうちの少なくとも一つを表すための候補であってもよく、または候補を形成し得る。一部の実例では、計算システム１１００／１１００Ａ／１１００Ｂは、エッジ検出情報１１２６内の特定の候補エッジを使用して、物体のグループの物理的エッジの少なくとも一つを表すべきかどうかを判定し得る。こうした判定は、候補エッジが、偽エッジであるのとは対照的に、物理的エッジを実際に表しているかどうかに関連する信頼レベルを評価することを伴い得る。一実施例では、こうした評価は、候補エッジが、物理的エッジを表すことから生じる画像特性と関連付けられるかどうかに基づいてもよい。こうした特徴は、以下でより詳細に論じる、ダークプライアと称される画像特徴と関連付けられてもよい。一部のシナリオでは、計算システム１１００は、複数の候補エッジから、物体のグループの物理的エッジを実際に表す、十分に高い信頼レベルを有する候補エッジのサブセットを選択してもよく、一方で、サブセットから除外される候補エッジは、物体のグループの物理的エッジを表す、十分に高い信頼レベルを有しなくてもよい。従って、計算システム１１００／１１００Ａ／１１００Ｂが、物理的エッジの少なくとも一つを表すために特定の候補エッジを使用することを決定した場合、計算システムは、そのサブセットに候補エッジを含み得る。計算システム１１００／１１００Ａ／１１００Ｂが、物理的エッジの少なくとも一つを表すために特定の候補エッジを使用しないと決定する場合、計算システムは、サブセット内に候補エッジを含まないと決定し得る。サブセットに含まれない候補エッジは、エッジ検出情報１１２６から除去されてもよく、またはより広くは、物体のグループの物理的エッジの少なくとも一つを表すための候補としてさらなる検討から除外され得る。 In an embodiment, in FIG. 2C, the non-transitory computer-readable medium 1120 may store edge detection information 1126 that may describe a plurality of candidate edges identified from the image information generated by the camera 1200. As discussed in more detail below, when the image information represents a group of objects, each of the candidate edges may be or form a candidate for representing at least one of a plurality of physical edges of the group of objects. In some instances, the computing system 1100/1100A/1100B may determine whether a particular candidate edge in the edge detection information 1126 should be used to represent at least one of the physical edges of the group of objects. Such a determination may involve evaluating a confidence level associated with whether the candidate edge actually represents a physical edge, as opposed to being a false edge. In one example, such an evaluation may be based on whether the candidate edge is associated with image characteristics that result from representing a physical edge. Such characteristics may be associated with an image feature referred to as a dark prior, which is discussed in more detail below. In some scenarios, the computing system 1100 may select a subset of candidate edges from the plurality of candidate edges that have a sufficiently high confidence level that they actually represent a physical edge of the group of objects, while the candidate edges excluded from the subset may not have a sufficiently high confidence level that they represent a physical edge of the group of objects. Thus, if the computing system 1100/1100A/1100B determines that a particular candidate edge is to be used to represent at least one of the physical edges, the computing system may include the candidate edge in the subset. If the computing system 1100/1100A/1100B determines that a particular candidate edge is not to be used to represent at least one of the physical edges, the computing system may determine not to include the candidate edge in the subset. Candidate edges that are not included in the subset may be removed from the edge detection information 1126, or more broadly, may be excluded from further consideration as candidates for representing at least one of the physical edges of the group of objects.

実施形態では、処理回路１１１０が、非一時的コンピューター可読媒体１１２０に記憶される、一つまたは複数のコンピューター可読プログラム命令によってプログラムされ得る。例えば、図２Ｄは、処理回路１１１０が、物理的エッジ検出モジュール１１２５、物体認識／登録モジュール１１２８、および／または運動計画モジュール１１２９を含む、一つまたは複数のモジュールによってプログラムされる、計算システム１１００／１１００Ａ／１１００Ｂの実施形態である、計算システム１１００Ｃを示す。 In an embodiment, the processing circuit 1110 may be programmed by one or more computer-readable program instructions stored on a non-transitory computer-readable medium 1120. For example, FIG. 2D illustrates computing system 1100C, an embodiment of computing system 1100/1100A/1100B, in which the processing circuit 1110 is programmed by one or more modules, including a physical edge detection module 1125, an object recognition/registration module 1128, and/or a motion planning module 1129.

実施形態では、物理的エッジ検出モジュール１１２５は、物体のグループを表す画像情報に現れる複数の候補エッジの中から、物体のグループの物理的エッジを表すためにどの候補エッジを使用するべきかを決定するように構成され得る。いくつかの実施では、物理的エッジ検出モジュール１１２５は、以下でより詳細に論じるように、定義された暗さ条件が満たされるかどうか、および／または奥行き不連続状態が満たされるかどうかに基づいて、こうした決定を実行し得る。一部の実例では、物理的エッジ検出モジュール１１２５はまた、画像情報から複数の候補エッジを識別するように構成され得る。一部の実例では、物理的エッジ検出モジュール１１２５は、画像情報によって表される個々の物体を区別することを伴い得る、画像セグメンテーション（例えば、点群セグメンテーション）を行うように構成され得る。例えば、モジュール１１２５は、物体のグループの一つの物体を表す画像情報の画像セグメント（画像部分とも呼ぶ）を抽出するか、またはその他の方法で識別し得る。いくつかの実施では、画像セグメンテーションは、例えば、モジュール１１２５が物体のグループの物理的エッジを表すために使用すべきであると決定した候補エッジに基づいて行われてもよい。 In an embodiment, the physical edge detection module 1125 may be configured to determine which candidate edge should be used to represent the physical edge of the group of objects from among multiple candidate edges appearing in the image information representing the group of objects. In some implementations, the physical edge detection module 1125 may perform such a determination based on whether a defined darkness condition is met and/or whether a depth discontinuity condition is met, as discussed in more detail below. In some instances, the physical edge detection module 1125 may also be configured to identify multiple candidate edges from the image information. In some instances, the physical edge detection module 1125 may be configured to perform image segmentation (e.g., point cloud segmentation), which may involve distinguishing individual objects represented by the image information. For example, the module 1125 may extract or otherwise identify an image segment (also referred to as an image portion) of the image information representing one object of the group of objects. In some implementations, the image segmentation may be performed based on, for example, the candidate edge that the module 1125 has determined should be used to represent the physical edge of the group of objects.

実施形態では、物体認識／登録モジュール１１２８は、物理的エッジ検出モジュール１１２５からの結果に基づいて、物体認識操作または物体登録モジュールを実行するように構成され得る。例えば、物理的エッジ検出モジュール１１２５が、物体のグループの一つの物体を表す画像セグメントを識別する場合、物体認識／登録モジュール１１２８は、例えば、画像セグメントが物体認識テンプレートと十分に合致するかを判定し、および／または画像セグメントに基づいて新しい物体認識テンプレートを生成するように構成され得る。 In an embodiment, object recognition/registration module 1128 may be configured to perform an object recognition operation or object registration based on results from physical edge detection module 1125. For example, if physical edge detection module 1125 identifies an image segment that represents an object of a group of objects, object recognition/registration module 1128 may be configured, for example, to determine whether the image segment sufficiently matches an object recognition template and/or to generate a new object recognition template based on the image segment.

実施形態では、運動計画モジュール１１２９は、物理的エッジ検出モジュール１１２５の結果に基づいて、および／または物体認識／登録モジュール１１２８の結果に基づいて、ロボット運動計画を実行するように構成され得る。上述のように、ロボット運動計画は、ロボット（例えば、１３００）と、物体のグループの少なくとも一つの物体との間のロボット相互作用のためのものであり得る。一部の実例では、ロボット運動計画は、例えば、物体をピックアップするためのロボットの構成要素（例えば、エンドエフェクター装置）による動き、および／または物体をピックアップした後に後続する構成要素の軌道の決定を伴い得る。 In an embodiment, the motion planning module 1129 may be configured to perform robot motion planning based on the results of the physical edge detection module 1125 and/or based on the results of the object recognition/registration module 1128. As described above, the robot motion planning may be for a robotic interaction between a robot (e.g., 1300) and at least one object of a group of objects. In some instances, the robot motion planning may involve, for example, a movement by a component of the robot (e.g., an end effector device) to pick up an object and/or a determination of a trajectory of a subsequent component after picking up an object.

さまざまな実施形態では、「コンピューター可読命令」および「コンピューター可読プログラム命令」という用語は、さまざまなタスクおよび操作を遂行するように構成される、ソフトウェア命令またはコンピューターコードを記述するために使用される。さまざまな実施形態では、「モジュール」という用語は、処理回路１１１０に一つまたは複数の機能タスクを行わせるように構成される、ソフトウェア命令またはコードの集まりを広く指す。モジュールおよびコンピューター可読命令は、処理回路または他のハードウェアコンポーネントが、モジュールもしくはコンピューター可読命令を実行しているときに、さまざまな操作またはタスクを行うものとして説明され得る。 In various embodiments, the terms "computer-readable instructions" and "computer-readable program instructions" are used to describe software instructions or computer code configured to perform various tasks and operations. In various embodiments, the term "module" refers broadly to a collection of software instructions or code configured to cause the processing circuitry 1110 to perform one or more functional tasks. The modules and computer-readable instructions may be described as performing various operations or tasks when the processing circuitry or other hardware component is executing the module or computer-readable instructions.

図３Ａ～３Ｃは、候補エッジの処理、すなわちより具体的には、物理的エッジ検出が実行され得る、例示的な環境を示す。より具体的には、図３Ａは、計算システム１１００、ロボット３３００、およびカメラ３２００を含む、システム３０００（図１Ａ～図１Ｄのシステム１０００／１０００Ａ／１０００Ｂ／１０００Ｃの実施形態であり得る）を描写する。カメラ３２００は、カメラ１２００の実施形態であってもよく、カメラ３２００のカメラ視野３２１０内の情景を表す、またはより具体的には、物体３５１０、３５２０、３５３０、３５４０、および３５５０などのカメラ視野３２１０内の物体を表す、画像情報を生成するように構成され得る。一実施例では、物体３５１０～３５４０のおのおのは、例えば、箱または木枠などの容器であってもよく、一方で、物体３５５０は、例えば、容器が配置されるパレットであり得る。 3A-3C show an exemplary environment in which the processing of candidate edges, or more specifically, physical edge detection, may be performed. More specifically, FIG. 3A depicts a system 3000 (which may be an embodiment of systems 1000/1000A/1000B/1000C of FIGS. 1A-1D) including a computing system 1100, a robot 3300, and a camera 3200. Camera 3200 may be an embodiment of camera 1200 and may be configured to generate image information representing a scene within a camera field of view 3210 of camera 3200, or more specifically, representing objects within camera field of view 3210, such as objects 3510, 3520, 3530, 3540, and 3550. In one example, each of objects 3510-3540 may be a container, such as, for example, a box or crate, while object 3550 may be, for example, a pallet on which the container is placed.

物体３５１０～３５４０が、より具体的には、物体の物理的エッジを示す図３Ｂに示される。より具体的には、図は、物体３５１０の上部表面の物理的エッジ部３５１０Ａ～３５１０Ｄ、物体３５２０の上部表面の物理的エッジ部３５２０Ａ～３５２０Ｄ、物体３５３０の上部表面の物理的エッジ部３５３０Ａ～３５３０Ｄ、および物体３５４０の上部表面の物理的エッジ部３５４０Ａ～３５４０Ｄを示す。図３Ｂの物理的エッジ（例えば、３５１０Ａ～３５１０Ｄ、３５２０Ａ～３５２０Ｄ、３５３０Ａ～３５３０Ｄ、および３５４０Ａ～３５４０Ｄ）は、物体３５１０～３５４０のそれぞれの上部表面の外縁であり得る。一部の実例では、物体の表面の物理的エッジ（例えば、３５１０Ａ～３５１０Ｄ）は、表面の輪郭を画定し得る。物体が、複数の非同一平面表面（複数の面とも呼ばれる）を有する多面体（例えば、立方体）を形成する場合、一つの表面の物理的エッジは、表面が物体の別の表面と交わる境界を形成し得る。 3B, which more specifically shows the physical edges of the objects. More specifically, the figure shows physical edges 3510A-3510D of the top surface of object 3510, physical edges 3520A-3520D of the top surface of object 3520, physical edges 3530A-3530D of the top surface of object 3530, and physical edges 3540A-3540D of the top surface of object 3540. The physical edges of FIG. 3B (e.g., 3510A-3510D, 3520A-3520D, 3530A-3530D, and 3540A-3540D) may be the outer edges of the top surfaces of each of objects 3510-3540. In some instances, the physical edges of the object's surfaces (e.g., 3510A-3510D) may define the contours of the surface. If the object forms a polyhedron (e.g., a cube) with multiple non-coplanar surfaces (also called faces), the physical edge of one surface may form the boundary where the surface meets another surface of the object.

実施形態では、カメラ視野内の物体は、物体の外側表面上に、視覚的マーキングなどの視覚的詳細（可視詳細とも呼ぶ）を有し得る。例えば、図３Ａおよび３Ｂでは、物体３５１０、３５２０、３５３０、３５４０は、物体３５１０～３５４０のそれぞれの外側表面（例えば、上部表面）上に、それぞれ印刷されるか、またはそうでなければ配置される視覚的マーキング３５１２、３５２２、３５３２、３５４２を有し得る。実施例として、視覚的マーキングは、可視線（例えば、直線または曲線）、多角形、視覚的パターン、または他の視覚的マーキングなどの可視形状を含み得る。一部のシナリオでは、視覚的マーキング（例えば、可視線）は、物体の外側表面上に表示されるシンボルまたは図面を形成してもよく、またはその一部であり得る。記号は、例えば、ロゴまたは文字（例えば、英数字）を含み得る。一部のシナリオでは、容器の外側表面または他の物体上の視覚的詳細は、容器の外側表面上に配置される材料の層（例えば、包装テープのストリップまたは郵送用ラベルのシート）の輪郭によって形成され得る。 In an embodiment, an object within the camera field of view may have visual details, such as visual markings (also referred to as visible details), on the outer surface of the object. For example, in FIGS. 3A and 3B, objects 3510, 3520, 3530, 3540 may have visual markings 3512, 3522, 3532, 3542, respectively, printed or otherwise disposed on the outer surface (e.g., top surface) of each of objects 3510-3540. As examples, the visual markings may include visible shapes, such as visible lines (e.g., straight or curved lines), polygons, visual patterns, or other visual markings. In some scenarios, the visual markings (e.g., visible lines) may form or be part of a symbol or drawing displayed on the outer surface of the object. Symbols may include, for example, logos or characters (e.g., alphanumeric characters). In some scenarios, visual details on the outer surface of a container or other object may be formed by the contour of a layer of material (e.g., a strip of packaging tape or a sheet of mailing labels) disposed on the outer surface of the container.

実施形態では、図３Ａのシステム３０００は、光源３６００などの一つまたは複数の光源を含み得る。光源３６００は、例えば、発光ダイオード（ＬＥＤ）、ハロゲンランプ、または任意の他の光源であってもよく、可視光、赤外線、または物体３５１０～３５５０の表面に向かって任意の他の形態の光を放射するように構成され得る。実施形態によっては、計算システム１１００は、光源３６００と通信して、光源３６００が起動されたときを制御するように構成され得る。他の実装では、光源３６００は、計算システム１１００とは独立して動作し得る。 In an embodiment, the system 3000 of FIG. 3A may include one or more light sources, such as light source 3600. Light source 3600 may be, for example, a light emitting diode (LED), a halogen lamp, or any other light source, and may be configured to emit visible light, infrared light, or any other form of light toward the surface of objects 3510-3550. In some embodiments, computing system 1100 may be configured to communicate with light source 3600 to control when light source 3600 is activated. In other implementations, light source 3600 may operate independently of computing system 1100.

実施形態では、図３Ｃに示すように、システム３０００は、カメラ視野３２１０Ａを有するカメラ３２００Ａ（カメラ１２００Ａの実施形態であり得る）を含み、カメラ視野３２１０Ｂを有するカメラ３２００Ｂ（カメラ１２００Ｂの実施形態であり得る）を含む、複数のカメラを含んでもよい。カメラ３２００Ａは、例えば、２Ｄ画像または他の２Ｄ画像情報を生成するように構成される、２Ｄカメラであってもよく、一方で、カメラ３２００Ｂは、例えば、３Ｄ画像情報を生成するように構成される、３Ｄカメラであり得る。２Ｄ画像（例えば、カラー画像またはグレースケール画像）は、カメラ視野３２１０／３２１０Ａにおける、物体３５１０～３５５０などの一つまたは複数の物体の外観を記述し得る。例えば、２Ｄ画像は、物体３５１０～３５４０の外側表面（例えば、上部表面）上に配置される視覚的マーキング３５１２～３５４２、および／またはそれらの外側表面の輪郭などの視覚的詳細を捕捉してもよく、または他の方法で、表し得る。実施形態では、３Ｄ画像情報は、物体３５１０～３５５０のうちの一つまたは複数の構造を記述してもよく、物体の構造は、物体の構造または物体の物理的構造とも呼ばれ得る。例えば、３Ｄ画像情報は、奥行きマップを含んでもよく、より一般的には、カメラ３２００Ｂに対する、またはいくつかの他の基準点に対する、カメラ視野３２１０／３２１０Ｂのさまざまな位置のそれぞれの奥行き値を記述し得る、奥行き情報を含んでもよい。それぞれの奥行き値に対応する位置は、物体３５１０～３５５０のそれぞれの上部表面上の位置などの、カメラ視野３２１０／３２１０Ｂのさまざまな表面上の位置（物理的位置とも称する）であり得る。一部の実例では、３Ｄ画像情報は、物体３５１０～３５５０、またはカメラ視野３２１０／３２１０Ｂ内のいくつかの他の物体の一つまたは複数の外側表面上のさまざまな位置を記述する、複数の３Ｄ座標を含み得る、点群を含み得る。 In an embodiment, as shown in FIG. 3C, system 3000 may include multiple cameras, including camera 3200A (which may be an embodiment of camera 1200A) having camera field of view 3210A, and camera 3200B (which may be an embodiment of camera 1200B) having camera field of view 3210B. Camera 3200A may be, for example, a 2D camera configured to generate 2D images or other 2D image information, while camera 3200B may be, for example, a 3D camera configured to generate 3D image information. The 2D images (e.g., color or grayscale images) may describe the appearance of one or more objects, such as objects 3510-3550, in camera field of view 3210/3210A. For example, the 2D images may capture or otherwise represent visual details, such as visual markings 3512-3542 disposed on the outer surfaces (e.g., top surfaces) of objects 3510-3540 and/or the contours of their outer surfaces. In an embodiment, the 3D image information may describe the structure of one or more of the objects 3510-3550, which may also be referred to as the structure of the object or the physical structure of the object. For example, the 3D image information may include a depth map, or more generally, depth information that may describe respective depth values of various positions of the camera field of view 3210/3210B relative to the camera 3200B or relative to some other reference point. The positions corresponding to the respective depth values may be positions (also referred to as physical positions) on various surfaces of the camera field of view 3210/3210B, such as positions on the top surface of each of the objects 3510-3550. In some instances, the 3D image information may include a point cloud, which may include a plurality of 3D coordinates that describe various positions on one or more outer surfaces of the objects 3510-3550, or some other object within the camera field of view 3210/3210B.

図３Ａおよび図３Ｂの実施例では、ロボット３３００（ロボット１３００の実施形態であり得る）は、ロボット基部３３１０に取り付けられる一端を有し、かつロボットグリッパなどのエンドエフェクター装置３３３０に取り付けられるか、またはそれらによって形成される別の端を有する、ロボットアーム３３２０を含み得る。ロボット基部３３１０は、ロボットアーム３３２０を装着するために使用され得るが、ロボットアーム３３２０、より具体的には、エンドエフェクター装置３３３０は、ロボット３３００の環境で一つまたは複数の物体（例えば、３５１０／３５２０／３５３０／３５４０）と相互作用するために使用され得る。相互作用（ロボット相互作用とも呼ぶ）は、例えば、物体３５１０～３５４０の少なくとも一つをつかむか、または他の方法でピックアップすることを含み得る。例えば、ロボット相互作用は、ロボット３３００が、物体３５５０（例えば、パレットまたは他のプラットフォーム）から物体３５１０～３５４０（例えば、箱）をピックアップし、および物体３５１０～３５４０を目的地の位置に動かすために使用される、パレットから降ろす動作の一部であり得る。 In the example of Figures 3A and 3B, a robot 3300 (which may be an embodiment of robot 1300) may include a robot arm 3320 having one end attached to a robot base 3310 and another end attached to or formed by an end effector device 3330, such as a robot gripper. The robot base 3310 may be used to mount the robot arm 3320, while the robot arm 3320, and more specifically, the end effector device 3330, may be used to interact with one or more objects (e.g., 3510/3520/3530/3540) in the environment of the robot 3300. The interaction (also referred to as robot interaction) may include, for example, grasping or otherwise picking up at least one of the objects 3510-3540. For example, the robot interaction may be part of an unpalletizing operation in which the robot 3300 picks up objects 3510-3540 (e.g., boxes) from an object 3550 (e.g., a pallet or other platform) and is used to move the objects 3510-3540 to a destination location.

上で論じたように、本開示の一態様は、一つまたは複数の物体を表す画像情報に基づいて、箱のグループなど、物体のグループの一つまたは複数の物理的エッジの検出を実施することまたは促進することに関する。図４は、物理的エッジ検出を実施または促進するための、あるいはより具体的には、候補エッジを使用して、物体のグループの物理的エッジの少なくとも一つを表すべきかどうかを判定するための、例示的な方法４０００のフロー図を示す。より具体的には、本方法は、物理的エッジを表すことができる候補エッジを有するか、または偽エッジであり得る画像情報を受信することを伴い得る。偽エッジは、例えば、物体のグループのうちの一つの表面上に表示される可視線または他の視覚的マーキングを表す候補エッジであり得る。視覚的マーキングは、物理的エッジに似ているが、実際にはいかなる物理的エッジにも対応しない外観を有し得る。従って、方法４０００は、実施形態では、候補エッジが実際の物理的エッジに対応するかどうか、または候補エッジが偽エッジである可能性が高いかどうかについての信頼レベルまたは可能性を評価するために使用され得る。候補エッジが偽エッジである可能性が高い場合、および／または実際の物理的エッジに対応する十分な高い信頼レベルを持たない場合、方法４０００は、実施形態では、物体のグループの任意の物理的エッジを表すためのさらなる検討から候補エッジを除去するまたはより広くは除外し得る。 As discussed above, one aspect of the disclosure relates to performing or facilitating detection of one or more physical edges of a group of objects, such as a group of boxes, based on image information representing one or more objects. FIG. 4 illustrates a flow diagram of an example method 4000 for performing or facilitating physical edge detection, or more specifically, for determining whether a candidate edge should be used to represent at least one of the physical edges of the group of objects. More specifically, the method may involve receiving image information having a candidate edge that may represent a physical edge or may be a false edge. A false edge may be, for example, a candidate edge that represents a visible line or other visual marking displayed on a surface of one of the group of objects. The visual marking may have an appearance that resembles a physical edge but does not actually correspond to any physical edge. Thus, the method 4000 may be used in embodiments to evaluate a confidence level or likelihood as to whether a candidate edge corresponds to an actual physical edge, or whether the candidate edge is likely to be a false edge. If a candidate edge is likely to be a false edge and/or does not have a high enough confidence level that it corresponds to a real physical edge, the method 4000, in an embodiment, may remove or more broadly exclude the candidate edge from further consideration for representing any physical edges of the group of objects.

実施形態では、方法４０００は、例えば、図２Ａ～図２Ｄ、または図３Ａまたは３Ｃの計算システム１１００によって、またはより具体的には、計算システム１１００の少なくとも一つの処理回路１１１０によって行われてもよい。一部のシナリオでは、少なくとも一つの処理回路１１００が、非一時的コンピューター可読媒体（例えば、１１２０）上に記憶される命令を実行することによって、方法４０００を行ってもよい。例えば、命令によって、処理回路１１１０に、方法４０００を行い得る、図２Ｄに示されたモジュールのうちの一つまたは複数を実行させてもよい。実施例として、以下で論じるステップ４００２～４００８のうちの一つまたは複数は、物理的エッジ検出モジュール１１２５によって行われてもよい。方法４０００が、物体認識および／または物体登録を行うステップを含む場合、ステップは、例えば、物体認識／登録モジュール１１２８によって行われてもよい。方法４０００が、ロボット相互作用を計画すること、またはロボット相互作用移動コマンドを生成することを伴う場合、こうしたステップは、例えば、運動計画モジュール１１２９によって行われてもよい。実施形態では、方法４０００は、計算システム１１００が、図３Ａならびに図３Ｃのロボット３３００およびカメラ３２００／３２００Ａ／３２００Ｂなど、ロボットおよびカメラと、または本開示で論じる任意の他のカメラもしくはロボットと通信している環境で行われてもよい。図３Ａおよび図３Ｃに示されるような一部のシナリオでは、カメラ（例えば、３２００）は、静止構造（例えば、部屋の天井）に装着され得る。他のシナリオでは、カメラは、ロボットアーム（例えば、３３２０）、またはより具体的には、ロボット（例えば、３３００）のエンドエフェクター装置（例えば、３３３０）上に装着され得る。 In an embodiment, the method 4000 may be performed, for example, by the computing system 1100 of FIG. 2A-FIG. 2D or FIG. 3A or 3C, or more specifically, by at least one processing circuit 1110 of the computing system 1100. In some scenarios, at least one processing circuit 1100 may perform the method 4000 by executing instructions stored on a non-transitory computer-readable medium (e.g., 1120). For example, the instructions may cause the processing circuit 1110 to perform one or more of the modules shown in FIG. 2D, which may perform the method 4000. As an example, one or more of steps 4002-4008 discussed below may be performed by the physical edge detection module 1125. If the method 4000 includes steps of performing object recognition and/or object registration, the steps may be performed, for example, by the object recognition/registration module 1128. Where the method 4000 involves planning robot interactions or generating robot interaction movement commands, such steps may be performed, for example, by the motion planning module 1129. In an embodiment, the method 4000 may be performed in an environment in which the computing system 1100 is in communication with a robot and camera, such as the robot 3300 and camera 3200/3200A/3200B of FIG. 3A and FIG. 3C, or with any other camera or robot discussed in this disclosure. In some scenarios, such as those shown in FIG. 3A and FIG. 3C, the camera (e.g., 3200) may be mounted on a stationary structure (e.g., the ceiling of a room). In other scenarios, the camera may be mounted on a robot arm (e.g., 3320), or more specifically, on an end effector device (e.g., 3330) of the robot (e.g., 3300).

実施形態では、方法４０００の一つまたは複数のステップは、物体のグループ（例えば、３５１０～３５５０）が、現在、カメラ（例えば、３２００／３２００Ａ／３２００Ｂ）のカメラ視野（例えば、３２１０／３２１０Ａ／３２１０Ｂ）にあるときに行われてもよい。例えば、方法４０００の一つまたは複数のステップは、物体のグループがカメラ視野（例えば、３２１０／３２１０Ａ／３２１０Ｂ）内に入った直後に、またはより一般的には、物体のグループがカメラ視野内にある間に、行われてもよい。一部のシナリオでは、方法４０００の一つまたは複数のステップは、物体のグループがカメラ視野内にあるときに行われてもよい。例えば、物体のグループがカメラ視野（例えば、３２１０／３２１０Ａ／３２１０Ｂ）にあるとき、カメラ（例えば、３２００／３２００Ａ／３２００Ｂ）は、物体のグループを表す画像情報を生成してもよく、画像情報を計算システム（例えば、１１００）に通信し得る。計算システムは、物体のグループがまだカメラ視野内にある間、または物体のグループがもはやカメラ視野内にないときでさえも、画像情報に基づいて方法４０００の一つまたは複数のステップを実施し得る。 In an embodiment, one or more steps of method 4000 may be performed when a group of objects (e.g., 3510-3550) is currently in the camera field of view (e.g., 3210/3210A/3210B) of a camera (e.g., 3200/3200A/3200B). For example, one or more steps of method 4000 may be performed immediately after the group of objects enters the camera field of view (e.g., 3210/3210A/3210B), or more generally, while the group of objects is in the camera field of view. In some scenarios, one or more steps of method 4000 may be performed while the group of objects is in the camera field of view. For example, when a group of objects is in a camera field of view (e.g., 3210/3210A/3210B), the camera (e.g., 3200/3200A/3200B) may generate image information representative of the group of objects and communicate the image information to a computing system (e.g., 1100). The computing system may perform one or more steps of method 4000 based on the image information while the group of objects is still in the camera field of view, or even when the group of objects is no longer in the camera field of view.

実施形態では、方法４０００は、計算システム１１００が、カメラ（例えば、３２００／３２００Ａ／３２００Ｂ）のカメラ視野（例えば、３２１０／３２１０Ａ／３２１０Ｂ）内の物体のグループを表す画像情報を受信する、ステップ４００２から開始されてもよく、またはそうでなければ、ステップ４００２を含んでもよい。画像情報は、物体のグループがカメラ視野にある（またはあった）ときに、カメラ（例えば、３２００／３２００Ａ／３２００Ｂ）によって生成されてもよく、および、例えば、２Ｄ画像情報および／または３Ｄ画像情報を含み得る。例えば、図５Ａは、２Ｄ画像情報、またはより具体的には、カメラ３２００／３２００Ａによって生成され、図３Ａおよび図３Ｃの物体３５１０～３５５０を表す、２Ｄ画像５６００を示す。より具体的には、２Ｄ画像５６００（例えば、グレースケール、またはカラー画像）は、カメラ３２００／３２００Ａの視点からの物体３５１０～３５５０の外観を記述し得る。実施形態では、２Ｄ画像５６００は、カラー画像の単一色チャネル（例えば、赤、緑、または青のチャネル）に対応し得る。カメラ３２００／３２００Ａが物体３５１０～３５５０の上方に配置される場合、２Ｄ画像５６００は、物体３５１０～３５５０のそれぞれの上部表面の外観を表し得る。図５Ａの実施例では、２Ｄ画像５６００は、物体３５１０～３５５０のそれぞれの表面（例えば、上部表面）を表す、それぞれの部分５６１０、５６２０、５６３０、５６４０、および５６５０（画像部分とも呼ぶ）を含み得る。図５Ａでは、２Ｄ画像５６００の画像部分５６１０～５６５０の各画像部分は、画像領域、すなわちより具体的には、ピクセル領域（画像がピクセルによって形成される場合）であり得る。より具体的には、画像領域は、画像の領域であってもよく、ピクセル領域は、ピクセルの領域であり得る。一つまたは複数の画像部分５６１０～５５５０は、物体の表面上に目に見えるか、または現れる、視覚的マーキングまたは他の視覚的詳細を捕捉するか、または他の方法で表し得る。例えば、画像部分５６１０は、物体３６１０の上部表面上に印刷または他の方法で配置され得る図３Ｂの視覚的マーキング３６１２を表し得る。 In an embodiment, method 4000 may begin with or otherwise include step 4002, in which computing system 1100 receives image information representing a group of objects within a camera field of view (e.g., 3210/3210A/3210B) of a camera (e.g., 3200/3200A/3200B). The image information may be generated by a camera (e.g., 3200/3200A/3200B) when the group of objects is (or was) in the camera field of view, and may include, for example, 2D image information and/or 3D image information. For example, FIG. 5A shows 2D image information, or more specifically, 2D image 5600 generated by camera 3200/3200A and representing objects 3510-3550 of FIGS. 3A and 3C. More specifically, the 2D image 5600 (e.g., a grayscale or color image) may describe the appearance of the objects 3510-3550 from the viewpoint of the camera 3200/3200A. In an embodiment, the 2D image 5600 may correspond to a single color channel (e.g., a red, green, or blue channel) of a color image. If the camera 3200/3200A is positioned above the objects 3510-3550, the 2D image 5600 may represent the appearance of the top surfaces of each of the objects 3510-3550. In the example of FIG. 5A, the 2D image 5600 may include respective portions 5610, 5620, 5630, 5640, and 5650 (also referred to as image portions) that represent the surfaces (e.g., top surfaces) of each of the objects 3510-3550. In FIG. 5A, each image portion of image portions 5610-5650 of 2D image 5600 may be an image region, or more specifically, a pixel region (where an image is formed by pixels). More specifically, an image region may be a region of an image, and a pixel region may be a region of pixels. One or more image portions 5610-5550 may capture or otherwise represent visual markings or other visual details that are visible or appearing on the surface of an object. For example, image portion 5610 may represent visual marking 3612 of FIG. 3B, which may be printed or otherwise disposed on the top surface of object 3610.

図５Ｂは、ステップ４００２の画像情報が３Ｄ画像情報５７００を含む、実施例を示す。より具体的には、３Ｄ画像情報５７００は、物体３５１０～３５５０の一つまたは複数の表面（例えば、上部表面、または他の外側表面）上のさまざまな位置のそれぞれの奥行き値を示す、例えば、奥行きマップまたは点群を含み得る。例えば、３Ｄ画像情報５７００は、物体３５１０の表面上の一組の位置５７１０_１～５７１０_ｎ（物理的位置とも呼ぶ）のそれぞれの奥行き値を示す第一の部分５７１０（画像部分とも呼ぶ）と、物体３５２０の表面上の一組の位置５７２０_１～５７２０_ｎのそれぞれの奥行き値を示す第二の部分５７２０と、物体３５３０の表面上の一組の位置５７３０_１～５７３０_ｎのそれぞれの奥行き値を示す第三の部分５７３０と、物体３５４０の表面上の一組の位置５７４０_１～５７４０_ｎのそれぞれの奥行き値を示す第四の部分５７４０と、物体３５５０の表面上の一組の位置５７５０_１～５７５０_ｎのそれぞれの奥行き値を示す第五の部分５７５０とを含み得る。それぞれの奥行き値は、３Ｄ画像情報を生成するカメラ（例えば、３２００／３２００Ｂ）に対するものであってもよく、またはいくつかの他の基準点に対するものであり得る。一部の実装形態では、３Ｄ画像情報は、カメラ視野（例えば、３２１０／３２１０Ｂ）の中にある物体の構造上のさまざまな位置に対するそれぞれの座標を含む、点群を含み得る。図５Ｂの実施例では、点群は、物体３５１０～３５５０のそれぞれの表面上の位置を記述する、それぞれの座標セットを含み得る。座標は、［ＸＹＺ］座標などの３Ｄ座標であってもよく、カメラ座標系、またはいくつかの他の座標系に対する値を有し得る。実施例として、カメラ座標系は、図３Ａ、３Ｃ、および５Ｂに示される、Ｘ、Ｙ、Ｚによって定義される。 5B shows an example in which the image information of step 4002 includes 3D image information 5700. More specifically, the 3D image information 5700 may include, for example, a depth map or point cloud indicating respective depth values for various locations on one or more surfaces (e.g., top surfaces, or other exterior surfaces) of the objects 3510-3550. For example, the 3D image information 5700 may include a first portion 5710 (also referred to as an image portion) indicating the depth values of each of a set of positions 5710 ₁ to 5710 _n (also referred to as physical positions) on the surface of the object 3510, a second portion 5720 indicating the depth values of each of the set of positions 5720 ₁ to 5720 _n on the surface of the object 3520, a third portion 5730 indicating the depth values of each of the set of positions 5730 ₁ to 5730 _n on the surface of the object 3530, a fourth portion ₅₇₄₀ indicating the depth values of each of the set of positions 5740 ₁ to 5740 _n on the surface of the object 3540, and a fifth portion 5750 indicating the depth values of each of the set of positions 5750 1 to 5750 _n on the surface of the object 3550. Each depth value may be relative to the camera (e.g., 3200/3200B) generating the 3D image information, or may be relative to some other reference point. In some implementations, the 3D image information may include a point cloud including respective coordinates for various positions on the structure of the object within the camera field of view (e.g., 3210/3210B). In the example of FIG. 5B, the point cloud may include respective sets of coordinates describing positions on the surface of each of the objects 3510-3550. The coordinates may be 3D coordinates, such as [X Y Z] coordinates, and may have values relative to the camera coordinate system, or some other coordinate system. As an example, the camera coordinate system is defined by X, Y, Z, as shown in FIGS. 3A, 3C, and 5B.

実施形態では、ステップ４００２は、２Ｄ画像情報および３Ｄ画像情報の両方を受信することを伴い得る。一部の実例では、計算システム１１００は、２Ｄ画像情報を使用して、３Ｄ画像情報の限定を補償してもよく、その逆も可能である。例えば、カメラ視野の中の複数の物体が互いに近接して配置され、カメラ（例えば、３２００Ｂ）に対して実質的に等しい奥行きを有する場合、３Ｄ画像情報（例えば、５７００）は、実質的に等しい奥行き値を有する複数の位置を記述してもよく、特に、物体間の間隔が３Ｄ画像情報の解像度に対して狭すぎる場合、３Ｄ画像情報で表される個々の物体を区別するための詳細を欠いてもよい。一部の実例では、３Ｄ画像情報は、個々の物体を区別することの困難さをさらに増大させ得る、ノイズまたは他の誤差源のために、誤りまたは欠落した情報を有し得る。この実施例では、２Ｄ画像情報は、個々の物体間の物理的エッジを捕捉またはその他の方法で表すことによって、この詳細の欠如を補償し得る。しかしながら、一部の実例では、２Ｄ画像情報は、以下で論じるように、実際の物理的エッジに対応しない候補エッジであり得る、偽エッジを含み得る。いくつかの実装では、計算システム１１００は、ステップ４００６に関して以下で論じるように、２Ｄ画像情報が候補エッジで定義された暗さ条件を満たすかどうかを判定することによって、候補エッジが偽エッジである可能性を評価し得る。いくつかの実装では、計算システム１１００は、候補エッジが、３Ｄ画像情報が奥行きの急激な変化を記述する物理的位置に対応する時など、候補エッジが３Ｄ画像情報内の物理的エッジに対応するかどうかを判定し得る。こうしたシナリオでは、３Ｄ画像情報は、候補エッジが偽エッジであるかをチェックするために使用されてもよく、定義された暗さ条件の使用を補完または置換して、候補エッジが偽エッジであるか、または候補エッジが実際の物理的エッジに対応するかどうかを判定する、より堅牢な方法を提供し得る。 In an embodiment, step 4002 may involve receiving both 2D image information and 3D image information. In some instances, the computing system 1100 may use the 2D image information to compensate for limitations of the 3D image information, or vice versa. For example, if multiple objects in a camera field of view are positioned close to each other and have substantially equal depths relative to the camera (e.g., 3200B), the 3D image information (e.g., 5700) may describe multiple positions with substantially equal depth values and may lack detail to distinguish individual objects represented in the 3D image information, especially if the spacing between the objects is too narrow for the resolution of the 3D image information. In some instances, the 3D image information may have erroneous or missing information due to noise or other error sources that may further increase the difficulty of distinguishing individual objects. In this example, the 2D image information may compensate for this lack of detail by capturing or otherwise representing physical edges between individual objects. However, in some instances, the 2D image information may include false edges, which may be candidate edges that do not correspond to actual physical edges, as discussed below. In some implementations, the computing system 1100 may evaluate the likelihood that a candidate edge is a false edge by determining whether the 2D image information satisfies a darkness condition defined at the candidate edge, as discussed below with respect to step 4006. In some implementations, the computing system 1100 may determine whether the candidate edge corresponds to a physical edge in the 3D image information, such as when the candidate edge corresponds to a physical location where the 3D image information describes an abrupt change in depth. In such scenarios, the 3D image information may be used to check whether the candidate edge is a false edge, which may complement or replace the use of the defined darkness condition to provide a more robust method of determining whether the candidate edge is a false edge or whether the candidate edge corresponds to an actual physical edge.

図４に戻ると、方法４０００は、一実施形態において、計算システム１１００が、ステップ４００２の画像情報から、物体のグループ（例えば、３５１０～３５５０）に関連付けられる複数の候補エッジを識別する、ステップ４００４を含み得る。実施形態では、候補エッジは、物体または物体のグループの物理的エッジを表すための候補を形成する、画像位置または物理的位置のセットであってもよく、またはそれを含んでもよい。一実施例では、画像情報が、一つまたは複数の物体を表すための２Ｄ画像を含む場合、候補エッジは、ピクセル位置（例えば、ピクセル位置［ｕ_１ｖ_１］～［ｕ_ｋｖ_ｋ］）などの画像位置のセットを指し得る。ピクセル位置のセットは、物理的エッジに集合的に類似しているピクセルのセットに対応し得る。例えば、図６Ａは、計算システム１１００が、２Ｄ画像５６００から候補エッジ５６０１_１、５６０１_２、５６０１_３、５６０１_４、５６０１_５、５６０１_６、．．．５６０１_ｎを識別する実施例を示す。候補エッジ５６０１_１～５６０１_ｎの各候補エッジは、例えば、２Ｄ画像が画像強度の急激な変化を有する線または線分を画定する、ピクセル位置のそれぞれのセットを含んでもよく、またはそれらによって形成され得る。画像強度の急激な変化は、例えば、一方の画像領域が他方よりも暗い、互いに直に隣接する二つの画像領域の間で発生し得る。以下でより詳細に論じるように、候補エッジは、二つの画像領域の間の境界に基づいて形成され得る。こうした実施例では、境界は、上で論じた線または線分によって形成され得る。２Ｄ画像（例えば、５６００）から識別される候補エッジは、２Ｄ候補エッジまたは２Ｄエッジと呼んでもよい。 Returning to FIG. 4, method 4000 may include step 4004 in which, in one embodiment, computing system 1100 identifies from the image information of step 4002 a number of candidate edges associated with the group of objects (e.g., 3510-3550). In an embodiment, the candidate edges may be or include image locations or sets of physical locations that form candidates for representing a physical edge of an object or group of objects. In one example, where the image information includes 2D images for representing one or more objects, the candidate edges may refer to a set of image locations, such as pixel locations (e.g., pixel locations [u ₁ v ₁ ]-[u _k v _k ]). The set of pixel locations may correspond to a set of pixels that collectively resemble a physical edge. For example, FIG. 6A illustrates computing system 1100 identifying candidate edges 5601 ₁ , 5601 ₂ , 5601 ₃ , 5601 ₄ , 5601 ₅ , 5601 ₆ , . 5601 _n . Each of candidate edges 5601 ₁ -5601 _n may include or be formed, for example, by a respective set of pixel locations where the 2D image defines a line or line segment having an abrupt change in image intensity. An abrupt change in image intensity may occur, for example, between two image regions immediately adjacent to one another, where one image region is darker than the other. As discussed in more detail below, the candidate edge may be formed based on a boundary between the two image regions. In such an example, the boundary may be formed by the line or line segment discussed above. A candidate edge identified from a 2D image (e.g., 5600) may be referred to as a 2D candidate edge or a 2D edge.

実施形態では、画像情報は、実際の物理的エッジに対応するいくつかの候補エッジを含んでもよく、偽エッジであるいくつかの候補エッジを含んでもよい。例えば、図６Ａの候補エッジ５６０１_１、５６０１_２、５６０１_５、５６０１_６は、物体のグループ３５１０～３５５０の実際の物理的エッジに対応するか、より具体的には、物理的物体３５１０に対応することができ、一方で、候補エッジ５６０１_３および５６０１_４は偽エッジであり得る。候補エッジ５６０１_３および５６０１_４は、例えば、物体３５１０の表面上に表示される可視線またはその他の視覚的マーキングを表し得る。これらの可視線は、物理的エッジに類似し得るが、物体３５１０～３５５０の実際の物理的エッジには対応しない。従って、ステップ４００８に関して以下で論じるように、方法４０００は、一実施形態では、物体のグループの物理的エッジのうちの少なくとも一つを表すために、特定の候補エッジを使用するべきかどうかを判定することを伴い得る。 In an embodiment, the image information may include some candidate edges that correspond to actual physical edges and some candidate edges that are false edges. For example, candidate edges 5601 ₁ , 5601 ₂ , 5601 ₅ , 5601 ₆ in FIG. 6A may correspond to actual physical edges of the group of objects 3510-3550, or more specifically, to physical object 3510, while candidate edges 5601 ₃ and 5601 ₄ may be false edges. Candidate edges 5601 ₃ and 5601 ₄ may represent, for example, visible lines or other visual markings displayed on the surface of object 3510. These visible lines may resemble physical edges but do not correspond to actual physical edges of objects 3510-3550. Thus, as discussed below with respect to step 4008, method 4000, in one embodiment, may involve determining whether a particular candidate edge should be used to represent at least one of the physical edges of the group of objects.

一実施例では、画像情報が３Ｄ情報を含む場合、候補エッジは、画像位置のセットまたは物理的位置のセットを指し得る。実施例として、画像位置がピクセル位置である場合、それらは、物理的エッジのように見えるピクセルのセットに対応し得る。別の実施例では、３Ｄ画像情報が奥行きマップを含む場合、候補エッジは、奥行きマップに奥行きの急激な変化がある境界を形成し得る、例えば、線または線分を画定する、ピクセル位置のセットを含み得る。３Ｄ画像情報が、物体の表面上の（例えば、点群を介して）物理的位置の３Ｄ座標を記述する場合、候補エッジは、点群または他の３Ｄ画像情報の奥行きの急激な変化がある境界を形成し得る、例えば、仮想線または線分を画定する、物理的位置のセットを含み得る。例えば、図６Ｂは、計算システム１１００が、３Ｄ画像情報５７００から候補エッジ５７０１_１、５７０１_２、５７０１_３、５７０１_ｎを識別した実施例を示す。候補エッジ５７０１_１～５７０１_ｎの各候補エッジは、例えば、奥行きの急激な変化が生じる境界を画定する、物理的位置［Ｘ_１Ｙ_１Ｚ_１］～［Ｘ_ｐＹ_ｐＺ_１］のセットを含み得る。３Ｄ画像情報から識別される候補エッジは、３Ｄ候補エッジまたは３Ｄエッジと呼んでもよい。 In one example, if the image information includes 3D information, the candidate edges may refer to a set of image locations or a set of physical locations. As an example, if the image locations are pixel locations, they may correspond to a set of pixels that appear to be physical edges. In another example, if the 3D image information includes a depth map, the candidate edges may include a set of pixel locations that may form a boundary where there is an abrupt change in depth in the depth map, e.g., defining a line or line segment. If the 3D image information describes 3D coordinates of physical locations on the surface of an object (e.g., via a point cloud), the candidate edges may include a set of physical locations that may form a boundary where there is an abrupt change in depth of the point cloud or other 3D image information, e.g., defining a virtual line or line segment. For example, FIG. 6B illustrates an example where the computing system 1100 has identified candidate edges 5701 ₁ , 5701 ₂ , 5701 ₃ , 5701 _n from the 3D image information 5700. Each candidate edge 5701 ₁ -5701 _n may, for example, include a set of physical locations [X ₁ Y ₁ Z ₁ ] - [X _p Y _p Z ₁ ] that define a boundary where an abrupt change in depth occurs. Candidate edges identified from the 3D image information may be referred to as 3D candidate edges or 3D edges.

実施形態では、計算システム１１００が２Ｄ候補エッジおよび３Ｄ候補エッジの両方を識別する場合、計算システム１１００は、２Ｄ候補エッジ（例えば、５６０１_５）のいずれかが、３Ｄ候補エッジ（例えば、５７０１_１）のうちの一つと共通の物理的エッジを表すか、またはその逆であるかを判定するように構成され得る。言い換えれば、計算システム１１００は、２Ｄ候補エッジのいずれかが３Ｄ候補エッジのうちの一つにマッピングするか、またはその逆であるかを判定し得る。マッピングは、例えば、２次元候補エッジの座標を、２Ｄ画像情報の座標系で表されることから３Ｄ画像情報の座標系で表されることまで変換すること、または３Ｄ次元候補エッジの座標を、３Ｄ画像情報の座標系で表されることから２Ｄ画像情報の座標系で表されることまで変換することに基づいてもよい。２Ｄ候補エッジから３Ｄ候補エッジまでのマッピングは、「ＭＥＴＨＯＤＡＮＤＣＯＭＰＵＴＩＮＧＳＹＳＴＥＭＦＯＲＰＲＯＣＥＳＳＩＮＧＣＡＮＤＩＤＡＴＥＥＤＧＥＳ」と題する、米国特許出願第１６／７９１，０２４号（弁理士整理番号ＭＪ００４９－ＵＳ／００７７－０００９ＵＳ１）でより詳細に論じられ、その全体の内容が参照により本明細書に組み込まれる。 In an embodiment, if the computing system 1100 identifies both 2D and 3D candidate edges, the computing system 1100 may be configured to determine whether any of the 2D candidate edges (e.g., 5601 ₅ ) represent a common physical edge with one of the 3D candidate edges (e.g., 5701 ₁ ), or vice versa. In other words, the computing system 1100 may determine whether any of the 2D candidate edges map to one of the 3D candidate edges, or vice versa. The mapping may be based, for example, on transforming the coordinates of the two-dimensional candidate edge from being represented in the coordinate system of the 2D image information to being represented in the coordinate system of the 3D image information, or on transforming the coordinates of the 3D dimensional candidate edge from being represented in the coordinate system of the 3D image information to being represented in the coordinate system of the 2D image information. Mapping of 2D candidate edges to 3D candidate edges is discussed in more detail in U.S. patent application Ser. No. 16/791,024, entitled "METHOD AND COMPUTTING SYSTEM FOR PROCESSING CANDIDATE EDGES" (Attorney Docket No. MJ0049-US/0077-0009US1), the entire contents of which are incorporated herein by reference.

上述のように、計算システム１１００は、画像強度（例えば、ピクセル強度）の急激な変化がある２Ｄ画像情報内の画像位置（例えば、ピクセル位置）を識別することによって、２Ｄ画像または他の２Ｄ画像情報から候補エッジを識別し得る。いくつかの実施では、急激な変化は、一つの画像領域が他の画像領域よりも暗い、二つの画像領域の間の境界で生じ得る。例えば、二つの画像領域は、第一の画像領域および第二の画像領域を含み得る。第一の画像領域は、第二の画像領域を含み得る、一つまたは複数の直に隣接する領域よりも暗い２Ｄ画像の領域であり得る。画像領域の暗さは、画像情報（例えば、５６００）を生成するカメラ（例えば、３２００／３２００Ａ）によって、対応する物理的領域からどれだけの量の反射光が検出されたか、または他の方法で感知されたかを示し得る。より具体的には、暗い画像領域は、カメラが、対応する物理領域から比較的少量の反射光（または反射光なし）を感知したことを示し得る。実施形態によっては、画像領域の暗さは、画像領域の中の画像強度が、最小の可能な画像強度値（例えば、ゼロ）にどれほど近いかを示し得る。これらの実装では、暗い画像領域が、画像領域の画像強度値（複数可）がゼロに近いことを示してもよく、それよりも暗くない画像領域が、画像領域の画像強度値（複数可）が、最大の可能な画像強度値に近いことを示し得る。 As described above, the computing system 1100 may identify candidate edges from a 2D image or other 2D image information by identifying image locations (e.g., pixel locations) within the 2D image information where there is an abrupt change in image intensity (e.g., pixel intensity). In some implementations, the abrupt change may occur at a boundary between two image regions where one image region is darker than the other image region. For example, the two image regions may include a first image region and a second image region. The first image region may be a region of the 2D image that is darker than one or more immediately adjacent regions, which may include the second image region. The darkness of an image region may indicate how much reflected light has been detected or otherwise sensed from the corresponding physical region by a camera (e.g., 3200/3200A) generating the image information (e.g., 5600). More specifically, a dark image region may indicate that the camera has sensed a relatively small amount (or no) reflected light from the corresponding physical region. In some embodiments, the darkness of an image region may indicate how close the image intensity within the image region is to the minimum possible image intensity value (e.g., zero). In these implementations, a dark image region may indicate that the image intensity value(s) of the image region are closer to zero, and a less dark image region may indicate that the image intensity value(s) of the image region are closer to the maximum possible image intensity value.

実施形態では、第二の画像領域は、長方形バンドまたは線または線分などの細長い形状を有し得る。実施例として、図７Ａは、それぞれ、直に隣接する画像領域５６０５_１、５６０５_２、５６０５_３、５６０５_４、５６０５_５、５６０５_６よりも暗い、画像領域５６０３_１、５６０３_２、５６０３_３、５６０３_４、５６０３_５、５６０３_６の例を示す。画像５６００がピクセルを含む場合、図７Ａの画像領域は、ピクセル領域であり得る。画像領域５６０３_１～５６０３_４はそれぞれ、長方形バンドなどのピクセルのバンドであってもよく、および複数のピクセルの幅を有してもよく、画像領域５６０３_５および５６０３_６はそれぞれ、一つのピクセルの幅を有するピクセルの線であってもよく、または形成し得る。上述のように、候補エッジは、第一の画像領域（例えば、画像領域５６０３_１～５６０３_６のうちの一つ）と第二の画像領域（例えば、画像領域５６０５_１～５６０５_６のうちの一つ）との間よって形成されてもよく、またはそれらの境界に基づいて形成されてもよく、第一の画像領域は、第二の画像領域に直に隣接してもよく、第二の画像領域よりも暗くてもよく、画像強度における急激な変化が二つの画像領域の間の境界で生じてもよい。例えば、図７Ｂは、画像領域５６０３_１～５６０３_６と対応する直に隣接する画像領域５６０５_１～５６０５_６との間のそれぞれの境界によって画定または形成される、候補エッジ５６０１_１～５６０１_６を示す。実施例として、計算システム１１００は、候補エッジ５６０１_１を、一つの画像領域５６０５_１と別の暗い画像領域５６０３_１との間の境界を画定するピクセル位置のセットとして識別し得る。追加の実施例として、計算システム１１００は、候補エッジ５６０１_５を、一つの画像領域５６０５_５と暗い画像領域５６０３_５との間の境界を画定するピクセル位置のセットとして識別し得る。一部の実例では、候補エッジ５６０１_５のピクセル位置は、暗い画像領域５６０３_５に位置し得る。より具体的には、候補エッジ５６０１_５は、例えば、単一のピクセルの幅を有するピクセルの線であり得る画像領域５６０３_５であり得るか、または合致し得る。 In an embodiment, the second image region may have an elongated shape such as a rectangular band or a line or line segment. As an example, Figure 7A shows examples of image regions ₅₆₀₃₁ , ₅₆₀₃₂ , ₅₆₀₃₃ , _{56034, 56035} _, ₅₆₀₃₆ that are darker than immediately adjacent image regions ₅₆₀₅₁ , ₅₆₀₅₂ , ₅₆₀₅₃ , 56054, ₅₆₀₅₅ , ₅₆₀₅₆ _, respectively. If image 5600 contains pixels, the image regions of Figure _7A may be pixel regions. Image regions _56031-56034 may each be a band of pixels, such as a rectangular band, and may have a width of multiple pixels, while image regions ₅₆₀₃₅ and ₅₆₀₃₆ may each be, or form, a line of pixels having a width of one pixel. As discussed above, a candidate edge may be formed by or based on a boundary between a first image region (e.g., one of image regions 5603 ₁ -5603 ₆ ) and a second image region (e.g., one of image regions 5605 ₁ -5605 ₆ ), where the first image region may be immediately adjacent to the second image region and may be darker than the second image region, and an abrupt change in image intensity may occur at the boundary between the two image regions. For example, FIG. 7B shows candidate edges 5601 ₁ -5601 ₆ defined or formed by respective boundaries between image regions 5603 ₁ -5603 ₆ and corresponding immediately adjacent image regions 5605 ₁ -5605 _6. As an example, computing system 1100 may identify candidate edge 5601 ₁ as a set of pixel locations that define a boundary between one image region 5605 ₁ and another dark image region 5603 ₁ . As an additional example, the computing system 1100 may identify a candidate edge 5601 ₅ as a set of pixel locations that define a boundary between one image region 5605 ₅ and a dark image region 5603 _5. In some instances, the pixel locations of the candidate edge 5601 ₅ may be located in the dark image region 5603 _5. More specifically, the candidate edge 5601 ₅ may be or may coincide with the image region 5603 ₅ , which may be, for example, a line of pixels having a width of a single pixel.

実施形態では、計算システム１１００は、例えば、画像強度の急激な変化を検出し得る、画像エッジ検出技術に基づいて、候補エッジ５６０１_１～５６０１_６の一つなどの候補エッジを検出またはその他の方法で識別し得る。例えば、計算システム１１００は、ソベルオペレーター、プレウィットオペレーター、または２Ｄ画像内の強度勾配を決定するための他の技術を適用することによって、および／またはＣａｎｎｙエッジ検出器または他のエッジ検出技術を適用することによって２Ｄ画像内の候補エッジまたは他の画像情報を検出するように構成され得る。 In an embodiment, computing system 1100 may detect or otherwise identify a candidate edge, such as one of candidate edges 5601 ₁ -5601 ₆ , based on an image edge detection technique, which may, for example, detect an abrupt change in image intensity. For example, computing system 1100 may be configured to detect candidate edges or other image information in the 2D image by applying a Sobel operator, a Prewitt operator, or other technique for determining intensity gradients in the 2D image, and/or by applying a Canny edge detector or other edge detection technique.

実施形態では、計算システム１１００が、一つまたは複数の直に隣接する画像領域よりも暗いピクセルのバンドである２Ｄ画像内の画像領域を識別する場合、画像領域は、いくつかの環境では、候補エッジよりも多くを形成するのに十分な幅であり得る。例えば、図７Ｃは、画像領域５６０３_１～５６０３_４に基づいて、追加の候補エッジ５６０１_７～５６０１_１０を識別する計算システム１１００を示す。より具体的には、追加の候補エッジ５６０１_７～５６０１_１０は、画像領域５６０３_１～５６０３_４と、直に隣接する画像領域５６０７_１～５６０７_４との間のそれぞれの境界を画定する、ピクセル位置のそれぞれのセットであり得る。より具体的な例として、この実施例の画像領域５６０３_２は、画像エッジ検出技術が、図７Ｂに示されるように、画像領域５６０３_２の一方の側（例えば、右側）と直に隣接する領域５６０５_２との間の境界によって形成される候補エッジ５６０１_２を識別し、および図７Ｃに示されるように、画像領域５６０３_２の反対側（例えば、左側）と直に隣接する領域５６０７_２との間の境界によって形成される別の候補エッジ５６０１_８をさらに識別するように、十分に広くてもよい。実施形態では、画像領域は、画像エッジ検出技術が、画像領域から単一の候補エッジのみを識別し得るように非常に狭くてもよい。こうした実施形態では、画像領域は、単一のピクセル、または数ピクセルの幅を有し得る。例えば、上述のように、画像領域５６０３_５は、ピクセルの線を形成してもよく、一つのピクセルの幅を有し得る。この実施例では、計算システム１１００は、画像領域５６０３_５に基づいて単一の候補エッジ５６０１_５のみを識別してもよく、候補エッジ５６０１_５は、例えば、画像候補エッジ５６０１_５が、画像領域５６０３_５を形成するピクセルの線であり得るか、または重複し得るように、画像領域５６０３_５と合致し得る。 In an embodiment, where the computing system 1100 identifies an image region in a 2D image that is a band of pixels darker than one or more immediately adjacent image regions, the image region may in some circumstances be wide enough to form _more than a candidate edge. For example, Figure 7C shows computing system 1100 identifying additional candidate edges _56017-560110 based on image regions _56031-56034 . More specifically, _the additional candidate _edges _56017-560110 may be respective sets of pixel locations that define respective boundaries between image regions _56031-56034 and immediately _adjacent image regions _56071-56074 _. As a more specific example, image region 5603 ₂ in this example may be sufficiently wide such that image edge detection techniques identify a candidate edge 5601 ₂ formed by the boundary between one side (e.g., the right side) of image region 5603 ₂ and immediately adjacent region 5605 ₂ , as shown in FIG. 7B, and further identify another candidate edge 5601 ₈ formed by the boundary between the opposite side (e.g., the left side) of image region 5603 ₂ and immediately adjacent region 5607 ₂ , as shown in FIG. 7C. In an embodiment, the image region may be very narrow such that image edge detection techniques may identify only a single candidate edge from the image region. In such an embodiment, the image region may have a width of a single pixel, or a few pixels. For example, as described above, image region 5603 ₅ may form a line of pixels and may have a width of one pixel. In this example, the computing system 1100 may identify only a single candidate edge 5601 ₅ based on the image region 5603 ₅ , and the candidate edge 5601 ₅ may match the image region 5603 ₅ , for example, such that the image candidate edge 5601 ₅ may be a line of pixels that form the image region 5603 ₅ or may overlap.

図４に戻ると、方法４０００は、実施形態で、ステップ４００４の複数の候補エッジが、第一の画像領域と第二の画像領域との間の境界によって形成される第一の候補エッジを含むときに行われてもよい、ステップ４００６を含み得、第一の画像領域は、画像強度において第二の画像領域よりも暗くてもよく、第二の画像領域に直に隣接し得る。この実施例では、第一の画像領域および第二の画像領域は、２Ｄ画像（例えば、５６００）または他の画像情報によって記述される領域であり得る。例えば、図７Ａ～７Ｃは、複数の候補エッジ５６０１_１～５６０１_ｎの例を提供し、ステップ４００６の第一の候補エッジは、複数の候補エッジ５６０１_１～５６０１_ｎのうちのいずれか一つであり得る。上述のように、第一の候補エッジは、第一の画像領域と第二の明るい画像領域との間の境界によって形成されてもよく、またはそれらに基づいて形成され得る。実施例として、第一の候補エッジが候補エッジ５６０１_１である場合、第一の画像領域は、画像領域５６０３_１であってもよく、第二の画像領域は、５６０５_１であり得る。別の例として、第一の候補エッジが候補エッジ５６０１_２である場合、第一の画像領域は画像領域５６０３_２であってもよく、第二の画像領域は画像領域５６０５_２であり得る。 Returning to FIG. 4, method 4000 may include step 4006, which may be performed in an embodiment when the plurality of candidate edges of step 4004 includes a first candidate edge formed by a boundary between a first image region and a second image region, where the first image region may be darker in image intensity than the second image region and may be immediately adjacent to the second image region. In this example, the first image region and the second image region may be regions described by a 2D image (e.g., 5600) or other image information. For example, FIGS. 7A-7C provide examples of a plurality of candidate edges 5601 ₁ -5601 _n , and the first candidate edge of step 4006 may be any one of the plurality of candidate edges 5601 ₁ -5601 _n . As discussed above, the first candidate edge may be formed by or based on a boundary between a first image region and a second bright image region. As an example, if the first candidate edge is candidate edge _5601-1 , the first image region may be image region _5603-1 and the second image region may be _5605-1 . As another example, if the first candidate edge is candidate edge 5601-2, the first image region may be image _region _5603-2 and the second image region may be image region _5605-2 .

ステップ４００６で、計算システム１１００は、画像情報（例えば、２Ｄ画像５６００）が、第一の候補エッジ（例えば、５６０１_２）で定義された暗さ条件を満たすかどうかを判定し得る。こうした決定は、より具体的には、例えば、第一の画像領域（例えば、５６０３_２）が定義された暗さ条件を満たすかどうかを判定することを含み得る。実施形態では、定義された暗さ条件を使用して、第一の候補エッジ（例えば、５６０１_２）が、カメラ視野（例えば、３２１０／３２１０Ａ）中の物体（例えば、３５１０）の実際の物理的エッジに対応するか、または第一の候補エッジが偽エッジであるかを判定することができる。 At step 4006, the computing system 1100 may determine whether the image information (e.g., 2D image 5600) satisfies a defined darkness condition at the first candidate edge (e.g., 5601 ₂ ). Such a determination may more specifically include, for example, determining whether the first image region (e.g., 5603 ₂ ) satisfies the defined darkness condition. In an embodiment, the defined darkness condition may be used to determine whether the first candidate edge (e.g., 5601 ₂ ) corresponds to an actual physical edge of the object (e.g., 3510) in the camera field of view (e.g., 3210/3210A) or whether the first candidate edge is a false edge.

実施形態では、定義された暗さ条件は、画像プライア、またはより具体的には、ダークプライアを検出するために使用され得る。画像プライアは、特定の状況中に画像内に現れる可能性を有するか、または画像内に予期され得る、画像特徴を指し得る。より具体的には、画像プライアは、画像が、カメラ視野において互いに隣り合わせに置かれる箱または他の物体のグループを表すために生成される状況など、このような状況で生成される画像にどのような画像特徴（複数可）があるかの予期、期待、または予測に対応し得る。実例によっては、ダークプライアは、高レベルの暗さを有する、および／またはスパイク形状の画像強度プロファイル（例えば、ピクセル強度プロファイル）を有する画像特徴を指し得る。スパイク形状の画像強度プロファイルは、暗さのスパイク増加および／または画像強度のスパイク減少を伴い得る。ダークプライアは、物体の一部または全ての間に狭い、物理的ギャップのみが存在するように、カメラ視野内の箱または他の物体のグループが互いに十分に近接して配置される状況に対応し得る。より具体的には、ダークプライアは、物体のグループを表すためにそのような状況で画像が生成されるとき、物理的ギャップが画像内で非常に暗く見えるであろうという予期、期待、または予測に対応し得る。より具体的には、ダークプライアは、以下でより詳細に論じるように、物理的ギャップを表す画像中の画像領域が高レベルの暗さを有するとの予期または予測に対応してもよく、および／またはスパイク形状の画像強度プロファイルを有し得る。いくつかの実施では、候補エッジに関連付けられる画像領域が、二つの物体間の物理的ギャップに対応するかどうかを評価することによって、候補エッジが、物理的エッジに対応するかどうかを判定するために、ダークプライアを使用し得る。 In an embodiment, the defined darkness conditions may be used to detect image priors, or more specifically, dark priors. Image priors may refer to image features that have the potential to appear in an image during a particular situation or that may be expected in an image. More specifically, image priors may correspond to expectations, expectations, or predictions of what image feature(s) will be present in an image generated in such a situation, such as a situation where an image is generated to represent a group of boxes or other objects placed next to each other in a camera field of view. In some instances, dark priors may refer to image features that have a high level of darkness and/or have a spike-shaped image intensity profile (e.g., pixel intensity profile). A spike-shaped image intensity profile may involve a spike increase in darkness and/or a spike decrease in image intensity. A dark prior may correspond to a situation where a group of boxes or other objects in a camera field of view are placed close enough to each other that there are only narrow, physical gaps between some or all of the objects. More specifically, the dark prior may correspond to an expectation, expectation, or prediction that a physical gap will appear very dark in an image when the image is generated in such a situation to represent a group of objects. More specifically, the dark prior may correspond to an expectation or prediction that an image region in an image representing a physical gap will have a high level of darkness and/or may have a spike-shaped image intensity profile, as discussed in more detail below. In some implementations, the dark prior may be used to determine whether a candidate edge corresponds to a physical edge by evaluating whether an image region associated with the candidate edge corresponds to a physical gap between two objects.

実施形態では、一部のシナリオでは、ダークプライアを検出するための条件であり得る、定義された暗さ条件は、二つの物体間（例えば、図３Ａ～３Ｃの３５１０および３５２０）の物理的ギャップがどのようになっているのかというモデルに基づいてもよく、特に物理的ギャップが狭い場合（例えば、５ｍｍ未満または１０ｍｍ未満）、２Ｄ画像に表示される必要がある、または表示される可能性が高い。例えば、定義された暗さ条件は、拡散反射率のランバートモデルに基づいてもよい。反射率のこうしたモデルは、一つまたは複数の表面または領域、特に入射光の拡散反射を引き起こす表面または領域から光がどのように反射するかを推定し得る。従って、モデルは、表面または表面領域からの反射光の強度を推定してもよく、反射光を感知するカメラ（例えば、３２００／３２００Ａ）によって生成される画像において、表面または表面領域がどれほど明るく、またはどれほど暗いかを示し得る。 In an embodiment, the defined darkness condition, which in some scenarios may be a condition for detecting a dark prior, may be based on a model of what the physical gap between two objects (e.g., 3510 and 3520 in FIGS. 3A-3C) is, and should or is likely to be visible in the 2D image, especially if the physical gap is narrow (e.g., less than 5 mm or less than 10 mm). For example, the defined darkness condition may be based on a Lambertian model of diffuse reflectance. Such a model of reflectance may estimate how light reflects from one or more surfaces or regions, especially surfaces or regions that cause diffuse reflection of incident light. Thus, the model may estimate the intensity of reflected light from a surface or surface region, and may indicate how bright or dark the surface or surface region is in an image generated by a camera (e.g., 3200/3200A) that senses the reflected light.

ランバートモデルが物体のグループ（例えば、図３Ａ～３Ｃの３５１０～３５５０）にどのように適用されるかの実施例として、図８は、物体３５１０、３５２０のさまざまな表面からくる反射光を感知することによって、少なくとも物体３５１０および３５２０を表す画像（例えば、５６００）を生成するように構成される、カメラ３２００Ａを描写する。一部のシナリオでは、反射光は、光源３６００からの放射光の反射であり得る。より具体的には、光源３６００は、物体３５１０、３５２０に向かって少なくともベクトルに沿って光を放射し得る。

この実施例では、物体３５１０および３５２０は、互いに隣接して配置されてもよく、物体３５１０の物理的エッジ部３５１０Ｂと物体３５２０の物理的エッジ部３５２０Ｄとの間の空間であり得る狭い物理的ギャップｇによって分離され得る。図８のシナリオでは、画像（例えば、５６００）中の物理的ギャップｇの画像強度は、物体３５１０の表面３５１０Ｅ（例えば、側面）などの物理的ギャップを形成するさまざまな表面から反射光の量に依存し得る。拡散反射率のランバートモデルの下で、特定の表面からの反射光の強度は、ｃｏｓαに比例するか、または他の方法でそれ基づいてもよく、ここで、αは、入射光のベクトル

と表面（例えば、３５１０Ｅ）の法線ベクトル

との間の角度である。図８Ａでは、ベクトル

と表面３５１０Ｅの法線ベクトル

との間の角度αは、実質的に９０度であってもよく、その結果、表面３５１０Ｅからの反射光の強度が極めて低くてもよく、または一部の状況ではゼロであり得る。従って、二つの物体間の物理的ギャップを表す画像領域は、極めて高いレベルの暗さを有し得る。従って、ステップ４００６の定義された暗さ条件は、いくつかの実装では、画像領域が、二つの物体間の物理的ギャップを表すことを示すために十分に暗いかどうかを評価するために使用され得る、定義された暗さ閾値基準を含んでもよい。 As an example of how the Lambertian model is applied to a group of objects (e.g., 3510-3550 in FIGS. 3A-3C), FIG. 8 depicts a camera 3200A configured to generate an image (e.g., 5600) representing at least the

objects

3510 and 3520 by sensing reflected light coming from various surfaces of the

objects

3510, 3520. In some scenarios, the reflected light may be a reflection of emitted light from a light source 3600. More specifically, the light source 3600 may emit light toward the

objects

3510, 3520 at least along a vector.

In this example, objects 3510 and 3520 may be positioned adjacent to one another and separated by a narrow physical gap g, which may be the space between physical edge 3510B of object 3510 and physical edge 3520D of object 3520. In the scenario of FIG. 8 , the image intensity of physical gap g in an image (e.g., 5600) may depend on the amount of reflected light from various surfaces that form the physical gap, such as surface 3510E (e.g., sides) of object 3510. Under the Lambertian model of diffuse reflectance, the intensity of reflected light from a particular surface may be proportional to or otherwise based on cos α, where α is the vector of incident light

and the normal vector of the surface (e.g., 3510E)

In FIG. 8A, the vector

and the normal vector of the surface 3510E

The angle α between may be substantially 90 degrees, such that the intensity of reflected light from surface 3510E may be extremely low, or in some circumstances zero. Thus, an image region representing a physical gap between two objects may have an extremely high level of darkness. Thus, the defined darkness condition of step 4006 may, in some implementations, include a defined darkness threshold criterion that may be used to evaluate whether an image region is sufficiently dark to indicate that it represents a physical gap between two objects.

一部の状況では、物理的ギャップは、その周辺よりもその真ん中で暗いように見え得る。すなわち、何らかの反射光が物理的ギャップを離れる場合、より多くの反射光が、物理的ギャップの真ん中からよりも物理的ギャップの外周から来てもよい。周辺部は、例えば、物理的エッジ部３５２０Ｄまたは物理的エッジ部３５１０Ｂに近い、物理的ギャップ内の位置を指し得る。一部のシナリオでは、物理的ギャップの真ん中に暗さのピークレベルが生じ得る。従って、物理的ギャップを表す画像領域は、画像強度プロファイルが、暗さのスパイク増加または画像領域内の画像強度のスパイク減少を有する、スパイク形状の画像強度プロファイル（例えば、ピクセル強度プロファイル）を有し得る。従って、ステップ４００６で定義された暗さ条件は、一部の状況では、画像領域が、例えば、スパイク形状の画像強度プロファイル（例えば、ステップ形状の画像強度プロファイルとは対照的に）を有するかどうかを評価するために、定義されたスパイク強度プロファイル基準を含み得る。 In some circumstances, a physical gap may appear darker in its middle than its periphery. That is, if any reflected light leaves the physical gap, more reflected light may come from the periphery of the physical gap than from the middle of the physical gap. The periphery may refer to a location within the physical gap, for example, closer to physical edge 3520D or physical edge 3510B. In some scenarios, a peak level of darkness may occur in the middle of the physical gap. Thus, an image region representing a physical gap may have a spike-shaped image intensity profile (e.g., pixel intensity profile), where the image intensity profile has a spike increase in darkness or a spike decrease in image intensity within the image region. Thus, the darkness condition defined in step 4006 may, in some circumstances, include a spike intensity profile criterion defined to evaluate whether the image region has, for example, a spike-shaped image intensity profile (as opposed to a step-shaped image intensity profile).

実施形態では、定義された暗さ条件は、例えば、非一時的コンピューター可読媒体１１２０または他の場所に格納される一つまたは複数の規則、基準、または他の情報によって定義され得る。例えば、情報は、暗さ閾値基準を満たすことによってのみ、スパイク強度プロファイル基準を満たすことによってのみ、両方の基準を満たすことによってのみ、または暗さ閾値基準もしくはスパイク強度プロファイル基準のいずれかを満たすことによってのみ、暗さ条件が満たされるかどうかを定義し得る。一部の実例では、定義された暗さ条件が、事前に定義された暗さ条件であり得るように、情報は、手動でまたは他の方法で事前に定義されてもよく、非一時的コンピューター可読媒体１１２０に記憶され得る。実例によっては、暗さ条件についての情報が動的に定義され得る。 In an embodiment, the defined darkness condition may be defined by one or more rules, criteria, or other information stored, for example, on the non-transitory computer-readable medium 1120 or elsewhere. For example, the information may define whether the darkness condition is met only by meeting a darkness threshold criterion, only by meeting a spike intensity profile criterion, only by meeting both criteria, or only by meeting either the darkness threshold criterion or the spike intensity profile criterion. In some instances, the information may be predefined manually or otherwise and stored on the non-transitory computer-readable medium 1120 such that the defined darkness condition may be a predefined darkness condition. In some instances, the information about the darkness condition may be dynamically defined.

実施形態では、定義された暗さ閾値基準および／または定義されたスパイク強度プロファイル基準は、非一時的コンピューター可読媒体１１２０または他の場所に格納される情報によって定義され得る。情報は、定義された暗さ閾値基準および／または定義されたスパイク強度プロファイル基準が、所定の基準（単数または複数）であり得るように、事前に定義され得る。実施形態では、本開示のさまざまな所定の閾値または他の所定の値は、非一時的コンピューター可読媒体１１２０または他の場所に保存値として手動で定義され得る。例えば、以下で論じる定義された暗さ閾値または定義された奥行き差閾値は、コンピューター可読媒体１１２０上に記憶される値であり得る。これらは、所定の値であってもよく、または動的に定義され得る。 In an embodiment, the defined darkness threshold criteria and/or the defined spike intensity profile criteria may be defined by information stored on the non-transitory computer readable medium 1120 or elsewhere. The information may be predefined such that the defined darkness threshold criteria and/or the defined spike intensity profile criteria may be a predetermined criteria or criteria. In an embodiment, the various predetermined thresholds or other predetermined values of the present disclosure may be manually defined as stored values on the non-transitory computer readable medium 1120 or elsewhere. For example, the defined darkness thresholds or defined depth difference thresholds discussed below may be values stored on the computer readable medium 1120. These may be predetermined values or may be dynamically defined.

図９Ａ～９Ｃは、画像情報（例えば、５６００）が候補エッジ５６０１_２で定義された暗さ条件を満たすか、より具体的には、画像領域５６０３_２が定義された暗さ条件を満たすかどうかを評価するための実施形態を示す。画像領域５６０３_２は、図８の物体３５１０などの第一の物体と、物体３５２０などの第二の物体との間の物理的ギャップを表し得る。実施形態では、候補エッジ５６０１_２は、物体３５１０の物理的エッジ５６１０Ｂを表してもよく、画像領域５６０３_２と直に隣接する画像領域５６０５_２との間の境界によって、またはその境界に基づいて形成され得る。この実施例では、画像領域５６０３_２は、第一の画像領域であってもよく、画像領域５６０５_２は、第二の画像領域であり得る。より具体的には、画像領域５６０３_２は、ピクセルのバンドを形成する第一のピクセル領域であってもよく、一方、画像領域５６０５_２は、第二のピクセル領域であってもよく、故に候補エッジ５６０１_２は、例えば、第一のピクセル領域と第二のピクセル領域との間の境界を画定するピクセル位置のセットを含んでもよく、またはそれによって形成され得る。上で論じたように、計算システム１１００は、例えば、画像領域５６０３_２、５６０５_２の間の画像強度（例えば、ピクセル強度）の急激な変化を検出することによって、候補エッジ５６０１_２を識別し得る。 9A-9C illustrate an embodiment for evaluating whether image information (e.g., 5600) satisfies a darkness condition defined by a candidate edge 5601 ₂ , and more specifically, whether image region 5603 ₂ satisfies the defined darkness condition. Image region 5603 ₂ may represent a physical gap between a first object, such as object 3510 of FIG. 8, and a second object, such as object 3520. In an embodiment, candidate edge 5601 ₂ may represent a physical edge 5610B of object 3510, and may be formed by or based on a boundary between image region 5603 ₂ and an immediately adjacent image region 5605 _2. In this example, image region 5603 ₂ may be a first image region and image region 5605 ₂ may be a second image region. More specifically, image region 5603 ₂ may be a first pixel region forming a band of pixels, while image region 5605 ₂ may be a second pixel region, and thus candidate edge 5601 ₂ may include or be formed by, for example, a set of pixel locations that define a boundary between the first pixel region and the second pixel region. As discussed above, computing system 1100 may identify candidate edge 5601 ₂ by, for example, detecting an abrupt change in image intensity (e.g., pixel intensity) between image regions 5603 ₂ , 5605 ₂ .

実施形態では、候補エッジが、第一の画像領域と第二の画像領域との間の境界に基づいて形成され、第一の画像領域が第二の画像領域よりも暗い場合、計算システム１１００は、第一の画像領域が定義されたスパイク強度プロファイル基準を満たす場合、定義された暗さ条件を満たすと決定し得る。より具体的には、計算システムは、第一の画像領域が、画像強度が、第一の画像領域内の暗闇において、第一の画像領域内の位置での暗さのピークレベルに向かって増加し、そしてその後、暗闇の中で減少する、形などの、その画像強度プロファイル（例えば、ピクセル強度プロファイル）に対して特定の形状を持っている場合、第一の画像領域（例えば、５６０３_２）が、定義されたスパイク強度プロファイル基準を満たすと決定することができる。こうした基準は、画像強度プロファイルが、画像領域内の暗さのスパイク増加、または画像領域内の強度のスパイク減少を有する、スパイク形状の強度プロファイルと一致し得る。こうした基準は、画像に現れる任意の物理的ギャップが、ギャップの外周に対して、ギャップの真ん中でより暗く見えることが予期される、ダークプライアを検出することと関連付けられ得る。 In an embodiment, a candidate edge is formed based on a boundary between a first image region and a second image region, and if the first image region is darker than the second image region, the computing system 1100 may determine that the first image region meets the defined darkness condition if it meets the defined spike intensity profile criteria. More specifically, the computing system may determine that the first image region (e.g., 5603 2 ) meets the defined spike intensity profile criteria if it has a particular shape to its image intensity profile (e.g., pixel intensity profile), such as a shape in _which the image intensity increases in darkness within the first image region toward a peak level of darkness at a location within the first image region, and then decreases in darkness. Such criteria may be consistent with a spike-shaped intensity profile, where the image intensity profile has a spike increase in darkness within the image region, or a spike decrease in intensity within the image region. Such criteria may be associated with detecting dark priors, where any physical gap that appears in an image is expected to appear darker in the middle of the gap relative to the perimeter of the gap.

図９Ｂは、定義されたスパイク強度プロファイル基準を満たすことができる、画像強度プロファイル９００１、またはより具体的には、ピクセル強度プロファイルを示す。より具体的には、画像強度プロファイルは、画像強度、すなわちより具体的には、ピクセル強度が、ピクセル位置などの画像位置の関数としてどのように変化するかを記述する情報を含み得る。実施形態によっては、画像強度プロファイルは、画像内の位置の関数として、画像強度の値を記述する曲線またはグラフによって表され得る。例えば、図９Ｂは、軸５６０９に沿った特定の方向におけるピクセル位置の関数として、画像強度の値、すなわちより具体的には、ピクセル強度値を記述する、曲線またはグラフを、画像強度プロファイル９００１として描写する。軸５６０９は、画像領域５６０３_２の幅寸法を横切って、それに沿って延在する軸であってもよく、軸５６０９に沿った方向は、軸５６０９に沿った特定の方向であり得る。図９Ｂの実施例では、幅寸法は、例えば、図５Ａの画像５６００の座標軸ｕと整列され得、軸５６０９に沿った方向は、その方向に沿ったピクセル座標［ｕ，ｖ］が、ｕの値が増加する、正の方向であり得る。 FIG. 9B illustrates an image intensity profile 9001, or more specifically, a pixel intensity profile, that may meet the defined spike intensity profile criteria. More specifically, the image intensity profile may include information describing how image intensity, or more specifically, pixel intensity, varies as a function of image location, such as pixel location. In some embodiments, the image intensity profile may be represented by a curve or graph describing the value of the image intensity as a function of location in the image. For example, FIG. 9B depicts as image intensity profile 9001 a curve or graph describing the value of the image intensity, or more specifically, pixel intensity value, as a function of pixel location in a particular direction along axis 5609. Axis 5609 may be an axis extending across and along a width dimension of image region 5603 ₂ , and the direction along axis 5609 may be a particular direction along axis 5609. In the example of FIG. 9B, the width dimension may be aligned, for example, with the coordinate axis u of image 5600 of FIG. 5A, and the direction along axis 5609 may be the positive direction along which pixel coordinates [u,v] increase in value of u.

実施形態では、計算システム１１００は、画像領域５６０３_２が、（ｉ）画像強度（例えば、ピクセル強度）が第一の方向（例えば、軸５６０９に沿った正の方向）に沿った位置の関数としての第一の画像領域内の暗さにおいて増加し、第一の画像領域内の位置ｕ_１で暗さのピークレベル（例えば、９００２）に到達する、第一のプロファイル部分（例えば、９０１１）と、その後に（ｉｉ）画像強度が、同じ方向（例えば、正の方向）に沿った位置の関数として、暗さのピークレベルから離れて、第一の画像領域内の暗さにおいて減少する、第二のプロファイル部分（例えば、９０１２）とを含む、ピクセル強度プロファイル（例えば、９００１）を有するかどうかを判定することによって、画像領域５６０３_２が定義されたスパイク強度プロファイル基準を満たすかどうかを判定することができる。図９Ｂの画像強度プロファイル９００１は、より具体的には、画像領域９５０３_２内の画像強度のスパイク減少を有するスパイク形状の強度プロファイルであり得る。 In an embodiment, the computing system 1100 can determine whether the image region 5603 ₂ meets the defined spike intensity profile criteria by determining whether the image region 5603 2 has a pixel intensity profile (e.g., 9001) that includes (i) a first profile portion (e.g., 9011) in which the image intensity (e.g., pixel intensity) increases in darkness in the first image region as a function of position along _a first direction (e.g., a positive direction along the axis 5609) and reaches a peak level of darkness (e.g., 9002) at a position u 1 in the first image region, followed by (ii) a second profile portion (e.g., 9012) in which the image intensity decreases in darkness in the first image region away from the peak level of darkness as a function of position along the same direction (e.g., a positive direction). The image intensity profile 9001 of FIG. _9B can more specifically be a spike-shaped intensity profile having a spike decrease in image intensity in the image region 9503 ₂ .

いくつかの実施では、暗さが増大する画像強度プロファイルは、画像強度が減少している値を有する画像強度プロファイルに対応し得る。例えば、画像（例えば、５６００）は、最小の可能なピクセル強度値（例えば、ゼロ）から、最大の可能なピクセル強度値（例えば、８ビットでコードされたピクセル強度値に対して２５５）の範囲内にあるピクセル強度値を有し得る。この実施例では、ピクセル強度値が低いと、明るさのレベルが低いため、暗さのレベルが高くてもよい、一方で、ピクセル強度値が高いと、明るさのレベルが高いため、暗さのレベルが低くてもよい。さらにこの実施例では、画像強度プロファイルの暗さのピークレベル（例えば、９００２）は、画像強度プロファイル（例えば、９００１）の最小画像強度値に対応し得る。 In some implementations, an image intensity profile of increasing darkness may correspond to an image intensity profile having values of decreasing image intensity. For example, an image (e.g., 5600) may have pixel intensity values that range from a minimum possible pixel intensity value (e.g., zero) to a maximum possible pixel intensity value (e.g., 255 for 8-bit coded pixel intensity values). In this example, lower pixel intensity values may have higher levels of darkness due to lower levels of brightness, while higher pixel intensity values may have lower levels of darkness due to higher levels of brightness. Further in this example, a peak level of darkness in the image intensity profile (e.g., 9002) may correspond to a minimum image intensity value in the image intensity profile (e.g., 9001).

上の実施例では、計算システム１１００は、画像強度プロファイルが、画像強度値（例えば、ピクセル強度値）が、最小画像強度値に向かって画像強度の減少を開始し、その後、最小画像強度値から離れて画像強度を増加させるように切り替わる形状を有するかどうかを判定することによって、画像領域が、定義されたスパイク強度プロファイル基準を満たすかどうかを判定し得る。例えば、図９Ｂの画像強度プロファイル９００１は、画像領域５６０３_２の幅寸法にわたって延在する一連のピクセルに対するそれぞれのピクセル強度値を記述し得る。計算システム１１００は、画像強度プロファイルが、それぞれのピクセル強度値が画像領域５６０３_２内の最小ピクセル強度値に向かって減少し、その後、最小ピクセル強度値から離れて画像領域５６０３_２内で増加するように切り替わる形状を有するかどうかを判定することによって、画像領域５６０３_２がスパイク強度プロファイル基準を満たすかどうかを判定することができる。この実施例では、最小ピクセル強度値は、画像強度プロファイル９００１における暗さのピークレベル９００２に対応し得る。 In the above example, the computing system 1100 may determine whether an image region meets the defined spike intensity profile criteria by determining whether the image intensity profile has a shape where the image intensity values (e.g., pixel intensity values) begin decreasing in image intensity towards a minimum image intensity value and then switch to increasing image intensity away from the minimum image intensity value. For example, the image intensity profile 9001 of FIG. 9B may describe respective pixel intensity values for a series of pixels extending across the width dimension of the image region 5603 _2. The computing system 1100 may determine whether the image region 5603 ₂ meets the spike intensity profile criteria by determining whether the image intensity profile has a shape where the respective pixel intensity values decrease towards a minimum pixel intensity value in the image region 5603 ₂ and then switch to increasing in image region 5603 ₂ away from the minimum pixel intensity value. In this example, the minimum pixel intensity value may correspond to a peak level of darkness 9002 in the image intensity profile 9001.

実施形態では、候補エッジが、第一の画像領域と第二の画像領域との間の境界に基づいて形成され、第一の画像領域が第二の画像領域よりも暗い場合、定義された暗さ閾値基準を満たすことは、定義された暗さ閾値との比較を伴い得る。こうした基準は、画像中に存在する任意の物理的ギャップが、外観上非常に暗いと予想される、ダークプライアを検出することに対応し得る。図９Ｃは、画像領域５６０３_２および直に隣接する画像領域に対する別の画像強度プロファイル９００３を示す。この実施例では、第一の画像領域は、第二のより明るい画像領域（例えば、５６０５_２）に直に隣接する画像領域５６０３_２であってもよく、一方で暗い画像領域５６０３_２は、第一の画像領域であり得る。上述のように、画像領域５６０３_２は、ピクセルのバンドを形成し得る。図９Ｃで、計算システム１１００は、画像領域５６０３_２が、定義された暗さ閾値τ_{ｄａｒｋ＿ｐｒｉｏｒ}よりも画像強度が暗い少なくとも一つの部分を有するかどうかを判定することによって、画像領域５６０３_２が定義された暗さ閾値基準を満たすかどうかを判定し得る。上述のように、より高いレベルの暗さは、一部の例では、より低い画像強度値に対応し得る。こうした実例では、計算システム１１００は、画像領域５６０３_２が、定義された暗さ閾値のτ_{ｄａｒｋ＿ｐｒｉｏｒ}よりも小さい画像強度値（例えば、ピクセル強度値）を有する画像強度プロファイルを有するかどうかを判定し得る。一部の状況では、計算システム１１００は、より具体的には、画像強度プロファイル９００３の最小強度値が、定義された暗さ閾値であるτ_{ｄａｒｋ＿ｐｒｉｏｒ}以下であるかを判定してもよく、最小強度値は、強度プロファイル９００３の暗さのピークレベル９００４に対応し得る。実施形態では、画像領域５６０３_２が画像強度プロファイル９００３を有する場合、定義された暗さ閾値基準および定義されたスパイク強度プロファイル基準の両方を満たすことができる。 In an embodiment, a candidate edge is formed based on a boundary between a first image region and a second image region, and if the first image region is darker than the second image region, meeting a defined darkness threshold criterion may involve a comparison with a defined darkness threshold. Such a criterion may correspond to detecting a dark prior, where any physical gaps present in the image are expected to be very dark in appearance. Figure 9C shows another image intensity profile 9003 for image region 5603 ₂ and an immediately adjacent image region. In this example, the first image region may be image region 5603 2 that is immediately adjacent to a second lighter image region (e.g., 5605 ₂ ), while the dark image region 5603 ₂ may be the first _{image region. As mentioned above, image region 5603 2} _may form a band of pixels. 9C, the computing system 1100 may determine whether the image region 5603 ₂ meets the defined darkness threshold criteria by determining whether the image region 5603 ₂ has at least one portion with an image intensity darker than the defined darkness threshold τ _{dark_prior} . As discussed above, a higher level of darkness may correspond to a lower image intensity value in some examples. In such an example, the computing system 1100 may determine whether the image region 5603 ₂ has an image intensity profile with image intensity values (e.g., pixel intensity values) that are less than the defined darkness threshold τ _{dark_prior} . In some circumstances, the computing system 1100 may more specifically determine whether the minimum intensity value of the image intensity profile 9003 is less than or equal to the defined darkness threshold τ _{dark_prior} , which may correspond to a peak level of darkness 9004 of the intensity profile 9003. In an embodiment, if an image region 5603 ₂ has an image intensity profile 9003, it may satisfy both the defined darkness threshold criteria and the defined spike intensity profile criteria.

実施形態では、計算システム１１００は、定義された暗さ閾値基準または定義されたスパイク強度プロファイル基準のうちの少なくとも一つが、上記の基準のうちのいずれか一つが、定義された暗さ条件を満たすために使用され得るように、満たされた場合、候補エッジおよび／または画像領域に対して定義された暗さ条件が満たされると判定し得る。実施形態では、計算システム１１００は、定義されたスパイク強度プロファイル基準が、（定義された暗さ閾値基準が満たされるかどうかに関係なく）満たされるという判定に応答してのみ、定義された暗さ閾値基準が、（定義されたスパイク強度プロファイル基準が満たされるかどうかに関係なく）満たされるという判定に応答してのみ、または、定義された暗さ閾値基準と定義されたスパイク強度プロファイル基準の両方が満たされるという判定に応答してのみ、定義された暗さ条件が、満たされると判定することができる。 In an embodiment, the computing system 1100 may determine that the defined darkness condition is met for a candidate edge and/or image region if at least one of the defined darkness threshold criteria or the defined spike intensity profile criteria is met, such that any one of the above criteria may be used to meet the defined darkness condition. In an embodiment, the computing system 1100 may determine that the defined darkness condition is met only in response to a determination that the defined spike intensity profile criteria is met (regardless of whether the defined darkness threshold criteria is met), only in response to a determination that the defined darkness threshold criteria is met (regardless of whether the defined spike intensity profile criteria is met), or only in response to a determination that both the defined darkness threshold criteria and the defined spike intensity profile criteria are met.

実施形態では、計算システム１１００は、２Ｄ画像５６００などの２Ｄ画像情報に基づいて、候補エッジ（例えば、５６０１_２）を識別し、候補エッジが定義された暗さ条件を満たすかどうかを判定し得る。上述のように、計算システム１１００が２Ｄ画像情報および３Ｄ画像情報の両方を受信する場合、計算システム１１００は、２Ｄ画像情報を使用して、３Ｄ画像情報の制限を補償するか、または３Ｄ画像情報の欠如を補償してもよく、その逆もまた可能である。例えば、カメラ（例えば、３２００Ｂ）が物体のグループを表すために３Ｄ画像情報を生成するとき、３Ｄ画像情報は、特に物体のグループがカメラに対して等しい奥行き値を有する場合、グループ内の個々の物体を区別するための情報を欠いてもよい。より具体的には、３Ｄ画像情報は、物体間の狭い物理的ギャップを検出するための情報を欠いてもよく、それゆえ、物理的ギャップに関連する物理的エッジを識別する際に限定的な有用性を有し得る。 In an embodiment, the computing system 1100 may identify candidate edges (e.g., 5601 ₂ ) based on 2D image information, such as 2D image 5600, and determine whether the candidate edges meet a defined darkness condition. As described above, when the computing system 1100 receives both 2D image information and 3D image information, the computing system 1100 may use the 2D image information to compensate for limitations of the 3D image information or compensate for the lack of 3D image information, or vice versa. For example, when a camera (e.g., 3200B) generates 3D image information to represent a group of objects, the 3D image information may lack information to distinguish individual objects in the group, especially when the group of objects has equal depth values relative to the camera. More specifically, the 3D image information may lack information to detect narrow physical gaps between objects and therefore may have limited usefulness in identifying physical edges associated with physical gaps.

実施例として、図９Ｄは、図５Ｂの３Ｄ画像情報５７００の部分５７１５に関連する奥行き値を示す。より具体的には、部分５７１５は、物体３５２０の上部表面上の物理的位置５７２０_ａ～５７２０_ａ＋５、および物体３５１０の上部表面上の位置５７１０_ｂ～５７１０_ｂ＋４に対するそれぞれの奥行き値を記述し得る。これらの物理的位置は、画像領域５６０３_２内またはその周りにある画像位置、または候補エッジ５６０１_２の周りにある画像位置に対してマッピングされてもよく、またはそうでなければそれに対応し得る。図８に示すように、画像領域５６０３_２は、物体３５１０、３５２０の間の物理的ギャップｇを表し得る。上述のように、計算システム１１００は、３Ｄ画像情報を使用して、奥行きの急激な変化がある一つまたは複数の位置を検出しようとし得る。しかし、図８の物理的ギャップは、３Ｄ画像情報５７００によって捕捉される３Ｄ画像情報の解像度に対して狭すぎるか、またはそうでなければ小さすぎてもよい。従って、図９Ｄの実施例では、計算システム１１００は、位置５７２０_ａ～５７２０_ａ＋５および５７１０_ｂ～５７１０_ｂ＋４に、奥行きの急激な変化がないと判定し、それゆえ、３Ｄ画像情報がそれらの位置でいずれの候補エッジも示さないと判定し得る。さらに、一部の状況では、３Ｄ画像情報は、位置５７２０_ａ～５７２０_ａ＋５および５７１０_ｂ～５７１０_ｂ＋４の一部について、またはより具体的には、候補エッジ５６０１_１に対応する一つまたは複数の位置について、奥行き情報を欠落し得る。一部の状況では、候補エッジ５６０１_１をマッピングするか、または他の方法で対応する３Ｄ画像情報の一部分は、非一時的コンピューター可読媒体１１２０に定義された値であり得る、定義されたノイズ許容閾値よりも大きい撮像ノイズのレベルによって影響され得る。上記の実施例では、２Ｄ画像情報は、物体３５１０の物理的エッジ３５１０Ｂを表す候補エッジ５６０１_１を含んでもよく、物体３５１０と物体３５２０との間の物理的ギャップを表さない画像領域５６０３_２を含んでもよいため、２Ｄ画像情報は、３Ｄ画像情報のこれらの制限を補償し得る。３Ｄ画像情報の限定を含む上記の実施例では、計算システム１１００は、物体のグループの物理的エッジ（例えば、３５１０Ｂ）のうちの一つを表すために、候補エッジ５６０１_２を使用するべきかどうかを判定するために、定義された暗さ条件をさらに使用し得る。 As an example, Figure 9D illustrates depth values associated with portion 5715 of 3D image information 5700 of Figure 5B. More specifically, portion 5715 may describe respective depth values for physical locations 5720a _through _5720a ₊₅ on the top surface of object 3520, and locations 5710b through 5710b ₊₄ on the top surface of object 3510. These physical locations may be mapped to or otherwise correspond to image locations within or around image region ₅₆₀₃₂ , or around candidate edge _56012. As shown in Figure 8, image region ₅₆₀₃₂ may represent a physical gap g between objects 3510, 3520. As discussed above, computing system 1100 may use the 3D image information to attempt to detect one or more locations where there is an abrupt change in depth. However, the physical gap of Figure 8 may be too narrow or otherwise too small for the resolution of the 3D image information captured by the 3D image information 5700. Thus, in the example of Figure 9D, the computing system 1100 may determine that there is no abrupt change in depth at locations _5720a through 5720a ₊₅ and _5710b through 5710b ₊₄ , and therefore determine that the 3D image information does not show any candidate edges at those locations. Furthermore, in some circumstances, the 3D image information may lack depth information for some of locations 5720a _through 5720a ₊₅ _and 5710b through 5710b ₊₄ , or more specifically, for one or more locations that correspond to the candidate edge _5601-1 . In some circumstances, a portion of the 3D image information that maps to or otherwise corresponds to the candidate edge _5601-1 may be affected by a level of imaging noise that is greater than a defined noise tolerance threshold, which may be a value defined in the non-transitory computer-readable medium 1120. In the above example, the 2D image information may compensate for these limitations of the 3D image information because the 2D image information may include candidate edge 5601 ₁ that represents a physical edge 3510B of object 3510, and may also include image regions 5603 ₂ that do not represent a physical gap between object 3510 and object 3520. In the above example involving limitations of the 3D image information, the computing system 1100 may further use the defined darkness condition to determine whether candidate edge 5601 ₂ should be used to represent one of the physical edges (e.g., 3510B) of the group of objects.

図１０Ａ～１０Ｃは、候補エッジ５６０１_４および／または画像領域５６０３_４が定義された暗さ条件を満たすかどうかを判定するための実施例を示す。画像領域５６０３_４は、物体３５１０の上部表面上に印刷された可視線などの、図３Ｂの視覚的マーキング３５１２を表し得る。この実施例では、画像領域５６０３_４は、画像領域５６０５_４および５６０７_４など、直に隣接している画像領域よりも暗くてもよい。候補エッジ５６０１_４は、暗い画像領域５６０３_４と直に隣接する領域５６０５_４との間の境界に基づいて形成され得る。 10A-10C show an example for determining whether a candidate edge 5601 ₄ and/or image region 5603 ₄ meets a defined darkness condition. Image region 5603 ₄ may represent visual marking 3512 of FIG. 3B, such as a visible line printed on the top surface of object 3510. In this example, image region 5603 ₄ may be darker than immediately adjacent image regions, such as image regions 5605 ₄ and 5607 _4. Candidate edge 5601 ₄ may be formed based on the boundary between dark image region 5603 ₄ and immediately adjacent region 5605 ₄ .

図１０Ｂは、画像強度プロファイル１０００１が、画像強度プロファイル１０００１が、暗さのピークレベルに向かって位置の関数として暗さが増大し、その後暗さのピークレベルから離れて暗さが減少する形で、画像領域５６０３_４内で変化しないため、定義されたスパイク強度プロファイル基準を満たさない画像強度プロファイル１０００１を有する画像領域５６０３_４を示す。この実施例の画像強度プロファイル１０００１は、画像領域５６０３_４の幅寸法と整列され得る、軸５６０８に沿ったピクセル位置の関数として、ピクセル強度値を記述し得る。上述のように、スパイク強度プロファイル基準は、物体の物理的エッジ間の物理的ギャップが、物理的ギャップの外周に対して、物理的ギャップの真ん中でより暗く見えてもよいという予期に対応し得る。従って、物理的ギャップを表す画像領域は、画像領域内の画像位置の関数として変化し、より具体的には、物理的ギャップの周辺に対応する画像位置から物理的ギャップの中央に対応する画像位置へ、特定の方向に沿った画像位置の関数として暗さが増加し、およびその後、同じ方向に沿った位置の関数として暗さが減少する、画像強度プロファイルを有し得る。より具体的には、画像強度プロファイルは、物理的ギャップを表す画像領域における暗さのスパイク増加または画像強度のスパイク減少を有し得る。実施形態では、代わりに視覚的線または他の視覚的マーキングを表す画像領域は、こうした画像強度プロファイルを欠いてもよく、代わりに、画像領域内により均一なレベルの暗さを有し得る。従って、図１０Ｂに示すように、視覚的マーキング３５１２の一部を表す画像領域５６０３_４は、プロファイル１０００１が、画像領域５６０３_４内で実質的に変化しないように、画像領域５６０３_４内でより均一である画像強度プロファイル１０００１を有し得る。さらに、画像強度プロファイル１０００１は、画像領域５６０３_４の境界で、より明るい隣接画像領域（例えば、５６０５_４）と関連付けられる画像強度から、画像領域５６０３_４内の一様な画像強度まで、画像強度の段差形状の変化を有し得る。従って、画像強度プロファイル１０００１は、画像領域５６０３_４を横断する特定の方向に沿って、暗さのピークレベルに向かって、暗さを増大させることによって始まり、その後、その方向に沿って暗さを減少することに切り替える形状を有しない。より具体的には、画像強度プロファイル１０００１は、画像強度のスパイク減少を示さない。従って、この実施例の計算システム１１００は、画像領域５６０３_４が、２Ｄ画像５６００が、候補エッジ５６０１_４および／または画像領域５６０３_４で定義された暗さ条件を満たさないとの判定をもたらし得る、定義されたスパイク強度プロファイル基準を満たさないと判定することができる。 10B illustrates image region 5603.4 having an image intensity profile 10001 that does not meet the defined spike intensity profile criteria because image intensity profile ₁₀₀₀₁ does not vary within image region 5603.4 in a manner that increases in darkness as a function of position toward a peak level of darkness and then decreases in darkness away from the peak level of darkness. Image intensity profile 10001 in this example may describe pixel intensity values as a function of pixel position along axis ₅₆₀₈ , which may be aligned with the width dimension of image region _5603.4 . As discussed above, the spike intensity profile criteria may correspond to an expectation that a physical gap between physical edges of objects may appear darker in the middle of the physical gap relative to the perimeter of the physical gap. Thus, an image region representing a physical gap may have an image intensity profile that varies as a function of image position within the image region, more specifically, increasing in darkness as a function of image position along a particular direction from image locations corresponding to the periphery of the physical gap to image locations corresponding to the center of the physical gap, and then decreasing in darkness as a function of position along the same direction. More specifically, the image intensity profile may have a spike of increasing darkness or a spike of decreasing image intensity in the image region representing the physical gap. In embodiments, an image region representing a visual line or other visual marking instead may lack such an image intensity profile, and instead have a more uniform level of darkness within the image region. Thus, as shown in FIG. 10B, an image region _{5603_4} representing a portion of a visual marking 3512 may have an image intensity profile 10001 that is more uniform within image region _{5603_4} , such that profile 10001 does not vary substantially within image region _{5603_4} . Additionally, image intensity profile 10001 may have a step-shaped change in image intensity at the boundaries of image region _{5603_4} from the image intensity associated with a brighter adjacent image region (e.g., _{5605_4} ) to a uniform image intensity within image region _{5603_4} . Thus, image intensity profile 10001 does not have a shape that begins by increasing darkness along a particular direction across image region _{5603_4} to a peak level of darkness and then switches to decreasing darkness along that direction. More specifically, image intensity profile 10001 does not exhibit a spike decrease in image intensity. Thus, computing system 1100 of this example may determine that image region _{5603_4} does not satisfy the defined spike intensity profile criteria, which may result in a determination that 2D image 5600 does not satisfy the defined darkness condition at candidate edge _{5601_4} and/or image region _{5603_4} .

図１０Ｃは、画像強度プロファイル１０００３が、画像領域５６０３_４が十分に暗くないことを示し得るため、定義された暗さ閾値基準を満たすことができない場合がある、画像強度プロファイル１０００３を有する、画像領域５６０３_４を描写する。より具体的には、計算システム１１００は、画像領域５６０３_４の画像強度プロファイル１０００３内のピクセル強度値のほとんどまたは全てが、定義された暗さ閾値のτ_{ｄａｒｋ＿ｐｒｉｏｒ}を超えると決定し得る。従って、図１０Ｃの計算システム１１００は、画像領域５６０３_４が、画像５６００が、候補エッジ５６０１_４および／または画像領域５６０３_４で定義された暗さ条件を満たさないとの判定をもたらし得る、定義された暗さ閾値基準を満たさないと判定し得る。画像強度プロファイル１０００３はまた、図１０Ｂについて上で論じたように、定義されたスパイク強度プロファイル基準を満たさなくてもよい。 10C depicts image region _{5603_4} having an image intensity profile 10003 that may fail to meet a defined darkness threshold criterion because image intensity profile 10003 may indicate that image region _{5603_4} is not dark enough. More specifically, computing system 1100 may determine that most or all of the pixel intensity values in image intensity profile 10003 of image region 5603_4 exceed a defined darkness threshold τ _{dark_prior} . Thus, computing system 1100 of FIG. _10C may determine that image region _{5603_4} does not meet a defined darkness threshold criterion, which may result in a determination that image 5600 does not meet a defined darkness condition at candidate edge _{5601_4} and/or image region _{5603_4} . Image intensity profile 10003 may also fail to meet a defined spike intensity profile criterion, as discussed above with respect to FIG. 10B.

実施形態では、画像領域は、画像領域が定義されたスパイク強度プロファイル基準を満たすかどうかの信頼できる評価を行うには小さすぎる幅を有し得る。例えば、画像領域は、単一のピクセルのみの幅、または数ピクセルのみの幅を有し得る。一部の実例では、計算システム１１００は、こうした画像領域が定義された暗さ条件を満たすものではないと判定し得る。他の例では、計算システム１１００は、画像領域が定義された暗さ閾値基準を満たすかどうかに基づいて、画像領域が定義された暗さ条件を満たすかどうかを判定し得る。一部の実例では、計算システム１１００は、定義された暗さ条件に関して、こうした画像領域または関連する候補エッジを評価しないことを決定し得る。 In embodiments, an image region may have a width that is too small to make a reliable assessment of whether the image region meets the defined spike intensity profile criteria. For example, the image region may be only a single pixel wide, or only a few pixels wide. In some instances, the computing system 1100 may determine that such an image region does not meet the defined darkness condition. In other instances, the computing system 1100 may determine whether the image region meets the defined darkness condition based on whether the image region meets the defined darkness threshold criteria. In some instances, the computing system 1100 may decide not to evaluate such an image region or associated candidate edges for the defined darkness condition.

上述のように、本開示の一態様は、計算システム１１００が、２Ｄ画像５６００などの、少なくとも２Ｄ画像情報に基づいて、複数の候補エッジを識別する状況に関する。こうした実施形態では、複数の候補エッジは、２Ｄ画像に基づいて識別される少なくとも第一の候補エッジ（例えば、５６０１_１／５６０１_２／５６０１_３／５６０１_４）を含み得る。例えば、第一の候補エッジは、２Ｄ画像の二つの画像領域の間の境界に基づいて形成され得る。一部の実例では、計算システム１１００は、２Ｄ画像情報および３Ｄ画像情報に基づいて、複数の候補エッジを識別し得る。こうした例では、複数の候補エッジは、上述のように、２Ｄ画像情報からの第一の候補エッジを含んでもよく、３Ｄ画像情報に基づいて識別される、第二の候補エッジ（例えば、図６Ｂの５７０１_１）をさらに含んでもよい。 As mentioned above, one aspect of the disclosure relates to situations where the computing system 1100 identifies a plurality of candidate edges based on at least 2D image information, such as the 2D image 5600. In such an embodiment, the plurality of candidate edges may include at least a first candidate edge (e.g., 5601 ₁ /5601 ₂ /5601 ₃ /5601 ₄ ) identified based on the 2D image. For example, the first candidate edge may be formed based on a boundary between two image regions of the 2D image. In some instances, the computing system 1100 may identify a plurality of candidate edges based on the 2D image information and the 3D image information. In such an example, the plurality of candidate edges may include a first candidate edge from the 2D image information, as described above, and may further include a second candidate edge (e.g., 5701 ₁ of FIG. 6B ) identified based on the 3D image information.

実施例として、図１１Ａ～１１Ｂは、３Ｄ画像情報５７００に基づいて、複数の候補エッジの第二の候補エッジとして、候補エッジ５７０１_１を識別する、計算システム１１００を示す。この実施例では、計算システム１１００は、３Ｄ画像情報５７００の第一の部分５７１０Ａと第二の部分５７５０Ａとの間の候補エッジ５７０１_１での奥行きの急激な変化を検出することに基づいて、候補エッジ５７０１_１を識別し得る。第一の部分５７１０Ａは、例えば、図３Ａ～３Ｃの物体３５１０の上部表面上の位置の領域を表し得る一方で、第二の部分５７５０Ａは、例えば、物体３５５０の上部表面上の位置の領域を表し得る。より具体的には、図１１Ｂに示すように、３Ｄ画像情報の第一の部分５７１０Ａは、物体３５１０の上部表面上にある位置５７１０_ｃ～５７１０_ｃ＋５に対するそれぞれの奥行き値を含んでもよく、一方で、第二の部分５７５０Ａは、物体３５５０の上部表面上にある位置５７５０_ｄ～５７５０_ｄ＋４に対するそれぞれの奥行き値を含んでもよい。この実施例では、位置５７１０_ｃ～５７１０_ｃ＋５および５７５０_ｄ～５７５０_ｄ＋４は、図１１ＡのＹ軸に沿って整列された一連の位置であり得る。 11A-11B illustrate computing system 1100 identifying candidate edge _5701.1 as a second candidate edge of a plurality of candidate edges based on 3D image information 5700. In this example, computing system 1100 may identify candidate edge _5701.1 based on detecting an abrupt change in depth at candidate edge _5701.1 between first portion 5710A and second portion 5750A of 3D image information 5700. First portion 5710A may represent, for example, a region of locations on a top surface of object 3510 of FIGS. 3A-3C, while second portion 5750A may represent, for example, a region of locations on a top surface of object 3550. More specifically, as shown in Figure 11B, a first portion 5710A of the 3D image information may include respective depth values for locations _5710c through 5710c ₊₅ on the top surface of the object 3510, while a second portion 5750A may include respective depth values for locations 5750d _through 5750d ₊₄ on the top surface of the object 3550. In this example, locations 5710c _through 5710c ₊₅ and _5750d through 5750d ₊₄ may be a series of locations aligned along the Y-axis in Figure 11A.

実施形態では、計算システム１１００は、３Ｄ画像情報５７００によって記述される一連の位置の二つの連続的な位置またはそうでなければ隣接する位置の間の奥行きの急激な変化を検出することに基づき、図１１Ｂの候補エッジ５７０１_１を識別し得る。こうした急激な変化は、奥行き不連続状態と呼んでもよい。急激な変化は、例えば、二つの位置のそれぞれの奥行き値間の差が、定義された奥行き差閾値を超えるときに検出され得る。例えば、計算システム１１００は、位置５７１０_ｃ＋５の奥行き値と位置５７５０_ｄの奥行き値との間の差が、定義された奥行き差閾値を超えると決定し得る。結果として、計算システム１１００は、これら二つの位置５７１０_ｃ＋５、５７５０_ｄに基づいて、候補エッジ５７０１_１を識別し得る。例えば、候補エッジ５７０１_１は、Ｙ軸上の位置５７１０_ｃ＋５、５７５０_ｄの間にある位置を含むように識別され得る。 In an embodiment, the computing system 1100 may identify the candidate edge 5701 1 of FIG. 11B based on detecting an abrupt change in depth between two consecutive or otherwise adjacent positions in the sequence of positions described by the 3D image information 5700. Such an abrupt change may be referred to as a depth discontinuity. The abrupt change may be detected, for example, when the difference between the respective depth values of the two positions exceeds a defined depth difference threshold. For example, _the computing system 1100 may determine that the difference between the depth value of the position 5710 _c+5 and the depth value of the position 5750 _d exceeds a defined depth difference threshold. As a result, the computing system 1100 may identify the candidate edge 5701 ₁ based on these two positions 5710 _c+5 , 5750 _d . For example, the candidate edge 5701 ₁ may be identified to include a position that is between the positions 5710 _c+5 , 5750 _d on the Y-axis.

実施形態では、計算システム１１００は、３Ｄ画像情報に基づいて、定義された奥行き差閾値を超える奥行き差を有する二つの表面を識別することによって、候補エッジを識別し得る。例えば、図１１Ｃに示すように、計算システム１１００は、３Ｄ画像情報５７００によって記述される位置の第一のセットに基づいて、物体のグループ３５１０～３５５０の第一の表面を識別してもよく、位置の第一のセットは、定義された測定分散閾値を超えて互いに逸脱しないそれぞれの奥行き値を有する。同様に、計算システム１１００は、３Ｄ画像情報５７００によって記述される位置の第二のセットに基づいて、物体のグループ３５１０～３５５０の第二の表面を識別してもよく、位置の第二のセットは、定義された測定分散閾値を超えて互いに逸脱しないそれぞれの奥行き値を有する。図１１Ｃの実施例では、位置の第一のセットは、物体５７１０の上部表面を表し得る位置５７１０_ｃ～５７１０_ｃ＋５を含んでもよく、一方で、位置の第二のセットは、物体５７５０の上部表面を表し得る位置５７５０_ｄ～５７５０_ｄ＋４を含んでもよい。 In an embodiment, the computing system 1100 may identify candidate edges by identifying two surfaces having a depth difference that exceeds a defined depth difference threshold based on the 3D image information. For example, as shown in FIG. 11C, the computing system 1100 may identify a first surface of the group of objects 3510-3550 based on a first set of locations described by the 3D image information 5700, the first set of locations having respective depth values that do not deviate from each other by more than a defined measurement variance threshold. Similarly, the computing system 1100 may identify a second surface of the group of objects 3510-3550 based on a second set of locations described by the 3D image information 5700, the second set of locations having respective depth values that do not deviate from each other by more than a defined measurement variance threshold. In the example of FIG. 11C , the first set of positions may include positions 5710 _c through 5710 _c+5 , which may represent the top surface of object 5710, while the second set of positions may include positions 5750 _d through 5750 _d+4 , which may represent the top surface of object 5750.

本実施形態では、定義された測定分散閾値は、撮像ノイズ、製造公差、またはカメラ（例えば、３２００Ｂ）によって計測された奥行き測定値にランダムな変動をもたらし得る他の要因の影響を記述し得る。ランダム変動のこうした源は、たとえさまざまな位置が共通表面の一部であり、実際にカメラに対して同じ奥行きを有するとしても、さまざまな位置の奥行き値に何らかの自然分散をもたらす。一部の実例では、定義された測定分散閾値は、奥行き測定値における予想されるランダム変動、またはより広くは、カメラがノイズまたは他の誤差源に対してどの程度感受性であるかを記述するために使用される、公称標準偏差に等しいか、またはそれに基づいてもよい。公称標準偏差は、カメラによって生成される奥行き値または他の奥行き情報に予想される、ベースライン標準偏差または他の形態の分散を記述し得る。公称標準偏差、またはより広くは、定義された測定分散閾値は、例えば、非一時的コンピューター可読媒体１１２０に格納される値であってもよく、所定の値または動的に定義された値であり得る。実施形態では、位置のセットが、定義された測定分散閾値を超えて互いに逸脱しないそれぞれの奥行き値を有する場合、計算システム１１００は、位置のセットが共通表面の一部であると決定し得る。より具体的な実施形態では、計算システム１１００は、それらのそれぞれの奥行き値の標準偏差（例えば、Ｓｔｄ_５７１０またはＳｔｄ_５７５０）が、定義された測定分散閾値よりも小さい場合、位置のセットが共通表面の一部であると決定し得る。 In this embodiment, the defined measurement variance threshold may describe the effects of imaging noise, manufacturing tolerances, or other factors that may result in random variations in the depth measurements taken by the camera (e.g., 3200B). Such sources of random variation result in some natural variation in the depth values of various locations, even if the various locations are part of a common surface and actually have the same depth relative to the camera. In some instances, the defined measurement variance threshold may be equal to or based on a nominal standard deviation, which is used to describe the expected random variation in the depth measurements, or more broadly, how susceptible the camera is to noise or other error sources. The nominal standard deviation may describe a baseline standard deviation or other form of variance expected in the depth values or other depth information generated by the camera. The nominal standard deviation, or more broadly, the defined measurement variance threshold, may be a value stored, for example, in the non-transitory computer-readable medium 1120, and may be a predetermined value or a dynamically defined value. In an embodiment, the computing system 1100 may determine that a set of locations are part of a common surface if the set of locations have respective depth values that do not deviate from one another by more than a defined measurement variance threshold. In a more specific embodiment, the computing system 1100 may determine that a set of locations are part of a common surface if the standard deviation (e.g., Std ₅₇₁₀ or Std ₅₇₅₀ ) of their respective depth values is less than a defined measurement variance threshold.

上の実施形態では、計算システム１１００は、十分な奥行き差を有する二つの表面に基づいて３Ｄ画像情報から候補エッジを識別し得る。例えば、物体５７１０の上部表面を記述する図１１Ｃの位置の第一のセットは、平均奥行き値Ａｖｇ_５７１０を有するか、または他の方法でそれに関連付けることができる。同様に、物体５７５０の上部表面を記述する位置の第二のセットは、平均奥行き値Ａｖｇ_５７５０を有するか、または他の方法でそれに関連付けることができる。計算システム１１００は、Ａｖｇ_５７１０とＡｖｇ_５７５０との間の差が、定義された奥行き差閾値以上であるかを判定し得る。一部の実例では、定義された奥行き差閾値は、定義された測定分散閾値の倍数（例えば、定義された測定分散閾値の２倍、または定義された測定分散閾値の５倍）として決定され得る。二つの表面に関連付けられるＡｖｇ_５７１０とＡｖｇ_５７５０との間の差が、定義された奥行き差閾値以上である場合、計算システム１１００は、奥行き不連続状態が満たされると判断し得る。より具体的には、計算システム１１００は、候補エッジ（例えば、５７０１_１）が、二つの表面の間の位置に存在していると判定してもよく、より具体的には、二つの表面間の遷移がある位置に基づいて、候補エッジを識別し得る。 In the above embodiment, the computing system 1100 may identify candidate edges from the 3D image information based on two surfaces having sufficient depth difference. For example, the first set of locations in FIG. 11C describing the top surface of the object 5710 may have or be otherwise associated with an average depth value Avg _5710. Similarly, the second set of locations describing the top surface of the object ₅₇₅₀ may have or be otherwise associated with an average depth value Avg 5750. The computing system 1100 may determine whether the difference between Avg ₅₇₁₀ and Avg ₅₇₅₀ is greater than or equal to a defined depth difference threshold. In some instances, the defined depth difference threshold may be determined as a multiple of a defined measurement variance threshold (e.g., twice the defined measurement variance threshold or five times the defined measurement variance threshold). If the difference between Avg ₅₇₁₀ and Avg ₅₇₅₀ associated with the two surfaces is greater than or equal to a defined depth difference threshold, then computing system 1100 may determine that a depth discontinuity condition is met. More specifically, computing system 1100 may determine that a candidate edge (e.g., 5701 ₁ ) exists at a location between the two surfaces, and more specifically may identify the candidate edge based on the location at which the transition between the two surfaces is located.

上述のように、本開示の一態様は、３Ｄ画像情報が、２Ｄ画像情報の限定を補償し得る（およびその逆）ように、２Ｄ画像情報および３Ｄ画像情報を使用して互いを補償することに関する。実例によっては、３Ｄ画像情報から検出された物理的エッジは、２Ｄ画像情報からのみ検出された物理的エッジよりも高いレベルの信頼と関連付けられ得る。場合によっては、物理的エッジ（例えば、図３Ｂの３５１０Ａ）が２Ｄ画像情報および３Ｄ画像情報の両方で表される場合、計算システム１１００は、２Ｄ画像情報の物理的エッジを表す候補エッジ（例えば、５６０１_５）を識別し、３Ｄ画像情報の物理的エッジを表す対応する候補エッジ（例えば、５７０１_１）を識別することができる。上述のように、対応する候補エッジは、互いにマッピングし得る。例えば、３Ｄ画像情報（例えば、５７０１_１）中の候補エッジは、２Ｄ画像情報中の候補エッジ（例えば、５６０１_５）にマッピングされ得る。候補エッジ（例えば、５６０１_５）は、５６０５_５および５６５０などの二つの画像領域の間の境界に基づいて形成され得る。しかしながら、一部の状況では、計算システム１１００は、２Ｄ画像情報からの候補エッジ（例えば、５６０１_５）が実際の物理的エッジに対応するかどうかを、高い信頼レベルで決定することができなくてもよい。例えば、図１１Ｄでは、２Ｄ画像５６００は、候補エッジ５６０１_５で画像強度の段差形状の変化を有し得る。この実施例では、計算システム１１００は、２Ｄ画像５６００が、候補エッジ５６０１_５で、またはより具体的には、二つの画像領域５６０５_５および５６５０で、定義された暗さ条件を満たさないと決定し得る。従って、計算システム１１００は、物理的エッジを表す候補エッジ５６０１_５と関連付けられる十分に高い信頼レベルがないと決定し得る。 As mentioned above, one aspect of the present disclosure relates to using 2D image information and 3D image information to compensate for each other, such that the 3D image information can compensate for the limitations of the 2D image information (and vice versa). In some instances, a physical edge detected from the 3D image information can be associated with a higher level of confidence than a physical edge detected only from the 2D image information. In some instances, when a physical edge (e.g., 3510A in FIG. 3B ) is represented in both the 2D image information and the 3D image information, the computing system 1100 can identify a candidate edge (e.g., 5601 ₅ ) that represents a physical edge of the 2D image information and identify a corresponding candidate edge (e.g., 5701 ₁ ) that represents a physical edge of the 3D image information. As mentioned above, corresponding candidate edges can map to each other. For example, a candidate edge in the 3D image information (e.g., 5701 ₁ ) can be mapped to a candidate edge in the 2D image information (e.g., 5601 ₅ ). A candidate edge (e.g., 5601 ₅ ) may be formed based on a boundary between two image regions, such as 5605 ₅ and 5650. However, in some circumstances, the computing system 1100 may not be able to determine with a high level of confidence whether a candidate edge (e.g., 5601 ₅ ) from the 2D image information corresponds to an actual physical edge. For example, in FIG. 11D , the 2D image 5600 may have a step-shaped change in image intensity at the candidate edge 5601 _5. In this example, the computing system 1100 may determine that the 2D image 5600 does not meet a defined darkness condition at the candidate edge 5601 ₅ , or more specifically, at the two image regions 5605 ₅ and 5650. Thus, the computing system 1100 may determine that there is not a sufficiently high level of confidence associated with the candidate edge 5601 ₅ that represents a physical edge.

こうした状況では、計算システム１１００は、３Ｄ画像情報を使用して、追加の入力を提供し得る。より具体的には、計算システム１１００は、図１１Ａ～１１Ｃに関して上述されたように、３Ｄ画像情報に基づき、候補エッジ５７０１_１を識別してもよく、２Ｄ画像情報の中の候補エッジ５６０１_５が、３Ｄ画像情報の中の候補エッジ５７０１_１にマッピングされてもよく、または他の方法でそれに対応し得る。実例によっては、候補エッジ５７０１_１が奥行き情報に基づいて識別されるので、計算システム１１００は、物理的エッジ、すなわち、図３Ｂの物理的エッジ３５１０Ａを表す候補エッジ５７０１_１が存在する可能性が十分高いと判定し得る。従って、２Ｄ画像情報は、物理的エッジ３５１０Ａの検出につながらない、または物理的エッジ３５１０Ａの検出に低い信頼レベルでつながり得る一方、３Ｄ画像情報は、計算システム１１００によって、より高い信頼レベルで物理的エッジ３５１０Ａを検出するために使用され得る。 In such a situation, the computing system 1100 may use the 3D image information to provide additional input. More specifically, the computing system 1100 may identify the candidate edge 5701 ₁ based on the 3D image information as described above with respect to FIGS. 11A-11C, and the candidate edge 5601 ₅ in the 2D image information may be mapped to or otherwise correspond to the candidate edge 5701 ₁ in the 3D image information. In some instances, since the candidate edge 5701 ₁ is identified based on the depth information, the computing system 1100 may determine that there is a sufficiently high probability that a physical edge, i.e., the candidate edge 5701 ₁ representing the physical edge 3510A of FIG. 3B, exists. Thus, the 2D image information may not lead to the detection of the physical edge 3510A or may lead to the detection of the physical edge 3510A with a low confidence level, while the 3D image information may be used by the computing system 1100 to detect the physical edge 3510A with a higher confidence level.

図４に戻ると、方法４０００は、一実施形態において、計算システム１１００が、複数の候補エッジ（例えば、複数の候補エッジ５６０１_１～５６０１_ｎのサブセット）のサブセットを選択して、物体のグループ（例えば、３５１０～３５４０）の物理的エッジを表すための候補エッジの選択されたサブセットを形成し得る、ステップ４００８を含み得る。実施形態では、このステップは、サブセットから、それぞれが偽エッジである可能性が高い一つまたは複数の候補エッジを除外することを伴い得る。偽エッジである可能性が高い一つまたは複数の候補エッジは、候補エッジのサブセットから除去されてもよく、またはより一般的には、物体のグループ（例えば、３５１０～３５４０）の物理的エッジを表すためのさらなる検討から無視され得る。一実施例では、計算システム１１００は、複数の候補エッジから除去する候補エッジ（複数可）を決定することによって、複数の候補エッジのサブセットを選択してもよく、複数の候補エッジは、除去された後、結果として生じるサブセットを形成する。一実施例では、複数の候補エッジが、図２Ｃのエッジ検出情報１１２６によって表される、または記述される場合、候補エッジを除去することは、その候補エッジに関する情報をエッジ検出情報１１２６から削除することを伴い得る。 Returning to FIG. 4, method 4000 may include step 4008, in which in one embodiment, computing system 1100 may select a subset of the plurality of candidate edges (e.g., a subset of the plurality of candidate edges 5601 ₁ -5601 _n ) to form a selected subset of candidate edges for representing a physical edge of the group of objects (e.g., 3510-3540). In an embodiment, this step may involve excluding from the subset one or more candidate edges, each of which is likely to be a false edge. The one or more candidate edges that are likely to be false edges may be removed from the subset of candidate edges, or more generally, may be ignored from further consideration for representing a physical edge of the group of objects (e.g., 3510-3540). In one example, computing system 1100 may select the subset of the plurality of candidate edges by determining which candidate edge(s) to remove from the plurality of candidate edges, which, after removal, form the resulting subset. In one example, when multiple candidate edges are represented or described by the edge detection information 1126 of FIG. 2C , removing a candidate edge may involve deleting information about that candidate edge from the edge detection information 1126.

上述のように、複数の候補エッジ（例えば、５６０１_１～５６０１_ｎ、または５６０１_１～５６０１_ｎおよび５７０１_１～５７０１_ｎ）は、第一の画像領域と第一の画像領域よりも暗い第二の画像領域との間の境界に基づいて形成される、少なくとも第一の候補エッジ（例えば、５６０１_１または５６０１_４）を含み得る。さらに、第一の候補エッジは、２Ｄ画像情報から識別され得る。実施形態では、ステップ４００８は、サブセット（候補エッジのサブセットとも呼ばれる）に第一の候補エッジを含むかどうかを決定することを伴い得る。サブセットに第一の候補エッジ（例えば、５６０１_１）を含むことで、第一の候補エッジを使用して、カメラ視野内の物体のグループの少なくとも一つの物理的エッジ（例えば、３５１０Ｂ）を表すことが可能になり得る。より具体的には、第一の候補エッジ（例えば、５６０１_１）がサブセットに含まれる場合、このような包含は、第一の候補エッジ（例えば、５６０１_１）が、物体のグループの物理的エッジの少なくとも一つを表すために考慮されたままである候補であるという表示であり得る。言い換えれば、計算システム１１００は、少なくとも一つの物理的エッジを表すための候補として第一の候補エッジを保持するかどうかを決定し得る。計算システム１１００が、そのような候補として第一の候補エッジを保持すると決定する場合、それは、サブセット（候補エッジの選択されたサブセットとも呼ぶことができる）内に第一の候補エッジを含み得る。この決定は、複数の候補エッジのサブセットを選択するステップの一部であってもよく、画像が第一の候補エッジで定義された暗さ条件を満たすかどうかに基づいて行われてもよい。実例によっては、サブセット内の第一の候補エッジの包含は、第一の候補エッジが偽エッジである可能性が充分に低いという表示であり得る。場合によっては、サブセットに第一の候補エッジを含めることは、計算システム１１００が、物体のグループの少なくとも一つの物理的エッジを表すために第一の候補エッジを使用するか、または物体のグループの少なくとも一つの物理的エッジを表すために少なくとも第一の候補エッジを考慮し続けるように、第一の候補エッジ（例えば、５６０１_１）が、物体のグループの実際の物理的エッジに対応する十分に高い信頼レベルを有することを示し得る。計算システム１１００が、第一の候補エッジ（例えば、５６０１_４）がフィルターリングされるか、またはそうでなければサブセットから除外されるように、第一の候補エッジ（例えば、５６０１_４）をサブセットに含めないと決定する場合、こうした除外は、第一の候補エッジ（例えば、５６０１_４）が、物体のグループの物理的エッジの少なくとも一つを表すための候補ではなくなったという表示であり得る。実例によっては、サブセットからの第一の候補エッジの除外は、第一の候補エッジ（例えば、５６０１_４）が偽エッジである可能性が高いという表示であり得る。 As discussed above, the plurality of candidate edges (e.g., 5601 ₁ -5601 _n , or 5601 ₁ -5601 _n and 5701 ₁ -5701 _n ) may include at least a first candidate edge (e.g., 5601 ₁ or 5601 ₄ ) formed based on a boundary between a first image region and a second image region that is darker than the first image region. Additionally, the first candidate edge may be identified from the 2D image information. In an embodiment, step 4008 may involve determining whether to include the first candidate edge in the subset (also referred to as a subset of candidate edges). Including the first candidate edge (e.g., 5601 ₁ ) in the subset may enable the first candidate edge to be used to represent at least one physical edge (e.g., 3510B) of a group of objects in the camera field of view. More specifically, if the first candidate edge (e.g., 5601 ₁ ) is included in the subset, such inclusion may be an indication that the first candidate edge (e.g., 5601 ₁ ) is a candidate that remains under consideration for representing at least one of the physical edges of the group of objects. In other words, the computing system 1100 may determine whether to retain the first candidate edge as a candidate for representing at least one physical edge. If the computing system 1100 determines to retain the first candidate edge as such a candidate, it may include the first candidate edge in the subset (which may also be referred to as a selected subset of the candidate edges). This determination may be part of the step of selecting a subset of the multiple candidate edges, and may be made based on whether the image satisfies a darkness condition defined by the first candidate edge. In some instances, the inclusion of the first candidate edge in the subset may be an indication that the first candidate edge is sufficiently unlikely to be a false edge. In some cases, including the first candidate edge in the subset may indicate that the first candidate edge (e.g., 5601 1 ) has a sufficiently high confidence level that it corresponds to an actual physical edge of the group of objects such that the computing system 1100 continues to use the first candidate edge to represent at least one physical edge of the group of objects or to consider at least the first candidate edge to represent at least one physical edge of the group of objects. If the computing system 1100 determines not to include the first candidate edge (e.g., 5601 ₄ ) in the subset such that the first candidate edge (e.g., 5601 ₄ ) is filtered or otherwise removed from the subset, such removal may be an indication that the first candidate edge (e.g., 5601 ₄ ) is no longer a candidate for representing at least one physical edge of the group of objects. In some instances, removal _of the first candidate edge from the subset may be an indication that the first candidate edge (e.g., 5601 ₄ ) is likely to be a false edge.

実施形態では、候補エッジの選択されたサブセットに第一の候補エッジを含むかどうかの判定は、上述のように、画像情報（例えば、５６００）が第一の候補エッジで定義された暗さ条件を満たすかどうかに基づいてもよい。いくつかの実施では、画像情報が第一の候補エッジで定義された暗さ条件を満たす場合、こうした結果は、第一の候補エッジが偽エッジである可能性が充分に低いことを示し得る。なぜなら、このような状況における第一の候補エッジは、二つの物体間の物理的ギャップを表す画像領域と関連付けられる可能性が高いからである。従って、第一の候補エッジは、物理的ギャップの一方の側面を形成する物理的エッジを表し得る。こうした状況では、計算システム１１００は、選択されたサブセットに第一の候補エッジを含むように決定し得る。一部の実例では、画像情報が第一の候補エッジで定義された暗さ条件を満たさない場合、計算システム１１００は、第一の候補エッジを選択されたサブセットに含めないように決定し得る。一部の実例では、計算システムが、２Ｄ画像情報が第一の候補エッジで定義された暗さ条件を満たさないと判断した場合、計算システム１１００は、３Ｄ画像情報を使用して第一の候補エッジをさらに評価し得る。例えば、計算システム１１００が、２Ｄ画像５６００が候補エッジ５６０１_５で定義された暗さ条件を満たさないと決定した場合、計算システム１１００は、その候補エッジ５６０１_５が３Ｄ画像情報によって記述された候補エッジ５７０１_１にマッピングされるかどうか、および３Ｄ画像情報の候補エッジ５７０１_１が、図１１Ａ～１１Ｄに関して上で論じたように、定義された奥行き差閾値よりも大きい奥行き変化を示すかどうかを決定することができる。 In an embodiment, the determination of whether to include the first candidate edge in the selected subset of candidate edges may be based on whether the image information (e.g., 5600) satisfies the darkness condition defined by the first candidate edge, as described above. In some implementations, if the image information satisfies the darkness condition defined by the first candidate edge, such a result may indicate that the first candidate edge is sufficiently unlikely to be a false edge, because the first candidate edge in such a situation is likely to be associated with an image region that represents a physical gap between two objects. Thus, the first candidate edge may represent a physical edge that forms one side of the physical gap. In such a situation, the computing system 1100 may determine to include the first candidate edge in the selected subset. In some instances, if the image information does not satisfy the darkness condition defined by the first candidate edge, the computing system 1100 may determine not to include the first candidate edge in the selected subset. In some instances, if the computing system determines that the 2D image information does not satisfy the darkness condition defined by the first candidate edge, the computing system 1100 may further evaluate the first candidate edge using the 3D image information. For example, if the computing system 1100 determines that the 2D image 5600 does not meet the darkness condition defined by the candidate edge 5601 ₅ , the computing system 1100 can determine whether that candidate edge 5601 _{5 maps to a candidate edge 5701 1} _described by the 3D image information, and whether the candidate edge 5701 ₁ of the 3D image information exhibits a depth change greater than a defined depth difference threshold, as discussed above with respect to FIGS. 11A-11D.

実施形態では、方法４０００は、ステップ４００６および／または４００８を複数回（例えば、複数の反復を介して）実行して、画像情報が複数の候補エッジで定義された暗さ条件を満たすかどうかを決定し、これらの決定に基づいて上で論じたサブセットを選択し得る。実施例として、複数の候補エッジが、少なくとも候補エッジ５６０１_１～５６０１_ｎを含む場合、計算システム１１００は、ステップ４００６を複数回実行して、２Ｄ画像５６００が、例えば、候補エッジ５６０１_１～５６０１_ｎで、定義された暗さ条件を満たすかどうかを判定し得る。計算システム１１００はさらに、ステップ４００８を複数回実行して、これらの候補エッジのうちどれがサブセットに含まれ、物理的エッジを表すための候補のままであるかを、およびこれらの候補エッジのうちどれがサブセットから除外され、従って物理的エッジを表すための候補ではなくなるかを決定し得る。例えば、計算システム１１００は、２Ｄ画像５６００がそれらの候補エッジで定義された暗さ条件を満たすため、サブセットが候補エッジ５６０１_１および５６０１_２を含むこと、および２Ｄ画像がそれらの候補エッジで定義された暗さ条件を満たさないため、サブセットが候補エッジ５６０１_３および５６０１_４を含まないことを決定し得る。一部の状況では、計算システム１１００は、２Ｄ画像５６００が候補エッジで定義された暗さ条件を満たすことができないため、候補エッジ５６０１_５をサブセットに含めないと判断し得る。一部の状況では、計算システム１１００は、候補エッジ５６０１_５が、奥行き差閾値を超える奥行き変化を示す３Ｄ画像情報の候補エッジ５７０１_１にマッピングされる場合、候補エッジ５６０１_５をサブセットに依然として含むように決定し得る。 In an embodiment, method 4000 may perform steps 4006 and/or 4008 multiple times (e.g., via multiple iterations) to determine whether the image information satisfies the defined darkness condition at multiple candidate edges and select the subset discussed above based on these determinations. As an example, if the multiple candidate edges include at least candidate edges 5601 ₁ -5601 _n , computing system 1100 may perform step 4006 multiple times to determine whether 2D image 5600 satisfies the defined darkness condition at, for example, candidate edges 5601 ₁ -5601 _n . Computing system 1100 may further perform step 4008 multiple times to determine which of these candidate edges are included in the subset and remain candidates for representing a physical edge, and which of these candidate edges are excluded from the subset and therefore are no longer candidates for representing a physical edge. For example, the computing system 1100 may determine that the subset includes candidate edges 5601 ₁ and 5601 ₂ because the 2D image 5600 meets the darkness conditions defined at those candidate edges, and that the subset does not include candidate edges 5601 ₃ and 5601 ₄ because the 2D image does not meet the darkness conditions defined at those candidate edges. In some circumstances, the computing system 1100 may determine not to include candidate edge 5601 ₅ in the subset because the 2D image 5600 cannot meet the darkness conditions defined at the candidate edges. In some circumstances, the computing system 1100 may determine to still include candidate edge 5601 ₅ in the subset if candidate edge 5601 ₅ maps to candidate edge 5701 ₁ of the 3D image information that exhibits a depth change that exceeds a depth difference threshold.

実施形態では、方法４０００は、計算システム１１００がロボット相互作用移動コマンドを出力するステップを含み得る。ロボット相互作用移動コマンドは、ロボット（例えば、３３００）と、物体のグループ（例えば、３５１０～３５５０）の少なくとも一つの物体との間のロボット相互作用に使用され得る。ロボット相互作用は、例えば、ロボット（例えば、３３００）がパレットから物体（例えば、箱）をピックアップし、物体を目的地の位置に動かす、パレットから降ろす動作または他の動作を行うことを伴い得る。 In an embodiment, method 4000 may include a step in which computing system 1100 outputs a robot interaction move command. The robot interaction move command may be used for robot interaction between a robot (e.g., 3300) and at least one object of a group of objects (e.g., 3510-3550). The robot interaction may involve, for example, the robot (e.g., 3300) picking up an object (e.g., a box) from a pallet, moving the object to a destination location, unloading the object from the pallet, or performing other actions.

実施形態では、ロボット相互作用移動コマンドは、ステップ４００８の候補エッジの選択されたサブセットに基づいて生成され得る。例えば、計算システム１１００は、候補エッジの選択されたサブセットを使用して、画像情報によって記述される物体のグループの中から個々の物体を区別し得る。一部の実例では、計算システム１１００は、選択されたサブセットを使用して、画像情報のセグメンテーションを実行し得る。例えば、画像情報が点群を含む場合、計算システムは、候補エッジの選択されたサブセットを使用して、物体のグループの中で個々の物体に対応する点群の一部分を識別することを伴い得る、点群セグメンテーションを行い得る。点群セグメンテーションは、米国特許出願第１６／７９１，０２４号（弁理士整理番号ＭＪ００４９－ＵＳ／００７７－０００９ＵＳ１）、その全体が参照により本明細書に組み込まれる。一実施例では、画像情報が２Ｄ画像情報を含む場合、計算システム１１００は、候補エッジの選択されたサブセットを使用して、物体のグループの中から個々の物体に対応する２Ｄ画像情報の一部分を分離し得る。分離された部分は、例えば、物体認識操作または物体登録操作（例えば、モジュール１１２８による）を実施するために使用される、ターゲット画像またはターゲット画像部分として使用され得る。物体登録および物体認識については、米国特許出願第１６／９９１，４６６号（弁理士整理番号ＭＪ００５４－ＵＳ／００７７－００１２ＵＳ１）、および米国特許出願第１７／１９３，２５３号（弁理士整理番号ＭＪ００６０－ＵＳ／００７７－００１７ＵＳ１）でより詳細に論じられ、それらの内容全体が参照により本明細書に組み込まれる。こうした実施例では、ロボット相互作用移動コマンドは、物体認識操作または物体登録操作の結果に基づいて生成され得る。例えば、物体認識操作は、画像情報によって、またはその一部によって、どの物体または物体タイプを表しているかの推定であり得る、検出仮説を生成し得る。一部の実例では、検出仮説は、例えば、物体３５１０～３５４０の一方の物理的構造を記述する情報を含み得る、物体認識テンプレートと関連付けられ得る。この情報は、計算システム１１００によって使用され、物体（例えば、モジュール１１２９を介して）を取り出して移動するためのロボット（例えば、３３００）の移動を計画し得る。 In an embodiment, the robot interaction movement command may be generated based on the selected subset of the candidate edges of step 4008. For example, the computing system 1100 may use the selected subset of the candidate edges to distinguish individual objects from among a group of objects described by the image information. In some instances, the computing system 1100 may use the selected subset to perform segmentation of the image information. For example, if the image information includes a point cloud, the computing system may use the selected subset of the candidate edges to perform point cloud segmentation, which may involve identifying portions of the point cloud that correspond to individual objects from among a group of objects. Point cloud segmentation is described in U.S. Patent Application Serial No. 16/791,024 (Attorney Docket No. MJ0049-US/0077-0009US1), the entirety of which is incorporated herein by reference. In one example, if the image information includes 2D image information, the computing system 1100 may use the selected subset of the candidate edges to separate portions of the 2D image information that correspond to individual objects from among a group of objects. The separated portions may be used, for example, as target images or target image portions used to perform object recognition or object registration operations (e.g., by module 1128). Object registration and object recognition are discussed in more detail in U.S. Patent Application No. 16/991,466 (Attorney Docket No. MJ0054-US/0077-0012US1) and U.S. Patent Application No. 17/193,253 (Attorney Docket No. MJ0060-US/0077-0017US1), the entire contents of which are incorporated herein by reference. In such examples, robot interaction movement commands may be generated based on the results of the object recognition or object registration operations. For example, the object recognition operation may generate a detection hypothesis, which may be an estimate of which object or object type is represented by the image information, or a portion thereof. In some instances, the detection hypothesis may be associated with an object recognition template, which may include information describing the physical structure of one of the objects 3510-3540, for example. This information may be used by the computing system 1100 to plan the movements of a robot (e.g., 3300) to retrieve and move objects (e.g., via module 1129).

方法４０００の上記のステップが、図３Ａ～３Ｃの物体３５１０～３５５０に関し示される一方で、図１２Ａは、物体１２５１０に関する上記のステップを示す一方で、図１３Ａは、物体１３５１０～１３５２０に関する上記のステップを示す。実施形態では、図１２Ａの物体１２５１０は、第二の、直に隣接する物理的領域１２５１４よりも暗い第一の物理的領域１２５１２を有する上部表面を有する箱であり得る。例えば、第一の物理的領域１２５１２は、第二の物理的領域１２５１４と比較して、その上により多くのインクを印刷し得る。物体１２５１０は、箱が配置されるパレットであり得る、物体１２５２０上に配置され得る。図１２Ｂは、物体１２５１０を表すために生成され得る２Ｄ画像１２６００を示す。より具体的には、２Ｄ画像１２６００は、第一の物理的領域１２５１２を表す第一の画像領域１２６０３を含んでもよく、第二の物理的領域１２５１４を表す第二の画像領域１２６０５を含んでもよい。本実施形態の計算システム１１００は、第一の画像領域１２６０３と第二の画像領域１２６０５との間の境界に基づいて、第一の候補エッジ１２６０１_１を識別し得る。 While the above steps of method 4000 are illustrated with respect to objects 3510-3550 in Figures 3A-3C, Figure 12A illustrates the above steps with respect to object 12510, while Figure 13A illustrates the above steps with respect to objects 13510-13520. In an embodiment, object 12510 in Figure 12A may be a box having a top surface with a first physical area 12512 that is darker than a second, immediately adjacent physical area 12514. For example, first physical area 12512 may have more ink printed thereon compared to second physical area 12514. Object 12510 may be placed on object 12520, which may be a pallet on which the box is placed. Figure 12B illustrates a 2D image 12600 that may be generated to represent object 12510. More specifically, the 2D image 12600 may include a first image region 12603 representing a first physical region 12512, and may include a second image region 12605 representing a second physical region 12514. The computing system 1100 of the present embodiment may identify a first candidate edge 12601 ₁ based on a boundary between the first image region 12603 and the second image region 12605.

実施形態では、計算システム１１００は、２Ｄ画像１２６００が、第一の候補エッジ１２６０１_１で定義された暗さ条件を満たさないと決定し得る。例えば、計算システム１１００は、２Ｄ画像１２６００が、第一の候補エッジ１２６０１_１で画像強度の段差形状の変化を有する画像強度プロファイル１２００１を有すると決定し得る。画像強度プロファイルは、画像のｕ軸に沿って延在する軸１２６０９に沿って測定され得る。いくつかの実施では、計算システム１１００は、画像強度プロファイル１２００１、またはより具体的には、画像領域１２６０３および１２６０５が、スパイク強度プロファイル基準を満たさないと決定し得る。計算システム１１００は、定義された暗さ条件が第一の候補エッジ１２６０１_１で満たされていないとさらに判断し得る。結果として、計算システム１１００は、エッジ検出情報１１２６から第一の候補エッジ１２６０１_１を除去し得る。 In an embodiment, the computing system 1100 may determine that the 2D image 12600 does not satisfy the darkness condition defined at the first candidate edge 12601 _1. For example, the computing system 1100 may determine that the 2D image 12600 has an image intensity profile 12001 that has a step-shaped change in image intensity at the first candidate edge 12601 _1. The image intensity profile may be measured along an axis 12609 that extends along the u-axis of the image. In some implementations, the computing system 1100 may determine that the image intensity profile 12001, or more specifically, image regions 12603 and 12605, do not satisfy the spike intensity profile criteria. The computing system 1100 may further determine that the defined darkness condition is not satisfied at the first candidate edge 12601 _1. As a result, the computing system 1100 may remove the first candidate edge 12601 ₁ from the edge detection information 1126.

図１３Ａの実施形態において、物体１３５１０および１３５２０はそれぞれ箱であってもよく、パレットまたは他のプラットフォームであり得る物体１３５３０上に配置され得る。本実施形態では、物体１３５１０は、物体１３５２０よりも暗くてもよい（例えば、より暗い段ボールまたは他の材料で作製された結果として）。さらに、二つの物体は、狭い物理的ギャップｇによって分離され得る。図１３Ｂは、第一の物体１３５１０を表す第一の画像領域１３６０３と、第二の物体１３５３０を表す第二の画像領域１３６０５とを含む、２Ｄ画像１３６００を示す。本実施形態の計算システム１１００は、二つの画像領域１３６０５、１３６０５の間の境界に基づいて、候補エッジ１３６０１_１を識別し得る。 In the embodiment of Fig. 13A, objects 13510 and 13520 may each be a box and may be placed on object 13530, which may be a pallet or other platform. In this embodiment, object 13510 may be darker than object 13520 (e.g., as a result of being made of darker cardboard or other material). Furthermore, the two objects may be separated by a narrow physical gap g. Fig. 13B shows a 2D image 13600 including a first image region 13603 representing a first object 13510 and a second image region 13605 representing a second object 13530. The computing system 1100 of this embodiment may identify candidate edges 13601 ₁ based on the boundary between the two image regions 13605, 13605.

図１２Ｂおよび１３Ｂに示すように、画像１２６００および１３６００が類似の外観を有し得る。しかしながら、図１３Ｃに示すように、画像１３６００は、画像強度のスパイク減少を含む画像強度プロファイルを有し得る。より具体的には、画像１３６００の画像領域１３６０３は、より具体的には、物体１３５１０、１３５２０の間の物理的ギャップｇを表すための画像領域１３６０３_１を含んでもよく、物体１３５２０を表すための画像領域１３６０３_２を含んでもよい。本実施形態では、画像領域１３６０３_１は、画像強度のスパイク減少を含んでもよく、定義された暗さ閾値よりも小さい最小ピクセル強度値を有し得る。従って、計算システム１１００は、画像領域１３６０３_１が、定義されたスパイク強度プロファイル基準および／または定義された暗さ閾値基準を満たすと決定し得る。結果として、計算システム１１００は、画像１３６００が、第一の候補エッジ１３６０３_１で定義された暗さ条件を満たすと決定し得る。従って、計算システム１１００は、第一の候補エッジ１３６０３_１を使用して、物体１３５１０、１３５２０の物理的エッジのうちの一つを表すように決定し得る。 As shown in Figures 12B and 13B, images 12600 and 13600 may have similar appearances. However, as shown in Figure 13C, image 13600 may have an image intensity profile that includes a spike decrease in image intensity. More specifically, image region 13603 of image 13600 may include image region _13603-1 to represent the physical gap g between objects 13510, 13520, and may include image region _13603-2 to represent object 13520. In this embodiment, image region _13603-1 may include a spike decrease in image intensity and may have a minimum pixel intensity value that is less than a defined darkness threshold. Thus, computing system 1100 may determine that image region _13603-1 meets the defined spike intensity profile criteria and/or the defined darkness threshold criteria. As a result, the computing system 1100 may determine that the image 13600 satisfies the darkness condition defined by the first candidate edge 13603 _1. Thus, the computing system 1100 may determine that the first candidate edge 13603 ₁ is used to represent one of the physical edges of the objects 13510, 13520.

さまざまな実施形態に関する追加の考察： Additional considerations regarding various embodiments:

実施形態１は、計算システム、または計算システムによって実施される方法を含む。計算システムは、通信インターフェイスおよび少なくとも一つの処理回路を含んでもよい。通信インターフェイスは、ロボット、およびカメラ視野を有するカメラと通信するように構成され得る。少なくとも一つの処理回路は、物体のグループがカメラ視野の中にあるとき、カメラによって生成される、物体のグループを表す画像情報を受信することと、画像情報から、物体のグループに関連付けられる複数の候補エッジを識別することであって、複数の候補エッジが、物体のグループの物理的エッジを表すためのそれぞれの候補を形成する、画像位置または物理的位置のそれぞれのセットであるか、またはそれらを含むことと、複数の候補エッジが、第一の画像領域と第二の画像領域との間の境界に基づいて形成される第一の候補エッジを含む場合、画像情報が、第一の候補エッジで定義された暗さ条件を満たすかどうかを判定することであって、第一の画像領域が、第二の画像領域よりも暗く、第一の画像領域および第二の画像領域が、画像情報によって記述されるそれぞれの領域であることと、複数の候補エッジのサブセットを選択して、物体のグループの物理的エッジを表すための候補エッジの選択されたサブセットを形成することであって、選択することが、画像情報が第一の候補エッジで定義された暗さ条件を満たすかどうかに基づいて、候補エッジの選択されたサブセット内に第一の候補エッジを含めることによって、物体のグループの物理的エッジの少なくとも一つを表す候補として、第一の候補エッジを保持するかどうかを判定することを含むことと、ロボット相互作用移動コマンドを出力することであって、ロボット相互作用移動コマンドが、ロボットと物体のグループの少なくとも一つの物体との間のロボット相互作用のためであり、候補エッジの選択されたサブセットに基づいて生成されることと、を実行するように構成され得る。本実施形態では、少なくとも一つの処理回路は、第一の画像領域が、定義された暗さ閾値基準または定義されたスパイク強度プロファイル基準のうちの少なくとも一つを満たすという決定に応答して、画像情報が、第一の候補エッジで定義された暗さ条件を満たすと決定するように構成される。さらに本実施形態では、少なくとも一つの処理回路は、第一の画像領域が、定義された暗さ閾値よりも画像強度が暗い少なくとも一つの部分を有するかどうかを判定することによって、第一の画像領域が定義された暗さ閾値基準を満たすかどうかを判定するように構成される。さらに本実施形態では、少なくとも一つの処理回路は、第一の画像領域が、（ｉ）画像強度が、位置の関数として第一の画像領域内の暗さが増加し、第一の画像領域内の位置で暗さのピークレベルに到達する、第一のプロファイル部分と、続いて（ｉｉ）画像強度が、位置の関数として、暗さのピークレベルから離れて、第一の画像領域内の暗さが減少する、第二のプロファイル部分とを含む、画像強度プロファイルを有するかどうかを判定することによって、第一の画像領域がスパイク強度プロファイル基準を満たすかどうかを判定するように構成される。 Embodiment 1 includes a computing system or a method implemented by the computing system. The computing system may include a communications interface and at least one processing circuit. The communications interface may be configured to communicate with a robot and a camera having a camera field of view. The at least one processing circuit receives image information representing a group of objects generated by the camera when the group of objects is within the camera field of view, and identifies from the image information a plurality of candidate edges associated with the group of objects, the plurality of candidate edges being or including respective sets of image locations or physical locations that form respective candidates for representing physical edges of the group of objects, and if the plurality of candidate edges includes a first candidate edge formed based on a boundary between a first image region and a second image region, determines whether the image information satisfies a darkness condition defined by the first candidate edge, the first image region being darker than the second image region, and the first image region and the second image region being darker than the respective candidate edges described by the image information. The image processing system may be configured to perform the following: selecting a subset of the plurality of candidate edges to form a selected subset of candidate edges for representing a physical edge of the group of objects, where selecting includes determining whether to retain the first candidate edge as a candidate for representing at least one of the physical edges of the group of objects by including the first candidate edge in the selected subset of candidate edges based on whether the image information satisfies a darkness condition defined by the first candidate edge; and outputting a robot interaction move command, where the robot interaction move command is for a robot interaction between the robot and at least one object of the group of objects and is generated based on the selected subset of candidate edges. In this embodiment, the at least one processing circuit is configured to determine that the image information satisfies the darkness condition defined by the first candidate edge in response to determining that the first image region satisfies at least one of the defined darkness threshold criterion or the defined spike intensity profile criterion. Further in this embodiment, the at least one processing circuit is configured to determine whether the first image region satisfies the defined darkness threshold criterion by determining whether the first image region has at least one portion with an image intensity darker than the defined darkness threshold. Further, in this embodiment, at least one processing circuit is configured to determine whether the first image region meets the spike intensity profile criteria by determining whether the first image region has an image intensity profile including (i) a first profile portion where the image intensity increases in darkness within the first image region as a function of position and reaches a peak level of darkness at a position within the first image region, followed by (ii) a second profile portion where the image intensity decreases in darkness within the first image region away from the peak level of darkness as a function of position.

実施形態２は、実施形態１に記載の計算システムを含み、第一の画像領域が、物体のグループの第一の物体と第二の物体との間の物理的ギャップを表すピクセルのバンドを形成する第一のピクセル領域であり、第二の画像領域が、第一の候補エッジを形成する境界が第一のピクセル領域と第二のピクセル領域との間にあるように、第一のピクセル領域に直に隣接する第二のピクセル領域である。 Embodiment 2 includes the computing system of embodiment 1, where the first image region is a first pixel region forming a band of pixels representing a physical gap between a first object and a second object of a group of objects, and the second image region is a second pixel region immediately adjacent to the first pixel region such that a boundary forming a first candidate edge is between the first pixel region and the second pixel region.

実施形態３は、実施形態２に記載の計算システムを含み、少なくとも一つの処理回路が、第一の画像領域が定義された暗さ閾値よりも小さいピクセル強度値を有するかどうかを判定することによって、第一の画像領域が、定義された暗さ閾値基準を満たすかどうかを判定するように構成される。 Embodiment 3 includes the computing system of embodiment 2, where at least one processing circuit is configured to determine whether the first image region meets a defined darkness threshold criterion by determining whether the first image region has pixel intensity values less than a defined darkness threshold.

実施形態４は、実施形態２または３の計算システムを含み、第一の画像領域の画像強度プロファイルが、第一の画像領域の幅寸法にわたって延びる一連のピクセルのそれぞれのピクセル強度値を記述し、少なくとも一つの処理回路が、画像強度プロファイルが、それぞれのピクセル強度値が、第一の画像領域における最小ピクセル強度値に向かって減少し、そして次に、最小ピクセル強度値から離れて増加するように切り替わる、形状を有するかどうかを判定することによって、第一の画像領域がスパイク強度プロファイル基準を満たすかどうかを判定するように構成され、最小ピクセル強度値が、第一の画像領域における暗さのピークレベルに関連している。 Embodiment 4 includes the computing system of embodiment 2 or 3, wherein an image intensity profile of the first image region describes pixel intensity values for each of a series of pixels extending across a width dimension of the first image region, and at least one processing circuit is configured to determine whether the first image region meets a spike intensity profile criterion by determining whether the image intensity profile has a shape in which each pixel intensity value decreases toward a minimum pixel intensity value in the first image region and then switches to increase away from the minimum pixel intensity value, the minimum pixel intensity value being associated with a peak level of darkness in the first image region.

実施形態５は、実施形態１～４のいずれか一つに記載の計算システムを含み、少なくとも一つの処理回路が、第一の画像領域がスパイク強度プロファイル基準を満たすという決定に応答してのみ、第一の画像領域が定義された暗さ条件を満たすと判定するように構成される。 Embodiment 5 includes a computing system according to any one of embodiments 1 to 4, in which at least one processing circuit is configured to determine that the first image region meets a defined darkness condition only in response to determining that the first image region meets the spike intensity profile criterion.

実施形態６は、実施形態１～５のいずれか一つに記載の計算システムを含み、少なくとも一つの処理回路が、第一の画像領域が定義された暗さ閾値基準を満たすという決定に応答してのみ、第一の画像領域が定義された暗さ条件を満たすと判定するように構成される。 Embodiment 6 includes a computing system according to any one of embodiments 1 to 5, wherein at least one processing circuit is configured to determine that the first image region meets a defined darkness condition only in response to determining that the first image region meets a defined darkness threshold criterion.

実施形態７は、実施形態１に記載の計算システムを含み、少なくとも一つの処理回路が、第一の画像領域が定義された暗さ閾値基準および定義されたスパイク強度プロファイル基準の両方を満たすという決定に応答してのみ、第一の画像領域が定義された暗さ条件を満たすと判定するように構成される。 Embodiment 7 includes the computing system of embodiment 1, where at least one processing circuit is configured to determine that the first image region meets the defined darkness condition only in response to determining that the first image region meets both the defined darkness threshold criterion and the defined spike intensity profile criterion.

実施形態８は、実施形態１～７のいずれか一つに記載の計算システムを含み、少なくとも一つの処理回路が、画像情報が２Ｄ画像情報および３Ｄ画像情報を含むとき、２Ｄ画像情報に基づいて、第一の画像領域と第二の画像領域との間の境界に基づいて形成される第一の候補エッジを識別するように構成され、３Ｄ画像情報が、カメラ視野内の位置の奥行き情報を含む。 Embodiment 8 includes the computing system of any one of embodiments 1 to 7, and at least one processing circuit is configured to identify a first candidate edge formed based on a boundary between a first image region and a second image region based on the 2D image information when the image information includes 2D image information and 3D image information, and the 3D image information includes depth information of a position within the camera field of view.

実施形態９は、実施形態８の計算システムを含み、少なくとも一つの処理回路が、（ｉ）３Ｄ画像情報に、第一の候補エッジに対応する一つまたは複数の位置の奥行き情報が欠落し、（ｉｉ）第一の候補画像に対応する３Ｄ画像情報の一部が、定義されたノイズ許容閾値よりも大きい撮像ノイズのレベルの影響を受ける場合に、物体のグループの物理的エッジの少なくとも一つを表すための候補として第一の候補エッジを保持するかどうかの判定を行うように構成される。 Embodiment 9 includes the computing system of embodiment 8, wherein at least one processing circuit is configured to determine whether to retain a first candidate edge as a candidate for representing at least one physical edge of a group of objects when (i) the 3D image information is missing depth information at one or more locations corresponding to the first candidate edge, and (ii) a portion of the 3D image information corresponding to the first candidate image is subject to a level of imaging noise greater than a defined noise tolerance threshold.

実施形態１０は、実施形態８または９の計算システムを含み、少なくとも一つの処理回路が、３Ｄ画像情報が第一の候補エッジに対応する一つまたは複数の位置で、定義された奥行き不連続状態を満たさない時に、物体のグループの物理的エッジのうちの少なくとも一つを表すための候補として、第一の候補エッジを保持するかどうかの判定を行うように構成される。 Embodiment 10 includes the computing system of embodiment 8 or 9, wherein at least one processing circuit is configured to determine whether to retain a first candidate edge as a candidate for representing at least one of the physical edges of the group of objects when the 3D image information does not satisfy a defined depth discontinuity condition at one or more locations corresponding to the first candidate edge.

実施形態１１は、実施形態１０の計算システムを含み、少なくとも一つの処理回路が、３Ｄ画像情報が、定義された奥行き差閾値を超える一つまたは複数の位置で奥行き変化を記述しないという決定に応答して、３Ｄ画像情報が、第一の候補エッジに対応する一つまたは複数の位置で、定義された奥行き不連続状態を満たさないと判定するように構成される。 Embodiment 11 includes the computing system of embodiment 10, wherein at least one processing circuit is configured to determine that the 3D image information does not satisfy a defined depth discontinuity condition at one or more locations corresponding to a first candidate edge in response to determining that the 3D image information does not describe a depth change at one or more locations that exceeds a defined depth difference threshold.

実施形態１２は、実施形態８～１１のうちのいずれか一つの計算システムを含み、少なくとも一つの処理回路が、３Ｄ画像情報に基づいて、複数の候補エッジの第二の候補エッジを識別するように構成される。 Embodiment 12 includes any one of the computing systems of embodiments 8 to 11, in which at least one processing circuit is configured to identify a second candidate edge of the plurality of candidate edges based on the 3D image information.

実施形態１３は、請求項１２に記載の計算システムを含み、少なくとも一つの処理回路は、定義された測定分散閾値を超えて互いに逸脱しないそれぞれの奥行き値を有する、３Ｄ画像情報によって記述される位置の第一のセットに基づいて、物体のグループの第一の表面を識別することと、定義された測定分散閾値内にあるそれぞれの奥行き値を有する、３Ｄ画像情報によって記述される位置の第二のセットに基づいて、物体のグループの第二の表面を識別することと、第一の平均奥行き値として、第一の表面に関連付けられる平均奥行き値を決定することと、第二の平均奥行き値として、第二の表面に関連付けられる平均奥行き値を決定することと、第一の平均奥行き値と第二の平均奥行き値との間の差が、定義された奥行き差閾値を超えるという決定に応答して、第一の表面と第二の表面との間の遷移がある位置に基づいて、第二の候補エッジを識別することと、によって、３Ｄ画像情報に基づいて第二の候補エッジを識別するように構成される。 Embodiment 13 includes the computing system of claim 12, wherein at least one processing circuit is configured to identify a second candidate edge based on the 3D image information by: identifying a first surface of the group of objects based on a first set of locations described by the 3D image information having respective depth values that do not deviate from each other by more than a defined measurement variance threshold; identifying a second surface of the group of objects based on a second set of locations described by the 3D image information having respective depth values that are within the defined measurement variance threshold; determining an average depth value associated with the first surface as the first average depth value; determining an average depth value associated with the second surface as the second average depth value; and, in response to determining that the difference between the first average depth value and the second average depth value exceeds the defined depth difference threshold, identifying a second candidate edge based on a location at which there is a transition between the first surface and the second surface.

実施形態１４は、実施形態１２または１３の計算システムを含み、少なくとも一つの処理回路が、第二の候補エッジが、２Ｄ画像情報の中にあり、定義された暗さ条件を満たさない二つの画像領域の間の境界に基づいて形成される、候補エッジにマッピングされるときに、３Ｄ画像情報に基づいて第二の候補エッジを識別するように構成される。 Embodiment 14 includes the computing system of embodiment 12 or 13, wherein at least one processing circuit is configured to identify a second candidate edge based on the 3D image information when the second candidate edge is mapped to a candidate edge in the 2D image information and formed based on a boundary between two image regions that do not satisfy a defined darkness condition.

実施形態１５は、実施形態１～１４の計算システムを含み、少なくとも一つの処理回路が、候補エッジの選択されたサブセットに基づいて、物体認識操作または物体登録操作を行うように構成され、ロボット相互作用移動コマンドが、物体認識操作または物体登録操作の結果に基づいて生成される。 Embodiment 15 includes the computing system of embodiments 1-14, wherein at least one processing circuit is configured to perform an object recognition operation or an object registration operation based on a selected subset of the candidate edges, and a robot interaction move command is generated based on a result of the object recognition operation or the object registration operation.

実施形態１６は、実施形態１～１５のいずれか一つに記載の計算システムを含み、少なくとも一つの処理回路が、複数の候補エッジからどの候補エッジをフィルターリングするかを決定することによって、複数の候補エッジのサブセットを選択するように構成され、複数の候補エッジが、フィルターリングされた後、候補エッジのサブセットを形成する。 Embodiment 16 includes the computing system of any one of embodiments 1 to 15, wherein at least one processing circuit is configured to select a subset of the plurality of candidate edges by determining which candidate edges to filter from the plurality of candidate edges, and the plurality of candidate edges form the subset of candidate edges after filtering.

関連分野の当業者にとって、本明細書に記載する方法および用途への、その他の好適な修正ならびに適応が、実施形態のうちのいずれの範囲から逸脱することなく成すことができることは明らかであろう。上に記載する実施形態は、説明に役立つ実施例であり、本発明がこれらの特定の実施形態に限定されると解釈されるべきではない。本明細書に開示するさまざまな実施形態は、記載および添付の図に具体的に提示する組み合わせとは異なる組み合わせで、組み合わせてもよいことが理解されるべきである。実施例によって、本明細書に記載するプロセスもしくは方法のいずれのある特定の行為または事象は、異なるシーケンスで行われてもよく、追加、統合、または完全に省略し得ることも理解されるべきである（例えば、記載した全ての行為または事象は、方法またはプロセスを実施するのに必要ではない場合がある）。一部の実例では、方法４０００は、ステップ４００２を省略するように修正され得る。上述のさまざまな実施形態は、方法４０００のステップ４００２～４００８に関連するが、本開示の別の方法は、図１１Ｂまたは１１Ｃに関して論じられるように、３Ｄ画像情報に基づいて候補エッジを特定することを含んでもよく、およびステップ４００２～４００８を省略し得る。加えて、本明細書の実施形態のある特定の特徴を、明確にするために、単一コンポーネント、モジュール、またはユニットにより行われていると記載しているものの、本明細書に記載する特徴および機能は、構成要素、モジュール、またはユニットのいかなる組み合わせによって行われてもよいことは理解されるべきである。従って、添付の特許請求の範囲で定義されるような、発明の趣旨または範囲から逸脱することなく、さまざまな変更および修正を当業者が及ぼし得る。

It will be apparent to those skilled in the relevant art that other suitable modifications and adaptations to the methods and applications described herein can be made without departing from the scope of any of the embodiments. The embodiments described above are illustrative examples, and the present invention should not be construed as being limited to these particular embodiments. It should be understood that the various embodiments disclosed herein may be combined in different combinations than those specifically presented in the description and accompanying figures. It should also be understood that, by way of example, certain acts or events of any of the processes or methods described herein may be performed in a different sequence, added, combined, or omitted entirely (e.g., not all acts or events described may be necessary to implement a method or process). In some instances, method 4000 may be modified to omit step 4002. Although the various embodiments described above relate to steps 4002-4008 of method 4000, another method of the present disclosure may include identifying candidate edges based on 3D image information, as discussed with respect to FIG. 11B or 11C, and may omit steps 4002-4008. In addition, although certain features of the embodiments herein are described for clarity as being performed by a single component, module, or unit, it should be understood that the features and functions described herein may be performed by any combination of components, modules, or units. Thus, various changes and modifications may be effected by one skilled in the art without departing from the spirit or scope of the invention, as defined in the appended claims.

Claims

1. A computing system comprising:
a communication interface configured to communicate with the robot and a camera having a camera field of view;
at least one processing circuit;
The at least one processing circuit, when a group of objects is within the camera field of view,
receiving image information generated by the camera and representative of the group of objects;
identifying candidate edges from the image information;
determining whether a portion of the image information adjacent the candidate edge satisfies an intensity profile criterion based on whether the portion of the image information includes (i) a first profile portion where image intensity increases in darkness followed by (ii) a second profile portion where image intensity decreases in darkness;
Outputting a robot interactive movement command;
configured to run
A computing system, wherein the robot interaction move command is for robot interaction between the robot and at least one object of the group of objects and is based on the candidate edge.

The computing system of claim 1, wherein if the intensity profile criterion is met, the portion of the image information adjacent the candidate edge is a first pixel region that forms a band of pixels representing a physical gap between a first object and a second object of the group of objects.

an image intensity profile of the portion of the image information adjacent the candidate edge describing respective pixel intensity values for a series of pixels extending across a width dimension of the portion;
2. The computing system of claim 1, wherein the at least one processing circuit is configured to determine whether the portion of the image information adjacent the candidate edge meets a spike intensity profile criterion by determining whether the image intensity profile has a shape in which the respective pixel intensity values decrease toward a minimum pixel intensity value and then switch to increase away from the minimum pixel intensity value, the minimum pixel intensity value being associated with a peak level of darkness in the portion of the image information adjacent the candidate edge.

The at least one processing circuit comprises:
only in response to determining that the portion of the image information adjacent the candidate edge satisfies the spike strength profile criterion;
The computing system of claim 3 , configured to determine that the portion of the image information adjacent the candidate edge satisfies the intensity profile criterion.

The at least one processing circuit comprises:
When the image information includes 2D image information and 3D image information,
configured to identify the candidate edges based on the 2D image information;
The computing system of claim 1 , wherein the 3D image information includes depth information for locations within the camera field of view.

The at least one processing circuit comprises:
(i) the 3D image information lacks depth information at one or more locations corresponding to the candidate edge, or (ii) a portion of the 3D image information corresponding to the candidate edge is subject to a level of imaging noise greater than a defined noise tolerance threshold;
The computing system of claim 5 , configured to determine whether to retain the candidate edge as a candidate for representing at least one physical edge of the group of objects.

The at least one processing circuit comprises:
when the 3D image information does not satisfy a defined depth discontinuity condition at one or more locations corresponding to the candidate edge;
The computing system of claim 5 , configured to determine whether to retain the candidate edge as a candidate for representing at least one physical edge of the group of objects.

The at least one processing circuit comprises:
in response to determining that the 3D image information does not describe a depth change at one or more locations that exceeds a defined depth difference threshold;
The computing system of claim 7 , configured to determine that the 3D image information does not satisfy the defined depth discontinuity condition at the one or more locations corresponding to the candidate edge.

The computing system of claim 5, wherein the at least one processing circuit is configured to identify a second candidate edge based on the 3D image information.

The at least one processing circuit comprises:
identifying a first surface of the group of objects based on a first set of locations described by the 3D image information having respective depth values that do not deviate from each other by more than a defined measurement variance threshold;
identifying a second surface of the group of objects based on a second set of locations described by the 3D image information having respective depth values that are within the defined measurement variance threshold;
determining an average depth value associated with the first surface as a first average depth value;
determining an average depth value associated with the second surface as a second average depth value;
in response to determining that the difference between the first average depth value and the second average depth value exceeds a defined depth difference threshold, identifying the second candidate edge based on a location of a transition between the first surface and the second surface;
10. The computing system of claim 9, configured to identify the second candidate edge based on the 3D image information by:

The at least one processing circuit comprises:
when the second candidate edge is mapped to a candidate that is in the 2D image information and is formed based on a boundary between two image regions that do not satisfy the intensity profile criterion;
The computing system of claim 9 , configured to identify the second candidate edge based on the 3D image information.

The computing system of claim 1, wherein the at least one processing circuit is configured to perform an object recognition operation or an object registration operation based on the candidate edges.

A non-transitory computer-readable medium having instructions,
The instructions, when executed by at least one processing circuit of a computing system, cause the at least one processing circuit to:
receiving image information by the at least one processing circuit of the computing system, the computing system being configured to communicate with (i) a robot and (ii) a camera having a camera field of view, the image information being representative of a group of objects within the camera field of view and generated by the camera;
identifying candidate edges from the image information;
determining whether a portion of the image information adjacent the candidate edge satisfies an intensity profile criterion based on whether the portion of the image information includes (i) a first profile portion where image intensity increases in darkness followed by (ii) a second profile portion where image intensity decreases in darkness;
Outputting a robot interactive movement command;
The above is executed.
A non-transitory computer-readable medium, wherein the robot interaction move command is for robot interaction between the robot and at least one object of the group of objects and is based on the candidate edge.

The non-transitory computer-readable medium of claim 13, wherein if the intensity profile criterion is met, the portion of the image information adjacent the candidate edge is a first pixel region that forms a band of pixels representing a physical gap between a first object and a second object of the group of objects.

1. A method performed by a computing system, comprising:
receiving image information by the computing system, the computing system configured to communicate with (i) a robot and (ii) a camera having a camera field of view, the image information being for representing a group of objects within the camera field of view and generated by the camera;
identifying candidate edges from the image information;
determining whether a portion of the image information adjacent the candidate edge satisfies an intensity profile criterion based on whether the portion of the image information includes (i) a first profile portion where image intensity increases in darkness followed by (ii) a second profile portion where image intensity decreases in darkness;
Outputting a robot interactive movement command;
Including,
The method, wherein the robot interaction move command is for robot interaction between the robot and at least one object of the group of objects and is based on the candidate edge.