JP7396202B2

JP7396202B2 - Generation program, generation method, and information processing device

Info

Publication number: JP7396202B2
Application number: JP2020092813A
Authority: JP
Inventors: 広之江刺; 哲也俣野
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2020-05-28
Filing date: 2020-05-28
Publication date: 2023-12-12
Anticipated expiration: 2040-05-28
Also published as: JP2021189629A

Description

本発明は、生成プログラム、生成方法、および情報処理装置に関する。 The present invention relates to a generation program, a generation method, and an information processing device.

例えば、製造、仕分け、およびその他の分野において部品および製品などを検査したり、形状を判定したりすることが行われている。こうした検査および判定などの処理は、処理対象の数や種類が膨大となることがあり、人手で行うことが難しいことがある。そのため、画像解析により処理する試みが成されている。 For example, parts and products are inspected and shapes are determined in manufacturing, sorting, and other fields. Processing such as inspection and determination may be difficult to perform manually because the number and types of objects to be processed may be enormous. Therefore, attempts have been made to process it by image analysis.

また、撮影装置で被写体を撮影する際に、例えば、障害物などに起因して被写体を好ましい向きから撮影できないことがある。例えば、この様な場合に、撮影装置で撮影した被写体が写る画像を、利用において好ましい向きから見た被写体が写る画像に変換する画像再投影の技術が利用されている。画像再投影の技術として、例えば、テクスチャマッピングおよび投影テクスチャマッピングなどの技術が知られている。 Further, when photographing a subject with a photographing device, it may not be possible to photograph the subject from a preferred direction due to, for example, an obstacle. For example, in such cases, an image reprojection technique is used to convert an image of a subject photographed by a photographing device into an image of the subject viewed from a preferred orientation for use. As techniques for image reprojection, techniques such as texture mapping and projected texture mapping are known, for example.

また、画像変換に関連する技術が知られている（例えば、特許文献１）。 Furthermore, techniques related to image conversion are known (for example, Patent Document 1).

特開平１１－０２５２９１号公報Japanese Patent Application Publication No. 11-025291

しかしながら、例えば、テクスチャマッピングおよび投影テクスチャマッピングなどの技術を用いて画像再投影を行う場合、被写体の３次元形状モデルを生成するため、計算負荷が高くなることがある。そのため、例えば、部品および製品の検査および判定などの現場のように、大量の撮影画像を処理する場合、これらの技術で処理することが難しいことがある。 However, when reprojecting an image using techniques such as texture mapping and projection texture mapping, a three-dimensional shape model of the subject is generated, which may increase the calculation load. Therefore, when processing a large number of captured images, such as in the case of inspection and judgment of parts and products, for example, it may be difficult to process using these techniques.

１つの側面では、本発明は、被写体が写る撮影画像から、その被写体が別の向きで写る画像を生成する処理にかかる負荷を低減することを目的とする。 In one aspect, an object of the present invention is to reduce the load required for processing to generate an image in which the subject is shown in a different direction from a captured image in which the subject is shown.

本発明の一つの態様の生成プログラムは、各画素が被写体の深度を示す深度情報と対応づけられている撮影画像の撮影に用いられた撮影装置の光学中心と、撮影画像が投影される撮影投影面と、視点と、視点から被写体を写した視点画像が投影される視点投影面とを、仮想空間上に配置し、視点から視点投影面上に配置した視点画像の処理対象画素に向けて延ばした視線と、光学中心から撮影投影面上に配置した撮影画像の画素に向けて延ばした投射線との交差位置と、撮影画像の画素と対応する深度情報が示す被写体の深度とに基づいて、視点画像の処理対象画素と対応する画素値を、撮影画像の画素のうちから取得し、処理対象画素に対して取得した画素値を用いて視点画像を生成する、処理をコンピュータに実行させる。 A generation program according to one aspect of the present invention includes the optical center of the photographing device used to photograph the photographed image, in which each pixel is associated with depth information indicating the depth of the subject, and the photographic projection onto which the photographed image is projected. A plane, a viewpoint, and a viewpoint projection surface onto which a viewpoint image of the subject is projected from the viewpoint are arranged in virtual space, and the viewpoint image placed on the viewpoint projection surface is extended from the viewpoint toward the processing target pixel. Based on the intersection position of the line of sight and the projection line extending from the optical center toward the pixels of the photographed image placed on the photographic projection plane, and the depth of the subject indicated by the depth information corresponding to the pixels of the photographed image, A computer is caused to execute a process of acquiring a pixel value corresponding to a processing target pixel of a viewpoint image from among the pixels of the photographed image, and generating a viewpoint image using the acquired pixel value for the processing target pixel.

被写体が写る撮影画像から、その被写体が別の向きで写る画像を生成する処理にかかる負荷を低減できる。 It is possible to reduce the load required for processing to generate an image of the subject in a different orientation from a photographed image of the subject.

実施形態が利用される例示的な撮影システムを示す図である。1 is a diagram illustrating an exemplary imaging system in which embodiments may be utilized; FIG. 実施形態に係る情報処理装置のブロック構成を例示する図である。FIG. 1 is a diagram illustrating a block configuration of an information processing device according to an embodiment. 実施形態に係る撮影画像からの視点画像の生成の流れを例示する図である。FIG. 3 is a diagram illustrating a flow of generating a viewpoint image from a photographed image according to an embodiment. 実施形態に係る撮影画像からの視点画像の生成の流れを例示する図である。FIG. 3 is a diagram illustrating a flow of generating a viewpoint image from a photographed image according to an embodiment. 実施形態に係る撮影画像からの視点画像の生成の流れを例示する図である。FIG. 3 is a diagram illustrating a flow of generating a viewpoint image from a photographed image according to an embodiment. 実施形態に係る死角領域について例示する図である。FIG. 3 is a diagram illustrating a blind spot area according to an embodiment. 実施形態に係る遮蔽領域について例示する図である。FIG. 3 is a diagram illustrating a shielding area according to an embodiment. 実施形態に係る視点画像の生成処理の動作フローを例示する図である。FIG. 3 is a diagram illustrating an operational flow of viewpoint image generation processing according to the embodiment. 実施形態に係る視線と対応する投射線の特定処理の動作フローを例示する図である。FIG. 3 is a diagram illustrating an operation flow of a process for specifying a projection line corresponding to a line of sight according to an embodiment. 実施形態に係る視線と対応する投射線の特定処理の流れを例示する図である。FIG. 3 is a diagram illustrating the flow of a process for specifying a projection line corresponding to a line of sight according to an embodiment. 実施形態に係る撮影画像と視点画像とを例示する図である。FIG. 3 is a diagram illustrating a photographed image and a viewpoint image according to an embodiment. 実施形態に係る複数の位置から撮影された撮影画像を用いた視点画像の生成について例示する図である。FIG. 3 is a diagram illustrating generation of a viewpoint image using captured images captured from a plurality of positions according to an embodiment. 実施形態に係る複数の位置から撮影された撮影画像を用いた視点画像の生成について例示する図である。FIG. 3 is a diagram illustrating generation of a viewpoint image using captured images captured from a plurality of positions according to an embodiment. 実施形態に係る異なる２つの位置から撮影された撮影画像に基づく視点画像の生成処理の動作フローを例示する図である。FIG. 3 is a diagram illustrating an operation flow of a viewpoint image generation process based on captured images captured from two different positions according to the embodiment. 実施形態に係る情報処理装置を実現するためのコンピュータのハードウェア構成を例示する図である。1 is a diagram illustrating a hardware configuration of a computer for realizing an information processing apparatus according to an embodiment.

以下、図面を参照しながら、本発明のいくつかの実施形態について説明する。なお、複数の図面において対応する要素には同一の符号を付す。 Hereinafter, some embodiments of the present invention will be described with reference to the drawings. Note that the same reference numerals are given to corresponding elements in a plurality of drawings.

図１は、実施形態が利用される例示的な撮影システム１００を示す図である。撮影システム１００は、例えば、情報処理装置１０１、撮影装置１０２、設置台１０５などを含む。設置台１０５は、例えば、被写体１１０が設置される台であり、一例ではベルトコンベアなどの製造ラインであってよい。設置台１０５には、例えば、部品、製品、製造物、および成形物などの撮影対象物１０６が設置される。撮影装置１０２は、設置台１０５に設置された撮影対象物１０６を撮影する。 FIG. 1 is a diagram illustrating an exemplary imaging system 100 in which embodiments may be utilized. The photographing system 100 includes, for example, an information processing device 101, a photographing device 102, an installation stand 105, and the like. The installation table 105 is, for example, a table on which the subject 110 is placed, and may be, for example, a production line such as a belt conveyor. An object to be photographed 106 such as a part, a product, a manufactured product, and a molded object is installed on the installation stand 105, for example. The photographing device 102 photographs an object to be photographed 106 installed on an installation stand 105 .

情報処理装置１０１は、例えば、撮影装置１０２と有線または無線で通信接続されている。情報処理装置１０１は、例えば、パーソナルコンピュータ（ＰＣ）、モバイルＰＣ、タブレット端末、または、ネットワークを介して撮影装置１０２と接続されるサーバコンピュータなどの画像処理機能を備えるコンピュータであってよい。 The information processing device 101 is, for example, communicatively connected to the photographing device 102 by wire or wirelessly. The information processing device 101 may be, for example, a personal computer (PC), a mobile PC, a tablet terminal, or a computer equipped with an image processing function such as a server computer connected to the photographing device 102 via a network.

撮影装置１０２は、例えば、ＲＧＢ－Ｄカメラおよびステレオカメラなどの撮影画像と、その撮影画像の画素と対応する深度の情報を取得可能な撮影装置であってよい。一例では、撮影装置１０２は、深度画像を撮影する。撮影装置１０２は、例えば、被写体１１０が写る画像を撮影する。被写体１１０は、例えば、画像に写り込む物体であってよい。図１の例では、設置台１０５および撮影対象物１０６が撮影画像に被写体１１０として写っていてよい。撮影対象物１０６は、例えば、部品、製品、製造物、および成形物などである。撮影装置１０２は、例えば、撮影した被写体１１０が写る撮影画像および深度画像を情報処理装置１０１に送信する。 The photographing device 102 may be, for example, a photographing device such as an RGB-D camera or a stereo camera that can acquire a photographed image and depth information corresponding to the pixels of the photographed image. In one example, the imaging device 102 captures a depth image. The photographing device 102 photographs, for example, an image in which the subject 110 appears. The subject 110 may be, for example, an object that appears in an image. In the example of FIG. 1, the installation stand 105 and the object to be photographed 106 may be shown as the object 110 in the photographed image. The object to be photographed 106 is, for example, a part, a product, a manufactured product, a molded object, or the like. The photographing device 102 transmits, for example, a photographed image and a depth image showing the photographed subject 110 to the information processing device 101.

なお、図１では、例えば、製造、仕分け、およびその他の分野における部品および製品の検査および判定などの現場での実施形態の利用を例示しているが、実施形態はこれに限定されるものではない。実施形態は、被写体１１０が写る撮影画像を、その被写体１１０を別の向きから見た画像に変換する処理が行われる様々な状況で利用することが可能である。 Note that although FIG. 1 illustrates the use of the embodiment in the field, for example, for inspection and determination of parts and products in manufacturing, sorting, and other fields, the embodiment is not limited thereto. do not have. The embodiment can be used in various situations in which processing is performed to convert a photographed image in which the subject 110 is seen into an image of the subject 110 viewed from a different direction.

なお、図１に示すように、例えば、ＲＧＢ－Ｄカメラなどの撮影装置１０２で撮影を行う場合、その撮影画像と対応する深度画像も得られる。そのため、深度画像の深度の情報を用いて３次元形状モデルを生成することができる。そして、生成した３次元形状モデルに撮影画像のテクスチャを貼り付けて回転させることで、様々な視点から被写体１１０を撮影した撮影画像を得ることができる。しかしながら、３次元形状モデルは、大量のポリゴンを処理して生成されるため処理負荷が高く、高速に処理を行ったり、大量の撮影画像を処理したりする用途では利用が難しいことがある。そのため、被写体が写る撮影画像から、その被写体が別の向きで写る画像を生成する画像変換にかかる処理負荷を低減できる更なる技術の提供が望まれている。 Note that, as shown in FIG. 1, when a photograph is taken using the photographing device 102 such as an RGB-D camera, a depth image corresponding to the photographed image is also obtained. Therefore, a three-dimensional shape model can be generated using the depth information of the depth image. Then, by attaching the texture of the captured image to the generated three-dimensional shape model and rotating it, captured images of the subject 110 captured from various viewpoints can be obtained. However, since the three-dimensional shape model is generated by processing a large number of polygons, the processing load is high, and it may be difficult to use it in applications that require high-speed processing or processing a large number of captured images. Therefore, it is desired to provide a further technique that can reduce the processing load required for image conversion for generating an image in which the subject is viewed in a different direction from a photographed image in which the subject appears.

以下で述べる実施形態では、撮影画像の画素と、撮影画像に基づいて生成を行う別の視点から被写体１１０を写した画像の画素とを、投射線および視線の光線追跡により特定する。なお、撮影画像に基づいて生成を行う別の視点から被写体１１０を写した画像を、以下では、視点画像と呼ぶことがある。また、投射線は、例えば、撮影画像の撮影に用いられた撮影装置の光学中心から撮影画像の画素に向かう直線で表される。視線は、例えば、視点から視点画像の画素に向かう直線で表される。実施形態によれば、被写体１１０の３次元形状モデルを生成しなくても、視点画像を生成することができるため、視点画像の生成にかかる処理負荷を低減することができる。 In the embodiment described below, pixels of a photographed image and pixels of an image of the subject 110 from another viewpoint, which is generated based on the photographed image, are identified by ray tracing of a projection line and a line of sight. Note that an image of the subject 110 that is generated from a different viewpoint based on the photographed image may be referred to as a viewpoint image below. Further, the projection line is represented by, for example, a straight line extending from the optical center of the photographing device used to photograph the photographed image toward the pixel of the photographed image. The line of sight is represented, for example, by a straight line from the viewpoint to the pixel of the viewpoint image. According to the embodiment, since a viewpoint image can be generated without generating a three-dimensional shape model of the subject 110, it is possible to reduce the processing load required to generate a viewpoint image.

なお、実施形態において視点画像の生成に用いる撮影画像は、予め前処理を実行して鏡面反射光の成分が除去された拡散反射光の画像であってよい。撮影画像の鏡面反射光の成分は、光源の影響を受ける成分であるため、視点が変わると変化し得る。そのため、例えば、視点画像における被写体１１０の色味などをその後の処理で用いる場合には、視点画像の生成に用いる撮影画像から予め除去しておくことは好ましい。しかしながら、実施形態はこれに限定されるものではない。例えば、視点画像を被写体１１０の輪郭や形状の外観検査に用いる場合には、鏡面反射光の成分が含まれていてもエッジの形状を抽出することが可能であるため、鏡面反射光の成分の除去は実行されなくてもよい。以下、実施形態を更に詳細に説明する。 Note that in the embodiment, the photographed image used to generate the viewpoint image may be an image of diffuse reflected light in which a specular reflected light component is removed by performing preprocessing in advance. The specularly reflected light component of the photographed image is a component that is affected by the light source, so it can change when the viewpoint changes. Therefore, for example, if the color of the subject 110 in the viewpoint image is to be used in subsequent processing, it is preferable to remove it in advance from the photographed image used to generate the viewpoint image. However, embodiments are not limited thereto. For example, when using a viewpoint image to inspect the appearance of the outline and shape of the subject 110, it is possible to extract the edge shape even if the specularly reflected light component is included. No removal may be performed. Hereinafter, embodiments will be described in more detail.

図２は、実施形態に係る情報処理装置１０１のブロック構成を例示する図である。情報処理装置１０１は、例えば、制御部２０１、記憶部２０２、通信部２０３、および表示装置２０４を含む。制御部２０１は、情報処理装置１０１の各部を制御してよい。制御部２０１は、例えば、配置部２１１、取得部２１２、および生成部２１３などを含み、またその他の機能部を含んでもよい。記憶部２０２は、例えば、後述する動作フローを実行するためのプログラム、およびプログラムの実行で参照する情報などを記憶している。通信部２０３は、例えば、制御部２０１の指示に従って他の装置と通信する。通信部２０３は、例えば、制御部２０１の指示に従って撮影装置１０２と通信し、撮影装置１０２から被写体１１０の写る撮影画像を取得してよい。表示装置２０４は、例えば、制御部２０１の指示に従って、表示画面に情報を表示する。これらの各部の詳細および記憶部２０２に格納されている情報の詳細については後述する。 FIG. 2 is a diagram illustrating a block configuration of the information processing apparatus 101 according to the embodiment. The information processing device 101 includes, for example, a control unit 201, a storage unit 202, a communication unit 203, and a display device 204. The control unit 201 may control each unit of the information processing device 101. The control unit 201 includes, for example, a placement unit 211, an acquisition unit 212, a generation unit 213, and may also include other functional units. The storage unit 202 stores, for example, a program for executing an operation flow described later, information to be referred to when executing the program, and the like. The communication unit 203 communicates with other devices according to instructions from the control unit 201, for example. The communication unit 203 may, for example, communicate with the photographing device 102 according to instructions from the control unit 201 and obtain a photographed image of the subject 110 from the photographing device 102. The display device 204 displays information on a display screen according to instructions from the control unit 201, for example. The details of each of these units and the information stored in the storage unit 202 will be described later.

図３から図５は、実施形態に係る撮影画像からの視点画像の生成の流れを例示する図である。なお、以下の説明において、図面が３次元の仮想空間内での物体の配置を示している場合、軸の方向を表す矢印を付している。例えば、図３では、図面の縦方向にＺ軸、図面の横方向にＸ軸、および図面の奥行方向にＹ軸がとられている。なお、実施形態に係る仮想空間の軸の方向はこれに限定されるものではなく、空間内の座標を指定することができれば、軸は任意の方向にとられてよい。 3 to 5 are diagrams illustrating the flow of generating a viewpoint image from a photographed image according to the embodiment. In the following description, when a drawing shows the arrangement of objects in a three-dimensional virtual space, an arrow indicating the direction of the axis is attached. For example, in FIG. 3, the Z axis is in the vertical direction of the drawing, the X axis is in the horizontal direction of the drawing, and the Y axis is in the depth direction of the drawing. Note that the direction of the axis of the virtual space according to the embodiment is not limited to this, and the axis may be taken in any direction as long as the coordinates in the space can be specified.

図３に示すように、撮影画像からの視点画像の生成処理において、制御部２０１は、仮想空間に撮影装置１０２の光学中心３０１を配置する。また、制御部２０１は、光学中心３０１から光軸３０２の前方の焦点距離の位置に、光軸に対して垂直に撮影画像の投影面として撮影投影面３０３を配置する。なお、撮影投影面３０３に投影される投影像に重ね合わせて撮影装置１０２で撮影した撮影画像および深度画像を配置することができる。この場合、撮影画像および深度画像の画像中心は、例えば、光軸３０２上にある。更に、制御部２０１は、撮影投影面３０３に対して平行に３次元平面３０５を配置する。なお、図３において、光学中心３０１から延びる破線は撮影視野を表している。 As shown in FIG. 3, in the process of generating a viewpoint image from a photographed image, the control unit 201 places the optical center 301 of the photographing device 102 in virtual space. Further, the control unit 201 arranges a photographic projection plane 303 as a projection plane of a photographed image perpendicularly to the optical axis at a position of a focal length in front of the optical axis 302 from the optical center 301 . Note that a captured image and a depth image captured by the imaging device 102 can be arranged to be superimposed on the projected image projected onto the imaging projection surface 303. In this case, the image centers of the photographed image and the depth image are on the optical axis 302, for example. Further, the control unit 201 arranges a three-dimensional plane 305 parallel to the imaging projection plane 303. In addition, in FIG. 3, a broken line extending from the optical center 301 represents the photographing field of view.

また、撮影投影面３０３に配置した深度画像の深度の情報は、被写体１１０の位置を表している。図３には、深度画像の画素ごとの深度の情報に基づいて、被写体１１０として設置台１０５および撮影対象物１０６の位置が破線で示されている。被写体１１０は、例えば、光学中心３０１から撮影投影面３０３上に配置した深度画像の画素３０７に向かって延ばした直線上の画素３０７の深度情報で示される深度の位置にある。なお、撮影投影面３０３上に配置した撮影画像および深度画像の画素に向かう直線を、画素の深度情報で示される深度まで延ばした矢印を、以下では、例えば、投射線３０８と呼ぶ。 Furthermore, the depth information of the depth image placed on the photographic projection plane 303 represents the position of the subject 110. In FIG. 3, the positions of the installation table 105 and the object to be photographed 106 as the object 110 are indicated by broken lines based on the depth information for each pixel of the depth image. The subject 110 is, for example, at a depth position indicated by the depth information of the pixel 307 on a straight line extending from the optical center 301 toward the pixel 307 of the depth image arranged on the imaging projection plane 303. Note that an arrow extending a straight line toward a pixel of a photographed image and a depth image arranged on the photographic projection plane 303 to a depth indicated by the depth information of the pixel is hereinafter referred to as a projection line 308, for example.

続いて、制御部２０１は、３次元平面３０５に対して光学中心３０１のある側の半空間の任意の位置に視点３１０を配置する。制御部２０１は、視点３１０の前方に、視点３１０に透視投影する際の投影面として視点投影面３１１を配置する。なお、図３において視点３１０から延びる破線は視点視野を表している。 Subsequently, the control unit 201 places the viewpoint 310 at an arbitrary position in the half space on the side of the optical center 301 with respect to the three-dimensional plane 305. The control unit 201 arranges a viewpoint projection surface 311 in front of the viewpoint 310 as a projection surface for perspective projection onto the viewpoint 310 . Note that in FIG. 3, a broken line extending from the viewpoint 310 represents the viewpoint field of view.

この場合に、実施形態では制御部２０１は、視点３１０の位置から視点投影面３１１上に投影される被写体１１０の視点画像を生成する処理を実行する。なお、以下では、視点３１０から視点投影面３１１上に配置された視点画像の或る画素３１２に向かって延びる直線を、例えば、視線３１３と呼ぶ。 In this case, in the embodiment, the control unit 201 executes processing to generate a viewpoint image of the subject 110 projected onto the viewpoint projection plane 311 from the position of the viewpoint 310. Note that, hereinafter, a straight line extending from the viewpoint 310 toward a certain pixel 312 of a viewpoint image arranged on the viewpoint projection plane 311 is referred to as a line of sight 313, for example.

続いて、制御部２０１は、視線３１３と交わる投射線３０８のうちから、視線３１３と投射線３０８との交差位置の深度と、投射線３０８が撮影投影面３０３上で通る画素の深度情報が示す被写体１１０の深度との深度差が最小となる投射線３０８を特定する。例えば、図４には、投射線３０８ａから投射線３０８ｃの３つの投射線３０８が示されている。ここで、投射線３０８ｃは、視線３１３と交差していないため排除される。一方、投射線３０８ａおよび投射線３０８ｂは、視線３１３と交差している。また、投射線３０８ａは、視線３１３と点４０１の位置で交差しており、投射線３０８ａが撮影投影面３０３上で通る深度画像の画素４１２の深度情報は点４０２の位置における被写体１１０の深度を示している。投射線３０８ａでは、視線３１３との交差位置（図４の点４０１）の深度と、投射線３０８ａが撮影投影面３０３上で通る画素４１２の深度情報が示す被写体（図４の点４０２）までの深度との深度差は大きい。一方、投射線３０８ｂでは、視線３１３との交差位置（図４の点４０３）の深度と、投射線３０８ｂが撮影投影面３０３上で通る画素４１３の深度情報が示す被写体１１０の深度とがほぼ同じ深度であり、深度差が小さい。そのため、制御部２０１は、深度差が最小となる投射線３０８として投射線３０８ｂを特定してよい。 Next, the control unit 201 determines, from among the projection lines 308 that intersect with the line of sight 313, the depth of the intersection position of the line of sight 313 and the projection line 308, and the depth information of the pixels through which the projection line 308 passes on the imaging projection plane 303. The projection line 308 with the minimum depth difference from the depth of the subject 110 is identified. For example, three projection lines 308 are shown in FIG. 4, from projection line 308a to projection line 308c. Here, the projection line 308c is excluded because it does not intersect the line of sight 313. On the other hand, the projection line 308a and the projection line 308b intersect the line of sight 313. Furthermore, the projection line 308a intersects the line of sight 313 at the position of the point 401, and the depth information of the pixel 412 of the depth image through which the projection line 308a passes on the imaging projection plane 303 indicates the depth of the subject 110 at the position of the point 402. It shows. For the projection line 308a, the depth at the intersection with the line of sight 313 (point 401 in FIG. 4) and the distance to the subject (point 402 in FIG. 4) indicated by the depth information of the pixel 412 through which the projection line 308a passes on the imaging projection plane 303 are determined. The difference in depth is large. On the other hand, for the projection line 308b, the depth at the intersection with the line of sight 313 (point 403 in FIG. 4) is almost the same as the depth of the subject 110 indicated by the depth information of the pixel 413 through which the projection line 308b passes on the imaging projection plane 303. depth, and the difference in depth is small. Therefore, the control unit 201 may specify the projection line 308b as the projection line 308 with the minimum depth difference.

なお、例えば、投射線３０８が通る撮影投影面３０３上の撮影画像の画素と、視線３１３が通る視点投影面３１１上の視点画像の画素とが被写体の同じ領域を写しているとする。この場合、投射線３０８と視線３１３との交差位置の深度と、投射線３０８が通る撮影投影面３０３上の深度画像の画素の深度情報が示す被写体１１０の深度とはおおよそ一致する。そのため、深度差が最小となり、所定の誤差範囲（例えば、５ｍｍ以下）内に小さくなる投射線３０８を特定することで、制御部２０１は、視線３１３が通る視点投影面３１１上の画素と同じ被写体１１０を写す撮影投影面３０３上の画素を特定することができる。 Note that, for example, it is assumed that the pixels of the photographed image on the photographic projection plane 303 through which the projection line 308 passes and the pixels of the viewpoint image on the viewpoint projection plane 311 through which the line of sight 313 passes reflect the same area of the subject. In this case, the depth at the intersection of the projection line 308 and the line of sight 313 approximately matches the depth of the subject 110 indicated by the depth information of the pixel of the depth image on the imaging projection plane 303 through which the projection line 308 passes. Therefore, by specifying the projection line 308 in which the depth difference is the minimum and is within a predetermined error range (for example, 5 mm or less), the control unit 201 can detect the same subject as the pixel on the viewpoint projection plane 311 through which the line of sight 313 passes. 110 can be specified on the imaging projection plane 303.

なお、深度差は、一例では、撮影投影面３０３と並行な３次元平面３０５からの視線３１３と投射線３０８との交差位置までの深度と、３次元平面３０５からの被写体１１０までの深度との差分から求めることができる。３次元平面３０５からの被写体１１０までの深度は、例えば、３次元平面３０５を撮影投影面３０３上に配置した場合、撮影投影面３０３上の深度画像の画素の深度情報が示す深度から焦点距離を差し引くことで求めることができる。 Note that the depth difference is, for example, the depth between the depth from the three-dimensional plane 305 parallel to the imaging projection plane 303 to the intersection position of the line of sight 313 and the projection line 308, and the depth from the three-dimensional plane 305 to the subject 110. It can be determined from the difference. The depth from the three-dimensional plane 305 to the subject 110 is determined by, for example, when the three-dimensional plane 305 is placed on the photographic projection plane 303, the focal length is calculated from the depth indicated by the depth information of the pixel of the depth image on the photographic projection plane 303. It can be found by subtracting it.

そして、制御部２０１は、例えば、図５に示すように、特定した投射線３０８が通る撮影投影面３０３上の画素４１３の画素値を、視線３１３が通る視点投影面３１１上の画素３１２の画素値として用いることができる。 Then, for example, as shown in FIG. 5, the control unit 201 changes the pixel value of the pixel 413 on the imaging projection plane 303 through which the specified projection line 308 passes, to the pixel value of pixel 312 on the viewpoint projection plane 311 through which the line of sight 313 passes. Can be used as a value.

更に、制御部２０１は、視点投影面３１１上の視点画像の全ての画素について、同様に撮影投影面３０３上の撮影画像に対応する画素があれば、その画素から画素値を取得することができる。それによって、制御部２０１は、視点３１０の位置から見た視点画像を生成することができる。 Furthermore, for all pixels of the viewpoint image on the viewpoint projection surface 311, if there is a pixel corresponding to the photographed image on the photographic projection surface 303, the control unit 201 can obtain a pixel value from that pixel. . Thereby, the control unit 201 can generate a viewpoint image seen from the position of the viewpoint 310.

また、以上で述べた撮影画像からの視点画像の生成処理では、例えば、被写体１１０の３次元形状モデルを生成しなくても、視点画像を生成することが可能である。そのため、処理負荷を抑えて視点画像を生成することができ、高速な処理が求められる状況、および大量の画像を処理する状況で、視点画像を得ることが可能になる。 Further, in the process of generating a viewpoint image from a photographed image described above, for example, it is possible to generate a viewpoint image without generating a three-dimensional shape model of the subject 110. Therefore, a viewpoint image can be generated with a reduced processing load, and a viewpoint image can be obtained in situations where high-speed processing is required and in situations where a large amount of images are to be processed.

続いて、実施形態に係る視点画像における死角領域の識別について説明する。 Next, identification of a blind spot area in a viewpoint image according to an embodiment will be described.

（死角領域の識別）
例えば、撮影画像において、手前にある別の被写体１１０により隠されて見えなかった領域が、視点を動かした結果、見えるようになることがある。この場合に、撮影画像において見えていない視点画像上の領域を、例えば、死角領域と呼ぶ。死角領域は、撮影画像においてその領域が死角となっており、画像情報が撮れていないため、描画できない視点画像の領域を表わしている。 (Identification of blind spot area)
For example, in a captured image, an area that was hidden by another subject 110 in the foreground may become visible as a result of moving the viewpoint. In this case, an area on the viewpoint image that is not visible in the captured image is called a blind spot area, for example. The blind spot area represents an area of the viewpoint image that cannot be drawn because the area is a blind spot in the captured image and no image information has been captured.

図６は、実施形態に係る死角領域の特定について例示する図である。例えば、図６（ａ）に示すように、投射線３０８が通る撮影投影面３０３上の画素の深度情報が示す深度よりも深い位置で、投射線３０８と視線３１３とが交差することがある。この場合、視線３１３が通る画素に写るはずの被写体１１０の領域が、撮影画像においては死角となっており、その部分の画像情報が取得できていないことを表している。 FIG. 6 is a diagram illustrating identification of a blind spot area according to the embodiment. For example, as shown in FIG. 6A, the projection line 308 and the line of sight 313 may intersect at a position deeper than the depth indicated by the depth information of the pixel on the imaging projection plane 303 through which the projection line 308 passes. In this case, the area of the subject 110 that is supposed to be captured in the pixel through which the line of sight 313 passes is a blind spot in the captured image, indicating that image information for that part cannot be acquired.

例えば、図６（ａ）の例では、視線３１３ａは被写体１１０の点６０３の位置を指しており、視点３１０から撮影を行った場合に得られる視点画像には、点６０３の位置の被写体１１０が写る。しかしながら、この点６０３の部分は光学中心３０１から撮影した撮影画像では被写体１１０の裏に隠れて死角となっている。そのため、撮影画像には点６０３の位置の画像情報が含まれていない。この場合、視点画像においては点６０３の位置などの死角領域の画像は生成されなくてよい。また、この場合に、制御部２０１は、視点画像において死角領域に所定の無効を表す色を設定するなど、死角領域が識別可能に表示されるようにしてよい。それにより、視点画像を見たユーザは、死角領域には撮影画像には写っていない何かが写っていることを知ることができる。 For example, in the example of FIG. 6A, the line of sight 313a points to the position of point 603 on the subject 110, and the viewpoint image obtained when shooting from the viewpoint 310 shows the subject 110 at the position of point 603. Take a picture. However, in the photographed image taken from the optical center 301, this point 603 is hidden behind the subject 110 and becomes a blind spot. Therefore, the captured image does not include image information at the position of point 603. In this case, an image of a blind spot area such as the position of point 603 does not need to be generated in the viewpoint image. Further, in this case, the control unit 201 may set the blind spot area in a viewpoint image in a predetermined color representing invalidity, so that the blind spot area is displayed in an identifiable manner. Thereby, the user viewing the viewpoint image can know that something that is not shown in the captured image is shown in the blind spot area.

なお、死角領域の識別では、制御部２０１は、まず、視線３１３と投射線３０８との交差位置の深度と、投射線３０８が通る撮影投影面３０３上の画素の深度情報が示す被写体１１０の深度との深度差が最小になる投射線３０８を特定する。図６の例では、点６０１を指す投射線３０８が特定されたとする。この場合、制御部２０１は、視線３１３と投射線３０８との交差位置（例えば、点６０２）と、投射線３０８が撮影投影面３０３上で通る画素の深度情報が示す被写体１１０の位置（例えば、点６０１）とが所定の条件を満たして離れているか否かを判定する。そして、交差位置（例えば、点６０２）と、深度情報が示す被写体１１０の位置（例えば、点６０１）とが所定の条件を満たして離れている場合に、視線３１３が視点投影面３１１上で通る視点画像の画素を死角領域と判定してよい。 Note that in identifying the blind spot area, the control unit 201 first determines the depth of the intersection of the line of sight 313 and the projection line 308 and the depth of the subject 110 indicated by the depth information of the pixel on the imaging projection plane 303 through which the projection line 308 passes. A projection line 308 with the minimum depth difference between the two points is identified. In the example of FIG. 6, it is assumed that a projection line 308 pointing to a point 601 has been identified. In this case, the control unit 201 determines the intersection position of the line of sight 313 and the projection line 308 (for example, point 602) and the position of the subject 110 indicated by the depth information of the pixel through which the projection line 308 passes on the imaging projection plane 303 (for example, It is determined whether or not the point 601) satisfies a predetermined condition and is far away from the point 601). Then, when the intersection position (for example, point 602) and the position of the subject 110 indicated by the depth information (for example, point 601) meet a predetermined condition and are separated, the line of sight 313 passes on the viewpoint projection plane 311. A pixel of the viewpoint image may be determined to be a blind spot area.

所定の条件は、例えば、視線３１３と投射線３０８との交差位置（例えば、点６０２）と、投射線３０８が撮影投影面３０３上で通る画素の深度情報が示す被写体１１０の位置（例えば、点６０１）との距離が所定の距離以上であることである。別な例では、所定の条件は、視線３１３と投射線３０８との交差位置（例えば、点６０２）の深度と、投射線３０８が撮影投影面３０３上で通る画素の深度情報が示す深度（例えば、点６０１の深度）との差が所定の長さ以上であることであってよい。なお、図６（ｂ）は、図６（ａ）の一部を拡大した図である。図６（ｂ）に示すように、深度差は、距離差の深度方向の成分を表している。投射線３０８が、撮影投影面３０３と平行な面となす角度θが鋭角に近づくと、深度差は小さくなってしまう。そのため、より高精度に死角領域を判定する場合、距離差を用いることが好ましい。 The predetermined conditions include, for example, the intersection position of the line of sight 313 and the projection line 308 (for example, point 602), and the position of the subject 110 indicated by the depth information of the pixel through which the projection line 308 passes on the photographic projection plane 303 (for example, the point 602). 601) is greater than or equal to a predetermined distance. In another example, the predetermined condition is the depth of the intersection position (for example, point 602) of the line of sight 313 and the projection line 308, and the depth indicated by the depth information of the pixel through which the projection line 308 passes on the imaging projection plane 303 (for example, , depth of point 601) may be a predetermined length or more. Note that FIG. 6(b) is a partially enlarged view of FIG. 6(a). As shown in FIG. 6(b), the depth difference represents a component of the distance difference in the depth direction. When the angle θ that the projection line 308 makes with a plane parallel to the imaging projection plane 303 approaches an acute angle, the depth difference becomes small. Therefore, when determining the blind spot area with higher accuracy, it is preferable to use the distance difference.

続いて、実施形態に係る視点画像における遮蔽領域の識別について説明する。 Next, identification of a shielding area in a viewpoint image according to an embodiment will be described.

（遮蔽領域の識別）
例えば、撮影画像において見えている領域が、視点を動かした結果、手前に別の被写体１１０が被り見えなくなることがある。このように、撮影画像において見えているのに視点画像において見えなくなる視点画像の領域を、例えば、遮蔽領域と呼ぶ。 (Identification of shielded area)
For example, as a result of moving the viewpoint, an area that is visible in the photographed image may be obscured by another subject 110 in the foreground. In this way, a region of the viewpoint image that is visible in the photographed image but becomes invisible in the viewpoint image is called, for example, a shielding region.

図７は、実施形態に係る遮蔽領域について例示する図である。例えば、図７に示すように、１つの視線３１３が、複数の投射線３０８と被写体１１０上で交差することがある。図７では、視線３１３は、投射線３０８ａと被写体１１０上の点７０１の位置で交差しており、投射線３０８ｂと被写体１１０上の点７０２の位置で交差している。 FIG. 7 is a diagram illustrating the shielding area according to the embodiment. For example, as shown in FIG. 7, one line of sight 313 may intersect with a plurality of projection lines 308 on the subject 110. In FIG. 7, the line of sight 313 intersects the projection line 308a at a point 701 on the subject 110, and intersects the projection line 308b at a point 702 on the subject 110.

しかしながら、点７０１の位置は、視点３１０からは被写体１１０の裏に隠れて見えない。そのため、視線３１３が通る視点投影面３１１上の視点画像の画素の値には、投射線３０８ｂが通る撮影投影面３０３上の撮影画像の画素の画素値を用いることが正しい。このように視線３１３が、複数の投射線３０８と被写体１１０上で交差することがある。この場合に、制御部２０１は、例えば、より視点３１０に近い側で交差する投射線３０８が選択されるように、視線３１３と対応する投射線３０８を特定してよい。それにより、視点３１０の位置から見た視点画像を正しく生成することができる。 However, the position of the point 701 is hidden behind the subject 110 and cannot be seen from the viewpoint 310. Therefore, it is correct to use the pixel values of the pixels of the photographed image on the photographic projection plane 303 through which the projection line 308b passes as the values of the pixels of the viewpoint image on the viewpoint projection plane 311 through which the line of sight 313 passes. In this way, the line of sight 313 may intersect with a plurality of projection lines 308 on the subject 110. In this case, the control unit 201 may specify the projection line 308 that corresponds to the line of sight 313 so that, for example, the projection line 308 that intersects on the side closer to the viewpoint 310 is selected. Thereby, a viewpoint image viewed from the position of viewpoint 310 can be correctly generated.

例えば、以上で述べたように、撮影画像から視点画像を生成することができる。以下、実施形態に係る視点画像の生成処理の動作フローを説明する。 For example, as described above, a viewpoint image can be generated from a captured image. The operation flow of the viewpoint image generation process according to the embodiment will be described below.

図８は、実施形態に係る視点画像の生成処理の動作フローを例示する図である。例えば、制御部２０１は、深度情報を有する撮影画像に対する視点変換の実行指示が入力されると、図８の動作フローを開始してよい。深度情報を有する撮影画像は、例えば、ＲＧＢ－Ｄカメラで撮影された一組の撮影画像と深度画像であってよい。別の例では、深度情報を有する撮影画像は、ステレオカメラで撮影された画像であってもよい。以下、深度情報を有する撮影画像が、一組の撮影画像と深度画像である場合を例に説明を行う。 FIG. 8 is a diagram illustrating an operational flow of viewpoint image generation processing according to the embodiment. For example, the control unit 201 may start the operation flow shown in FIG. 8 when an instruction to perform viewpoint conversion on a photographed image having depth information is input. The photographed image having depth information may be, for example, a set of photographed image and depth image photographed with an RGB-D camera. In another example, the photographed image having depth information may be an image photographed with a stereo camera. Hereinafter, an example will be described in which a photographed image having depth information is a set of a photographed image and a depth image.

ステップ８０１（以降、ステップを“Ｓ”と記載し、例えば、Ｓ８０１と表記する）において制御部２０１は、仮想空間上に撮影装置１０２の透視投影モデルを配置する。例えば、制御部２０１は、図３で述べたように、深度情報を有する撮影画像の撮影に用いられた撮影装置１０２のカメラパラメータを取得し、仮想空間上に光学中心３０１、光軸３０２、撮影投影面３０３などの透視投影モデルを配置してよい。カメラパラメータは、例えば、予め記憶部２０２に記憶されていてよい。 In step 801 (hereinafter, step will be abbreviated as "S", for example, S801), the control unit 201 arranges a perspective projection model of the imaging device 102 on the virtual space. For example, as described in FIG. 3, the control unit 201 acquires the camera parameters of the photographing device 102 used to photograph the photographed image having depth information, and displays the optical center 301, optical axis 302, photographing image, etc. on the virtual space. A perspective projection model such as the projection plane 303 may be arranged. The camera parameters may be stored in the storage unit 202 in advance, for example.

Ｓ８０２において制御部２０１は、例えば、撮影投影面３０３上に撮影画像および深度画像を配置する。 In S802, the control unit 201 arranges the photographed image and the depth image on the photographic projection plane 303, for example.

Ｓ８０３において制御部２０１は、例えば、仮想空間上に視点３１０と視点画像を投影する視点投影面３１１を配置し、また、撮影投影面３０３に平行に３次元平面３０５を配置する。３次元平面３０５は、一例では、撮影投影面３０３と重なるように配置されてよい。 In S803, the control unit 201 arranges, for example, a viewpoint 310 and a viewpoint projection plane 311 for projecting a viewpoint image on the virtual space, and also arranges a three-dimensional plane 305 parallel to the imaging projection plane 303. In one example, the three-dimensional plane 305 may be arranged to overlap the imaging projection plane 303.

Ｓ８０４において制御部２０１は、視点投影面３１１上に生成する視点画像の画素のうちから１つの画素を処理対象画素として選択する。 In S804, the control unit 201 selects one pixel from among the pixels of the viewpoint image generated on the viewpoint projection plane 311 as a processing target pixel.

Ｓ８０５において制御部２０１は、視点３１０から視点画像の選択した処理対象画素を通る視線３１３と対応する投射線３０８を特定する。制御部２０１は、例えば、図４および図５を参照して述べたように、投射線３０８が通る画素の深度情報が示す被写体１１０の深度と、投射線３０８と視線３１３との交差位置の深度との深度差が最小となる投射線３０８を特定してよい。なお、例示的な投射線３０８の特定処理の動作フローを、図９を参照して後述する。 In S805, the control unit 201 identifies the projection line 308 corresponding to the line of sight 313 passing through the selected processing target pixel of the viewpoint image from the viewpoint 310. For example, as described with reference to FIGS. 4 and 5, the control unit 201 determines the depth of the subject 110 indicated by the depth information of the pixel through which the projection line 308 passes, and the depth of the intersection position of the projection line 308 and the line of sight 313. The projection line 308 that has the minimum depth difference between the projection line and the projection line 308 may be identified. Note that the operational flow of an exemplary process for specifying the projection line 308 will be described later with reference to FIG. 9 .

Ｓ８０６において制御部２０１は、図６を参照して述べたように、視点画像の処理対象画素が死角領域であるか否かを判定する。例えば、制御部２０１は、深度差が最小となる投射線３０８と対応する撮影画像の画素の深度が示す位置と、深度差が最小となる投射線３０８と視線３１３との交差位置とが所定の条件を満たして離れている場合に、処理対象画素が死角領域であると判定してよい。 In S806, the control unit 201 determines whether the processing target pixel of the viewpoint image is in a blind spot area, as described with reference to FIG. For example, the control unit 201 determines that the position indicated by the depth of the pixel of the photographed image corresponding to the projection line 308 where the depth difference is the minimum, and the intersection position of the projection line 308 and the line of sight 313 where the depth difference is the minimum are predetermined. If the conditions are met and the pixel is far away, it may be determined that the pixel to be processed is in the blind spot area.

Ｓ８０６において処理対象画素が死角領域である場合（Ｓ８０６がＹＥＳ）、フローはＳ８０９に進む。Ｓ８０９において制御部２０１は、視点画像において処理対象画素を死角領域に設定し、フローはＳ８１０に進む。 If the pixel to be processed is a blind spot area in S806 (YES in S806), the flow advances to S809. In S809, the control unit 201 sets the processing target pixel in the viewpoint image as a blind spot area, and the flow advances to S810.

一方、Ｓ８０６において処理対象画素が死角領域でない場合（Ｓ８０６がＮＯ）、フローはＳ８０７に進む。Ｓ８０７において制御部２０１は、特定した深度差が最小となる投射線３０８が通る撮影投影面３０３上の撮影画像の画素から画素値を取得する。 On the other hand, if the pixel to be processed is not a blind spot area in S806 (NO in S806), the flow advances to S807. In S807, the control unit 201 acquires pixel values from pixels of the photographed image on the photographic projection plane 303 through which the projection line 308 through which the identified depth difference is the minimum passes.

Ｓ８０８において制御部２０１は、取得した画素値を、視点画像の処理対象画素に設定し、フローはＳ８１０に進む。 In S808, the control unit 201 sets the acquired pixel value as the processing target pixel of the viewpoint image, and the flow advances to S810.

Ｓ８１０において制御部２０１は、視点画像の全ての画素を処理したか否かを判定する。未処理の画素が視点画像にまだ含まれている場合、フローはＳ８０４に戻り、未処理の画素を処理対象画素として選択して処理を繰り返す。一方、Ｓ８１０において視点画像の全ての画素の処理が完了している場合、フローはＳ８１１に進む。 In S810, the control unit 201 determines whether all pixels of the viewpoint image have been processed. If the viewpoint image still contains unprocessed pixels, the flow returns to S804, selects the unprocessed pixels as processing target pixels, and repeats the process. On the other hand, if processing of all pixels of the viewpoint image has been completed in S810, the flow advances to S811.

Ｓ８１１において制御部２０１は、視点画像の画素に対して設定した画素値および死角領域の情報に基づいて、視点画像を生成する。 In S811, the control unit 201 generates a viewpoint image based on the pixel value set for the pixel of the viewpoint image and the blind spot area information.

Ｓ８１２において制御部２０１は、視点画像を出力し、本動作フローは終了する。制御部２０１は、例えば、視点画像の画像データを記憶部２０２に保存してもよいし、別な例では、ディスプレイなどの表示装置２０４に視点画像を表示させてもよい。 In S812, the control unit 201 outputs the viewpoint image, and this operation flow ends. For example, the control unit 201 may store the image data of the viewpoint image in the storage unit 202, or in another example, may display the viewpoint image on a display device 204 such as a display.

また、例えば、表示装置２０４に視点画像を表示させる場合に、制御部２０１は、視点画像のうちの死角領域を所定の表示形式で表示させることで、識別可能に表示させてよい。それにより、視点画像を見たユーザは、死角領域には視点の位置から見た場合に見えるようになった、撮影画像には写っていない何かが写っていることを知ることができる。 Further, for example, when displaying a viewpoint image on the display device 204, the control unit 201 may display the blind spot area of the viewpoint image in a predetermined display format so that it can be identified. Thereby, the user viewing the viewpoint image can know that something that is not shown in the photographed image is visible in the blind spot when viewed from the viewpoint position.

以上で述べたように、図８の動作フローによれば、深度情報を有する撮影画像に写る被写体１１０を任意の視点から見た視点画像を生成することができる。 As described above, according to the operation flow shown in FIG. 8, it is possible to generate a viewpoint image in which the subject 110 appearing in the photographed image having depth information is viewed from an arbitrary viewpoint.

また、図８の動作フローで述べたように、実施形態では、撮影画像と視点画像の透視投影モデルを仮想空間上に配置し、光線追跡により視点画像の画素と対応する撮影画像の画素を特定している。そのため、例えば、被写体の３次元形状モデルを生成しなくても、視点画像を生成することができる。 In addition, as described in the operation flow of FIG. 8, in the embodiment, a perspective projection model of the photographed image and the viewpoint image is placed in the virtual space, and the pixels of the photographed image that correspond to the pixels of the viewpoint image are identified by ray tracing. are doing. Therefore, for example, a viewpoint image can be generated without generating a three-dimensional shape model of the subject.

例えば、イメージベーストレンダリングの技術としてテクスチャマッピングおよび投影テクスチャマッピングなどの技術が知られている。これらのテクスチャマッピングおよび投影テクスチャマッピングでは、被写体の３次元形状モデルを生成するため、メモリの使用量が多く、また、計算量も多くなる。上述の実施形態では、３次元形状モデルを生成しなくても視点画像を生成できるため、視点画像の生成にかかるメモリ使用量を低減することができる。また、視点画像を高速に生成することが可能である。 For example, techniques such as texture mapping and projected texture mapping are known as image-based rendering techniques. These texture mappings and projected texture mappings generate a three-dimensional shape model of the subject, and therefore use a large amount of memory and also require a large amount of calculation. In the above-described embodiment, since a viewpoint image can be generated without generating a three-dimensional shape model, it is possible to reduce the amount of memory used for generating a viewpoint image. Furthermore, it is possible to generate viewpoint images at high speed.

また、上述の実施形態では、仮想空間上で撮影時の実際の空間と同じスケールで演算を行っており、例えば、投影座標系での計算や正規化を行わなくてもよいため、実空間の長さの単位で厳密に視点を変換した視点画像を生成することができる。 In addition, in the above embodiment, calculations are performed in the virtual space at the same scale as the actual space at the time of shooting, and for example, there is no need to perform calculations or normalization in the projected coordinate system. It is possible to generate a viewpoint image in which the viewpoint is strictly converted in units of length.

また、例えば、画像変換の技術としてホモグラフィ変換が知られている。ホモグラフィ変換では、変換後に別の被写体１１０で遮蔽されて見えなくなるはずの領域が、変換後の画像に表示されてしまうことがある。上述の実施形態によれば、例えば図７を参照して例示したように、視点の変換後に別の被写体で遮蔽されて見えなくなる遮蔽領域を指す投射線３０８は選択されないため、遮蔽領域の遮蔽を視点画像で正しく表現することができる。 Further, for example, homography conversion is known as an image conversion technique. In homography conversion, an area that should be hidden by another subject 110 and become invisible after conversion may end up being displayed in the converted image. According to the above-described embodiment, as illustrated with reference to FIG. 7, for example, the projection line 308 that points to an occluded area that is occluded by another subject and becomes invisible after the viewpoint conversion is not selected, so that the occluded area is not occluded. It can be expressed correctly using a viewpoint image.

なお、上述の実施形態では、光学中心３０１、光軸３０２、撮影投影面３０３、視点３１０、視点投影面３１１などの立体配置を規定し、これらが規定した立体配置に配置されているものとして演算を行っている。換言すると、これらの光学中心３０１、光軸３０２、撮影投影面３０３、視点３１０、視点投影面３１１は、実際にポリゴンなどとして仮想空間上に配置しなくても演算を行うことができる。また、上述の実施形態では、３次元平面３０５をポリゴンとして仮想空間上に配置している。３次元平面３０５をポリゴンとして仮想空間に配置することで、ＧＰＵ（Graphics Processing Unit）のピクセルシェーダに透視投影結果のピクセルを渡して高速に処理することが可能になる。しかしながら、３次元平面３０５のポリゴンは、一例では、仮想空間上に配置されなくてもよい。 In the above embodiment, the three-dimensional arrangement of the optical center 301, the optical axis 302, the imaging projection plane 303, the viewpoint 310, the viewpoint projection plane 311, etc. is defined, and the calculation is performed assuming that these are arranged in the defined three-dimensional arrangement. It is carried out. In other words, the optical center 301, optical axis 302, imaging projection plane 303, viewpoint 310, and viewpoint projection plane 311 can be calculated without actually being arranged as polygons or the like in the virtual space. Furthermore, in the embodiment described above, the three-dimensional plane 305 is arranged as a polygon in the virtual space. By arranging the three-dimensional plane 305 as a polygon in the virtual space, it becomes possible to pass the pixels of the perspective projection result to a pixel shader of a GPU (Graphics Processing Unit) and process them at high speed. However, in one example, the polygons of the three-dimensional plane 305 do not have to be placed in the virtual space.

従って、実施形態によれば被写体が写る撮影画像から、その被写体が別の向きで写る視点画像を生成する処理の負荷を低減することができる。 Therefore, according to the embodiment, it is possible to reduce the processing load of generating a viewpoint image in which the subject is seen in a different direction from a photographed image in which the subject is seen.

続いて、実施形態に係る視線３１３と対応する投射線３０８の特定処理を例示する。図９は、実施形態に係る視線３１３と対応する投射線３０８の特定処理の動作フローを例示する図である。また、図１０は、実施形態に係る視線３１３と対応する投射線３０８の特定処理の流れを例示する図である。例えば、制御部２０１は、図８のＳ８０５に進むと、図９の動作フローを開始してよい。 Next, a process for specifying the projection line 308 corresponding to the line of sight 313 according to the embodiment will be illustrated. FIG. 9 is a diagram illustrating the operational flow of the process of specifying the projection line 308 corresponding to the line of sight 313 according to the embodiment. Further, FIG. 10 is a diagram illustrating the flow of the process of specifying the projection line 308 corresponding to the line of sight 313 according to the embodiment. For example, when the control unit 201 proceeds to S805 in FIG. 8, it may start the operation flow in FIG. 9.

Ｓ９０１において制御部２０１は、例えば、視点投影面３１１上に配置した視点画像から選択した処理対象画素３１２を通る視点３１０からの３次元平面３０５へのベクトル１００１を求める（図１０（ａ）の（１））。なお、以下、このベクトルを視線ベクトル１００１と呼ぶことがある。 In S901, the control unit 201 obtains, for example, a vector 1001 from the viewpoint 310 to the three-dimensional plane 305 passing through the processing target pixel 312 selected from the viewpoint image arranged on the viewpoint projection plane 311 (((a) in FIG. 10A). 1)). Note that, hereinafter, this vector may be referred to as the line-of-sight vector 1001.

Ｓ９０２において制御部２０１は、例えば、視線ベクトル１００１と３次元平面３０５との交点を求める。 In S902, the control unit 201 determines, for example, the intersection between the line-of-sight vector 1001 and the three-dimensional plane 305.

Ｓ９０３において制御部２０１は、例えば、３次元平面３０５と平行に所定の深度に層１００２を配置する（図１０（ａ）の（２））。なお、所定の深度は、例えば、撮影画像に写る最前面の被写体１１０よりも浅い深度に層１００２が配置されるように設定されてよい。 In S903, the control unit 201 arranges the layer 1002 at a predetermined depth parallel to the three-dimensional plane 305 ((2) in FIG. 10A), for example. Note that the predetermined depth may be set, for example, so that the layer 1002 is placed at a shallower depth than the foreground subject 110 in the captured image.

Ｓ９０４において制御部２０１は、例えば、視線３１３と層１００２との交点１００３を求める（図１０（ａ）の（３））。 In S904, the control unit 201 determines, for example, the intersection 1003 between the line of sight 313 and the layer 1002 ((3) in FIG. 10A).

Ｓ９０５において制御部２０１は、例えば、交点１００３へと向かう光学中心３０１からの投射線ベクトル１００４を求める（図１０（ａ）の（４））。 In S905, the control unit 201 obtains, for example, a projected line vector 1004 from the optical center 301 toward the intersection 1003 ((4) in FIG. 10(a)).

Ｓ９０６において制御部２０１は、例えば、投射線ベクトル１００４と撮影投影面３０３との交点を求め、交点の位置にある深度画像の画素３０７を特定する（図１０（ａ）の（５））。例えば、制御部２０１は、光学中心３０１、焦点距離、および投射線ベクトル１００４に基づいて撮影投影面３０３との交点を求め、交点の位置にある深度画像の画素３０７を特定してよい。 In S906, the control unit 201, for example, finds the intersection between the projection line vector 1004 and the imaging projection plane 303, and specifies the pixel 307 of the depth image located at the intersection ((5) in FIG. 10A). For example, the control unit 201 may find the intersection with the imaging projection plane 303 based on the optical center 301, the focal length, and the projection line vector 1004, and specify the pixel 307 of the depth image located at the intersection.

Ｓ９０７において制御部２０１は、例えば、深度画像から特定した画素３０７の深度情報から被写体１１０の深度を特定する（図１０（ａ）の（６））。なお、図１０（ａ）の（６）の例では、画素３０７の深度情報から特定した被写体１１０の深度として、３次元平面３０５からの被写体１１０の深度が示されている。３次元平面３０５からの被写体１１０の深度は、例えば、画素３０７の深度情報が示す深度から焦点距離を引くことで求めることができる。 In S907, the control unit 201 identifies the depth of the subject 110, for example, from the depth information of the pixel 307 identified from the depth image ((6) in FIG. 10A). Note that in the example (6) of FIG. 10A, the depth of the subject 110 from the three-dimensional plane 305 is shown as the depth of the subject 110 specified from the depth information of the pixel 307. The depth of the subject 110 from the three-dimensional plane 305 can be determined, for example, by subtracting the focal length from the depth indicated by the depth information of the pixel 307.

Ｓ９０８において制御部２０１は、例えば、画素３０７の深度情報から特定した被写体１１０の深度と、視線ベクトル１００１と投射線ベクトル１００４との交差位置となる層１００２の深度との深度方向の配置が反転したか否かを判定する。 In S908, the control unit 201 determines that, for example, the arrangement in the depth direction between the depth of the subject 110 identified from the depth information of the pixel 307 and the depth of the layer 1002 at the intersection of the line-of-sight vector 1001 and the projection line vector 1004 is reversed. Determine whether or not.

例えば、図１０（ａ）では、層１００２の初期の配置において、画素３０７の深度情報から特定した被写体１１０の深度よりも、視線ベクトル１００１と投射線ベクトル１００４との交差位置となる層１００２の深度の方が浅くなっている。この場合、Ｓ９０８の処理では、画素３０７の深度情報から特定した被写体１１０の深度よりも、層１００２の深度の方が深くなった場合に、被写体１１０の深度と、層１００２の深度方向の配置が反転したと判定してよい。被写体１１０の深度と、層１００２の深度方向の配置が反転していない場合、制御部２０１は、Ｓ９０８においてＮＯと判定し、フローはＳ９０９に進む。 For example, in FIG. 10A, in the initial arrangement of the layer 1002, the depth of the layer 1002 at the intersection of the line-of-sight vector 1001 and the projection line vector 1004 is greater than the depth of the subject 110 identified from the depth information of the pixel 307. is shallower. In this case, in the process of S908, if the depth of the layer 1002 is deeper than the depth of the object 110 specified from the depth information of the pixel 307, the depth of the object 110 and the arrangement of the layer 1002 in the depth direction are It may be determined that the image has been reversed. If the depth of the subject 110 and the arrangement of the layer 1002 in the depth direction are not reversed, the control unit 201 determines NO in S908, and the flow advances to S909.

Ｓ９０９において制御部２０１は、画素３０７の深度情報が示す被写体１１０の深度のある方向に、層１００２を所定の移動量で移動させてフローはＳ９０４に戻り、処理を繰り返す。例えば、制御部２０１は、図１０（ｂ）の（７）に示すように、層１００２の位置を移動させて、画素３０７が示す被写体１１０の深度の或る方向に層１００２を近づけてよい。これにより、制御部２０１は、層１００２の位置を徐々に変更して、画素３０７の深度情報が示す被写体１１０の深度と近い深度で、視線ベクトル１００１と投射線ベクトル１００４とが交差する層１００２を探索することができる。 In S909, the control unit 201 moves the layer 1002 by a predetermined amount of movement in the direction of the depth of the subject 110 indicated by the depth information of the pixel 307, and the flow returns to S904 to repeat the process. For example, the control unit 201 may move the position of the layer 1002 to bring the layer 1002 closer to a certain direction of the depth of the subject 110 indicated by the pixel 307, as shown in (7) of FIG. 10(b). As a result, the control unit 201 gradually changes the position of the layer 1002 to locate the layer 1002 where the line-of-sight vector 1001 and the projection line vector 1004 intersect at a depth close to the depth of the subject 110 indicated by the depth information of the pixel 307. You can explore.

また、Ｓ９０８において、例えば、被写体１１０の深度と、層１００２の深度方向の配置が反転した場合、制御部２０１は、ＹＥＳと判定し、フローはＳ９１０に進む。Ｓ９１０において制御部２０１は、層１００２の移動量と移動方向を所定回数変更済みであるか否かを判定する。 Further, in S908, for example, if the depth of the subject 110 and the arrangement of the layer 1002 in the depth direction are reversed, the control unit 201 determines YES, and the flow advances to S910. In S910, the control unit 201 determines whether the amount and direction of movement of the layer 1002 have been changed a predetermined number of times.

Ｓ９１０において層１００２の移動量と移動方向を所定回数変更済みでない場合（Ｓ９１０がＮＯ）、フローはＳ９１１に進む。Ｓ９１１において制御部２０１は、層１００２の移動方向を逆方向に反転させて、かつ、移動量を１／２～１／１０など所定の割合で小さく変更し、フローはＳ９０９に戻る。例えば、制御部２０１は、図１０（ｃ）の（８）に示すように、層１００２の移動方向を逆方向に反転させて、移動量を小さくし、探索を継続してよい。それにより、画素３０７の深度情報が示す被写体１１０の深度と近い深度で、視線ベクトル１００１と投射線ベクトル１００４とが交差する層１００２を更に細かく探索することができる。 If the amount and direction of movement of the layer 1002 have not been changed a predetermined number of times in S910 (NO in S910), the flow advances to S911. In S911, the control unit 201 reverses the moving direction of the layer 1002 and changes the moving amount by a predetermined ratio such as 1/2 to 1/10, and the flow returns to S909. For example, as shown in (8) of FIG. 10C, the control unit 201 may reverse the moving direction of the layer 1002 to reduce the moving amount and continue the search. Thereby, the layer 1002 where the line-of-sight vector 1001 and the projection line vector 1004 intersect can be searched more precisely at a depth close to the depth of the subject 110 indicated by the depth information of the pixel 307.

また、Ｓ９１０において層１００２の移動量と移動方向を所定回数変更済みである場合（Ｓ９１０がＹＥＳ）、フローはＳ９１２に進む。Ｓ９１２において制御部２０１は、最終的な層１００２と視線ベクトル１００１との交点に向けて延ばした投射線ベクトル１００４を視線３１３と対応する投射線３０８として特定し、本動作フローは終了し、フローはＳ８０６に進む。 Further, if the amount and direction of movement of the layer 1002 have been changed a predetermined number of times in S910 (YES in S910), the flow advances to S912. In S912, the control unit 201 specifies the projection line vector 1004 extended toward the intersection of the final layer 1002 and the line of sight vector 1001 as the projection line 308 corresponding to the line of sight 313, and this operation flow ends. The process advances to S806.

以上で述べたように、図９の動作フローによれば、投射線３０８が通る深度画像の画素が示す被写体１１０の深度と、投射線３０８と視線３１３との交差位置の深度との深度差が最小となる投射線３０８を特定することができる。なお、深度差が最小となる投射線３０８とは、一例では、光学中心３０１から撮影画像の画素に下した投射線のうちで深度差が最小となる投射線３０８であってよい。別の例では、深度差が最小となる投射線３０８とは、探索された投射線３０８の中で深度差が最小となる投射線３０８であってもよい。 As described above, according to the operation flow of FIG. 9, the depth difference between the depth of the subject 110 indicated by the pixels of the depth image through which the projection line 308 passes and the depth at the intersection position of the projection line 308 and the line of sight 313 is The minimum projection line 308 can be identified. Note that the projection line 308 with the minimum depth difference may be, for example, the projection line 308 with the minimum depth difference among the projection lines drawn from the optical center 301 to the pixels of the photographed image. In another example, the projection line 308 with the minimum depth difference may be the projection line 308 with the minimum depth difference among the searched projection lines 308.

なお、Ｓ９１０における所定回数は、回数が多ければ、画素３０７の深度情報が示す被写体１１０の深度と近い深度で、視線ベクトル１００１と投射線ベクトル１００４とが交差する層１００２の探索精度が向上する。一方で、回数が多くなると、計算量が多くなり、処理速度の低下を招く。そのため、所定回数は、実施形態を適用する状況に応じて好ましい値に設定されてよい。例えば、所定回数は、２回～５回に設定されていてよく、一例では、３回に設定されていてよい。 Note that if the predetermined number of times in S910 is large, the search accuracy for the layer 1002 where the line-of-sight vector 1001 and the projection line vector 1004 intersect will be improved at a depth close to the depth of the subject 110 indicated by the depth information of the pixel 307. On the other hand, when the number of times increases, the amount of calculation increases, leading to a decrease in processing speed. Therefore, the predetermined number of times may be set to a preferable value depending on the situation in which the embodiment is applied. For example, the predetermined number of times may be set to 2 to 5 times, and in one example, may be set to 3 times.

また、図９の動作フローでは、３次元平面３０５との平行に配置される層１００２の初期の深度は、被写体１１０よりも浅い深度となるように設定されていてよい。それにより、例えば、図７を参照して述べたように、視線３１３と被写体１１０上で交差する複数の投射線３０８が存在する場合にも、より浅い深度で視線３１３と交差する投射線３０８を特定することができる。それにより、撮影画像において見えていた領域が視点を移動した視点画像において、手前に別の被写体１１０が被さり見えなくなる場合にも、その遮蔽を視点画像において正しく表現することができる。 Further, in the operation flow of FIG. 9, the initial depth of the layer 1002 arranged parallel to the three-dimensional plane 305 may be set to be shallower than the depth of the subject 110. As a result, for example, as described with reference to FIG. 7, even when there are a plurality of projection lines 308 that intersect the line of sight 313 on the subject 110, the projection lines 308 that intersect the line of sight 313 at a shallower depth can be can be specified. Thereby, even when another subject 110 is covered in the foreground in a viewpoint image in which the viewpoint has been moved to an area that was visible in the photographed image, the occlusion can be accurately expressed in the viewpoint image.

また、図９の処理に示すように、視点３１０から光学中心３０１に向けて光輸送経路を逆追跡することで、最小限の光線のみを対象に計算を行うことができる。例えば、視点画像において用いられていない撮影画像上の画素については処理対象から省くことができる。また、視線３１３と交差する投射線３０８のみを対象として探索を行うことができる。そのため、計算量を削減することができる。 Further, as shown in the process of FIG. 9, by tracing the light transport path back from the viewpoint 310 toward the optical center 301, calculation can be performed using only the minimum number of light rays. For example, pixels on the captured image that are not used in the viewpoint image can be omitted from the processing target. Further, it is possible to search only for the projection line 308 that intersects the line of sight 313. Therefore, the amount of calculation can be reduced.

図１１は、実施形態に係る撮影画像と視点画像とを例示する図である。図１１では、中央に撮影画像１１０１が示されている。また、撮影画像１１０１の上下左右には、それぞれ光軸上に注視点を固定した上で光学中心を原点とするカメラ座標系のｘ軸またはｙ軸の方向に視点を移動させた場合の視点画像１１０２が示されている。視点画像１１０２には、死角領域１１０５が含まれており、撮影画像１１０１には写っていない被写体１１０の領域が死角領域１１０５として表示されている。 FIG. 11 is a diagram illustrating a photographed image and a viewpoint image according to the embodiment. In FIG. 11, a captured image 1101 is shown in the center. Additionally, on the top, bottom, left and right of the photographed image 1101, there are viewpoint images obtained by fixing the gaze point on the optical axis and moving the viewpoint in the direction of the x-axis or y-axis of the camera coordinate system with the optical center as the origin. 1102 is shown. The viewpoint image 1102 includes a blind spot area 1105, and an area of the subject 110 that is not captured in the captured image 1101 is displayed as the blind spot area 1105.

また、実施形態によれば、図１１に示すように、様々な視点３１０から見た視点画像１１０２を生成することができる。そして、様々な視点３１０から見た視点画像１１０２を、被写体１１０として写る部品、製品、製造物、および成形物などの形状並びに色味の検査、判定、および識別に利用することができる。 Further, according to the embodiment, as shown in FIG. 11, viewpoint images 1102 viewed from various viewpoints 310 can be generated. The viewpoint images 1102 seen from various viewpoints 310 can be used for inspecting, determining, and identifying the shape and color of parts, products, manufactured products, molded objects, etc. that are photographed as the subject 110.

また、例えば、障害物などの存在により、撮影装置１０２の設置位置に制約があり、被写体１１０の位置認識および形状認識などの処理に入力するのに適した角度から被写体１１０を撮影できないことがある。実施形態によれば、撮影画像を様々な視点３１０から見た視点画像に変換できるため、撮影装置１０２の設置位置に制約がある場合にも、例えば、位置認識および形状認識などの処理に入力するのに適した視点画像を生成することができる。 Furthermore, for example, due to the presence of an obstacle, there are restrictions on the installation position of the photographing device 102, and it may not be possible to photograph the subject 110 from an angle suitable for inputting it into processes such as position recognition and shape recognition of the subject 110. . According to the embodiment, since captured images can be converted into viewpoint images seen from various viewpoints 310, even when there are restrictions on the installation position of the imaging device 102, the images can be input into processing such as position recognition and shape recognition. It is possible to generate viewpoint images suitable for

なお、図１１に示すように、１か所から撮影された深度情報を有する撮影画像１１０１を用いて視点の移動を行い視点画像１１０２の生成を行うと、被写体１１０の凹凸などに起因して視点画像１１０２に死角領域が含まれてしまうことがある。この様な死角領域は、別の方向から撮影された死角領域が写る撮影画像を用いて被写体１１０を写した画素に置き換えることができる。以下、複数の位置から撮影された撮影画像を用いた視点画像の生成について説明する。 Note that, as shown in FIG. 11, when a viewpoint image 1102 is generated by moving the viewpoint using a photographed image 1101 having depth information taken from one location, the viewpoint may change due to unevenness of the subject 110, etc. The image 1102 may include a blind spot area. Such a blind spot area can be replaced with pixels that capture the subject 110 using a photographed image in which the blind spot area is photographed from a different direction. Hereinafter, generation of a viewpoint image using captured images captured from a plurality of positions will be explained.

図１２は、実施形態に係る複数の位置から撮影された撮影画像を用いた視点画像の生成について例示する図である。図１２において、第１の光学中心３０１ａから撮影投影面３０３ａの像を撮影した画像を第１の撮影画像とする。また、第２の光学中心３０１ｂから撮影投影面３０３ｂ上の像を撮影した画像を第２の撮影画像とする。なお、第１の撮影画像および第２の撮影画像は、いずれも画素が深度情報と対応づけられている。そして、第１の撮影画像と第２の撮影画像とを用いて視点３１０から被写体１１０を見た場合の視点画像を生成する例を以下に述べる。 FIG. 12 is a diagram illustrating generation of a viewpoint image using captured images captured from a plurality of positions according to the embodiment. In FIG. 12, an image obtained by photographing an image of the photographic projection plane 303a from the first optical center 301a is defined as a first photographed image. Further, an image obtained by photographing an image on the photographing projection surface 303b from the second optical center 301b is set as a second photographed image. Note that in both the first captured image and the second captured image, pixels are associated with depth information. An example of generating a viewpoint image when viewing the subject 110 from the viewpoint 310 using the first captured image and the second captured image will be described below.

図１２において、例えば、点１２１１は、第１の光学中心３０１ａからは手前にある被写体１１０で隠されて死角となっており、見ることができない。そのため、第１の光学中心３０１ａから撮影した第１の撮影画像には、点１２１１における被写体１１０の画素値を表す情報は含まれていない。一方、第２の光学中心３０１ｂからは点１２１１は見えており、第２の光学中心３０１ｂから撮影した第２の撮影画像には、点１２１１における被写体１１０の画素値を表す情報が含まれている。そのため、視点３１０から見た視点画像を生成する際に、第１の撮影画像だけを用いた場合には死角領域となってしまう画素の画素値を、第２の撮影画像から補うことが可能である。 In FIG. 12, for example, a point 1211 cannot be seen from the first optical center 301a because it is hidden by the subject 110 in the foreground and forms a blind spot. Therefore, the first captured image captured from the first optical center 301a does not include information representing the pixel value of the subject 110 at the point 1211. On the other hand, the point 1211 is visible from the second optical center 301b, and the second photographed image taken from the second optical center 301b includes information representing the pixel value of the subject 110 at the point 1211. . Therefore, when generating the viewpoint image seen from the viewpoint 310, it is possible to supplement the pixel values of pixels that would be in the blind area if only the first captured image was used, from the second captured image. be.

例えば、制御部２０１は、異なる２つの位置から撮影された２つの撮影画像のそれぞれから視点画像を得る。そして、得られた２つの視点画像のうちの一方で死角領域となっている画素が、他方の視点画像で画素値を有するとする。この場合、制御部２０１は、その画素の画素値として他方の視点画像から得た画素値を用いるように２つの視点画像を合成して視点画像を生成してよい。それにより、死角領域を減らした視点画像を生成することができる。なお、このように複数の視点画像を合成して得られる視点画像を、以下では、合成視点画像と呼ぶことがある。 For example, the control unit 201 obtains a viewpoint image from each of two captured images captured from two different positions. It is assumed that a pixel that is a blind spot area in one of the two obtained viewpoint images has a pixel value in the other viewpoint image. In this case, the control unit 201 may generate a viewpoint image by combining the two viewpoint images so that the pixel value obtained from the other viewpoint image is used as the pixel value of the pixel. Thereby, it is possible to generate a viewpoint image with reduced blind spot areas. Note that a viewpoint image obtained by combining a plurality of viewpoint images in this manner may be referred to as a composite viewpoint image below.

また、例えば、異なる２つの位置から撮影された２つの撮影画像のそれぞれから得た視点画像の両方が、或る画素に対して画素値を有することもある。図１３は、この場合を例示しており、被写体１１０の点１３１１の位置は、第１の光学中心３０１ａと第２の光学中心３０１ｂの双方から見えている。この場合、制御部２０１は、投射線３０８と、視線３１３との成す角がより小さい角度で撮影された撮影画像から画素値を取得してよい。例えば、図１３の例では、投射線３０８ａと、視線３１３との成す角度θ１よりも、投射線３０８ｂと、視線３１３との成す角度θ２の方が小さい。そのため、制御部２０１は、投射線３０８ｂが通る第２の撮影画像上の画素１３０１から、視線が通る視点画像上の画素１３０３の画素値を取得して、合成視点画像において用いてよい。それにより、視線３１３と近しい方向から視点画像を撮影している撮影画像から画素値を取得することができる。そのため、生成される合成視点画像の色味などの精度の向上を図ることができる。 Furthermore, for example, both viewpoint images obtained from two photographed images taken from two different positions may have a pixel value for a certain pixel. FIG. 13 illustrates this case, and the position of the point 1311 of the subject 110 is visible from both the first optical center 301a and the second optical center 301b. In this case, the control unit 201 may acquire pixel values from a captured image captured at a smaller angle between the projection line 308 and the line of sight 313. For example, in the example of FIG. 13, the angle θ2 between the projection line 308b and the line of sight 313 is smaller than the angle θ1 between the projection line 308a and the line of sight 313. Therefore, the control unit 201 may acquire the pixel value of pixel 1303 on the viewpoint image through which the line of sight passes from pixel 1301 on the second captured image through which the projection line 308b passes, and use it in the composite viewpoint image. Thereby, pixel values can be acquired from a photographed image in which the viewpoint image is photographed from a direction close to the line of sight 313. Therefore, it is possible to improve the accuracy of the color tone, etc. of the generated synthetic viewpoint image.

また、制御部２０１は、例えば、異なる２つの位置から撮影された２つの撮影画像のそれぞれから得たいずれの視点画像においても死角領域に設定されている画素については、合成視点画像においてその画素を死角領域に設定してよい。 In addition, for example, for a pixel that is set in a blind spot area in each of the viewpoint images obtained from two captured images taken from two different positions, the control unit 201 controls the pixel in the composite viewpoint image. It may be set in the blind spot area.

以上で述べたように、異なる位置から撮影された複数の撮影画像を用いることで、死角領域の画素値を補って合成視点画像を生成することができる。なお、図１３の例では、２つの撮影位置から撮影した撮影画像を用いる例が示されているが、実施形態はこれに限定されるものではなく、３以上の撮影位置から撮影された撮影画像を用いて１つの視点画像が生成されてもよい。 As described above, by using a plurality of captured images captured from different positions, it is possible to generate a composite viewpoint image by supplementing the pixel values in the blind spot area. Note that although the example in FIG. 13 shows an example in which photographed images photographed from two photographing positions are used, the embodiment is not limited to this, and photographed images photographed from three or more photographing positions are used. One viewpoint image may be generated using .

また、或る撮影位置から撮影した際に死角となってしまう領域が写るように別の撮影位置から撮影することで、死角領域の画素の画素値を効率的に補うことのできる撮影画像を得ることができる。 In addition, by shooting from another shooting position so that the area that becomes a blind spot when shooting from a certain shooting position is captured, a captured image can be obtained that can efficiently compensate for the pixel values of pixels in the blind spot area. be able to.

図１４は、実施形態に係る異なる２つの位置から撮影された撮影画像に基づく視点画像の生成処理の動作フローを例示する図である。例えば、制御部２０１は、視点画像の生成指示が入力されると、図１４の動作フローを開始してよい。 FIG. 14 is a diagram illustrating an operational flow of viewpoint image generation processing based on captured images captured from two different positions according to the embodiment. For example, the control unit 201 may start the operation flow shown in FIG. 14 when an instruction to generate a viewpoint image is input.

Ｓ１４０１において制御部２０１は、例えば、複数の撮影位置から撮影された深度情報を有する撮影画像のそれぞれから視点画像を生成する。なお、制御部２０１は、例えば、図８および図９の動作フローで例示した処理により、深度情報を有する撮影画像から視点画像を生成してよい。 In S1401, the control unit 201 generates a viewpoint image from each of the captured images having depth information captured from a plurality of shooting positions, for example. Note that the control unit 201 may generate a viewpoint image from a photographed image having depth information, for example, by the processing illustrated in the operational flows of FIGS. 8 and 9.

Ｓ１４０２において制御部２０１は、撮影位置ごとに生成された複数の視点画像を重ね合わせる。 In S1402, the control unit 201 superimposes a plurality of viewpoint images generated for each shooting position.

Ｓ１４０３において制御部２０１は、重ね合わせた視点画像内の各画素について、画素値が１つの視点画像のみで得られている画素については、得られている画素値を設定してよい。例えば、２つの撮影位置で撮影した２つの撮影画像のそれぞれから生成された視点画像において、一方の視点画像においては死角領域と設定されている画素が、他方の視点画像において画素値を有するとする。この場合、制御部２０１は、その画素の画素値として、他方の視点画像における画素値を用いるように合成視点画像を生成してよい。 In S1403, the control unit 201 may set the obtained pixel value for each pixel in the superimposed viewpoint images for a pixel whose pixel value is obtained from only one viewpoint image. For example, suppose that in perspective images generated from two captured images taken at two shooting positions, a pixel that is set as a blind spot area in one perspective image has a pixel value in the other perspective image. . In this case, the control unit 201 may generate a composite viewpoint image using the pixel value in the other viewpoint image as the pixel value of that pixel.

Ｓ１４０４において制御部２０１は、重ね合わせた視点画像内の各画素について、画素値が複数の視点画像で得られている画素については、視線３１３との成す角が最小の投射線３０８と対応する画素から得られた画素値を、合成視点画像の画素に設定する。それにより、視線３１３と近しい方向から撮影された撮影画像から画素値を取得することができる。 In S1404, for each pixel in the superimposed viewpoint images, the control unit 201 selects a pixel whose pixel value is obtained from a plurality of viewpoint images, the pixel corresponding to the projection line 308 that forms the smallest angle with the line of sight 313. The pixel value obtained from is set to the pixel of the composite viewpoint image. Thereby, pixel values can be acquired from a photographed image photographed from a direction close to the line of sight 313.

Ｓ１４０５において制御部２０１は、死角領域を設定する。例えば、制御部２０１は、重ね合わせた複数の視点画像の全てにおいて死角領域と設定されている画素については、合成視点画像においても死角領域に設定してよい。 In S1405, the control unit 201 sets a blind spot area. For example, the control unit 201 may set pixels that are set as blind spot areas in all of the plurality of superimposed viewpoint images to be blind spot areas in the composite viewpoint image as well.

Ｓ１４０６において制御部２０１は、Ｓ１４０３からＳ１４０５の処理で設定した画素の値に基づいて、複数の視点画像を合成し、複数の撮影画像に基づく合成視点画像を生成する。 In S1406, the control unit 201 synthesizes a plurality of viewpoint images based on the pixel values set in the processing from S1403 to S1405, and generates a composite viewpoint image based on the plurality of captured images.

Ｓ１４０７において制御部２０１は、複数の撮影画像に基づく視点画像を出力し、本動作フローは終了する。制御部２０１は、例えば、視点画像の画像データを記憶部２０２に保存してもよいし、別な例では、ディスプレイなどの表示装置２０４に視点画像を表示させてもよい。 In S1407, the control unit 201 outputs a viewpoint image based on the plurality of captured images, and this operation flow ends. For example, the control unit 201 may store the image data of the viewpoint image in the storage unit 202, or in another example, may display the viewpoint image on a display device 204 such as a display.

以上で述べたように、図１４の動作フローによれば、複数の撮影画像に基づいて視点画像を生成することで、死角領域を補って視点画像を生成することができる。 As described above, according to the operation flow shown in FIG. 14, by generating a viewpoint image based on a plurality of captured images, it is possible to generate a viewpoint image while compensating for a blind spot area.

以上において、実施形態を例示したが、実施形態はこれに限定されるものではない。例えば、上述の動作フローは例示であり、実施形態はこれに限定されるものではない。可能な場合には、動作フローは、処理の順番を変更して実行されてもよく、別に更なる処理を含んでもよく、または、一部の処理が省略されてもよい。 Although the embodiments have been illustrated above, the embodiments are not limited thereto. For example, the above-described operational flow is an example, and the embodiments are not limited thereto. If possible, the operational flow may be executed by changing the order of processing, may include additional processing, or may omit some processing.

また、上述の実施形態では、視線３１３と対応する投射線３０８を、深度画像の画素が示す被写体１１０の深度と、投射線３０８と視線３１３との交差位置の深度との深度差に基づいて特定する例を述べている。しかしながら、実施形態はこれに限定されるものではない。別の実施形態では、制御部２０１は、深度画像の画素が示す被写体１１０の深度に基づく位置と、投射線３０８と視線３１３との交差位置との距離が最小になるように投射線３０８を特定してもよい。 Furthermore, in the embodiment described above, the projection line 308 corresponding to the line of sight 313 is specified based on the depth difference between the depth of the subject 110 indicated by the pixels of the depth image and the depth at the intersection position of the projection line 308 and the line of sight 313. An example is given. However, embodiments are not limited thereto. In another embodiment, the control unit 201 specifies the projection line 308 such that the distance between the position based on the depth of the subject 110 indicated by the pixels of the depth image and the intersection position of the projection line 308 and the line of sight 313 is minimized. You may.

また、上述の実施形態では、情報処理装置１０１が、視点画像の生成を行う例を述べているが実施形態はこれに限定されるものではない。例えば、別の実施形態では、クライアント‐サーバシステムなどを用いて、上述の処理を複数の装置で分担して実行してもよい。 Further, in the above-described embodiment, an example is described in which the information processing apparatus 101 generates a viewpoint image, but the embodiment is not limited to this. For example, in another embodiment, the processing described above may be shared and performed by multiple devices using a client-server system or the like.

なお、上述の実施形態において制御部２０１は、例えば、Ｓ８０１～Ｓ８０３の処理では配置部２１１として動作する。また、制御部２０１は、例えば、Ｓ８０７の処理では取得部２１２として動作する。制御部２０１は、例えば、Ｓ８１１の処理では生成部２１３として動作する。 Note that in the above-described embodiment, the control unit 201 operates as the placement unit 211 in, for example, the processing of S801 to S803. Further, the control unit 201 operates as the acquisition unit 212 in the process of S807, for example. For example, the control unit 201 operates as the generation unit 213 in the process of S811.

図１５は、実施形態に係る情報処理装置１０１を実現するためのコンピュータ１５００のハードウェア構成を例示する図である。図１５の情報処理装置１０１を実現するためのハードウェア構成は、例えば、プロセッサ１５０１、メモリ１５０２、記憶装置１５０３、読取装置１５０４、通信インタフェース１５０６、および入出力インタフェース１５０７を備える。なお、プロセッサ１５０１、メモリ１５０２、記憶装置１５０３、読取装置１５０４、通信インタフェース１５０６、入出力インタフェース１５０７は、例えば、バス１５０８を介して互いに接続されている。 FIG. 15 is a diagram illustrating a hardware configuration of a computer 1500 for realizing the information processing apparatus 101 according to the embodiment. A hardware configuration for realizing the information processing apparatus 101 in FIG. 15 includes, for example, a processor 1501, a memory 1502, a storage device 1503, a reading device 1504, a communication interface 1506, and an input/output interface 1507. Note that the processor 1501, memory 1502, storage device 1503, reading device 1504, communication interface 1506, and input/output interface 1507 are connected to each other via a bus 1508, for example.

プロセッサ１５０１は、例えば、シングルプロセッサであっても、マルチプロセッサやマルチコアであってもよい。プロセッサ１５０１は、メモリ１５０２を利用して例えば上述の動作フローの手順を記述したプログラムを実行することにより、上述した制御部２０１の一部または全部の機能を提供する。例えば、情報処理装置１０１のプロセッサ１５０１は、記憶装置１５０３に格納されているプログラムを読み出して実行することで、配置部２１１、取得部２１２、および生成部２１３として動作する。 The processor 1501 may be, for example, a single processor, a multiprocessor, or a multicore. The processor 1501 provides some or all of the functions of the control unit 201 described above by using the memory 1502 to execute, for example, a program that describes the procedure of the operation flow described above. For example, the processor 1501 of the information processing device 101 operates as the placement unit 211, the acquisition unit 212, and the generation unit 213 by reading and executing a program stored in the storage device 1503.

メモリ１５０２は、例えば半導体メモリであり、ＲＡＭ領域およびＲＯＭ領域を含んでいてよい。記憶装置１５０３は、例えばハードディスク、フラッシュメモリ等の半導体メモリ、または外部記憶装置である。なお、ＲＡＭは、Random Access Memoryの略称である。また、ＲＯＭは、Read Only Memoryの略称である。 Memory 1502 is, for example, a semiconductor memory and may include a RAM area and a ROM area. The storage device 1503 is, for example, a hard disk, a semiconductor memory such as a flash memory, or an external storage device. Note that RAM is an abbreviation for Random Access Memory. Further, ROM is an abbreviation for Read Only Memory.

読取装置１５０４は、プロセッサ１５０１の指示に従って着脱可能記憶媒体１５０５にアクセスする。着脱可能記憶媒体１５０５は、例えば、半導体デバイス、磁気的作用により情報が入出力される媒体、光学的作用により情報が入出力される媒体などにより実現される。なお、半導体デバイスは、例えば、ＵＳＢ（Universal Serial Bus）メモリである。また、磁気的作用により情報が入出力される媒体は、例えば、磁気ディスクである。光学的作用により情報が入出力される媒体は、例えば、ＣＤ－ＲＯＭ、ＤＶＤ、Blu-ray Disc等（Blu-rayは登録商標）である。ＣＤは、Compact Discの略称である。ＤＶＤは、Digital Versatile Diskの略称である。 Reading device 1504 accesses removable storage medium 1505 according to instructions from processor 1501. The removable storage medium 1505 is realized by, for example, a semiconductor device, a medium in which information is input/output by magnetic action, a medium in which information is input/output by optical action, or the like. Note that the semiconductor device is, for example, a USB (Universal Serial Bus) memory. Further, a medium in which information is input/output by magnetic action is, for example, a magnetic disk. Examples of media on which information is input and output through optical action include CD-ROMs, DVDs, and Blu-ray Discs (Blu-ray is a registered trademark). CD is an abbreviation for Compact Disc. DVD is an abbreviation for Digital Versatile Disk.

記憶部２０２は、例えばメモリ１５０２、記憶装置１５０３、および着脱可能記憶媒体１５０５を含んでいる。例えば、情報処理装置１０１の記憶装置１５０３には、例えば、上述の動作フローの処理を記述したプログラムおよび処理の実行で参照する情報が格納されていてよい。 The storage unit 202 includes, for example, a memory 1502, a storage device 1503, and a removable storage medium 1505. For example, the storage device 1503 of the information processing apparatus 101 may store, for example, a program that describes the processing of the above-described operational flow and information that is referenced when executing the processing.

通信インタフェース１５０６は、プロセッサ１５０１の指示に従って、他の装置と通信する。例えば、通信インタフェース１５０６は、有線または無線通信で撮影装置１０２から撮影画像および深度画像などの撮影データを受信してよい。通信インタフェース１５０６は、例えば、上述の通信部２０３の一例である。 Communication interface 1506 communicates with other devices according to instructions from processor 1501. For example, the communication interface 1506 may receive imaging data such as a captured image and a depth image from the imaging device 102 via wired or wireless communication. The communication interface 1506 is, for example, an example of the communication unit 203 described above.

入出力インタフェース１５０７は、例えば、入力装置および出力装置との間のインタフェースであってよい。入力装置は、例えばユーザからの指示を受け付けるキーボード、マウス、タッチパネルなどのデバイスである。出力装置は、例えばスピーカなどの音声装置である。また、入出力インタフェース１５０７は、例えば、上述の表示装置２０４と接続されていてよい。 Input/output interface 1507 may be, for example, an interface between an input device and an output device. The input device is, for example, a device such as a keyboard, mouse, or touch panel that receives instructions from a user. The output device is, for example, an audio device such as a speaker. Furthermore, the input/output interface 1507 may be connected to, for example, the display device 204 described above.

実施形態に係る各プログラムは、例えば、下記の形態で情報処理装置１０１に提供される。
（１）記憶装置１５０３に予めインストールされている。
（２）着脱可能記憶媒体１５０５により提供される。
（３）プログラムサーバなどのサーバから提供される。 Each program according to the embodiment is provided to the information processing apparatus 101 in the following format, for example.
(1) Installed in the storage device 1503 in advance.
(2) Provided by a removable storage medium 1505.
(3) Provided by a server such as a program server.

なお、図１５を参照して述べた情報処理装置１０１を実現するためのコンピュータ１５００のハードウェア構成は、例示であり、実施形態はこれに限定されるものではない。例えば、上述の構成の一部が、削除されてもよく、また、新たな構成が追加されてもよい。また、別の実施形態では、例えば、上述の情報処理装置１０１の一部または全部の機能がＦＰＧＡ、ＳｏＣ、ＡＳＩＣ、およびＰＬＤなどによるハードウェアとして実装されてもよい。なお、ＦＰＧＡは、Field Programmable Gate Arrayの略称である。ＳｏＣは、System-on-a-chipの略称である。ＡＳＩＣは、Application Specific Integrated Circuitの略称である。ＰＬＤは、Programmable Logic Deviceの略称である。例えば、上述の視点画像の生成処理が、ハードウェアとして実装されてもよい。また、この視点画像の生成処理を実行するハードウェアは、例えば、撮影装置１０２に搭載されるなど、撮影装置１０２と統合されていてもよい。 Note that the hardware configuration of the computer 1500 for realizing the information processing apparatus 101 described with reference to FIG. 15 is an example, and the embodiment is not limited to this. For example, some of the configurations described above may be deleted, or new configurations may be added. In another embodiment, for example, some or all of the functions of the information processing device 101 described above may be implemented as hardware such as an FPGA, an SoC, an ASIC, and a PLD. Note that FPGA is an abbreviation for Field Programmable Gate Array. SoC is an abbreviation for System-on-a-chip. ASIC is an abbreviation for Application Specific Integrated Circuit. PLD is an abbreviation for Programmable Logic Device. For example, the above-described viewpoint image generation processing may be implemented as hardware. Further, the hardware that executes this viewpoint image generation processing may be integrated with the imaging device 102, such as being installed in the imaging device 102, for example.

以上において、いくつかの実施形態が説明される。しかしながら、実施形態は上記の実施形態に限定されるものではなく、上述の実施形態の各種変形形態および代替形態を包含するものとして理解されるべきである。例えば、各種実施形態は、その趣旨および範囲を逸脱しない範囲で構成要素を変形して具体化できることが理解されよう。また、前述した実施形態に開示されている複数の構成要素を適宜組み合わせることにより、種々の実施形態が実施され得ることが理解されよう。更には、実施形態に示される全構成要素からいくつかの構成要素を削除して、または実施形態に示される構成要素にいくつかの構成要素を追加して種々の実施形態が実施され得ることが当業者には理解されよう。 Above, several embodiments are described. However, the embodiments are not limited to the embodiments described above, but should be understood to include various modifications and alternative forms of the embodiments described above. For example, it will be understood that the various embodiments can be embodied by changing the components without departing from the spirit and scope thereof. Furthermore, it will be understood that various embodiments can be implemented by appropriately combining the plurality of components disclosed in the embodiments described above. Furthermore, various embodiments may be implemented by deleting some components from all the components shown in the embodiments or adding some components to the components shown in the embodiments. It will be understood by those skilled in the art.

１００撮影システム
１０１情報処理装置
１０２撮影装置
１０５設置台
１０６撮影対象物
１１０被写体
２０１制御部
２０２記憶部
２０３通信部
２０４表示装置
２１１配置部
２１２取得部
２１３生成部
３０１光学中心
３０２光軸
３０３撮影投影面
３０５３次元平面
３０８投射線
３１０視点
３１１視点投影面
３１３視線
１５００コンピュータ
１５０１プロセッサ
１５０２メモリ
１５０３記憶装置
１５０４読取装置
１５０５着脱可能記憶媒体
１５０６通信インタフェース
１５０７入出力インタフェース
１５０８バス Reference Signs List 100 Photographing system 101 Information processing device 102 Photographing device 105 Installation stand 106 Photographing object 110 Subject 201 Control unit 202 Storage unit 203 Communication unit 204 Display device 211 Arrangement unit 212 Acquisition unit 213 Generation unit 301 Optical center 302 Optical axis 303 Photographic projection plane 305 Three-dimensional plane 308 Projection line 310 Viewpoint 311 Viewpoint projection plane 313 Line of sight 1500 Computer 1501 Processor 1502 Memory 1503 Storage device 1504 Reading device 1505 Removable storage medium 1506 Communication interface 1507 Input/output interface 1508 Bus

Claims

The optical center of the photographing device used to photograph the photographed image, in which each pixel is associated with depth information indicating the depth of the subject, the photographic projection plane on which the photographed image is projected, the viewpoint, and the A viewpoint projection plane onto which a viewpoint image depicting the subject is projected is arranged in virtual space,
A line of sight extending from the viewpoint toward a pixel to be processed of the viewpoint image arranged on the viewpoint projection surface, and a projection line extending from the optical center toward a pixel of the photographed image arranged on the photographic projection surface. and the depth of the subject indicated by the depth information corresponding to the pixels of the photographed image, the pixel value corresponding to the processing target pixel of the viewpoint image is determined from among the pixels of the photographed image. retrieved from
generating the viewpoint image using the pixel value acquired for the processing target pixel;
A generation program that causes a computer to perform a process.

The acquiring process includes determining the depth indicated by the intersection position of the line of sight and the projection line extending from the optical center toward the pixel, among a plurality of pixels of the photographed image arranged on the photographic projection plane, and the pixel. 2. The generation program according to claim 1, wherein the pixel value of the processing target pixel is acquired from a pixel that has a minimum depth difference from the depth of the subject based on the depth information.

The acquiring process is based on the intersection position of the line of sight and a projection line extended from the optical center toward the pixel where the depth difference is the minimum, and the depth information of the pixel where the depth difference is the minimum. 3. The generation program according to claim 2, wherein when the distance to the subject position is large and satisfies a predetermined condition, the processing target pixel of the viewpoint image is set to a blind spot area.

The acquiring process includes, when the line of sight intersects a plurality of projection lines on the subject, the captured image corresponding to the projection line having the shallowest intersection among the intersection positions of the plurality of projection lines and the line of sight. 4. The generation program according to claim 2, wherein the pixel value of the pixel to be processed is acquired from the pixel of .

The generating process further includes:
Generating a second viewpoint image of the subject from the viewpoint based on a second captured image in which each pixel is associated with the depth information indicating the depth of the subject;
When the processing target pixel is set in the blind spot area in either the viewpoint image or the second viewpoint image, the pixel value of the other viewpoint image is set as the pixel value of the processing target pixel. 4. The generation program according to claim 3, comprising a process for using.

The generating process further includes:
Generating a second viewpoint image of the subject from the viewpoint based on a second captured image in which each pixel is associated with the depth information indicating the depth of the subject;
When both the viewpoint images and the second viewpoint image have pixel values at the processing target pixels, the pixel values at the processing target pixels of both viewpoint images are determined by the processing from the viewpoint. 5. The method according to claim 1, further comprising a process of setting a pixel value obtained from a pixel corresponding to a projection line extending toward the target pixel and forming a smaller angle with the line of sight as the pixel value of the target pixel. The generation program according to any one of the items.

The optical center of the photographing device used to photograph the photographed image in which each pixel is associated with depth information indicating the depth of the subject, the photographic projection plane on which the photographed image is projected, the viewpoint, and the A viewpoint projection plane onto which a viewpoint image depicting the subject is projected is arranged in virtual space,
A line of sight extending from the viewpoint toward a pixel to be processed of the viewpoint image arranged on the viewpoint projection surface, and a projection line extending from the optical center toward a pixel of the photographed image arranged on the photographic projection surface. and the depth of the subject indicated by the depth information corresponding to the pixels of the photographed image, the pixel value corresponding to the processing target pixel of the viewpoint image is determined from among the pixels of the photographed image. retrieved from
generating the viewpoint image using the pixel value acquired for the processing target pixel;
computer-implemented generation methods, including

The optical center of the photographing device used to photograph the photographed image, in which each pixel is associated with depth information indicating the depth of the subject, the photographic projection plane on which the photographed image is projected, the viewpoint, and the an arrangement unit that arranges, in virtual space, a viewpoint projection surface onto which a viewpoint image depicting the subject is projected;
A line of sight extending from the viewpoint toward a pixel to be processed of the viewpoint image arranged on the viewpoint projection surface, and a projection line extending from the optical center toward a pixel of the photographed image arranged on the photographic projection surface. and the depth of the subject indicated by the depth information corresponding to the pixels of the photographed image, the pixel value corresponding to the processing target pixel of the viewpoint image is determined from among the pixels of the photographed image. an acquisition unit that acquires from
a generation unit that generates the viewpoint image using the pixel value acquired for the processing target pixel;
Information processing equipment, including.