JP7056576B2

JP7056576B2 - Transmitter, transmit method, receiver and receive method

Info

Publication number: JP7056576B2
Application number: JP2018551582A
Authority: JP
Inventors: 郁夫塚越
Original assignee: Sony Corp; Sony Group Corp
Current assignee: Sony Corp; Sony Group Corp
Priority date: 2016-11-15
Filing date: 2017-11-07
Publication date: 2022-04-19
Anticipated expiration: 2037-11-07
Also published as: JPWO2018092642A1; EP3544304A4; WO2018092642A1; EP3544304A1

Description

本技術は、送信装置、送信方法、受信装置および受信方法に関し、詳しくは、合成可能な複数のビューの符号化画像データからなる符号化ストリームを送信する送信装置等に関する。 The present art relates to a transmitting device, a transmitting method, a receiving device, and a receiving method, and more particularly to a transmitting device for transmitting a coded stream composed of coded image data of a plurality of views that can be synthesized.

例えば、特許文献１には、複数の画像を繋ぎ合わせて合成し、広い視野を再現するパノラマ画像を得ることが記載されている。合成可能な複数のビューの符号化画像データからなる符号化ストリームが送られてくる場合、受信側では、合成画像の表示を行うためには、複数のビューの画像データを高精度につなぎ合わせるアルゴリズムが必要になる。ここで、合成位置の決定は、１ピクセルずつ移動させながら、それぞれの場合での画素相関値を取って比較を行うことで行われる。このように、複数のビューの画像データを高精度につなぎ合わせる処理は、合成位置の決定処理が伴うことから処理負荷の重いものとなる。 For example, Patent Document 1 describes that a plurality of images are joined together and combined to obtain a panoramic image that reproduces a wide field of view. When a coded stream consisting of coded image data of multiple views that can be combined is sent, the receiving side is an algorithm that connects the image data of multiple views with high accuracy in order to display the combined image. Is required. Here, the composite position is determined by moving one pixel at a time and taking a pixel correlation value in each case for comparison. As described above, the process of connecting the image data of a plurality of views with high accuracy involves a process of determining the composite position, which makes the processing load heavy.

特開２０００－２９９８０４号公報Japanese Unexamined Patent Publication No. 2000-299804

本技術の目的は、受信側における画像合成処理の処理負荷を低減可能とすることにある。 An object of this technique is to make it possible to reduce the processing load of image composition processing on the receiving side.

本技術の概念は、
合成可能な複数のビューの符号化画像データからなる符号化ストリームを送信する送信部と、
上記各ビューの符号化画像データに合成点の情報を挿入する情報挿入部を備える
送信装置にある。The concept of this technology is
A transmitter that sends a coded stream consisting of coded image data from multiple views that can be combined, and a transmitter.
It is in a transmission device including an information insertion unit that inserts information of a synthesis point into the coded image data of each of the above views.

本技術において、送信部により、合成可能な複数のビューの符号化画像データからなる符号化ストリームが送信される。情報挿入部により、各ビューの符号化画像データに合成点の情報が挿入される。例えば、合成点には、合成時の基準点と非基準点が含まれる、ようにされてもよい。また、例えば、合成点の情報は、画像のトップレフトを原点とするｘ座標およびｙ座標からなる、ようにされてもよい。 In the present technique, the transmission unit transmits a coded stream composed of coded image data of a plurality of views that can be synthesized. The information insertion unit inserts information on the composite point into the coded image data of each view. For example, the composite point may include a reference point and a non-reference point at the time of synthesis. Further, for example, the information of the composite point may be composed of x-coordinates and y-coordinates having the top left of the image as the origin.

このように本技術においては、各ビューの符号化画像データに合成点の情報を挿入して送信するものである。そのため、受信側では、合成点の情報に基づいて合成処理を行うことが可能となり、処理負荷の低減が可能となる。 As described above, in the present technique, the information of the synthesis point is inserted into the coded image data of each view and transmitted. Therefore, on the receiving side, it is possible to perform the synthesis processing based on the information of the synthesis point, and it is possible to reduce the processing load.

なお、本技術において、例えば、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、ビューを識別する情報が付加される、ようにされてもよい。これにより、受信側では、各ビューの識別を容易に行うことができる。 In the present technique, for example, information for identifying the view may be added to each of the information of the synthesis point inserted in the coded image data of each view. As a result, the receiving side can easily identify each view.

また、本技術において、例えば、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、代表ビューであるか否かを示す情報が付加される、ようにされてもよい。これにより、受信側では、合成画像のうちデフォルト表示すべき一部領域として代表ビューに対応した領域を選択することが可能となる。 Further, in the present technique, for example, information indicating whether or not the view is a representative view may be added to each of the information of the composite point inserted in the coded image data of each view. This makes it possible for the receiving side to select an area corresponding to the representative view as a part of the composite image to be displayed by default.

また、本技術において、例えば、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、合成ビューの組を示す情報が付加される、ようにされてもよい。これにより、受信側では、合成ビューの組を指定して所望の合成画像を得ることが可能となる。 Further, in the present technique, for example, information indicating a set of composite views may be added to each of the composite point information inserted in the coded image data of each view. This makes it possible for the receiving side to specify a set of composite views and obtain a desired composite image.

また、本技術の他の概念は、
合成可能な複数のビューの符号化画像データからなる符号化ストリームを受信する受信部を備え、
上記各ビューの符号化画像データには、合成点の情報が挿入されており、
上記複数のビューから選択された所定数のビューの画像データを、上記合成点の情報に基づいて合成して合成画像データを得る合成部をさらに備える
受信装置にある。In addition, other concepts of this technology
It has a receiver that receives a coded stream consisting of coded image data of multiple views that can be synthesized.
Information on the composition point is inserted in the coded image data of each of the above views.
The receiving device further includes a compositing unit that synthesizes image data of a predetermined number of views selected from the plurality of views based on the information of the compositing points to obtain composite image data.

本技術において、受信部により、合成可能な複数のビューの符号化画像データからなる符号化ストリームが受信される。各ビューの符号化画像データには、合成点の情報が挿入されている。合成部により、複数のビューから選択された所定数のビューの画像データが、合成点の情報に基づいて合成されて合成画像データが得られる。 In the present technique, the receiving unit receives a coded stream composed of coded image data of a plurality of views that can be synthesized. Information on the composition point is inserted in the coded image data of each view. The compositing unit combines image data of a predetermined number of views selected from a plurality of views based on the information of the compositing points to obtain composite image data.

このように本技術においては、複数のビューから選択された所定数のビューの画像データを、各ビューの符号化画像データに挿入されて送られてくる合成点の情報に基づいて合成するものである。そのため、処理負荷の低減が可能となる。 As described above, in the present technique, the image data of a predetermined number of views selected from a plurality of views is synthesized based on the information of the synthesis point inserted into the coded image data of each view and sent. be. Therefore, the processing load can be reduced.

なお、本技術において、例えば、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、合成ビューの組を示す情報が付加されており、合成部は、合成ビューの組の指定情報で指定された組に属するビューの画像データを合成する、ようにされてもよい。これにより、合成ビューの組を指定して所望の合成画像を得ることが可能となる。 In the present technique, for example, information indicating a set of composite views is added to each of the information of the composite points inserted in the coded image data of each view, and the composite unit is the set of composite views. The image data of the views belonging to the set specified by the specified information may be combined. This makes it possible to specify a set of composite views and obtain a desired composite image.

また、本技術において、例えば、合成画像データから一部の領域の画像データを選択的に取り出して表示用画像データを得る表示処理部をさらに備える、ようにされてもよい。これにより、合成画像の一部領域の画像を選択的に表示することが可能となる。 Further, in the present technique, for example, a display processing unit for selectively extracting image data of a part of a region from the composite image data to obtain display image data may be further provided. This makes it possible to selectively display an image in a part of the composite image.

この場合、例えば、一部の領域をユーザが操作する操作部をさらに備える、ようにされてもよい。これにより、ユーザは、合成画像内の表示領域を任意に調整することが可能となる。そして、この場合、例えば、表示用画像データに、取り出された一部領域の合成画像内の位置を示すための表示データを重畳する、ようにされてもよい。これにより、ユーザは、合成画像内の表示領域の位置を容易に認識できる。 In this case, for example, an operation unit in which a user operates a part of the area may be further provided. This allows the user to arbitrarily adjust the display area in the composite image. Then, in this case, for example, the display data for indicating the position in the composite image of the extracted partial region may be superimposed on the display image data. This allows the user to easily recognize the position of the display area in the composite image.

また、本技術において、例えば、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、代表ビューであるか否かを示す情報が付加されており、表示処理部は、デフォルトでは、合成画像データから代表ビューの領域の画像データを取り出して表示用画像データを得る、ようにされてもよい。これにより、デフォルトでは、合成画像のうち代表ビューの画像を表示することが可能となる。 Further, in the present technology, for example, information indicating whether or not the view is a representative view is added to each of the information of the composite point inserted in the coded image data of each view, and the display processing unit defaults. Then, the image data of the area of the representative view may be taken out from the composite image data to obtain the image data for display. As a result, by default, it is possible to display the representative view image among the composite images.

本技術によれば、受信側における画像合成処理の処理負荷を低減することが可能となる。なお、ここに記載された効果は必ずしも限定されるものではなく、本開示中に記載されたいずれかの効果であってもよい。 According to this technique, it is possible to reduce the processing load of the image composition processing on the receiving side. The effects described herein are not necessarily limited, and may be any of the effects described in the present disclosure.

実施の形態としての送受信システムの構成例を示すブロック図である。It is a block diagram which shows the configuration example of the transmission / reception system as an embodiment. 送信装置の構成例を示すブロック図である。It is a block diagram which shows the configuration example of a transmission device. スティッチング・データ・ＳＥＩメッセージの構造例を示す図である。It is a figure which shows the structural example of a stitching data SEI message. スティッチング・データ・ＳＥＩメッセージの構造例における主要な情報の内容を示す図である。It is a figure which shows the content of the main information in the structural example of a stitching data SEI message. Ｎ個のビューの符号化画像データのシーケンスの一例を示す図である。It is a figure which shows an example of the sequence of the coded image data of N views. 受信装置の構成例を示すブロック図である。It is a block diagram which shows the configuration example of a receiving device. 情報「tier」の値による合成ビューの組の識別例を示す図である。It is a figure which shows the identification example of the set of the composite view by the value of information "tier". 「tier1」で指定された合成ビューの組に属するビュー１、ビュー２、ビュー３の画像の一例を示す図である。It is a figure which shows an example of the image of the view 1, the view 2, and the view 3 belonging to the set of the composite view designated by "tier1". 合成処理の結果の例を示す図である。It is a figure which shows the example of the result of the synthesis processing. ビュー１，２に含まれる合成点であるマスターポイントＭ１とターゲットポイントＴ１の位置関係の一例を示す図である。It is a figure which shows an example of the positional relationship between the master point M1 which is a composite point included in views 1 and 2 and the target point T1. 合成処理の手順の一例を説明するための図である。It is a figure for demonstrating an example of the procedure of a synthesis process. 合成処理の手順の一例を説明するための図である。It is a figure for demonstrating an example of the procedure of a synthesis process. 画像合成/表示選択部における合成処理のフローを示す図である。It is a figure which shows the flow of the composition processing in an image composition / display selection part. 表示部に合成画像（合成ビュー）の全体を表示する場合の表示の一例を示す図である。It is a figure which shows an example of the display when the whole composite image (composite view) is displayed on the display part. 成画像（合成ビュー）の一部領域を表示する場合であって、ユーザインタラクションによる合成画像の表示の一例を示す図である。It is a figure which shows the example of the display of the composite image by the user interaction in the case of displaying a part area of a synthetic image (composite view).

以下、発明を実施するための形態（以下、「実施の形態」とする）について説明する。なお、説明は以下の順序で行う。
１．実施の形態
２．変形例Hereinafter, embodiments for carrying out the invention (hereinafter referred to as “embodiments”) will be described. The explanation will be given in the following order.
1. 1. Embodiment 2. Modification example

＜１．実施の形態＞
［送受信システム］
図１は、実施の形態としての送受信システム１０の構成例を示している。この送受信システム１０は、送信装置１００と、受信装置２００とを有する構成となっている。<1. Embodiment>
[Transmission / reception system]
FIG. 1 shows a configuration example of a transmission / reception system 10 as an embodiment. The transmission / reception system 10 has a transmission device 100 and a reception device 200.

送信装置１００は、コンテナとしてのトランスポートストリームＴＳを放送波に載せて送信する。このトランスポートストリームＴＳには、合成可能な複数のビューの符号化画像データからなる符号化ストリーム（ビデオストリーム）が含まれる。この場合、符号化ストリームは、１本で構成されていてもよく、あるいは複数本で構成されていてもよい。 The transmission device 100 carries the transport stream TS as a container on a broadcast wave and transmits it. The transport stream TS includes a coded stream (video stream) composed of coded image data of a plurality of views that can be synthesized. In this case, the coded stream may be composed of one stream or a plurality of coded streams.

各ビューの符号化画像データには、合成点の情報が挿入される。このように各ビューの符号化画像データに合成点の情報を挿入されることで、受信側では、その合成点の情報に基づいて合成処理を行うことが可能となり、処理負荷の低減が可能となる。 Information on the composition point is inserted into the coded image data of each view. By inserting the information of the composition point into the coded image data of each view in this way, the receiving side can perform the composition processing based on the information of the composition point, and it is possible to reduce the processing load. Become.

各ビューの画像には２点以上の合成点が存在する。この合成点には、合成時に基準となる基準点とそれ以外の非基準点が含まれる。以下、適宜、基準点を「マスターポイント」と呼び、非基準点を「ターゲットポイント」と呼ぶ。ここで、合成点の情報は、例えば、画像のトップレフト（top-left）を原点とするｘ座標「x_coordinate」およびｙ座標「y_coordinate」からなる。 There are two or more composite points in the image of each view. This composite point includes a reference point that serves as a reference at the time of synthesis and other non-reference points. Hereinafter, the reference point is appropriately referred to as a “master point”, and the non-reference point is referred to as a “target point”. Here, the information of the composite point is composed of, for example, the x-coordinate "x_coordinate" and the y-coordinate "y_coordinate" with the top-left of the image as the origin.

各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、ビューを識別する情報「view_id」が付加される。このようにビューを識別する情報が付加されることで、受信側では、各ビューの識別が容易に可能となる。 Information "view_id" that identifies the view is added to each of the information of the composite point inserted in the coded image data of each view. By adding the information for identifying the views in this way, the receiving side can easily identify each view.

また、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、代表ビューであるか否かを示す情報「masterview_flag」が付加される。このように代表ビューであるか否かを示す情報が付加されることで、受信側では、合成画像のうちデフォルト表示すべき一部領域として代表ビューに対応した領域を選択することが可能となる。 Further, information "masterview_flag" indicating whether or not the view is a representative view is added to each of the information of the composite point inserted in the coded image data of each view. By adding information indicating whether or not the view is a representative view in this way, the receiving side can select an area corresponding to the representative view as a part of the composite image to be displayed by default. ..

また、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、合成ビューの組を示す情報「tier」が付加される。このように合成ビューの組を示す情報が付加されることで、受信側では、合成ビューの組を指定して所望の合成画像を得ることが可能となる。 Further, information "tier" indicating a set of composite views is added to each of the composite point information inserted in the coded image data of each view. By adding the information indicating the set of composite views in this way, the receiving side can specify the set of composite views and obtain a desired composite image.

受信装置２００は、送信装置１００から放送波に載せて送られてくる上述のトランスポートストリームＴＳを受信する。このトランスポートストリームＴＳには、上述したように、合成可能な複数のビューの符号化画像データからなる符号化ストリーム（ビデオストリーム）が含まれている。そして、各ビューの符号化画像データには、合成点の情報が挿入されている。 The receiving device 200 receives the above-mentioned transport stream TS transmitted on the broadcast wave from the transmitting device 100. As described above, the transport stream TS includes a coded stream (video stream) composed of coded image data of a plurality of views that can be synthesized. Then, information on the composite point is inserted in the coded image data of each view.

受信装置２００は、複数のビューから選択された所定数のビューの画像データを、合成点の情報に基づいて合成して合成画像データを得る。例えば、受信装置２００は、合成ビューの組の指定情報で指定された組に属するビューの画像データを合成する。合成ビューの組の指定は、例えば、ユーザの操作により行われる。 The receiving device 200 obtains composite image data by synthesizing image data of a predetermined number of views selected from a plurality of views based on the information of the synthesis point. For example, the receiving device 200 synthesizes image data of a view belonging to the set specified by the specified information of the set of the composite view. The designation of the set of the composite view is performed by, for example, a user operation.

受信装置２００は、例えば、合成画像データから一部の領域、例えば１つのビューの画像サイズ分の画像データを選択的に取り出して表示用画像データを得る。これにより、合成画像の一部領域の画像が選択的に表示される。例えば、デフォルトでは、代表ビューの情報に基づいて、代表ビューの領域の画像データが取り出されて、代表ビューの画像表示がなされる。この一部の領域の調整は、例えば、ユーザの操作により行われる。そして、この場合、受信装置２００は、例えば、取り出された一部領域の合成画像内の位置を表示画像上に表示し、ユーザの便に供する。 The receiving device 200 selectively extracts, for example, image data for a part of a region, for example, an image size of one view from the composite image data, and obtains image data for display. As a result, the image of a part of the composite image is selectively displayed. For example, by default, the image data in the area of the representative view is fetched based on the information of the representative view, and the image of the representative view is displayed. The adjustment of this part of the area is performed by, for example, a user operation. Then, in this case, the receiving device 200 displays, for example, the position in the composite image of the extracted partial region on the display image and provides it for the user's convenience.

「送信装置の構成」
図２は、送信装置１００の構成例を示している。この送信装置１００は、制御部１０１と、ユーザ操作部１０２ａと、表示部１０２ｂと、非圧縮データバッファ（ｄｐｂ：decoded picture buffer）１０３と、ビデオエンコーダ１０４と、圧縮データバッファ（ｃｐｂ：coded picture buffer）１０５と、ＴＳフォーマッタ１０６と、送信部１０７を有している。"Configuration of transmitter"
FIG. 2 shows a configuration example of the transmission device 100. The transmission device 100 includes a control unit 101, a user operation unit 102a, a display unit 102b, an uncompressed data buffer (dpb: decoded picture buffer) 103, a video encoder 104, and a compressed data buffer (cpb: coded picture buffer). ) 105, a TS formatter 106, and a transmission unit 107.

制御部１０１は、ＣＰＵ（Central Processing Unit）を備えており、送信装置１００の各部の動作を制御する。この制御部１０１には、ユーザインタフェースを構成するユーザ操作部１０２ａおよび表示部１０２ｂが接続されている。ユーザは、ユーザ操作部１０２ａにより種々の入力操作を行うことができる。例えば、ユーザ操作部１０２ａから、スティッチ合成情報の設定操作を行うことができる。このスティッチ合成情報には、合成点（マスターポイント、ターゲットポイント）の情報、ビュー識別情報、代表ビュー情報、合成ビュー組情報などが含まれる。 The control unit 101 includes a CPU (Central Processing Unit) and controls the operation of each unit of the transmission device 100. A user operation unit 102a and a display unit 102b constituting the user interface are connected to the control unit 101. The user can perform various input operations by the user operation unit 102a. For example, the stitch composition information can be set from the user operation unit 102a. This stitch composition information includes information on composition points (master point, target point), view identification information, representative view information, composition view group information, and the like.

ビデオエンコーダ１０４は、非圧縮データバッファ１０３を通じて、合成可能な複数の、ここではＮ個のビューの動画像データＶＤ１、ＶＤ２，・・・，ＶＤＮを入力し、ＡＶＣあるいはＨＥＶＣなどで符号化し、各ビューの符号化画像データからなる符号化ストリーム（ビデオストリーム）を得る。この場合、符号化ストリームは、１本あるいは複数本で構成される。 The video encoder 104 inputs video data VD1, VD2, ..., VDN of a plurality of view to be synthesized, here N views, through the uncompressed data buffer 103, encodes them with AVC, HEVC, or the like, and each of them is encoded. Obtain a coded stream (video stream) consisting of the coded image data of the view. In this case, the coded stream is composed of one or more streams.

この際、ビデオエンコーダ１０４は、各ビューの符号化画像データに、スティッチ合成情報を挿入する。具体的には、ピクチャ毎の符号化画像データを持つ各アクセスユニット（ＡＵ）の“ＳＥＩｓ”の部分に、スティッチ合成情報を持つ新規定義するスティッチング・データ・ＳＥＩメッセージ（Stitching_data SEI message）を挿入する。 At this time, the video encoder 104 inserts stitch composition information into the coded image data of each view. Specifically, a newly defined stitching data SEI message (Stitching_data SEI message) having stitch composition information is inserted in the "SEIs" part of each access unit (AU) having encoded image data for each picture. do.

図３は、スティッチング・データ・ＳＥＩメッセージの構造例(Syntax)を示している。図４は、その構造例における主要な情報の内容（Semantics）を示している。「byte_length」の８ビットフィールドは、このＳＥＩメッセージの以降のサイズをバイト数で示す。「number_of_stitching_sets」の１６ビットフィールドは、スティッチ合成情報の数を示す。この数の分だけ、以下のフィールドが繰り返し存在する。 FIG. 3 shows a structural example (Syntax) of a stitching data SEI message. FIG. 4 shows the contents (Semantics) of the main information in the structural example. The 8-bit field of "byte_length" indicates the subsequent size of this SEI message in bytes. The 16-bit field of "number_of_stitching_sets" indicates the number of stitch composition information. The following fields exist repeatedly for this number.

「stp_id」の１６ビットフィールドは、スティッチ合成するための点（合成点）の識別情報を示す。「view_id」の１６ビットフィールドは、スティッチ合成点が含まれるビューの識別情報を示す。 The 16-bit field of "stp_id" indicates the identification information of the point (combination point) for stitch synthesis. The 16-bit field of "view_id" indicates the identification information of the view including the stitch composition point.

「master_point_x_coordinate」の１６ビットフィールドは、画像のトップレフト（top-left）を原点としたマスターポイントのｘ座標を示す。「master_point_y_coordinate」の１６ビットフィールドは、画像のトップレフトを原点としたマスターポイントのｙ座標を示す。「target_point_x_coordinate」の１６ビットフィールドは、画像のトップレフトを原点としたターゲットポイントのｘ座標を示す。「target_point_y_coordinate」の１６ビットフィールドは、画像のトップレフトを原点としたターゲットポイントのｙ座標を示す。 The 16-bit field of "master_point_x_coordinate" indicates the x-coordinate of the master point with the top-left of the image as the origin. The 16-bit field of "master_point_y_coordinate" indicates the y-coordinate of the master point with the top left of the image as the origin. The 16-bit field of "target_point_x_coordinate" indicates the x-coordinate of the target point with the top left of the image as the origin. The 16-bit field of "target_point_y_coordinate" indicates the y-coordinate of the target point with the top left of the image as the origin.

「tier」の４ビットフィールドは、合成ビューの組を示す。「masterview_flag」の１ビットフィールドは、スティッチ合成点が含まれるビューが代表ビューであるか否かを示す。例えば、“１”は代表ビューであることを示し、“０”は代表ビューでないことを示す。 The 4-bit field of "tier" indicates a set of composite views. The 1-bit field of "masterview_flag" indicates whether or not the view including the stitch composition point is a representative view. For example, "1" indicates that it is a representative view, and "0" indicates that it is not a representative view.

図５は、Ｎ個のビューの符号化画像データのシーケンスの一例を示している。各ピクチャの符号化画像データ（アクセスユニット）にそれぞれ上述のスティッチング・データ・ＳＥＩメッセージが挿入される。図示の例においては、１番目のピクチャと２番目のピクチャに関しては、ビュー１（View1）の符号化画像データに挿入されるスティッチ合成情報において「Masterview = 1」とされ、ビュー１が代表ビューに設定されている。 FIG. 5 shows an example of a sequence of coded image data of N views. The stitching data SEI message described above is inserted into the coded image data (access unit) of each picture. In the illustrated example, regarding the first picture and the second picture, "Masterview = 1" is set in the stitch composition information inserted in the coded image data of view 1, and view 1 becomes the representative view. It is set.

また、３番目以降のピクチャに関しては、ビュー２（View2）の符号化画像データに挿入されるスティッチ合成情報において「Masterview = 1」とされ、ビュー２が代表ビューに設定されている。代表ビューは、後述するように、合成画像の一部領域を選択的に表示する場合に、デフォルトで、優先表示される。 Further, regarding the third and subsequent pictures, "Masterview = 1" is set in the stitch composition information inserted in the coded image data of the view 2, and the view 2 is set as the representative view. As will be described later, the representative view is preferentially displayed by default when a part of the composite image is selectively displayed.

図２に戻って、圧縮データバッファ(ｃｐｂ)１０５は、ビデオエンコーダ１０４で生成された符号化ストリームを、一時的に蓄積する。ＴＳフォーマッタ１０６は、圧縮データバッファ１０４に蓄積されている符号化ストリームを読み出し、ＰＥＳパケット化し、さらにトランスポートパケット化して多重し、多重化ストリームとしてのトランスポートストリームＴＳを得る。送信部１０７は、ＴＳフォーマッタ１０６で得られたトランスポートストリームＴＳを、放送波あるいはネットのパケットに載せて、受信装置２００に送信する。 Returning to FIG. 2, the compressed data buffer (cpb) 105 temporarily stores the coded stream generated by the video encoder 104. The TS formatter 106 reads out the coded stream stored in the compressed data buffer 104, converts it into a PES packet, further converts it into a transport packet and multiplexes it, and obtains a transport stream TS as a multiplexed stream. The transmission unit 107 carries the transport stream TS obtained by the TS formatter 106 on a broadcast wave or a net packet and transmits it to the receiving device 200.

図２に示す送信装置１００の動作を簡単に説明する。合成可能なＮ個のビューの動画像データＶＤ１、ＶＤ２，・・・，ＶＤＮは、非圧縮データバッファ(ｄｐｂ)１０３を介して、ビデオエンコーダ１０４に供給される。ビデオエンコーダ１０４では、動画像データＶＤ１、ＶＤ２，・・・，ＶＤＮのそれぞれがＡＶＣあるいはＨＥＶＣなどで符号化されて、符号化ストリーム（ビデオストリーム）が得られる。 The operation of the transmission device 100 shown in FIG. 2 will be briefly described. The moving image data VD1, VD2, ..., VDN of N views that can be combined are supplied to the video encoder 104 via the uncompressed data buffer (dpb) 103. In the video encoder 104, each of the moving image data VD1, VD2, ..., VDN is encoded by AVC, HEVC, or the like to obtain a coded stream (video stream).

また、ビデオエンコーダ１０４では、各ビューの符号化画像データに、合成点（マスターポイント、ターゲットポイント）の情報、ビュー識別情報、代表ビュー情報、合成ビュー組情報などが含まれるスティッチ合成情報が挿入される。具体的には、ビデオエンコーダ１０４では、アクセスユニット（ＡＵ）の“ＳＥＩｓ”の部分に、スティッチ合成情報を持つスティッチング・データ・ＳＥＩメッセージ（図３参照）が挿入される。 Further, in the video encoder 104, stitch composition information including composition point (master point, target point) information, view identification information, representative view information, composition view group information, and the like is inserted into the coded image data of each view. To. Specifically, in the video encoder 104, a stitching data SEI message (see FIG. 3) having stitch synthesis information is inserted into the “SEIs” portion of the access unit (AU).

ビデオエンコーダ１０４で生成された符号化ストリームは、圧縮データバッファ(ｃｐｂ)１０５に一時的に蓄積される。ＴＳフォーマッタ１０６では、圧縮データバッファ１０５に蓄積されている符号化ストリームが読み出され、ＰＥＳパケット化され、さらにトランスポートパケット化されて多重され、多重化ストリームとしてのトランスポートストリームＴＳが得られる。 The coded stream generated by the video encoder 104 is temporarily stored in the compressed data buffer (cpb) 105. In the TS formatter 106, the coded stream stored in the compressed data buffer 105 is read out, converted into a PES packet, further converted into a transport packet, and multiplexed to obtain a transport stream TS as a multiplexed stream.

ＴＳフォーマッタ１０６で得られたトランスポートストリームＴＳは、送信部１０７に送られる。送信部１０７では、ＴＳフォーマッタ１０６で得られたトランスポートストリームＴＳが、放送波あるいはネットのパケットに載せて、受信装置２００に送信される。 The transport stream TS obtained by the TS formatter 106 is sent to the transmission unit 107. In the transmission unit 107, the transport stream TS obtained by the TS formatter 106 is carried on a broadcast wave or a net packet and transmitted to the receiving device 200.

「受信装置の構成」
図６は、受信装置２００の構成例を示している。この受信装置２００は、制御部（Central Processing Unit）２０１と、ユーザ操作部２０１ａと、受信部２０２と、ＴＳ解析部２０３と、圧縮データバッファ（ｃｐｂ：coded picture buffer）２０４と、ビデオデコーダ２０５と、非圧縮データバッファ（ｄｐｂ：decoded picture buffer）２０６と、画像合成/表示選択部２０７と、合成部２０８と、表示部２０９を有している。"Receiver configuration"
FIG. 6 shows a configuration example of the receiving device 200. The receiving device 200 includes a control unit (Central Processing Unit) 201, a user operation unit 201a, a receiving unit 202, a TS analysis unit 203, a compressed data buffer (cpb: coded picture buffer) 204, and a video decoder 205. It has an uncompressed data buffer (dpb: decoded picture buffer) 206, an image composition / display selection unit 207, a composition unit 208, and a display unit 209.

制御部２０１は、ＣＰＵ（Central Processing Unit）を備えており、受信装置２００の各部の動作を制御する。この制御部２０１には、ユーザインタフェースを構成するユーザ操作部２０１ａが接続されている。ユーザは、ユーザ操作部２０１ａにより種々の入力操作を行うことができる。例えば、受信機側で合成画像のうち表示すべき一部領域の選択が可能な場合には、ユーザ操作部２０１ａから選択操作を行うことできる。 The control unit 201 includes a CPU (Central Processing Unit) and controls the operation of each unit of the receiving device 200. A user operation unit 201a constituting a user interface is connected to the control unit 201. The user can perform various input operations by the user operation unit 201a. For example, if the receiver can select a part of the composite image to be displayed, the user operation unit 201a can perform the selection operation.

受信部２０２は、送信装置１００から放送波あるいはネットのパケットに載せて送られてくるトランスポートストリームＴＳを受信する。ＴＳ解析部２０３は、トランスポートストリームＴＳから、それに含まれる符号化ストリーム（各ビューの符号化画像データ）を取り出して圧縮データバッファ（ｃｐｂ）２０４に送る。 The receiving unit 202 receives the transport stream TS transmitted from the transmitting device 100 on a broadcast wave or a packet on the net. The TS analysis unit 203 takes out the coded stream (coded image data of each view) included in the transport stream TS and sends it to the compressed data buffer (cpb) 204.

ビデオデコーダ２０５は、受信装置２００のデコード能力に応じて、全てのビュー、あるいはユーザ操作部２０１ａからユーザにより指定された合成ビューの組に属する各ビューの符号化画像データをデコードする。合成ビューの組の指定は、情報「tier」の値を所定の値に設定することで行われる。この場合、指定された合成ビューの組に属するビューであるかは、設定された情報「tier」の値に関連付けられた合成点を持つか否かで識別される。 The video decoder 205 decodes all the views or the encoded image data of each view belonging to the set of composite views designated by the user from the user operation unit 201a according to the decoding ability of the receiving device 200. The specification of the set of the composite view is performed by setting the value of the information "tier" to a predetermined value. In this case, whether or not the view belongs to the specified set of composite views is identified by whether or not it has a composite point associated with the value of the set information "tier".

図７は、情報「tier」の値による合成ビューの組の識別例を示している。なお、この例は、合成点（マスターポイント、ターゲットポイント）の情報に基づいて、各ビューの画像が既に合成されている状態を示している。例えば、合成ビューの組の指定が情報「tier」の値を「tier1」に設定することで行われる場合、ビュー１（view1）、ビュー２（view2）、ビュー３（view3）の各ビューが「tier1」に関連付けられた合成点を持っていることから、指定された合成ビューの組に属するビューとなる。また、例えば、合成ビューの組の指定が情報「tier」の値を「tier2」に設定することで行われる場合、ビュー１（view1）、ビュー４（view4）、ビュー５（view5）の各ビューが「tier2」に関連付けられた合成点を持っていることから、指定された合成ビューの組に属するビューとなる。 FIG. 7 shows an example of identifying a set of composite views by the value of the information “tier”. Note that this example shows a state in which the images of each view have already been combined based on the information of the composite points (master point, target point). For example, when the composition view set is specified by setting the value of the information "tier" to "tier1", each view of view 1 (view1), view 2 (view2), and view 3 (view3) is "view3". Since it has a composite point associated with "tier 1", it is a view that belongs to the specified composite view set. Further, for example, when the set of the composite view is specified by setting the value of the information "tier" to "tier2", each view of view 1 (view1), view 4 (view4), and view 5 (view5) is specified. Has a composite point associated with "tier2", so it is a view that belongs to the specified composite view set.

ビデオデコーダ２０５は、圧縮データバッファ２０４に蓄積されている各ビューにおける各ピクチャの符号化画像データを、それぞれ、そのピクチャのＤＴＳ（Decoding Time stamp）で与えられるデコードタイミングで読み出してデコードする。デコードして得られた各ビューの各ピクチャの画像データは非圧縮データバッファ２０６に一時的に蓄積される。この非圧縮データバッファ２０６から各ピクチャの画像データがそのピクチャのＰＴＳ（Presentation Time Stamp）で与えられるタイミングで読み出されて、各ビューの動画像データＶＤが得られ、画像合成/表示選択部２０７に送られる。 The video decoder 205 reads and decodes the encoded image data of each picture in each view stored in the compressed data buffer 204 at the decoding timing given by the DTS (Decoding Time stamp) of the picture. The image data of each picture of each view obtained by decoding is temporarily stored in the uncompressed data buffer 206. The image data of each picture is read from the uncompressed data buffer 206 at the timing given by the PTS (Presentation Time Stamp) of the picture, the moving image data VD of each view is obtained, and the image composition / display selection unit 207 is obtained. Will be sent to.

また、ビデオデコーダ２０５は、ビデオストリームに挿入されているパラメータセットやＳＥＩメッセージを抽出し、制御部２０１に送る。この抽出情報には、上述した各ビューの符号化画像データのアクセスユニットに挿入されているスティッチング・データ・ＳＥＩメッセージ（図３参照）も含まれる。これにより、制御部２０１は、各ビューのスティッチ合成情報を認識する。上述したように、スティッチ合成情報には、合成点（マスターポイント、ターゲットポイント）の情報、ビュー識別情報、代表ビュー情報、合成ビュー組情報などが含まれている。 Further, the video decoder 205 extracts the parameter set and the SEI message inserted in the video stream and sends them to the control unit 201. The extracted information also includes a stitching data SEI message (see FIG. 3) inserted in the access unit of the coded image data of each view described above. As a result, the control unit 201 recognizes the stitch composition information of each view. As described above, the stitch composition information includes information on the composition point (master point, target point), view identification information, representative view information, composition view group information, and the like.

画像合成/表示選択部２０７は、非圧縮データバッファ２０６で得られた各ビューの画像データに合成処理を施して合成画像データを得る。合成処理では、同じ識別情報「stp_id」を持つ合成点を重ね合わせることで画像データを合成する。この場合、基準となるマスタ―ポイントを起点に、ターゲットポイント同士が重なるように、必要に応じて、回転やスケーリングを行って合成する。 The image composition / display selection unit 207 performs composition processing on the image data of each view obtained in the uncompressed data buffer 206 to obtain composite image data. In the compositing process, image data is synthesized by superimposing composite points having the same identification information "stp_id". In this case, starting from the reference master point, the target points are rotated and scaled as necessary so that the target points overlap each other.

図８は、上述の図７において、「tier1」で指定された合成ビューの組に属するビュー１（view1）、ビュー２（view2）、ビュー３（view3）の画像の一例を示している。ビュー１のスティッチ合成情報には、「stp_id1」で識別される合成点に係る各種情報が含まれている。ここでは、「view_id1」により、この合成点はビュー１に含まれること示されている。また、マスターポイントの座標が（ｘ１１，ｙ１１）であることが示されている。また、ターゲットポイントの座標が（ｘ１２，ｙ１２）であることが示されている。 FIG. 8 shows an example of images of view 1, view 2 (view 2), and view 3 (view 3) belonging to the set of composite views specified by “tier 1” in FIG. 7 above. The stitch composition information of the view 1 includes various information related to the composition point identified by "stp_id1". Here, "view_id1" indicates that this composite point is included in view 1. It is also shown that the coordinates of the master point are (x11, y11). It is also shown that the coordinates of the target point are (x12, y12).

ビュー２のスティッチ合成情報には、「stp_id1」で識別される合成点に係る各種情報が含まれている。ここでは、「view_id2」により、この合成点はビュー２に含まれること示されている。また、マスターポイントの座標が（ｘ２１，ｙ２１）であることが示されている。また、ターゲットポイントの座標が（ｘ２２，ｙ２２）であることが示されている。 The stitch composition information of the view 2 includes various information related to the composition point identified by "stp_id1". Here, "view_id2" indicates that this composite point is included in view 2. It is also shown that the coordinates of the master point are (x21, y21). It is also shown that the coordinates of the target point are (x22, y22).

また、ビュー２のスティッチ合成情報には、「stp_id2」で識別される合成点に係る各種情報が含まれている。ここでは、「view_id2」により、この合成点はビュー２に含まれること示されている。また、マスターポイントの座標が（ｘ２３，ｙ２３）であることが示されている。また、ターゲットポイントの座標が（ｘ２４，ｙ２４）であることが示されている。 Further, the stitch composition information of the view 2 includes various information related to the composition point identified by "stp_id2". Here, "view_id2" indicates that this composite point is included in view 2. It is also shown that the coordinates of the master point are (x23, y23). It is also shown that the coordinates of the target point are (x24, y24).

ビュー３のスティッチ合成情報には、「stp_id2」で識別される合成点に係る各種情報が含まれている。ここでは、「view_id3」により、この合成点はビュー３に含まれること示されている。また、マスターポイントの座標が（ｘ３１，ｙ３１）であることが示されている。また、ターゲットポイントの座標が（ｘ３２，ｙ３２）であることが示されている。 The stitch composition information of the view 3 includes various information related to the composition point identified by "stp_id2". Here, "view_id3" indicates that this composite point is included in view 3. It is also shown that the coordinates of the master point are (x31, y31). It is also shown that the coordinates of the target point are (x32, y32).

ビュー１とビュー２の合成処理では、「stp_id1」で識別される合成点（マスターポイント、ターゲットポイント）を重ね合わせることで画像データを合成する。また、ビュー２とビュー３の合成処理では、「stp_id2」で識別される合成点（マスターポイント、ターゲットポイント）を重ね合わせることで画像データを合成する。これにより、ビュー１、ビュー２およびビュー３の画像データを合成した合成画像データが得られる。図９は、合成処理の結果の例を示している。 In the composition process of view 1 and view 2, image data is combined by superimposing the composition points (master point, target point) identified by "stp_id1". Further, in the compositing process of the view 2 and the view 3, the image data is synthesized by superimposing the compositing points (master point, target point) identified by "stp_id2". As a result, a composite image data obtained by synthesizing the image data of views 1, 2 and 3 can be obtained. FIG. 9 shows an example of the result of the synthesis process.

合成処理の手順について説明する。ここでは、ビュー１とビュー２の合成処理を例にとって説明する。この場合、ビュー１とビュー２に含まれる「stp_id1」で識別される合成点（マスターポイント、ターゲットポイント）を用いて合成処理が行われる。 The procedure of the synthesis process will be described. Here, the composition process of view 1 and view 2 will be described as an example. In this case, the composition process is performed using the composition points (master point, target point) identified by "stp_id1" included in the view 1 and the view 2.

図１０（ａ）は、ビュー１に含まれる合成点であるマスターポイントＭ１とターゲットポイントＴ１の位置関係の一例を示している。マスターポイントＭ１の座標は（ｘ１１，ｙ１１）であり、ターゲットポイントＴ１の座標は（ｘ１２、ｙ１２）である。マスターポイントＭ１を基点として、ターゲットポイントＴ１は、垂直軸からθ１だけ回転した位置にある。また、マスターポイントＭ１とターゲットポイントＴ１の間の距離はｒ１である。 FIG. 10A shows an example of the positional relationship between the master point M1 and the target point T1, which are synthetic points included in the view 1. The coordinates of the master point M1 are (x11, y11), and the coordinates of the target point T1 are (x12, y12). With the master point M1 as the base point, the target point T1 is at a position rotated by θ1 from the vertical axis. Further, the distance between the master point M1 and the target point T1 is r1.

図１０（ｂ）は、ビュー２に含まれる合成点であるマスターポイントＭ２とターゲットポイントＴ２の位置関係の一例を示している。マスターポイントＭ２の座標は（ｘ２１，ｙ２１）であり、ターゲットポイントＴ２の座標は（ｘ２２、ｙ２２）である。マスターポイントＭ２を基点として、ターゲットポイントＴ２は、垂直軸からθ２だけ回転した位置にある。また、マスターポイントＭ２とターゲットポイントＴ２の間の距離はｒ２である。なお、ここでは、θ２＜θ１、ｒ２＞ｒ１であるとする。 FIG. 10B shows an example of the positional relationship between the master point M2 and the target point T2, which are composite points included in the view 2. The coordinates of the master point M2 are (x21, y21), and the coordinates of the target point T2 are (x22, y22). With the master point M2 as the base point, the target point T2 is located at a position rotated by θ2 from the vertical axis. Further, the distance between the master point M2 and the target point T2 is r2. Here, it is assumed that θ2 <θ1, r2> r1.

ステップ１では、図１１（ａ）に示すように、ビュー１のマスターポイントＭ１とビュー２のマスターポイントＭ２の座標が重なるように、２つのビュー（画像）の位置設定の処理をする。このように位置設定をした場合、マスターポイントＭ１，Ｍ２の座標が重なるが、上述したようにθ２＜θ１、ｒ２＞ｒ１であるので、ビュー１のターゲットポイントＴ１とビュー２のターゲットポイントＴ２の座標は重ならない。 In step 1, as shown in FIG. 11A, the positions of the two views (images) are set so that the coordinates of the master point M1 of the view 1 and the master point M2 of the view 2 overlap. When the position is set in this way, the coordinates of the master points M1 and M2 overlap, but since θ2 <θ1, r2> r1 as described above, the coordinates of the target point T1 of the view 1 and the target point T2 of the view 2 Do not overlap.

以降のステップでは、ターゲットポイントＴ１，Ｔ２の座標が重なるように位置設定の処理をさらに進めていく。なお、θ２＝θ１、ｒ２＝ｒ１である場合には、このステップ１の処理で、ターゲットポイントＴ１，Ｔ２の座標も重なることから、２つのビュー（画像）の位置設定の処理が終了することになる。 In the subsequent steps, the position setting process is further advanced so that the coordinates of the target points T1 and T2 overlap. When θ2 = θ1 and r2 = r1, the coordinates of the target points T1 and T2 also overlap in the process of this step 1, so that the process of setting the positions of the two views (images) is completed. Become.

ステップ２では、図１１（ｂ）に示すように、ビュー１のターゲットポイントＴ１とビュー２のターゲットポイントＴ２の座標を重ねるための第１段階として、ビュー２を、マスターポイントＭ２を支点として、（θ１-θ２）だけ回転する。これにより、ビュー２のターゲットポイントＴ２は、マスターポイントＭ１，Ｍ２とビュー１のターゲットポイントＴ１とを結ぶ線の延長上に位置した状態となる。なお、この場合、逆に、ビュー１を、マスターポイントＭ１を支点として、－（θ１-θ２）だけ回転することも考えられる。 In step 2, as shown in FIG. 11B, as the first step for superimposing the coordinates of the target point T1 of the view 1 and the target point T2 of the view 2, the view 2 is set as a fulcrum and the master point M2 is used as a fulcrum. Rotate by θ1-θ2). As a result, the target point T2 of the view 2 is located on the extension of the line connecting the master points M1 and M2 and the target point T1 of the view 1. In this case, conversely, it is conceivable to rotate the view 1 by − (θ1-θ2) with the master point M1 as the fulcrum.

ステップ３では、図１１（ｃ）に示すように、ビュー１のターゲットポイントＴ１とビュー２のターゲットポイントＴ２の座標を重ねるための第２段階として、ビュー２のターゲットポイントＴ２がビュー１のターゲットポイントＴ１に重なるように、マスターポイントＭ１ないしＭ２を起点にスケーリングする。この場合、ビュー２のマスターポイントＭ２とターゲットポイントＴ２の間の距離は、ビュー１のマスターポイントＭ１とターゲットポイントＴ１の間の距離ｒ１と等しくなる。なお、この場合、逆に、ビュー１のターゲットポイントＴ１がビュー２のターゲットポイントＴ２に重なるようにスケーリングすることも考えられる。 In step 3, as shown in FIG. 11C, the target point T2 of the view 2 is the target point of the view 1 as the second step for superimposing the coordinates of the target point T1 of the view 1 and the target point T2 of the view 2. Scaling is performed starting from the master points M1 to M2 so as to overlap T1. In this case, the distance between the master point M2 of the view 2 and the target point T2 is equal to the distance r1 between the master point M1 of the view 1 and the target point T1. In this case, conversely, it is conceivable to scale the target point T1 of the view 1 so as to overlap the target point T2 of the view 2.

ステップ４では、ビュー１とビュー２の画質（輝度、色など）が一致するように調整する。この場合、例えば、輝度調整の場合は、図１２（ａ）に示すように、上述したようにマスターポイント同士、ターゲットポイント同士が重なった状態で、マスターポイントとターゲットポイントで規制される長方形領域を設定する。そして、この長方形領域におけるビュー１、ビュー２のそれぞれの輝度ヒストグラムをとり、それが同じになるように調整する。図１２（ｂ）は、輝度調整された後の状態を示している。詳細説明は省略するが、色調整の場合も同様である。なお、輝度調整および色調整の双方を行う場合他、いずれか一方のみを行うことも考えられる。 In step 4, the image quality (luminance, color, etc.) of view 1 and view 2 is adjusted to match. In this case, for example, in the case of brightness adjustment, as shown in FIG. 12A, the rectangular area regulated by the master point and the target point is formed in a state where the master points and the target points overlap each other as described above. Set. Then, the luminance histograms of views 1 and 2 in this rectangular area are taken and adjusted so that they are the same. FIG. 12B shows the state after the luminance is adjusted. Although detailed description is omitted, the same applies to color adjustment. In addition to the case where both brightness adjustment and color adjustment are performed, it is also conceivable to perform only one of them.

図１３は、画像合成/表示選択部２０７における合成処理のフローを示している。ステップＳＴ１において、処理を開始する。次に、ステップＳＴ２において、同じ「stp_id」の値を持つ２つのビュー（画像）のマスターポイント（Master Point）同士が同一の座標に重なるように位置設定をする（図１１（ａ）参照）。 FIG. 13 shows the flow of the compositing process in the image compositing / display selection unit 207. In step ST1, the process is started. Next, in step ST2, the positions are set so that the master points of the two views (images) having the same “stp_id” value overlap with each other at the same coordinates (see FIG. 11A).

次に、ステップＳＴ３において、マスターポイント同士が同一の座標に重なっても２つのターゲットポイント（Target Pint）が座標上で重ならない場合であってθ２とθ１が異なるときは、マスターポイントを中心として、θ２がθ１と等しくなるように、あるいはθ１がθ２に等しくなるように、いずれかのビューを回転する（図１１（ｂ）参照）。 Next, in step ST3, when the two target points (Target Pint) do not overlap on the coordinates even if the master points overlap on the same coordinates, and θ2 and θ1 are different, the master point is set as the center. Rotate either view so that θ2 is equal to θ1 or θ1 is equal to θ2 (see FIG. 11 (b)).

次に、ステップＳＴ４において、θ２がθ１と等しくなっても２つのターゲットポイントが座標上で重ならない場合は、２つのターゲットポイントが重なるように、一方のビューを縮小または拡大するスケーリングをする（図１１（ｃ）参照）。次に、ステップＳＴ５において、マスターポイントとターゲットポイントが形成する長方形領域で輝度および/または色に関するヒストグラムが同じになるように、２つのビューの画質調整をする（図１２参照）。 Next, in step ST4, if the two target points do not overlap on the coordinates even if θ2 becomes equal to θ1, scaling is performed to reduce or enlarge one view so that the two target points overlap (Fig.). 11 (c)). Next, in step ST5, the image quality of the two views is adjusted so that the histograms regarding brightness and / or color are the same in the rectangular region formed by the master point and the target point (see FIG. 12).

他に合成すべきビューが存在すれば、その分だけ上述のステップＳＴ２からステップＳＴ５までを繰り返し、全てのビューの合成が終了したとき、ステップＳＴ６において、合成処理を終了する。 If there is another view to be combined, the above steps ST2 to ST5 are repeated by that amount, and when the composition of all the views is completed, the composition process is completed in step ST6.

図６に戻って、画像合成/表示選択部２０７は、表示部２０９の表示画枠サイズやユーザ操作部２０１ａからのユーザの選択操作に応じて、合成画像データからその全領域あるいは一部領域の画像データを取り出して表示用画像データとして出力する。例えば、表示部２０９の表示画枠サイズが合成画像の全体表示が可能であるサイズの場合には、画像合成/表示選択部２０７は、合成画像データからその全領域の画像データを取り出して表示用画像データとして出力する。 Returning to FIG. 6, the image composition / display selection unit 207 may use the composite image data as a whole area or a part of the image composition / display selection unit 207 according to the display image frame size of the display unit 209 and the user selection operation from the user operation unit 201a. The image data is taken out and output as image data for display. For example, when the display frame size of the display unit 209 is a size capable of displaying the entire composite image, the image composite / display selection unit 207 extracts the image data of the entire area from the composite image data and displays it. Output as image data.

また、例えば、表示部２０９の表示画枠サイズが合成画像の一部の表示、例えば１つのビュー分だけの表示が可能であるサイズの場合には、画像合成/表示選択部２０７は、合成画像データからその一部領域の画像データを取り出して表示用画像データとして出力する。この際、画像合成/表示選択部２０７は、デフォルトでは、代表ビューの領域の画像データを取り出して表示用画像データとして出力する。 Further, for example, when the display frame size of the display unit 209 is a size capable of displaying a part of the composite image, for example, displaying only one view, the image composition / display selection unit 207 may display the composite image. The image data of a part of the area is taken out from the data and output as the image data for display. At this time, by default, the image composition / display selection unit 207 takes out the image data in the area of the representative view and outputs it as display image data.

また、このように画像合成/表示選択部２０７が合成画像データからその一部領域の画像データを取り出して表示用画像データとして出力する場合、ユーザ操作部２０１ａからのユーザの選択操作に応じて画像データを取り出す一部領域を変更し、その変更後の一部領域の画像データを取り出して表示用画像データとして出力する。これにより、ユーザのスクロール操作に応じたスクロール表示も可能となる。なお、スクロール操作ではなく、各ビューの領域を表示領域として順次選択することも考えられる。 Further, when the image composition / display selection unit 207 extracts the image data of a part of the area from the composite image data and outputs the image data as display image data in this way, the image corresponds to the user's selection operation from the user operation unit 201a. Part of the area from which data is to be taken out is changed, and the image data of the changed part of the area is taken out and output as image data for display. This enables scroll display according to the scroll operation of the user. It is also conceivable to sequentially select the area of each view as the display area instead of the scroll operation.

制御部２０１は、合成画像内のどの位置の画像データが取り出されている状態にあるかを示すためのＵＩ表示データを生成する。この場合、合成画像データからその全領域の画像データが取り出されている場合には、このＵＩ表示データによりその旨が表示され、スクロール成分がないことが示される。一方、合成画像データからその一部領域の画像データが取り出されている場合には、このＵＩ表示データによりその取り出し位置が表示され、スクロール成分があることが示される。 The control unit 201 generates UI display data for indicating which position of the image data in the composite image is in the fetched state. In this case, when the image data of the entire area is extracted from the composite image data, this UI display data indicates that there is no scroll component. On the other hand, when the image data of a part of the area is extracted from the composite image data, the extraction position is displayed by this UI display data, and it is shown that there is a scroll component.

合成部２０８は、画像合成/表示選択部２０７で生成された表示用画像データおよび制御部２０１で発生されるＵＩ表示データを合成して、表示部２０９に送る。表示部２０９は、例えば、ＬＣＤ(Liquid Crystal Display)、有機ＥＬ（Organic Electro-Luminescence）パネル等で構成されている。なお、この表示部２０９は、受信装置２００に接続される外部機器であってもよい。 The compositing unit 208 synthesizes the display image data generated by the image compositing / display selection unit 207 and the UI display data generated by the control unit 201, and sends them to the display unit 209. The display unit 209 is composed of, for example, an LCD (Liquid Crystal Display), an organic EL (Organic Electro-Luminescence) panel, or the like. The display unit 209 may be an external device connected to the receiving device 200.

図１４は、表示部２０９に、合成画像（合成ビュー）の全体を表示する場合の表示の一例を示している。表示画面には、合成画像の他に、ＵＩ表示データにより、合成画像の全体表示であり、スクロール成分がないことが示されている。図１５は、成画像（合成ビュー）の一部領域を表示する場合であって、ユーザインタラクションによる合成画像の表示の一例を示している。この場合、ユーザのスクロール操作により、表示領域がスクロールしていく。表示画面には、一部領域の画像の他に、ＵＩ表示データにより、合成画像の一部表示であることがその位置と共に示され、さらにスクロール成分があることが方向を示す矢印を伴って示されている。 FIG. 14 shows an example of display when the entire composite image (composite view) is displayed on the display unit 209. On the display screen, in addition to the composite image, the UI display data indicates that the composite image is the entire display and that there is no scroll component. FIG. 15 shows a case where a part of the adult image (composite view) is displayed, and shows an example of displaying the composite image by user interaction. In this case, the display area is scrolled by the scroll operation of the user. On the display screen, in addition to the image of a partial area, the UI display data indicates that the composite image is partially displayed together with its position, and that there is a scroll component with an arrow indicating the direction. Has been done.

図１５（ａ）は、合成画像のうち左端の代表ビューの領域の表示領域となっている場合のＵＩ画面の一例を示している。この場合、右向きの矢印により、表示画像位置が左端にあり、右側へのスクロールのみが可能であることが示されている。図１５（ｂ）は、右側にＰ１だけ表示領域をスクロールした場合のＵＩ画面の一例を示している。この場合、縦棒で合成画像の全体に対する表示領域の相対位置が示され、さらに右左の両方に向いた矢印により、右左の両方へのスクロールが可能であることが示されている。 FIG. 15A shows an example of the UI screen when the display area is the leftmost representative view area of the composite image. In this case, the arrow pointing to the right indicates that the display image position is at the left end and only scrolling to the right is possible. FIG. 15B shows an example of the UI screen when the display area is scrolled by P1 on the right side. In this case, the vertical bar indicates the relative position of the display area with respect to the entire composite image, and the arrows pointing to both the right and left indicate that scrolling to both the right and left is possible.

図１５（ｃ）は、さらに右側にＰ２だけ表示領域をスクロールした場合のＵＩ画面の一例を示している。この場合、縦棒で合成画像の全体に対する表示領域の相対位置が示され、さらに右左の両方に向いた矢印により、右左の両方へのスクロールが可能であることが示されている。図１５（ｄ）は、さらに右側にＰ３だけ表示領域をスクロールした場合のＵＩ画面の一例を示している。この場合、左向きの矢印により、表示画像位置が右端にあり、左側へのスクロールのみが可能であることが示されている。 FIG. 15C shows an example of the UI screen when the display area is further scrolled to the right by P2. In this case, the vertical bar indicates the relative position of the display area with respect to the entire composite image, and the arrows pointing to both the right and left indicate that scrolling to both the right and left is possible. FIG. 15D shows an example of the UI screen when the display area is further scrolled to the right by P3. In this case, the left-pointing arrow indicates that the display image position is at the right end and only scrolling to the left is possible.

図６に示す受信装置２００の動作を簡単に説明する。受信部２０２では、送信装置１００から放送波あるいはネットのパケットに載せて送られてくるトランスポートストリームＴＳが受信される。このトランスポートストリームＴＳは、ＴＳ解析部２０３に供給される。ＴＳ解析部２０３では、トランスポートストリームＴＳから、それに含まれる符号化ストリーム（各ビューの符号化画像データ）を取り出され、圧縮データバッファ２０４に送られる。 The operation of the receiving device 200 shown in FIG. 6 will be briefly described. The receiving unit 202 receives the transport stream TS transmitted from the transmitting device 100 on a broadcast wave or a packet on the net. This transport stream TS is supplied to the TS analysis unit 203. The TS analysis unit 203 takes out the coded stream (coded image data of each view) included in the transport stream TS and sends it to the compressed data buffer 204.

ビデオデコーダ２０５では、受信装置２００のデコード能力に応じて、全てのビュー、あるいはユーザ操作部２０１ａからユーザにより指定された合成ビューの組に属する各ビューの符号化画像データがデコードされる。合成ビューの組の指定は、情報「tier」の値を所定の値に設定することで行われる。この場合、ビデオデコーダ２０５では、圧縮データバッファ２０４に蓄積されている各ビューにおける各ピクチャの符号化画像データが、それぞれ、そのピクチャのＤＴＳ（Decoding Time stamp）で与えられるデコードタイミングで読み出されてデコードされる。 In the video decoder 205, the coded image data of all the views or each view belonging to the set of composite views designated by the user is decoded from the user operation unit 201a according to the decoding ability of the receiving device 200. The specification of the set of the composite view is performed by setting the value of the information "tier" to a predetermined value. In this case, in the video decoder 205, the encoded image data of each picture in each view stored in the compressed data buffer 204 is read out at the decoding timing given by the DTS (Decoding Time stamp) of the picture. It is decoded.

ビデオデコーダ２０５で得られた各ビューの各ピクチャの画像データは、非圧縮データバッファ２０６に一時的に蓄積される。この非圧縮データバッファ２０６から各ピクチャの画像データがそのピクチャのＰＴＳ（Presentation Time Stamp）で与えられるタイミングで読み出されて、各ビューの動画像データＶＤが得られ、画像合成/表示選択部２０７に送られる。 The image data of each picture of each view obtained by the video decoder 205 is temporarily stored in the uncompressed data buffer 206. The image data of each picture is read from the uncompressed data buffer 206 at the timing given by the PTS (Presentation Time Stamp) of the picture, the moving image data VD of each view is obtained, and the image composition / display selection unit 207 is obtained. Will be sent to.

また、ビデオデコーダ２０５では、ビデオストリームに挿入されているパラメータセットやＳＥＩメッセージが抽出され、制御部２０１に送られる。この抽出情報には、各ビューの符号化画像データのアクセスユニットに挿入されているスティッチング・データ・ＳＥＩメッセージ（図３参照）も含まれる。これにより、制御部２０１では、各ビューのスティッチ合成情報が認識される。このスティッチ合成情報には、合成点（マスターポイント、ターゲットポイント）の情報、ビュー識別情報、代表ビュー情報、合成ビュー組情報などが含まれている。 Further, the video decoder 205 extracts the parameter set and the SEI message inserted in the video stream and sends them to the control unit 201. This extracted information also includes stitching data SEI messages (see FIG. 3) inserted in the access unit of the coded image data of each view. As a result, the control unit 201 recognizes the stitch composition information of each view. This stitch composition information includes information on composition points (master point, target point), view identification information, representative view information, composition view group information, and the like.

画像合成/表示選択部２０７では、制御部２０１の制御の下、非圧縮データバッファ２０６で得られた各ビューの画像データに合成処理が施されて合成画像データが得られる。合成処理では、同じ識別情報「stp_id」を持つ合成点が重ね合わせられることで画像データが合成される。この場合、基準となるマスタ―ポイントを起点に、ターゲットポイント同士が重なるように、必要に応じて、回転やスケーリングが行われて合成される。 In the image composition / display selection unit 207, under the control of the control unit 201, the image data of each view obtained in the uncompressed data buffer 206 is subjected to a composition process to obtain the composite image data. In the compositing process, image data is synthesized by superimposing composite points having the same identification information "stp_id". In this case, starting from the reference master point, rotation and scaling are performed as necessary so that the target points overlap each other, and the composition is performed.

また、画像合成/表示選択部２０７では、表示部２０９の表示画枠サイズやユーザ操作部２０１ａからのユーザの選択操作に応じて、合成画像データからその全領域あるいは一部領域の画像データが取り出されて表示用画像データとして出力される。この表示用画像データは、合成部２０８を介して表示部２０９に供給され、表示部２０９には、表示用画像データによる画像が表示される。このように、表示部２０９の表示画枠サイズに応じて、スケーラブルな表示が行なわれる。 Further, in the image composition / display selection unit 207, image data of all or a part of the composite image data is extracted from the composite image data according to the display frame size of the display unit 209 and the user's selection operation from the user operation unit 201a. Is output as image data for display. The display image data is supplied to the display unit 209 via the compositing unit 208, and the display unit 209 displays the image based on the display image data. In this way, scalable display is performed according to the display image frame size of the display unit 209.

例えば、表示部２０９の表示画枠サイズが合成画像の全体表示が可能であるサイズの場合には、画像合成/表示選択部２０７では、合成画像データからその全領域の画像データが取り出されて表示用画像データとして出力される。そのため、この場合には、表示部２０９には、合成画像の全体が表示される。 For example, when the display frame size of the display unit 209 is a size capable of displaying the entire composite image, the image composite / display selection unit 207 extracts and displays the image data of the entire area from the composite image data. It is output as image data. Therefore, in this case, the entire composite image is displayed on the display unit 209.

また、例えば、表示部２０９の表示画枠サイズが合成画像の一部の表示、例えば１つのビュー分だけの表示が可能であるサイズの場合には、画像合成/表示選択部２０７では、合成画像データからその一部領域の画像データが取り出されて表示用画像データとして出力される。そのため、この場合には、表示部２０９には、合成画像の一部領域が表示される。この際、画像合成/表示選択部２０７では、デフォルトにあっては、代表ビューの領域の画像データが取り出されるため、表示部２０９には代表ビューの画像が表示される。 Further, for example, when the display frame size of the display unit 209 is a size capable of displaying a part of the composite image, for example, displaying only one view, the image composition / display selection unit 207 displays the composite image. The image data of a part of the area is extracted from the data and output as display image data. Therefore, in this case, a part of the composite image is displayed on the display unit 209. At this time, in the image composition / display selection unit 207, since the image data in the area of the representative view is taken out by default, the image of the representative view is displayed on the display unit 209.

また、この際、ユーザ操作部２０１ａからのユーザの選択操作に応じて画像データを取り出す一部領域が変更されるとき、画像合成/表示選択部２０７では、その変更後の一部領域の画像データが取り出されて表示用画像データとして出力される。そのため、ユーザのスクロール操作に応じたスクロール表示も行われる。 Further, at this time, when the partial area for extracting the image data is changed according to the user's selection operation from the user operation unit 201a, the image composition / display selection unit 207 changes the image data of the partial area after the change. Is taken out and output as image data for display. Therefore, scroll display is also performed according to the scroll operation of the user.

また、制御部２０１では、合成画像内のどの位置の画像データが取り出されている状態にあるかを示すためのＵＩ表示データが生成される。このＵＩ表示データは、合成部２０８において、表示用画像データに合成される。この場合、合成画像データからその全領域の画像データが取り出されているときは、このＵＩ表示データによりその旨が表示され、スクロール成分がないことが示される。一方、合成画像データからその一部領域の画像データが取り出されているときは、このＵＩ表示データによりその取り出し位置が表示され、スクロール成分があることが示される。ユーザは、このＵＩ表示データにより、スクロール操作を容易に行い得る。 Further, the control unit 201 generates UI display data for indicating which position of the image data in the composite image is in the fetched state. This UI display data is combined with the display image data in the composition unit 208. In this case, when the image data of the entire area is taken out from the composite image data, this UI display data indicates that there is no scroll component. On the other hand, when the image data of a part of the area is extracted from the composite image data, the extraction position is displayed by this UI display data, and it is shown that there is a scroll component. The user can easily perform the scroll operation by using this UI display data.

以上説明したように、図１に示す送受信システム１０において、送信装置１００では、各ビューの符号化画像データに合成点（マスターポイント、ターゲットポイント）の情報を挿入して送信するものである。そのため、受信側では、合成点の情報に基づいて各ビューの合成処理を行うことが可能となり、処理負荷の低減が可能となる。 As described above, in the transmission / reception system 10 shown in FIG. 1, the transmission device 100 inserts the information of the synthesis point (master point, target point) into the coded image data of each view and transmits the data. Therefore, on the receiving side, it is possible to perform the composition processing of each view based on the information of the composition point, and it is possible to reduce the processing load.

また、図１に示す送受信システム１０において、送信装置１００では、各ビューの符号化画像データに挿入される合成点の情報のそれぞれには、ビューを識別する情報を付加するものである。そのため、受信側では、各ビューの識別を容易に行うことができる。 Further, in the transmission / reception system 10 shown in FIG. 1, in the transmission device 100, information for identifying the view is added to each of the information of the synthesis points inserted in the coded image data of each view. Therefore, the receiving side can easily identify each view.

また、図１に示す送受信システム１０において、送信装置１００では、各ビューの符号化画像データに挿入される合成点の情報のそれぞれに、代表ビューであるか否かを示す情報が付加するものである。そのため、受信側では、合成画像のうちデフォルト表示すべき一部領域として代表ビューに対応した領域を選択することが可能となる。 Further, in the transmission / reception system 10 shown in FIG. 1, in the transmission device 100, information indicating whether or not the view is a representative view is added to each of the information of the synthesis points inserted in the coded image data of each view. be. Therefore, on the receiving side, it is possible to select an area corresponding to the representative view as a part area to be displayed by default in the composite image.

また、図１に示す送受信システム１０において、送信装置１００では、各ビューの符号化画像データに挿入される合成点の情報のそれぞれに、合成ビューの組を示す情報を付加するものである。そのため、受信側では、合成ビューの組を指定して所望の合成画像を得ることが可能となる。 Further, in the transmission / reception system 10 shown in FIG. 1, the transmission device 100 adds information indicating a set of composite views to each of the information of the composite points inserted in the coded image data of each view. Therefore, on the receiving side, it is possible to specify a set of composite views and obtain a desired composite image.

また、図１に示す送受信システム１０において、受信装置２００では、合成画像内で表示すべき一部の領域をユーザが操作する操作部が備えられる。そのため、ユーザは、合成画像内の表示領域を任意に調整することが可能となる。そして、この場合、受信装置２００では、表示用画像データに、取り出された一部領域の合成画像内の位置を示すための表示データが重畳される。そのため、ユーザは、合成画像内の表示領域の位置を容易に認識でき、スクロール操作をスムーズに行うことができる。 Further, in the transmission / reception system 10 shown in FIG. 1, the receiving device 200 is provided with an operation unit in which the user operates a part of the area to be displayed in the composite image. Therefore, the user can arbitrarily adjust the display area in the composite image. Then, in this case, the receiving device 200 superimposes the display data for indicating the position in the composite image of the extracted partial region on the display image data. Therefore, the user can easily recognize the position of the display area in the composite image and can smoothly perform the scroll operation.

＜２．変形例＞
なお、上述実施の形態においては、合成点のセットを１つのマスターポイントと１つのターゲットポイントで構成する例を示した。しかし、合成点のセットを１つのマスターポイントと２つ以上のターゲットポイントで構成することも考えられる。その場合、合成処理では、基準となるマスタ―ポイントを起点に、２つ以上のターゲットポイント同士が重なるように、必要に応じて回転やスケーリング（非線形な回転やスケーリングも含まれる）が行われる。<2. Modification example>
In the above-described embodiment, an example is shown in which a set of synthesis points is composed of one master point and one target point. However, it is also conceivable to configure a set of composite points with one master point and two or more target points. In that case, in the synthesis process, rotation or scaling (including non-linear rotation and scaling) is performed as necessary so that two or more target points overlap each other starting from the reference master point.

また、上述実施の形態においては、送信装置１００と受信装置２００からなる送受信システム１０を示したが、本技術を適用し得る送受信システムの構成は、これに限定されるものではない。例えば、受信装置２００の部分が、例えば、（ＨＤＭＩ（High-Definition Multimedia Interface）などのデジタルインタフェースで接続されたセットトップボックスおよびモニタの構成などであってもよい。なお、「ＨＤＭＩ」は、登録商標である。 Further, in the above-described embodiment, the transmission / reception system 10 including the transmission device 100 and the reception device 200 is shown, but the configuration of the transmission / reception system to which the present technology can be applied is not limited to this. For example, the portion of the receiving device 200 may be, for example, a configuration of a set-top box and a monitor connected by a digital interface such as (HDMI (High-Definition Multimedia Interface)). Note that “HDMI” is registered. It is a trademark.

また、上述実施の形態においては、コンテナがトランスポートストリーム（ＭＰＥＧ－２ＴＳ）である例を示した。しかし、本技術は、インターネット等のネットワークを利用して受信端末に配信される構成のシステムにも同様に適用できる。インターネットの配信では、ＩＳＯＢＭＦＦ（あるいはＭＰ４）やそれ以外のフォーマットのコンテナで配信されることが多い。つまり、コンテナとしては、デジタル放送規格で採用されているトランスポートストリーム（ＭＰＥＧ－２ＴＳ）、インターネット配信で使用されているＩＳＯＢＭＦＦ（あるいはＭＰ４）などの種々のフォーマットのコンテナが該当する。 Further, in the above-described embodiment, an example in which the container is a transport stream (MPEG-2 TS) is shown. However, this technique can be similarly applied to a system configured to be delivered to a receiving terminal using a network such as the Internet. In Internet distribution, it is often distributed in containers of ISOBMFF (or MP4) or other formats. That is, the container corresponds to a container of various formats such as the transport stream (MPEG-2 TS) adopted in the digital broadcasting standard and ISOBMFF (or MP4) used in the Internet distribution.

また、本技術は、以下のような構成を取ることもできる。
（１）合成可能な複数のビューの符号化画像データからなる符号化ストリームを送信する送信部と、
上記各ビューの符号化画像データに合成点の情報を挿入する情報挿入部を備える
送信装置。
（２）上記合成点には、合成時の基準点と非基準点が含まれる
前記（１）に記載の送信装置。
（３）上記合成点の情報は、画像のトップレフトを原点とするｘ座標およびｙ座標からなる
前記（１）または（２）に記載の送信装置。
（４）上記各ビューの符号化画像データに挿入される上記合成点の情報のそれぞれには、ビューを識別する情報が付加される
前記（１）から（３）のいずれかに記載の送信装置。
（５）上記各ビューの符号化画像データに挿入される上記合成点の情報のそれぞれには、代表ビューであるか否かを示す情報が付加される
前記（１）から（４）のいずれかに記載の送信装置。
（６）上記各ビューの符号化画像データに挿入される上記合成点の情報のそれぞれには、合成ビューの組を示す情報が付加される
前記（１）から（５）のいずれかに記載の送信装置。
（７）送信部が、合成可能な複数のビューの符号化画像データからなる符号化ストリームを送信する送信ステップと、
情報挿入部が、上記各ビューの符号化画像データに合成点の情報を挿入する情報挿入ステップを有する
送信方法。
（８）合成可能な複数のビューの符号化画像データからなる符号化ストリームを受信する受信部を備え、
上記各ビューの符号化画像データには、合成点の情報が挿入されており、
上記複数のビューから選択された所定数のビューの画像データを、上記合成点の情報に基づいて合成して合成画像データを得る合成部をさらに備える
受信装置。
（９）上記各ビューの符号化画像データに挿入される上記合成点の情報のそれぞれには、合成ビューの組を示す情報が付加されており、
上記合成部は、上記合成ビューの組の指定情報で指定された組に属するビューの画像データを合成する
前記（８）に記載の受信装置。
（１０）上記合成画像データから一部の領域の画像データを選択的に取り出して表示用画像データを得る表示処理部をさらに備える
前記（８）または（９）に記載の受信装置。
（１１）上記各ビューの符号化画像データに挿入される上記合成点の情報のそれぞれには、代表ビューであるか否かを示す情報が付加されており、
上記表示処理部は、デフォルトでは、上記合成画像データから上記代表ビューの領域の画像データを取り出して表示用画像データを得る
前記（１０）に記載の受信装置。
（１２）上記一部の領域をユーザが操作する操作部をさらに備える
前記（１０）または（１１）に記載の受信装置。
（１３）上記表示処理部は、上記表示用画像データに、上記取り出された一部領域の合成画像内の位置を示すため表示データを重畳する
前記（１２）に記載の受信装置。
（１４）受信部が、合成可能な複数のビューの符号化画像データからなる符号化ストリームを受信する受信ステップを有し、
上記各ビューの符号化画像データには、合成点の情報が挿入されており、
合成部が、上記複数のビューから選択された所定数のビューの画像データを、上記合成点の情報に基づいて合成する合成ステップをさらに有する
受信方法。In addition, the present technology can also have the following configurations.
(1) A transmitter for transmitting a coded stream composed of coded image data of a plurality of views that can be synthesized, and a transmission unit.
A transmission device including an information insertion unit that inserts information of a synthesis point into the coded image data of each of the above views.
(2) The transmission device according to (1) above, wherein the composite point includes a reference point and a non-reference point at the time of synthesis.
(3) The transmission device according to (1) or (2) above, wherein the information of the composite point is composed of x-coordinates and y-coordinates whose origin is the top left of the image.
(4) The transmission device according to any one of (1) to (3) above, wherein information for identifying the view is added to each of the information of the synthesis point inserted in the coded image data of each view. ..
(5) Any of the above (1) to (4), in which information indicating whether or not the view is a representative view is added to each of the information of the composite points inserted in the coded image data of each of the views. The transmitter described in.
(6) The description according to any one of (1) to (5) above, wherein information indicating a set of composite views is added to each of the information of the composite points inserted in the coded image data of each view. Transmitter.
(7) A transmission step in which the transmission unit transmits a coded stream composed of coded image data of a plurality of views that can be synthesized.
A transmission method in which the information insertion unit includes an information insertion step for inserting information of a synthesis point into the coded image data of each of the above views.
(8) A receiver for receiving a coded stream composed of coded image data of a plurality of views that can be synthesized is provided.
Information on the composition point is inserted in the coded image data of each of the above views.
A receiving device further comprising a compositing unit for synthesizing image data of a predetermined number of views selected from the plurality of views based on the information of the compositing points to obtain composite image data.
(9) Information indicating a set of composite views is added to each of the information of the composite points inserted in the coded image data of each view.
The receiving device according to (8) above, wherein the compositing unit synthesizes image data of a view belonging to the set specified by the designated information of the set of the composite view.
(10) The receiving device according to (8) or (9) above, further comprising a display processing unit that selectively extracts image data of a part of the area from the composite image data to obtain display image data.
(11) Information indicating whether or not the view is a representative view is added to each of the information of the composite points inserted in the coded image data of each of the views.
By default, the display processing unit extracts image data in a region of the representative view from the composite image data to obtain display image data. The receiving device according to (10).
(12) The receiving device according to (10) or (11), further comprising an operation unit in which a user operates a part of the above area.
(13) The receiving device according to (12), wherein the display processing unit superimposes display data on the display image data in order to indicate a position in the composite image of the extracted partial region.
(14) The receiving unit has a receiving step of receiving a coded stream composed of coded image data of a plurality of views that can be synthesized.
Information on the composition point is inserted in the coded image data of each of the above views.
A receiving method in which the compositing unit further includes a compositing step of composing image data of a predetermined number of views selected from the plurality of views based on the information of the compositing points.

本技術の主な特徴は、各ビューの符号化画像データに合成点（マスターポイント、ターゲットポイント）の情報を挿入して送信することで、受信側では、合成点の情報に基づいて各ビューの合成処理を行うことを可能とし、処理負荷の低減を可能としたことである（図２、図３参照）。 The main feature of this technique is that the information of the composite point (master point, target point) is inserted into the coded image data of each view and transmitted, and the receiving side receives the information of each view based on the information of the composite point. It is possible to perform the synthesis processing and reduce the processing load (see FIGS. 2 and 3).

１０・・・送受信システム
１００・・・送信装置
１０１・・・制御部
１０２ａ・・・ユーザ操作部
１０２ｂ・・・表示部
１０３・・・非圧縮データバッファ
１０４・・・ビデオエンコーダ
１０５・・・圧縮データバッファ
１０６・・・ＴＳフォーマッタ
１０７・・・送信部
２００・・・受信装置
２０１・・・制御部
２０１ａ・・・ユーザ操作部
２０２・・・受信部
２０３・・・ＴＳ解析部
２０４・・・圧縮データバッファ
２０５・・・ビデオデコーダ
２０６・・・非圧縮データバッファ
２０７・・・画像合成/表示選択部
２０８・・・合成部
２０９・・・表示部10 ... Transmission / reception system 100 ... Transmission device 101 ... Control unit 102a ... User operation unit 102b ... Display unit 103 ... Uncompressed data buffer 104 ... Video encoder 105 ... Compression Data buffer 106 ... TS formatter 107 ... Transmitter 200 ... Receiver 201 ... Control unit 201a ... User operation unit 202 ... Receiver 203 ... TS analysis unit 204 ... Compressed data buffer 205 ・・・ Video decoder 206 ・・・ Uncompressed data buffer 207 ・・・ Image composition / display selection unit 208 ・・・ Composition unit 209 ・・・ Display unit

Claims

A transmitter that sends a coded stream consisting of coded image data from multiple views that can be combined, and a transmitter.
It is provided with an information insertion unit for inserting information on a composite point including information on two points, a reference point and a non-reference point at the time of synthesis, into the coded image data of each of the above views by adding identification information of the composite point. Transmitter.

The transmission device according to claim 1, wherein the information of the synthesis point is composed of x-coordinates and y-coordinates whose origin is the top left of the image.

The transmission device according to claim 1, wherein information for identifying the view is added to each of the information of the synthesis point inserted in the coded image data of each view.

The transmission device according to claim 1, wherein information indicating whether or not the view is a representative view is added to each of the information of the composite points inserted in the coded image data of each of the views.

The transmission device according to claim 1, wherein information indicating a set of composite views is added to each of the information of the composite points inserted in the coded image data of each of the views.

A transmission step in which the transmitter transmits a coded stream consisting of coded image data of a plurality of views that can be synthesized.
The information insertion unit inserts the information of the composite point including the information of the two points of the reference point and the non-reference point at the time of synthesis into the coded image data of each of the above views by adding the identification information of the composite point. A transmission method with an information insertion step.

It has a receiver that receives a coded stream consisting of coded image data of multiple views that can be synthesized.
In the coded image data of each of the above views, the information of the composite point including the information of the two points of the reference point and the non-reference point at the time of synthesis is inserted with the identification information of the composite point added .
A receiving device further comprising a compositing unit for synthesizing image data of a predetermined number of views selected from the plurality of views based on the information of the compositing points to obtain composite image data.

Information indicating a set of composite views is added to each of the information of the composite points inserted in the coded image data of each of the views.
The receiving device according to claim 7 , wherein the compositing unit synthesizes image data of a view belonging to the set designated by the designated information of the set of the composite view.

The receiving device according to claim 7 , further comprising a display processing unit for selectively extracting image data of a part of the area from the composite image data to obtain image data for display.

Information indicating whether or not the view is a representative view is added to each of the information of the composite points inserted in the coded image data of each of the views.
The receiving device according to claim 9 , wherein the display processing unit extracts image data in a region of the representative view from the composite image data to obtain image data for display by default.

The receiving device according to claim 9 , further comprising an operation unit in which the user operates a part of the above area.

The receiving device according to claim 11 , wherein the display processing unit superimposes display data on the display image data in order to indicate a position in the composite image of the extracted partial region.

The receiver has a receive step of receiving a coded stream consisting of coded image data of a plurality of views that can be synthesized.
In the coded image data of each of the above views, the information of the composite point including the information of the two points of the reference point and the non-reference point at the time of synthesis is inserted with the identification information of the composite point added .
A receiving method in which the compositing unit further includes a compositing step of composing image data of a predetermined number of views selected from the plurality of views based on the information of the compositing points.