JP6468897B2

JP6468897B2 - Image processing apparatus, image processing method, and program

Info

Publication number: JP6468897B2
Application number: JP2015049640A
Authority: JP
Inventors: 政博有泉; 賀保橋口; 大介中嶋
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2015-03-12
Filing date: 2015-03-12
Publication date: 2019-02-13
Anticipated expiration: 2035-03-12
Also published as: JP2016170602A

Description

本発明は、特に、画像を座標など属性が類似した複数の領域に分割するために用いて好適な画像処理装置、画像処理方法及びプログラムに関する。 The present invention particularly relates to an image processing apparatus, an image processing method, and a program suitable for use in dividing an image into a plurality of regions having similar attributes such as coordinates.

近年、画像を色やテクスチャ、座標など属性が類似した複数の領域に分割する技術が注目されている。このように分割された領域はスーパーピクセルと呼ばれ、分割された領域の単位で符号化処理や、画像処理、画像認識処理等を実行できるため、様々な画像処理装置におけて応用することができる。 In recent years, a technique for dividing an image into a plurality of regions having similar attributes such as color, texture, and coordinates has attracted attention. The area divided in this way is called a super pixel, and since encoding processing, image processing, image recognition processing, and the like can be executed in units of the divided area, it can be applied to various image processing apparatuses. it can.

従来、画像の領域分割に関して様々な手法が提案されている。特に、画素をクラスタリングすることによって画像を複数の領域に分割する手法が多く提案されている。これらの手法では、色やテクスチャ、座標を要素にもつデータをクラスタリングし、クラスタリングでは、各クラス（領域）の代表データを求め、入力データを最近傍代表データに割り当てる。ここで、最近傍代表データとは、入力データに対する複数の代表候補データの中で、入力データとの距離が最短の代表候補データのことである。 Conventionally, various methods have been proposed for image segmentation. In particular, many techniques for dividing an image into a plurality of regions by clustering pixels have been proposed. In these methods, data having colors, textures, and coordinates as elements are clustered. In the clustering, representative data of each class (region) is obtained, and input data is assigned to the nearest representative data. Here, the nearest representative data is representative candidate data having a shortest distance from the input data among a plurality of representative candidate data for the input data.

非特許文献１には、ＳＬＩＣ（Simple Linear Iterative Clustering）と呼ばれるクラスタリング手法を用いて複数の領域に分割する方法が提案されている。ＳＬＩＣは、Ｋ−ｍｅａｎｓクラスタリングの最近傍代表データの探索範囲を画像中の局所領域に限定した手法である。ＳＬＩＣでは、クラスタリングの距離関数として座標空間の距離と色空間の距離との重み付き総和を用いている。非特許文献１には、これら二つの距離の重みを適応的に設定する方法としてＡＳＬＩＣ（Adaptive-SLIC）が提案されている。ＡＳＬＩＣでは、各領域の代表データからの色空間の距離と座標空間の距離との最大値を用いて色距離と座標距離とを正規化することにより二つの重みを適応的に設定している。 Non-Patent Document 1 proposes a method of dividing into a plurality of regions using a clustering technique called SLIC (Simple Linear Iterative Clustering). SLIC is a technique in which the search range of nearest neighbor representative data in K-means clustering is limited to a local region in an image. In SLIC, a weighted sum of a distance in a coordinate space and a distance in a color space is used as a distance function for clustering. Non-Patent Document 1 proposes ASLIC (Adaptive-SLIC) as a method of adaptively setting the weights of these two distances. In ASLIC, two weights are adaptively set by normalizing the color distance and the coordinate distance using the maximum value of the color space distance and the coordinate space distance from the representative data of each region.

また、非特許文献２に記載の手法では、入力データと代表候補データとの距離計算において、注目する入力データの近傍に異なるクラスの入力データが存在する程距離が大きくなる項を導入している。これにより、分割される形状が円形に近くなるようにクラスタリングしている。 Further, in the method described in Non-Patent Document 2, in the distance calculation between input data and representative candidate data, a term is introduced in which the distance increases as different classes of input data exist in the vicinity of the input data of interest. . Thus, clustering is performed so that the shape to be divided is close to a circle.

Radhakrishna Achanta, et al., "SLIC Superpixels Compared to State-of-the-Art Superpixel Methods," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, No. 11, pp. 2274-2282, Nov. 2012.Radhakrishna Achanta, et al., "SLIC Superpixels Compared to State-of-the-Art Superpixel Methods," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, No. 11, pp. 2274-2282, Nov. 2012. J. Wang, and X. Wang, "VCells: Simple and Efficient Superpixels Using Edge-Weighted Centroidal Voronoi Tessellations," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, No. 6, pp. 1241-1247, Jun. 2012.J. Wang, and X. Wang, "VCells: Simple and Efficient Superpixels Using Edge-Weighted Centroidal Voronoi Tessellations," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, No. 6, pp. 1241-1247, Jun. 2012.

クラスタリングは画像の領域分割においてよく用いられる技術である。ただし、クラスタリングによる領域分割では、フラグメンテーションが発生することがある。フラグメンテーションとは、同一クラス内の画素が座標空間上で連続していない状態のことである。領域分割結果を応用する場合には、フラグメンテーションが発生しないことが好ましい。 Clustering is a technique often used in image segmentation. However, fragmentation may occur in region division by clustering. Fragmentation is a state in which pixels in the same class are not continuous in the coordinate space. When applying the region division result, it is preferable that no fragmentation occurs.

フラグメンテーションの発生度合いは、色距離の重み及び座標距離の重みの設定によって変化する。色距離の重みを大きくした場合は、色のエッジで領域が分割される傾向が強くなるが、フラグメンテーションが発生し易い傾向となる。一方、座標距離の重みを大きくした場合は逆の傾向となり、フラグメンテーションは発生し難い傾向となるが、色のエッジで領域分割され難くなる傾向となる。このように領域分割精度とフラグメンテーションとではトレードオフの関係にあり、領域分割の応用先が必要とする領域分割精度とフラグメンテーションの発生度合いとを満たす重みを決定することが難しいという課題がある。 The degree of occurrence of fragmentation varies depending on the setting of the color distance weight and the coordinate distance weight. When the weight of the color distance is increased, the tendency of the area to be divided at the edge of the color becomes strong, but fragmentation tends to occur easily. On the other hand, when the weight of the coordinate distance is increased, the tendency tends to be opposite, and fragmentation tends not to occur, but it tends to be difficult to divide the region at the color edge. Thus, there is a trade-off relationship between the region division accuracy and fragmentation, and there is a problem that it is difficult to determine a weight that satisfies the region division accuracy and the degree of fragmentation required by the application destination of region division.

非特許文献１に開示されているＡＳＬＩＣでは、前述したように、各領域の代表点データからの色距離の最大値と座標距離の最大値とを用いて色距離と座標距離とを正規化し、二つの距離の重みを適応的に決定している。しかし、この方法は座標距離と色距離とを正規化しているのであって、フラグメンテーションの軽減という目標に基づいて重みを決定しているわけではない。したがって、ＡＳＬＩＣでは、領域分割の応用先が必要とする領域分割精度とフラグメンテーションの発生度合いとを満たす結果に必ずしもなっていないという課題がある。 In the ASLIC disclosed in Non-Patent Document 1, as described above, the color distance and the coordinate distance are normalized using the maximum value of the color distance and the maximum value of the coordinate distance from the representative point data of each region, The weight of the two distances is determined adaptively. However, this method normalizes the coordinate distance and the color distance, and does not determine the weight based on the goal of reducing fragmentation. Therefore, in ASLIC, there is a problem that the result does not necessarily satisfy the region division accuracy and the degree of fragmentation required by the application destination of region division.

また、非特許文献２に記載の方式では、フラグメンテーションの発生を抑制することができるが、各入力データについて代表候補データとの距離計算時に、近傍の入力データのクラス情報を参照する必要がある。このため、処理コストが大きいという課題がある。 Further, the method described in Non-Patent Document 2 can suppress the occurrence of fragmentation, but it is necessary to refer to the class information of neighboring input data when calculating the distance between each input data and representative candidate data. For this reason, there exists a subject that processing cost is large.

本発明は前述の問題点に鑑み、所望の領域分割精度で画像の領域分割を行う際に、処理コストを抑えてフラグメンテーションの発生を抑制できるようにすることを目的としている。 In view of the above-described problems, an object of the present invention is to suppress the generation of fragmentation while suppressing processing costs when performing image region division with desired region division accuracy.

本発明に係る画像処理装置は、画像中の複数の領域に対してそれぞれ代表データを設定する代表データ設定手段と、前記代表データとの座標及び色に係る距離に基づいて、前記画像中のそれぞれの画素を前記複数の領域の中の何れかの代表データと関連付ける関連付け手段と、前記関連付け手段によって関連付けされた画素の集合で構成された領域の評価値を算出する算出手段と、前記評価値の目標値を設定する目標値設定手段と、前記評価値が前記目標値に近づくように前記座標と前記色との間の重みを制御する重み制御手段とを備え、前記関連付け手段は、前記重み制御手段によって制御された重みに従って前記座標及び色に係る距離を算出し、前記算出された距離に基づいて、前記画像中のそれぞれの画素を前記何れかの代表データと関連付けることを特徴とする。 The image processing apparatus according to the present invention includes a representative data setting unit that sets representative data for each of a plurality of regions in an image, and the distance between the representative data based on coordinates and colors, respectively. An association means for associating each pixel with any representative data in the plurality of areas, a calculation means for calculating an evaluation value of an area formed by a set of pixels associated by the association means, Target value setting means for setting a target value; weight control means for controlling a weight between the coordinates and the color so that the evaluation value approaches the target value; and the association means includes the weight control Calculating a distance related to the coordinates and color according to the weight controlled by the means, and based on the calculated distance, each pixel in the image is set as any one of the representative data. Characterized in that it put with each other.

本発明によれば、所望の領域分割精度で画像の領域分割を行う際に、処理コストを抑えてフラグメンテーションの発生を抑制することができる。 According to the present invention, when image segmentation is performed with desired region segmentation accuracy, it is possible to suppress the processing cost and suppress the occurrence of fragmentation.

実施形態におけるクラスタリング処理の処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the process sequence of the clustering process in embodiment. 画像中のブロック及びその代表データを説明するための図である。It is a figure for demonstrating the block in an image, and its representative data. フラグメンテーションが発生している領域分割結果の一例を示す図である。It is a figure which shows an example of the area division | segmentation result in which fragmentation has generate | occur | produced. 実施形態におけるデータ処理装置のハードウェア構成例を示すブロック図である。It is a block diagram which shows the hardware structural example of the data processor in embodiment.

以下、本発明の好適な実施形態について、図面を参照しながら詳細に説明する。本実施形態では、本発明による画像領域分割方法を用いて画像を領域分割する例について説明する。 DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, preferred embodiments of the invention will be described in detail with reference to the drawings. In the present embodiment, an example in which an image is divided into regions using the image region dividing method according to the present invention will be described.

＜データ処理装置の構成例＞
図４は、本実施形態におけるクラスタリング方法を実現可能なデータ処理装置４００のハードウェア構成例を示すブロック図である。
図４において、データ保存部４０１は、画像データを保持する記憶媒体である。データ保存部４０１は、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＣＤ−ＲやＤＶＤ、メモリーカード、ＣＦカード、スマートメディア、ＳＤカード、メモリスティック、ｘＤピクチャーカード、ＵＳＢメモリ等で構成される。また、データ保存部４０１には、画像データの他にも、プログラムやその他のデータを保存することも可能である。あるいは、後述するＲＡＭ４０６の一部をデータ保存部４０１として用いてもよい。さらに、後述する通信部４０２により接続した先の機器の記憶装置を、通信部４０２を介して利用するというように仮想的に構成するのであってもよい。 <Configuration example of data processing device>
FIG. 4 is a block diagram illustrating a hardware configuration example of the data processing apparatus 400 that can implement the clustering method according to the present embodiment.
In FIG. 4, a data storage unit 401 is a storage medium that holds image data. The data storage unit 401 includes a hard disk, flexible disk, CD-ROM, CD-R, DVD, memory card, CF card, smart media, SD card, memory stick, xD picture card, USB memory, and the like. In addition to image data, the data storage unit 401 can also store programs and other data. Alternatively, a part of the RAM 406 described later may be used as the data storage unit 401. Furthermore, a storage device of a destination device connected by the communication unit 402 described later may be virtually configured to be used via the communication unit 402.

表示部４０３は、領域分割処理を行う前後の画像を表示したり、ＧＵＩ等の画像を表示したりする装置であり、一般的にはＣＲＴや液晶ディスプレイなどが用いられる。あるいは、ケーブル等で接続された外部のディスプレイ装置であっても構わない。 The display unit 403 is a device that displays images before and after performing the area division processing, or displays an image such as a GUI, and generally uses a CRT, a liquid crystal display, or the like. Alternatively, it may be an external display device connected by a cable or the like.

ＣＰＵ４０４は、本実施形態に係る主要な処理を実行すると共にデータ処理装置４００全体の動作を制御する。ＲＯＭ４０５及びＲＡＭ４０６は、その処理に必要なプログラム、データ、作業領域などをＣＰＵ４０４に提供する。後述する処理に必要なプログラムがデータ保存部４０１に格納されている場合や、ＲＯＭ４０５に格納されている場合には、ＲＡＭ４０６に一旦書き込まれてから後述する処理が実行される。また、通信部４０２を経由してデータ処理装置４００がプログラムを受信する場合には、データ保存部４０１に一旦記録した後にＲＡＭ４０６に書き込まれるか、通信部４０２からＲＡＭ４０６に直接書き込まれてから処理が実行される。 The CPU 404 executes main processing according to the present embodiment and controls the overall operation of the data processing device 400. The ROM 405 and the RAM 406 provide the CPU 404 with programs, data, work areas, and the like necessary for the processing. When a program necessary for processing to be described later is stored in the data storage unit 401 or stored in the ROM 405, the processing described later is executed after being once written in the RAM 406. When the data processing apparatus 400 receives a program via the communication unit 402, the program is temporarily recorded in the data storage unit 401 and then written in the RAM 406 or directly from the communication unit 402 to the RAM 406. Executed.

ＣＰＵ４０４は、ＲＡＭ４０６またはデータ保存部４０１に書き込まれた画像データを読み出して本実施形態に係る画像処理（クラスタリング処理）を実行する。代表データ等の処理途中のデータはＣＰＵ４０４によりＲＡＭ４０６に書き込まれ、必要な場合に読み出される。そして、クラスタリングの処理結果がＲＡＭ４０６に書き込まれる。また、ＣＰＵ４０４は、表示部４０３に画像を表示する制御を行ったり、通信部４０２を介して外部装置に画像データを送信する制御を行ったりする。 The CPU 404 reads image data written in the RAM 406 or the data storage unit 401 and executes image processing (clustering processing) according to the present embodiment. Data in the middle of processing such as representative data is written into the RAM 406 by the CPU 404 and is read out when necessary. Then, the clustering processing result is written in the RAM 406. Further, the CPU 404 performs control for displaying an image on the display unit 403, and performs control for transmitting image data to an external apparatus via the communication unit 402.

通信部４０２は、機器間の通信を行うためのＩ／Ｆである。Ｉ／Ｆとしては、例えば、公知のローカルエリアネットワーク、ＵＳＢ、ＩＥＥＥ１２８４、ＩＥＥＥ１３９４、電話回線などの有線による通信方式であってもよい。また、赤外線（ＩｒＤＡ）、ＩＥＥＥ８０２．１１ａ、ＩＥＥＥ８０２．１１ｂ、ＩＥＥＥ８０２．１１ｇ、ＩＥＥＥ８０２．１１ｎ、Ｂｌｕｅｔｏｏｔｈ（登録商標）、ＵＷＢ（ＵｌｔｒａｗｉｄｅＢａｎｄ）等の無線通信方式であってもよい。 The communication unit 402 is an I / F for performing communication between devices. As the I / F, for example, a well-known local area network, USB, IEEE1284, IEEE1394, a wired communication system such as a telephone line may be used. Further, a wireless communication system such as infrared (IrDA), IEEE 802.11a, IEEE 802.11b, IEEE 802.11g, IEEE 802.11n, Bluetooth (registered trademark), UWB (Ultra wide Band), or the like may be used.

なお、図４に示す例では、データ保存部４０１、表示部４０３が全て１つの装置内に含まれるような例を示しているが、これらの部分が公知の通信方式による通信路で接続されており、全体としてこのような構成となっているのであっても構わない。また、図４に示す例においては、ＣＰＵが１つだけの構成であるが、ＣＰＵを複数設けるような構成にしてもよい。システム構成については、上記以外にも様々な構成要素が存在してもよいが、本発明の主眼ではないのでその説明は省略する。 In the example shown in FIG. 4, an example is shown in which the data storage unit 401 and the display unit 403 are all included in one device, but these parts are connected by a communication path using a known communication method. Therefore, the configuration as a whole may be used. Further, in the example shown in FIG. 4, the configuration has only one CPU, but a configuration in which a plurality of CPUs are provided may be employed. As for the system configuration, various components other than those described above may exist, but the description thereof is omitted because it is not the main point of the present invention.

以下、図４に示したデータ処理装置４００が行う処理の概要について説明する。本実施形態では、画素データを二次元の座標と三次元の色特徴と計五次元の入力データとして使用し、この入力データを代表データにクラスタリングすることによって領域分割を行う。このとき、代表データと入力データとの間の距離は、座標空間上の距離（以下、座標距離）と色空間上の距離（以下、色距離）との重み付き総和で定義する。 Hereinafter, an outline of processing performed by the data processing apparatus 400 illustrated in FIG. 4 will be described. In this embodiment, pixel data is used as two-dimensional coordinates, three-dimensional color features, and five-dimensional input data in total, and this input data is clustered into representative data to perform region division. At this time, the distance between the representative data and the input data is defined by a weighted sum of a distance in the coordinate space (hereinafter referred to as coordinate distance) and a distance in the color space (hereinafter referred to as color distance).

座標距離及び色距離の重みは領域分割の結果を左右するパラメータであるため、領域分割の応用先で必要となる領域分割精度及び領域形状を得るためには適切な重みを設定する必要がある。そこで本実施形態では、領域分割結果を評価する評価関数とその目標値とを定義し、領域分割結果の評価値が目標値に近づくように重みを制御する。また、本実施形態では、領域のフラグメンテーションを軽減するために、評価関数として領域形状の複雑度を用い、各領域の複雑度が目標値に近づくように領域単位で重みを制御する。 Since the weights of the coordinate distance and the color distance are parameters that influence the result of area division, it is necessary to set appropriate weights in order to obtain the area division accuracy and area shape necessary for the application of area division. Therefore, in the present embodiment, an evaluation function for evaluating the region division result and its target value are defined, and the weight is controlled so that the evaluation value of the region division result approaches the target value. In this embodiment, in order to reduce fragmentation of a region, the complexity of the region shape is used as an evaluation function, and the weight is controlled in units of regions so that the complexity of each region approaches the target value.

＜領域分割のためのクラスタリング＞
図１は、本実施形態におけるクラスタリング処理の処理手順の一例を示すフローチャートである。なお、図１に示す各ステップはＣＰＵ４０４の動作を示すものとする。
まず、クラスタリング対象のデータについて説明する。以下の式（１）は第ｉ番目の入力データを示す。入力データは画像中の画素と対応する５次元のベクトルであり、第ｉ番目の入力データとは、画像をラスタスキャンして第ｉ番目に読み出された画素に関する情報である。
Ｘ_i＝（ｘ_i,1，ｘ_i,2，ｘ_i,3，ｘ_i,4，ｘ_i,5）・・・（１） <Clustering for region division>
FIG. 1 is a flowchart illustrating an example of a processing procedure of clustering processing in the present embodiment. Each step shown in FIG. 1 indicates the operation of the CPU 404.
First, clustering target data will be described. Equation (1) below shows the i-th input data. The input data is a five-dimensional vector corresponding to the pixel in the image, and the i-th input data is information regarding the i-th pixel read out by raster scanning the image.
X _i = (x _{i, 1} , x _{i, 2} , x _{i, 3} , x _{i, 4} , x _{i, 5} ) (1)

式（１）中のｘ_i,1、ｘ_i,2は二次元の座標空間上の位置を表し、それぞれ画像の縦方向、横方向の座標値を意味する。また、ｘ_i,3、ｘ_i,4、ｘ_i,5は画素の色成分を表わす三次元の色特徴である。 In formula (1), x _{i, 1} and x _{i, 2} represent positions in a two-dimensional coordinate space, and mean coordinate values in the vertical and horizontal directions of the image, respectively. X _{i, 3} , x _{i, 4} and x _{i, 5} are three-dimensional color features representing the color components of the pixels.

まず、ステップＳ１０１においては、初期化した代表データを生成する。具体的には、代表データ設定処理として、例えば図２に示した画像２０１を座標空間上のブロックに分割し、各々のブロック２０３の中に１個の代表データ２０５を設定する。そして、ブロック２０３内の入力データをランダムに選択し、選択した入力データの座標と色特徴とを要素とするベクトルをその代表データのベクトルとする。以下の式（２）は第ｊ番目の代表データＣ_jを示す。ここで、ｊとは、図２に示すように配置した代表データにラスター順に割り当てたインデックスである。
Ｃ_j＝（ｃ_j,1，ｃ_j,2，ｃ_j,3，ｃ_j,4，ｃ_j,5）・・・（２） First, in step S101, initialized representative data is generated. Specifically, as representative data setting processing, for example, the image 201 shown in FIG. 2 is divided into blocks on the coordinate space, and one representative data 205 is set in each block 203. Then, input data in the block 203 is selected at random, and a vector having the coordinates and color features of the selected input data as elements is set as a vector of the representative data. Equation (2) below shows the j-th representative data C _j . Here, j is an index assigned to the representative data arranged as shown in FIG. 2 in raster order.
C _j = (c _{j, 1} , c _{j, 2} , c _{j, 3} , c _{j, 4} , c _{j, 5} ) (2)

式（２）中のｃ_j,1、ｃ_j,2は二次元の座標空間上の位置を表し、それぞれ画像の縦方向、横方向の座標値を意味する。また、ｃ_j,3、ｃ_j,4、ｃ_j,5は画素の色成分を表わす三次元の色特徴である。 In equation (2), c _{j, 1} and c _{j, 2} represent positions in a two-dimensional coordinate space, and mean coordinate values in the vertical and horizontal directions of the image, respectively. Further, c _{j, 3} , c _{j, 4} , and c _{j, 5} are three-dimensional color features representing the color components of the pixels.

次にステップＳ１０２においては、領域分割結果の評価値の目標値を設定する。領域分割結果の評価関数及び目標値の定義については後述する。 In step S102, the target value of the evaluation value of the region division result is set. The definition of the evaluation function of the area division result and the target value will be described later.

ステップＳ１０３においては、所定の最大繰り返し回数を設定し、繰り返し回数ｔを制御してステップＳ１０４〜Ｓ１１３までの繰り返し計算のループを開始する。ｔの初期値は１とする。
ステップＳ１０４においては、入力データのループを開始する。画像の中の入力データＸ_iの順番ｉを制御し、ステップＳ１０５からステップＳ１０９までの処理を繰り返して全ての入力データをクラスタリングする。 In step S103, a predetermined maximum number of repetitions is set, the number of repetitions t is controlled, and a repetition calculation loop from steps S104 to S113 is started. The initial value of t is 1.
In step S104, a loop of input data is started. The order i of the input data X _i in the image is controlled, and the processing from step S105 to step S109 is repeated to cluster all input data.

まず、ステップＳ１０５において、探索範囲を設定する。具体的には、入力データの座標位置によって代表候補データの探索範囲を設定する。ここで、入力データの代表候補データとは、該当する入力データを割り当てる（帰属させる）代表データの候補である。 First, in step S105, a search range is set. Specifically, the search range for representative candidate data is set according to the coordinate position of the input data. Here, the representative candidate data of the input data is a candidate of representative data to which the corresponding input data is assigned (belongs to).

ここで、探索範囲２０２の幅および高さがそれぞれ５個の代表データの例について説明する。まず、処理対象の入力データ（以下、注目入力データ）２０６が含まれるブロックを中心に、代表候補データの探索範囲を設定する。この場合、探索範囲は５×５個の座標近傍の代表データである。探索範囲２０２は、縦方向が５〜９までであり、横方向が６〜１０までである。図２に示す例の場合、１ラインあたりの代表データ数が１５個であるため、探索範囲２０２に含まれるインデックスは、６６〜７０、８１〜８５、９６〜１００、１１１〜１１５、１２６〜１３０である。したがって、注目入力データ２０６を含め、ブロック２０４の範囲に入っている全ての入力データの探索範囲２０２は同じである。なお、入力データの位置と探索範囲の代表候補データとの関係は最初に固定され、繰り返し処理によって代表データの位置が変わっても、代表データのインデックスは変わらないものとする。 Here, an example of representative data in which the width and height of the search range 202 are each five will be described. First, a search range of representative candidate data is set around a block including input data to be processed (hereinafter referred to as attention input data) 206. In this case, the search range is representative data in the vicinity of 5 × 5 coordinates. The search range 202 has a vertical direction of 5 to 9, and a horizontal direction of 6 to 10. In the case of the example shown in FIG. 2, since the number of representative data per line is 15, the indexes included in the search range 202 are 66 to 70, 81 to 85, 96 to 100, 111 to 115, 126 to 130. It is. Therefore, the search range 202 of all input data included in the range of the block 204 including the target input data 206 is the same. It is assumed that the relationship between the position of the input data and the representative candidate data in the search range is fixed first, and the index of the representative data does not change even if the position of the representative data is changed by iterative processing.

ステップＳ１０６〜Ｓ１０８は、入力データの最近傍代表データを探索するステップである。ここで、最近傍代表データとは、入力データとの距離が最短な代表候補データ、即ち、距離空間上一番近い代表データである。各ステップにおける処理内容を説明する前に、本実施形態における入力データと代表候補データとの距離の計算方法について説明する。以下の式（３）は第ｉ番目の入力データと第ｊ番目の代表候補データとの間の距離Ｄ（Ｘ_i，Ｃ_j）を表す距離関数である。
Ｄ（Ｘ_i，Ｃ_j）＝Ｗ（Ｘ_i，ｔ）×Ｄ_s（Ｘ_i，Ｃ_j）＋Ｄ_c（Ｘ_i，Ｃ_j）・・・（３） Steps S106 to S108 are steps for searching the nearest representative data of the input data. Here, the nearest representative data is representative candidate data having the shortest distance from the input data, that is, the nearest representative data in the distance space. Before describing the processing contents in each step, a method for calculating the distance between input data and representative candidate data in the present embodiment will be described. The following equation (3) is a distance function representing a distance D (X _i , C _j ) between the i-th input data and the j-th representative candidate data.
D (X _i , C _j ) = W (X _i , t) × D _s (X _i , C _j ) + D _c (X _i , C _j ) (3)

式（３）中のＤ_sは座標距離を表し、Ｄ_cは色距離を表す。また、Ｗ（Ｘ_i，ｔ）は第ｉ番目の入力データＸ_iの、第ｔ番目の繰り返しにおける座標距離Ｄ_sの重みである。重みＷ（Ｘ_i，ｔ）は、以下の式（４）に示すように、入力データＸ_iが前回の繰り返しで帰属していた領域Ｓ_j,t-1に対応する重みＷ_j,tとなる。ただし、第１番目の繰り返しでは、全てのＸ_iに対して初期値Ｗ₁を座標距離の重みとする。本実施形態では初期値Ｗ₁の値を１とする。 In the formula (3), D _s represents a coordinate distance, and D _c represents a color distance. W (X _i , t) is a weight of the coordinate distance D _s in the t-th iteration of the i-th input data X _i . As shown in the following formula (4), the weight W (X _i , t) is the weight W _{j, t} corresponding to the region S _{j, t−1} to which the input data X _i belonged in the previous iteration. Become. However, in the first iteration, the initial value W ₁ is used as the weight of the coordinate distance for all X _i . In the present embodiment, the initial value W ₁ is 1.

座標距離の重みを大きくした場合、色距離よりも座標距離を重視して領域分割するため、代表データと入力データとで色が多少異なる場合であっても、代表データからの座標位置が近い入力データは同じ領域に分割される傾向となる。一方、座標距離の重みを小さくした場合は、座標距離よりも色距離を重視して領域分割するため、代表データからの座標位置が多少離れていたとしても、代表データと色が似ている入力データは同じ領域に分割される傾向となる。したがって、座標距離の重みを大きくするほどフラグメンテーションは軽減されるが、物体の境界で領域分割され難くなる傾向となる。一方、座標距離の重みを小さくするほど物体の境界で領域分割され易くなるが、フラグメンテーションが増加する傾向となる。 When the weight of the coordinate distance is increased, the area is divided with more importance on the coordinate distance than the color distance, so even if the color is slightly different between the representative data and the input data, the input of the coordinate position from the representative data is close. Data tends to be divided into the same areas. On the other hand, if the weight of the coordinate distance is reduced, the area is divided by placing more emphasis on the color distance than the coordinate distance, so even if the coordinate position from the representative data is slightly separated, the input is similar in color to the representative data. Data tends to be divided into the same areas. Therefore, the fragmentation is reduced as the weight of the coordinate distance is increased, but the region is more difficult to be divided at the boundary of the object. On the other hand, the smaller the weight of the coordinate distance, the easier the region is divided at the boundary of the object, but the fragmentation tends to increase.

式（３）に示すように、入力データと代表候補データとの間の距離Ｄは、座標距離Ｄ_sと色距離Ｄ_cとの重み付き総和により算出される。以下、このように複数の部分距離から算出される距離のことを総合距離と呼ぶ。以下の式（５）及び式（６）は、それぞれ座標距離Ｄ_s、色距離Ｄ_cを算出する計算式である。本実施形態では、ユークリッド距離を使用するが、これに限る訳ではない。 As shown in equation (3), the distance D between the input data and the representative candidate data is calculated by the weighted sum of the coordinate distance D _s and the color distance D _c. Hereinafter, such a distance calculated from a plurality of partial distances is referred to as a total distance. The following formulas (5) and (6) are calculation formulas for calculating the coordinate distance D _s and the color distance D _c , respectively. In the present embodiment, the Euclidean distance is used, but the present invention is not limited to this.

ステップＳ１０６においては、代表候補データのループを開始する。代表候補データＣ_jの順番ｊを制御し、全ての代表候補データを探索する。そして、ステップＳ１０７において、式（３）に従って第ｉ番目の入力データと第ｊ番目の代表候補データとの間の総合距離を算出する。ステップＳ１０８においては、全ての代表候補データとの総合距離の算出が完了したかどうかを判定する。この判定の結果、全ての代表候補データとの総合距離を算出した場合は、ステップＳ１０９に進み、代表候補データのループを終了する。一方、まだ総合距離を算出していない代表候補データがある場合は、ステップＳ１０７に戻り、次の代表候補データとの総合距離を計算する。 In step S106, a representative candidate data loop is started. The order j of the representative candidate data C _j is controlled to search for all representative candidate data. In step S107, the total distance between the i-th input data and the j-th representative candidate data is calculated according to equation (3). In step S108, it is determined whether or not the calculation of the total distance with all representative candidate data has been completed. As a result of the determination, if the total distance with all the representative candidate data is calculated, the process proceeds to step S109 to end the representative candidate data loop. On the other hand, if there is representative candidate data for which the total distance has not yet been calculated, the process returns to step S107 to calculate the total distance with the next representative candidate data.

続いてステップＳ１０９においては、ステップＳ１０６〜Ｓ１０８のループで算出された総合距離の中から距離が最短であった代表候補データ（最近傍代表データ）に入力データを帰属（関連付け）させる。そして、ステップＳ１１０において、全ての入力データを処理したかどうかを判定する。この判定の結果、全ての入力データを処理した場合は、ステップＳ１１１に進み、入力データのループを終了する。一方、まだ処理していない入力データがある場合はステップＳ１０４に戻り、次の入力データの処理を開始する。 Subsequently, in step S109, the input data is attributed (associated) to the representative candidate data (nearest neighbor representative data) having the shortest distance among the total distances calculated in the loop of steps S106 to S108. In step S110, it is determined whether all input data have been processed. If all the input data has been processed as a result of this determination, the process proceeds to step S111, and the input data loop is terminated. On the other hand, if there is input data that has not been processed yet, the process returns to step S104 to start processing the next input data.

ステップＳ１１１においては、各代表データに対して、帰属させた入力データのベクトルの平均ベクトルを計算する。そして、その平均ベクトルを該当代表データの新しいベクトルＣ_jとし、代表データを更新する。次に、ステップＳ１１２おいて、更新後の各領域の評価値を算出する。評価値の算出方法については後述する。 In step S111, an average vector of the assigned input data vectors is calculated for each representative data. Then, the average vector is set as a new vector C _j of the corresponding representative data, and the representative data is updated. Next, in step S112, the updated evaluation value of each area is calculated. A method for calculating the evaluation value will be described later.

続いてステップＳ１１３において、ステップＳ１０８において入力データと代表候補データとの間の総合距離を算出する際に使用する重みＷ（Ｘ_i，ｔ）を制御する。この処理では、ステップＳ１１２で算出した評価値が、ステップＳ１０２で設定した目標値に近づくように重みを制御する。この重みの制御については後述する。 Subsequently, in step S113, the weight W (X _i , t) used when calculating the total distance between the input data and the representative candidate data in step S108 is controlled. In this process, the weight is controlled so that the evaluation value calculated in step S112 approaches the target value set in step S102. This weight control will be described later.

次に、ステップＳ１１４において、最大繰り返し回数を超えるかどうか、又はクラスタリングの結果が収束したかどうかを判定する。この判定の結果、何れかの条件が成立する場合は、繰り返し計算を終了する。一方、いずれの条件も成立しない場合はステップＳ１０４に戻り、次の繰り返し計算を開始する。以上の処理が完了すると、ＣＰＵ４０４はクラスタリング結果（領域分割結果）を表示部４０３などに出力する。 Next, in step S114, it is determined whether or not the maximum number of repetitions has been exceeded, or whether or not the clustering result has converged. If any of the conditions is satisfied as a result of this determination, the iterative calculation is terminated. On the other hand, if none of the conditions is satisfied, the process returns to step S104 to start the next iterative calculation. When the above processing is completed, the CPU 404 outputs the clustering result (region division result) to the display unit 403 or the like.

＜領域の評価値計算＞
次に、ステップＳ１１２における評価値の算出について説明する。本実施形態では、各領域の評価値として領域形状の複雑度を用いる。各領域の評価値はステップＳ１１３の座標距離の重み制御に用いられる。また、各々の領域は、代表データに帰属する入力データの集合によって構成される。以下の式（７）から式（１１）に評価値の計算式を示す。 <Evaluation value calculation of area>
Next, calculation of the evaluation value in step S112 will be described. In the present embodiment, the complexity of the area shape is used as the evaluation value for each area. The evaluation value of each area is used for the weight control of the coordinate distance in step S113. Each area is constituted by a set of input data belonging to the representative data. The following formulas (7) to (11) show evaluation value calculation formulas.

領域Ｓ_j,tの評価値Ｅ（Ｓ_j,t）は、式（７）に示すように、領域Ｓ_j,tの面積Ｅ_s（Ｓ_j,t）と、領域Ｓ_j,tの輪郭長Ｅ_c（Ｓ_j,t）とを用いて計算される。面積Ｅ_s（Ｓ_j,t）は、式（８）に示すように領域Ｓ_j,tの要素数で定義される。輪郭長Ｅ_c（Ｓ_j,t）は、式（９）に示すように、領域Ｓ_j,tの要素である入力データＸ_iのうち、領域境界と判定された入力データの総数である。 Area S _j, evaluation value of _t E (S _{j, t),} as shown in Equation (7), region S _j, the area of _t E _s (S _{j, t)} and, region S _j, the contour of _t It is calculated using the length E _c (S _{j, t} ). The area E _s (S _{j, t} ) is defined by the number of elements in the region S _{j, t} as shown in Expression (8). The contour length E _c (S _{j, t} ) is the total number of input data determined as a region boundary among the input data X _i that is an element of the region S _{j, t} as shown in the equation (9).

Ｃｏｎｔｏｕｒ（Ｘ_i）は入力データＸ_iが領域境界であるか否かを判定する関数であり、式（１０）に示すように、入力データＸ_iの周囲８近傍の入力データが帰属する領域番号を参照して計算される。周囲８近傍の入力データが、全て注目入力データＸ_iと同じ領域に帰属していた場合、注目入力データを非領域境界と判定し、Ｃｏｎｔｏｕｒ（Ｘ_i）は０となる。周囲８近傍の入力データの中に、１つでも注目入力データＸ_iと異なる領域に帰属している入力データがある場合は、注目入力データを領域境界と判定し、Ｃｏｎｔｏｕｒ（Ｘ_i）は１となる。Ｌ（ｘ，ｙ）は座標位置（ｘ，ｙ）の入力データが属する領域番号を示し、δ（ｘ）はｘ＝０の時に１、それ以外のｘについては０となる関数である。 “Contour (X _i )” is a function for determining whether or not the input data X _i is a region boundary. As shown in the equation (10), the region number to which the input data in the vicinity of the input data X _i around 8 belongs. Calculated with reference to If all the input data in the vicinity of the surrounding 8 belong to the same region as the target input data X _i , the target input data is determined to be a non-region boundary, and Contour (X _i ) becomes zero. If there is at least one input data belonging to a region different from the target input data X _{i in the} vicinity of the surrounding eight input data, the target input data is determined to be a region boundary, and Control (X _i ) is 1. It becomes. L (x, y) represents a region number to which the input data at the coordinate position (x, y) belongs, and δ (x) is a function that is 1 when x = 0 and 0 for other x.

評価値Ｅ（Ｓ_j,t）は領域の複雑度を示している。例えば図３に示すように、領域３０１から領域３０３、領域３０４から領域３０７、領域３０８から領域３１５、および領域３１６から領域３１７ではフラグメンテーションが発生している。これらの領域のように、凹凸が大きい場合や、領域Ｓ_j,tが座標空間上で連続していない場合は複雑度が大きくなる。領域Ｓ_j,tに含まれる全ての入力データが互いに座標空間上で不連続である場合は、領域の輪郭長は面積と等しくなり、複雑度は１となる。一方、領域３１８のように凹凸が少なく、全ての入力データが座標空間上で隣接しているような場合は複雑度が小さくなる。以上のようにフラグメンテーションが生じている領域は複雑度が大きくなる傾向があり、フラグメンテーションが発生していない領域は複雑度が小さくなる傾向がある。 The evaluation value E (S _{j, t} ) indicates the complexity of the area. For example, as shown in FIG. 3, fragmentation occurs in the region 301 to the region 303, the region 304 to the region 307, the region 308 to the region 315, and the region 316 to the region 317. As in these regions, when the unevenness is large, or when the regions S _{j, t} are not continuous in the coordinate space, the complexity increases. When all input data included in the region S _{j, t} are discontinuous in the coordinate space, the contour length of the region is equal to the area, and the complexity is 1. On the other hand, when there are few unevenness | corrugations like the area | region 318 and all the input data are adjacent on coordinate space, complexity becomes small. As described above, an area where fragmentation occurs tends to increase in complexity, and an area where fragmentation does not occur tends to decrease in complexity.

＜重みの目標値設定＞
ステップＳ１０２では、ステップＳ１１２で計算する評価値の目標値を設定する。本実施形態では、評価値Ｅ（Ｓ_j,t）の目標値Ｔを以下の式（１２）により計算する。目標値Ｔは式（１２）に示すように、領域形状が横Ｌ_x、縦Ｌ_yの矩形形状となった場合の評価値に対して、２０％の複雑さを考慮した値である。横Ｌ_x及び縦Ｌ_yの長さは、それぞれ式（１３）、式（１４）に示すように、幅Ｉ_x、高さＩ_yの画像を横方向にＮ_x個、縦方向にＮ_y個の均一な矩形領域で分割した際の辺の長さを示す。 <Set weight target value>
In step S102, the target value of the evaluation value calculated in step S112 is set. In the present embodiment, the target value T of the evaluation value E (S _{j, t} ) is calculated by the following equation (12). Target value T is as shown in equation (12), the evaluation value when the area shape becomes horizontal L _x, a rectangular shape of vertical L _y, is a value in consideration of the 20% of the complexity. The lengths of the horizontal L _x and the vertical L _y are N _x images of width I _x and height I _{y in} the horizontal direction and N _{y in} the vertical direction, as shown in equations (13) and (14), respectively. The length of the side when dividing into the uniform rectangular area is shown.

＜重み制御＞
ステップＳ１１３では、ステップＳ１０２で設定した目標値Ｔと、ステップＳ１１２で計算した評価値Ｅ（Ｓ_j,t）とを用いて、第ｔ＋１番目の繰り返しにおける座標距離の重みＷ_j,t+1を計算する。以下の式（１５）は重みの制御式を示している。ｋは制御の強さを示す制御係数であり、本実施形態ではｋの値を０．１と設定する。第１番目の繰り返しでは重みの初期値Ｗ₁に対して制御を行い、第２番目以降の繰り返しでは現在の重みＷ_j,tに対して制御を行う。 <Weight control>
In step S113, using the target value T set in step S102 and the evaluation value E (S _{j, t} ) calculated in step S112, the weight W _{j, t + 1} of the coordinate distance in the (t + 1) th iteration is calculated. calculate. The following equation (15) shows a weight control equation. k is a control coefficient indicating the strength of control. In this embodiment, the value of k is set to 0.1. In the first th iteration and controls the initial value W ₁ of the weight, in the second and subsequent repetitive performs control for the current weight W _{j, t.}

式（１５）の制御により、領域の形状が目標よりも複雑である領域は、座標距離の重みが増加するため、次の領域分割では領域形状が単純になり、評価値が小さくなることが期待できる。領域の形状が目標よりも単純である領域は、座標距離の重みが減少するため、次の繰り返しでは領域形状が複雑になり、評価値が大きくなることが期待できる。 With the control of Expression (15), the weight of the coordinate distance is increased in the region where the shape of the region is more complex than the target, so that the region shape becomes simple in the next region division, and the evaluation value is expected to be small. it can. In the region where the shape of the region is simpler than the target, the weight of the coordinate distance decreases, so that the region shape becomes complicated and the evaluation value can be expected to increase in the next iteration.

以上説明したように、本実施形態の方式によれば、各領域の形状の複雑度が目標値に近づくように座標距離の重みを制御するようにしたので、フラグメンテーションを軽減することが可能である。 As described above, according to the method of the present embodiment, since the weight of the coordinate distance is controlled so that the complexity of the shape of each region approaches the target value, fragmentation can be reduced. .

（その他の実施形態）
前述した実施形態では、座標距離の重みを領域単位で制御する例を示したが、本発明はこれに限定するものではなく、画像のブロック単位で重みを制御することも可能である。以下の式（１６）は、注目入力データＸ_iの重みＷ（Ｘ_i，ｔ）を示している。 (Other embodiments)
In the embodiment described above, an example in which the weight of the coordinate distance is controlled in units of regions has been described, but the present invention is not limited to this, and the weight can be controlled in units of blocks of the image. Equation (16) below shows the weight W (X _i , t) of the input data of interest X _i .

式（１６）中のＢ_jは図２に示すブロックであり、入力データの集合である。例えば、注目入力データ２０６を含むブロックはブロック２０４である。図２に示すように、各ブロックには対応する代表データが存在する。そこで、第ｊ番目のブロックＢ_jに対応する重みＷ_j,tは、式（１５）に示すように、第ｊ番目の領域Ｓ_j,tを用いて制御する。前述の実施形態のように領域単位で重みを制御する場合は、入力データＸ_iが前回の繰り返しで帰属した領域番号を参照する必要があった。これに対してブロック単位で重みを制御する場合は、領域番号の参照が不要であるため、処理コストがより小さいという利点がある。 B _j in equation (16) is the block shown in FIG. 2 and is a set of input data. For example, the block including the attention input data 206 is the block 204. As shown in FIG. 2, each block has corresponding representative data. Therefore, the weight W _{j, t} corresponding to the j-th block B _j is controlled using the j-th region S _{j, t} as shown in Expression (15). When the weight is controlled in units of regions as in the above-described embodiment, it is necessary to refer to the region number to which the input data X _i belonged in the previous iteration. On the other hand, when the weight is controlled in units of blocks, there is an advantage that the processing cost is smaller because it is not necessary to refer to the area number.

また、以下の式（１７）を用いて画像単位で重みを設定することも可能である。Ｗ_tは第ｔ番目の繰り返しにおける重みである。Ｗ_tは以下の式（１８）に示すように各領域の評価値平均Ｅ（ｔ）を用いて求めることができる。評価値平均Ｅ（ｔ）は、以下の式（１９）により計算される。画像単位で重みを制御する場合は制御する重みが１つであるため、ブロック単位で重みを制御するよりもさらに処理コストが小さいという利点がある。 It is also possible to set a weight for each image using the following equation (17). W _t is a weight in the t-th iteration. W _t can be obtained by using the evaluation value average E (t) of each region as shown in the following formula (18). The evaluation value average E (t) is calculated by the following equation (19). When the weight is controlled in units of images, there is an advantage that the processing cost is lower than that in the case of controlling the weights in units of blocks because the weight to be controlled is one.

また、前述した実施形態では評価関数として、式（７）から式（１１）に示した領域形状の複雑度を用いたが、本発明はこれに限定するものではない。評価関数には領域分割結果を評価する任意の関数を適用することが可能である。例えば、領域の色特徴の統計量である分散値を評価値に使用し、各領域の色特徴の分散値が同程度になるように領域分割することも可能である。 In the above-described embodiment, the complexity of the region shape shown in Expression (7) to Expression (11) is used as the evaluation function, but the present invention is not limited to this. An arbitrary function for evaluating the region division result can be applied to the evaluation function. For example, it is also possible to divide the region so that the variance value of the color feature of each region is approximately the same, using a variance value that is a statistic of the color feature of the region as the evaluation value.

また、前述した実施形態では一つの評価関数、目標値によって一つの重みを制御したが、本発明はこれに限定するものではない。複数の評価関数、目標値によって複数の重みを制御することも可能である。 In the above-described embodiment, one weight is controlled by one evaluation function and target value, but the present invention is not limited to this. It is also possible to control a plurality of weights by a plurality of evaluation functions and target values.

また、前述した実施形態では各領域の評価値の目標値の算出方法として、式（１２）から式（１４）を用いたが、本発明はこれに限定するものではない。目標値は、評価関数が取りうる値域の中から任意の値を設定することも可能である。 In the above-described embodiment, Equations (12) to (14) are used as the method for calculating the target value of the evaluation value of each region, but the present invention is not limited to this. As the target value, an arbitrary value can be set from a range that the evaluation function can take.

また、前述した実施形態の処理による領域分割結果は、重みの初期値Ｗ₁に依存しているため、初期値の設定によっては良い領域分割結果が得られない可能性がある。そこで、重みの初期値を変えながら、図１のステップＳ１０１からＳ１１４の処理を複数回実施し、最も良い領域分割結果を最終的な出力として選択することも可能である。 In addition, since the region division result by the processing of the above-described embodiment depends on the initial value W _{1 of the} weight, a good region division result may not be obtained depending on the setting of the initial value. Therefore, it is possible to perform the processing of steps S101 to S114 in FIG. 1 a plurality of times while changing the initial value of the weight, and select the best region division result as the final output.

また、前述した実施形態では、重みを制御する際の制御係数ｋに定数値を設定したが、本発明はこれに限定するものではない。制御係数ｋを以下の式（２０）に示すような繰り返し数ｔによる単調減少関数ｋ（ｔ）として定義してもよい。なお、ｅは自然対数の底を示す。式（２０）は繰り返し回数ｔによる単調減少関数であるため、ステップＳ１０３の繰り返しの中で重みが振動することなく収束するようになる。 In the embodiment described above, a constant value is set for the control coefficient k when controlling the weight, but the present invention is not limited to this. The control coefficient k may be defined as a monotonically decreasing function k (t) with a repetition number t as shown in the following equation (20). Note that e indicates the base of the natural logarithm. Since equation (20) is a monotonously decreasing function depending on the number of repetitions t, the weights converge without oscillating during the repetition of step S103.

また、重みの初期値による影響を軽減するため、重みの制御式に外乱項Ｒ（ｔ）を加え、以下の式（２１）及び式（２２）を用いて重みを制御することも可能である。なお、ｒａｎｄ（−１，１）は値域［−１，１］の乱数であり、外乱項Ｒ（ｔ）は繰り返し回数ｔによって値域幅が単調減少する項である。式（２１）及び式（２２）を用いた場合には、ステップＳ１０３の繰り返しの初めの間は、評価値及び目標値に影響を受けない強い外乱によって重みが変化するため、初期値の影響を軽減することが可能である。さらに、制御係数ｋに式（２０）のような単調減少関数を用いる場合は、外乱項の収束速度を制御係数ｋの収束速度よりも大きく設定することにより、重みを安定して収束させることが可能である。 In order to reduce the influence of the initial value of the weight, the disturbance term R (t) can be added to the weight control expression, and the weight can be controlled using the following expressions (21) and (22). . Note that rand (−1, 1) is a random number in the range [−1, 1], and the disturbance term R (t) is a term in which the range width monotonously decreases with the number of repetitions t. When Expression (21) and Expression (22) are used, during the beginning of the repetition of Step S103, the weight changes due to strong disturbance that is not affected by the evaluation value and the target value. It can be reduced. Furthermore, when a monotonically decreasing function such as Equation (20) is used for the control coefficient k, the weight can be converged stably by setting the convergence speed of the disturbance term larger than the convergence speed of the control coefficient k. Is possible.

本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 The present invention supplies a program that realizes one or more functions of the above-described embodiments to a system or apparatus via a network or a storage medium, and one or more processors in a computer of the system or apparatus read and execute the program This process can be realized. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

４０１データ保存部
４０４ＣＰＵ 401 Data storage unit 404 CPU

Claims

Representative data setting means for setting representative data for each of a plurality of regions in the image;
An associating means for associating each pixel in the image with any representative data in the plurality of regions based on a coordinate and color distance with the representative data;
Calculating means for calculating an evaluation value of an area composed of a set of pixels associated by the associating means;
Target value setting means for setting a target value of the evaluation value;
Weight control means for controlling the weight between the coordinates and the color so that the evaluation value approaches the target value;
The associating means calculates a distance related to the coordinates and color according to the weight controlled by the weight control means, and based on the calculated distance, each pixel in the image is set as any one of the representative data. An image processing apparatus characterized by associating.

The image processing apparatus according to claim 1, wherein the representative data setting unit updates the respective representative data based on the associated set of pixels.

3. The image processing apparatus according to claim 1, wherein the weight control unit controls a weight between the coordinates and the color in units of an area formed by the set of associated pixels. 4. .

The image processing apparatus according to claim 1, wherein the weight control unit controls a weight between the coordinates and the color in units of blocks.

The image processing apparatus according to claim 1, wherein the weight control unit controls a weight between the coordinates and the color in units of images.

The image processing apparatus according to claim 1, wherein the calculation unit calculates, as the evaluation value, a complexity of a shape of an area configured by the associated set of pixels. .

The image processing according to claim 1, wherein the calculation unit calculates a statistic of a color feature in an area configured by the set of associated pixels as the evaluation value. apparatus.

The image processing according to any one of claims 1 to 7, further comprising output means for outputting a result of an area formed by the associated set of pixels when a predetermined condition is satisfied. apparatus.

A representative data setting step for setting representative data for each of a plurality of regions in the image;
An association step of associating each pixel in the image with any representative data in the plurality of regions based on a coordinate and color distance with the representative data;
A calculation step of calculating an evaluation value of an area configured by a set of pixels associated in the association step;
A target value setting step for setting a target value of the evaluation value;
A weight control step of controlling a weight between the coordinates and the color so that the evaluation value approaches the target value,
In the association step, a distance related to the coordinates and color is calculated according to the weight controlled in the weight control step, and each pixel in the image is set to any one of the representative data based on the calculated distance. An image processing method characterized by associating with.

A representative data setting step for setting representative data for each of a plurality of regions in the image;
An association step of associating each pixel in the image with any representative data in the plurality of regions based on a coordinate and color distance with the representative data;
A calculation step of calculating an evaluation value of an area configured by a set of pixels associated in the association step;
A target value setting step for setting a target value of the evaluation value;
Causing a computer to execute a weight control step of controlling a weight between the coordinates and the color so that the evaluation value approaches the target value;
In the association step, a distance related to the coordinates and color is calculated according to the weight controlled in the weight control step, and each pixel in the image is set to any one of the representative data based on the calculated distance. A program characterized by associating with.