JP7452149B2

JP7452149B2 - Learning device and computer program for block noise detection

Info

Publication number: JP7452149B2
Application number: JP2020050353A
Authority: JP
Inventors: 直希土屋; 佳寛真鍋; 進齋藤
Original assignee: Aisin Seiki Co Ltd; Aisin Corp
Current assignee: Aisin Corp
Priority date: 2020-03-20
Filing date: 2020-03-20
Publication date: 2024-03-19
Anticipated expiration: 2040-03-20
Also published as: JP2021149719A

Description

本発明は、画像に対して生じるブロックノイズ検出の為の学習装置及びコンピュータプログラムに関する。 The present invention relates to a learning device and a computer program for detecting block noise occurring in an image.

従来より、ナビゲーション装置、スマートフォン、パーソナルコンピュータ等の画像を表示する為の画像表示装置（液晶ディスプレイ、有機ＥＬディスプレイ等）を備えた各種機器では、ＣＰＵ等の演算手段によって生成された画像データを一旦メモリに書き込み、その後メモリから読み出した画像データをＧＰＵ等の画像処理を行う演算手段（ＣＰＵが行っても良い）によって画像表示装置に対して描画することにより画像の表示を行う。しかしながら、このような画像表示装置への画像の表示を行う際に、例えばメモリに対する画像データの書き込みと画像データの読み出しのタイミングが合わない等の理由により、メモリに書き込む画像データの生成に異常が生じる場合があり、その場合に画像表示装置に表示される画像にブロックノイズが生じることが知られている。 Conventionally, in various devices equipped with image display devices (liquid crystal displays, organic EL displays, etc.) for displaying images, such as navigation devices, smartphones, and personal computers, image data generated by a calculation means such as a CPU is temporarily processed. The image is displayed by writing the image data into the memory and then drawing the image data read out from the memory on the image display device by a calculation means (which may be performed by the CPU) that performs image processing such as a GPU. However, when displaying images on such an image display device, an abnormality may occur in the generation of image data to be written to the memory, for example, because the timing of writing the image data to the memory and reading the image data do not match. It is known that block noise may occur in the image displayed on the image display device.

上記ブロックノイズの発生を防止する為には、画像の描画を行う前に生成された画像データに対してブロックノイズを生じさせる異常があることを検出可能とする必要がある。そして、このような画像データの異常を描画前に検出する為には学習モデルの導入が有効である。例えば画像表示装置への画像表示に関する学習モデルとして特開２０１８－１３６５３７号公報には、表示部に実際に表示された映像に対応するデータを学習データ、表示部への表示を意図している理想的な映像に対応するデータを教師データとしてニューラルネットワークを用いた学習を行うことによって画素の階調を補正し、映像の品質を向上させる技術について開示されている。 In order to prevent the occurrence of block noise, it is necessary to be able to detect an abnormality that causes block noise in generated image data before drawing an image. In order to detect such abnormalities in image data before rendering, it is effective to introduce a learning model. For example, as a learning model for displaying images on an image display device, Japanese Patent Application Laid-Open No. 2018-136537 describes data corresponding to images actually displayed on the display as learning data, and data corresponding to the image actually displayed on the display as learning data, and data as learning data for the image that is intended to be displayed on the display. A technique has been disclosed that corrects the gradation of pixels and improves the quality of the video by performing learning using a neural network using data corresponding to a typical video as teacher data.

特開２０１２－１９２８７８号公報（第１５－１７頁）Japanese Patent Application Publication No. 2012-192878 (pages 15-17)

ここで、上記特許文献１の学習モデルでは教師データとして、表示部への表示を意図している理想的な映像に対応するデータを用いている。しかし、実際に学習モデルを構築するに際して、学習モデルが表示部に実際に表示された映像のデータを取得することは比較的容易な一方で、表示部に表示する前の理想的な映像（即ち表示される予定の映像）のデータを取得することは容易でないことが多い。従って、上記特許文献１の学習モデルは適用できるケースが限られ、実用上問題があった。 Here, in the learning model of Patent Document 1, data corresponding to an ideal video intended to be displayed on the display unit is used as the teacher data. However, when actually constructing a learning model, it is relatively easy for the learning model to acquire data of the image actually displayed on the display, while it is relatively easy to obtain the data of the image actually displayed on the display. In many cases, it is not easy to obtain data for images (videos scheduled to be displayed). Therefore, the learning model of Patent Document 1 has limited cases where it can be applied, and has problems in practical use.

本発明は前記従来における問題点を解消するためになされたものであり、正常画像に対して疑似的にブロックノイズを生じさせた画像を生成して教師データとして用いることによって、教師データの取得が容易となり、広く適用することが可能なブロックノイズ検出の為の学習装置及びコンピュータプログラムを提供することを目的とする。 The present invention has been made to solve the above-mentioned problems in the conventional art, and it is possible to obtain training data by generating an image in which pseudo block noise is generated from a normal image and using it as training data. It is an object of the present invention to provide a learning device and a computer program for block noise detection that can be easily and widely applied.

前記目的を達成するため本発明に係るブロックノイズ検出の為の学習装置は、ブロックノイズの生じていない学習用正常画像を取得する正常画像取得手段と、ブロック形状の所定エリアを疑似的にノイズ領域に設定したブロックノイズの画像である疑似ブロックノイズ画像を生成する疑似ブロックノイズ画像生成手段と、前記学習用正常画像に対して前記疑似ブロックノイズ画像を重畳して合成することによって、前記学習用正常画像に対して擬似的にブロックノイズを生じさせた疑似異常画像を、ブロックノイズを検出する為の学習モデルの学習に用いる教師データとして生成する教師データ生成手段と、を有し、前記疑似ブロックノイズ画像生成手段は、ノイズ領域とするブロック形状のサイズ、位置、ノイズ領域の色及び透過率を夫々設定して前記疑似ブロックノイズ画像を生成する。
In order to achieve the above object, a learning device for detecting block noise according to the present invention includes a normal image acquisition means for acquiring a normal image for learning in which block noise does not occur, and a predetermined area of a block shape as a pseudo noise region. a pseudo block noise image generating means that generates a pseudo block noise image which is an image of block noise set to a teacher data generating means for generating a pseudo abnormal image in which block noise is generated in the image as teacher data used for learning a learning model for detecting block noise; The image generating means generates the pseudo block noise image by setting the size and position of the block shape to be the noise area, the color and transmittance of the noise area, respectively .

また、本発明に係るコンピュータプログラムは、ブロックノイズ検出の為の学習に係るプログラムである。具体的には、コンピュータを、ブロックノイズの生じていない学習用正常画像を取得する正常画像取得手段と、ブロック形状の所定エリアを疑似的にノイズ領域に設定したブロックノイズの画像である疑似ブロックノイズ画像を生成する疑似ブロックノイズ画像生成手段と、前記学習用正常画像に対して前記疑似ブロックノイズ画像を重畳して合成することによって、前記学習用正常画像に対して擬似的にブロックノイズを生じさせた疑似異常画像を、ブロックノイズを検出する為の学習モデルの学習に用いる教師データとして生成する教師データ生成手段と、して機能させる為のコンピュータプログラムであって、前記疑似ブロックノイズ画像生成手段は、ノイズ領域とするブロック形状のサイズ、位置、ノイズ領域の色及び透過率を夫々設定して前記疑似ブロックノイズ画像を生成する。
Further, a computer program according to the present invention is a program related to learning for block noise detection. Specifically, the computer is equipped with a normal image acquisition means that acquires a normal image for learning without block noise, and a pseudo block noise image that is a block noise image in which a predetermined block-shaped area is set as a pseudo noise area. pseudo block noise image generation means for generating an image; and superimposing and compositing the pseudo block noise image on the normal learning image to generate pseudo block noise in the normal learning image. A computer program for operating a computer program as a teacher data generation means for generating pseudo abnormal images as teacher data used for learning a learning model for detecting block noise, the pseudo block noise image generation means comprising: , the size and position of the block shape to be the noise region, the color and transmittance of the noise region are set respectively, and the pseudo block noise image is generated .

前記構成を有する本発明に係るブロックノイズ検出の為の学習装置及びコンピュータプログラムによれば、正常画像に対して疑似的にブロックノイズを生じさせた画像を生成して教師データとして用い、ブロックノイズを検出する為の学習を行うので、従来と比べて教師データの取得が容易となる。その結果、画像表示装置に表示する画像の品質を向上させるという点において広く適用可能な学習モデルを構築することが可能となる。また、擬似的にブロックノイズを生じさせた疑似異常画像を生成する場合であっても、実際にブロックノイズが生じている異常画像を再現して教師データとして用いることが可能となる。 According to the learning device and computer program for block noise detection according to the present invention having the above-described configuration, an image in which block noise is generated in a pseudo normal image is generated and used as training data, and the block noise is detected. Since learning is performed for detection, it is easier to obtain training data than in the past. As a result, it becomes possible to construct a learning model that is widely applicable in terms of improving the quality of images displayed on image display devices. Furthermore, even in the case of generating a pseudo abnormal image in which block noise is generated in a pseudo manner, it is possible to reproduce an abnormal image in which block noise has actually occurred and use it as training data.

本実施形態に係る学習システムを示した概略構成図である。1 is a schematic configuration diagram showing a learning system according to this embodiment. 本実施形態に係る学習装置を示したブロック図である。FIG. 1 is a block diagram showing a learning device according to the present embodiment. 本実施形態に係るナビゲーション装置を示したブロック図である。FIG. 1 is a block diagram showing a navigation device according to the present embodiment. 本実施形態に係る学習処理プログラムのフローチャートである。3 is a flowchart of a learning processing program according to the present embodiment. 学習用画像において重複画像の除外を行う態様を示した図である。FIG. 6 is a diagram showing a mode in which duplicate images are excluded from learning images. 疑似ブロックノイズ画像の一例を示した図である。FIG. 2 is a diagram showing an example of a pseudo block noise image. 疑似異常画像の一例を示した図である。FIG. 3 is a diagram showing an example of a pseudo-abnormal image. ブロックノイズ領域の面積値の分布を示した図である。FIG. 3 is a diagram showing a distribution of area values of a block noise region. 学習結果の評価に伴う再学習の態様を示した図である。FIG. 7 is a diagram showing a mode of relearning accompanying evaluation of learning results. 本実施形態に係る画像表示処理プログラムのフローチャートである。3 is a flowchart of an image display processing program according to the present embodiment.

以下、本発明に係る学習装置を具体化した一実施形態について図面を参照しつつ詳細に説明する。先ず、本実施形態に係る学習装置１を含む学習システム２の概略構成について図１を用いて説明する。図１は本実施形態に係る学習システム２を示した概略構成図である。 DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of a learning device according to the present invention will be described in detail below with reference to the drawings. First, a schematic configuration of a learning system 2 including a learning device 1 according to the present embodiment will be described using FIG. 1. FIG. 1 is a schematic configuration diagram showing a learning system 2 according to this embodiment.

図１に示すように、本実施形態に係る学習システム２は、画像に生じるブロックノイズ検出の為の学習モデルの学習を行う学習装置１と、各種の画像を表示する手段を備えるとともに学習システム２によって表示画像のブロックノイズの低減を図る対象となるナビゲーション装置３と、を基本的に有する。尚、本実施形態では特に表示画像のブロックノイズの低減を図る対象として車載器であるナビゲーション装置３を例に挙げて説明しているが、画像を表示する手段を備える装置であればナビゲーション装置以外であっても良い。例えば、スマートフォン、タブレット端末、パーソナルコンピュータ等をナビゲーション装置３の代わりとしても良い。 As shown in FIG. 1, the learning system 2 according to the present embodiment includes a learning device 1 for learning a learning model for detecting block noise occurring in images, and means for displaying various images. The navigation device 3 basically includes a navigation device 3 that is a target for reducing block noise in a displayed image. In this embodiment, the navigation device 3, which is an on-vehicle device, is used as an example of a target for reducing block noise in a displayed image, but any device other than the navigation device may be used as long as it is equipped with means for displaying an image. It may be. For example, a smartphone, a tablet terminal, a personal computer, etc. may be used instead of the navigation device 3.

また、本実施形態では学習を行う主体である学習装置１は画像の表示を行うナビゲーション装置３と別体に設けているが、学習装置１はナビゲーション装置３の一部（即ち学習機能を有するナビゲーション装置３）としても良い。その場合には、学習機能を有するナビゲーション装置３が本発明に係る学習装置に該当する。 Further, in this embodiment, the learning device 1 which is the main body that performs learning is provided separately from the navigation device 3 which displays images, but the learning device 1 is a part of the navigation device 3 (i.e., a navigation device with a learning function). It may also be used as device 3). In that case, the navigation device 3 having a learning function corresponds to the learning device according to the present invention.

ここで、学習装置１は、ナビゲーション装置３に表示される画像に生じるブロックノイズを検出する為の学習モデルの学習を行う装置である。また、学習装置１は、学習モデルの学習や学習結果を評価する為のテストに用いる各種画像が記憶されたＤＢを備え、ＤＢに格納された画像を用いて後述のように学習モデルの学習や学習結果の評価を行う。特に本実施形態ではナビゲーション装置３において表示される画像を対象としたブロックノイズ検出の為の学習を行うので、ＤＢにはナビゲーション装置３において表示対象となる画像（例えば地図画像、移動案内画像、施設案内画像等）が格納される。また、最終的に学習装置１において学習した学習モデルをナビゲーション装置３へと組み込むことによって、学習結果をナビゲーション装置３へと反映する。 Here, the learning device 1 is a device that performs learning of a learning model for detecting block noise occurring in images displayed on the navigation device 3. The learning device 1 also includes a DB in which various images used for learning the learning model and tests for evaluating the learning results are stored, and uses the images stored in the DB to learn the learning model as described below. Evaluate learning results. In particular, in this embodiment, since learning for block noise detection is performed for images displayed on the navigation device 3, the DB contains images to be displayed on the navigation device 3 (for example, map images, movement guide images, facility images, etc.). guide images, etc.) are stored. Further, by finally incorporating the learning model learned in the learning device 1 into the navigation device 3, the learning results are reflected in the navigation device 3.

一方、ナビゲーション装置３は、車両４に搭載され、格納する地図データや通信により取得した地図データに基づいて自車位置周辺の地図を表示したり、地図画像上において車両の現在位置を表示したり、設定された案内経路に沿った移動案内を行う車載機である。特に本実施形態では、学習装置１により学習した学習モデルを組み込み、表示する画像のブロックノイズの発生を低減させる処理についても行う。尚、詳細については後述する。 On the other hand, the navigation device 3 is installed in the vehicle 4 and displays a map around the vehicle's location based on stored map data and map data acquired through communication, and displays the current location of the vehicle on a map image. , is an in-vehicle device that provides movement guidance along a set guidance route. In particular, in this embodiment, the learning model learned by the learning device 1 is incorporated, and processing for reducing the occurrence of block noise in images to be displayed is also performed. The details will be described later.

続いて、学習システム２に含まれる学習装置１のより具体的な構成について図２を用いて説明する。図２は本実施形態に係る学習装置１を示したブロック図である。 Next, a more specific configuration of the learning device 1 included in the learning system 2 will be described using FIG. 2. FIG. 2 is a block diagram showing the learning device 1 according to this embodiment.

学習装置１は、図２に示すように制御部１１と、制御部１１に接続された情報記録手段としての教師画像ＤＢ１２と、タッチパネルやキーボード等から構成される入力操作部１３と、を備える。 As shown in FIG. 2, the learning device 1 includes a control section 11, a teacher image DB 12 as an information recording means connected to the control section 11, and an input operation section 13 composed of a touch panel, a keyboard, and the like.

制御部１１は、学習装置１の全体の制御を行う制御ユニット（ＭＣＵ、ＭＰＵ等）であり、演算装置及び制御装置としてのＣＰＵ２１、並びにＣＰＵ２１が各種の演算処理を行うにあたってワーキングメモリとして使用されるＲＡＭ２２、制御用のプログラムのほか、後述の学習処理プログラム（図４参照）等が記録されたＲＯＭ２３、ＲＯＭ２３から読み出したプログラムを記憶するフラッシュメモリ２４等の内部記憶装置を備えている。尚、制御部１１は、処理アルゴリズムとしての各種手段を有する。例えば、正常画像取得手段は、ブロックノイズの生じていない学習用正常画像を取得する。疑似ブロックノイズ画像生成手段は、ブロック形状の所定エリアを疑似的にノイズ領域に設定したブロックノイズの画像である疑似ブロックノイズ画像を生成する。教師データ生成手段は、学習用正常画像に対して疑似ブロックノイズ画像を重畳して合成することによって、学習用正常画像に対して擬似的にブロックノイズを生じさせた疑似異常画像を、ブロックノイズを検出する為の学習モデルの学習に用いる教師データとして生成する。 The control unit 11 is a control unit (MCU, MPU, etc.) that controls the entire learning device 1, and is used as a CPU 21 as an arithmetic device and a control device, and as a working memory when the CPU 21 performs various arithmetic processes. In addition to a RAM 22 and a control program, it also includes an internal storage device such as a ROM 23 in which a learning processing program (see FIG. 4), which will be described later, is recorded, and a flash memory 24 that stores programs read from the ROM 23. Note that the control unit 11 has various means as processing algorithms. For example, the normal image acquisition means acquires a normal image for learning without block noise. The pseudo block noise image generation means generates a pseudo block noise image that is a block noise image in which a predetermined block-shaped area is pseudo-set as a noise region. The training data generation means superimposes and synthesizes the pseudo-block noise image on the normal learning image, thereby generating a pseudo-abnormal image in which block noise is generated in a pseudo-normal image for learning. It is generated as training data used for learning the learning model for detection.

また、教師画像ＤＢ１２は、学習モデルの学習や学習結果を評価する為のテストに用いる各種画像を記憶する為の記憶手段であり、例えば不揮発性のメモリやＨＤＤ等によって構成される。ここで、教師画像ＤＢ１２に格納される画像は大きく分けて３種類あり、学習に用いる学習用画像２５、学習結果を評価する為の一次テストに用いる一次テスト用画像２６、一次テスト後に行われる二次テストに用いる二次テスト用画像２７が存在する。教師画像ＤＢ１２に格納される画像数は例えば学習用画像２５が約９万枚、一次テスト用画像２６が約２万枚、二次テストに用いる二次テスト用画像２７が約７９万枚存在する。尚、学習用画像２５、一次テスト用画像２６、二次テスト用画像２７は一部が重複していても良い。 Further, the teacher image DB 12 is a storage means for storing various images used for learning of the learning model and tests for evaluating the learning results, and is configured by, for example, a non-volatile memory, an HDD, or the like. Here, the images stored in the teacher image DB 12 are roughly divided into three types: a learning image 25 used for learning, a primary test image 26 used for a primary test to evaluate learning results, and a secondary test image 26 used for a primary test to evaluate learning results. There is a secondary test image 27 to be used in the next test. The number of images stored in the teacher image DB 12 includes, for example, approximately 90,000 learning images 25, approximately 20,000 primary test images 26, and approximately 790,000 secondary test images 27 used in the secondary test. . Note that the learning image 25, the primary test image 26, and the secondary test image 27 may partially overlap.

ここで、本実施形態に係る学習装置１は、ナビゲーション装置３において表示される画像を対象としたブロックノイズを検出する為の学習モデルの学習を行うので、学習用画像２５、一次テスト用画像２６及び二次テスト用画像２７はいずれもナビゲーション装置３において表示対象となる画像（例えば地図画像、移動案内画像、施設案内画像等）となる。また、学習用画像２５は、過去にナビゲーション装置３において実際に表示された画像でも良いし、学習用に疑似的に生成した画像であっても良い。更に学習用画像２５は、基本的にブロックノイズが生じていない正常画像のみを含む。但し、後述のように学習時においては学習用画像２５の正常画像からブロックノイズが生じている異常画像が疑似的に生成され、正常画像と疑似的に生成された異常画像を教師画像として学習が行われる。 Here, since the learning device 1 according to the present embodiment learns a learning model for detecting block noise in images displayed on the navigation device 3, the learning image 25, the primary test image 26 and the secondary test image 27 are all images to be displayed on the navigation device 3 (for example, a map image, a movement guide image, a facility guide image, etc.). Further, the learning image 25 may be an image actually displayed on the navigation device 3 in the past, or may be an image pseudo-generated for learning. Furthermore, the learning images 25 basically include only normal images without block noise. However, as will be described later, during learning, an abnormal image with block noise is generated from the normal image of the learning image 25, and learning is performed using the normal image and the pseudo-generated abnormal image as a teacher image. It will be done.

一方、一次テスト用画像２６及び二次テスト用画像２７は、疑似的に生成した画像であっても良いが、より現実に近い状況を想定して学習効果を確認するために、過去にナビゲーション装置３において実際に表示された画像を用いるのが望ましい。一次テスト用画像２６及び二次テスト用画像２７には、ブロックノイズが生じていない正常画像と、ブロックノイズが生じている異常画像を予め含む。例えば正常画像５割、異常画像５割の比率とするがその比率は適宜変更可能とする。 On the other hand, the primary test image 26 and the secondary test image 27 may be pseudo-generated images, but in order to confirm the learning effect by assuming a more realistic situation, it is necessary to It is desirable to use the image actually displayed in step 3. The primary test image 26 and the secondary test image 27 include in advance a normal image without block noise and an abnormal image with block noise. For example, the ratio is set to 50% for normal images and 50% for abnormal images, but the ratio can be changed as appropriate.

また、入力操作部１３は、例えばキーボード、マウス、ディスプレイの前面に設けられたタッチパネル等によって構成され、操作者による各種操作を受け付け可能に構成されている。そして、制御部１１は、入力操作部１３を介して入力された操作に基づき、対応する各種の動作を実行すべく制御を行う。例えば本実施形態では後述のように疑似的に生成するブロックノイズのパラメータの設定を行う際や学習の目標値を設定する際などに操作される。 Further, the input operation unit 13 is configured to include, for example, a keyboard, a mouse, a touch panel provided on the front of the display, etc., and is configured to be able to accept various operations by an operator. Then, the control unit 11 performs control based on operations input via the input operation unit 13 to perform various corresponding operations. For example, in this embodiment, it is operated when setting the parameters of pseudo-generated block noise or when setting the target value for learning, as will be described later.

本実施形態に係る学習装置１において学習される学習モデルは、上述したようにナビゲーション装置３において表示される画像を対象としたブロックノイズ検出の為の学習モデルであるが、より具体的にはナビゲーション装置３において生成された画像データを画面上に表示する前に該画像データが異常画像（表示した際にブロックノイズが生じる画像）に該当するかを判定する学習モデルである。詳細については後述するがナビゲーション装置３は、ブロックノイズが生じると予測されるエリアの面積値が閾値以上である場合に異常画像であると判定し、特に学習モデルはその閾値について学習によって最適値を算出する。 The learning model learned by the learning device 1 according to the present embodiment is a learning model for block noise detection targeting images displayed on the navigation device 3 as described above, but more specifically, This is a learning model that determines whether the image data generated in the device 3 corresponds to an abnormal image (an image that causes block noise when displayed) before displaying the image data on the screen. Although the details will be described later, the navigation device 3 determines that the image is an abnormal image when the area value of the area where block noise is predicted to occur is equal to or greater than the threshold value, and in particular, the learning model calculates the optimal value by learning for the threshold value. calculate.

続いて、学習システム２に含まれるナビゲーション装置３の概略構成について図３を用いて説明する。図３は本実施形態に係るナビゲーション装置３を示したブロック図である。 Next, a schematic configuration of the navigation device 3 included in the learning system 2 will be described using FIG. 3. FIG. 3 is a block diagram showing the navigation device 3 according to this embodiment.

図３に示すように本実施形態に係るナビゲーション装置３は、ナビゲーション装置３が搭載された車両の現在位置を検出する現在位置検出部３１と、各種のデータが記録されたデータ記録部３２と、入力された情報に基づいて、各種の演算処理を行うナビゲーションＥＣＵ３３と、ユーザからの操作を受け付ける操作部３４と、ユーザに対して車両周辺の地図やナビゲーション装置３に設定された案内経路に関する情報等を表示する液晶ディスプレイ３５と、経路案内に関する音声ガイダンスや危険因子に対する警告等を出力するスピーカ３６と、記憶媒体であるＤＶＤを読み取るＤＶＤドライブ３７と、プローブセンタやＶＩＣＳ（登録商標：Vehicle Information and Communication System）センタ等の情報センタとの間で通信を行う通信モジュール３８と、を有している。また、ナビゲーション装置３は、ＣＡＮ等の車載ネットワークを介して、ナビゲーション装置３の搭載された車両に対して設置されたセンサや車両制御用のＥＣＵ等と接続されている。 As shown in FIG. 3, the navigation device 3 according to the present embodiment includes a current position detection unit 31 that detects the current position of the vehicle in which the navigation device 3 is mounted, a data recording unit 32 in which various data are recorded, A navigation ECU 33 that performs various arithmetic processing based on input information, an operation unit 34 that accepts operations from the user, and information regarding maps around the vehicle and guidance routes set in the navigation device 3 for the user. a liquid crystal display 35 that displays route guidance, a speaker 36 that outputs audio guidance regarding route guidance, warnings about risk factors, etc., a DVD drive 37 that reads a DVD as a storage medium, and a probe center and VICS (registered trademark: Vehicle Information and Communication). The communication module 38 communicates with an information center such as a system center. Further, the navigation device 3 is connected to sensors installed in the vehicle in which the navigation device 3 is mounted, an ECU for vehicle control, etc. via an in-vehicle network such as CAN.

以下に、ナビゲーション装置３が有する各構成要素について順に説明する。
現在位置検出部３１は、ＧＰＳ４１、車速センサ４２、ステアリングセンサ４３、ジャイロセンサ４４等からなり、現在の車両の位置、方位、車両の走行速度、現在時刻等を検出することが可能となっている。ここで、特に車速センサ４２は、車両の移動距離や車速を検出する為のセンサであり、車両の駆動輪の回転に応じてパルスを発生させ、パルス信号をナビゲーションＥＣＵ３３に出力する。そして、ナビゲーションＥＣＵ３３は発生するパルスを計数することにより駆動輪の回転速度や移動距離を算出する。尚、上記４種類のセンサをナビゲーション装置３が全て備える必要はなく、これらの内の１又は複数種類のセンサのみをナビゲーション装置３が備える構成としても良い。 Below, each component included in the navigation device 3 will be explained in order.
The current position detection unit 31 includes a GPS 41, a vehicle speed sensor 42, a steering sensor 43, a gyro sensor 44, etc., and is capable of detecting the current vehicle position, direction, vehicle speed, current time, etc. . Here, in particular, the vehicle speed sensor 42 is a sensor for detecting the travel distance and vehicle speed of the vehicle, generates pulses according to the rotation of the drive wheels of the vehicle, and outputs the pulse signal to the navigation ECU 33. Then, the navigation ECU 33 calculates the rotation speed and travel distance of the drive wheels by counting the generated pulses. Note that it is not necessary for the navigation device 3 to include all of the four types of sensors described above, and the navigation device 3 may be configured to include only one or more types of sensors.

また、データ記録部３２は、外部記憶装置及び記録媒体としてのハードディスク（図示せず）と、ハードディスクに記録された地図情報ＤＢ４５や学習情報ＤＢ４６や所定のプログラム等を読み出すとともにハードディスクに所定のデータを書き込む為のドライバである記録ヘッド（図示せず）とを備えている。尚、データ記録部３２としてはハードディスクの代わりにメモリーカードやＣＤやＤＶＤ等の光ディスクを有しても良い。また、地図情報ＤＢ４５は外部のサーバに格納させ、ナビゲーション装置３が通信により取得しても良い。 The data recording unit 32 also has a hard disk (not shown) as an external storage device and a recording medium, and reads map information DB 45, learning information DB 46, predetermined programs, etc. recorded on the hard disk, and also writes predetermined data to the hard disk. It is equipped with a recording head (not shown) which is a driver for writing. Note that the data recording section 32 may include a memory card, an optical disk such as a CD, or a DVD instead of a hard disk. Alternatively, the map information DB 45 may be stored in an external server and acquired by the navigation device 3 through communication.

ここで、地図情報ＤＢ４５は、例えば、道路（リンク）に関するリンクデータ、ノード点に関するノードデータ、施設に関する施設データ、経路探索処理に用いられる探索データ、地図を表示するための地図表示データ、各交差点に関する交差点データ、地点を検索するための検索データ等を含む。 Here, the map information DB 45 includes, for example, link data related to roads (links), node data related to node points, facility data related to facilities, search data used for route search processing, map display data for displaying a map, and each intersection. Contains intersection data related to, search data for searching points, etc.

また、学習情報ＤＢ４６は、学習装置１によって学習された学習モデルに関する情報が格納される記憶手段である。本実施形態では後述のようにナビゲーション装置３は、生成され液晶ディスプレイ３５に表示する前の画像データについて異常画像（表示した際にブロックノイズが生じる画像）に該当するかを判定し、異常画像に該当すると判定された場合に画像データの再生成を行うが、判定の際に用いる閾値が学習値として上記学習モデルに含まれている。尚、学習情報ＤＢ４６に格納される学習モデルに関する情報は、学習装置１から通信或いは記憶媒体を介して取得し、更に取得した情報に基づいて適宜更新可能とするのが望ましい。 Further, the learning information DB 46 is a storage means in which information regarding learning models learned by the learning device 1 is stored. In this embodiment, as will be described later, the navigation device 3 determines whether or not the generated image data before being displayed on the liquid crystal display 35 corresponds to an abnormal image (an image that causes block noise when displayed), and When it is determined that this applies, the image data is regenerated, and the threshold value used in the determination is included in the learning model as a learning value. Note that it is desirable that the information regarding the learning model stored in the learning information DB 46 be acquired from the learning device 1 via communication or a storage medium, and can be updated as appropriate based on the acquired information.

一方、ナビゲーションＥＣＵ（エレクトロニック・コントロール・ユニット）３３は、ナビゲーション装置３の全体の制御を行う電子制御ユニットであり、演算装置及び制御装置としてのＣＰＵ５１、並びにＣＰＵ５１が各種の演算処理を行うにあたってワーキングメモリとして使用されるとともに、経路が探索されたときの経路データ等が記憶されるＲＡＭ５２、制御用のプログラムのほか、後述の画像表示処理プログラム（図１０参照）等が記録されたＲＯＭ５３、ＲＯＭ５３から読み出したプログラムを記憶するフラッシュメモリ５４等の内部記憶装置を備えている。更に、特に画像の描画に関する演算を行うＧＰＵ５５、画像の描画に関する情報を格納するＶＲＡＭ（ビデオメモリ）５６についても備えている。但し、ＧＰＵ５５やＶＲＡＭ５６は必須ではなく、ＣＰＵ５１やＲＡＭ５２がそれらの動作を代わりに行っても良い。 On the other hand, the navigation ECU (electronic control unit) 33 is an electronic control unit that controls the entire navigation device 3, and includes a CPU 51 as an arithmetic unit and a control device, and a working memory when the CPU 51 performs various arithmetic processes. A RAM 52 stores route data when a route is searched, and a ROM 53 stores control programs as well as an image display processing program (see FIG. 10), which will be described later. It is equipped with an internal storage device such as a flash memory 54 for storing programs. Furthermore, it also includes a GPU 55 that performs calculations particularly related to image drawing, and a VRAM (video memory) 56 that stores information related to image drawing. However, the GPU 55 and VRAM 56 are not essential, and the CPU 51 and RAM 52 may perform these operations instead.

操作部３４は、走行開始地点としての出発地及び走行終了地点としての目的地を入力する際等に操作され、各種のキー、ボタン等の複数の操作スイッチ（図示せず）を有する。そして、ナビゲーションＥＣＵ３３は、各スイッチの押下等により出力されるスイッチ信号に基づき、対応する各種の動作を実行すべく制御を行う。尚、操作部３４は液晶ディスプレイ３５の前面に設けたタッチパネルを有しても良い。また、マイクと音声認識装置を有しても良い。 The operation unit 34 is operated when inputting a departure point as a travel start point and a destination as a travel end point, and has a plurality of operation switches (not shown) such as various keys and buttons. Then, the navigation ECU 33 performs control to execute various corresponding operations based on switch signals output by pressing each switch. Note that the operation unit 34 may include a touch panel provided in front of the liquid crystal display 35. Further, it may include a microphone and a voice recognition device.

また、液晶ディスプレイ３５には、道路を含む地図画像、交通情報、操作案内、操作メニュー、キーの案内、ナビゲーション装置３で設定されている案内経路、案内経路に沿った案内情報、ニュース、天気予報、時刻、メール、テレビ番組等が表示される。尚、液晶ディスプレイ３５の代わりに、ＨＵＤやＨＭＤを用いても良い。また、本実施形態では特にナビゲーション装置３に組み込まれた学習モデルを用いて、後述のように液晶ディスプレイ３５に表示される画像に対してブロックノイズの低減の為の処理が行われる。 The liquid crystal display 35 also displays map images including roads, traffic information, operation guidance, operation menus, key guidance, the guidance route set by the navigation device 3, guidance information along the guidance route, news, and weather forecasts. , time, email, TV programs, etc. are displayed. Note that a HUD or HMD may be used instead of the liquid crystal display 35. Furthermore, in this embodiment, using a learning model built into the navigation device 3, processing for reducing block noise is performed on an image displayed on the liquid crystal display 35 as described later.

また、スピーカ３６は、ナビゲーションＥＣＵ３３からの指示に基づいて案内経路に沿った走行を案内する音声ガイダンスや、交通情報の案内を出力する。 Further, the speaker 36 outputs audio guidance for guiding travel along the guide route and traffic information based on instructions from the navigation ECU 33.

また、ＤＶＤドライブ３７は、ＤＶＤやＣＤ等の記録媒体に記録されたデータを読み取り可能なドライブである。そして、読み取ったデータに基づいて音楽や映像の再生、地図情報ＤＢ４５や学習情報ＤＢ４６の更新等が行われる。尚、ＤＶＤドライブ３７に替えてメモリーカードを読み書きする為のカードスロットを設けても良い。 Further, the DVD drive 37 is a drive that can read data recorded on recording media such as DVDs and CDs. Then, based on the read data, music and video are played back, and the map information DB 45 and learning information DB 46 are updated. Note that a card slot for reading and writing a memory card may be provided in place of the DVD drive 37.

また、通信モジュール３８は、交通情報センタ、例えば、ＶＩＣＳセンタやプローブセンタ等から送信された交通情報を受信する為の通信装置であり、例えば携帯電話機やＤＣＭが該当する。また、学習装置１との間の通信について可能にしても良い。 Further, the communication module 38 is a communication device for receiving traffic information transmitted from a traffic information center, such as a VICS center or a probe center, and is, for example, a mobile phone or a DCM. Further, communication with the learning device 1 may be made possible.

続いて、上記構成を有する本実施形態に係る学習装置１においてＣＰＵ２１が実行する学習処理プログラムについて図４に基づき説明する。図４は本実施形態に係る学習処理プログラムのフローチャートである。ここで、学習処理プログラムは、前回の処理を実行した後から一定期間経過後に実行され、教師画像ＤＢ１２に格納された画像を用いてナビゲーション装置３に表示される画像に生じるブロックノイズ検出の為の学習モデルの学習を行い、より最適な学習モデルを生成するプログラムである。また、以下の図４にフローチャートで示されるプログラムは、制御部１１が備えているＲＡＭ２２、ＲＯＭ２３等に記憶されており、ＣＰＵ２１により実行される。 Next, a learning processing program executed by the CPU 21 in the learning device 1 according to the present embodiment having the above configuration will be described based on FIG. 4. FIG. 4 is a flowchart of the learning processing program according to this embodiment. Here, the learning processing program is executed after a certain period of time has passed since the previous processing was executed, and uses the images stored in the teacher image DB 12 to detect block noise occurring in the image displayed on the navigation device 3. This is a program that performs training on a learning model and generates a more optimal learning model. Further, a program shown in a flowchart in FIG. 4 below is stored in the RAM 22, ROM 23, etc. included in the control unit 11, and is executed by the CPU 21.

先ず、学習処理プログラムでは、ステップ（以下、Ｓと略記する）１において、ＣＰＵ２１は教師画像ＤＢ１２から学習モデルの学習に用いる学習用画像２５を抽出する。尚、学習用画像２５は、ナビゲーション装置３において表示対象となる画像（例えば地図画像、移動案内画像、施設案内画像等）であり、過去にナビゲーション装置３において実際に表示された画像でも良いし、学習用に疑似的に生成した画像であっても良い。過去にナビゲーション装置３において実際に表示された画像を収集する手段としては、例えば液晶ディスプレイ３５の表示画像を撮像し、撮像画像から正常画像のみを目視で抽出することにより行う。また、学習の過程で後述のように二次テスト用画像２７の一部が学習用画像２５として追加される場合もある（Ｓ１３）。前記Ｓ１で抽出される学習用画像２５は基本的にブロックノイズが生じていない正常画像のみであり、枚数は約９万枚とする。 First, in step (hereinafter abbreviated as S) 1 of the learning processing program, the CPU 21 extracts the learning image 25 used for learning the learning model from the teacher image DB 12. The learning image 25 is an image to be displayed on the navigation device 3 (for example, a map image, a movement guide image, a facility guide image, etc.), and may be an image actually displayed on the navigation device 3 in the past. The images may be pseudo-generated for learning purposes. The means for collecting images actually displayed in the navigation device 3 in the past is, for example, by capturing an image displayed on the liquid crystal display 35 and visually extracting only normal images from the captured image. Further, during the learning process, a part of the secondary test image 27 may be added as the learning image 25 as described later (S13). The learning images 25 extracted in S1 are basically only normal images without block noise, and the number of images is approximately 90,000.

尚、前記Ｓ１で抽出された約９万枚の学習用画像２５の中には、全く同一或いはごく一部のみが異なる実質的に同一な複数の画像（以下、重複画像）が多く含まれる。このような複数の重複画像の全てを対象として学習を行ったとしても、複数の重複画像の内から選択した一の重複画像のみを対象として学習を行った場合であっても学習効果に大きな差はない。従って、処理負担を軽減するために図５に示すように重複画像については一のみを残して他の重複画像は以降の処理対象から除外する（Ｓ２）。特にＳ２においてＣＰＵ２１は、ハッシュ関数を用いて前記Ｓ１で抽出された約９万枚の学習用画像２５の内から上記重複画像の探索を行う。尚、ハッシュ関数を用いた探索以外の手段によって重複画像を判定しても良い。前記Ｓ２の処理を行った結果、以降の処理で処理対象となる学習用画像２５の枚数は約４．５万枚まで削減される。 Note that the approximately 90,000 learning images 25 extracted in S1 include many substantially identical images (hereinafter referred to as duplicate images) that are completely the same or differ only in a small portion. Even if learning is performed using all of these multiple duplicate images as the target, there is a large difference in the learning effect even if learning is performed using only one duplicate image selected from the multiple duplicate images. There isn't. Therefore, in order to reduce the processing load, as shown in FIG. 5, only one of the duplicate images is left and the other duplicate images are excluded from subsequent processing (S2). In particular, in S2, the CPU 21 uses a hash function to search for the duplicated images from among the approximately 90,000 learning images 25 extracted in S1. Note that duplicate images may be determined by means other than searching using a hash function. As a result of performing the process in S2, the number of learning images 25 to be processed in subsequent processes is reduced to approximately 45,000.

続いて、Ｓ３においてＣＰＵ２１は、ブロック形状の所定エリアを疑似的にノイズ領域に設定したブロックノイズの画像（以下、疑似ブロックノイズ画像という）を生成する。ここで、図６は前記Ｓ３で生成される疑似ブロックノイズ画像６１の一例を示した図である。疑似ブロックノイズ画像６１のサイズは学習用画像２５と同サイズとする。また、ノイズ領域は一又は複数の正方形のブロック形状が連なった領域となるが、画像内においてノイズ領域となるブロック形状のサイズや位置はランダムに設定される。例えば以下の方法により設定する。
（１）周波数をランダムに取得し、ｓｉｎ波とｃｏｓ波の合成波を生成する。
（２）画像の縦方向に対してブロックサイズ単位のステップで合成波からサンプリング値をサンプリングする。
（３）画像の横方向に対しても上記（１）及び（２）の処理を行う。
（４）縦方向と横方向に夫々サンプリングされたサンプリング値について、閾値を基準に白（２５５）と黒（０）のいずれかに区分する。黒に設定された領域がノイズ領域となる。 Subsequently, in S3, the CPU 21 generates a block noise image (hereinafter referred to as a pseudo block noise image) in which a predetermined block-shaped area is pseudo-set as a noise region. Here, FIG. 6 is a diagram showing an example of the pseudo block noise image 61 generated in S3. The size of the pseudo block noise image 61 is set to be the same size as the learning image 25. Further, the noise area is an area in which one or more square blocks are connected, but the size and position of the block shapes that become the noise area are set randomly within the image. For example, use the following method to set it.
(1) Obtain frequencies at random and generate a composite wave of sine waves and cosine waves.
(2) Sampling values are sampled from the composite wave in steps of block size in the vertical direction of the image.
(3) The above processes (1) and (2) are also performed in the horizontal direction of the image.
(4) The sampling values sampled in the vertical and horizontal directions are classified into either white (255) or black (0) based on the threshold value. The area set to black becomes the noise area.

また、前記Ｓ３においてＣＰＵ２１は、前記Ｓ１で教師画像ＤＢ１２から抽出され、前記Ｓ２で重複画像を除外した後の学習用画像２５の数に対応する数（例えば約４．５万枚）だけ疑似ブロックノイズ画像６１を生成する。生成される複数の疑似ブロックノイズ画像６１毎に、ノイズ領域となるブロック形状のサイズや位置は上記（１）～（４）の手段を用いてランダムに設定される。 In addition, in S3, the CPU 21 generates pseudo blocks corresponding to the number of learning images 25 (for example, about 45,000 images) extracted from the teacher image DB 12 in S1 and after removing duplicate images in S2. A noise image 61 is generated. For each of the plurality of generated pseudo block noise images 61, the size and position of the block shape serving as the noise area are randomly set using the means (1) to (4) above.

その後、Ｓ４においてＣＰＵ２１は、前記Ｓ３で生成された疑似ブロックノイズ画像について、複数の疑似ブロックノイズ画像６１毎にノイズ領域の色をランダムに設定する。例えば、ＨＳＶ色空間から色をランダムに選択し、選択した色にノイズ領域の色を変換する。また、ノイズ領域の透過率についても同様にランダムに設定する。尚、ノイズ領域の色や透過率については一の画像内において同一としても良いし、一の画像内でもエリア毎に異なる態様に設定しても良い。 Thereafter, in S4, the CPU 21 randomly sets the color of the noise region for each of the plurality of pseudo block noise images 61 for the pseudo block noise images generated in S3. For example, a color is randomly selected from the HSV color space and the color of the noise region is converted to the selected color. Furthermore, the transmittance of the noise area is also randomly set. Note that the color and transmittance of the noise area may be the same within one image, or may be set differently for each area even within one image.

続いて、Ｓ５においてＣＰＵ２１は、前記Ｓ１で教師画像ＤＢ１２から抽出され、前記Ｓ２で重複画像を除外した後の学習用画像２５に対して、図７に示すように前記Ｓ３で生成された疑似ブロックノイズ画像６１を重畳して合成する。更に、Ｓ６においてＣＰＵ２１は、重畳した画像をグレースケール変換する。その結果、図７に示すように正常画像である学習用画像２５に対して擬似的にブロックノイズを生じさせた疑似異常画像６２が生成される。尚、疑似異常画像６２に生じているブロックノイズの位置、形状、透過率、色（グレースケールの色）については画像毎に上記Ｓ３及びＳ４の処理によりランダムに設定されることとなる。また、処理対象となる学習用画像２５の枚数が約４．５万枚であれば、疑似異常画像６２も約４．５万枚生成されることとなる。 Subsequently, in S5, the CPU 21 extracts the pseudo blocks generated in S3 as shown in FIG. The noise image 61 is superimposed and synthesized. Furthermore, in S6, the CPU 21 converts the superimposed image into a gray scale. As a result, as shown in FIG. 7, a pseudo-abnormal image 62 is generated in which block noise is artificially generated in the learning image 25, which is a normal image. Note that the position, shape, transmittance, and color (grayscale color) of the block noise occurring in the pseudo-abnormal image 62 are randomly set for each image by the processing in S3 and S4. Furthermore, if the number of learning images 25 to be processed is approximately 45,000, approximately 45,000 pseudo abnormal images 62 will also be generated.

その後、Ｓ７においてＣＰＵ２１は、前記Ｓ６で生成された疑似ブロックノイズ画像（即ち擬似的にブロックノイズを生じさせた異常画像）と教師画像ＤＢ１２から抽出した学習用画像２５（即ち正常画像）を教師画像として、ランダムに学習モデルに入力して学習を行う。尚、学習モデルに入力する正常画像と異常画像の割合は例えば正常画像：異常画像＝４：６とする。尚、割合は適宜変更可能であるが、異常画像の検出を漏れなく行う為に、異常画像の割合を多くするのが望ましい。上記学習モデルによる学習は、前記Ｓ６で生成された全ての疑似ブロックノイズ画像６１の入力が終了するまで行う。また、学習モデルは例えばＵ－Ｎｅｔモデルを用いる。 Thereafter, in S7, the CPU 21 converts the pseudo block noise image generated in S6 (i.e., the abnormal image in which block noise has been pseudo-generated) and the learning image 25 (i.e., the normal image) extracted from the teacher image DB 12 into the teacher image. is randomly input to the learning model and learning is performed. Note that the ratio of normal images and abnormal images input to the learning model is, for example, normal image: abnormal image = 4:6. Although the ratio can be changed as appropriate, it is desirable to increase the ratio of abnormal images in order to detect all abnormal images. Learning using the learning model is performed until all the pseudo block noise images 61 generated in S6 are input. Further, as the learning model, for example, the U-Net model is used.

前記Ｓ７における学習モデルによる学習としては、例えばランダムに入力された画像が“正常画像”又は“異常画像”のいずれに該当するかの予測を行う。そして、学習モデルが予測した結果と、予め用意している画像が正常画像か異常画像であるかの答えと照合させることで学習モデルの学習を行う。尚、学習モデルの学習としては“正常画像”又は“異常画像”の予測に加えて、画像内のどの領域にブロックノイズが生じているかを予測させ、学習モデルが予測した結果と、予め用意しているブロックノイズが生じている領域の答えと照合させることで学習モデルの学習を行うようにしても良い。 As for the learning using the learning model in S7, for example, it is predicted whether a randomly input image corresponds to a "normal image" or an "abnormal image." Then, the learning model is trained by comparing the results predicted by the learning model with a previously prepared answer as to whether the image is a normal image or an abnormal image. In addition to predicting "normal images" or "abnormal images," the learning model is trained to predict in which areas within the image block noise occurs, and the results predicted by the learning model are combined with the predictions prepared in advance. The learning model may be trained by comparing the answer with the answer of the area where block noise occurs.

その後、Ｓ８においてＣＰＵ２１は、生成された全ての疑似ブロックノイズ画像６１を対象として学習モデルの学習を行った後に、学習モデルの学習結果を評価する為に教師画像ＤＢ１２に格納された一次テスト用画像２６を用いて評価値ｍＩｏＵを算出する。具体的には、学習後の学習モデルに対して一次テスト用画像２６を入力し、一次テスト用画像２６においてブロックノイズが生じている領域を予測させる。そして、学習モデルが予測した結果と、予め用意しているブロックノイズが生じている領域の答えと照合させ、評価値ｍＩｏＵを算出する。尚、評価値ｍＩｏＵは予測結果と正解が完全に一致していれば１．０となり、ずれが大きくなる程０に近い値が算出される。例えばピクセル単位で混同行列を用いることによって評価値ｍＩｏＵを算出する。 After that, in S8, the CPU 21 trains the learning model using all the generated pseudo block noise images 61, and then uses the primary test image stored in the teacher image DB 12 to evaluate the learning results of the learning model. 26 to calculate the evaluation value mIoU. Specifically, the primary test image 26 is input to the learned model after learning, and the area where block noise occurs in the primary test image 26 is predicted. Then, the result predicted by the learning model is compared with a previously prepared answer for the area where block noise occurs, and an evaluation value mIoU is calculated. Note that the evaluation value mIoU is 1.0 if the prediction result and the correct answer completely match, and a value closer to 0 is calculated as the deviation becomes larger. For example, the evaluation value mIoU is calculated by using a confusion matrix for each pixel.

尚、一次テスト用画像２６は、過去にナビゲーション装置３において実際に表示された画像を用い、ブロックノイズが生じていない正常画像と、ブロックノイズが生じている異常画像を夫々含むようにする。一次テスト用画像２６の枚数は約２万枚とするが、２万枚の一次テスト用画像２６に対して夫々算出された評価値ｍＩｏＵの平均値を、今回の学習モデルの最終的な評価値ｍＩｏＵとする。但し、平均値ではなく中央値や最頻値としても良い。 The primary test image 26 is an image actually displayed on the navigation device 3 in the past, and includes a normal image without block noise and an abnormal image with block noise. The number of primary test images 26 is approximately 20,000, and the average value of the evaluation values mIoU calculated for each of the 20,000 primary test images 26 is the final evaluation value of this learning model. Let it be mIoU. However, the median value or the mode may be used instead of the average value.

その後、ＣＰＵ２１は上記Ｓ１～Ｓ８の処理を所定回数（例えば１００回）繰り返し行う。処理を繰り返す度に前記Ｓ３で生成される疑似ブロックノイズ画像６１のノイズ領域の位置、形状、色、透過率がランダムに変化する、即ち学習モデルで学習対象とする教師画像が変わるので、学習モデルの学習結果を評価する評価値ｍＩｏＵも繰り返す度に変わることとなる。 Thereafter, the CPU 21 repeats the processes of S1 to S8 described above a predetermined number of times (for example, 100 times). Each time the process is repeated, the position, shape, color, and transmittance of the noise region of the pseudo block noise image 61 generated in S3 change randomly, that is, the teacher image to be learned by the learning model changes, so the learning model The evaluation value mIoU used to evaluate the learning result will also change each time it is repeated.

次に、Ｓ９においてＣＰＵ２１は、所定回数繰り返し行った学習において、前記Ｓ８で算出された評価値ｍＩｏＵが最も高い値となった学習モデル（以下、ベスト学習モデルという）を選択する。そして、選択されたベスト学習モデルの学習結果を改めて評価する為に、ベスト学習モデルに対して教師画像ＤＢ１２に格納された二次テスト用画像２７を入力し、入力された二次テスト用画像２７においてブロックノイズが生じていると予測された領域（以下、ブロックノイズ領域という）の面積値を算出する。 Next, in S9, the CPU 21 selects the learning model that has the highest evaluation value mIoU calculated in S8 (hereinafter referred to as the best learning model) in the learning that has been repeated a predetermined number of times. Then, in order to reevaluate the learning results of the selected best learning model, the secondary test image 27 stored in the teacher image DB 12 is input to the best learning model, and the input secondary test image 27 The area value of a region where block noise is predicted to occur (hereinafter referred to as a block noise region) is calculated.

尚、二次テスト用画像２７は、一次テスト用画像２６と同様に過去にナビゲーション装置３において実際に表示された画像を用い、ブロックノイズが生じていない正常画像と、ブロックノイズが生じている異常画像を夫々含むようにする。二次テスト用画像２７の枚数は約７９万枚とし、７９万枚の二次テスト用画像２７に対して夫々ブロックノイズ領域の面積値を算出する。 The secondary test image 27 uses images actually displayed on the navigation device 3 in the past, similar to the primary test image 26, and includes a normal image with no block noise and an abnormal image with block noise. Include each image. The number of secondary test images 27 is approximately 790,000, and the area value of the block noise region is calculated for each of the 790,000 secondary test images 27.

続いて、Ｓ１０においてＣＰＵ２１は、入力された二次テスト用画像２７が正常画像である場合と異常画像である場合とに分けて、前記Ｓ９で算出されたブロックノイズ領域の面積値の分布（ヒストグラム）を生成する。例えば図８は前記Ｓ１０において生成されるブロックノイズ領域の面積値の分布の一例を示した図である。図８に示すように前記Ｓ１０において生成されるブロックノイズ領域の面積値の分布は、横軸を面積値、縦軸を度数とし、算出された面積値を所定範囲毎に区分してその範囲に含まれる度数を示すヒストグラムとする。 Next, in S10, the CPU 21 divides the input secondary test image 27 into a normal image and an abnormal image, and calculates the area value distribution (histogram) of the block noise region calculated in S9. ) is generated. For example, FIG. 8 is a diagram showing an example of the distribution of area values of the block noise region generated in S10. As shown in FIG. 8, the distribution of the area values of the block noise region generated in S10 is determined by dividing the calculated area values into predetermined ranges, with the horizontal axis representing the area value and the vertical axis representing the frequency. A histogram showing the included frequencies.

基本的に入力された二次テスト用画像２７が正常画像である場合のブロックノイズ領域の面積値の分布は、０付近に度数が集中することとなる。一方、入力された二次テスト用画像２７が異常画像である場合のブロックノイズ領域の面積値の分布は、ある程度分散して度数が分布することとなる。 Basically, in the distribution of the area values of the block noise region when the inputted secondary test image 27 is a normal image, the frequency is concentrated around 0. On the other hand, when the inputted secondary test image 27 is an abnormal image, the area values of the block noise region will be distributed in a somewhat dispersed frequency distribution.

次に、Ｓ１１においてＣＰＵ２１は、異常画像を異常画像と判断する再現率(Recall)が１００％となること、即ち異常画像に対する見逃しが無いことを第１の条件とし、且つ適合率（Precision、精度ともいう）が最も高くなることを第２の条件として閾値を算出する。本実施形態において学習モデルが学習するのは、ナビゲーション装置３において生成された画像データを画面上に表示する前に該画像データが異常画像（表示した際にブロックノイズが生じる画像）に該当するかを判定する為の学習である。そして、学習モデルはブロックノイズが生じると予測されるブロックノイズ領域の面積値が閾値以上である場合に異常画像であると判定することとし、Ｓ１１ではその閾値について学習によって最適値を算出する。 Next, in S11, the CPU 21 sets the first condition that the recall rate (Recall) for determining an abnormal image as an abnormal image is 100%, that is, that no abnormal image is overlooked, and The threshold value is calculated with the second condition being that the value (also referred to as ``(also referred to as ``(also referred to as ``(also referred to as ``(also referred to as "(also referred to as "))" is the highest" is the second condition. In this embodiment, the learning model learns whether the image data generated in the navigation device 3 corresponds to an abnormal image (an image that causes block noise when displayed) before displaying the image data on the screen. This is learning to judge. Then, the learning model determines that the image is an abnormal image when the area value of a block noise region where block noise is predicted to occur is equal to or larger than a threshold value, and in S11, an optimal value for the threshold value is calculated by learning.

ここで、閾値の算出において重要なのはナビゲーション装置３に反映した際に異常画像の見逃しが無いこと、即ち異常画像を正常画像と判定しないことである。また、適合率、即ちナビゲーション装置３が異常画像と判定した画像が実際に異常画像である割合はできる限り高くするのが望ましい。従って、図８に示すように生成されたブロックノイズ領域の面積値の分布において、入力された二次テスト用画像２７が異常画像である場合の度数が存在する最も低い面積値を閾値とする。 Here, what is important in calculating the threshold value is that no abnormal images are overlooked when reflected on the navigation device 3, that is, that abnormal images are not determined to be normal images. Furthermore, it is desirable to make the precision rate, that is, the percentage of images determined to be abnormal images by the navigation device 3 as actually abnormal images, as high as possible. Therefore, in the distribution of the area values of the block noise region generated as shown in FIG. 8, the lowest area value that has the frequency when the input secondary test image 27 is an abnormal image is set as the threshold value.

また、前記Ｓ１１においてＣＰＵ２１は、設定された閾値における適合率を算出する。ここで、適合率は上述したようにナビゲーション装置３に反映した際に異常画像と判定した画像が実際に異常画像である割合（正解率）である。前記Ｓ１０で生成されたブロックノイズ領域の面積値の分布において、閾値以上の面積値において正常画像と異常画像を合わせた度数に対する異常画像の度数の割合が適合率となる。即ち、閾値以上の面積値において正常画像の度数が存在しなければ適合率は理想値である１００％となるが、１００％の到達は困難である為、後述のように目標値として例えば９９％を設定する。 Further, in S11, the CPU 21 calculates the matching rate at the set threshold value. Here, the precision rate is the proportion (accuracy rate) of images that are determined to be abnormal images when reflected on the navigation device 3 and are actually abnormal images, as described above. In the distribution of the area values of the block noise region generated in S10, the ratio of the frequency of the abnormal image to the combined frequency of the normal image and the abnormal image for area values equal to or higher than the threshold value is the matching rate. In other words, if there is no frequency of normal images in area values greater than or equal to the threshold, the precision will be the ideal value of 100%, but since it is difficult to reach 100%, the target value is set to 99%, for example, as described below. Set.

その後、Ｓ１２においてＣＰＵ２１は、前記Ｓ１１で設定された閾値をナビゲーション装置３に反映した際の再現率(Recall)と適合率（Precision）の値が目標値に到達するか否かを判定する。尚、再現率の目標値は１００％とし、適合率の目標値は９９％とする。但し、目標値は適宜変更可能である。例えば適合率の目標値を９８％や９７％としても良い。また、本実施形態では上述のように再現率が１００％であることを条件に閾値を設定するので、前記Ｓ１２では再現率については必ず目標値に到達すると判定されることとなるが、例えば適合率が９９％であることを条件に閾値を設定し、Ｓ１２では再現率が目標値に到達するか否かの判定を行っても良い。 After that, in S12, the CPU 21 determines whether or not the values of the recall rate (Recall) and the precision rate (Precision) when the threshold value set in S11 is reflected in the navigation device 3 reach the target values. Note that the target value for the recall rate is 100%, and the target value for the precision rate is 99%. However, the target value can be changed as appropriate. For example, the target value of the precision rate may be set to 98% or 97%. In addition, in this embodiment, the threshold value is set on the condition that the recall rate is 100% as described above, so in S12, it is determined that the recall rate always reaches the target value. A threshold value may be set on the condition that the rate is 99%, and it may be determined in S12 whether the recall rate reaches the target value.

そして、前記Ｓ１１で設定された閾値をナビゲーション装置３に反映した際の再現率(Recall)と適合率（Precision）の値が目標値に到達すると判定された場合（Ｓ１２：ＹＥＳ）には、Ｓ１４へと移行する。それに対して、前記Ｓ１１で設定された閾値をナビゲーション装置３に反映した際の再現率(Recall)と適合率（Precision）の値が目標値に到達しないと判定された場合（Ｓ１２：ＮＯ）には、Ｓ１３へと移行する。 Then, if it is determined that the recall rate (Recall) and precision rate (Precision) when the threshold value set in S11 is reflected in the navigation device 3 reaches the target value (S12: YES), S14 transition to. On the other hand, if it is determined that the recall rate (Recall) and precision rate (Precision) when the threshold value set in S11 is reflected in the navigation device 3 does not reach the target value (S12: NO). Then, the process moves to S13.

Ｓ１３においてＣＰＵ２１は、適合率を向上させる為に、現在の学習モデルではブロックノイズ領域の予測を正しく行うことができなかった正常画像、即ち前記Ｓ１０で生成されたブロックノイズ領域の面積値の分布において、ブロックノイズ領域の面積値が閾値以上となっている正常画像について学習対象として追加する。具体的には図９に示すようにブロックノイズ領域の面積値の分布において、閾値以上の面積値となる正常画像を学習用画像２５に追加する。即ち、二次テスト用画像２７の一部が学習用画像２５に新たに追加されることとなる。 In S13, in order to improve the precision rate, the CPU 21 uses the normal image in which the block noise region could not be correctly predicted using the current learning model, that is, the distribution of area values of the block noise region generated in S10. , normal images in which the area value of the block noise region is greater than or equal to the threshold are added as learning targets. Specifically, as shown in FIG. 9, in the area value distribution of the block noise region, a normal image having an area value equal to or greater than a threshold value is added to the learning image 25. That is, a part of the secondary test image 27 is newly added to the learning image 25.

その後、Ｓ１へと戻りＣＰＵ２１は新たに画像の追加された学習用画像２５に基づいて学習モデルの学習を行う。ブロックノイズ領域の予測を正しくできなかった正常画像を学習対象として追加することにより、その後に新たな学習を行った学習モデルでは適合率（Precision）の向上が期待できる。 Thereafter, the process returns to S1 and the CPU 21 performs learning of the learning model based on the newly added learning image 25. By adding normal images that could not correctly predict block noise regions as learning targets, it is expected that the precision of the learning model that performs new learning after that will improve.

一方、Ｓ１４においてＣＰＵ２１は、前記Ｓ１１で設定された閾値を含むベスト学習モデルを学習結果としてナビゲーション装置３へと組み込む。尚、ベスト学習モデルの組み込みは、例えば通信や記憶媒体を介して必要な情報をナビゲーション装置３へと取得させることにより行う。その結果、ナビゲーション装置３は組み込んだベスト学習モデルを用いて後述のように表示する画像のブロックノイズの発生を低減させる処理が可能となる。 On the other hand, in S14, the CPU 21 incorporates the best learning model including the threshold set in S11 into the navigation device 3 as a learning result. Incidentally, the best learning model is incorporated by, for example, causing the navigation device 3 to acquire necessary information via communication or a storage medium. As a result, the navigation device 3 can use the incorporated best learning model to perform processing to reduce the occurrence of block noise in images to be displayed as described later.

続いて、本実施形態に係るナビゲーション装置３においてＣＰＵ５１やＧＰＵ５５が実行する画像表示処理プログラムについて図１０に基づき説明する。図１０は本実施形態に係る画像表示処理プログラムのフローチャートである。ここで、画像表示処理プログラムは、車両のＡＣＣ電源(accessory power supply)がＯＮされた後に実行され、前述の学習処理プログラム（図４参照）において学習装置１により学習した学習モデルを組み込むことによって、表示する画像のブロックノイズの発生を低減させるプログラムである。また、以下の図１０にフローチャートで示されるプログラムは、ナビゲーションＥＣＵ３３が備えているＲＡＭ５２、ＲＯＭ５３等に記憶されており、ＣＰＵ５１又はＧＰＵ５５により実行される。 Next, an image display processing program executed by the CPU 51 and the GPU 55 in the navigation device 3 according to this embodiment will be described based on FIG. 10. FIG. 10 is a flowchart of the image display processing program according to this embodiment. Here, the image display processing program is executed after the ACC power supply (accessory power supply) of the vehicle is turned on, and by incorporating the learning model learned by the learning device 1 in the above-mentioned learning processing program (see FIG. 4), This is a program that reduces the occurrence of block noise in displayed images. Further, a program shown in a flowchart in FIG. 10 below is stored in the RAM 52, ROM 53, etc. included in the navigation ECU 33, and is executed by the CPU 51 or GPU 55.

先ず、画像表示処理プログラムでは、Ｓ２１においてＣＰＵ５１はナビゲーション装置３において表示対象とする画像の画像データを生成する。尚、ナビゲーション装置３において表示対象とする画像は現在のナビゲーション装置３の状況によって異なるが、基本的には車両の現在位置周辺の地図画像を表示する。また、車両の移動案内の実行中であって案内交差点に接近した場合には、交差点拡大図等についても表示する。また、ユーザによって施設検索が行われた場合には検索された施設のリストや施設情報等についても表示する。 First, in the image display processing program, the CPU 51 generates image data of an image to be displayed in the navigation device 3 in S21. Note that the image to be displayed in the navigation device 3 varies depending on the current status of the navigation device 3, but basically a map image around the current position of the vehicle is displayed. Additionally, when the vehicle approaches a guidance intersection while the vehicle is being guided, an enlarged view of the intersection, etc. is also displayed. Furthermore, when the user performs a facility search, a list of searched facilities, facility information, etc. are also displayed.

続いて、Ｓ２２においてＣＰＵ５１は、学習情報ＤＢ４６に格納された学習モデルを用いて、前記Ｓ２１で生成された画像データについて液晶ディスプレイ３５に表示する前にブロックノイズの検出を行う。具体的には、ＣＰＵ５１は学習モデルを用いてブロックノイズが生じると予測されるエリアの面積値を特定し、面積値が閾値以上の場合に画像データにブロックノイズが生じると判定する。尚、閾値については前述の学習処理プログラム（図４参照）において最適値が算出されている（Ｓ１１）。 Subsequently, in S22, the CPU 51 uses the learning model stored in the learning information DB 46 to detect block noise before displaying the image data generated in S21 on the liquid crystal display 35. Specifically, the CPU 51 uses the learning model to specify the area value of an area where block noise is predicted to occur, and determines that block noise will occur in the image data if the area value is equal to or greater than a threshold value. Note that the optimal value of the threshold value is calculated in the aforementioned learning processing program (see FIG. 4) (S11).

その後、Ｓ２３においてＣＰＵ５１は、前記Ｓ２２の検出結果に基づいて前記Ｓ２１で生成された画像データにブロックノイズが生じると判定されたか否かを判定し、ブロックノイズが生じると判定された場合（Ｓ２３：ＹＥＳ）には、Ｓ２４へと移行する。 Thereafter, in S23, the CPU 51 determines whether it is determined that block noise occurs in the image data generated in S21 based on the detection result of S22, and if it is determined that block noise occurs (S23: If YES), the process moves to S24.

Ｓ２４においてＣＰＵ５１は、生成された画像データに異常が有ると推定し、画像データを再度生成する処理を行う。その後、Ｓ２２へと移行し、再生成した画像データについて改めてブロックノイズの検出を行う。 In S24, the CPU 51 estimates that there is an abnormality in the generated image data, and performs processing to generate the image data again. After that, the process moves to S22, and block noise is detected again on the regenerated image data.

一方、前記Ｓ２１で生成された画像データにブロックノイズが生じないと判定された場合（Ｓ２３：ＮＯ）には、Ｓ２５へと移行する。Ｓ２５では生成された画像データを一旦ＲＡＭ５２やＶＲＡＭ５６等のメモリに格納する。 On the other hand, if it is determined that block noise does not occur in the image data generated in S21 (S23: NO), the process moves to S25. In S25, the generated image data is temporarily stored in a memory such as the RAM 52 or VRAM 56.

その後、ＧＰＵ５５はメモリに格納された画像データを液晶ディスプレイ３５に描画する描画処理を行う。その結果、液晶ディスプレイ３５に表示される画像にブロックノイズが生じることを防止することが可能となる。尚、画像の描画処理についてはＣＰＵ５１が行っても良い。 Thereafter, the GPU 55 performs a drawing process to draw the image data stored in the memory on the liquid crystal display 35. As a result, it is possible to prevent block noise from occurring in the image displayed on the liquid crystal display 35. Note that the CPU 51 may perform image drawing processing.

以上詳細に説明した通り、本実施形態に係る学習装置１及び学習装置１で実行されるコンピュータプログラムでは、ブロックノイズの生じていない学習用正常画像を取得するとともに（Ｓ１）、ブロック形状の所定エリアを疑似的にノイズ領域に設定したブロックノイズの画像である疑似ブロックノイズ画像を生成し（Ｓ３）、学習用正常画像に対して疑似ブロックノイズ画像を重畳して合成することによって、学習用正常画像に対して擬似的にブロックノイズを生じさせた疑似異常画像を、ブロックノイズを検出する為の学習モデルの学習に用いる教師データとして生成し（Ｓ６）、生成された教師データを用いて学習モデルの学習を行う（Ｓ１～Ｓ１３）ので、従来と比べて教師データの取得が容易となる。その結果、画像表示装置（例えばナビゲーション装置、スマートフォン、タブレット端末、パーソナルコンピュータ）に表示する画像の品質を向上させるという点において広く適用可能な学習モデルを構築することが可能となる。 As explained in detail above, in the learning device 1 according to the present embodiment and the computer program executed by the learning device 1, a normal image for learning without block noise is acquired (S1), and a predetermined area of a block shape is acquired. A pseudo block noise image is generated (S3), which is an image of block noise that is set in a pseudo noise region, and the pseudo block noise image is superimposed and synthesized on the normal image for learning, thereby creating the normal image for learning. A pseudo abnormal image in which block noise is generated is generated as training data for use in learning a learning model for detecting block noise (S6), and the generated training data is used to train the learning model. Since learning is performed (S1 to S13), it is easier to obtain teacher data than in the past. As a result, it is possible to construct a learning model that is widely applicable in terms of improving the quality of images displayed on image display devices (eg, navigation devices, smartphones, tablet terminals, personal computers).

尚、本発明は前記実施形態に限定されるものではなく、本発明の要旨を逸脱しない範囲内で種々の改良、変形が可能であることは勿論である。
例えば、上記実施形態では、学習モデルの学習結果を評価する手段として再現率(Recall)と適合率（Precision）を用いているが、他の評価手段によって学習結果を評価するようにしても良い。 It should be noted that the present invention is not limited to the embodiments described above, and it goes without saying that various improvements and modifications can be made without departing from the gist of the present invention.
For example, in the embodiment described above, recall and precision are used as means for evaluating the learning results of the learning model, but the learning results may be evaluated using other evaluation means.

また、本実施形態ではブロックノイズの画像である疑似ブロックノイズ画像６１を生成する際に、ノイズ領域とするブロック形状のサイズ、位置、ノイズ領域の色及び透過率の各パラメータを夫々ランダムに変化させているが、各パラメータの一部（例えばノイズ領域とするブロック形状のサイズと位置）のみをランダムに設定しても良い。また、学習装置１の操作を行う操作者においてパラメータを設定可能にしても良い。 Furthermore, in this embodiment, when generating the pseudo block noise image 61, which is an image of block noise, each parameter of the size and position of the block shape to be the noise region, the color and transmittance of the noise region is changed at random. However, only a part of each parameter (for example, the size and position of the block shape used as the noise region) may be randomly set. Further, the operator who operates the learning device 1 may be able to set the parameters.

また、本実施形態では学習用画像２５に対して擬似的にブロックノイズを生じさせた疑似異常画像をグレースケール変換しているが、グレースケール変換については行わないようにしても良い。 Further, in this embodiment, the pseudo abnormal image in which block noise is generated in the learning image 25 is subjected to gray scale conversion, but the gray scale conversion may not be performed.

また、上記実施形態では学習処理プログラム（図４）は学習装置１が実施しているが、ナビゲーション装置３が実施するようにしても良い。その場合には、学習機能を有するナビゲーション装置３が本発明に係る学習装置に該当する。 Further, in the above embodiment, the learning processing program (FIG. 4) is executed by the learning device 1, but it may be executed by the navigation device 3. In that case, the navigation device 3 having a learning function corresponds to the learning device according to the present invention.

また、上記実施形態では表示画像のブロックノイズの低減を図る対象として車載器であるナビゲーション装置３を例に挙げて説明しているが、画像を表示する手段を備える装置であればナビゲーション装置以外であっても良い。例えば、スマートフォン、タブレット端末、パーソナルコンピュータ等をナビゲーション装置３の代わりとしても良い。 Furthermore, in the above embodiment, the navigation device 3, which is an on-vehicle device, is used as an example to reduce block noise in a displayed image. It's okay. For example, a smartphone, a tablet terminal, a personal computer, etc. may be used instead of the navigation device 3.

また、本発明に係るブロックノイズ検出の為の学習装置を具体化した実施例について上記に説明したが、ブロックノイズ検出の為の学習装置は以下の構成を有することも可能であり、その場合には以下の効果を奏する。 Moreover, although the embodiment embodying the learning device for detecting block noise according to the present invention has been described above, the learning device for detecting block noise can also have the following configuration, and in that case, has the following effects.

例えば、第１の構成は以下のとおりである。
ブロックノイズの生じていない学習用正常画像（２５）を取得する正常画像取得手段（２１）と、ブロック形状の所定エリアを疑似的にノイズ領域に設定したブロックノイズの画像である疑似ブロックノイズ画像（６１）を生成する疑似ブロックノイズ画像生成手段（２１）と、前記学習用正常画像に対して前記疑似ブロックノイズ画像を重畳して合成することによって、前記学習用正常画像に対して擬似的にブロックノイズを生じさせた疑似異常画像（６２）を、ブロックノイズを検出する為の学習モデルの学習に用いる教師データとして生成する教師データ生成手段（２１）と、を有する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、正常画像に対して疑似的にブロックノイズを生じさせた画像を生成して教師データとして用い、ブロックノイズを検出する為の学習を行うので、従来と比べて教師データの取得が容易となる。その結果、画像表示装置に表示する画像の品質を向上させるという点において広く適用可能な学習モデルを構築することが可能となる。 For example, the first configuration is as follows.
A normal image acquisition means (21) that acquires a normal image for learning (25) in which block noise does not occur, and a pseudo block noise image (21) that is an image of block noise in which a predetermined block-shaped area is pseudo-set as a noise region. 61), and a pseudo block noise image generating means (21) that generates a pseudo block noise image on the learning normal image by superimposing and composing the pseudo block noise image on the learning normal image. It has a teacher data generating means (21) that generates a pseudo abnormal image (62) in which noise is generated as teacher data used for learning a learning model for detecting block noise.
According to the learning device for block noise detection having the above configuration, an image in which block noise is generated in a pseudo normal image is generated and used as training data, and learning for detecting block noise is performed. Therefore, it is easier to obtain training data than in the past. As a result, it becomes possible to construct a learning model that is widely applicable in terms of improving the quality of images displayed on image display devices.

また、第２の構成は以下のとおりである。
前記疑似ブロックノイズ画像生成手段（２１）は、ノイズ領域とするブロック形状のサイズ、位置、ノイズ領域の色及び透過率を夫々設定して前記疑似ブロックノイズ画像（６１）を生成する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、擬似的にブロックノイズを生じさせた疑似異常画像を生成する場合であっても、実際にブロックノイズが生じている異常画像を再現して教師データとして用いることが可能となる。 Further, the second configuration is as follows.
The pseudo block noise image generation means (21) generates the pseudo block noise image (61) by setting the size and position of a block shape to be a noise region, the color and transmittance of the noise region, respectively.
According to the learning device for block noise detection having the above configuration, even when generating a pseudo abnormal image in which block noise is generated, it is possible to reproduce an abnormal image in which block noise actually occurs. This makes it possible to use the data as training data.

また、第３の構成は以下のとおりである。
前記疑似ブロックノイズ画像生成手段（２１）は、ノイズ領域とするブロック形状のサイズ、位置、ノイズ領域の色及び透過率の少なくとも一をパラメータとしてランダムに変化させて複数の前記疑似ブロックノイズ画像（６１）を生成し、前記教師データ生成手段（２１）は、複数の前記疑似ブロックノイズ画像（６１）を前記学習用正常画像（２５）に対して重畳して合成した複数の前記疑似異常画像（６２）を生成し、複数の前記疑似異常画像を教師データとして用いて前記学習モデルの学習を行う。
上記構成を有するブロックノイズ検出の為の学習装置によれば、擬似的にブロックノイズを生じさせた疑似異常画像を生成する場合に、パラメータをランダムに変えて様々なブロックノイズのパターンに基づく複数の疑似異常画像を生成し、生成した複数の疑似異常画像を教師データとして用いることによって、実際に生じ得る多様なブロックノイズを想定した高い学習効果について期待できる。 Moreover, the third configuration is as follows.
The pseudo block noise image generation means (21) generates a plurality of pseudo block noise images (61) by randomly changing at least one of the size and position of the block shape to be a noise region, the color and transmittance of the noise region as parameters. ), and the teacher data generation means (21) generates a plurality of pseudo abnormal images (62), which are synthesized by superimposing a plurality of pseudo block noise images (61) on the learning normal image (25). ), and the learning model is trained using the plurality of pseudo abnormal images as training data.
According to the learning device for detecting block noise having the above configuration, when generating a pseudo-abnormal image in which block noise is generated, the learning device randomly changes the parameters and generates a plurality of images based on various block noise patterns. By generating pseudo-abnormal images and using a plurality of generated pseudo-abnormal images as training data, a high learning effect can be expected assuming various block noises that may actually occur.

また、第４の構成は以下のとおりである。
教師データとして用いる前記疑似異常画像（６２）を変えて前記学習モデルの学習を複数回行い、複数回の学習毎に前記学習モデルの学習結果を評価する学習結果評価手段（２１）と、最も評価の高い学習結果を示した前記学習モデルを採用する学習モデル採用手段（２１）と、を有する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、教師データとする疑似異常画像を変化させて複数回の学習を行わせるとともに、最も評価の高い学習後の学習モデルを採用するので、仮に擬似的にブロックノイズを生じさせた疑似異常画像に実際には存在し得ない不自然な異常画像が含まれていたとしても、それらを教師データとして学習した学習モデルについては排除でき、より実際の異常画像に近い疑似異常画像を教師データとして学習した学習モデルを採用することが可能となる。 Further, the fourth configuration is as follows.
a learning result evaluation means (21) that performs learning of the learning model multiple times by changing the pseudo abnormal images (62) used as training data, and evaluates the learning results of the learning model for each of the multiple learnings; learning model employing means (21) for employing the learning model that has shown a high learning result.
According to the learning device for block noise detection having the above configuration, learning is performed multiple times by changing the pseudo abnormal images used as teaching data, and the learning model after learning with the highest evaluation is adopted. Even if a pseudo-abnormal image that artificially generates block noise contains unnatural abnormal images that cannot actually exist, the learning model that learned them as training data can eliminate them, making it more practical. It becomes possible to employ a learning model that is trained using pseudo-abnormal images close to the abnormal images of , as training data.

また、第５の構成は以下のとおりである。
前記学習結果評価手段（２１）は、複数回の学習毎に学習された学習モデルを用いて、ブロックノイズが生じているテスト用異常画像を含む第１のテスト用画像（２６）に対してブロックノイズが生じているエリアの予測判断を行わせ、ブロックノイズが生じていると予測されたエリアと実際にブロックノイズが生じているエリアとの一致の度合いを評価する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、ブロックノイズが生じているエリアの予測判断によって学習結果の評価を行うので、ブロックノイズを検出する為の学習評価の高い学習モデルを採用することが可能となる。 Further, the fifth configuration is as follows.
The learning result evaluation means (21) uses the learning model learned for each of the plurality of learnings to evaluate the first test image (26) including the abnormal test image in which block noise has occurred. A predictive determination of the area where the noise is occurring is made, and the degree of coincidence between the area where the block noise is predicted to occur and the area where the block noise is actually occurring is evaluated.
According to the learning device for block noise detection having the above configuration, since the learning results are evaluated by predictive judgment of the area where block noise occurs, a learning model with a high learning evaluation for detecting block noise is adopted. It becomes possible to do so.

また、第６の構成は以下のとおりである。
前記学習モデル採用手段（２１）により採用された前記学習モデルを用いて、ブロックノイズの生じていないテスト用正常画像とブロックノイズが生じているテスト用異常画像とを含む第２のテスト用画像（２７）群に対してブロックノイズが生じているエリアの予測を行わせるエリア予測手段（２１）と、前記エリア予測手段によりブロックノイズが生じていると予測されたエリアの面積分布を、前記テスト用正常画像と前記テスト用異常画像とに分けて生成する面積分布生成手段（２１）と、前記面積分布に基づいて前記学習モデル採用手段により採用された前記学習モデルを再評価する学習モデル再評価手段（２１）と、を有する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、ブロックノイズの生じていないテスト用正常画像とブロックノイズが生じているテスト用異常画像に対して夫々ブロックノイズが生じているエリアの予測判断を行い、ブロックノイズが生じていると予測されたエリアの面積分布に基づいて学習結果の評価を行うので、ブロックノイズの生じていない正常画像を正常画像と検出し、ブロックノイズの生じている異常画像を異常画像と検出することを評価対象とした学習モデルの評価が可能となる。 Further, the sixth configuration is as follows.
Using the learning model adopted by the learning model adoption means (21), a second test image ( 27) Area prediction means (21) that causes the group to predict areas where block noise has occurred; Area distribution generation means (21) that separately generates a normal image and the test abnormal image; and a learning model re-evaluation means that re-evaluates the learning model adopted by the learning model adoption means based on the area distribution. (21).
According to the learning device for block noise detection having the above configuration, the area where block noise occurs is predicted for a normal test image in which block noise does not occur and an abnormal test image in which block noise occurs, respectively. Since the learning result is evaluated based on the area distribution of areas predicted to have block noise, normal images without block noise are detected as normal images, and normal images with block noise are detected as normal images. It becomes possible to evaluate a learning model that targets detection of abnormal images as abnormal images.

また、第７の構成は以下のとおりである。
前記学習モデル再評価手段（２１）は、再現率と適合率が予め設定された目標値に到達したか否かによって評価を行い、再現率と適合率が目標値に到達していない場合には、再度、学習モデルの学習を行う再学習手段（２１）を有する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、特に再現率と適合率が予め設定された目標値に到達した学習モデルを採用可能となる一方で、再現率と適合率が予め設定された目標値に到達しない学習モデルについては再学習を行うことによって、再現率と適合率を向上させることが可能となる。 Moreover, the seventh configuration is as follows.
The learning model re-evaluation means (21) evaluates whether or not the recall rate and precision rate have reached preset target values, and if the recall rate and precision rate have not reached the target values, , has a relearning means (21) for learning the learning model again.
According to the learning device for detecting block noise having the above configuration, it is possible to adopt a learning model whose recall rate and precision rate have reached preset target values, while the recall rate and precision rate are set in advance. By relearning learning models that do not reach the set target value, it is possible to improve recall and precision.

また、第８の構成は以下のとおりである。
前記再学習手段（２１）は、前記第２のテスト用画像（２７）群に含まれるテスト用正常画像の内、前記学習モデルによってブロックノイズのエリアの予測が正しくできなかったテスト用正常画像を前記学習用正常画像（２５）に追加し、再度、学習モデルの学習を行う。
上記構成を有するブロックノイズ検出の為の学習装置によれば、再現率と適合率が予め設定された目標値に到達しない学習モデルについては、ブロックノイズのエリアの予測が正しくできなかったテスト用正常画像を学習対象に追加して再学習を行うことによって、再現率と適合率を向上させることが可能となる。 Further, the eighth configuration is as follows.
The relearning means (21) selects a normal test image for which block noise areas could not be predicted correctly by the learning model, among the normal test images included in the second test image (27) group. It is added to the normal image for learning (25) and the learning model is trained again.
According to the learning device for block noise detection having the above configuration, for the learning model whose recall rate and precision rate do not reach the preset target values, the test normality that cannot predict the block noise area correctly is determined. By adding images to the learning target and performing relearning, it is possible to improve recall and precision.

また、第９の構成は以下のとおりである。
画像表示装置（３５）に表示する為の画像データを生成する画像データ生成手段（５１）と、前記画像データ生成手段により生成された前記画像データに基づく画像の表示を行う前に、前記画像データに対して前記教師データに基づいて学習された学習モデルを用いてブロックノイズの検出を行うブロックノイズ検出手段（５１）と、前記ブロックノイズ検出手段によって前記ブロックノイズが生じると検出された画像データに対してブロックノイズを低減させるための処理を行うブロックノイズ低減手段（５１）と、を有する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、学習された学習モデルを用いることによって、画像表示装置に表示する前の画像データに対して予めブロックノイズを生じさせる画像データか否かを検出することが可能となる。そして、ブロックノイズが生じると検出された画像データに対してブロックノイズを低減させるための処理を行うことによって、ブロックノイズが生じた画像が画像表示装置に表示されることを防止できる。 Further, the ninth configuration is as follows.
An image data generation means (51) that generates image data to be displayed on an image display device (35); and before displaying an image based on the image data generated by the image data generation means, the image data block noise detection means (51) for detecting block noise using a learning model learned based on the teacher data; A block noise reduction means (51) that performs processing for reducing block noise.
According to the learning device for block noise detection having the above configuration, by using the learned learning model, it is possible to determine whether or not block noise is generated in image data before displaying it on the image display device. It becomes possible to detect. Then, by performing processing for reducing block noise on image data in which block noise has been detected, it is possible to prevent images in which block noise has occurred from being displayed on the image display device.

また、第１０の構成は以下のとおりである。
前記ブロックノイズ検出手段（５１）は、前記画像データにおいてブロックノイズが生じるエリアを予測するとともに、ブロックノイズが生じると予測されたエリアの面積値が閾値以上の場合に、前記画像データに前記ブロックノイズが生じると検出する。
上記構成を有するブロックノイズ検出の為の学習装置によれば、学習された学習モデルを用いて画像表示装置に表示する前の画像データに対してブロックノイズが生じるエリアを予測することによって、画像表示装置に表示する前の画像データがブロックノイズを生じさせる画像データか否かを正確に検出することが可能となる。 Moreover, the tenth configuration is as follows.
The block noise detection means (51) predicts an area where block noise will occur in the image data, and adds the block noise to the image data when the area value of the area where block noise is predicted to occur is greater than or equal to a threshold value. is detected when it occurs.
According to the learning device for detecting block noise having the above configuration, the learned learning model is used to predict areas where block noise will occur in image data before displaying it on an image display device. It becomes possible to accurately detect whether the image data before being displayed on the device is image data that causes block noise.

１学習装置
２学習システム
３ナビゲーション装置
１２教師画像ＤＢ
２１ＣＰＵ
２２ＲＡＭ
２３ＲＯＭ
２５学習用画像
２６一次テスト用画像
２７二次テスト用画像
３３ナビゲーションＥＣＵ
３５液晶ディスプレイ
５１ＣＰＵ
５５ＧＰＵ
６１疑似ブロックノイズ画像
６２疑似異常画像 1 Learning device 2 Learning system 3 Navigation device 12 Teacher image DB
21 CPU
22 RAM
23 ROM
25 Learning image 26 Primary test image 27 Secondary test image 33 Navigation ECU
35 LCD display 51 CPU
55 GPU
61 Pseudo block noise image 62 Pseudo abnormal image

Claims

normal image acquisition means for acquiring a normal image for learning without block noise;
pseudo block noise image generation means for generating a pseudo block noise image that is a block noise image in which a predetermined block-shaped area is pseudo-set as a noise region;
In order to detect block noise, a pseudo-abnormal image is generated in which block noise is pseudo-generated in the normal image for learning by superimposing and composing the pseudo-block noise image on the normal image for learning. a teacher data generation means for generating teacher data used for learning the learning model ;
The pseudo block noise image generation means is a learning device that generates the pseudo block noise image by setting the size and position of a block shape to be a noise region, the color and transmittance of the noise region, respectively .

The pseudo block noise image generation means generates a plurality of pseudo block noise images by randomly changing at least one of the size and position of a block shape to be a noise region, the color and transmittance of the noise region as parameters,
The teacher data generating means generates a plurality of pseudo abnormal images obtained by superimposing and synthesizing a plurality of pseudo block noise images on the learning normal image,
The learning device according to claim 1 , wherein the learning model is trained using a plurality of the pseudo abnormal images as training data.

Learning the learning model multiple times by changing the pseudo abnormal images used as training data,
learning result evaluation means for evaluating the learning results of the learning model every time the learning is performed a plurality of times;
3. The learning device according to claim 1, further comprising a learning model employing unit that employs the learning model that has shown the highest evaluation learning result.

The learning result evaluation means includes:
Using a learning model learned each time a plurality of learnings are performed, a predictive judgment of an area where block noise is occurring is performed on a first test image including an abnormal test image where block noise is occurring;
4. The learning device according to claim 3 , wherein the learning device evaluates the degree of coincidence between an area where block noise is predicted to occur and an area where block noise actually occurs.

Using the learning model adopted by the learning model adoption means, a second test image group including a normal test image in which no block noise occurs and an abnormal test image in which block noise occurs. Area prediction means for predicting an area where block noise is occurring;
Area distribution generating means for generating an area distribution of an area predicted to have block noise by the area predicting means, dividing it into the normal test image and the abnormal test image;
5. The learning device according to claim 3 , further comprising learning model re-evaluation means for re-evaluating the learning model adopted by the learning model adoption means based on the area distribution.

The learning model re-evaluation means evaluates whether the recall rate and the precision rate have reached preset target values,
6. The learning device according to claim 5 , further comprising a relearning means for learning the learning model again when the recall rate and the precision rate have not reached the target values.

The relearning means selects a normal test image for which the block noise area could not be correctly predicted by the learning model from among the normal test images included in the second test image group, into the normal learning image. The learning device according to claim 6 , wherein the learning model is added and the learning model is learned again.

image data generation means for generating image data to be displayed on an image display device;
Before displaying an image based on the image data generated by the image data generation means, block noise is detected using a learning model trained on the image data based on the teacher data. detection means;
8. A block noise reduction unit that performs processing to reduce block noise on image data in which block noise has been detected by the block noise detection unit. learning device.

The block noise detection means predicts an area where block noise will occur in the image data, and detects that the block noise will occur in the image data if the area value of the area where block noise is predicted to occur is greater than or equal to a threshold value. The learning device according to claim 8 , which detects.

computer,
normal image acquisition means for acquiring a normal image for learning without block noise;
pseudo block noise image generation means for generating a pseudo block noise image that is a block noise image in which a predetermined block-shaped area is pseudo-set as a noise region;
In order to detect block noise, a pseudo-abnormal image is generated in which block noise is pseudo-generated in the normal image for learning by superimposing and composing the pseudo-block noise image on the normal image for learning. a teacher data generation means for generating teacher data to be used for learning the learning model;
A computer program for functioning as
The pseudo block noise image generation means is a computer program that generates the pseudo block noise image by setting the size and position of a block shape to be a noise region, the color and transmittance of the noise region, respectively.