JP7604604B2

JP7604604B2 - Medical image processing device, medical image processing method and program

Info

Publication number: JP7604604B2
Application number: JP2023208259A
Authority: JP
Inventors: 好彦岩瀬; 学山添; 弘樹内田; 律也富田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2019-03-11
Filing date: 2023-12-11
Publication date: 2024-12-23
Anticipated expiration: 2039-10-03
Also published as: JP2024025807A; JP7406892B2; JP7297628B2; CN113557714B; JP7746514B2; JP2020166813A; JP2020166814A; JP2025029192A; CN113557714A

Description

本発明は、医用画像処理装置、医用画像処理方法及びプログラムに関する。 The present invention relates to a medical image processing device, a medical image processing method, and a program.

医療分野においては、被検者の疾患を特定したり、疾患の程度を観察したりするために、様々な撮影装置によって画像が取得され、医療従事者による画像診断が行われている。撮影装置の種類には、例えば放射線科分野では、Ｘ線撮影装置、Ｘ線コンピュータ断層撮影（ＣＴ）装置、磁気共鳴イメージング（ＭＲＩ）装置、陽電子放出断層撮影（ＰＥＴ）装置、及び単一光子放射断層撮影（ＳＰＥＣＴ）装置等がある。また、例えば眼科分野では、眼底カメラ、走査型レーザ検眼鏡（ＳＬＯ）、光コヒーレンストモグラフィ（ＯＣＴ）装置、及びＯＣＴアンギオグラフィ（ＯＣＴＡ）装置がある。 In the medical field, images are acquired by various imaging devices to identify the disease of a subject and observe the extent of the disease, and image diagnosis is performed by medical professionals. For example, in the field of radiology, types of imaging devices include X-ray devices, X-ray computed tomography (CT) devices, magnetic resonance imaging (MRI) devices, positron emission tomography (PET) devices, and single photon emission computed tomography (SPECT) devices. In the field of ophthalmology, for example, there are fundus cameras, scanning laser ophthalmoscopes (SLO), optical coherence tomography (OCT) devices, and OCT angiography (OCTA) devices.

画像診断を正確に行ったり、短時間で完了したりするためには、撮影装置によって取得される画像のノイズの少なさや解像度・空間分解能の高さ、適切な階調といった画質の高さが重要となる。また、観察したい部位や病変が強調されている画像も役に立つことがある。 To perform image diagnosis accurately and complete it in a short time, it is important that the images acquired by the imaging device have high image quality, such as low noise, high resolution and spatial resolution, and appropriate gradation. Images that highlight the area or lesion to be observed can also be useful.

しかしながら、多くの撮影装置においては、画質が高いなどの、画像診断に適した画像を取得するためになんらかの代償が必要である。例えば、画質が高い画像を取得するために高性能な撮影装置を購入する方法があるが、低性能なものよりも多くの投資が必要になる場合が多い。 However, with many imaging devices, some trade-off is required to obtain images suitable for diagnostic imaging, such as high image quality. For example, one option is to purchase a high-performance imaging device to obtain high-image quality images, but this often requires a larger investment than a low-performance device.

また、例えばＣＴでは、ノイズが少ない画像を取得するために被検者の被曝線量を増やさなければならない場合がある。また、例えばＭＲＩでは、観察したい部位が強調された画像を取得するために副作用のリスクがある造影剤を使用する場合がある。また、例えばＯＣＴでは、撮影する領域が広かったり、高い空間分解能が必要であったりする場合には、撮影時間がより長くなる場合がある。また、例えば、一部の撮影装置では、画質が高い画像を取得するために複数回画像を取得する必要があり、その分撮影に時間がかかる。 In addition, for example, in CT, the subject's radiation dose may need to be increased to obtain an image with less noise. In addition, for example, in MRI, a contrast agent that carries a risk of side effects may be used to obtain an image that highlights the area to be observed. In addition, for example, in OCT, if the area to be imaged is large or high spatial resolution is required, the imaging time may be longer. In addition, for example, some imaging devices require multiple images to be taken in order to obtain an image with high image quality, which increases the imaging time.

特許文献１には、医用技術の急激な進歩や緊急時の簡易な撮影に対応するため、以前に取得した画像を、人工知能エンジンによって、より解像度の高い画像に変換する技術が開示されている。このような技術によれば、例えば、代償の少ない簡易な撮影によって取得された画像をより解像度の高い画像に変換することができる。 Patent Document 1 discloses a technology that uses an artificial intelligence engine to convert previously acquired images into images with higher resolution in order to accommodate rapid advances in medical technology and the need for simple imaging in emergencies. With this technology, for example, it is possible to convert an image acquired by simple imaging with little cost into an image with higher resolution.

特開２０１８－５８４１号公報JP 2018-5841 A

しかしながら、解像度の高い画像であっても、画像診断に適した画像とは言えない場合もある。例えば、解像度が高い画像であっても、ノイズが多い場合やコントラストが低い場合等には観察すべき対象が適切に把握できないことがある。 However, even high-resolution images may not be suitable for image diagnosis. For example, even if the image has high resolution, if there is a lot of noise or low contrast, it may not be possible to properly grasp the object to be observed.

これに対し、本発明の目的の一つは、従来よりも画像診断に適した画像を生成することができる医用画像処理装置、医用画像処理方法及びプログラムを提供することである。 In response to this, one of the objectives of the present invention is to provide a medical image processing device, a medical image processing method, and a program that can generate images that are more suitable for image diagnosis than conventional methods.

本発明の一実施態様に係る医用画像処理装置は、
検者からの指示に応じて、被検者の所定部位の３次元の医用画像データにおける前記所定部位の深度範囲のうち一部の深度範囲を指定する指定手段と、
前記３次元の医用画像データを用いて、前記指定された一部の深度範囲に対応する前記所定部位の医用画像である第１の画像を取得する取得手段と、
被検者の所定部位の複数の深度範囲に対応する複数の医用画像を含む学習データを用いて得られた機械学習エンジンを含む高画質化エンジンを用いて、前記第１の画像から、該第１の画像と比べて高画質化された第２の画像を生成し、前記第１の画像と前記第２の画像とにおける互いに対応する画素毎に互いの画素値を、検者からの指示に応じて変更可能な割合により合成することにより合成画像を生成する高画質化部と、を備える。 A medical image processing apparatus according to an embodiment of the present invention comprises:
a designation means for designating a part of a depth range of a predetermined part of a subject in three-dimensional medical image data of the predetermined part in response to an instruction from an examiner ;
an acquisition means for acquiring a first image, which is a medical image of the predetermined portion corresponding to the specified partial depth range, by using the three-dimensional medical image data ;
The image quality improvement unit uses an image quality improvement engine including a machine learning engine obtained using learning data including a plurality of medical images corresponding to a plurality of depth ranges of a specific part of a subject to generate a second image having higher image quality than the first image from the first image , and generates a composite image by combining pixel values of corresponding pixels in the first image and the second image at a ratio that can be changed in response to instructions from an examiner .

また、本発明の他の実施態様に係る医用画像処理方法は、
検者からの指示に応じて、被検者の所定部位の３次元の医用画像データにおける前記所定部位の深度範囲のうち一部の深度範囲を指定することと、
前記３次元の医用画像データを用いて、前記指定された一部の深度範囲に対応する前記所定部位の医用画像である第１の画像を取得することと、
被検者の所定部位の複数の深度範囲に対応する複数の医用画像を含む学習データを用いて得られた機械学習エンジンを含む高画質化エンジンを用いて、前記第１の画像から、該第１の画像と比べて高画質化された第２の画像を生成し、前記第１の画像と前記第２の画像とにおける互いに対応する画素毎に互いの画素値を、検者からの指示に応じて変更可能な割合により合成することにより合成画像を生成することと、を含む。 Furthermore, a medical image processing method according to another embodiment of the present invention comprises:
Specifying a depth range of a predetermined portion of the subject's three-dimensional medical image data in response to an instruction from an examiner ;
acquiring a first image, which is a medical image of the predetermined portion corresponding to the specified partial depth range, using the three-dimensional medical image data ;
The method includes generating a second image having higher image quality than the first image from the first image using an image quality improvement engine including a machine learning engine obtained using learning data including a plurality of medical images corresponding to a plurality of depth ranges of a specific part of a subject , and generating a composite image by combining pixel values of corresponding pixels in the first image and the second image at a ratio that can be changed in response to instructions from an examiner .

本発明の一つによれば、従来よりも画像診断に適した画像を生成することができる。 According to one aspect of the present invention, it is possible to generate images that are more suitable for image diagnosis than conventional methods.

高画質化処理に関するニューラルネットワークの構成の一例を示す。1 shows an example of the configuration of a neural network related to image quality improvement processing. 撮影箇所推定処理に関するニューラルネットワークの構成の一例を示す。1 shows an example of the configuration of a neural network related to a shooting location estimation process. 画像の真贋評価処理に関するニューラルネットワークの構成の一例を示す。1 shows an example of the configuration of a neural network related to image authenticity evaluation processing. 第１の実施形態に係る画像処理装置の概略的な構成の一例を示す。1 illustrates an example of a schematic configuration of an image processing apparatus according to a first embodiment. 第１の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 4 is a flowchart showing an example of a flow of image processing according to the first embodiment. 第１の実施形態に係る画像処理の流れの別例を示すフロー図である。FIG. 11 is a flowchart showing another example of the flow of image processing according to the first embodiment. 第２の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 11 is a flowchart showing an example of a flow of image processing according to the second embodiment. 第４の実施形態に係る画像処理を説明するための図である。FIG. 13 is a diagram for explaining image processing according to a fourth embodiment. 第４の実施形態に係る高画質化処理の流れの一例を示すフロー図である。FIG. 13 is a flowchart showing an example of the flow of image quality improvement processing according to the fourth embodiment. 第５の実施形態に係る画像処理を説明するための図である。FIG. 13 is a diagram for explaining image processing according to the fifth embodiment. 第５の実施形態に係る高画質化処理の流れの一例を示すフロー図である。FIG. 13 is a flowchart showing an example of the flow of image quality improvement processing according to the fifth embodiment. 第６の実施形態に係る画像処理を説明するための図である。FIG. 13 is a diagram for explaining image processing according to the sixth embodiment. 第６の実施形態に係る高画質化処理の流れの一例を示すフロー図である。FIG. 23 is a flowchart showing an example of the flow of image quality improvement processing according to the sixth embodiment. 第６の実施形態に係る画像処理を説明するための図である。FIG. 13 is a diagram for explaining image processing according to the sixth embodiment. 第７の実施形態に係る画像処理装置の概略的な構成の一例を示す。23 shows an example of a schematic configuration of an image processing device according to a seventh embodiment. 第７の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 23 is a flowchart showing an example of the flow of image processing according to the seventh embodiment. 第７の実施形態に係るユーザーインターフェースの一例を示す。23 shows an example of a user interface according to the seventh embodiment. 第９の実施形態に係る画像処理装置の概略的な構成の一例を示す。23 shows an example of a schematic configuration of an image processing device according to a ninth embodiment. 第９の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 23 is a flowchart showing an example of the flow of image processing according to the ninth embodiment. 第１２の実施形態に係る画像処理装置の概略的な構成の一例を示す。23 shows an example of a schematic configuration of an image processing device according to a twelfth embodiment. 第１３の実施形態に係る高画質化処理の流れの一例を示すフロー図である。FIG. 23 is a flowchart showing an example of the flow of image quality improvement processing according to the thirteenth embodiment. 第１３の実施形態に係る高画質化処理の流れの別例を示すフロー図である。FIG. 23 is a flowchart showing another example of the flow of image quality improvement processing according to the thirteenth embodiment. 第１７の実施形態に係る画像処理装置の概略的な構成の一例を示す。23 shows an example of a schematic configuration of an image processing device according to a seventeenth embodiment. 第１７の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 23 is a flow diagram showing an example of the flow of image processing according to the seventeenth embodiment. 高画質化処理に関するニューラルネットワークの構成の一例を示す。1 shows an example of the configuration of a neural network related to image quality improvement processing. 第１９の実施形態に係る画像処理装置の概略的な構成の一例を示す。23 shows an example of a schematic configuration of an image processing device according to a nineteenth embodiment. 第１９の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 23 is a flow diagram showing an example of the flow of image processing according to the nineteenth embodiment. 第２１の実施形態に係る画像処理の流れの一例を示すフロー図である。A flowchart showing an example of the flow of image processing related to the twenty-first embodiment. 高画質化処理に関する教師画像の一例を示す。1 shows an example of a teacher image related to image quality improvement processing. 高画質化処理に関する入力画像の一例を示す。1 shows an example of an input image for image quality improvement processing. 第２２の実施形態に係る画像処理装置の概略的な構成の一例を示す。23 shows an example of a schematic configuration of an image processing device according to a twenty-second embodiment. 第２２の実施形態に係る画像処理の流れの一例を示すフロー図である。FIG. 22 is a flow diagram showing an example of the flow of image processing according to the twenty-second embodiment. 第２２の実施形態に係る広画角画像を説明するための図である。FIG. 22 is a diagram for explaining a wide-angle image according to the twenty-second embodiment. 第２３の実施形態に係る高画質化処理を説明するための図である。23A to 23C are diagrams for explaining image quality improvement processing according to the twenty-third embodiment. 第２４の実施形態に係るユーザーインターフェースの一例を示す。23 shows an example of a user interface according to the twenty-fourth embodiment. 第２５の実施形態に係る画像処理装置の概略的な構成の一例を示す。25 shows an example of a schematic configuration of an image processing device according to a twenty-fifth embodiment. 変形例６に係る機械学習エンジンとして用いられるニューラルネットワークの構成の一例を示す。13 shows an example of the configuration of a neural network used as a machine learning engine in accordance with variant example 6. 変形例６に係る機械学習エンジンとして用いられるニューラルネットワークの構成の一例を示す。13 shows an example of the configuration of a neural network used as a machine learning engine in accordance with variant example 6. 第２４の実施形態に係るユーザーインターフェースの一例を示す。23 shows an example of a user interface according to the twenty-fourth embodiment.

以下、本発明を実施するための例示的な実施形態を、図面を参照して詳細に説明する。ただし、以下の実施形態で説明する寸法、材料、形状、及び構成要素の相対的な位置等は任意であり、本発明が適用される装置の構成又は様々な条件に応じて変更できる。また、図面において、同一であるか又は機能的に類似している要素を示すために図面間で同じ参照符号を用いる。 Below, exemplary embodiments for carrying out the present invention will be described in detail with reference to the drawings. However, the dimensions, materials, shapes, and relative positions of components described in the following embodiments are arbitrary and can be changed according to the configuration of the device to which the present invention is applied or various conditions. In addition, the same reference numerals are used in the drawings to indicate elements that are identical or functionally similar.

＜用語の説明＞
まず、本明細書において用いられる用語について説明する。 <Terminology>
First, the terms used in this specification will be explained.

本明細書におけるネットワークでは、各装置は有線又は無線の回線で接続されてよい。ここで、ネットワークにおける各装置を接続する回線は、例えば、専用回線、ローカルエリアネットワーク（以下、ＬＡＮと表記）回線、無線ＬＡＮ回線、インターネット回線、Ｗｉ－Ｆｉ（登録商標）、及びＢｌｕｅｔｏｏｔｈ（登録商標）等を含む。 In the network described in this specification, each device may be connected by a wired or wireless line. Here, the lines connecting each device in the network include, for example, a dedicated line, a local area network (hereinafter referred to as LAN) line, a wireless LAN line, an Internet line, Wi-Fi (registered trademark), and Bluetooth (registered trademark), etc.

医用画像処理装置は、相互に通信が可能な２以上の装置によって構成されてもよいし、単一の装置によって構成されてもよい。また、医用画像処理装置の各構成要素は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）やＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等のプロセッサーによって実行されるソフトウェアモジュールにより構成されてよい。また、当該各構成要素は、ＡＳＩＣ等の特定の機能を果たす回路等によって構成されてもよい。また、他の任意のハードウェアと任意のソフトウェアとの組み合わせにより構成されてもよい。 The medical image processing device may be composed of two or more devices capable of communicating with each other, or may be composed of a single device. Furthermore, each component of the medical image processing device may be composed of a software module executed by a processor such as a CPU (Central Processing Unit) or MPU (Micro Processing Unit). Furthermore, each component may be composed of a circuit that performs a specific function, such as an ASIC. Furthermore, it may be composed of a combination of any other hardware and any software.

また、下記実施形態による医用画像処理装置又は医用画像処理方法によって処理される医用画像は、任意のモダリティ（撮影装置、撮影方法）を用いて取得された画像を含む。処理される医用画像は、任意の撮影装置等で取得された医用画像や下記実施形態による医用画像処理装置又は医用画像処理方法によって作成された画像を含むことができる。 In addition, the medical images processed by the medical image processing device or medical image processing method according to the following embodiments include images acquired using any modality (imaging device, imaging method). The medical images to be processed can include medical images acquired by any imaging device, etc., and images created by the medical image processing device or medical image processing method according to the following embodiments.

さらに、処理される医用画像は、被検者（被検体）の所定部位の画像であり、所定部位の画像は被検者の所定部位の少なくとも一部を含む。また、当該医用画像は、被検者の他の部位を含んでもよい。また、医用画像は、静止画像又は動画像であってよく、白黒画像又はカラー画像であってもよい。さらに医用画像は、所定部位の構造（形態）を表す画像でもよいし、その機能を表す画像でもよい。機能を表す画像は、例えば、ＯＣＴＡ画像、ドップラーＯＣＴ画像、ｆＭＲＩ画像、及び超音波ドップラー画像等の血流動態（血流量、血流速度等）を表す画像を含む。なお、被検者の所定部位は、撮影対象に応じて決定されてよく、人眼（被検眼）、脳、肺、腸、心臓、すい臓、腎臓、及び肝臓等の臓器、頭部、胸部、脚部、並びに腕部等の任意の部位を含む。 Furthermore, the medical image to be processed is an image of a specific part of the subject (test subject), and the image of the specific part includes at least a part of the specific part of the subject. The medical image may also include other parts of the subject. The medical image may be a still image or a moving image, and may be a black and white image or a color image. The medical image may also be an image that represents the structure (shape) of the specific part, or an image that represents its function. Images that represent function include images that represent hemodynamics (blood flow rate, blood flow velocity, etc.), such as OCTA images, Doppler OCT images, fMRI images, and ultrasonic Doppler images. The specific part of the subject may be determined according to the subject to be photographed, and may include any part, such as the human eye (test subject eye), organs such as the brain, lungs, intestines, heart, pancreas, kidneys, and liver, the head, chest, legs, and arms.

また、医用画像は、被検者の断層画像であってもよいし、正面画像であってもよい。正面画像は、例えば、眼底正面画像や、前眼部の正面画像、蛍光撮影された眼底画像、ＯＣＴで取得したデータ（３次元のＯＣＴデータ）について撮影対象の深さ方向における少なくとも一部の範囲のデータを用いて生成したＥｎ－Ｆａｃｅ画像を含む。なお、Ｅｎ－Ｆａｃｅ画像は、３次元のＯＣＴＡデータ（３次元のモーションコントラストデータ）について撮影対象の深さ方向における少なくとも一部の範囲のデータを用いて生成したＯＣＴＡのＥｎ－Ｆａｃｅ画像（モーションコントラスト正面画像）であっても良い。また、３次元のＯＣＴデータや３次元のモーションコントラストデータは、３次元の医用画像データの一例である。 The medical image may be a tomographic image of the subject, or a frontal image. The frontal image includes, for example, a frontal image of the fundus, a frontal image of the anterior segment, a fundus image photographed by fluorescence, and an En-Face image generated using data acquired by OCT (three-dimensional OCT data) from at least a partial range in the depth direction of the subject. The En-Face image may be an OCTA En-Face image (motion contrast frontal image) generated using three-dimensional OCTA data (three-dimensional motion contrast data) from at least a partial range in the depth direction of the subject. Three-dimensional OCT data and three-dimensional motion contrast data are examples of three-dimensional medical image data.

また、撮影装置とは、診断に用いられる画像を撮影するための装置である。撮影装置は、例えば、被検者の所定部位に光、Ｘ線等の放射線、電磁波、又は超音波等を照射することにより所定部位の画像を得る装置や、被写体から放出される放射線を検出することにより所定部位の画像を得る装置を含む。より具体的には、以下の実施形態に係る撮影装置は、少なくとも、Ｘ線撮影装置、ＣＴ装置、ＭＲＩ装置、ＰＥＴ装置、ＳＰＥＣＴ装置、ＳＬＯ装置、ＯＣＴ装置、ＯＣＴＡ装置、眼底カメラ、及び内視鏡等を含む。 An imaging device is a device for capturing images used in diagnosis. Imaging devices include, for example, devices that obtain an image of a specific part of a subject by irradiating the specific part with light, radiation such as X-rays, electromagnetic waves, or ultrasound, and devices that obtain an image of a specific part by detecting radiation emitted from a subject. More specifically, imaging devices according to the following embodiments include at least an X-ray imaging device, a CT device, an MRI device, a PET device, a SPECT device, an SLO device, an OCT device, an OCTA device, a fundus camera, and an endoscope.

なお、ＯＣＴ装置としては、タイムドメインＯＣＴ（ＴＤ－ＯＣＴ）装置やフーリエドメインＯＣＴ（ＦＤ－ＯＣＴ）装置を含んでよい。また、フーリエドメインＯＣＴ装置はスペクトラルドメインＯＣＴ（ＳＤ－ＯＣＴ）装置や波長掃引型ＯＣＴ（ＳＳ－ＯＣＴ）装置を含んでよい。また、ＳＬＯ装置やＯＣＴ装置として、波面補償光学系を用いた波面補償ＳＬＯ（ＡＯ－ＳＬＯ）装置や波面補償ＯＣＴ（ＡＯ－ＯＣＴ）装置等を含んでよい。また、ＳＬＯ装置やＯＣＴ装置として、偏光位相差や偏光解消に関する情報を可視化するための偏光ＳＬＯ（ＰＳ－ＳＬＯ）装置や偏光ＯＣＴ（ＰＳ－ＯＣＴ）装置等を含んでよい。 The OCT device may include a time domain OCT (TD-OCT) device or a Fourier domain OCT (FD-OCT) device. The Fourier domain OCT device may include a spectral domain OCT (SD-OCT) device or a swept wavelength OCT (SS-OCT) device. The SLO device or OCT device may include a wavefront compensation SLO (AO-SLO) device or a wavefront compensation OCT (AO-OCT) device using a wavefront compensation optical system. The SLO device or OCT device may include a polarized SLO (PS-SLO) device or a polarized OCT (PS-OCT) device for visualizing information related to polarization phase difference and depolarization.

画像管理システムは、撮影装置によって撮影された画像や画像処理された画像を受信して保存する装置及びシステムである。また、画像管理システムは、接続された装置の要求に応じて画像を送信したり、保存された画像に対して画像処理を行ったり、画像処理の要求を他の装置に要求したりすることができる。画像管理システムとしては、例えば、画像保存通信システム（ＰＡＣＳ）を含むことができる。特に、下記実施形態に係る画像管理システムは、受信した画像とともに関連付けられた被検者の情報や撮影時間などの各種情報も保存可能なデータベースを備える。また、画像管理システムはネットワークに接続され、他の装置からの要求に応じて、画像を送受信したり、画像を変換したり、保存した画像に関連付けられた各種情報を送受信したりすることができる。 The image management system is a device and system that receives and stores images captured by an imaging device and images that have been processed. The image management system can also transmit images in response to a request from a connected device, perform image processing on stored images, and request image processing from other devices. Examples of the image management system include a picture archiving and communication system (PACS). In particular, the image management system according to the embodiment described below includes a database that can store various information such as subject information and shooting time associated with received images. The image management system is also connected to a network, and can send and receive images, convert images, and send and receive various information associated with stored images in response to a request from other devices.

撮影条件とは、撮影装置によって取得された画像の撮影時の様々な情報である。撮影条件は、例えば、撮影装置に関する情報、撮影が実施された施設に関する情報、撮影に係る検査の情報、撮影者に関する情報、及び被検者に関する情報等を含む。また、撮影条件は、例えば、撮影日時、撮影部位名、撮影領域、撮影画角、撮影方式、画像の解像度や階調、画像サイズ、適用された画像フィルタ、画像のデータ形式に関する情報、及び放射線量に関する情報等を含む。なお、撮影領域には、特定の撮影部位からずれた周辺の領域や複数の撮影部位を含んだ領域等が含まれることができる。 The imaging conditions are various pieces of information at the time of capturing an image obtained by an imaging device. The imaging conditions include, for example, information about the imaging device, information about the facility where the imaging was performed, information about the examination related to the imaging, information about the photographer, and information about the subject. The imaging conditions also include, for example, the imaging date and time, the name of the imaging part, the imaging area, the imaging angle of view, the imaging method, the image resolution and gradation, the image size, the applied image filter, information about the image data format, and information about the radiation dose. The imaging area can include a surrounding area that is shifted from a specific imaging part, an area that includes multiple imaging parts, etc.

撮影条件は、画像を構成するデータ構造中に保存されていたり、画像とは別の撮影条件データとして保存されていたり、撮影装置に関連するデータベースや画像管理システムに保存されたりすることができる。そのため、撮影条件は、撮影装置の撮影条件の保存手段に対応した手順により取得することができる。具体的には、撮影条件は、例えば、撮影装置が出力した画像のデータ構造を解析したり、画像に対応する撮影条件データを取得したり、撮影装置に関連するデータベースから撮影条件を取得するためのインターフェースにアクセスする等により取得される。 The shooting conditions can be stored in the data structure constituting the image, or as shooting condition data separate from the image, or stored in a database or image management system related to the shooting device. Therefore, the shooting conditions can be acquired by a procedure corresponding to the storage means of the shooting conditions of the shooting device. Specifically, the shooting conditions are acquired, for example, by analyzing the data structure of the image output by the shooting device, acquiring shooting condition data corresponding to the image, or accessing an interface for acquiring the shooting conditions from a database related to the shooting device.

なお、撮影装置によっては、保存されていない等の理由で取得できない撮影条件も存在する。例えば、撮影装置に特定の撮影条件を取得したり保存したりする機能が無い、又はそのような機能が無効にされている場合である。また、例えば、撮影装置や撮影に関係の無い撮影条件であるとして保存しないようになっている場合もある。さらに、例えば、撮影条件が隠蔽されていたり、暗号化されていたり、権利が無いと取得できないようになっていたりする場合等もある。ただし、保存されていない撮影条件であっても取得できる場合がある。例えば、画像解析を実施することによって、撮影部位名や撮影領域を特定することができる。 Depending on the imaging device, there may be some imaging conditions that cannot be acquired because they have not been saved. For example, this may be the case when the imaging device does not have the function to acquire or save specific imaging conditions, or when such a function is disabled. In addition, there may be cases where the imaging conditions are not saved because they are not related to the imaging device or imaging. Furthermore, there may be cases where the imaging conditions are hidden, encrypted, or cannot be acquired without the right. However, there are cases where even imaging conditions that have not been saved can be acquired. For example, the name of the imaging part or the imaging area can be identified by performing image analysis.

機械学習モデルとは、任意の機械学習アルゴリズムに対して、事前に適切な教師データ（学習データ）を用いてトレーニング（学習）を行ったモデルである。教師データは、一つ以上の、入力データと出力データ（正解データ）とのペア群で構成される。なお、教師データを構成するペア群の入力データと出力データの形式や組み合わせは、一方が画像で他方が数値であったり、一方が複数の画像群で構成され他方が文字列であったり、双方が画像であったりする等、所望の構成に適したものであってよい。 A machine learning model is a model that has been trained (learned) in advance using appropriate teacher data (learning data) for an arbitrary machine learning algorithm. The teacher data consists of one or more pairs of input data and output data (correct answer data). The format and combination of the input data and output data in the pairs that make up the teacher data may be suitable for the desired configuration, such as one being an image and the other being a number, one being composed of a group of multiple images and the other being a string of characters, or both being images.

具体的には、例えば、ＯＣＴによって取得された画像と、該画像に対応する撮影部位ラベルとのペア群によって構成された教師データ（以下、第１の教師データ）が挙げられる。なお、撮影部位ラベルは部位を表すユニークな数値や文字列である。また、その他の教師データの例として、ＯＣＴの通常撮影によって取得されたノイズの多い低画質画像と、ＯＣＴにより複数回撮影して高画質化処理した高画質画像とのペア群によって構成されている教師データ（以下、第２の教師データ）等が挙げられる。 Specific examples of such training data include training data (hereinafter referred to as "first training data") that is composed of a set of pairs of an image acquired by OCT and an imaging site label corresponding to the image. The imaging site label is a unique numerical value or character string that represents the site. Other examples of training data include training data (hereinafter referred to as "second training data") that is composed of a set of pairs of a noisy low-quality image acquired by normal OCT imaging and a high-quality image that has been imaged multiple times by OCT and processed to improve image quality.

機械学習モデルに入力データを入力すると、該機械学習モデルの設計に従った出力データが出力される。機械学習モデルは、例えば、教師データを用いてトレーニングされた傾向に従って、入力データに対応する可能性の高い出力データを出力する。また、機械学習モデルは、例えば、教師データを用いてトレーニングされた傾向に従って、出力データの種類のそれぞれについて、入力データに対応する可能性を数値として出力する等を行うことができる。具体的には、例えば、第１の教師データでトレーニングされた機械学習モデルにＯＣＴによって取得された画像を入力すると、機械学習モデルは、該画像に撮影されている撮影部位の撮影部位ラベルを出力したり、撮影部位ラベル毎の確率を出力したりする。また、例えば、第２の教師データでトレーニングされた機械学習モデルにＯＣＴの通常撮影によって取得されたノイズの多い低画質画像を入力すると、機械学習モデルは、ＯＣＴにより複数回撮影して高画質化処理された画像相当の高画質画像を出力する。なお、機械学習モデルについては、品質保持の観点から、自身が出力した出力データを教師データとして用いないように構成することができる。 When input data is input to the machine learning model, output data according to the design of the machine learning model is output. The machine learning model outputs output data that is likely to correspond to the input data, for example, according to the tendency trained using the teacher data. In addition, the machine learning model can output the possibility of corresponding to the input data as a numerical value for each type of output data, for example, according to the tendency trained using the teacher data. Specifically, for example, when an image acquired by OCT is input to a machine learning model trained with the first teacher data, the machine learning model outputs the imaging site label of the imaging site captured in the image, or outputs the probability for each imaging site label. In addition, for example, when a noisy low-quality image acquired by normal OCT imaging is input to a machine learning model trained with the second teacher data, the machine learning model outputs a high-quality image equivalent to an image captured multiple times by OCT and processed for high image quality. In addition, the machine learning model can be configured not to use the output data it outputs as teacher data from the viewpoint of maintaining quality.

また、機械学習アルゴリズムは、畳み込みニューラルネットワーク（ＣＮＮ）等のディープラーニングに関する手法を含む。ディープラーニングに関する手法においては、ニューラルネットワークを構成する層群やノード群に対するパラメータの設定が異なると、教師データを用いてトレーニングされた傾向を出力データに再現可能な程度が異なる場合がある。例えば、第１の教師データを用いたディープラーニングの機械学習モデルにおいては、より適切なパラメータが設定されていると、正しい撮影部位ラベルを出力する確率がより高くなる場合がある。また、例えば、第２の教師データを用いたディープラーニングの機械学習モデルにおいては、より適切なパラメータが設定されていると、より高画質な画像を出力できる場合がある。 The machine learning algorithm also includes deep learning techniques such as convolutional neural networks (CNN). In deep learning techniques, different parameter settings for layers and nodes constituting a neural network may result in different degrees of reproducibility of trends trained using training data in output data. For example, in a deep learning machine learning model using first training data, setting more appropriate parameters may increase the probability of outputting a correct imaging site label. Also, in a deep learning machine learning model using second training data, setting more appropriate parameters may result in higher quality images being output.

具体的には、ＣＮＮにおけるパラメータは、例えば、畳み込み層に対して設定される、フィルタのカーネルサイズ、フィルタの数、ストライドの値、及びダイレーションの値、並びに全結合層の出力するノードの数等を含むことができる。なお、パラメータ群やトレーニングのエポック数は、教師データに基づいて、機械学習モデルの利用形態に好ましい値に設定することができる。例えば、教師データに基づいて、正しい撮影部位ラベルをより高い確率で出力したり、より高画質な画像を出力したりできるパラメータ群やエポック数を設定することができる。 Specifically, parameters in a CNN can include, for example, the filter kernel size, number of filters, stride value, and dilation value set for the convolutional layer, as well as the number of nodes output by the fully connected layer. The parameter group and the number of training epochs can be set to values that are favorable for the usage form of the machine learning model based on the training data. For example, a parameter group and the number of epochs can be set based on the training data to output a correct imaging site label with a higher probability or to output a higher quality image.

このようなパラメータ群やエポック数の決定方法の一つを例示する。まず、教師データを構成するペア群の７割をトレーニング用とし、残りの３割を評価用としてランダムに設定する。次に、トレーニング用のペア群を用いて機械学習モデルのトレーニングを行い、トレーニングの各エポックの終了時に、評価用のペア群を用いてトレーニング評価値を算出する。トレーニング評価値とは、例えば、各ペアを構成する入力データをトレーニング中の機械学習モデルに入力したときの出力と、入力データに対応する出力データとを損失関数によって評価した値群の平均値である。最後に、最もトレーニング評価値が小さくなったときのパラメータ群及びエポック数を、当該機械学習モデルのパラメータ群やエポック数として決定する。なお、このように、教師データを構成するペア群をトレーニング用と評価用とに分けてエポック数の決定を行うことによって、機械学習モデルがトレーニング用のペア群に対して過学習してしまうことを防ぐことができる。 One method for determining such a parameter set and the number of epochs is exemplified below. First, 70% of the pair sets constituting the teacher data are set for training, and the remaining 30% are set for evaluation, randomly. Next, the machine learning model is trained using the pair sets for training, and at the end of each epoch of training, a training evaluation value is calculated using the pair sets for evaluation. The training evaluation value is, for example, the average value of a group of values obtained by evaluating the output when the input data constituting each pair is input to the machine learning model being trained, and the output data corresponding to the input data, using a loss function. Finally, the parameter set and the number of epochs when the training evaluation value is the smallest are determined as the parameter set and the number of epochs of the machine learning model. In this way, by dividing the pair sets constituting the teacher data into those for training and those for evaluation and determining the number of epochs, it is possible to prevent the machine learning model from overlearning the pair sets for training.

高画質化エンジン（高画質化用の学習済モデル）とは、入力された低画質画像を高画質化した高画質画像を出力するモジュールのことである。ここで、本明細書における高画質化とは、入力された画像を画像診断により適した画質の画像に変換することをいい、高画質画像とは、画像診断により適した画質の画像に変換された画像をいう。また、低画質画像とは、例えば、Ｘ線撮影、ＣＴ、ＭＲＩ、ＯＣＴ、ＰＥＴ、若しくはＳＰＥＣＴ等により取得された二次元画像や三次元画像、又は連続撮影したＣＴの三次元動画像等の特に高画質になるような設定をされずに撮影されたものである。具体的には、低画質画像は、例えば、Ｘ線撮影装置やＣＴによる低線量での撮影や、造影剤を使用しないＭＲＩによる撮影、ＯＣＴの短時間撮影等によって取得される画像、及び少ない撮影回数で取得されたＯＣＴＡ画像等を含む。 The image quality improvement engine (trained model for image quality improvement) is a module that improves the image quality of an input low-image quality image and outputs a high-image quality image. In this specification, image quality improvement refers to converting an input image into an image with image quality more suitable for image diagnosis, and a high-image quality image refers to an image converted into an image with image quality more suitable for image diagnosis. In addition, a low-image quality image is, for example, a two-dimensional image or a three-dimensional image acquired by X-ray photography, CT, MRI, OCT, PET, or SPECT, or a three-dimensional moving image of a CT taken continuously without being set to obtain a particularly high image quality. Specifically, low-image quality images include, for example, images acquired by low-dose photography using an X-ray photography device or CT, photography using an MRI without using a contrast agent, images acquired by short-time photography using OCT, and OCTA images acquired with a small number of photography sessions.

また、画像診断に適した画質の内容は、各種の画像診断で何を診断したいのかということに依存する。そのため一概には言えないが、例えば、画像診断に適した画質は、ノイズが少なかったり、高コントラストであったり、撮影対象を観察しやすい色や階調で示していたり、画像サイズが大きかったり、高解像度であったりする画質を含む。また、画像生成の過程で描画されてしまった実際には存在しないオブジェクトやグラデーションが画像から除去されているような画質を含むことができる。 The content of image quality suitable for diagnostic imaging depends on what is being diagnosed with various types of diagnostic imaging. Therefore, it is difficult to generalize, but for example, image quality suitable for diagnostic imaging includes image quality with little noise, high contrast, colors and gradations that make it easy to observe the subject, large image size, and high resolution. It can also include image quality in which non-existent objects and gradations that are drawn during the image generation process are removed from the image.

また、ノイズが少なかったり、高コントラストであったりする高画質画像を、ＯＣＴＡ等の画像の血管解析処理や、ＣＴやＯＣＴ等の画像の領域セグメンテーション処理等の画像解析に利用すると、低画質画像を利用するよりも精度よく解析が行えることが多い。そのため、高画質化エンジンによって出力された高画質画像は、画像診断だけでなく、画像解析にも有用である場合がある。 In addition, when high-quality images with low noise and high contrast are used for image analysis such as vascular analysis processing of images such as OCTA, or area segmentation processing of images such as CT or OCT, the analysis can often be performed more accurately than when low-quality images are used. Therefore, high-quality images output by the image quality engine can be useful not only for image diagnosis, but also for image analysis.

下記の実施形態における高画質化手法を構成する画像処理手法では、ディープラーニング等の各種機械学習アルゴリズムを用いた処理を行う。なお、当該画像処理手法では、機械学習アルゴリズムを用いた処理に加えて、各種画像フィルタ処理、類似画像に対応する高画質画像のデータベースを用いたマッチング処理、及び知識ベース画像処理等の既存の任意の処理を行ってもよい。 The image processing method constituting the image quality improvement method in the following embodiment performs processing using various machine learning algorithms such as deep learning. Note that in addition to processing using machine learning algorithms, the image processing method may also perform any existing processing such as various image filter processing, matching processing using a database of high-quality images corresponding to similar images, and knowledge-based image processing.

特に、二次元画像を高画質化するＣＮＮの構成例として、図１に示す構成がある。当該ＣＮＮの構成には、複数の畳み込み処理ブロック１００群が含まれる。畳み込み処理ブロック１００は、畳み込み（Ｃｏｎｖｏｌｕｔｉｏｎ）層１０１と、バッチ正規化（ＢａｔｃｈＮｏｒｍａｌｉｚａｔｉｏｎ）層１０２と、正規化線形関数（ＲｅｃｔｉｆｉｅｒＬｉｎｅａｒＵｎｉｔ）を用いた活性化層１０３とを含む。また、当該ＣＮＮの構成には、合成（Ｍｅｒｇｅｒ）層１０４と、最後の畳み込み層１０５が含まれる。合成層１０４は、畳み込み処理ブロック１００の出力値群と画像を構成する画素値群とを連結したり、加算したりして合成する。最後の畳み込み層１０５は、合成層１０４で合成された、高画質画像Ｉｍ１２０を構成する画素値群を出力する。このような構成では、入力された画像Ｉｍ１１０を構成する画素値群が畳み込み処理ブロック１００群を経て出力された値群と、入力された画像Ｉｍ１１０を構成する画素値群とが、合成層１０４で合成される。その後、合成された画素値群は最後の畳み込み層１０５で高画質画像Ｉｍ１２０に成形される。 In particular, as an example of a CNN configuration for improving the image quality of a two-dimensional image, there is a configuration shown in FIG. 1. The CNN configuration includes a group of multiple convolution processing blocks 100. The convolution processing block 100 includes a convolution layer 101, a batch normalization layer 102, and an activation layer 103 using a normalized linear function (Rectifier Linear Unit). The CNN configuration also includes a merger layer 104 and a final convolution layer 105. The merger layer 104 merges the output value group of the convolution processing block 100 with the pixel value group that constitutes the image by connecting or adding them. The final convolution layer 105 outputs the pixel value group that constitutes the high-image-quality image Im120 that is merged in the merger layer 104. In this configuration, the pixel values constituting the input image Im110 are output through the convolution processing blocks 100, and the pixel values constituting the input image Im110 are synthesized in the synthesis layer 104. The synthesized pixel values are then shaped into a high-quality image Im120 in the final convolution layer 105.

なお、例えば、畳み込み処理ブロック１００の数を１６とし、畳み込み層１０１群のパラメータとして、フィルタのカーネルサイズを幅３画素、高さ３画素、フィルタの数を６４とすることで、一定の高画質化の効果を得られる。しかしながら、実際には上記の機械学習モデルの説明において述べた通り、機械学習モデルの利用形態に応じた教師データを用いて、より良いパラメータ群を設定することができる。なお、三次元画像や四次元画像を処理する必要がある場合には、フィルタのカーネルサイズを三次元や四次元に拡張してもよい。 For example, a certain level of image quality improvement can be achieved by setting the number of convolution processing blocks 100 to 16, setting the parameters of the convolution layer 101 group to a filter kernel size of 3 pixels wide and 3 pixels high, and setting the number of filters to 64. However, in practice, as described in the above description of the machine learning model, a better parameter group can be set using training data according to the usage form of the machine learning model. Note that when three-dimensional or four-dimensional images need to be processed, the filter kernel size may be expanded to three or four dimensions.

なお、ＣＮＮを用いた画像処理等、一部の画像処理手法を利用する場合には画像サイズについて注意する必要がある。具体的には、高画質画像の周辺部が十分に高画質化されない問題等の対策のため、入力する低画質画像と出力する高画質画像とで異なる画像サイズを要する場合があることに留意すべきである。 Note that when using some image processing methods, such as image processing using CNN, attention must be paid to image size. Specifically, it should be noted that in order to address issues such as the issue of the peripheral areas of a high-quality image not being of sufficient quality, different image sizes may be required for the input low-quality image and the output high-quality image.

明瞭な説明のため、後述の実施形態において明記はしないが、高画質化エンジンに入力される画像と出力される画像とで異なる画像サイズを要する高画質化エンジンを採用した場合には、適宜画像サイズを調整しているものとする。具体的には、機械学習モデルをトレーニングするための教師データに用いる画像や、高画質化エンジンに入力される画像といった入力画像に対して、パディングを行ったり、該入力画像の周辺の撮影領域を結合したりして、画像サイズを調整する。なお、パディングを行う領域は、効果的に高画質化できるように高画質化手法の特性に合わせて、一定の画素値で埋めたり、近傍画素値で埋めたり、ミラーパディングしたりする。 For the sake of clarity, this will not be specified in the embodiments described below, but when a high-image-quality engine is adopted that requires different image sizes for the image input to the high-image-quality engine and the image output, the image size is adjusted appropriately. Specifically, the image size is adjusted for input images, such as images used as teacher data for training a machine learning model or images input to the high-image-quality engine, by padding or combining the surrounding shooting area of the input image. Note that the area to be padded is filled with a fixed pixel value, filled with nearby pixel values, or mirror padded according to the characteristics of the high-image-quality technique so that the image quality can be effectively improved.

また、高画質化手法は、一つの画像処理手法だけで実施されることもあるし、二つ以上の画像処理手法を組み合わせて実施されることもある。また、複数の高画質化手法群を並列に実施し、複数の高画質画像群を生成した上で、最も高画質な高画質画像を最終的に高画質画像として選択することもある。なお、最も高画質な高画質画像の選択は、画質評価指数を用いて自動的に行われてもよいし、任意の表示部等に備えられたユーザーインターフェースに複数の高画質画像群を表示して、検者（ユーザー）の指示に応じて行われてもよい。 The image quality improvement method may be implemented using only one image processing method, or may be implemented using a combination of two or more image processing methods. In addition, multiple image quality improvement methods may be implemented in parallel to generate multiple high-quality image groups, and the highest quality image may be selected as the final high-quality image. The highest quality image may be selected automatically using an image quality assessment index, or may be selected according to the examiner's (user's) instructions by displaying multiple high-quality image groups on a user interface provided on an arbitrary display unit, etc.

なお、高画質化していない入力画像の方が、画像診断に適している場合もあるので、最終的な画像の選択の対象には入力画像を加えてよい。また、高画質化エンジンに対して、低画質画像とともにパラメータを入力してもよい。高画質化エンジンに対して、入力画像とともに、例えば、高画質化を行う程度を指定するパラメータや、画像処理手法に用いられる画像フィルタサイズを指定するパラメータを入力してもよい。 In some cases, an input image that has not been enhanced in image quality may be more suitable for image diagnosis, so the input image may be included in the final image selection. Parameters may also be input to the image enhancement engine along with the low-image quality image. For example, parameters specifying the degree of image enhancement or parameters specifying the image filter size used in the image processing method may be input to the image enhancement engine along with the input image.

撮影箇所推定エンジンとは、入力された画像の撮影部位や撮影領域を推定するモジュールのことである。撮影箇所推定エンジンは、入力された画像に描画されている撮影部位や撮影領域がどこであるか、又は必要な詳細レベルの撮影部位ラベルや撮影領域ラベル毎に、該撮影部位や撮影領域である確率を出力することができる。 The shooting location estimation engine is a module that estimates the shooting part and shooting area of the input image. The shooting location estimation engine can output where the shooting part and shooting area depicted in the input image are, or the probability that it is the shooting part or shooting area for each shooting part label or shooting area label of the required level of detail.

撮影部位や撮影領域は、撮影装置によっては撮影条件として保存していない、又は撮影装置が取得できず保存できていない場合がある。また、撮影部位や撮影領域が保存されていても、必要な詳細レベルの撮影部位や撮影領域が保存されていない場合もある。例えば、撮影部位として“後眼部”と保存されているだけで、詳細には“黄斑部”なのか、“視神経乳頭部”なのか、又は、“黄斑部及び視神経乳頭部”なのか、“その他”なのかが分からないことがある。また、別の例では、撮影部位として“乳房”と保存されているだけで、詳細には“右乳房”なのか、“左乳房”なのか、又は、“両方”なのかが分からないことがある。そのため、撮影箇所推定エンジンを用いることで、これらの場合に入力画像の撮影部位や撮影領域を推定することができる。 Depending on the imaging device, the imaging part and imaging area may not be saved as imaging conditions, or may not be acquired by the imaging device and saved. Even if the imaging part and imaging area are saved, the imaging part and imaging area may not be saved at the required level of detail. For example, the imaging part may only be saved as "posterior segment," but it is not clear whether it is the "macular area," "optic disc," "macular area and optic disc," or "other." In another example, the imaging part may only be saved as "breast," but it is not clear whether it is the "right breast," "left breast," or "both." Therefore, by using the imaging location estimation engine, it is possible to estimate the imaging part and imaging area of the input image in these cases.

撮影箇所推定エンジンの推定手法を構成する画像及びデータ処理手法では、ディープラーニング等の各種機械学習アルゴリズムを用いた処理を行う。なお、当該画像及びデータ処理手法では、機械学習アルゴリズムを用いた処理に加えて又は代えて、自然言語処理、類似画像及び類似データのデータベースを用いたマッチング処理、知識ベース処理等の既存の任意の推定処理を行ってもよい。なお、機械学習アルゴリズムを用いて構築した機械学習モデルをトレーニングする教師データは、撮影部位や撮影領域のラベルが付けられた画像とすることができる。この場合には、教師データの画像を入力データ、撮影部位や撮影領域のラベルを出力データとする。 The image and data processing method constituting the estimation method of the imaging location estimation engine performs processing using various machine learning algorithms such as deep learning. Note that the image and data processing method may perform any existing estimation processing such as natural language processing, matching processing using a database of similar images and similar data, knowledge-based processing, etc. in addition to or instead of processing using a machine learning algorithm. Note that the teacher data used to train the machine learning model constructed using the machine learning algorithm may be images labeled with the imaging site and imaging area. In this case, the images of the teacher data are used as input data, and the labels of the imaging site and imaging area are used as output data.

特に、二次元画像の撮影箇所を推定するＣＮＮの構成例として、図２に示す構成がある。当該ＣＮＮの構成には、畳み込み層２０１とバッチ正規化層２０２と正規化線形関数を用いた活性化層２０３とで構成された複数の畳み込み処理ブロック２００群が含まれる。また、当該ＣＮＮの構成には、最後の畳み込み層２０４と、全結合（ＦｕｌｌＣｏｎｎｅｃｔｉｏｎ）層２０５と、出力層２０６が含まれる。全結合層２０５は畳み込み処理ブロック２００の出力値群を全結合する。また、出力層２０６は、Ｓｏｆｔｍａｘ関数を利用して、入力画像Ｉｍ２１０に対する、想定される撮影部位ラベル毎の確率を推定結果（Ｒｅｓｕｌｔ）２０７として出力する。このような構成では、例えば、入力画像Ｉｍ２１０が“黄斑部”を撮影した画像であれば、“黄斑部に対応する撮影部位ラベルについて最も高い確率が出力される。 In particular, as an example of a CNN configuration that estimates the shooting location of a two-dimensional image, there is a configuration shown in FIG. 2. The CNN configuration includes a group of multiple convolution processing blocks 200, each of which is composed of a convolution layer 201, a batch normalization layer 202, and an activation layer 203 using a normalized linear function. The CNN configuration also includes a final convolution layer 204, a fully connected layer 205, and an output layer 206. The fully connected layer 205 fully connects the output value groups of the convolution processing blocks 200. The output layer 206 also uses a Softmax function to output the probability of each expected shooting part label for the input image Im210 as an estimation result (Result) 207. In this configuration, for example, if the input image Im210 is an image of the "macular region", the highest probability is output for the shooting part label corresponding to the "macular region.

なお、例えば、畳み込み処理ブロック２００の数を１６、畳み込み層２０１群のパラメータとして、フィルタのカーネルサイズを幅３画素、高さ３画素、フィルタの数を６４とすることで、一定の精度で撮影部位を推定することができる。しかしながら、実際には上記の機械学習モデルの説明において述べた通り、機械学習モデルの利用形態に応じた教師データを用いて、より良いパラメータ群を設定することができる。なお、三次元画像や四次元画像を処理する必要がある場合には、フィルタのカーネルサイズを三次元や四次元に拡張してもよい。なお、推定手法は、一つの画像及びデータ処理手法だけで実施されることもあるし、二つ以上の画像及びデータ処理手法を組み合わせて実施されることもある。 For example, the imaging site can be estimated with a certain degree of accuracy by setting the number of convolution processing blocks 200 to 16, the parameters of the convolution layer group 201 to a filter kernel size of 3 pixels wide and 3 pixels high, and the number of filters to 64. However, in practice, as described in the above description of the machine learning model, a better parameter group can be set using training data according to the usage form of the machine learning model. If it is necessary to process three-dimensional or four-dimensional images, the filter kernel size may be expanded to three or four dimensions. The estimation method may be implemented using only one image and data processing method, or may be implemented using a combination of two or more image and data processing methods.

画質評価エンジンとは、入力された画像に対する画質評価指数を出力するモジュールのことである。画質評価指数を算出する画質評価処理手法では、ディープラーニング等の各種機械学習アルゴリズムを用いた処理を行う。なお、当該画質評価処理手法では、画像ノイズ計測アルゴリズム、及び類似画像や基底画像に対応する画質評価指数のデータベースを用いたマッチング処理等の既存の任意の評価処理を行ってもよい。なお、これらの評価処理は、機械学習アルゴリズムを用いた処理に加えて又は代えて行われてよい。 The image quality assessment engine is a module that outputs an image quality assessment index for an input image. The image quality assessment processing method for calculating the image quality assessment index performs processing using various machine learning algorithms such as deep learning. Note that the image quality assessment processing method may perform any existing evaluation processing such as an image noise measurement algorithm and a matching process using a database of image quality assessment indices corresponding to similar images and base images. Note that these evaluation processes may be performed in addition to or instead of processing using a machine learning algorithm.

例えば、画質評価指数は機械学習アルゴリズムを用いて構築した機械学習モデルより得ることができる。この場合、機械学習モデルをトレーニングする教師データを構成するペアの入力データは、事前に様々な撮影条件によって撮影された低画質画像群と高画質画像群とで構成される画像群である。また、機械学習モデルをトレーニングする教師データを構成するペアの出力データは、例えば、画像診断を行う検者が入力データの画像群のそれぞれについて設定した画質評価指数群である。 For example, the image quality assessment index can be obtained from a machine learning model constructed using a machine learning algorithm. In this case, the pair of input data constituting the teacher data for training the machine learning model is a group of images consisting of a group of low-quality images and a group of high-quality images captured in advance under various shooting conditions. In addition, the pair of output data constituting the teacher data for training the machine learning model is, for example, a group of image quality assessment indices set for each of the image groups of the input data by an examiner performing image diagnosis.

本発明の説明における真贋評価エンジンとは、入力された画像の描画を評価して、対象の撮影装置によって撮影され取得された画像か否かを、ある程度の精度で評価するモジュールである。真贋評価処理手法では、ディープラーニング等の各種機械学習アルゴリズムを用いた処理を行う。なお、真贋評価処理手法では、機械学習アルゴリズムを用いた処理に加えて又は代えて、知識ベース処理等の既存の任意の評価処理を行ってもよい。 The authenticity evaluation engine in the description of this invention is a module that evaluates the rendering of an input image and evaluates with a certain degree of accuracy whether the image was captured and acquired by the target imaging device. The authenticity evaluation processing method performs processing using various machine learning algorithms such as deep learning. Note that the authenticity evaluation processing method may perform any existing evaluation processing such as knowledge-based processing in addition to or instead of processing using machine learning algorithms.

例えば、真贋評価処理は機械学習アルゴリズムを用いて構築した機械学習モデルにより実施することができる。まず、機械学習モデルの教師データについて説明する。教師データには、事前に様々な撮影条件によって撮影された高画質画像群と対象の撮影装置によって撮影され取得されたことを表すラベル（以下、真作ラベル）とのペア群が含まれる。また、教師データには、高画質化エンジン（第１レベルの高画質化エンジン）に低画質画像を入力して生成した高画質画像群と対象の撮影装置によって撮影され取得されていないことを表すラベル（以下、贋作ラベル）とのペア群が含まれる。このような教師データを用いてトレーニングした機械学習モデルは、第１レベルの高画質化エンジンが生成する高画質画像が入力されると贋作ラベルを出力する。 For example, the authenticity evaluation process can be performed by a machine learning model constructed using a machine learning algorithm. First, the training data of the machine learning model will be described. The training data includes a pair group of high-quality images captured in advance under various shooting conditions and a label (hereinafter, genuine label) indicating that the images were captured and acquired by the target imaging device. The training data also includes a pair group of high-quality images generated by inputting low-quality images to an image quality improvement engine (first level image quality improvement engine) and a label (hereinafter, counterfeit label) indicating that the images were not captured and acquired by the target imaging device. A machine learning model trained using such training data outputs a counterfeit label when a high-quality image generated by the first level image quality improvement engine is input.

特に、二次元画像の真贋評価処理を行うＣＮＮの構成例として、図３に示す構成がある。当該ＣＮＮの構成には、畳み込み層３０１と、バッチ正規化層３０２と、正規化線形関数を用いた活性化層３０３とで構成された複数の畳み込み処理ブロック３００群が含まれる。また、当該ＣＮＮの構成には、最後の畳み込み層３０４と、全結合層３０５と、出力層３０６が含まれる。全結合層３０５は、畳み込み処理ブロック３００の出力値群を全結合する。また、出力層３０６は、Ｓｉｇｍｏｉｄ関数を利用して、入力画像Ｉｍ３１０に対して、真作ラベルを表す１の値（真）又は贋作ラベルを表す０の値（偽）を、真贋評価処理の結果（Ｒｅｓｕｌｔ）３０７として出力する。 In particular, FIG. 3 shows an example of a CNN configuration for performing authenticity evaluation processing of two-dimensional images. The CNN configuration includes a group of multiple convolution processing blocks 300, each of which is composed of a convolution layer 301, a batch normalization layer 302, and an activation layer 303 using a normalized linear function. The CNN configuration also includes a final convolution layer 304, a fully connected layer 305, and an output layer 306. The fully connected layer 305 fully connects the output value groups of the convolution processing blocks 300. The output layer 306 also uses a sigmoid function to output a value of 1 (true) representing an authentic label or a value of 0 (false) representing a counterfeit label as the result (Result) 307 of the authenticity evaluation processing for the input image Im 310.

なお、畳み込み処理ブロック３００の数を１６、畳み込み層３０１群のパラメータとして、フィルタのカーネルサイズを幅３画素、高さ３画素、フィルタの数を６４とすることで、一定の精度で正しい真贋評価処理の結果を得られる。しかしながら、実際には上記の機械学習モデルの説明において述べた通り、機械学習モデルの利用形態に応じた教師データを用いて、より良いパラメータ群を設定することができる。なお、三次元画像や四次元画像を処理する必要がある場合には、フィルタのカーネルサイズを三次元や四次元に拡張してもよい。 Note that by setting the number of convolution processing blocks 300 to 16, the parameters of the convolution layer 301 group to a filter kernel size of 3 pixels wide and 3 pixels high, and the number of filters to 64, it is possible to obtain accurate authenticity evaluation results with a certain degree of precision. However, as described in the above explanation of the machine learning model, in practice, a better parameter group can be set using training data according to the usage form of the machine learning model. Note that when three-dimensional or four-dimensional images need to be processed, the filter kernel size may be expanded to three or four dimensions.

真贋評価エンジンは、第１レベルの高画質化エンジンよりも高度に高画質化する高画質化エンジン（第２レベルの高画質化エンジン）が生成する高画質画像が入力されると真作ラベルを出力することがある。つまり、真贋評価エンジンは入力された画像に対し、確実に撮影装置によって撮影され取得された画像か否かを評価できるわけではないが、撮影装置によって撮影され取得された画像らしさを持つ画像か否かを評価できる。この特性を利用して、真贋評価エンジンに高画質化エンジンが生成した高画質画像を入力することで、高画質化エンジンが生成した高画質画像が十分に高画質化されているか否かを評価できる。 The authenticity evaluation engine may output an authenticity label when a high-quality image generated by a high-quality engine (second-level image quality engine) that provides higher image quality than the first-level image quality engine is input. In other words, the authenticity evaluation engine cannot reliably evaluate whether an input image is an image that was captured and acquired by a camera device, but it can evaluate whether the image has the resemblance of an image that was captured and acquired by a camera device. By utilizing this characteristic, a high-quality image generated by a high-quality engine can be input to the authenticity evaluation engine, making it possible to evaluate whether the high-quality image generated by the high-quality engine has been sufficiently enhanced.

また、高画質化エンジンの機械学習モデルと真贋評価エンジンの機械学習モデルとを協調させてトレーニングすることによって、双方のエンジンの効率や精度を向上させてもよい。この場合には、まず、高画質化エンジンが生成する高画質画像を真贋評価エンジンに評価させると真作ラベルが出力されるように、該高画質化エンジンの機械学習モデルをトレーニングする。また、並行して、高画質化エンジンが生成する画像を真贋評価エンジンに評価させると贋作ラベルを出力するように、該真贋評価エンジンの機械学習モデルをトレーニングさせる。さらに、並行して、撮影装置によって取得された画像を真贋評価エンジンに評価させると真作ラベルを出力するように、該真贋評価エンジンの機械学習モデルをトレーニングさせる。これによって、高画質化エンジンと真贋評価エンジンの効率や精度が向上する。 The efficiency and accuracy of both engines may be improved by training the machine learning model of the image quality improvement engine and the machine learning model of the authenticity evaluation engine in cooperation with each other. In this case, first, the machine learning model of the image quality improvement engine is trained so that when the authenticity evaluation engine evaluates a high-quality image generated by the image quality improvement engine, an authenticity label is output. In parallel, the machine learning model of the authenticity evaluation engine is trained so that when the authenticity evaluation engine evaluates an image generated by the image quality improvement engine, an authenticity label is output. In parallel, the machine learning model of the authenticity evaluation engine is trained so that when the authenticity evaluation engine evaluates an image acquired by a photographing device, an authenticity label is output. This improves the efficiency and accuracy of the image quality improvement engine and the authenticity evaluation engine.

＜第１の実施形態＞
以下、図４及び５を参照して、第１の実施形態による医用画像処理装置について説明する。図４は、本実施形態に係る画像処理装置の概略的な構成の一例を示す。 First Embodiment
The medical image processing apparatus according to the first embodiment will be described below with reference to Figures 4 and 5. Figure 4 shows an example of a schematic configuration of the image processing apparatus according to this embodiment.

画像処理装置４００は、撮影装置１０及び表示部２０に、回路やネットワークを介して接続されている。また、撮影装置１０及び表示部２０が直接接続されていてもよい。なお、これらの装置は本実施形態では別個の装置とされているが、これらの装置の一部又は全部を一体的に構成してもよい。また、これらの装置は、他の任意の装置と回路やネットワークを介して接続されてもよいし、他の任意の装置と一体的に構成されてもよい。 The image processing device 400 is connected to the image capture device 10 and the display unit 20 via a circuit or a network. The image capture device 10 and the display unit 20 may also be directly connected. Note that although these devices are separate devices in this embodiment, some or all of these devices may be configured as an integrated device. Furthermore, these devices may be connected to any other device via a circuit or a network, or may be configured as an integrated device with any other device.

画像処理装置４００には、取得部４０１と、撮影条件取得部４０２と、高画質化可否判定部４０３と、高画質化部４０４と、出力部４０５（表示制御部）とが設けられている。なお、画像処理装置４００は、これら構成要素のうちの一部が設けられた複数の装置で構成されてもよい。取得部４０１は、撮影装置１０や他の装置から各種データや画像を取得したり、不図示の入力装置を介して検者からの入力を取得したりすることができる。なお、入力装置としては、マウス、キーボード、タッチパネル及びその他任意の入力装置を採用してよい。また、表示部２０をタッチパネルディスプレイとして構成してもよい。 The image processing device 400 is provided with an acquisition unit 401, an imaging condition acquisition unit 402, an image quality improvement possibility determination unit 403, an image quality improvement unit 404, and an output unit 405 (display control unit). The image processing device 400 may be composed of multiple devices in which some of these components are provided. The acquisition unit 401 can acquire various data and images from the imaging device 10 or other devices, and can acquire input from the examiner via an input device (not shown). The input device may be a mouse, a keyboard, a touch panel, or any other input device. The display unit 20 may also be configured as a touch panel display.

撮影条件取得部４０２は、取得部４０１が取得した医用画像（入力画像）の撮影条件を取得する。具体的には、医用画像のデータ形式に応じて、医用画像を構成するデータ構造に保存された撮影条件群を取得する。なお、医用画像に撮影条件が保存されていない場合には、取得部４０１を介して、撮影装置１０や画像管理システムから撮影条件群を含む撮影情報群を取得することができる。 The imaging condition acquisition unit 402 acquires the imaging conditions of the medical image (input image) acquired by the acquisition unit 401. Specifically, the imaging condition group stored in the data structure constituting the medical image is acquired according to the data format of the medical image. Note that if the imaging conditions are not stored in the medical image, it is possible to acquire an imaging information group including the imaging condition group from the imaging device 10 or the image management system via the acquisition unit 401.

高画質化可否判定部４０３は、撮影条件取得部４０２によって取得された撮影条件群を用いて高画質化部４０４によって医用画像が対処可能であるか否かを判定する。高画質化部４０４は、対処可能である医用画像について高画質化を行い、画像診断に適した高画質画像を生成する。出力部４０５は、高画質化部４０４が生成した高画質画像や入力画像、各種情報等を表示部２０に表示させる。また、出力部４０５は、生成された高画質画像等を画像処理装置４００に接続される記憶装置（記憶部）に記憶させてもよい。 The image quality improvement possibility determination unit 403 determines whether the medical image can be handled by the image quality improvement unit 404 using the group of shooting conditions acquired by the shooting condition acquisition unit 402. The image quality improvement unit 404 improves the image quality of the medical image that can be handled, and generates a high-quality image suitable for image diagnosis. The output unit 405 displays the high-quality image generated by the image quality improvement unit 404, the input image, various information, etc. on the display unit 20. The output unit 405 may also store the generated high-quality image, etc. in a storage device (storage unit) connected to the image processing device 400.

次に、高画質化部４０４について詳細に説明する。高画質化部４０４には高画質化エンジンが備えられている。本実施形態に係る高画質化エンジンの備える高画質化手法では、機械学習アルゴリズムを用いた処理を行う。 Next, the image quality improvement unit 404 will be described in detail. The image quality improvement unit 404 is equipped with an image quality improvement engine. The image quality improvement method equipped with the image quality improvement engine according to this embodiment performs processing using a machine learning algorithm.

本実施形態では、機械学習アルゴリズムに係る機械学習モデルのトレーニングに、処理対象として想定される特定の撮影条件を持つ低画質画像である入力データと、入力データに対応する高画質画像である出力データのペア群で構成された教師データを用いる。なお、特定の撮影条件には、具体的には、予め決定された撮影部位、撮影方式、撮影画角、及び画像サイズ等が含まれる。 In this embodiment, training of a machine learning model related to a machine learning algorithm uses teacher data consisting of a pair group of input data, which is a low-quality image having specific shooting conditions assumed to be processed, and output data, which is a high-quality image corresponding to the input data. Note that the specific shooting conditions specifically include a predetermined shooting part, shooting method, shooting angle of view, image size, etc.

本実施形態において、教師データの入力データは、撮影装置１０と同じ機種、撮影装置１０と同じ設定により取得された低画質画像である。また、教師データの出力データは、撮影装置１０と同じ機種が備える設定や画像処理により取得された高画質画像である。具体的には、出力データは、例えば、複数回撮影することにより取得した画像（元画像）群に対して加算平均等の重ね合わせ処理を行うことにより得られる高画質画像（重ね合わせ画像）である。ここで、高画質画像と低画質画像についてＯＣＴＡのモーションコントラストデータを例として説明をする。ここで、モーションコントラストデータとは、ＯＣＴＡ等で用いられる、撮影対象の同一箇所を繰り返し撮影し、その撮影間における撮影対象の時間的な変化を検出したデータである。このとき、算出したモーションコントラストデータ（３次元の医用画像データの一例）のうち、撮影対象の深さ方向における所望の範囲のデータを用いて正面画像を生成することで、ＯＣＴＡのＥｎ－Ｆａｃｅ画像（モーションコントラスト正面画像）を生成することができる。なお、以下では同一箇所におけるＯＣＴデータを繰り返し撮影することをＮＯＲ（ＮｕｍｂｅｒＯｆＲｅｐｅａｔ）と呼ぶ。 In this embodiment, the input data of the teacher data is a low-quality image acquired by the same model and the same settings as the imaging device 10. The output data of the teacher data is a high-quality image acquired by the settings and image processing of the same model as the imaging device 10. Specifically, the output data is, for example, a high-quality image (superimposed image) obtained by performing superimposition processing such as averaging on a group of images (original images) acquired by multiple shooting. Here, the high-quality image and the low-quality image are explained using the motion contrast data of OCTA as an example. Here, the motion contrast data is data used in OCTA etc., which is obtained by repeatedly shooting the same location of the shooting target and detecting the temporal change of the shooting target between the shootings. At this time, the En-Face image (motion contrast front image) of OCTA can be generated by generating a front image using data of a desired range in the depth direction of the shooting target from the calculated motion contrast data (an example of three-dimensional medical image data). In the following, repeatedly shooting OCT data at the same location is called NOR (Number Of Repeat).

本実施形態において、重ね合わせ処理による高画質画像と低画質画像の生成例として異なる２種類の方法について図２８を用いて説明をする。 In this embodiment, two different methods for generating high-quality images and low-quality images through overlay processing are described with reference to FIG. 28.

第一の方法は、高画質画像の例として、撮影対象の同一箇所を繰り返し撮影したＯＣＴデータから生成するモーションコントラストデータに関して、図２８（ａ）を用いて説明する。図２８（ａ）において、Ｉｍ２８１０は３次元のモーションコントラストデータ、Ｉｍ２８１１は３次元のモーションコントラストデータを構成する２次元のモーションコントラストデータを示す。そして、Ｉｍ２８１１－１～Ｉｍ２８１１－３は、Ｉｍ２８１１を生成するためのＯＣＴ断層画像（Ｂスキャン）を示している。ここで、ＮＯＲとは、図２８（ａ）においては、Ｉｍ２８１１－１～Ｉｍ２８１１－３におけるＯＣＴ断層画像の数の事を示し、図の例においてＮＯＲは３である。Ｉｍ２８１１－１～Ｉｍ２８１１－３は所定の時間間隔（Δｔ）で撮影される。なお、同一箇所とは被検眼の正面方向（Ｘ－Ｙ）において、１ラインの事を示し、図２８（ａ）においては、Ｉｍ２８１１の箇所に相当する。なお、正面方向は、深さ方向に対して交差する方向の一例である。モーションコントラストデータは時間的な変化を検出したデータであるため、このデータを生成するためには、少なくともＮＯＲは２回とする必要がある。例えば、ＮＯＲが２の場合には、１つのモーションコントラストデータが生成される。ＮＯＲが３の場合には、隣接する時間間隔（１回目と２回目、２回目と３回目）のＯＣＴのみでモーションコントラストデータを生成する場合には、２つのデータが生成される。離れた時間間隔（１回目と３回目）のＯＣＴデータも用いてモーションコントラストデータを生成する場合には、合計３つのデータが生成される。すなわち、ＮＯＲを３回、４回、・・・と増やしていくと、同一箇所におけるモーションコントラストのデータ数も増加する。同一箇所を繰り返し撮影して取得した複数のモーションコントラストデータを位置合わせして加算平均等の重ね合わせ処理をすることで、高画質なモーションコントラストデータを生成することが出来る。そのため、ＮＯＲを少なくとも３回以上とし、５回以上とするのが望ましい。一方、これに対応する低画質画像の例としては、加算平均等の重ね合わせ処理を行う前のモーションコントラストデータとする。この場合、低画質画像は加算平均等の重ね合わせ処理を行う際の基準画像とするのが望ましい。重ね合わせ処理をする際に、基準画像に対して対象画像の位置や形状を変形して位置合わせを行っておけば、基準画像と重ね合わせ処理後の画像とでは空間的な位置ずれがほとんどない。そのため、容易に低画質画像と高画質画像のペアとすることが出来る。なお、基準画像ではなく位置合わせの画像変形処理を行った対象画像を低画質画像としてもよい。元画像群（基準画像と対象画像）のそれぞれを入力データ、対応する重ね合わせ画像を出力データとすることで、複数のペア群を生成することができる。例えば、１５の元画像群から１の重ね合わせ画像を得る場合、元画像群のうちの一つ目の元画像と重ね合わせ画像とのペア、元画像群のうちの二つ目の元画像と重ね合わせ画像とのペアを生成することができる。このように、１５の元画像群から１の重ね合わせ画像を得る場合には、元画像群のうちの一つの画像と重ね合わせ画像による１５のペア群が生成可能である。なお、主走査（Ｘ）方向に同一箇所を繰り返し撮影し、それを副走査（Ｙ）方向にずらしながらスキャンをすることで３次元の高画質データを生成することが出来る。 The first method will be described with reference to FIG. 28(a) as an example of a high-quality image, in which motion contrast data is generated from OCT data obtained by repeatedly capturing images of the same location on the subject. In FIG. 28(a), Im2810 shows three-dimensional motion contrast data, and Im2811 shows two-dimensional motion contrast data that constitutes the three-dimensional motion contrast data. Im2811-1 to Im2811-3 show OCT tomographic images (B-scans) used to generate Im2811. Here, NOR refers to the number of OCT tomographic images in Im2811-1 to Im2811-3 in FIG. 28(a), and NOR is 3 in the example shown. Im2811-1 to Im2811-3 are captured at a predetermined time interval (Δt). The same location refers to one line in the front direction (X-Y) of the eye to be examined, and corresponds to the location of Im2811 in FIG. 28(a). The front direction is an example of a direction intersecting with the depth direction. Since the motion contrast data is data that detects a change over time, it is necessary to perform at least two NORs in order to generate this data. For example, when NOR is 2, one motion contrast data is generated. When NOR is 3, two data are generated when the motion contrast data is generated only by OCT of adjacent time intervals (first and second times, second and third times). When the motion contrast data is generated using OCT data of distant time intervals (first and third times), a total of three data are generated. That is, when NOR is increased to three times, four times, etc., the number of motion contrast data at the same location also increases. By aligning multiple motion contrast data obtained by repeatedly photographing the same location and performing superposition processing such as averaging, it is possible to generate high-quality motion contrast data. Therefore, NOR is set to at least three times, and preferably to five times or more. On the other hand, an example of a corresponding low-quality image is motion contrast data before performing superimposition processing such as averaging. In this case, it is preferable to use the low-quality image as a reference image when performing superimposition processing such as averaging. When performing superimposition processing, if the position and shape of the target image are deformed with respect to the reference image to perform positioning, there is almost no spatial positional deviation between the reference image and the image after superimposition processing. Therefore, it is easy to pair a low-quality image with a high-quality image. Note that the target image that has been subjected to image deformation processing for positioning, rather than the reference image, may be used as the low-quality image. By using each of the original image groups (reference image and target image) as input data and the corresponding superimposed image as output data, multiple pair groups can be generated. For example, when obtaining one superimposed image from 15 original image groups, a pair of the first original image and the superimposed image in the original image group and a pair of the second original image and the superimposed image in the original image group can be generated. In this way, when obtaining one overlaid image from a group of 15 original images, it is possible to generate a group of 15 pairs of one image from the group of original images and the overlaid image. It is also possible to generate high-quality three-dimensional data by repeatedly photographing the same location in the main scanning (X) direction and then scanning while shifting the images in the sub-scanning (Y) direction.

第二の方法は、撮影対象の同一領域を複数回撮影したモーションコントラストデータを重ね合わせ処理することで高画質画像を生成する処理に関して、図２８（ｂ）を用いて説明する。なお、同一領域とは被検眼の正面方向（Ｘ－Ｙ）において、３×３ｍｍや１０×１０ｍｍのような領域の事を示し、断層画像の深さ方向を含めて３次元のモーションコントラストデータを取得することを意味する。同一領域を複数回撮影して重ね合わせ処理を行う際には、１回あたりの撮影を短くするため、ＮＯＲは２回か３回とすることが望ましい。また、高画質な３次元モーションコントラストデータを生成するために、同一領域の３次元データを少なくとも２データ以上取得する。図２８（ｂ）では、複数の３次元モーションコントラストデータの例を示している。Ｉｍ２８２０～Ｉｍ２８４０は、図２８（ａ）で説明したのと同様に３次元のモーションコントラストデータである。これら２データ以上の３次元モーションコントラストデータを用いて、正面方向（Ｘ－Ｙ）と深度方向（Ｚ）の位置合わせ処理を行い、それぞれのデータにおいてアーティファクトとなるデータを除外した後に、平均化処理を行う。それによりアーティファクトの除外された１つの高画質な３次元モーションコントラストデータを生成することが出来る。３次元モーションコントラストデータから任意の平面を生成することで高画質画像となる。一方、これに対応する低画質画像は加算平均等の重ね合わせ処理を行う際の基準データから生成する任意の平面とするのが望ましい。第一の方法で説明したように、基準画像と加算平均後の画像とでは空間的な位置ずれがほとんどないため、容易に低画質画像と高画質画像のペアとすることが出来る。なお、基準データではなく位置合わせの画像変形処理を行った対象データから生成した任意の平面を低画質画像としてもよい。 The second method is a process for generating a high-quality image by superimposing motion contrast data obtained by photographing the same area of the subject multiple times, and is described with reference to FIG. 28(b). The same area refers to an area of 3×3 mm or 10×10 mm in the front direction (X-Y) of the subject's eye, and means obtaining three-dimensional motion contrast data including the depth direction of the tomographic image. When photographing the same area multiple times and performing superimposing, it is desirable to perform NOR two or three times in order to shorten each photographing. In addition, in order to generate high-quality three-dimensional motion contrast data, at least two or more three-dimensional data of the same area are obtained. FIG. 28(b) shows an example of multiple three-dimensional motion contrast data. Im2820 to Im2840 are three-dimensional motion contrast data similar to those described in FIG. 28(a). Using these two or more three-dimensional motion contrast data, alignment processing is performed in the front direction (X-Y) and the depth direction (Z), and after removing data that becomes artifacts in each data, averaging processing is performed. This makes it possible to generate a single piece of high-quality three-dimensional motion contrast data from which artifacts have been removed. A high-quality image is obtained by generating an arbitrary plane from the three-dimensional motion contrast data. On the other hand, it is desirable for the corresponding low-quality image to be an arbitrary plane generated from reference data when performing superposition processing such as averaging. As explained in the first method, there is almost no spatial misalignment between the reference image and the image after averaging, so a low-quality image and a high-quality image can easily be paired. Note that the low-quality image may be an arbitrary plane generated from target data that has been subjected to image deformation processing for alignment, rather than the reference data.

第一の方法は、撮影自体が１回で終了するため被験者の負担は少ない。しかし、ＮＯＲの回数を増やすほど１回の撮影時間が長くなってしまう。また、撮影途中に目の混濁や睫毛などのアーティファクトが入った場合には必ずしも良い画像が得られるとは限らない。第二の方法は、複数回撮影を行うため被験者の負担は少し増えてしまう。しかし、１回の撮影時間が短く済むのと、１回の撮影でアーティファクトが入ったとしても、別の撮影でアーティファクトが写らなければ最終的にはアーティファクトの少ないきれいな画像を得ることが出来る。これらの特徴を鑑みて、データを集める際には被験者の状況に合わせて任意の方法を選択する。 The first method places less of a burden on the subject, as the shooting itself is completed in one go. However, the more NORs are used, the longer each shooting session takes. Also, if artifacts such as clouding of the eye or eyelashes appear during shooting, a good image may not necessarily be obtained. The second method requires multiple shots, which places a slightly greater burden on the subject. However, the time required for each shot is shorter, and even if an artifact appears in one shot, as long as the artifact is not visible in another shot, a clean image with fewer artifacts can ultimately be obtained. Taking these characteristics into consideration, an arbitrary method is selected according to the subject's situation when collecting data.

本実施形態では、モーションコントラストデータを例として説明をしたがこれに限らない。モーションコントラストデータを生成するためにＯＣＴデータを撮影しているため、ＯＣＴデータでも上記の方法で同じことが可能である。さらに、本実施形態においてトラッキング処理について説明を省略したが、被検眼の同一箇所や同一領域を撮影するため、被検眼のトラッキングを行いながら撮影を行うことが望ましい。 In this embodiment, the motion contrast data has been described as an example, but the present invention is not limited to this. Since OCT data is captured to generate motion contrast data, the same method can be used with OCT data as described above. Furthermore, although the description of the tracking process has been omitted in this embodiment, in order to capture images of the same location or area of the subject's eye, it is desirable to capture images while tracking the subject's eye.

本実施形態において、３次元の高画質データと低画質データのペアが出来ているため、ここから任意の２次元画像のペアを生成することが出来る。これに関して、図２９を用いて説明をする。例えば、対象画像をＯＣＴＡのＥｎ－Ｆａｃｅ画像とする場合、３次元データから所望の深度範囲でＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成する。所望の深度範囲とは、図２８においてＺ方向における範囲の事を示す。ここで生成するＯＣＴＡのＥｎ－Ｆａｃｅ画像の例を図２９（ａ）に示す。ＯＣＴＡのＥｎ－Ｆａｃｅ画像としては、表層（Ｉｍ２９１０）、深層（Ｉｍ２９２０）、外層（Ｉｍ２９３０）、脈絡膜血管網（Ｉｍ２９４０）など、異なる深度範囲で生成したＯＣＴＡのＥｎ－Ｆａｃｅ画像を用いて学習を行う。なお、ＯＣＴＡのＥｎ－Ｆａｃｅ画像の種類はこれに限らず、基準となる層とオフセットの値を変えて異なる深度範囲を設定したＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成して種類を増やしてもよい。学習を行う際には、異なる深さのＯＣＴＡのＥｎ－Ｆａｃｅ画像毎に別々に学習をしてもよいし、異なる深度範囲の画像を複数組み合わせて（例えば、表層側と深層側で分ける）学習してもよいし、全ての深度範囲のＯＣＴＡのＥｎ－Ｆａｃｅ画像を一緒に学習させるようにしてもよい。ＯＣＴデータから生成する輝度のＥｎ－Ｆａｃｅ画像の場合も、ＯＣＴＡのＥｎ－Ｆａｃｅと同様に、任意の深度範囲から生成した複数のＥｎ－Ｆａｃｅ画像を用いて学習を行う。例えば、高画質化エンジンが、被検眼の異なる深度範囲に対応する複数のモーションコントラスト正面画像を含む学習データを用いて得た機械学習エンジンを含む場合を考える。このとき、取得部は、異なる深度範囲を含む長い深度範囲のうち一部の深度範囲に対応するモーションコントラスト正面画像を第１の画像として取得することができる。すなわち、学習データに含まれる複数のモーションコントラスト正面画像に対応する複数の深度範囲とは異なる深度範囲に対応するモーションコントラスト正面画像を、高画質化時の入力画像とすることができる。もちろん、学習時と同じ深度範囲のモーションコントラスト正面画像を、高画質化時の入力画像としてもよい。また、一部の深度範囲は、検者がユーザーインターフェース上の任意のボタンを押す等に応じて設定されてもよいし、自動的に設定されてもよい。なお、上述した内容は、モーションコントラスト正面画像に限るものではなく、例えば、輝度のＥｎ－Ｆａｃｅ画像に対しても適用することができる。 In this embodiment, since a pair of three-dimensional high-quality data and low-quality data is created, any pair of two-dimensional images can be generated from the pair. This will be explained with reference to FIG. 29. For example, when the target image is an OCTA En-Face image, the OCTA En-Face image is generated in the desired depth range from the three-dimensional data. The desired depth range refers to the range in the Z direction in FIG. 28. An example of the OCTA En-Face image generated here is shown in FIG. 29(a). As the OCTA En-Face image, learning is performed using OCTA En-Face images generated in different depth ranges, such as the superficial layer (Im2910), deep layer (Im2920), outer layer (Im2930), and choroidal vascular network (Im2940). The types of OCTA En-Face images are not limited to these, and the types may be increased by generating OCTA En-Face images with different depth ranges set by changing the reference layer and offset value. When learning, learning may be performed separately for each OCTA En-Face image of different depths, or learning may be performed by combining multiple images of different depth ranges (for example, dividing them into superficial and deep layers), or OCTA En-Face images of all depth ranges may be learned together. In the case of a luminance En-Face image generated from OCT data, learning is performed using multiple En-Face images generated from any depth range, similar to the OCTA En-Face. For example, consider a case where the image quality improvement engine includes a machine learning engine obtained using learning data including multiple motion contrast front images corresponding to different depth ranges of the subject's eye. At this time, the acquisition unit can acquire, as the first image, a motion contrast front image corresponding to a part of a long depth range including different depth ranges. That is, a motion contrast front image corresponding to a depth range different from the multiple depth ranges corresponding to the multiple motion contrast front images included in the learning data can be used as an input image for image enhancement. Of course, a motion contrast front image having the same depth range as that during learning may be used as an input image for image enhancement. In addition, the part of the depth range may be set by the examiner pressing an arbitrary button on the user interface, or may be set automatically. Note that the above content is not limited to the motion contrast front image, and can also be applied to, for example, a luminance En-Face image.

なお、処理対象の画像が断層画像である場合、ＢスキャンであるＯＣＴ断層画像やモーションコントラストデータの断層画像を用いて学習を行う。これに関して、図２９（ｂ）を用いて説明をする。図２９（ｂ）において、Ｉｍ２９５１～Ｉｍ２９５３はＯＣＴの断層画像である。図２９（ｂ）において画像が異なるのは、副走査（Ｙ）方向の位置が異なる場所の断層画像を示しているからである。断層画像においては、副走査方向の位置の違いを気にせずに一緒に学習をするようにしてもよい。ただし、撮影部位（例えば、黄斑部中心、視神経乳頭部中心）が異なる場所を撮影した画像の場合には、部位ごとに別々に学習をするようにしてもよいし、撮影部位を気にせずに一緒に学習をするようにしてもよい。なお、ＯＣＴ断層画像と、モーションコントラストデータの断層画像においては画像特徴量が大きく異なるので別々に学習を行う方が良い。 When the image to be processed is a tomographic image, learning is performed using an OCT tomographic image, which is a B-scan, or a tomographic image of motion contrast data. This will be described with reference to FIG. 29(b). In FIG. 29(b), Im2951 to Im2953 are OCT tomographic images. The images are different in FIG. 29(b) because they show tomographic images at different positions in the sub-scanning (Y) direction. For tomographic images, learning may be performed together without considering the difference in position in the sub-scanning direction. However, in the case of images taken at different locations (for example, the center of the macula, the center of the optic disc), learning may be performed separately for each location, or learning may be performed together without considering the location. Note that the image features of OCT tomographic images and tomographic images of motion contrast data are significantly different, so it is better to learn them separately.

重ね合わせ処理を行った重ね合わせ画像は、元画像群で共通して描出された画素が強調されるため、画像診断に適した高画質画像になる。この場合には、生成される高画質画像は、共通して描出された画素が強調された結果、低輝度領域と高輝度領域との違いがはっきりした高コントラストな画像になる。また、例えば、重ね合わせ画像では、撮影毎に発生するランダムノイズが低減されたり、ある時点の元画像ではうまく描出されなかった領域が他の元画像群によって補間されたりすることができる。 The overlaid image created by the overlay process emphasizes pixels that are commonly depicted in the original images, resulting in a high-quality image suitable for image diagnosis. In this case, the high-quality image generated is a high-contrast image with a clear distinction between low-brightness and high-brightness areas, as a result of emphasizing the commonly depicted pixels. In addition, for example, in the overlaid image, random noise that occurs with each capture can be reduced, and areas that were not well depicted in the original images at a certain point in time can be interpolated using other original images.

また、機械学習モデルの入力データを複数の画像で構成する必要がある場合には、元画像群から必要な数の元画像群を選択し、入力データとすることができる。例えば、１５の元画像群から１の重ね合わせ画像を得る場合において、機械学習モデルの入力データとして２の画像が必要であれば、１０５（１５Ｃ２＝１０５）のペア群を生成可能である。 In addition, when the input data for a machine learning model needs to be composed of multiple images, the required number of original image groups can be selected from the original image group and used as input data. For example, when obtaining one overlaid image from a group of 15 original images, if two images are required as input data for the machine learning model, 105 (15C2=105) pair groups can be generated.

なお、教師データを構成するペア群のうち、高画質化に寄与しないペアは教師データから取り除くことができる。例えば、教師データのペアを構成する出力データである高画質画像が画像診断に適さない画質である場合には、当該教師データを用いて学習した高画質化エンジンが出力する画像も画像診断に適さない画質になってしまう可能性がある。そのため、出力データが画像診断に適さない画質であるペアを教師データから取り除くことで、高画質化エンジンが画像診断に適さない画質の画像を生成する可能性を低減させることができる。 Note that, among the pairs that make up the training data, pairs that do not contribute to high image quality can be removed from the training data. For example, if the high-image-quality image that is the output data that makes up a pair of training data has image quality that is unsuitable for image diagnosis, the image output by the image quality improvement engine that has learned using the training data may also have image quality that is unsuitable for image diagnosis. Therefore, by removing pairs whose output data has image quality that is unsuitable for image diagnosis from the training data, the possibility that the image quality improvement engine will generate an image with image quality that is unsuitable for image diagnosis can be reduced.

また、ペアである画像群の平均輝度や輝度分布が大きく異なる場合には、当該教師データを用いて学習した高画質化エンジンが、低画質画像と大きく異なる輝度分布を持つ画像診断に適さない画像を出力する可能性がある。このため、平均輝度や輝度分布が大きく異なる入力データと出力データのペアを教師データから取り除くこともできる。 In addition, if the average luminance or luminance distribution of a pair of images differs significantly, the image quality improvement engine that has learned using the training data may output an image that is not suitable for image diagnosis, with a luminance distribution that differs significantly from the low-quality image. For this reason, it is possible to remove pairs of input data and output data with significantly different average luminance or luminance distribution from the training data.

さらに、ペアである画像群に描画される撮影対象の構造や位置が大きく異なる場合には、当該教師データを用いて学習した高画質化エンジンが、低画質画像と大きく異なる構造や位置に撮影対象を描画した画像診断に適さない画像を出力する可能性がある。このため、描画される撮影対象の構造や位置が大きく異なる入力データと出力データのペアを教師データから取り除くこともできる。また、高画質化エンジンについて、品質保持の観点から、自身が出力する高画質画像を教師データとして用いないように構成することができる。 Furthermore, if the structure or position of the subject depicted in the paired images differs significantly, the image quality improvement engine that has learned using the training data may output an image that is not suitable for image diagnosis, in which the subject is depicted in a structure or position that differs significantly from that of the low-quality image. For this reason, it is possible to remove from the training data pairs of input data and output data in which the structure or position of the depicted subject differs significantly. Furthermore, from the perspective of maintaining quality, the image quality improvement engine can be configured not to use the high-quality images it outputs as training data.

このように機械学習を行った高画質化エンジンを用いることで、高画質化部４０４は、一回の撮影で取得された医用画像が入力された場合に、重ね合わせ処理によって高コントラスト化やノイズ低減等が行われたような高画質画像を出力することができる。このため、高画質化部４０４は、入力画像である低画質画像に基づいて、画像診断に適した高画質画像を生成することができる。 By using an image quality improvement engine that has performed machine learning in this way, when a medical image acquired in a single shot is input, the image quality improvement unit 404 can output a high-image quality image in which contrast has been increased and noise has been reduced by overlay processing. Therefore, the image quality improvement unit 404 can generate a high-image quality image suitable for image diagnosis based on a low-image quality image that is the input image.

次に、図５のフロー図を参照して、本実施形態に係る一連の画像処理について説明する。図５は本実施形態に係る一連の画像処理のフロー図である。まず、本実施形態に係る一連の画像処理が開始されると、処理はステップＳ５１０に移行する。 Next, a series of image processing steps according to this embodiment will be described with reference to the flow diagram in FIG. 5. FIG. 5 is a flow diagram of a series of image processing steps according to this embodiment. First, when a series of image processing steps according to this embodiment is started, the process proceeds to step S510.

ステップＳ５１０では、取得部４０１が、回路やネットワークを介して接続された撮影装置１０から、撮影装置１０が撮影した画像を入力画像として取得する。なお、取得部４０１は、撮影装置１０からの要求に応じて、入力画像を取得してもよい。このような要求は、例えば、撮影装置１０が画像を生成した時、撮影装置１０が生成した画像を撮影装置１０が備える記憶装置に保存する前や保存した後、保存された画像を表示部２０に表示する時、画像解析処理に高画質画像を利用する時等に発行されてよい。 In step S510, the acquisition unit 401 acquires an image captured by the imaging device 10 as an input image from the imaging device 10 connected via a circuit or network. The acquisition unit 401 may acquire the input image in response to a request from the imaging device 10. Such a request may be issued, for example, when the imaging device 10 generates an image, before or after the image generated by the imaging device 10 is saved in a storage device provided in the imaging device 10, when the saved image is displayed on the display unit 20, when a high-quality image is used for image analysis processing, etc.

なお、取得部４０１は、撮影装置１０から画像を生成するためのデータを取得し、画像処理装置４００が当該データに基づいて生成した画像を入力画像として取得してもよい。この場合、画像処理装置４００が各種画像を生成するための画像生成方法としては、既存の任意の画像生成方法を採用してよい。 The acquisition unit 401 may acquire data for generating an image from the imaging device 10, and acquire an image generated by the image processing device 400 based on the data as an input image. In this case, any existing image generation method may be adopted as the image generation method for the image processing device 400 to generate various images.

ステップＳ５２０では、撮影条件取得部４０２が、入力画像の撮影条件群を取得する。具体的には、入力画像のデータ形式に応じて、入力画像を構成するデータ構造に保存された撮影条件群を取得する。なお、上述のように、入力画像に撮影条件が保存されていない場合には、撮影条件取得部４０２は、撮影装置１０や不図示の画像管理システムから撮影条件群を含む撮影情報群を取得することができる。 In step S520, the shooting condition acquisition unit 402 acquires a group of shooting conditions for the input image. Specifically, the shooting condition group stored in the data structure constituting the input image is acquired according to the data format of the input image. Note that, as described above, if the shooting conditions are not stored in the input image, the shooting condition acquisition unit 402 can acquire a group of shooting information including the group of shooting conditions from the shooting device 10 or an image management system (not shown).

ステップＳ５３０においては、高画質化可否判定部４０３が、取得された撮影条件群を用いて、高画質化部４０４に備える高画質化エンジンによって入力画像を高画質化可能であるか否かを判定する。具体的には、高画質化可否判定部４０３は、入力画像の撮影部位、撮影方式、撮影画角、及び画像サイズが、高画質化エンジンによって対処可能な条件と一致するか否かを判定する。 In step S530, the image quality improvement possibility determination unit 403 uses the acquired group of shooting conditions to determine whether the input image can be improved in image quality by the image quality improvement engine provided in the image quality improvement unit 404. Specifically, the image quality improvement possibility determination unit 403 determines whether the shooting part, shooting method, shooting angle of view, and image size of the input image match the conditions that can be handled by the image quality improvement engine.

高画質化可否判定部４０３が、すべての撮影条件を判定し、対処可能と判定された場合には、処理はステップＳ５４０に移行する。一方、高画質化可否判定部４０３が、これら撮影条件に基づいて、高画質化エンジンが入力画像を対処不可能であると判定した場合には、処理はステップＳ５５０に移行する。 When the image quality improvement feasibility determination unit 403 has determined all of the shooting conditions and determined that they can be handled, the process proceeds to step S540. On the other hand, when the image quality improvement feasibility determination unit 403 has determined that the image quality improvement engine cannot handle the input image based on these shooting conditions, the process proceeds to step S550.

なお、画像処理装置４００の設定や実装形態によっては、撮影部位、撮影方式、撮影画角、及び画像サイズのうちの一部に基づいて入力画像が処理不可能であると判定されたとしても、ステップＳ５４０における高画質化処理が実施されてもよい。例えば、高画質化エンジンが、被検者のいずれの撮影部位に対しても網羅的に対応可能であると想定され、入力データに未知の撮影部位が含まれていたとしても対処可能であるように実装されている場合等には、このような処理を行ってもよい。また、高画質化可否判定部４０３は、所望の構成に応じて、入力画像の撮影部位、撮影方式、撮影画角、及び画像サイズのうちの少なくとも一つが高画質化エンジンによって対処可能な条件と一致するか否かを判定してもよい。 Depending on the settings and implementation of the image processing device 400, the image quality improvement process in step S540 may be performed even if it is determined that the input image cannot be processed based on some of the imaging part, imaging method, imaging angle of view, and image size. For example, such processing may be performed when the image quality improvement engine is assumed to be comprehensively compatible with any imaging part of the subject and is implemented to be able to handle even unknown imaging parts included in the input data. Furthermore, the image quality improvement feasibility determination unit 403 may determine whether or not at least one of the imaging part, imaging method, imaging angle of view, and image size of the input image matches a condition that can be handled by the image quality improvement engine, depending on the desired configuration.

ステップＳ５４０においては、高画質化部４０４が、高画質化エンジンを用いて、入力画像を高画質化し、入力画像よりも画像診断に適した高画質画像を生成する。具体的には、高画質化部４０４は、入力画像を高画質化エンジンに入力し、高画質化された高画質画像を生成させる。高画質化エンジンは、教師データを用いて機械学習を行った機械学習モデルに基づいて、入力画像を用いて重ね合わせ処理を行ったような高画質画像を生成する。このため、高画質化エンジンは、入力画像よりも、ノイズ低減されたり、コントラスト強調されたりした高画質画像を生成することができる。 In step S540, the image quality improvement unit 404 uses the image quality improvement engine to improve the image quality of the input image and generate a high-image quality image that is more suitable for image diagnosis than the input image. Specifically, the image quality improvement unit 404 inputs the input image to the image quality improvement engine, which generates a high-image quality image with improved image quality. The image quality improvement engine generates a high-image quality image that is similar to an overlay process using the input image, based on a machine learning model that has performed machine learning using training data. Therefore, the image quality improvement engine can generate a high-image quality image with reduced noise and enhanced contrast compared to the input image.

なお、画像処理装置４００の設定や実装形態によっては、高画質化部４０４が、撮影条件群に応じて、高画質化エンジンに入力画像とともにパラメータを入力して、高画質化の程度等を調節してもよい。また、高画質化部４０４は、検者の入力に応じたパラメータを高画質化エンジンに入力画像とともに入力して高画質化の程度等を調整してもよい。 Depending on the settings and implementation of the image processing device 400, the image quality improvement unit 404 may input parameters to the image quality improvement engine together with the input image according to the group of shooting conditions to adjust the degree of image quality improvement, etc. Also, the image quality improvement unit 404 may input parameters according to the examiner's input to the image quality improvement engine together with the input image to adjust the degree of image quality improvement, etc.

ステップＳ５５０では、出力部４０５が、ステップＳ５４０において高画質画像が生成されていれば、高画質画像を出力して、表示部２０に表示させる。一方、ステップＳ５３０において高画質化処理が不可能であるとされていた場合には、入力画像を出力し、表示部２０に表示させる。なお、出力部４０５は、表示部２０に出力画像を表示させるのに代えて、撮影装置１０や他の装置に出力画像を表示させたり、記憶させたりしてもよい。また、出力部４０５は、画像処理装置４００の設定や実装形態によっては、出力画像を撮影装置１０や他の装置が利用可能なように加工したり、画像管理システム等に送信可能なようにデータ形式を変換したりしてもよい。 In step S550, if a high-quality image was generated in step S540, the output unit 405 outputs the high-quality image and displays it on the display unit 20. On the other hand, if it is determined in step S530 that high-quality processing is not possible, the input image is output and displayed on the display unit 20. Note that instead of displaying the output image on the display unit 20, the output unit 405 may cause the image capture device 10 or another device to display or store the output image. Depending on the settings and implementation form of the image processing device 400, the output unit 405 may also process the output image so that it can be used by the image capture device 10 or another device, or convert the data format so that it can be transmitted to an image management system, etc.

上記のように、本実施形態に係る画像処理装置４００は、取得部４０１と、高画質化部４０４とを備える。取得部４０１は、被検者の所定部位の画像である入力画像（第１の画像）を取得する。高画質化部４０４は、機械学習エンジンを含む高画質化エンジンを用いて、入力画像から、入力画像と比べてノイズ低減及びコントラスト強調のうちの少なくとも一つがなされた高画質画像（第２の画像）を生成する。高画質化エンジンは、重ね合わせ処理により得られた画像を学習データとした機械学習エンジンを含む。 As described above, the image processing device 400 according to this embodiment includes an acquisition unit 401 and an image quality improvement unit 404. The acquisition unit 401 acquires an input image (first image) that is an image of a specific part of a subject. The image quality improvement unit 404 uses an image quality improvement engine including a machine learning engine to generate a high image quality image (second image) from the input image in which at least one of noise reduction and contrast enhancement has been performed compared to the input image. The image quality improvement engine includes a machine learning engine that uses the image obtained by the overlay process as learning data.

当該構成により、本実施形態に係る画像処理装置４００は、入力画像から、ノイズが低減されていたり、コントラストが強調されていたりする高画質画像を出力することができる。このため、画像処理装置４００は、より明瞭な画像や観察したい部位や病変が強調されている画像等の画像診断に適した画像を、従来と比べて、撮影者や被検者の侵襲性を高めたり、労力を増したりすることなく、より少ない代償で取得することができる。 With this configuration, the image processing device 400 according to this embodiment can output a high-quality image in which noise has been reduced and contrast has been enhanced from the input image. As a result, the image processing device 400 can obtain images suitable for image diagnosis, such as clearer images and images in which the area or lesion to be observed is emphasized, at a lower cost than in the past, without increasing the invasiveness or effort of the photographer or subject.

また、画像処理装置４００は、入力画像に対して、高画質化エンジンを用いて高画質画像を生成できる否かを判定する高画質化可否判定部４０３を更に備える。高画質化可否判定部４０３は、入力画像の撮影部位、撮影方式、撮影画角、及び画像サイズの少なくとも一つに基づいて当該判定を行う。 The image processing device 400 further includes a high image quality determination unit 403 that determines whether a high image quality image can be generated for an input image using a high image quality engine. The high image quality determination unit 403 makes this determination based on at least one of the imaging part, imaging method, imaging angle of view, and image size of the input image.

当該構成により、本実施形態に係る画像処理装置４００は、高画質化部４０４が処理できない入力画像を高画質化処理から省くことができ、画像処理装置４００の処理負荷やエラーの発生を低減させることができる。 With this configuration, the image processing device 400 according to this embodiment can exclude input images that cannot be processed by the image quality improvement unit 404 from the image quality improvement process, thereby reducing the processing load on the image processing device 400 and the occurrence of errors.

なお、本実施形態においては、出力部４０５（表示制御部）は、生成された高画質画像を表示部２０に表示させる構成としたが、出力部４０５の動作はこれに限られない。例えば、出力部４０５は、高画質画像を撮影装置１０や画像処理装置４００に接続される他の装置に出力することもできる。このため、高画質画像は、これらの装置のユーザーインターフェースに表示されたり、任意の記憶装置に保存されたり、任意の画像解析に利用されたり、画像管理システムに送信されたりすることができる。 In this embodiment, the output unit 405 (display control unit) is configured to display the generated high-quality image on the display unit 20, but the operation of the output unit 405 is not limited to this. For example, the output unit 405 can also output the high-quality image to the imaging device 10 or other devices connected to the image processing device 400. Therefore, the high-quality image can be displayed on the user interface of these devices, saved in any storage device, used for any image analysis, or sent to an image management system.

本実施形態においては、高画質化可否判定部４０３が、高画質化エンジンによって高画質化可能な入力画像であるか否かを判定して、高画質化可能な入力画像であれば高画質化部４０４が高画質化を行った。これに対し、撮影装置１０によって、高画質化可能な撮影条件でのみ撮影が行なわれる等の場合には、撮影装置１０から取得した画像を無条件に高画質化してもよい。この場合には、図６に示すように、ステップＳ５２０とステップＳ５３０の処理を省き、ステップＳ５１０の次にステップＳ５４０を実施することができる。 In this embodiment, the image quality improvement possibility determination unit 403 determines whether the input image can be improved by the image quality improvement engine, and if the input image can be improved, the image quality improvement unit 404 improves the image quality. On the other hand, if the image capturing device 10 captures images only under shooting conditions that allow for high image quality improvement, the image obtained from the image capturing device 10 may be unconditionally improved in image quality. In this case, as shown in FIG. 6, the processes of steps S520 and S530 can be omitted, and step S540 can be performed after step S510.

なお、本実施形態においては、出力部４０５が、表示部２０に高画質画像を表示させる構成とした。しかしながら、出力部４０５は、検者からの指示に応じて、高画質画像を表示部２０に表示させてもよい。例えば、出力部４０５は、検者が表示部２０のユーザーインターフェース上の任意のボタンを押すことに応じて、高画質画像を表示部２０に表示させてもよい。この場合、出力部４０５は、入力画像と切り替えて高画質画像を表示させてもよいし、入力画像と並べて高画質画像を表示させてもよい。 In this embodiment, the output unit 405 is configured to display a high-quality image on the display unit 20. However, the output unit 405 may also display a high-quality image on the display unit 20 in response to an instruction from the examiner. For example, the output unit 405 may display a high-quality image on the display unit 20 in response to the examiner pressing an arbitrary button on the user interface of the display unit 20. In this case, the output unit 405 may display the high-quality image in switchover with the input image, or may display the high-quality image alongside the input image.

さらに、出力部４０５は、表示部２０に高画質画像を表示させる際に、表示されている画像が機械学習アルゴリズムを用いた処理により生成された高画質画像であることを示す表示を高画質画像とともに表示させてもよい。この場合には、ユーザーは、当該表示によって、表示された高画質画像が撮影によって取得した画像そのものではないことが容易に識別できるため、誤診断を低減させたり、診断効率を向上させたりすることができる。なお、機械学習アルゴリズムを用いた処理により生成された高画質画像であることを示す表示は、入力画像と当該処理により生成された高画質画像とを識別可能な表示であればどのような態様のものでもよい。 Furthermore, when the output unit 405 displays a high-quality image on the display unit 20, the output unit 405 may display, together with the high-quality image, a display indicating that the displayed image is a high-quality image generated by processing using a machine learning algorithm. In this case, the display allows the user to easily identify that the displayed high-quality image is not the image itself obtained by shooting, thereby reducing misdiagnosis and improving diagnostic efficiency. Note that the display indicating that the high-quality image is generated by processing using a machine learning algorithm may be in any form as long as it allows the user to distinguish between the input image and the high-quality image generated by the processing.

また、出力部４０５は、機械学習アルゴリズムを用いた処理により生成された高画質画像であることを示す表示について、機械学習アルゴリズムがどのような教師データによって学習を行ったものであるかを示す表示を表示部２０に表示させてもよい。当該表示としては、教師データの入力データと出力データの種類の説明や、入力データと出力データに含まれる撮影部位等の教師データに関する任意の表示を含んでよい。 The output unit 405 may also cause the display unit 20 to display, in addition to a display indicating that the image is a high-quality image generated by processing using a machine learning algorithm, a display indicating what kind of training data the machine learning algorithm used to learn. Such a display may include an explanation of the types of input data and output data of the training data, or any display regarding the training data, such as the imaging site included in the input data and output data.

本実施形態に係る高画質化エンジンでは、教師データの出力データとして、重ね合わせ画像を用いたが、教師データはこれに限られない。教師データの出力データとして、高画質画像を得る手段である、重ね合わせ処理や、後述する処理群、後述する撮影方法のうち、少なくとも一つを行うことで得られる高画質画像を用いてもよい。 In the image quality improvement engine according to this embodiment, a superimposed image is used as output data for the teacher data, but the teacher data is not limited to this. As output data for the teacher data, a high-quality image obtained by performing at least one of a superimposition process, a group of processes described below, or an imaging method described below, which are means for obtaining a high-quality image, may be used.

例えば、教師データの出力データとして、元画像群に対して最大事後確率推定処理（ＭＡＰ推定処理）を行うことで得られる高画質画像を用いてもよい。ＭＡＰ推定処理では、複数の低画質画像における各画素値の確率密度から尤度関数を求め、求めた尤度関数を用いて真の信号値（画素値）を推定する。 For example, high-quality images obtained by performing maximum a posteriori probability estimation processing (MAP estimation processing) on a group of original images may be used as output data for the training data. In the MAP estimation processing, a likelihood function is calculated from the probability density of each pixel value in multiple low-quality images, and the true signal value (pixel value) is estimated using the calculated likelihood function.

ＭＡＰ推定処理により得られた高画質画像は、真の信号値に近い画素値に基づいて高コントラストな画像となる。また、推定される信号値は、確率密度に基づいて求められるため、ＭＡＰ推定処理により得られた高画質画像では、ランダムに発生するノイズが低減される。このため、ＭＡＰ推定処理により得られた高画質画像を教師データとして用いることで、高画質化エンジンは、入力画像から、ノイズが低減されたり、高コントラストとなったりした、画像診断に適した高画質画像を生成することができる。なお、教師データの入力データと出力データのペアの生成方法は、重ね合わせ画像を教師データとした場合と同様の方法で行われてよい。 The high-quality image obtained by the MAP estimation process is a high-contrast image based on pixel values close to the true signal value. In addition, since the estimated signal value is calculated based on probability density, randomly occurring noise is reduced in the high-quality image obtained by the MAP estimation process. Therefore, by using the high-quality image obtained by the MAP estimation process as training data, the image quality improvement engine can generate a high-quality image suitable for image diagnosis, with reduced noise and high contrast, from the input image. Note that the method for generating pairs of input data and output data for the training data may be the same as when an overlaid image is used as training data.

また、教師データの出力データとして、元画像に平滑化フィルタ処理を適用した高画質画像を用いてもよい。この場合には、高画質化エンジンは、入力画像から、ランダムノイズが低減された高画質画像を生成することができる。さらに、教師データの出力データとして、元画像に階調変換処理を適用した画像を用いてもよい。この場合には、高画質化エンジンは、入力画像から、コントラスト強調された高画質画像を生成することができる。なお、教師データの入力データと出力データのペアの生成方法は、重ね合わせ画像を教師データとした場合と同様の方法で行われてよい。 In addition, a high-quality image obtained by applying a smoothing filter process to the original image may be used as the output data of the training data. In this case, the image quality improvement engine can generate a high-quality image in which random noise has been reduced from the input image. Furthermore, an image obtained by applying a tone conversion process to the original image may be used as the output data of the training data. In this case, the image quality improvement engine can generate a high-quality image in which contrast has been enhanced from the input image. Note that the method of generating pairs of input data and output data of the training data may be the same as when an overlaid image is used as the training data.

なお、教師データの入力データは、撮影装置１０と同じ画質傾向を持つ撮影装置から取得された画像でもよい。また、教師データの出力データは、逐次近似法等の高コストな処理によって得られた高画質画像であってもよいし、入力データに対応する被検者を、撮影装置１０よりも高性能な撮影装置で撮影することで取得した高画質画像であってもよい。さらに、出力データは、ルールベースによるノイズ低減処理を行うことによって取得された高画質画像であってもよい。ここで、ノイズ低減処理は、例えば、低輝度領域内に現れた明らかにノイズである１画素のみの高輝度画素を、近傍の低輝度画素値の平均値に置き換える等の処理を含むことができる。このため、高画質化エンジンは、入力画像の撮影に用いられる撮影装置よりも高性能な撮影装置によって撮影された画像、又は入力画像の撮影工程よりも工数の多い撮影工程で取得された画像を学習データとしてもよい。例えば、高画質化エンジンは、モーションコントラスト正面画像を入力画像とする場合、入力画像のＯＣＴＡ撮影に用いられるＯＣＴ撮影装置よりも高性能なＯＣＴ撮影装置によってＯＣＴＡ撮影されて得た画像、又は入力画像のＯＣＴＡ撮影工程よりも工数の多いＯＣＴＡ撮影工程で取得されて得た画像を学習データとしてもよい。 The input data of the teacher data may be an image obtained from a photographing device having the same image quality tendency as the photographing device 10. The output data of the teacher data may be a high-quality image obtained by a high-cost process such as successive approximation, or may be a high-quality image obtained by photographing a subject corresponding to the input data with a photographing device with higher performance than the photographing device 10. Furthermore, the output data may be a high-quality image obtained by performing a rule-based noise reduction process. Here, the noise reduction process may include, for example, a process of replacing only one high-luminance pixel that is clearly noise and appears in a low-luminance area with the average value of nearby low-luminance pixels. For this reason, the image quality improvement engine may use as learning data an image taken with a photographing device with higher performance than the photographing device used to photograph the input image, or an image obtained in a photographing process with more labor than the photographing process of the input image. For example, when a motion contrast front image is used as an input image, the image quality improvement engine may use as learning data an image obtained by OCTA imaging using an OCT imaging device with higher performance than the OCT imaging device used for OCTA imaging of the input image, or an image obtained in an OCTA imaging process that requires more labor than the OCTA imaging process of the input image.

なお、本実施形態の説明では省略したが、教師データの出力データとして用いられる、複数の画像から生成された高画質画像は、位置合わせ済みの複数の画像から生成されることができる。当該位置合わせ処理としては、例えば、複数の画像のうちの一つをテンプレートとして選択し、テンプレートの位置と角度を変えながらその他の画像との類似度を求め、テンプレートとの位置ずれ量を求め、位置ずれ量に基づいて各画像を補正してよい。また、その他の既存の任意の位置合わせ処理を行ってもよい。 Although not described in the present embodiment, a high-quality image generated from multiple images and used as output data for training data can be generated from multiple images that have already been aligned. The alignment process may involve, for example, selecting one of the multiple images as a template, determining the similarity with other images while changing the position and angle of the template, determining the amount of misalignment with the template, and correcting each image based on the amount of misalignment. Any other existing alignment process may also be performed.

なお、三次元画像を位置合わせする場合には、三次元画像を複数の二次元画像に分解し、二次元画像毎に位置合わせしたものを統合することで、三次元画像の位置合わせを行ってもよい。また、二次元画像を一次元画像に分解し、一次元画像毎に位置合わせしたものを統合することで、二次元画像の位置合わせを行ってもよい。なお、画像ではなく、画像を生成するためのデータに対して、これら位置合わせを行ってもよい。 When aligning a three-dimensional image, the three-dimensional image may be decomposed into multiple two-dimensional images, and the two-dimensional images may be aligned and integrated together to align the three-dimensional image. The two-dimensional image may be decomposed into one-dimensional images, and the one-dimensional images may be aligned together to align the two-dimensional image. The alignment may be performed on the data used to generate the images, rather than on the images themselves.

また、本実施形態では、高画質化可否判定部４０３が高画質化部４０４によって入力画像が対処可能であると判断したら、処理がステップＳ５４０に移行して、高画質化部４０４による高画質化処理が開始された。これに対し、出力部４０５が高画質化可否判定部４０３による判定結果を表示部２０に表示させ、高画質化部４０４が検者からの指示に応じて高画質化処理を開始してもよい。この際、出力部４０５は、判定結果とともに、入力画像や入力画像について取得した撮影部位等の撮影条件を表示部２０に表示させることができる。この場合には、検者によって判定結果が正しいか否かが判断された上で、高画質化処理が行われるため、誤判定に基づく高画質化処理を低減させることができる。 In addition, in this embodiment, when the image quality improvement possibility determination unit 403 determines that the input image can be handled by the image quality improvement unit 404, the process proceeds to step S540, and the image quality improvement process by the image quality improvement unit 404 is started. In response to this, the output unit 405 may display the determination result by the image quality improvement possibility determination unit 403 on the display unit 20, and the image quality improvement unit 404 may start the image quality improvement process in response to an instruction from the examiner. At this time, the output unit 405 may display the input image and the imaging conditions such as the imaging part acquired for the input image, together with the determination result, on the display unit 20. In this case, the image quality improvement process is performed after the examiner determines whether the determination result is correct or not, so that the image quality improvement process based on an erroneous determination can be reduced.

また、高画質化可否判定部４０３による判定を行わず、出力部４０５が入力画像や入力画像について取得した撮影部位等の撮影条件を表示部２０に表示させ、高画質化部４０４が検者からの指示に応じて高画質化処理を開始してもよい。 In addition, instead of making a judgment by the image quality improvement possibility judgment unit 403, the output unit 405 may display the input image and the imaging conditions such as the imaging part acquired for the input image on the display unit 20, and the image quality improvement unit 404 may start the image quality improvement process in response to an instruction from the examiner.

＜第２の実施形態＞
次に、図４及び７を参照して、第２の実施形態に係る画像処理装置について説明する。第１の実施形態では、高画質化部４０４は、一つの高画質化エンジンを備えていた。これに対して、本実施形態では、高画質化部が、異なる教師データを用いて機械学習を行った複数の高画質化エンジンを備え、入力画像に対して複数の高画質画像を生成する。 Second Embodiment
Next, an image processing apparatus according to a second embodiment will be described with reference to Figs. 4 and 7. In the first embodiment, the image quality improvement unit 404 includes one image quality improvement engine. In contrast, in this embodiment, the image quality improvement unit includes multiple image quality improvement engines that perform machine learning using different teacher data, and generates multiple high-image-quality images for an input image.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第１の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第１の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は、第１の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as the image processing device 400 according to the first embodiment. Therefore, the following description of the image processing device according to this embodiment will focus on the differences from the image processing device according to the first embodiment. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device according to the first embodiment, the configuration shown in FIG. 4 will be indicated using the same reference numerals and description will be omitted.

本実施形態に係る高画質化部４０４には、それぞれ異なる教師データを用いて機械学習が行われた二つ以上の高画質化エンジンが備えられている。ここで、本実施形態に係る教師データ群の作成方法について説明する。具体的には、まず、様々な撮影部位が撮影された、入力データとしての元画像と出力データとしての重ね合わせ画像のペア群を用意する。次に、撮影部位毎にペア群をグルーピングすることで、教師データ群を作成する。例えば、第１の撮影部位を撮影して取得されたペア群で構成される第１の教師データ、第２の撮影部位を撮影して取得されたペア群で構成される第２の教師データというように、教師データ群を作成する。 The image quality improvement unit 404 according to this embodiment is equipped with two or more image quality improvement engines that have undergone machine learning using different training data. Here, a method for creating a training data group according to this embodiment will be described. Specifically, first, a group of pairs of original images as input data and superimposed images as output data, in which various imaging sites are imaged, is prepared. Next, the pair groups are grouped for each imaging site to create a training data group. For example, a training data group is created in such a way that a first training data group is made up of a pair group acquired by imaging a first imaging site, and a second training data group is made up of a pair group acquired by imaging a second imaging site.

その後、各教師データを用いて別々の高画質化エンジンに機械学習を行わせる。例えば、第１の教師データでトレーニングされた機械学習モデルに対応する第１の高画質化エンジン、第２の教師データでトレーニングされた機械学習モデルに対応する第２の高画質化エンジンというように高画質化エンジン群を用意する。 Then, separate image quality improvement engines are made to perform machine learning using each training data. For example, a group of image quality improvement engines is prepared, such as a first image quality improvement engine corresponding to a machine learning model trained with the first training data, and a second image quality improvement engine corresponding to a machine learning model trained with the second training data.

このような高画質化エンジンは、それぞれ対応する機械学習モデルのトレーニングに用いた教師データが異なるため、高画質化エンジンに入力される画像の撮影条件によって、入力画像を高画質化できる程度が異なる。具体的には、第１の高画質化エンジンは、第１の撮影部位を撮影して取得された入力画像に対しては高画質化の程度が高く、第２の撮影部位を撮影して取得された画像に対しては高画質化の程度が低い。同様に、第２の高画質化エンジンは、第２の撮影部位を撮影して取得された入力画像に対しては高画質化の程度が高く、第１の撮影部位を撮影して取得された画像に対しては高画質化の程度が低い。 Since each of these image quality improvement engines uses different teacher data to train the corresponding machine learning model, the degree to which the image quality of the input image can be improved varies depending on the shooting conditions of the image input to the image quality improvement engine. Specifically, the first image quality improvement engine provides a high degree of image quality improvement for the input image acquired by shooting the first shooting part, and a low degree of image quality improvement for the image acquired by shooting the second shooting part. Similarly, the second image quality improvement engine provides a high degree of image quality improvement for the input image acquired by shooting the second shooting part, and a low degree of image quality improvement for the image acquired by shooting the first shooting part.

教師データのそれぞれが撮影部位によってグルーピングされたペア群で構成されることにより、該ペア群を構成する画像群の画質傾向が似る。このため、高画質化エンジンは対応する撮影部位であれば、第１の実施形態に係る高画像化エンジンよりも効果的に高画質化を行うことができる。なお、教師データのペアをグルーピングするための撮影条件は、撮影部位に限られず、撮影画角であったり、画像の解像度であったり、これらのうちの二つ以上の組み合わせであったりしてもよい。 Since each piece of training data is composed of pairs grouped by the imaging location, the image quality tendencies of the images constituting the pair groups are similar. Therefore, the image quality improvement engine can improve image quality more effectively than the image improvement engine according to the first embodiment for corresponding imaging locations. Note that the imaging conditions for grouping pairs of training data are not limited to the imaging location, and may be the imaging angle of view, image resolution, or a combination of two or more of these.

以下、図７を参照して、本実施形態に係る一連の画像処理について説明する。図７は、本実施形態に係る一連の画像処理のフロー図である。なお、ステップＳ７１０及びステップＳ７２０の処理は、第１の実施形態に係るステップＳ５１０及びステップＳ５２０と同様であるため、説明を省略する。なお、入力画像に対して、無条件で高画質化する場合には、ステップＳ７２０の処理の後に、ステップＳ７３０の処理を省き、処理をステップＳ７４０に移行してよい。 Below, a series of image processing according to this embodiment will be described with reference to FIG. 7. FIG. 7 is a flow diagram of a series of image processing according to this embodiment. Note that the processing of steps S710 and S720 is similar to steps S510 and S520 according to the first embodiment, and therefore a description thereof will be omitted. Note that if the image quality of the input image is to be unconditionally improved, after the processing of step S720, the processing of step S730 may be omitted and the processing may proceed to step S740.

ステップＳ７２０において入力画像の撮影条件が取得されると、処理はステップＳ７３０に移行する。ステップＳ７３０においては、高画質化可否判定部４０３が、ステップＳ７２０において取得した撮影条件群を用いて、高画質化部４０４が備える高画質化エンジン群のいずれかが、入力画像を対処可能であるか否かを判定する。 When the shooting conditions of the input image are acquired in step S720, the process proceeds to step S730. In step S730, the image quality improvement feasibility determination unit 403 uses the group of shooting conditions acquired in step S720 to determine whether any of the image quality improvement engines included in the image quality improvement unit 404 can handle the input image.

高画質化可否判定部４０３が、高画質化エンジン群のいずれも入力画像を対処不可能であると判定した場合には、処理はステップＳ７６０に移行する。一方で、高画質化可否判定部４０３が、高画質化エンジン群のいずれかが入力画像を対処可能であると判定した場合には、処理はステップＳ７４０に移行する。なお、画像処理装置４００の設定や実装形態によっては、第１の実施形態と同様に、高画質化エンジンによって一部の撮影条件が対処不可能であると判定されたとしても、ステップＳ７４０を実施してもよい。 If the image quality improvement capability determination unit 403 determines that none of the image quality improvement engines can handle the input image, the process proceeds to step S760. On the other hand, if the image quality improvement capability determination unit 403 determines that any of the image quality improvement engines can handle the input image, the process proceeds to step S740. Note that, depending on the settings and implementation form of the image processing device 400, step S740 may be performed, as in the first embodiment, even if the image quality improvement engines have determined that some shooting conditions cannot be handled.

ステップＳ７４０においては、高画質化部４０４が、ステップＳ７２０で取得した入力画像の撮影条件及び高画質化エンジン群の教師データの情報に基づいて、高画質化エンジン群から高画質化処理を行う高画質化エンジンを選択する。具体的には、例えば、ステップＳ７２０において取得した撮影条件群のうちの撮影部位に対して、同撮影部位又は周囲の撮影部位に関する教師データの情報を有し、高画質化の程度が高い高画質化エンジンを選択する。上述の例では、撮影部位が第１の撮影部位である場合には、高画質化部４０４は第１の高画質化エンジンを選択する。 In step S740, the image quality improvement unit 404 selects an image quality improvement engine that performs image quality improvement processing from the image quality improvement engine group based on the shooting conditions of the input image acquired in step S720 and the teacher data information of the image quality improvement engine group. Specifically, for example, for an image shooting part in the shooting condition group acquired in step S720, an image quality improvement engine that has teacher data information related to the same image shooting part or surrounding image shooting parts and has a high degree of image quality improvement is selected. In the above example, when the image shooting part is the first image shooting part, the image quality improvement unit 404 selects the first image quality improvement engine.

ステップＳ７５０では、高画質化部４０４が、ステップＳ７４０において選択した高画質化エンジンを用いて、入力画像を高画質化した高画質画像を生成する。その後、ステップＳ７６０において、出力部４０５は、ステップＳ７５０において高画質画像が生成されていれば、高画質画像を出力して、表示部２０に表示させる。一方、ステップＳ７３０において高画質化処理が不可能であるとされていた場合には、入力画像を出力し、表示部２０に表示させる。なお、出力部４０５は、高画質画像を表示部２０に表示させる際、高画質化部４０４によって選択された高画質化エンジンを用いて生成された高画質画像であることを表示させてもよい。 In step S750, the image quality improvement unit 404 uses the image quality improvement engine selected in step S740 to generate a high-quality image by improving the image quality of the input image. After that, in step S760, if a high-quality image was generated in step S750, the output unit 405 outputs the high-quality image and displays it on the display unit 20. On the other hand, if it is determined in step S730 that image quality improvement processing is not possible, the input image is output and displayed on the display unit 20. Note that when displaying the high-quality image on the display unit 20, the output unit 405 may display that the high-quality image was generated using the image quality improvement engine selected by the image quality improvement unit 404.

上記のように、本実施形態に係る高画質化部４０４は、それぞれ異なる学習データを用いて学習を行った複数の高画質化エンジンを備える。ここで、複数の高画質化エンジンの各々は、それぞれ撮影部位、撮影画角、異なる深度の正面画像、及び画像の解像度のうちの少なくとも一つについての異なる学習データを用いて学習を行ったものである。高画質化部４０４は、入力画像の撮影部位、撮影画角、異なる深度の正面画像、及び画像の解像度のうちの少なくとも一つに応じた高画質化エンジンを用いて、高画質画像を生成する。 As described above, the image quality improvement unit 404 according to this embodiment includes a plurality of image quality improvement engines that have been trained using different learning data. Here, each of the plurality of image quality improvement engines has been trained using different learning data for at least one of the imaging part, imaging angle of view, front images at different depths, and image resolution. The image quality improvement unit 404 generates a high-quality image using an image quality improvement engine that corresponds to at least one of the imaging part, imaging angle of view, front images at different depths, and image resolution of the input image.

このような構成により、本実施形態に係る画像処理装置４００は、より効果的な高画質画像を生成することができる。 With this configuration, the image processing device 400 according to this embodiment can generate high-quality images more effectively.

本実施形態では、高画質化部４０４が、入力画像の撮影条件に基づいて高画質化処理に用いる高画質化エンジンを選択したが、高画質化エンジンの選択処理はこれに限られない。例えば、出力部４０５が、取得した入力画像の撮影条件と高画質化エンジン群を表示部２０のユーザーインターフェースに表示させ、検者からの指示に応じて、高画質化部４０４が高画質化処理に用いる高画質化エンジンを選択してもよい。なお、出力部４０５は、高画質化エンジン群とともに各高画質化エンジンの学習に用いた教師データの情報を表示部２０に表示させてもよい。なお、高画質化エンジンの学習に用いた教師データの情報の表示態様は任意であってよく、例えば、学習に用いた教師データに関連する名称を用いて高画質化エンジン群を表示してもよい。 In this embodiment, the image quality improvement unit 404 selects an image quality improvement engine to be used for image quality improvement processing based on the shooting conditions of the input image, but the image quality improvement engine selection process is not limited to this. For example, the output unit 405 may display the shooting conditions of the acquired input image and the image quality improvement engine group on the user interface of the display unit 20, and the image quality improvement unit 404 may select an image quality improvement engine to be used for image quality improvement processing in response to an instruction from the examiner. The output unit 405 may display information on the teacher data used to learn each image quality improvement engine along with the image quality improvement engine group on the display unit 20. The display format of the information on the teacher data used to learn the image quality improvement engine may be arbitrary, and for example, the image quality improvement engine group may be displayed using a name related to the teacher data used for learning.

また、出力部４０５が、高画質化部４０４によって選択された高画質化エンジンを表示部２０のユーザーインターフェースに表示させ、検者からの指示を受け付けてもよい。この場合、高画質化部４０４は、検者からの指示に応じて、当該高画質化エンジンを高画質化処理に用いる高画質化エンジンとして最終的に選択するか否かを判断してもよい。 The output unit 405 may also display the image quality improvement engine selected by the image quality improvement unit 404 on the user interface of the display unit 20 and accept instructions from the examiner. In this case, the image quality improvement unit 404 may determine, in response to instructions from the examiner, whether or not to ultimately select the image quality improvement engine as the image quality improvement engine to be used for image quality improvement processing.

なお、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置４００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 The output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 400, as in the first embodiment. Also, the output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing, as in the first embodiment. That is, a high-quality image obtained by performing at least one of a group of processes or imaging methods, such as overlay processing, MAP estimation processing, smoothing filter processing, tone conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing, may be used.

＜第３の実施形態＞
次に、図４及び７を参照して、第３の実施形態に係る画像処理装置について説明する。第１及び２の実施形態では、撮影条件取得部４０２は、入力画像のデータ構造等から撮影条件群を取得する。これに対して、本実施形態では、撮影条件取得部は、撮影箇所推定エンジンを用いて、入力画像の撮影部位又は撮影領域を入力画像に基づいて推定する。 Third Embodiment
Next, an image processing device according to a third embodiment will be described with reference to Figs. 4 and 7. In the first and second embodiments, the imaging condition acquisition unit 402 acquires an imaging condition group from the data structure of an input image, etc. In contrast, in this embodiment, the imaging condition acquisition unit estimates an imaging part or imaging area of an input image based on the input image using an imaging location estimation engine.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第２の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第２の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は、第１及び２の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment is the same as the image processing device 400 according to the second embodiment. Therefore, the image processing device according to this embodiment will be described below, focusing on the differences from the image processing device according to the second embodiment. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device according to the first and second embodiments, the configuration shown in FIG. 4 will be indicated using the same reference numerals and description will be omitted.

本実施形態に係る撮影条件取得部４０２には、取得部４０１が取得した入力画像に描画されている撮影部位又は撮影領域を推定する撮影箇所推定エンジンが備えられている。本実施形態に係る撮影箇所推定エンジンの備える撮影箇所の推定手法では、機械学習アルゴリズムを用いた推定処理を行う。 The imaging condition acquisition unit 402 according to this embodiment is equipped with an imaging location estimation engine that estimates the imaging part or imaging area depicted in the input image acquired by the acquisition unit 401. The imaging location estimation method provided by the imaging location estimation engine according to this embodiment performs estimation processing using a machine learning algorithm.

本実施形態では、機械学習アルゴリズムを用いた撮影箇所推定手法に係る機械学習モデルのトレーニングには、画像である入力データと、入力データに対応する撮影部位ラベルや撮影領域ラベルである出力データとのペア群で構成された教師データを用いる。ここで、入力データとは、処理対象（入力画像）として想定される特定の撮影条件を持つ画像のことである。入力データとしては、撮影装置１０と同じ画質傾向を持つ撮影装置から取得された画像であることが好ましく、撮影装置１０と同じ設定をされた同じ機種であるとより良い。出力データである撮影部位ラベルや撮影領域ラベルの種類は、入力データに少なくとも一部が含まれている撮影部位や撮影領域であってよい。出力データである撮影部位ラベルの種類は、例えば、ＯＣＴであれば、“黄斑部”、“視神経乳頭部”、“黄斑部及び視神経乳頭部”、並びに“その他”等であってよい。 In this embodiment, training of a machine learning model related to a method for estimating an imaging location using a machine learning algorithm uses teacher data consisting of pairs of input data, which is an image, and output data, which is imaging site labels or imaging area labels corresponding to the input data. Here, the input data refers to an image having specific imaging conditions that is assumed to be the processing target (input image). The input data is preferably an image acquired from an imaging device having the same image quality tendency as the imaging device 10, and is preferably the same model with the same settings as the imaging device 10. The types of imaging site labels and imaging area labels, which are output data, may be imaging sites and imaging areas at least partially included in the input data. For example, in the case of OCT, the types of imaging site labels, which are output data, may be "macular region", "optic disc", "macular region and optic disc", and "others", etc.

本実施形態に係る撮影箇所推定エンジンは、このような教師データを用いた学習を行ったことにより、入力された画像に描画されている撮影部位や撮影領域がどこであるかを出力することができる。また、撮影箇所推定エンジンは、必要な詳細レベルの撮影部位ラベルや撮影領域ラベル毎に、該撮影部位や撮影領域である確率を出力することもできる。撮影箇所推定エンジンを用いることで、撮影条件取得部４０２は、入力画像に基づいて、入力画像の撮影部位や撮影領域を推定し、入力画像についての撮影条件として取得することができる。なお、撮影箇所推定エンジンが撮影部位ラベルや撮影領域ラベル毎に、該撮影部位や撮影領域である確率を出力する場合には、撮影条件取得部４０２は、最も確率の高い撮影部位や撮影領域を入力画像の撮影条件として取得する。 The shooting location estimation engine according to this embodiment can output the location of the shooting part or shooting area depicted in the input image by learning using such teacher data. In addition, the shooting location estimation engine can also output the probability of the shooting part or shooting area for each shooting part label or shooting area label of the required level of detail. By using the shooting location estimation engine, the shooting condition acquisition unit 402 can estimate the shooting part or shooting area of the input image based on the input image and acquire it as the shooting condition for the input image. Note that when the shooting location estimation engine outputs the probability of the shooting part or shooting area for each shooting part label or shooting area label, the shooting condition acquisition unit 402 acquires the shooting part or shooting area with the highest probability as the shooting condition of the input image.

次に、第２の実施形態と同様に、図７のフロー図を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ７１０、及びステップＳ７３０～ステップＳ７６０の処理は、第２の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、無条件で高画質化する場合には、ステップＳ７２０の処理の後に、ステップＳ７３０の処理を省き、処理をステップＳ７４０に移行してよい。 Next, as in the second embodiment, a series of image processing steps according to this embodiment will be described with reference to the flow diagram in FIG. 7. Note that the processing steps S710 and steps S730 to S760 according to this embodiment are similar to those steps in the second embodiment, and therefore will not be described. Note that if the image quality of the input image is to be unconditionally improved, after the processing step S720, the processing step S730 may be omitted and the processing may proceed to step S740.

ステップＳ７１０において入力画像が取得されると、処理はステップＳ７２０に移行する。ステップＳ７２０では、撮影条件取得部４０２が、ステップＳ７１０において取得した入力画像の撮影条件群を取得する。 When the input image is acquired in step S710, the process proceeds to step S720. In step S720, the shooting condition acquisition unit 402 acquires a group of shooting conditions for the input image acquired in step S710.

具体的には、入力画像のデータ形式に応じて、入力画像を構成するデータ構造に保存された撮影条件群を取得する。また、撮影条件群に撮影部位や撮影領域に関する情報が含まれていない場合、撮影条件取得部４０２は撮影箇所推定エンジンに入力画像を入力し、入力画像がどの撮影部位を撮影して取得されたものなのかを推定する。具体的には、撮影条件取得部４０２は、撮影箇所推定エンジンに入力画像を入力し、撮影部位ラベル群のそれぞれに対して出力された確率を評価し、最も確率の高い撮影部位を入力画像の撮影条件として設定・取得する。 Specifically, the imaging condition group stored in the data structure constituting the input image is acquired according to the data format of the input image. Furthermore, if the imaging condition group does not include information on the imaging part or imaging area, the imaging condition acquisition unit 402 inputs the input image to an imaging location estimation engine and estimates which imaging part was imaged to acquire the input image. Specifically, the imaging condition acquisition unit 402 inputs the input image to an imaging location estimation engine, evaluates the probability output for each imaging part label group, and sets and acquires the imaging part with the highest probability as the imaging condition for the input image.

なお、入力画像に撮影部位や撮影領域以外の撮影条件が保存されていない場合には、撮影条件取得部４０２は、撮影装置１０や不図示の画像管理システムから撮影条件群を含む撮影情報群を取得することができる。 If no imaging conditions other than the imaging part and imaging area are stored in the input image, the imaging condition acquisition unit 402 can acquire a group of imaging information including a group of imaging conditions from the imaging device 10 or an image management system (not shown).

以降の処理は、第２実施形態に係る一連の画像処理と同様であるため説明を省略する。 The subsequent processing is similar to the series of image processing in the second embodiment, so a detailed explanation is omitted.

上記のように、本実施形態に係る撮影条件取得部４０２は、入力画像の撮影部位及び撮影領域のうちの少なくとも一方を推定する推定部として機能する。撮影条件取得部４０２は、撮影部位や撮影領域のラベルが付けられた画像を学習データとした撮影箇所推定エンジンを含み、撮影箇所推定エンジンに入力画像を入力することで、入力画像の撮影部位や撮影領域を推定する。 As described above, the imaging condition acquisition unit 402 according to this embodiment functions as an estimation unit that estimates at least one of the imaging part and imaging area of the input image. The imaging condition acquisition unit 402 includes an imaging location estimation engine that uses images labeled with the imaging part and imaging area as learning data, and estimates the imaging part and imaging area of the input image by inputting the input image to the imaging location estimation engine.

これにより、本実施形態に係る画像処理装置４００は、入力画像の撮影部位や撮影領域についての撮影条件を入力画像に基づいて取得することができる。 As a result, the image processing device 400 according to this embodiment can obtain the shooting conditions for the shooting part and shooting area of the input image based on the input image.

なお、本実施形態では、撮影条件取得部４０２は、撮影条件群に撮影部位や撮影領域に関する情報が含まれていない場合に撮影箇所推定エンジンを用いて入力画像の撮影部位や撮影領域について推定を行った。しかしながら、撮影箇所推定エンジンを用いて撮影部位や撮影領域について推定を行う状況はこれに限られない。撮影条件取得部４０２は、入力画像のデータ構造に含まれる撮影部位や撮影領域についての情報が、必要な詳細レベルの情報として不足している場合にも、撮影箇所推定エンジンを用いて撮影部位や撮影領域について推定を行ってもよい。 In this embodiment, the shooting condition acquisition unit 402 estimates the shooting part and shooting area of the input image using the shooting location estimation engine when the shooting condition group does not include information on the shooting part and shooting area. However, the situation in which the shooting part and shooting area are estimated using the shooting location estimation engine is not limited to this. The shooting condition acquisition unit 402 may estimate the shooting part and shooting area using the shooting location estimation engine even when information on the shooting part and shooting area included in the data structure of the input image is insufficient at the required level of detail.

また、入力画像のデータ構造に撮影部位や撮影領域についての情報が含まれているか否かとは無関係に、撮影条件取得部４０２が撮影箇所推定エンジンを用いて入力画像の撮影部位や撮影領域を推定してもよい。この場合、出力部４０５が、撮影箇所推定エンジンから出力された推定結果と入力画像のデータ構造に含まれる撮影部位や撮影領域についての情報を表示部２０に表示させ、撮影条件取得部４０２が検者の指示に応じて、これらの撮影条件を決定してもよい。 In addition, the imaging condition acquisition unit 402 may estimate the imaging part and imaging area of the input image using the imaging location estimation engine, regardless of whether the data structure of the input image includes information on the imaging part and imaging area. In this case, the output unit 405 may display the estimation result output from the imaging location estimation engine and information on the imaging part and imaging area included in the data structure of the input image on the display unit 20, and the imaging condition acquisition unit 402 may determine these imaging conditions according to the examiner's instructions.

＜第４の実施形態＞
次に、図４、５、８及び９を参照して、第４の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が、入力画像を高画質化エンジンが対処可能な画像サイズになるように、入力画像を拡大又は縮小する。また、高画質化部は、高画質化エンジンからの出力画像を、出力画像の画像サイズが入力画像の画像サイズになるように縮小又は拡大して高画質画像を生成する。 Fourth Embodiment
Next, an image processing device according to a fourth embodiment will be described with reference to Figures 4, 5, 8 and 9. In this embodiment, an image quality improvement unit enlarges or reduces an input image so that the input image has an image size that the image quality improvement engine can handle. The image quality improvement unit also reduces or enlarges an output image from the image quality improvement engine so that the image size of the output image becomes the image size of the input image, thereby generating a high-image quality image.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第１の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第１の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は、第１の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as the image processing device 400 according to the first embodiment. Therefore, the image processing device according to this embodiment will be described below, focusing on the differences from the image processing device according to the first embodiment. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device according to the first embodiment, the configuration shown in FIG. 4 will be indicated using the same reference numerals and description will be omitted.

本実施形態に係る高画質化部４０４には、第１の実施形態に係る高画質化エンジンと同様の、高画質化エンジンが備えられている。ただし、本実施形態では、高画質化エンジンの学習に用いる教師データとして、入力データの画像及び出力データの画像を一定の画像サイズになるように拡大又は縮小した画像群により構成した、入力データと出力データのペア群を用いている。 The image quality improvement unit 404 according to this embodiment is equipped with an image quality improvement engine similar to the image quality improvement engine according to the first embodiment. However, in this embodiment, as the teacher data used for learning the image quality improvement engine, a group of pairs of input data and output data, which are composed of a group of images in which the input data images and the output data images are enlarged or reduced to a certain image size, are used.

ここで、図８を参照して、本実施形態に係る高画質化エンジンの教師データについて説明する。図８に示すように、例えば、教師データについて設定された一定の画像サイズより小さな低画質画像Ｉｍ８１０と高画質画像Ｉｍ８２０とがある場合を考える。この場合、教師データについて設定された一定の画像サイズとなるように、低画質画像Ｉｍ８１０及び高画質画像Ｉｍ８２０のそれぞれを拡大する。そして、拡大した低画質画像Ｉｍ８１１と拡大した高画質画像Ｉｍ８２１とをペアとして、当該ペアを教師データの一つとして用いる。 Now, referring to FIG. 8, the teacher data of the image quality improvement engine according to this embodiment will be described. As shown in FIG. 8, for example, consider the case where there is a low-quality image Im810 and a high-quality image Im820 that are smaller than a certain image size set for the teacher data. In this case, the low-quality image Im810 and the high-quality image Im820 are each enlarged so that they become the certain image size set for the teacher data. Then, the enlarged low-quality image Im811 and the enlarged high-quality image Im821 are paired, and the pair is used as one piece of teacher data.

なお、第１の実施形態と同様に、教師データの入力データには、処理対象（入力画像）として想定される特定の撮影条件を持つ画像を用いるが、当該特定の撮影条件は、予め決定された撮影部位、撮影方式、及び撮影画角である。つまり、本実施形態に係る当該特定の撮影条件には、第１の実施形態と異なり、画像サイズは含まれない。 As in the first embodiment, the input data for the training data uses an image with specific shooting conditions that are assumed to be the processing target (input image), and the specific shooting conditions are a predetermined shooting part, shooting method, and shooting angle of view. In other words, unlike the first embodiment, the specific shooting conditions in this embodiment do not include image size.

本実施形態に係る高画質化部４０４は、このような教師データで学習が行われた高画質化エンジンを用いて、入力画像を高画質化して高画質画像を生成する。この際、高画質化部４０４は、入力画像を教師データについて設定された一定の画像サイズになるように拡大又は縮小した変形画像を生成し、変形画像を高画質化エンジン入力する。また、高画質化部４０４は、高画質化エンジンからの出力画像を入力画像の画像サイズになるように縮小又は拡大し、高画質画像を生成する。このため、本実施形態に係る高画質化部４０４は、第１の実施形態では対処できなかった画像サイズの入力画像であっても、高画質化エンジンによって高画質化して高画質画像を生成することができる。 The image quality improvement unit 404 according to this embodiment uses an image quality improvement engine that has been trained using such teacher data to improve the image quality of an input image and generate a high-image quality image. At this time, the image quality improvement unit 404 generates a deformed image by enlarging or reducing the input image so that it has a certain image size set for the teacher data, and inputs the deformed image to the image quality improvement engine. The image quality improvement unit 404 also reduces or enlarges the output image from the image quality improvement engine so that it has the image size of the input image, generating a high-image quality image. Therefore, the image quality improvement unit 404 according to this embodiment can use the image quality improvement engine to improve the image quality of even an input image of an image size that could not be handled in the first embodiment, and generate a high-image quality image.

次に、図５及び９を参照して、本実施形態に係る一連の画像処理について説明する。図９は、本実施形態に係る高画質化処理のフロー図である。なお、本実施形態に係るステップＳ５１０、ステップＳ５２０、及びステップＳ５５０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、画像サイズ以外の撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to Figs. 5 and 9. Fig. 9 is a flow diagram of the image quality improvement processing according to this embodiment. Note that the processing steps S510, S520, and S550 according to this embodiment are similar to those in the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to shooting conditions other than image size, after the processing step S520, the processing step S530 may be omitted and the processing may proceed to step S540.

ステップＳ５２０において、第１の実施形態と同様に、撮影条件取得部４０２が入力画像の撮影条件群を取得したら処理はステップＳ５３０に移行する。ステップＳ５３０では、高画質化可否判定部４０３が、取得された撮影条件群を用いて、高画質化部４０４に備える高画質化エンジンが入力画像を対処可能であるか否かを判定する。具体的には、高画質化可否判定部４０３は、入力画像の撮影条件について、高画質化エンジンが対処可能な、撮影部位、撮影方式、及び撮影画角であるか否かを判定する。高画質化可否判定部４０３は、第１の実施形態と異なり、画像サイズは判定しない。 In step S520, as in the first embodiment, when the imaging condition acquisition unit 402 acquires a group of imaging conditions for the input image, the process proceeds to step S530. In step S530, the image quality improvement feasibility determination unit 403 uses the acquired group of imaging conditions to determine whether the image quality improvement engine provided in the image quality improvement unit 404 can handle the input image. Specifically, the image quality improvement feasibility determination unit 403 determines whether the imaging conditions of the input image are imaging body part, imaging method, and imaging angle of view that the image quality improvement engine can handle. Unlike the first embodiment, the image quality improvement feasibility determination unit 403 does not determine the image size.

高画質化可否判定部４０３が、撮影部位、撮影方式、及び撮影画角について判定し、入力画像が対処可能と判定された場合には、処理はステップＳ５４０に移行する。一方、高画質化可否判定部４０３が、これら撮影条件に基づいて、高画質化エンジンが入力画像を対処不可能であると判定した場合には、処理はステップＳ５５０に移行する。なお、画像処理装置４００の設定や実装形態によっては、撮影部位、撮影方式、及び撮影画角のうちの一部に基づいて入力画像が処理不可能であると判定されたとしても、ステップＳ５４０における高画質化処理が実施されてもよい。 If the image quality improvement feasibility determination unit 403 determines the imaging part, imaging method, and imaging angle of view, and determines that the input image can be handled, the process proceeds to step S540. On the other hand, if the image quality improvement feasibility determination unit 403 determines that the image quality improvement engine cannot handle the input image based on these imaging conditions, the process proceeds to step S550. Note that, depending on the settings and implementation form of the image processing device 400, the image quality improvement process in step S540 may be performed even if it is determined that the input image cannot be processed based on some of the imaging part, imaging method, and imaging angle of view.

処理がステップＳ５４０に移行すると、図９に示される本実施形態に係る高画質化処理が開始される。本実施形態に係る高画質化処理では、まず、ステップＳ９１０において、高画質化部４０４が、入力画像を教師データについて設定された一定の画像サイズに拡大又は縮小し、変形画像を生成する。 When the process proceeds to step S540, the image quality improvement process according to this embodiment shown in FIG. 9 is started. In the image quality improvement process according to this embodiment, first, in step S910, the image quality improvement unit 404 enlarges or reduces the input image to a certain image size set for the training data, and generates a deformed image.

次に、ステップＳ９２０において、高画質化部４０４は、生成した変形画像を高画質化エンジンに入力し高画質化された高画質な変形画像を取得する。 Next, in step S920, the image quality improvement unit 404 inputs the generated deformed image to an image quality improvement engine to obtain a high-image-quality deformed image.

その後、ステップＳ９３０において、高画質化部４０４は、高画質な変形画像を入力画像の画像サイズに縮小又は拡大し、高画質画像を生成する。高画質化部４０４がステップＳ９３０において高画質画像を生成したら、本実施形態に係る高画質化処理は終了し、処理はステップＳ５５０に移行する。ステップＳ５５０の処理は、第１の実施形態のステップＳ５５０と同様であるため説明を省略する。 Then, in step S930, the image quality improvement unit 404 reduces or enlarges the high-quality transformed image to the image size of the input image to generate a high-quality image. Once the image quality improvement unit 404 generates a high-quality image in step S930, the image quality improvement process according to this embodiment ends, and the process proceeds to step S550. The process of step S550 is similar to step S550 in the first embodiment, and therefore a description thereof will be omitted.

上記のように、本実施形態に係る高画質化部４０４は、入力画像の画像サイズを、高画質化エンジンが対処可能な画像サイズに調整して高画質化エンジンに入力する。また、高画質化部４０４は、高画質化エンジンからの出力画像を入力画像の元の画像サイズに調整することで高画質画像を生成する。これにより、本実施形態の画像処理装置４００は、高画質化エンジンを用いて、第１の実施形態では対処できなかった画像サイズの入力画像についても高画質化して、画像診断に適切な高画質画像を生成することができる。 As described above, the image quality improvement unit 404 according to this embodiment adjusts the image size of the input image to an image size that the image quality improvement engine can handle, and inputs the image to the image quality improvement engine. The image quality improvement unit 404 also generates a high-quality image by adjusting the output image from the image quality improvement engine to the original image size of the input image. As a result, the image processing device 400 according to this embodiment can use the image quality improvement engine to improve the image quality of input images of image sizes that could not be handled in the first embodiment, and generate high-quality images suitable for image diagnosis.

なお、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置４００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 The output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 400, as in the first embodiment. The output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing, as in the first embodiment. That is, a high-quality image obtained by performing at least one of a group of processes or imaging methods, such as overlay processing, MAP estimation processing, smoothing filter processing, tone conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing, may be used.

＜第５の実施形態＞
次に、図４、５、１０及び１１を参照して、第５の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が、高画質化エンジンによる一定の解像度を基準とした高画質化処理により高画質画像を生成する。 Fifth embodiment
Next, an image processing device according to a fifth embodiment will be described with reference to Figures 4, 5, 10 and 11. In this embodiment, an image quality improvement unit generates a high-quality image by image quality improvement processing using an image quality improvement engine with a certain resolution as a reference.

本実施形態に係る高画質化部４０４には、第１の実施形態と同様の、高画質化エンジンが備えられている。ただし、本実施形態では、高画質化エンジンの学習に用いる教師データが第１の実施形態における教師データと異なる。具体的には、教師データの入力データと出力データとのペア群を構成する画像群の解像度が一定の解像度となるような画像サイズに当該画像群を拡大又は縮小した後、十分に大きい一定の画像サイズとなるようにパディングしている。ここで、画像群の解像度とは、例えば、撮影装置の空間分解能や撮影領域に対する解像度をいう。 The image quality improvement unit 404 according to this embodiment is equipped with an image quality improvement engine similar to that of the first embodiment. However, in this embodiment, the training data used for training the image quality improvement engine is different from the training data in the first embodiment. Specifically, the image group constituting the pair group of input data and output data of the training data is enlarged or reduced to an image size such that the resolution of the image group becomes a certain resolution, and then padding is performed to obtain a sufficiently large and constant image size. Here, the resolution of the image group refers to, for example, the spatial resolution of the imaging device or the resolution for the imaging area.

ここで、図１０を参照して、本実施形態に係る高画質化エンジンの教師データについて説明する。図１０に示すように、例えば、教師データについて設定された一定の解像度より低い解像度を持つ低画質画像Ｉｍ１０１０と高画質画像Ｉｍ１０２０とがある場合を考える。この場合、教師データについて設定された一定の解像度となるように、低画質画像Ｉｍ１０１０と高画質画像Ｉｍ１０２０のそれぞれを拡大する。さらに、拡大された低画質画像Ｉｍ１０１０と高画質画像Ｉｍ１０２０のそれぞれについて、教師データについて設定された一定の画像サイズとなるようにパディングする。そして、拡大及びパディングが行われた低画質画像Ｉｍ１０１１と高画質画像Ｉｍ１０２１とをペアとし、当該ペアを教師データの一つとして用いる。 Now, referring to FIG. 10, the teacher data of the image quality improvement engine according to this embodiment will be described. As shown in FIG. 10, for example, consider the case where there is a low-quality image Im1010 and a high-quality image Im1020, both of which have a resolution lower than the fixed resolution set for the teacher data. In this case, the low-quality image Im1010 and the high-quality image Im1020 are each enlarged so that they have the fixed resolution set for the teacher data. Furthermore, the enlarged low-quality image Im1010 and the high-quality image Im1020 are each padded so that they have the fixed image size set for the teacher data. Then, the low-quality image Im1011 and the high-quality image Im1021 that have been enlarged and padded are paired, and the pair is used as one piece of teacher data.

なお、教師データについて設定された一定の画像サイズとは、処理対象（入力画像）として想定される画像を一定の解像度となるように拡大又は縮小したときの最大となりうる画像サイズである。当該一定の画像サイズが十分に大きくない場合には、高画質化エンジンに入力された画像を拡大したときに、機械学習モデルが対処不可能な画像サイズとなる可能性がある。 The fixed image size set for the training data is the maximum possible image size when the image expected to be processed (input image) is enlarged or reduced to a fixed resolution. If the fixed image size is not large enough, when the image input to the image quality improvement engine is enlarged, the image size may become too large for the machine learning model to handle.

また、パディングが行われる領域は、効果的に高画質化できるように機械学習モデルの特性に合わせて、一定の画素値で埋めたり、近傍画素値で埋めたり、ミラーパディングしたりする。なお、第１の実施形態と同様に、入力データには、処理対象として想定される特定の撮影条件を持つ画像を用いるが、当該特定の撮影条件は、予め決定された撮影部位、撮影方式、撮影画角である。つまり、本実施形態に係る当該特定の撮影条件には、第１の実施形態と異なり、画像サイズは含まれない。 In addition, the area to be padded is filled with a fixed pixel value, filled with nearby pixel values, or mirror padded according to the characteristics of the machine learning model so as to effectively improve image quality. As in the first embodiment, the input data uses images with specific shooting conditions that are expected to be processed, and the specific shooting conditions are a predetermined shooting part, shooting method, and shooting angle of view. In other words, unlike the first embodiment, the specific shooting conditions in this embodiment do not include image size.

本実施形態に係る高画質化部４０４は、このような教師データで学習が行われた高画質化エンジンを用いて、入力画像を高画質化して高画質画像を生成する。この際、高画質化部４０４は、入力画像を教師データについて設定された一定の解像度になるように拡大又は縮小した変形画像を生成する。また、高画質化部４０４は、変形画像について、教師データについて設定された一定の画像サイズとなるようにパディングを行ってパディング画像を生成し、パディング画像を高画質化エンジン入力する。 The image quality improvement unit 404 according to this embodiment uses an image quality improvement engine that has been trained using such teacher data to improve the image quality of an input image to generate a high-quality image. At this time, the image quality improvement unit 404 generates a deformed image by enlarging or reducing the input image so that it has a certain resolution set for the teacher data. The image quality improvement unit 404 also pads the deformed image so that it has a certain image size set for the teacher data to generate a padded image, and inputs the padded image to the image quality improvement engine.

また、高画質化部４０４は、高画質化エンジンから出力された高画質なパディング画像について、パディングを行った領域分だけトリミングし、高画質な変形画像を生成する。その後、高画質化部４０４は、生成した高画質な変形画像を入力画像の画像サイズになるように縮小又は拡大し、高画質画像を生成する。 The image quality improvement unit 404 also trims the high-quality padded image output from the image quality improvement engine by the area where padding was performed, to generate a high-quality deformed image. The image quality improvement unit 404 then reduces or enlarges the generated high-quality deformed image to the image size of the input image, to generate a high-quality image.

このため、本実施形態に係る高画質化部４０４は、第１の実施形態では対処できなかった画像サイズの入力画像であっても、高画質化エンジンによって高画質化して高画質画像を生成することができる。 For this reason, the image quality improvement unit 404 according to this embodiment can generate a high-quality image by improving the image quality using the image quality improvement engine, even for input images with image sizes that could not be handled in the first embodiment.

次に、図５及び１１を参照して、本実施形態に係る一連の画像処理について説明する。図１１は、本実施形態に係る高画質化処理のフロー図である。なお、本実施形態に係るステップＳ５１０、ステップＳ５２０、及びステップＳ５５０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、画像サイズ以外の撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to Figs. 5 and 11. Fig. 11 is a flow diagram of the image quality improvement processing according to this embodiment. Note that the processing steps S510, S520, and S550 according to this embodiment are similar to those in the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to shooting conditions other than image size, after the processing step S520, the processing step S530 may be omitted and the processing may proceed to step S540.

ステップＳ５２０において、第１の実施形態と同様に、撮影条件取得部４０２が入力画像の撮影条件群を取得したら、処理はステップＳ５３０に移行する。ステップＳ５３０では、高画質化可否判定部４０３が、取得された撮影条件群を用いて、高画質化部４０４に備える高画質化エンジンが入力画像を対処可能であるか否かを判定する。具体的には、高画質化可否判定部４０３は、入力画像の撮影条件について、高画質化エンジンが対処可能な、撮影部位、撮影方式、及び撮影画角であるか否かを判定する。高画質化可否判定部４０３は、第１の実施形態と異なり、画像サイズは判定しない。 In step S520, as in the first embodiment, when the imaging condition acquisition unit 402 acquires a group of imaging conditions for the input image, the process proceeds to step S530. In step S530, the image quality improvement feasibility determination unit 403 uses the acquired group of imaging conditions to determine whether the image quality improvement engine provided in the image quality improvement unit 404 can handle the input image. Specifically, the image quality improvement feasibility determination unit 403 determines whether the imaging conditions of the input image are imaging body part, imaging method, and imaging angle of view that the image quality improvement engine can handle. Unlike the first embodiment, the image quality improvement feasibility determination unit 403 does not determine the image size.

処理がステップＳ５４０に移行すると、図１１に示される本実施形態に係る高画質化処理が開始される。本実施形態に係る高画質化処理では、まず、ステップＳ１１１０において、高画質化部４０４が、入力画像を教師データについて設定された一定の解像度となるように拡大又は縮小し、変形画像を生成する。 When the process proceeds to step S540, the image quality improvement process according to this embodiment shown in FIG. 11 is started. In the image quality improvement process according to this embodiment, first, in step S1110, the image quality improvement unit 404 enlarges or reduces the input image to a certain resolution set for the training data, and generates a deformed image.

次に、ステップＳ１１２０において、高画質化部４０４は、生成した変形画像について、教師データについて設定された画像サイズとなるように、パディングを行ってパディング画像を生成する。この際、高画質化部４０４は、パディングを行う領域について、効果的に高画質化できるように機械学習モデルの特性に合わせて、一定の画素値で埋めたり、近傍画素値で埋めたり、ミラーパディングしたりする。 Next, in step S1120, the image quality improvement unit 404 performs padding on the generated transformed image so that the image size is the same as the image size set for the training data, to generate a padded image. At this time, the image quality improvement unit 404 fills the area to be padded with a fixed pixel value, fills with a neighboring pixel value, or performs mirror padding in accordance with the characteristics of the machine learning model so as to effectively improve the image quality.

ステップＳ１１３０では、高画質化部４０４がパディング画像を高画質化エンジンに入力し高画質化された高画質なパディング画像を取得する。
次に、ステップＳ１１４０において、高画質化部４０４は、高画質なパディング画像について、ステップＳ１１２０でパディングを行った領域分だけトリミングを行い、高画質な変形画像を生成する。 In step S1130, the image quality improving unit 404 inputs the padding image to an image quality improving engine to obtain a high-image-quality padding image.
Next, in step S1140, the image quality improving unit 404 performs trimming on the high-image-quality padded image by the area that was padded in step S1120, and generates a high-image-quality transformed image.

その後、ステップＳ１１５０において、高画質化部４０４は、高画質な変形画像を入力画像の画像サイズに縮小又は拡大し、高画質画像を生成する。高画質化部４０４がステップＳ１１３０において高画質画像を生成したら、本実施形態に係る高画質化処理は終了し、処理はステップＳ５５０に移行する。ステップＳ５５０の処理は、第１の実施形態のステップＳ５５０と同様であるため説明を省略する。 Then, in step S1150, the image quality improvement unit 404 reduces or enlarges the high-quality transformed image to the image size of the input image to generate a high-quality image. Once the image quality improvement unit 404 generates a high-quality image in step S1130, the image quality improvement process according to this embodiment ends, and the process proceeds to step S550. The process of step S550 is similar to step S550 in the first embodiment, and therefore a description thereof will be omitted.

上記のように、本実施形態による高画質化部４０４は、入力画像の解像度が所定の解像度となるように、入力画像の画像サイズを調整する。また、高画質化部４０４は、画像サイズが調整された入力画像について、調整された画像サイズが高画質化エンジンによって対処可能な画像サイズとなるように、パディングを行ったパディング画像を生成し、パディング画像を高画質化エンジンに入力する。その後、高画質化部４０４は、高画質化エンジンからの出力画像について、パディングを行った領域分だけトリミングを行う。そして、高画質化部４０４は、トリミングが行われた画像の画像サイズを、入力画像の元の画像サイズに調整することで高画質画像を生成する。 As described above, the image quality improvement unit 404 according to this embodiment adjusts the image size of the input image so that the resolution of the input image becomes a predetermined resolution. Furthermore, for the input image whose image size has been adjusted, the image quality improvement unit 404 generates a padded image by padding so that the adjusted image size becomes an image size that can be handled by the image quality improvement engine, and inputs the padded image to the image quality improvement engine. The image quality improvement unit 404 then trims the output image from the image quality improvement engine by the amount of the padded area. The image quality improvement unit 404 then generates a high-quality image by adjusting the image size of the trimmed image to the original image size of the input image.

これにより、本実施形態の高画質化部４０４は、第１の実施形態では対処できなかった画像サイズの入力画像であっても、高画質化エンジンによって高画質化して高画質画像を生成することができる。また、解像度を基準とした教師データで学習した高画質化エンジンを用いることで、単純に同一な画像サイズの画像を処理する第４の実施形態に係る高画質化エンジンよりも、効率よく入力画像を高画質化できる場合がある。 As a result, the image quality improvement unit 404 of this embodiment can use the image quality improvement engine to improve the image quality of input images of image sizes that could not be handled in the first embodiment, and generate high-quality images. Furthermore, by using an image quality improvement engine that has learned using training data based on resolution, it may be possible to improve the image quality of input images more efficiently than the image quality improvement engine of the fourth embodiment, which simply processes images of the same image size.

＜第６の実施形態＞
次に、図４、５、１２及び１３を参照して、第６の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が、入力画像を一定の画像サイズの領域毎に高画質化することにより高画質画像を生成する。 Sixth Embodiment
Next, an image processing apparatus according to a sixth embodiment will be described with reference to Figures 4, 5, 12 and 13. In this embodiment, an image quality improving unit improves the image quality of an input image for each region of a certain image size, thereby generating a high-quality image.

本実施形態に係る高画質化部４０４には、第１の実施形態と同様の、高画質化エンジンが備えられている。ただし、本実施形態では、高画質化エンジンの学習に用いる教師データが第１の実施形態における教師データと異なる。具体的には、教師データを構成する、低画質画像である入力データと高画質画像である出力データとのペア群を、低画質画像及び高画質画像における、位置関係が対応する一定の画像サイズの矩形領域画像によって構成している。なお、矩形領域は、部分領域の一例であり、矩形である必要はなく、どのような形状であってもよい。 The image quality improvement unit 404 according to this embodiment is equipped with an image quality improvement engine similar to that of the first embodiment. However, in this embodiment, the teacher data used for training the image quality improvement engine is different from the teacher data in the first embodiment. Specifically, pairs of input data, which is a low-quality image, and output data, which is a high-quality image, constituting the teacher data are constituted by rectangular area images of a certain image size in which the positional relationship in the low-quality image and the high-quality image corresponds. Note that the rectangular area is an example of a partial area, and does not have to be rectangular and may be any shape.

ここで、図１２を参照して、本実施形態に係る高画質化エンジンの教師データについて説明する。図１２に示すように、教師データを構成するペア群の一つに、例えば、低画質画像である元画像Ｉｍ１２１０と、高画質画像である重ね合わせ画像Ｉｍ１２２０があるとした場合を考える。この場合、第１の実施形態においては、教師データの入力データをＩｍ１２１０、出力データをＩｍ１２２０とした。 Now, referring to FIG. 12, the teacher data of the image quality improvement engine according to this embodiment will be described. As shown in FIG. 12, consider a case where one of the pairs constituting the teacher data includes, for example, an original image Im1210, which is a low-image quality image, and an overlaid image Im1220, which is a high-image quality image. In this case, in the first embodiment, the input data of the teacher data is Im1210, and the output data is Im1220.

これに対し、本実施形態においては、元画像Ｉｍ１２１０のうちの矩形領域画像Ｒ１２１１を入力データとし、重ね合わせ画像Ｉｍ１２２０において矩形領域画像Ｒ１２１１と同じ撮影領域である矩形領域画像Ｒ１２２１を出力データとする。そして、入力データである矩形領域画像Ｒ１２１１と出力データである矩形領域画像Ｒ１２２１によって教師データのペア（以下、第１の矩形領域画像ペア）を構成する。ここで、矩形領域画像Ｒ１２１１と矩形領域画像Ｒ１２２１は、一定の画像サイズの画像とされる。なお、元画像Ｉｍ１２１０と重ね合わせ画像Ｉｍ１２２０は任意の方法により位置合わせされてよい。また、矩形領域画像Ｒ１２１１と矩形領域画像Ｒ１２２１の対応する位置関係はテンプレートマッチングなどの任意の方法によって特定されてよい。なお、高画質化エンジンの設計によっては、入力データと出力データの、それぞれの画像サイズや次元数は異なっていてもよい。例えば、処理対象がＯＣＴの画像である場合に、入力データがＢスキャン画像（二次元画像）の一部であるとき、出力データがＡスキャン画像（一次元画像）の一部であってもよい。 In contrast, in this embodiment, the rectangular area image R1211 of the original image Im1210 is used as input data, and the rectangular area image R1221, which is the same shooting area as the rectangular area image R1211 in the superimposed image Im1220, is used as output data. Then, the rectangular area image R1211, which is the input data, and the rectangular area image R1221, which is the output data, form a pair of teacher data (hereinafter, the first rectangular area image pair). Here, the rectangular area image R1211 and the rectangular area image R1221 are images of a certain image size. Note that the original image Im1210 and the superimposed image Im1220 may be aligned by any method. Also, the corresponding positional relationship between the rectangular area image R1211 and the rectangular area image R1221 may be specified by any method such as template matching. Note that, depending on the design of the image quality improvement engine, the image size and the number of dimensions of the input data and the output data may differ. For example, if the processing target is an OCT image, and the input data is part of a B-scan image (two-dimensional image), the output data may be part of an A-scan image (one-dimensional image).

矩形領域画像Ｒ１２１１，Ｒ１２２１に関する一定の画像サイズは、例えば、処理対象（入力画像）として想定される画像の画像サイズ群について、対応する各次元の画素数群の公約数から決定することができる。この場合には、高画質化エンジンが出力する矩形領域画像群の位置関係が重なることを防ぐことができる。具体的に、例えば、処理対象として想定される画像が二次元画像であり、画像サイズ群のうちの第１の画像サイズが幅５００画素、高さ５００画素であり、第２の画像サイズが幅１００画素、高さ１００画素である場合を考える。ここで、各辺の公約数から、矩形領域画像Ｒ１２１１，Ｒ１２２１に関する一定の画像サイズを選択する。この場合には、例えば、一定の画像サイズを、幅１００画素、高さ１００画素や、幅５０画素、高さ５０画素や、幅２５画素、高さ２５画素等から選択する。 The fixed image size for the rectangular area images R1211 and R1221 can be determined, for example, from the common divisor of the pixel counts of the corresponding dimensions for the image size group of the image assumed to be processed (input image). In this case, it is possible to prevent the positional relationship of the rectangular area images output by the image quality improvement engine from overlapping. Specifically, for example, consider a case where the image assumed to be processed is a two-dimensional image, and the first image size of the image size group is 500 pixels wide and 500 pixels high, and the second image size is 100 pixels wide and 100 pixels high. Here, the fixed image size for the rectangular area images R1211 and R1221 is selected from the common divisor of each side. In this case, for example, the fixed image size is selected from 100 pixels wide and 100 pixels high, 50 pixels wide and 50 pixels high, 25 pixels wide and 25 pixels high, etc.

処理対象として想定される画像が三次元である場合には、幅、高さ、奥行きに関して画素数を決定する。なお、矩形領域は、入力データに対応する低画質画像と出力データに対応する高画質画像のペアの一つに対して、複数設定可能である。このため、例えば、元画像Ｉｍ１２１０のうちの矩形領域画像Ｒ１２１２を入力データ、重ね合わせ画像Ｉｍ１２２０において矩形領域画像Ｒ１２１２と同じ撮影領域である矩形領域画像Ｒ１２２２を出力データとする。そして、入力データである矩形領域画像Ｒ１２１２と出力データである矩形領域画像Ｒ１２２２によって教師データのペアを構成する。これにより、第１の矩形領域画像ペアとは別の矩形領域画像ペアを作成できる。 When the image to be processed is three-dimensional, the number of pixels is determined for width, height, and depth. Note that multiple rectangular areas can be set for one pair of a low-quality image corresponding to the input data and a high-quality image corresponding to the output data. For this reason, for example, rectangular area image R1212 of original image Im1210 is set as input data, and rectangular area image R1222, which is the same shooting area as rectangular area image R1212 in superimposed image Im1220, is set as output data. Then, a pair of teacher data is formed by rectangular area image R1212, which is the input data, and rectangular area image R1222, which is the output data. This makes it possible to create a rectangular area image pair other than the first rectangular area image pair.

なお、矩形領域の画像を異なる座標の画像に変えながら多数の矩形領域画像のペアを作成することで教師データを構成するペア群を充実させることができ、当該教師ペアを用いて学習を行った高画質化エンジンによって効率的な高画質化が期待できる。ただし、機械学習モデルの高画質化に寄与しないペアは教師データに加えないようにすることができる。例えば、ペアを構成する出力データである高画質画像から作成した矩形領域画像が診断に適さない画質である場合には、そのような教師データを用いて学習を行った高画質化エンジンが出力する画像も画像診断に適さない画質になってしまう可能性がある。そのため、そのような高画質画像を含むペアを教師データから取り除くことができる。 By creating a large number of pairs of rectangular area images while changing the images of rectangular areas to images with different coordinates, the group of pairs that make up the training data can be enriched, and efficient image quality improvement can be expected by an image quality improvement engine that has trained using the training pairs. However, pairs that do not contribute to improving the image quality of the machine learning model can be prevented from being added to the training data. For example, if a rectangular area image created from a high-quality image that is output data that makes up a pair has image quality that is not suitable for diagnosis, the image output by an image quality improvement engine that has trained using such training data may also have image quality that is not suitable for image diagnosis. Therefore, pairs that include such high-quality images can be removed from the training data.

また、例えば、ペアである、低画質画像から作成した矩形領域画像と高画質画像から作成した矩形領域画像の平均輝度や輝度分布が大きく異なる場合も、そのようなペアを教師データから取り除くことができる。そのような教師データを用いて学習を行うと、高画質化エンジンが入力画像と大きく異なる輝度分布を持つ画像診断に適さない画像を出力してしまう可能性がある。 In addition, for example, if the average luminance or luminance distribution of a pair of rectangular area images created from a low-quality image and a high-quality image differs significantly, such pairs can be removed from the training data. If learning is performed using such training data, the image quality improvement engine may output an image that is unsuitable for image diagnosis, with a luminance distribution that differs significantly from the input image.

さらに、例えば、ペアである、低画質画像から作成した矩形領域画像と高画質画像から作成した矩形領域画像とに描画される撮影対象の構造や位置が大きく異なる場合を考える。この場合には、そのような教師データを用いて学習を行った高画質化エンジンが入力画像と大きく異なる構造や位置に撮影対象を描画した画像診断に適さない画像を出力してしまう可能性がある。そのため、このようなペアを教師データから取り除くこともできる。 Furthermore, for example, consider a case where the structure and position of the object depicted in a pair of rectangular area images, one created from a low-image quality image and the other created from a high-image quality image, are significantly different. In this case, an image quality improvement engine that has learned using such training data may output an image that is not suitable for image diagnosis, in which the object is depicted in a structure and position significantly different from that in the input image. For this reason, such pairs can be removed from the training data.

なお、第１の実施形態と同様に、教師データの入力データには、処理対象として想定される特定の撮影条件を持つ画像を用いるが、当該特定の撮影条件は、予め決定された撮影部位、撮影方式、及び撮影画角である。つまり、本実施形態に係る当該特定の撮影条件には、第１の実施形態と異なり、画像サイズは含まれない。 As in the first embodiment, images with specific shooting conditions that are expected to be processed are used as input data for the training data, and the specific shooting conditions are a predetermined shooting area, shooting method, and shooting angle of view. In other words, unlike the first embodiment, the specific shooting conditions in this embodiment do not include image size.

本実施形態に係る高画質化部４０４は、このような教師データで学習が行われた高画質化エンジンを用いて、入力画像を高画質化して高画質画像を生成する。この際、高画質化部４０４は、入力された画像を、隙間なく連続する、教師データについて設定された一定の画像サイズの矩形領域画像群に分割する。高画質化部４０４は、分割した矩形領域画像群のそれぞれを高画質化エンジンにより高画質化し、高画質な矩形領域画像群を生成する。その後、高画質化部４０４は、生成した高画質な矩形領域画像群を、入力画像の位置関係に応じて配置して結合し、高画質画像を生成する。ここで、学習時には、ペア画像である入力データと出力データとの互いの位置関係が対応していれば、それぞれの矩形領域を低画質画像及び高画質画像における任意の場所から切り出して（抽出して）もよい。一方、高画質化時には、入力画像を隙間なく連続する矩形領域画像群に分割してもよい。また、学習時の各ペア画像の画像サイズと、高画質化時の各矩形領域画像の画像サイズとが互いが対応する（例えば、同一となる）ように設定されてもよい。これらにより、学習効率を上げつつ、無駄な計算や足りない所が出てくると画像にならないという問題が生じないようにすることができる。 The image quality improvement unit 404 according to the present embodiment uses an image quality improvement engine that has been trained using such teacher data to improve the image quality of an input image and generate a high-quality image. At this time, the image quality improvement unit 404 divides the input image into a group of rectangular area images of a certain image size set for the teacher data that are continuous without gaps. The image quality improvement unit 404 improves the image quality of each of the divided rectangular area image groups using the image quality improvement engine to generate a group of high-quality rectangular area images. After that, the image quality improvement unit 404 arranges and combines the generated high-quality rectangular area images according to the positional relationship of the input image to generate a high-quality image. Here, during learning, as long as the positional relationship between the input data and the output data, which are paired images, corresponds to each other, each rectangular area may be cut out (extracted) from any location in the low-quality image and the high-quality image. On the other hand, during image quality improvement, the input image may be divided into a group of continuous rectangular area images without gaps. In addition, the image size of each pair of images during learning and the image size of each rectangular area image during image quality improvement may be set to correspond to each other (for example, to be the same). This can improve learning efficiency while preventing problems such as unnecessary calculations and missing parts that result in no image.

このように、本実施形態の高画質化部４０４は、入力された画像を矩形領域単位で高画質化し、高画質化した画像を結合することで、第１の実施形態では対処できなかった画像サイズの画像をも高画質化して高画質画像を生成することができる。 In this way, the image quality improvement unit 404 of this embodiment improves the image quality of the input image in units of rectangular regions, and by combining the improved image quality images, it is possible to improve the image quality of images of image sizes that could not be handled in the first embodiment, thereby generating a high-image quality image.

次に、図５、１３及び１４を参照して、本実施形態に係る一連の画像処理について説明する。図１３は、本実施形態に係る高画質化処理のフロー図である。なお、本実施形態に係るステップＳ５１０、ステップＳ５２０、及びステップＳ５５０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、画像サイズ以外の撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to Figs. 5, 13, and 14. Fig. 13 is a flow diagram of the image quality improvement process according to this embodiment. Note that the processes of steps S510, S520, and S550 according to this embodiment are similar to those of the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to shooting conditions other than image size, after the process of step S520, the process of step S530 may be omitted and the process may proceed to step S540.

処理がステップＳ５４０に移行すると、図１３に示される本実施形態に係る高画質化処理が開始される。これについて図１４を用いて説明をする。本実施形態に係る高画質化処理では、まず、ステップＳ１３１０において、図１４（ａ）に示すように、入力画像を隙間なく連続する、教師データについて設定された一定の画像サイズ（Ｒ１４１１に示すサイズ）の矩形領域画像群に分割する。ここで、図１４（ａ）は、入力画像Ｉｍ１４１０を一定の画像サイズの矩形領域画像Ｒ１４１１～Ｒ１４２６群に分割した一例を示す。なお、上述のように、高画質化エンジンの設計によっては、高画質化エンジンの入力画像と出力画像の、それぞれの画像サイズや次元数が異なってもよい。この場合には、ステップＳ１３２０において生成される結合された高画質画像に欠損が無いように、入力画像の分割位置を重複させたり、分離させたりして、調整することができる。図１４（ｂ）には分割位置を重複させる例を示す。図１４（ｂ）において、Ｒ１４１１’、Ｒ１４１２’が重複した領域を示している。煩雑になるため図示はしないが、Ｒ１４１３～Ｒ１４２６においても同様な重複領域Ｒ１４１３’～Ｒ１４２６’を持つものとする。なお、図１４（ｂ）の場合の教師データについて設定される矩形領域サイズは、Ｒ１４１１’に示すサイズである。入力画像Ｉｍ１４１０の画像外部の周辺（上下左右端）においてはデータが存在しないため、一定の画素値で埋めたり、近傍画素値で埋めたり、ミラーパディングしたりする。また、高画質化エンジンによっては、フィルタ処理により画像内部の周辺（上下左右端）では、高画質化の精度が低下する場合がある。そのため、図１４（ｂ）のように分割位置を重複して矩形領域画像を設定し、最終的な画像としては矩形領域画像の一部をトリミングして合成するようにしてもよい。高画質化エンジンの特性に応じて、矩形領域のサイズを設定する。なお、図１４（ａ）、（ｂ）にはＯＣＴの断層画像を例示したが、図１４（ｃ）、（ｄ）に示すように入力画像（Ｉｍ１４５０）はＯＣＴＡのＥｎ－Ｆａｃｅ画像のような正面画像でもよく、同様の処理が可能である。なお、矩形領域画像のサイズは、対象とする画像や高画質化エンジンの種類に応じて適切に設定を行う。 When the process proceeds to step S540, the image quality improvement process according to the present embodiment shown in FIG. 13 is started. This will be described with reference to FIG. 14. In the image quality improvement process according to the present embodiment, first, in step S1310, as shown in FIG. 14(a), the input image is divided into a group of rectangular area images of a certain image size (size shown in R1411) set for the teacher data, which are continuous without gaps. Here, FIG. 14(a) shows an example in which the input image Im1410 is divided into a group of rectangular area images R1411 to R1426 of a certain image size. Note that, as described above, depending on the design of the image quality improvement engine, the image sizes and number of dimensions of the input image and the output image of the image quality improvement engine may differ. In this case, the division positions of the input image can be adjusted by overlapping or separating them so that there are no defects in the combined high-image quality image generated in step S1320. FIG. 14(b) shows an example in which the division positions are overlapped. In FIG. 14B, R1411' and R1412' show overlapping areas. Although not shown to avoid complication, R1413 to R1426 also have similar overlapping areas R1413' to R1426'. The rectangular area size set for the teacher data in the case of FIG. 14B is the size shown in R1411'. Since there is no data in the periphery (top, bottom, left, right ends) of the outside of the input image Im1410, the area is filled with a constant pixel value, filled with a neighboring pixel value, or mirror padded. In addition, depending on the image quality improvement engine, the accuracy of image quality improvement may decrease in the periphery (top, bottom, left, right ends) of the inside of the image due to filter processing. Therefore, as shown in FIG. 14B, rectangular area images may be set by overlapping the division positions, and a part of the rectangular area images may be trimmed and synthesized as the final image. The size of the rectangular area is set according to the characteristics of the image quality improvement engine. Although OCT tomographic images are shown in Fig. 14(a) and (b), the input image (Im1450) may be a front image such as an OCTA En-Face image, as shown in Fig. 14(c) and (d), and similar processing is possible. The size of the rectangular area image is set appropriately depending on the target image and the type of image quality improvement engine.

次に、ステップＳ１３２０において、高画質化部４０４は、矩形領域画像Ｒ１４１１～Ｒ１４２６群、あるいは重複領域を設定している場合は矩形領域画像Ｒ１４１１’～Ｒ１４２６’群のそれぞれを高画質化エンジンにより高画質化し、高画質な矩形領域画像群を生成する。 Next, in step S1320, the image quality improvement unit 404 uses an image quality improvement engine to improve the image quality of the rectangular area images R1411 to R1426, or, if an overlapping area is set, the rectangular area images R1411' to R1426', to generate a group of high-image-quality rectangular area images.

そして、ステップＳ１３３０において、高画質化部４０４は、生成した高画質な矩形領域画像群のそれぞれを、入力画像について分割した矩形領域画像Ｒ１４１１～Ｒ１４２６群のそれぞれと同様の位置関係に配置して結合し、高画質画像を生成する。重複領域を設定している場合には、矩形領域画像Ｒ１４１１’～Ｒ１４２６’それぞれと同様の位置関係に配置した後に矩形領域画像Ｒ１４１１～Ｒ１４２６を切り出して結合し、高画質画像を生成する。なお、重複領域を利用して矩形領域画像Ｒ１４１１’～Ｒ１４２６’の輝度値を補正するようにしてもよい。例えば、基準とする矩形領域画像を任意に設定する。そして、基準矩形画像と重複する領域のある隣接矩形画像において、同じ座標点の輝度値を計測することで、隣接画像間における輝度値の差（比率）が分かる。同様に、全ての画像においても重複領域における輝度値の差（比率）を求めることで、全体として輝度値のムラを無くすように補正を行うことが可能となる。なお、輝度値補正に重複領域を全て利用する必要はなく、重複領域の一部（周辺部数ピクセル）は使用しなくてもよい。 Then, in step S1330, the image quality improving unit 404 arranges each of the generated high-quality rectangular area images in the same positional relationship as each of the rectangular area images R1411 to R1426 divided from the input image, and combines them to generate a high-quality image. If an overlapping area is set, the rectangular area images R1411' to R1426' are arranged in the same positional relationship as each of them, and then the rectangular area images R1411 to R1426 are cut out and combined to generate a high-quality image. Note that the overlapping area may be used to correct the luminance values of the rectangular area images R1411' to R1426'. For example, a reference rectangular area image may be arbitrarily set. Then, by measuring the luminance values of the same coordinate points in adjacent rectangular images that have an overlapping area with the reference rectangular image, the difference (ratio) in luminance values between adjacent images can be found. Similarly, by calculating the difference (ratio) in luminance values in the overlapping areas in all images, correction can be performed to eliminate unevenness in luminance values overall. Note that it is not necessary to use the entire overlapping area for brightness value correction, and a portion of the overlapping area (a few peripheral pixels) may not be used.

上記のように、本実施形態に係る高画質化部４０４は、入力画像を所定の画像サイズの複数の矩形領域画像（第３の画像）Ｒ１４１１～Ｒ１４２６に分割する。その後、高画質化部４０４は、分割した複数の矩形領域画像Ｒ１４１１～Ｒ１４２６を高画質化エンジンに入力して複数の第４の画像を生成し、複数の第４の画像を統合することで、高画質画像を生成する。なお、統合時に矩形領域群間で位置関係が重なる場合には、該矩形領域群の画素値群を統合したり、上書きしたりすることができる。 As described above, the image quality improvement unit 404 according to this embodiment divides the input image into a plurality of rectangular area images (third images) R1411 to R1426 of a predetermined image size. The image quality improvement unit 404 then inputs the divided rectangular area images R1411 to R1426 to an image quality improvement engine to generate a plurality of fourth images, and integrates the plurality of fourth images to generate a high-quality image. Note that if the rectangular area groups overlap in positional relationship during integration, the pixel values of the rectangular area groups can be integrated or overwritten.

これにより、本実施形態の高画質化部４０４は、第１の実施形態では対処できなかった画像サイズの入力画像であっても、高画質化エンジンによって高画質化して高画質画像を生成することができる。また、教師データを、低画質画像及び高画質画像を所定の画像サイズに分割した複数の画像から作成すると、少ない画像から多くの教師データを作成することができる。そのため、この場合には、教師データを作成するための低画質画像及び高画質画像の数を少なくすることができる。 As a result, the image quality improvement unit 404 of this embodiment can use the image quality improvement engine to improve the image quality of input images of image sizes that could not be handled in the first embodiment, and generate high-quality images. Furthermore, if the teacher data is created from multiple images obtained by dividing low-quality images and high-quality images into a predetermined image size, a large amount of teacher data can be created from a small number of images. Therefore, in this case, the number of low-quality images and high-quality images used to create the teacher data can be reduced.

＜第７の実施形態＞
次に、図１５～１７を参照して、第７の実施形態に係る画像処理装置について説明する。本実施形態では、画質評価部が、検者の指示に応じて、複数の高画質化エンジンから出力された複数の高画質画像のうち最も高画質な画像を選択する。 Seventh embodiment
Next, an image processing apparatus according to a seventh embodiment will be described with reference to Figures 15 to 17. In this embodiment, an image quality evaluation unit selects the image with the highest image quality from among multiple high-image-quality images output from multiple image quality improvement engines in response to an instruction from an examiner.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第１の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第１の実施形態に係る画像処理装置との違いを中心として説明する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as those of the image processing device 400 according to the first embodiment. Therefore, the following description of the image processing device according to this embodiment will focus on the differences from the image processing device according to the first embodiment.

図１５は、本実施形態に係る画像処理装置１５００の概略的な構成を示す。本実施形態に係る画像処理装置１５００には、取得部４０１、撮影条件取得部４０２、高画質化可否判定部４０３、高画質化部４０４、及び出力部４０５に加えて、画質評価部１５０６が設けられている。なお、画像処理装置１５００は、これら構成要素のうちの一部が設けられた複数の装置で構成されてもよい。ここで、取得部４０１、撮影条件取得部４０２、高画質化可否判定部４０３、高画質化部４０４、及び出力部４０５は、第１の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Figure 15 shows a schematic configuration of an image processing device 1500 according to this embodiment. In addition to the acquisition unit 401, the shooting condition acquisition unit 402, the image quality improvement possibility determination unit 403, the image quality improvement unit 404, and the output unit 405, the image quality evaluation unit 1506 is provided in the image processing device 1500 according to this embodiment. Note that the image processing device 1500 may be composed of multiple devices provided with some of these components. Here, the acquisition unit 401, the shooting condition acquisition unit 402, the image quality improvement possibility determination unit 403, the image quality improvement unit 404, and the output unit 405 are the same as the configuration of the image processing device according to the first embodiment, so the same reference numerals are used for the configuration shown in Figure 4 and the description will be omitted.

また、画像処理装置１５００は、第１の実施形態に係る画像処理装置４００と同様に撮影装置１０、表示部２０及び不図示の他の装置と、任意の回路やネットワークを介して接続されてよい。また、これらの装置は、他の任意の装置と回路やネットワークを介して接続されてもよいし、他の任意の装置と一体的に構成されてもよい。なお、これらの装置は本実施形態では別個の装置とされているが、これらの装置の一部又は全部を一体的に構成してもよい。 Furthermore, the image processing device 1500 may be connected to the imaging device 10, the display unit 20, and other devices (not shown) via any circuit or network, similar to the image processing device 400 according to the first embodiment. Furthermore, these devices may be connected to any other device via a circuit or network, or may be configured integrally with any other device. Note that, although these devices are separate devices in this embodiment, some or all of these devices may be configured integrally.

本実施形態に係る高画質化部４０４には、それぞれ異なる教師データを用いて機械学習が行われた二つ以上の高画質化エンジンが備えられている。ここで、本実施形態に係る教師データ群の作成方法について説明する。具体的には、まず、様々な撮影条件によって撮影された、低画質画像である入力データと高画質画像である出力データのペア群を用意する。次に、任意の撮影条件の組み合わせによってペア群をグルーピングすることで、教師データ群を作成する。例えば、第１の撮影条件の組み合わせによって取得されたペア群で構成される第１の教師データ、第２の撮影条件の組み合わせによって取得されたペア群で構成される第２の教師データというように、教師データ群として作成する。 The image quality improvement unit 404 according to this embodiment is equipped with two or more image quality improvement engines that have undergone machine learning using different teacher data. Here, a method for creating a teacher data group according to this embodiment will be described. Specifically, first, a group of pairs of input data, which is a low-quality image, and output data, which is a high-quality image, captured under various shooting conditions is prepared. Next, a teacher data group is created by grouping the pair groups according to any combination of shooting conditions. For example, a teacher data group is created in such a way that a first teacher data group is composed of a pair group acquired by a first combination of shooting conditions, and a second teacher data group is composed of a pair group acquired by a second combination of shooting conditions.

その後、各教師データを用いて別々の高画質化エンジンに機械学習を行わせる。例えば、第１の教師データでトレーニングされた機械学習モデルに対応する第１の高画質化エンジン、第１の教師データでトレーニングされた機械学習モデルに対応する第１の高画質化エンジンというように高画質化エンジン群を用意する。 Then, machine learning is performed in separate image quality improvement engines using each training data. For example, a group of image quality improvement engines is prepared, such as a first image quality improvement engine corresponding to a machine learning model trained with the first training data, and a second image quality improvement engine corresponding to a machine learning model trained with the first training data.

このような高画質化エンジンは、それぞれ対応する機械学習モデルのトレーニングに用いた教師データが異なるため、高画質化エンジンに入力される画像の撮影条件によって、入力画像を高画質化できる程度が異なる。具体的には、第１の高画質化エンジンは、第１の撮影条件の組み合わせで撮影して取得された入力画像に対しては高画質化の程度が高く、第２の撮影条件の組み合わせで撮影して取得された画像に対しては高画質化の程度が低い。同様に、第２の高画質化エンジンは、第２の撮影条件で撮影して取得された入力画像に対しては高画質化の程度が高く、第１の撮影条件で撮影して取得された画像に対しては高画質化の程度が低い。 Since each of these image quality improvement engines uses different teacher data to train the corresponding machine learning model, the degree to which the image quality of the input image can be improved varies depending on the shooting conditions of the image input to the image quality improvement engine. Specifically, the first image quality improvement engine provides a high degree of image quality improvement for input images captured and acquired under the first combination of shooting conditions, and a low degree of image quality improvement for images captured and acquired under the second combination of shooting conditions. Similarly, the second image quality improvement engine provides a high degree of image quality improvement for input images captured and acquired under the second shooting conditions, and a low degree of image quality improvement for images captured and acquired under the first shooting conditions.

教師データのそれぞれが撮影条件の組み合わせによってグルーピングされたペア群で構成されることにより、該ペア群を構成する画像群の画質傾向が似る。このため、高画質化エンジンは対応する撮影条件の組み合わせであれば、第１の実施形態に係る高画像化エンジンよりも効果的に高画質化を行うことができる。なお、教師データのペアをグルーピングするための撮影条件の組み合わせは、任意であってよく、例えば、撮影部位、撮影画角、及び画像の解像度のうちの二つ以上の組み合わせであってよい。また、教師データのグルーピングを、第２の実施形態と同様に、一つの撮影条件に基づいて行ってもよい。 Since each piece of training data is composed of pairs grouped according to a combination of shooting conditions, the image quality tendencies of the images constituting the pair groups are similar. Therefore, the image quality improvement engine can improve image quality more effectively than the image improvement engine according to the first embodiment, so long as the combination of shooting conditions corresponds. Note that the combination of shooting conditions for grouping pairs of training data may be any combination, and may be, for example, a combination of two or more of the shooting part, shooting angle of view, and image resolution. Furthermore, training data may be grouped based on a single shooting condition, as in the second embodiment.

画質評価部１５０６は、高画質化部４０４が、複数の高画質化エンジンを用いて生成した複数の高画質画像について、検者の指示に応じて、最も画質の高い高画質画像を選択する。 The image quality assessment unit 1506 selects the highest quality image from the multiple high quality images generated by the image quality improvement unit 404 using multiple image quality improvement engines, in accordance with the examiner's instructions.

出力部４０５は、画質評価部１５０６が選択した高画質画像を表示部２０に表示させたり、他の装置に出力したりすることができる。なお、出力部４０５は、高画質化部４０４が生成した複数の高画質画像を表示部２０に表示させることができ、画質評価部１５０６は、表示部２０を確認した検者からの指示に応じて最も画質の高い高画質画像を選択することができる。 The output unit 405 can display the high-quality image selected by the image quality evaluation unit 1506 on the display unit 20 or output it to another device. The output unit 405 can display multiple high-quality images generated by the image quality improvement unit 404 on the display unit 20, and the image quality evaluation unit 1506 can select the high-quality image with the highest image quality in response to instructions from the examiner who checks the display unit 20.

これにより、画像処理装置１５００は、複数の高画質化エンジンを用いて生成された複数の高画質画像のうち、検者の指示に応じた最も画質の高い高画質画像を出力することができる。 This allows the image processing device 1500 to output the highest quality image in accordance with the examiner's instructions from among multiple high quality images generated using multiple image quality improvement engines.

以下、図１６及び１７を参照して、本実施形態に係る一連の画像処理について説明する。図１６は、本実施形態に係る一連の画像処理のフロー図である。なお、本実施形態に係るステップＳ１６１０及びステップＳ１６２０の処理は、第１の実施形態におけるステップＳ５１０及びステップＳ５２０での処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ１６２０の処理の後に、ステップＳ１６３０の処理を省き、処理をステップＳ１６４０に移行してよい。 Below, a series of image processing according to this embodiment will be described with reference to Figs. 16 and 17. Fig. 16 is a flow diagram of a series of image processing according to this embodiment. Note that the processing of steps S1610 and S1620 according to this embodiment is similar to the processing of steps S510 and S520 in the first embodiment, and therefore a description thereof will be omitted. Note that, when improving the image quality of an input image unconditionally with respect to the shooting conditions, after the processing of step S1620, the processing of step S1630 may be omitted and the processing may proceed to step S1640.

ステップＳ１６２０において、第１の実施形態と同様に、撮影条件取得部４０２が入力画像の撮影条件群を取得したら、処理はステップＳ１６３０に移行する。ステップＳ１６３０では、高画質化可否判定部４０３が、第２の実施形態と同様に、取得された撮影条件群を用いて、高画質化部４０４に備える高画質化エンジンのいずれかが入力画像を対処可能であるか否かを判定する。 In step S1620, as in the first embodiment, the shooting condition acquisition unit 402 acquires a group of shooting conditions for the input image, and the process proceeds to step S1630. In step S1630, as in the second embodiment, the image quality improvement feasibility determination unit 403 uses the acquired group of shooting conditions to determine whether or not any of the image quality improvement engines included in the image quality improvement unit 404 can handle the input image.

高画質化可否判定部４０３が、高画質化エンジン群のいずれも入力画像を対処不可能であると判定した場合には、処理はステップＳ１６６０に移行する。一方で、高画質化可否判定部４０３が、高画質化エンジン群のいずれかが入力画像を対処可能であると判定した場合には、処理はステップＳ１６４０に移行する。なお、画像処理装置４００の設定や実装形態によっては、第１の実施形態と同様に、高画質化エンジンによって一部の撮影条件が対処不可能であると判定されたとしても、ステップＳ１６４０を実施してもよい。 If the image quality improvement capability determination unit 403 determines that none of the image quality improvement engines can handle the input image, the process proceeds to step S1660. On the other hand, if the image quality improvement capability determination unit 403 determines that any of the image quality improvement engines can handle the input image, the process proceeds to step S1640. Note that, depending on the settings and implementation form of the image processing device 400, step S1640 may be performed, as in the first embodiment, even if the image quality improvement engines have determined that some shooting conditions cannot be handled.

ステップＳ１６４０においては、高画質化部４０４が、高画質化エンジン群のそれぞれにステップＳ１６１０において取得した入力画像を入力し、高画質画像群を生成する。 In step S1640, the image quality improvement unit 404 inputs the input image acquired in step S1610 to each of the image quality improvement engines, and generates a group of high-image-quality images.

ステップＳ１６５０では、画質評価部１５０６が、ステップＳ１６４０において生成された高画質画像群のうち最も高画質な画像を選択する。具体的には、まず、出力部４０５が、ステップＳ１６４０で生成された高画質画像群を、表示部２０のユーザーインターフェースに表示させる。 In step S1650, the image quality assessment unit 1506 selects the image with the highest image quality from the group of high-image-quality images generated in step S1640. Specifically, first, the output unit 405 displays the group of high-image-quality images generated in step S1640 on the user interface of the display unit 20.

ここで、図１７に当該インターフェースの一例を示す。当該インターフェースには、入力画像Ｉｍ１７１０、及び高画質化エンジン群のそれぞれが出力した高画質画像Ｉｍ１７２０，Ｉｍ１７３０，Ｉｍ１７４０，Ｉｍ１７５０のそれぞれが表示される。検者は不図示の任意の入力装置を操作して、画像群（高画質画像Ｉｍ１７２０～Ｉｍ１７５０）のうち、最も高画質、つまり、最も画像診断に適した画像を指示する。なお、高画質化エンジンによって高画質化していない入力画像の方が、画像診断に適している可能性もあるので、検者による指示の対象となる画像群に入力画像を加えてもよい。 Here, FIG. 17 shows an example of the interface. The interface displays input image Im1710, and high-quality images Im1720, Im1730, Im1740, and Im1750 output by each of the image quality improvement engines. The examiner operates any input device (not shown) to specify the image with the highest image quality, that is, the image most suitable for image diagnosis, from the image group (high-quality images Im1720 to Im1750). Note that since an input image that has not been improved in image quality by the image quality improvement engine may be more suitable for image diagnosis, the input image may be added to the image group that is the subject of the examiner's specification.

その後、画質評価部１５０６は、検者によって指示された高画質画像を最も高画質な画像として選択する。 The image quality assessment unit 1506 then selects the high-quality image indicated by the examiner as the image with the highest quality.

ステップＳ１６６０においては、出力部４０５が、ステップＳ１６５０において選択された画像を表示部２０に表示させたり、他の装置に出力したりする。ただし、ステップＳ１６３０において、入力画像が処理不可能であると判定されている場合には、出力部４０５は、入力画像を出力画像として出力する。なお、出力部４０５は、検者によって入力画像が指示された場合や、入力画像が処理不可能であった場合には、表示部２０に出力画像が入力画像と同じであることを表示させてもよい。 In step S1660, the output unit 405 displays the image selected in step S1650 on the display unit 20 or outputs it to another device. However, if it is determined in step S1630 that the input image cannot be processed, the output unit 405 outputs the input image as the output image. Note that if an input image is specified by the examiner or if the input image cannot be processed, the output unit 405 may display on the display unit 20 that the output image is the same as the input image.

上記のように、本実施形態に係る高画質化部４０４は、複数の高画質化エンジンを用いて、入力画像から複数の高画質画像を生成し、画像処理装置１５００の出力部４０５は、検者の指示に応じて、複数の高画質画像のうち少なくとも一つの画像を出力する。特に、本実施形態では、出力部４０５は、検者の指示に応じて、最も高画質な画像を出力する。これにより、画像処理装置１５００は、複数の高画質化エンジンを用いて生成された複数の高画質画像のうち、検者の指示に応じた画質の高い高画質画像を出力することができる。 As described above, the image quality improvement unit 404 according to this embodiment generates multiple high-quality images from an input image using multiple image quality improvement engines, and the output unit 405 of the image processing device 1500 outputs at least one of the multiple high-quality images in response to the examiner's instructions. In particular, in this embodiment, the output unit 405 outputs the image with the highest image quality in response to the examiner's instructions. This allows the image processing device 1500 to output the high-quality image with the highest image quality in response to the examiner's instructions from among the multiple high-quality images generated using the multiple image quality improvement engines.

なお、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置１５００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 The output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 1500, as in the first embodiment. The output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing, as in the first embodiment. That is, a high-quality image obtained by performing at least one of a group of processes or imaging methods, such as overlay processing, MAP estimation processing, smoothing filter processing, tone conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing, may be used.

＜第８の実施形態＞
次に、図１５及び１６を参照して、第８の実施形態に係る画像処理装置について説明する。本実施形態では、画質評価部が、画質評価エンジンを用いて、複数の高画質化エンジンから出力された複数の高画質画像のうち最も高画質な画像を選択する。 Eighth embodiment
Next, an image processing device according to an eighth embodiment will be described with reference to Figures 15 and 16. In this embodiment, an image quality evaluation unit uses an image quality evaluation engine to select an image with the highest image quality from among a plurality of high-image-quality images output from a plurality of image quality improvement engines.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第７の実施形態に係る画像処理装置１５００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第７の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は、第７の実施形態に係る画像処理装置の構成と同様であるため、図１５に示す構成について同一の参照符号を用いて示し、説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as the image processing device 1500 according to the seventh embodiment. Therefore, the image processing device according to this embodiment will be described below, focusing on the differences from the image processing device according to the seventh embodiment. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device according to the seventh embodiment, the configuration shown in FIG. 15 is indicated using the same reference numerals and description will be omitted.

本実施形態に係る画質評価部１５０６には、入力された画像の画質を評価する画質評価エンジンが備えられている。画質評価エンジンは入力された画像に対する画質評価指数を出力する。本実施形態に係る画質評価エンジンにおいて画質評価指数を算出する画質評価処理手法は、機械学習アルゴリズムを用いて構築した機械学習モデルを用いる。機械学習モデルをトレーニングする教師データを構成するペアの入力データは、事前に様々な撮影条件によって撮影された低画質画像群と高画質画像群とで構成される画像群である。また、機械学習モデルをトレーニングする教師データを構成するペアの出力データは、例えば、画像診断を行う検者が入力データの画像群のそれぞれについて設定した画質評価指数群である。 The image quality evaluation unit 1506 according to this embodiment is equipped with an image quality evaluation engine that evaluates the image quality of an input image. The image quality evaluation engine outputs an image quality evaluation index for the input image. The image quality evaluation processing method for calculating the image quality evaluation index in the image quality evaluation engine according to this embodiment uses a machine learning model constructed using a machine learning algorithm. The input data of the pairs that constitute the teacher data for training the machine learning model are image groups consisting of a group of low-image-quality images and a group of high-image-quality images captured in advance under various shooting conditions. In addition, the output data of the pairs that constitute the teacher data for training the machine learning model are, for example, a group of image quality evaluation indices set for each of the image groups of the input data by an examiner performing image diagnosis.

次に図１６を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ１６１０、ステップＳ１６２０、ステップＳ１６３０、及びステップＳ１６６０の処理は、第７の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ１６２０の処理の後に、ステップＳ１６３０の処理を省き、処理をステップＳ１６４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 16. Note that the processing steps S1610, S1620, S1630, and S1660 according to this embodiment are similar to those steps in the seventh embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S1620, the processing step S1630 may be omitted and the processing may proceed to step S1640.

ステップＳ１６３０において、第７の実施形態と同様に、高画質化可否判定部４０３が、高画質化エンジン群のいずれかが入力画像を対処可能であると判定した場合には、処理はステップＳ１６４０に移行する。なお、画像処理装置４００の設定や実装形態によっては、第１の実施形態と同様に、高画質化エンジンによって一部の撮影条件が対処不可能であると判定されたとしても、ステップＳ１６４０を実施してもよい。 In step S1630, as in the seventh embodiment, if the image quality improvement capability determination unit 403 determines that one of the image quality improvement engines can handle the input image, the process proceeds to step S1640. Note that, as in the first embodiment, depending on the settings and implementation form of the image processing device 400, step S1640 may be performed even if the image quality improvement engine determines that some shooting conditions cannot be handled by the image quality improvement engine.

ステップＳ１６５０では、画質評価部１５０６が、ステップＳ１６４０において生成された高画質画像群のうち最も高画質な画像を選択する。具体的には、まず、画質評価部１５０６が、ステップＳ１６４０で生成された高画質画像群を、画質評価エンジンに入力する。画質評価エンジンは、入力された各高画質画像について、学習に基づいて、画質評価指数を算出する。画質評価部１５０６は、算出された画質評価指数のうち最も高い画質評価指数が算出された高画質画像を選択する。なお、高画質化エンジンによって高画質化していない入力画像の方が、画像診断に適している可能性もあるので、画質評価部１５０６は、画質評価エンジンに入力画像も入力し、入力画像に対する画質評価指数も選択に加えてもよい。ステップＳ１６６０は、第７の実施形態のステップＳ１６６０と同様であるため説明を省略する。 In step S1650, the image quality evaluation unit 1506 selects the image with the highest image quality from the group of high-quality images generated in step S1640. Specifically, the image quality evaluation unit 1506 first inputs the group of high-quality images generated in step S1640 to the image quality evaluation engine. The image quality evaluation engine calculates an image quality evaluation index for each input high-quality image based on learning. The image quality evaluation unit 1506 selects the high-quality image with the highest image quality evaluation index calculated from the calculated image quality evaluation indexes. Note that since an input image that has not been improved in quality by the image improvement engine may be more suitable for image diagnosis, the image quality evaluation unit 1506 may also input the input image to the image quality evaluation engine and add the image quality evaluation index for the input image to the selection. Step S1660 is similar to step S1660 in the seventh embodiment, and therefore a description thereof will be omitted.

上記のように、本実施形態に係る画像処理装置１５００は、高画質画像の画質を評価する画質評価部１５０６を更に備える。高画質化部４０４は、複数の高画質化エンジンを用いて、入力画像から複数の高画質画像を生成し、画像処理装置１５００の出力部４０５は、画質評価部１５０６による評価結果に応じて、複数の高画質画像のうち少なくとも一つの画像を出力する。特に、本実施形態に係る画質評価部１５０６は、所定の評価手法による評価値を学習データとした画質評価エンジンを含む。画質評価部１５０６は、複数の高画質画像のうち、画質評価部１５０６による画質評価エンジンを用いた評価の結果が最も高い高画質画像を選択する。出力部４０５は、画質評価部１５０６によって選択された最も評価値が高い高画質画像を出力する。 As described above, the image processing device 1500 according to this embodiment further includes an image quality evaluation unit 1506 that evaluates the image quality of a high-quality image. The image quality improvement unit 404 generates multiple high-quality images from an input image using multiple image quality improvement engines, and the output unit 405 of the image processing device 1500 outputs at least one of the multiple high-quality images according to the evaluation result by the image quality evaluation unit 1506. In particular, the image quality evaluation unit 1506 according to this embodiment includes an image quality evaluation engine that uses an evaluation value by a predetermined evaluation method as learning data. The image quality evaluation unit 1506 selects the high-quality image that has the highest evaluation result using the image quality evaluation engine by the image quality evaluation unit 1506 from the multiple high-quality images. The output unit 405 outputs the high-quality image with the highest evaluation value selected by the image quality evaluation unit 1506.

これにより、本実施形態に係る画像処理装置１５００では、画質評価エンジンの出力に基づいて、複数の高画質画像から最も画像診断に適した高画質画像を容易に出力することができる。 As a result, the image processing device 1500 according to this embodiment can easily output the high-quality image most suitable for image diagnosis from among multiple high-quality images based on the output of the image quality evaluation engine.

なお、本実施形態では、画質評価部１５０６が画質評価エンジンによって出力される画質評価指数のうち最も高い画質評価指数の高画質画像を選択し、出力部４０５が選択された高画質画像を表示部２０に表示させた。しかしながら、画質評価部１５０６の構成はこれに限られない。例えば、画質評価部１５０６は画質評価エンジンによって出力される画質評価指数のうち上位いくつかの画質評価指数の高画質画像を選択し、出力部４０５が選択された高画質画像を表示部２０に表示させてもよい。また、出力部４０５が、画質評価エンジンによって出力された画質評価指数を対応する高画質画像とともに表示部２０に表示させ、画質評価部１５０６が検者の指示に応じて、最も高画質な画像を選択してもよい。 In this embodiment, the image quality evaluation unit 1506 selects a high-quality image with the highest image quality evaluation index among the image quality evaluation indices output by the image quality evaluation engine, and the output unit 405 displays the selected high-quality image on the display unit 20. However, the configuration of the image quality evaluation unit 1506 is not limited to this. For example, the image quality evaluation unit 1506 may select high-quality images with the top several image quality evaluation indices among the image quality evaluation indices output by the image quality evaluation engine, and the output unit 405 may display the selected high-quality image on the display unit 20. In addition, the output unit 405 may display the image quality evaluation index output by the image quality evaluation engine together with the corresponding high-quality image on the display unit 20, and the image quality evaluation unit 1506 may select the image with the highest image quality in accordance with the examiner's instructions.

＜第９の実施形態＞
次に、図１８及び１９を参照して、第９の実施形態に係る画像処理装置について説明する。本実施形態では、真贋評価部が、真贋評価エンジンを用いて、高画質化部４０４によって生成された高画質画像が十分に高画質化されたものであるか否かを評価する。 Ninth embodiment
Next, an image processing device according to a ninth embodiment will be described with reference to Figures 18 and 19. In this embodiment, an authenticity evaluation unit uses an authenticity evaluation engine to evaluate whether or not the high-quality image generated by the image quality improvement unit 404 has been sufficiently improved in quality.

図１８は、本実施形態に係る画像処理装置１８００の概略的な構成を示す。本実施形態に係る画像処理装置１８００には、取得部４０１、撮影条件取得部４０２、高画質化可否判定部４０３、高画質化部４０４、及び出力部４０５に加えて、真贋評価部１８０７が設けられている。なお、画像処理装置１８００は、これら構成要素のうちの一部が設けられた複数の装置で構成されてもよい。ここで、取得部４０１、撮影条件取得部４０２、高画質化可否判定部４０３、高画質化部４０４、及び出力部４０５は、第１の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Figure 18 shows a schematic configuration of an image processing device 1800 according to this embodiment. In addition to the acquisition unit 401, the shooting condition acquisition unit 402, the image quality improvement possibility determination unit 403, the image quality improvement unit 404, and the output unit 405, the image processing device 1800 according to this embodiment is provided with an authenticity evaluation unit 1807. Note that the image processing device 1800 may be composed of multiple devices provided with some of these components. Here, the acquisition unit 401, the shooting condition acquisition unit 402, the image quality improvement possibility determination unit 403, the image quality improvement unit 404, and the output unit 405 are the same as the configuration of the image processing device according to the first embodiment, so the configuration shown in Figure 4 is indicated using the same reference numerals and description will be omitted.

また、画像処理装置１８００は、第１の実施形態に係る画像処理装置４００と同様に撮影装置１０、表示部２０及び不図示の他の装置と、任意の回路やネットワークを介して接続されてよい。また、これらの装置は、他の任意の装置と回路やネットワークを介して接続されてもよいし、他の任意の装置と一体的に構成されてもよい。なお、これらの装置は本実施形態では別個の装置とされているが、これらの装置の一部又は全部を一体的に構成してもよい。 Furthermore, the image processing device 1800 may be connected to the imaging device 10, the display unit 20, and other devices (not shown) via any circuit or network, similar to the image processing device 400 according to the first embodiment. Furthermore, these devices may be connected to any other device via a circuit or network, or may be configured integrally with any other device. Note that, although these devices are separate devices in this embodiment, some or all of these devices may be configured integrally.

真贋評価部１８０７には、真贋評価エンジンが備えられている。真贋評価部１８０７は、真贋評価エンジンを用いて、高画質化エンジンが生成した高画質画像が十分に高画質化されているか否かを評価する。本実施形態に係る真贋評価エンジンにおける真贋評価処理手法は、機械学習アルゴリズムを用いて構築した機械学習モデルを用いる。 The authenticity evaluation unit 1807 is equipped with an authenticity evaluation engine. The authenticity evaluation unit 1807 uses the authenticity evaluation engine to evaluate whether the high-quality image generated by the image quality improvement engine has been sufficiently improved in quality. The authenticity evaluation processing method in the authenticity evaluation engine according to this embodiment uses a machine learning model constructed using a machine learning algorithm.

機械学習モデルをトレーニングする教師データには、事前に様々な撮影条件によって撮影された高画質画像群と対象の撮影装置によって撮影され取得されたことを表すラベル（以下、真作ラベル）とのペア群が含まれる。また、教師データには、高画質化の精度の悪い高画質化エンジンに低画質画像を入力して生成した高画質画像群と対象の撮影装置によって撮影され取得されていないことを表すラベル（以下、贋作ラベル）とのペア群が含まれる。 The training data for training the machine learning model includes pairs of high-quality images taken in advance under various shooting conditions and labels indicating that the images were taken and acquired with the target imaging device (hereinafter, genuine labels). The training data also includes pairs of high-quality images generated by inputting low-quality images into an image-enhancing engine with poor image-enhancing accuracy and labels indicating that the images were not taken and acquired with the target imaging device (hereinafter, counterfeit labels).

このような教師データを用いて学習が行われた真贋評価エンジンは、入力された画像に対し、確実に撮影装置によって撮影され取得された画像か否かを評価できるわけではないが、撮影装置によって撮影され取得された画像らしさを持つ画像か否かを評価できる。この特性を利用して、真贋評価部１８０７は、真贋評価エンジンに高画質化部４０４が生成した高画質画像を入力することで、高画質化部４０４が生成した高画質画像が十分に高画質化されているか否かを評価できる。 An authenticity evaluation engine that has been trained using such teacher data cannot reliably evaluate whether an input image is an image that was captured and acquired by a camera device, but it can evaluate whether the image resembles an image that was captured and acquired by a camera device. Using this characteristic, the authenticity evaluation unit 1807 can input a high-quality image generated by the image quality improvement unit 404 to the authenticity evaluation engine, and evaluate whether the high-quality image generated by the image quality improvement unit 404 has been sufficiently improved in quality.

出力部４０５は、真贋評価部１８０７によって高画質化部４０４が生成した高画質画像が十分に高画質化されていると判断されたら、当該高画質画像を表示部２０に表示させる。一方、出力部４０５は、真贋評価部１８０７によって、高画質化部４０４が生成した高画質画像が十分に高画質化されていないと判断されたら、入力画像を表示部２０に表示させる。なお、出力部４０５は、入力画像を表示させる際に、高画質化部４０４によって生成された高画質画像が十分に高画質化されなかったことや表示されている画像が入力画像であることを表示部２０に表示させることができる。 When the authenticity evaluation unit 1807 determines that the high-quality image generated by the image quality improvement unit 404 has been sufficiently improved, the output unit 405 displays the high-quality image on the display unit 20. On the other hand, when the authenticity evaluation unit 1807 determines that the high-quality image generated by the image quality improvement unit 404 has not been sufficiently improved, the output unit 405 displays the input image on the display unit 20. Note that, when displaying the input image, the output unit 405 can display on the display unit 20 that the high-quality image generated by the image quality improvement unit 404 has not been sufficiently improved or that the image being displayed is the input image.

以下、図１９を参照して、本実施形態に係る一連の画像処理について説明する。図１９は、本実施形態に係る一連の画像処理のフロー図である。なお、本実施形態に係るステップＳ１９１０～ステップＳ１９４０の処理は、第１の実施形態におけるステップＳ５１０～ステップＳ５４０での処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ１９２０の処理の後に、ステップＳ１９３０の処理を省き、処理をステップＳ１９４０に移行してよい。 A series of image processing steps according to this embodiment will be described below with reference to FIG. 19. FIG. 19 is a flow diagram of a series of image processing steps according to this embodiment. Note that the processing steps S1910 to S1940 according to this embodiment are similar to the processing steps S510 to S540 in the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S1920, the processing step S1930 may be omitted and the processing may proceed to step S1940.

ステップＳ１９４０において、高画質化部４０４が高画質画像群を生成したら、処理はステップＳ１９５０に移行する。ステップＳ１９５０では、真贋評価部１８０７が、ステップＳ１９４０において生成された高画質画像を真贋評価エンジンに入力し、真贋評価エンジンの出力に基づいて真贋評価を行う。具体的には、真贋評価部１８０７は、真贋評価エンジンから真作ラベル（真）が出力された場合には、生成された高画質画像が十分に高画質化されていると評価する。一方、真贋評価エンジンから贋作ラベル（偽）が出力された場合には、真贋評価部１８０７は、生成された高画質画像が十分に高画質化されていないと評価する。 In step S1940, when the image quality improvement unit 404 generates a group of high-quality images, the process proceeds to step S1950. In step S1950, the authenticity evaluation unit 1807 inputs the high-quality images generated in step S1940 to an authenticity evaluation engine, and performs an authenticity evaluation based on the output of the authenticity evaluation engine. Specifically, if the authenticity evaluation engine outputs a genuine label (genuine), the authenticity evaluation unit 1807 evaluates that the generated high-quality images have been sufficiently improved in image quality. On the other hand, if the authenticity evaluation engine outputs a counterfeit label (fake), the authenticity evaluation unit 1807 evaluates that the generated high-quality images have not been sufficiently improved in image quality.

ステップＳ１９６０においては、出力部４０５が、真贋評価部１８０７によって高画質化部４０４が生成した高画質画像が十分に高画質化されていると判断されたら、当該高画質画像を表示部２０に表示させる。一方、出力部４０５は、真贋評価部１８０７によって、高画質化部４０４が生成した高画質画像が十分に高画質化されていないと判断されたら、入力画像を表示部２０に表示させる。 In step S1960, if the authenticity evaluation unit 1807 determines that the high-quality image generated by the image quality improvement unit 404 has been sufficiently improved in quality, the output unit 405 causes the display unit 20 to display the high-quality image. On the other hand, if the authenticity evaluation unit 1807 determines that the high-quality image generated by the image quality improvement unit 404 has not been sufficiently improved in quality, the output unit 405 causes the display unit 20 to display the input image.

上記のように、本実施形態に係る画像処理装置１８００は、高画質画像の画質を評価する真贋評価部１８０７を更に備え、真贋評価部１８０７は画像の真贋を評価する真贋評価エンジンを含む。真贋評価エンジンは、高画質化部４０４の高画質化エンジンよりも高画質化処理の精度が低い（悪い）高画質化エンジンによって生成された画像を学習データとした機械学習エンジンを含む。画像処理装置１８００の出力部４０５は、真贋評価部の真贋評価エンジンからの出力が真である場合に、高画質画像を出力する。 As described above, the image processing device 1800 according to this embodiment further includes an authenticity evaluation unit 1807 that evaluates the image quality of a high-quality image, and the authenticity evaluation unit 1807 includes an authenticity evaluation engine that evaluates the authenticity of an image. The authenticity evaluation engine includes a machine learning engine that uses as learning data images generated by an image quality improvement engine that has lower (worse) image quality improvement processing accuracy than the image quality improvement engine of the image quality improvement unit 404. The output unit 405 of the image processing device 1800 outputs a high-quality image when the output from the authenticity evaluation engine of the authenticity evaluation unit is true.

これにより、本実施形態に係る画像処理装置１８００では、検者は十分に高画質化された高画質画像を効率よく確認することができる。 As a result, with the image processing device 1800 according to this embodiment, the examiner can efficiently check high-quality images that have been sufficiently improved.

また、高画質化エンジンの機械学習モデルと真贋評価エンジンの機械学習モデルとを協調させてトレーニングすることによって、双方のエンジンの効率や精度を向上させてもよい。 In addition, the machine learning model of the image quality improvement engine and the machine learning model of the authenticity evaluation engine may be trained in coordination to improve the efficiency and accuracy of both engines.

なお、本実施形態では、高画質化部４０４が一つの高画質画像を生成し、真贋評価部１８０７が生成された一つの高画質画像について評価を行う構成としたが、真贋評価部１８０７の評価はこれに限られない。例えば、第２の実施形態のように、高画質化部４０４が複数の高画質化エンジンを用いて複数の高画質画像を生成する場合には、真贋評価部１８０７が生成された複数の高画質画像の少なくとも一つについて評価を行う構成としてもよい。この場合、例えば真贋評価部１８０７は、生成された複数の高画質画像の全てについて評価を行ってもよいし、複数の高画質画像のうち検者によって指示された画像のみについて評価を行ってもよい。 In this embodiment, the image quality improvement unit 404 generates one high-quality image, and the authenticity evaluation unit 1807 evaluates the generated high-quality image, but the evaluation by the authenticity evaluation unit 1807 is not limited to this. For example, as in the second embodiment, when the image quality improvement unit 404 generates multiple high-quality images using multiple image quality improvement engines, the authenticity evaluation unit 1807 may be configured to evaluate at least one of the multiple high-quality images generated. In this case, for example, the authenticity evaluation unit 1807 may evaluate all of the multiple high-quality images generated, or may evaluate only the image of the multiple high-quality images specified by the examiner.

さらに、出力部４０５は、真贋評価部１８０７による高画質画像が十分に高画質化されているか否かの判断結果を表示部２０に表示させ、検者の指示に応じて、高画質画像を出力してもよい。 Furthermore, the output unit 405 may display on the display unit 20 the result of the judgment made by the authenticity evaluation unit 1807 as to whether or not the high-quality image has been sufficiently improved in quality, and output the high-quality image according to the examiner's instructions.

なお、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置１８００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 The output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 1800, as in the first embodiment. The output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing, as in the first embodiment. That is, a high-quality image obtained by performing at least one of a group of processes or imaging methods, such as overlay processing, MAP estimation processing, smoothing filter processing, tone conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing, may be used.

＜第１０の実施形態＞
次に、図４及び５を参照して、第１０の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が三次元の入力画像を複数の二次元画像に分割して高画質化エンジンに入力し、高画質化エンジンからの出力画像を結合することで三次元の高画質画像を生成する。 Tenth embodiment
Next, an image processing device according to a tenth embodiment will be described with reference to Figures 4 and 5. In this embodiment, an image quality improvement unit divides a three-dimensional input image into a plurality of two-dimensional images, inputs the images to an image quality improvement engine, and combines output images from the image quality improvement engine to generate a three-dimensional high-quality image.

本実施形態に係る取得部４０１は、構造的に連続する二次元画像群で構成された、三次元画像を取得する。具体的には、三次元画像は、例えば、ＯＣＴのＢスキャン像（断層画像）群で構成された三次元ＯＣＴボリューム画像である。また、例えば、アキシャル断層画像群で構成された三次元ＣＴボリューム画像である。 The acquisition unit 401 according to this embodiment acquires a three-dimensional image composed of a group of structurally continuous two-dimensional images. Specifically, the three-dimensional image is, for example, a three-dimensional OCT volume image composed of a group of OCT B-scan images (tomographic images). Also, for example, it is a three-dimensional CT volume image composed of a group of axial tomographic images.

高画質化部４０４には、第１の実施形態と同様に、高画質化エンジンが備えられている。なお、高画質化エンジンの教師データである入力データと出力データのペア群は二次元画像の画像群により構成されている。高画質化部４０４は、取得された三次元画像を複数の二次元画像に分割し、二次元画像毎に高画質化エンジンに入力する。これにより、高画質化部４０４は、複数の二次元の高画質画像を生成することができる。 The image quality improvement unit 404 is equipped with an image quality improvement engine, as in the first embodiment. Note that the pair group of input data and output data, which is the teacher data for the image quality improvement engine, is composed of a group of two-dimensional images. The image quality improvement unit 404 divides the acquired three-dimensional image into multiple two-dimensional images, and inputs each two-dimensional image to the image quality improvement engine. This allows the image quality improvement unit 404 to generate multiple two-dimensional high-quality images.

出力部４０５は、高画質化部４０４によって、三次元画像の各二次元画像について生成された複数の二次元の高画質画像を結合し、三次元の高画質画像を出力する。 The output unit 405 combines the multiple two-dimensional high-quality images generated by the image quality improvement unit 404 for each two-dimensional image of the three-dimensional image, and outputs a three-dimensional high-quality image.

次に、図５を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ５１０～ステップＳ５３０、及びステップＳ５５０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。ただし、ステップＳ５１０では、取得部４０１は三次元画像を取得する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 5. Note that the processing steps S510 to S530 and S550 according to this embodiment are similar to those in the first embodiment, and therefore will not be described. However, in step S510, the acquisition unit 401 acquires a three-dimensional image. Note that, if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S520, the processing step S530 may be omitted and the processing may proceed to step S540.

ステップＳ５３０において、高画質化可否判定部４０３が、高画質化エンジンによって入力画像を対処可能と判定した場合には、処理はステップＳ５４０に移行する。なお、高画質化可否判定部４０３は、三次元画像の撮影条件に基づいて当該判定を行ってもよいし、三次元画像を構成する複数の二次元画像に関する撮影条件に基づいて当該判定を行ってもよい。ステップＳ５４０では、高画質化部４０４が、取得された三次元画像を複数の二次元画像に分割する。高画質化部４０４は、分割した複数の二次元画像のそれぞれを高画質化エンジンに入力し、複数の二次元の高画質画像を生成する。高画質化部４０４は、取得した三次元画像に基づいて、生成した複数の二次元の高画質画像を結合し、三次元の高画質画像を生成する。 In step S530, if the image quality improvement possibility determination unit 403 determines that the input image can be handled by the image quality improvement engine, the process proceeds to step S540. The image quality improvement possibility determination unit 403 may make the determination based on the shooting conditions of the three-dimensional image, or may make the determination based on the shooting conditions for the multiple two-dimensional images that make up the three-dimensional image. In step S540, the image quality improvement unit 404 divides the acquired three-dimensional image into multiple two-dimensional images. The image quality improvement unit 404 inputs each of the multiple divided two-dimensional images to the image quality improvement engine to generate multiple two-dimensional high-quality images. The image quality improvement unit 404 combines the multiple generated two-dimensional high-quality images based on the acquired three-dimensional image to generate a three-dimensional high-quality image.

ステップＳ５５０では、出力部４０５は、生成された三次元の高画質画像を表示部２０に表示させる。なお、三次元の高画質画像の表示態様は任意であってよい。 In step S550, the output unit 405 displays the generated high-quality three-dimensional image on the display unit 20. Note that the display format of the high-quality three-dimensional image may be arbitrary.

上記のように、本実施形態に係る高画質化部４０４は、三次元の入力画像を複数の二次元の画像に分割して高画質化エンジンに入力する。高画質化部４０４は、高画質化エンジンから出力された複数の二次元の高画質画像を結合し、三次元の高画質画像を生成する。 As described above, the image quality improvement unit 404 according to this embodiment divides a three-dimensional input image into multiple two-dimensional images and inputs them to the image quality improvement engine. The image quality improvement unit 404 combines the multiple two-dimensional high-quality images output from the image quality improvement engine to generate a three-dimensional high-quality image.

これにより、本実施形態に係る高画質化部４０４は、二次元画像の教師データを用いて学習が行われた高画質化エンジンを用いて、三次元画像を高画質化することができる。 As a result, the image quality improvement unit 404 according to this embodiment can improve the image quality of a three-dimensional image using an image quality improvement engine that has been trained using teacher data of two-dimensional images.

＜第１１の実施形態＞
次に、図４及び５を参照して、第１１の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が三次元の入力画像を複数の二次元画像に分割し、複数の二次元画像を複数の高画質化エンジンによって並列に高画質化し、高画質化エンジンからの出力画像を結合することで三次元の高画質画像を生成する。 Eleventh embodiment
Next, an image processing device according to an eleventh embodiment will be described with reference to Figures 4 and 5. In this embodiment, an image quality improvement unit divides a three-dimensional input image into a plurality of two-dimensional images, improves the image quality of the plurality of two-dimensional images in parallel by a plurality of image quality improvement engines, and generates a high-quality three-dimensional image by combining output images from the image quality improvement engines.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第１０の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第１０の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は、第１及び１０の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as the image processing device 400 according to the tenth embodiment. Therefore, the image processing device according to this embodiment will be described below, focusing on the differences from the image processing device according to the tenth embodiment. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device according to the first and tenth embodiments, the same reference numerals are used for the configuration shown in FIG. 4, and description thereof will be omitted.

本実施形態に係る高画質化部４０４には、第１０の実施形態と同様の高画質化エンジンが、複数備えられている。なお、高画質化部４０４に備えられた複数の高画質化エンジン群は、回路やネットワークを介して、二つ以上の装置群に分散処理可能なように実装されていてもよいし、単一の装置に実装されていてもよい。 The image quality improvement unit 404 according to this embodiment is provided with a plurality of image quality improvement engines similar to those in the tenth embodiment. The plurality of image quality improvement engines provided in the image quality improvement unit 404 may be implemented in two or more device groups via a circuit or network so as to enable distributed processing, or may be implemented in a single device.

高画質化部４０４は、第１０の実施形態と同様に、取得された三次元画像を複数の二次元画像に分割する。高画質化部４０４は、複数の二次元画像を複数の高画質化エンジンを用いて、分担して（並列的に）高画質化を行い、複数の二次元の高画質画像を生成する。高画質化部４０４は、複数の高画質化エンジンから出力された複数の二次元の高画質画像を、処理対象である三次元画像に基づいて結合し、三次元の高画質画像を生成する。 As in the tenth embodiment, the image quality improvement unit 404 divides the acquired three-dimensional image into multiple two-dimensional images. The image quality improvement unit 404 uses multiple image quality improvement engines to improve the image quality of the multiple two-dimensional images in a shared manner (in parallel), generating multiple two-dimensional high-quality images. The image quality improvement unit 404 combines the multiple two-dimensional high-quality images output from the multiple image quality improvement engines based on the three-dimensional image to be processed, generating a three-dimensional high-quality image.

次に、図５を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ５１０～ステップＳ５３０、及びステップＳ５５０の処理は、第１０の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 5. Note that the processing steps S510 to S530 and S550 according to this embodiment are similar to those steps in the tenth embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S520, the processing step S530 may be omitted and the processing may proceed to step S540.

ステップＳ５３０において、高画質化可否判定部４０３が、高画質化エンジンによって入力画像を対処可能と判定した場合には、処理はステップＳ５４０に移行する。なお、高画質化可否判定部４０３は、三次元画像の撮影条件に基づいて当該判定を行ってもよいし、三次元画像を構成する複数の二次元画像に関する撮影条件に基づいて当該判定を行ってもよい。 In step S530, if the image quality improvement possibility determination unit 403 determines that the input image can be handled by the image quality improvement engine, the process proceeds to step S540. Note that the image quality improvement possibility determination unit 403 may make this determination based on the shooting conditions of the three-dimensional image, or may make this determination based on the shooting conditions for the multiple two-dimensional images that make up the three-dimensional image.

ステップＳ５４０では、高画質化部４０４が、取得された三次元画像を複数の二次元画像に分割する。高画質化部４０４は、分割した複数の二次元画像のそれぞれを複数の高画質化エンジンに入力し、並列的に高画質化処理して、複数の二次元の高画質画像を生成する。高画質化部４０４は、取得した三次元画像に基づいて、生成した複数の二次元の高画質画像を結合し、三次元の高画質画像を生成する。 In step S540, the image quality improvement unit 404 divides the acquired three-dimensional image into a plurality of two-dimensional images. The image quality improvement unit 404 inputs each of the divided two-dimensional images into a plurality of image quality improvement engines, and performs image quality improvement processing in parallel to generate a plurality of two-dimensional high-quality images. The image quality improvement unit 404 combines the generated two-dimensional high-quality images based on the acquired three-dimensional image to generate a three-dimensional high-quality image.

上記のように、本実施形態に係る高画質化部４０４は、複数の高画質化エンジンを含む。高画質化部４０４は、三次元の入力画像を複数の二次元の画像に分割し、複数の高画質化エンジンを並列的に用いて、複数の二次元の高画質画像を生成する。高画質化部４０４は複数の二次元の高画質画像を統合することで、三次元の高画質画像を生成する。 As described above, the image quality improvement unit 404 according to this embodiment includes multiple image quality improvement engines. The image quality improvement unit 404 divides a three-dimensional input image into multiple two-dimensional images, and generates multiple two-dimensional high-quality images using the multiple image quality improvement engines in parallel. The image quality improvement unit 404 generates a three-dimensional high-quality image by integrating the multiple two-dimensional high-quality images.

これにより、本実施形態に係る高画質化部４０４は、二次元画像の教師データを用いて学習が行われた高画質化エンジンを用いて、三次元画像を高画質化することができる。また、第１０の実施形態と比べて、より効率的に三次元画像を高画質化することができる。 As a result, the image quality improvement unit 404 according to this embodiment can improve the image quality of a three-dimensional image using an image quality improvement engine that has been trained using teacher data of two-dimensional images. Furthermore, compared to the tenth embodiment, it is possible to improve the image quality of a three-dimensional image more efficiently.

なお、複数の高画質化エンジンの教師データは、各高画質化エンジンで処理を行う処理対象に応じて異なる教師データであってもよい。例えば、第１の高画質化エンジンは第１の撮影領域についての教師データで学習を行い、第２の高画質化エンジンは第２の撮影領域についての教師データで学習を行ってもよい。この場合には、それぞれの高画質化エンジンが、より精度良く二次元画像の高画質化を行うことができる。 The training data for the multiple image quality improvement engines may be different training data depending on the processing target to be processed by each image quality improvement engine. For example, a first image quality improvement engine may learn from training data for a first imaging region, and a second image quality improvement engine may learn from training data for a second imaging region. In this case, each image quality improvement engine can improve the image quality of two-dimensional images with greater accuracy.

また、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置４００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 In addition, the output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 400, as in the first embodiment. In addition, the output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing, as in the first embodiment. In other words, a high-quality image obtained by performing at least one of a group of processes or imaging methods such as overlay processing, MAP estimation processing, smoothing filter processing, gradation conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing may be used.

＜第１２の実施形態＞
次に、図５及び２０を参照して、第１２の実施形態に係る画像処理装置について説明する。本実施形態では、取得部４０１が撮影装置ではなく画像管理システム２０００から入力画像を取得する。 Twelfth embodiment
Next, an image processing apparatus according to a twelfth embodiment will be described with reference to Figs. 5 and 20. In this embodiment, an acquisition unit 401 acquires an input image from an image management system 2000, not from an image capture device.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第１の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第１の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は第１の実施形態に係る画像処理装置４００の構成と同様であるため、図４に示す構成について同じ参照符号を用いて説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as the image processing device 400 according to the first embodiment. Therefore, the image processing device according to this embodiment will be described below, focusing on the differences from the image processing device according to the first embodiment. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device 400 according to the first embodiment, the same reference symbols are used for the configuration shown in FIG. 4, and the description will be omitted.

図２０は、本実施形態に係る画像処理装置４００の概略的な構成を示す。本実施形態に係る画像処理装置４００は画像管理システム２０００、及び表示部２０と任意の回路やネットワークを介して接続されている。画像管理システム２０００は、任意の撮影装置によって撮影された画像や画像処理された画像を受信して保存する装置及びシステムである。また、画像管理システム２０００は、接続された装置の要求に応じて画像を送信したり、保存された画像に対して画像処理を行ったり、画像処理の要求を他の装置に要求したりすることができる。画像管理システムとしては、例えば、画像保存通信システム（ＰＡＣＳ）を含むことができる。 Figure 20 shows a schematic configuration of an image processing device 400 according to this embodiment. The image processing device 400 according to this embodiment is connected to an image management system 2000 and a display unit 20 via any circuit or network. The image management system 2000 is a device and system that receives and stores images captured by any imaging device or images that have been processed. The image management system 2000 can also transmit images in response to a request from a connected device, perform image processing on stored images, and request image processing from other devices. The image management system can include, for example, a picture archiving and communication system (PACS).

本実施形態に係る取得部４０１は、画像処理装置４００に接続される画像管理システム２０００から入力画像を取得することができる。また、出力部４０５は、高画質化部４０４によって生成された高画質画像を、画像管理システム２０００に出力することができる。 The acquisition unit 401 according to this embodiment can acquire an input image from the image management system 2000 connected to the image processing device 400. In addition, the output unit 405 can output a high-quality image generated by the image quality improvement unit 404 to the image management system 2000.

次に、図５を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ５２０～ステップＳ５４０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 5. Note that the processing steps S520 to S540 according to this embodiment are similar to those steps in the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S520, the processing step S530 may be omitted and the processing may proceed to step S540.

ステップＳ５１０において、取得部４０１は、回路やネットワークを介して接続された画像管理システム２０００から、画像管理システム２０００が保存している画像を入力画像として取得する。なお、取得部４０１は、画像管理システム２０００からの要求に応じて、入力画像を取得してもよい。このような要求は、例えば、画像管理システム２０００が画像を保存した時や、保存した画像を他の装置に送信する前、保存された画像を表示部２０に表示する時に発行されてよい。また、当該要求は、例えば、画像管理システム２０００を利用者が操作して高画質化処理の要求を行った時や、画像管理システム２０００が備える画像解析機能に高画質画像を利用する時等に発行されてよい。 In step S510, the acquisition unit 401 acquires an image stored in the image management system 2000 as an input image from the image management system 2000 connected via a circuit or network. The acquisition unit 401 may acquire the input image in response to a request from the image management system 2000. Such a request may be issued, for example, when the image management system 2000 saves an image, before transmitting the saved image to another device, or when displaying the saved image on the display unit 20. The request may also be issued, for example, when a user operates the image management system 2000 to request high image quality processing, or when a high image quality image is used for the image analysis function provided by the image management system 2000.

ステップＳ５２０～ステップＳ５４０の処理は、第１の実施形態における処理と同様である。ステップＳ５４０において高画質化部４０４が高画質画像を生成したら、処理はステップＳ５５０に移行する。ステップＳ５５０において、出力部４０５は、ステップＳ５４０において高画質画像が生成されていれば、該高画質画像を画像管理システム２０００に出力画像として出力する。ステップＳ５４０において高画質画像が生成されていなければ、上記入力画像を画像管理システム２０００に出力画像として出力する。なお、出力部４０５は、画像処理装置４００の設定や実装によっては、出力画像を画像管理システム２０００が利用可能なように加工したり、出力画像のデータ形式を変換したりしてもよい。 The processing of steps S520 to S540 is the same as that in the first embodiment. When the image quality improvement unit 404 generates a high-quality image in step S540, the processing proceeds to step S550. In step S550, if a high-quality image was generated in step S540, the output unit 405 outputs the high-quality image to the image management system 2000 as an output image. If a high-quality image was not generated in step S540, the input image is output to the image management system 2000 as an output image. Depending on the settings and implementation of the image processing device 400, the output unit 405 may process the output image so that it can be used by the image management system 2000, or convert the data format of the output image.

上記のように、本実施形態に係る取得部４０１は、画像管理システム２０００から入力画像を取得する。このため、本実施形態の画像処理装置４００は、画像管理システム２０００が保存している画像を元に、画像診断に適した高画質画像を、撮影者や被検者の侵襲性を高めたり、労力を増したりすることなく出力することができる。また、出力された高画質画像は画像管理システム２０００に保存されたり、画像管理システム２０００が備えるユーザーインターフェースに表示されたりすることができる。また、出力された高画質画像は、画像管理システム２０００が備える画像解析機能に利用されたり、画像管理システム２０００に接続された他の装置に画像管理システム２０００を介して送信されたりすることができる。 As described above, the acquisition unit 401 according to this embodiment acquires an input image from the image management system 2000. Therefore, the image processing device 400 according to this embodiment can output a high-quality image suitable for image diagnosis based on the image stored in the image management system 2000, without increasing the invasiveness or labor of the photographer or subject. The output high-quality image can be stored in the image management system 2000, or displayed on a user interface provided by the image management system 2000. The output high-quality image can be used for the image analysis function provided by the image management system 2000, or can be transmitted via the image management system 2000 to other devices connected to the image management system 2000.

なお、画像処理装置４００や画像管理システム２０００、表示部２０は、不図示の他の装置と回路やネットワークを介して接続されていてもよい。また、これらの装置は本実施形態では別個の装置とされているが、これらの装置の一部又は全部を一体的に構成してもよい。 The image processing device 400, the image management system 2000, and the display unit 20 may be connected to other devices (not shown) via circuits or networks. In addition, although these devices are separate devices in this embodiment, some or all of these devices may be configured as an integrated unit.

また、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を画像管理システム２０００や画像処理装置４００に接続される他の装置に出力してもよい。 Furthermore, as in the first embodiment, the output unit 405 may output the generated high-quality image to the image management system 2000 or another device connected to the image processing device 400.

＜第１３の実施形態＞
次に、図４、５、２１Ａ、及び２１Ｂを参照して、第１３の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が複数の画像を入力画像とし、一枚の高画質画像を生成する。 Thirteenth embodiment
Next, an image processing apparatus according to a thirteenth embodiment will be described with reference to Figures 4, 5, 21A, and 21B. In this embodiment, an image quality improving unit receives a plurality of images as input images and generates one high-quality image.

本実施形態に係る取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして、複数の画像を取得する。 The acquisition unit 401 in this embodiment acquires multiple images from the imaging device 10 or another device as input data to be processed.

本実施形態に係る高画質化部４０４には、第１の実施形態と同様の、高画質化エンジンが備えられている。また、教師データも第１の実施形態と同様であってよい。高画質化部４０４は、取得部４０１で取得された複数の画像のそれぞれを高画質化エンジンに入力し、出力された複数の高画質画像を重ね合わせ処理して、最終的な高画質画像を生成する。なお、高画質化部４０４は、複数の高画質画像を重ね合わせ処理する前に、任意の手法により複数の高画質画像を位置合わせしてよい。 The image quality improvement unit 404 according to this embodiment is equipped with an image quality improvement engine similar to that of the first embodiment. The teacher data may also be similar to that of the first embodiment. The image quality improvement unit 404 inputs each of the multiple images acquired by the acquisition unit 401 to the image quality improvement engine, and performs a process of overlaying the multiple high-quality images output to generate a final high-quality image. Note that the image quality improvement unit 404 may align the multiple high-quality images by any method before performing a process of overlaying the multiple high-quality images.

出力部４０５は、高画質化部４０４が生成した最終的な高画質画像を表示部２０に表示させる。なお、出力部４０５は、最終的な高画質画像とともに、複数の入力画像を表示部２０に表示させてもよい。また、出力部４０５は、生成された複数の高画質画像を最終的な高画質画像や入力画像とともに表示部２０に表示してもよい。 The output unit 405 displays the final high-quality image generated by the image quality improvement unit 404 on the display unit 20. The output unit 405 may display a plurality of input images on the display unit 20 together with the final high-quality image. The output unit 405 may display the generated plurality of high-quality images on the display unit 20 together with the final high-quality image and the input images.

次に、図５及び図２１Ａを参照して、本実施形態に係る一連の画像処理について説明する。図２１Ａは本実施形態に係る高画質化処理のフロー図である。なお、本実施形態に係るステップＳ５１０～ステップＳ５３０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。 Next, a series of image processing steps according to this embodiment will be described with reference to Fig. 5 and Fig. 21A. Fig. 21A is a flow diagram of the image quality improvement processing according to this embodiment. Note that the processing steps S510 to S530 according to this embodiment are similar to those steps in the first embodiment, and therefore will not be described.

ただし、ステップＳ５１０では、取得部４０１は複数の画像を取得し、ステップＳ５２０及びＳ５３０では、複数の画像のそれぞれについて、撮影条件が取得されるとともに、高画質化エンジンによって対処可能か否かが判断される。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。また、複数の画像の一部の画像が、高画質化エンジンによって対処不可能であると判断された場合には、当該画像を以降の処理から除外することができる。 However, in step S510, the acquisition unit 401 acquires multiple images, and in steps S520 and S530, the shooting conditions are acquired for each of the multiple images, and it is determined whether or not the image quality can be improved by the image quality improvement engine. If the image quality of the input image is to be improved unconditionally in terms of the shooting conditions, the process of step S530 may be omitted after the process of step S520, and the process may proceed to step S540. Furthermore, if it is determined that some of the multiple images cannot be handled by the image quality improvement engine, the image may be excluded from subsequent processing.

ステップＳ５３０において、高画質化可否判定部４０３が、複数の入力画像について高画質化エンジンによって対処可能と判定した場合には、処理はステップＳ５４０に移行する。処理がステップＳ５４０に移行すると、図２１Ａに示される本実施形態に係る高画質化処理が開始される。本実施形態に係る高画質化処理では、まず、ステップＳ２１１０において、高画質化部４０４が、複数の入力画像のそれぞれを高画質化エンジンに入力し、高画質画像群を生成する。 If the image quality improvement possibility determination unit 403 determines in step S530 that the multiple input images can be handled by the image quality improvement engine, the process proceeds to step S540. When the process proceeds to step S540, the image quality improvement process according to this embodiment shown in FIG. 21A is started. In the image quality improvement process according to this embodiment, first, in step S2110, the image quality improvement unit 404 inputs each of the multiple input images to the image quality improvement engine to generate a group of high-image-quality images.

次に、ステップＳ２１２０では、高画質化部４０４は、生成した高画質画像群を重ね合わせ処理して最終的な一枚の高画質画像を生成する。なお、重ね合わせ処理は加算平均等平均化の処理やその他の既存の任意の処理によって行われてよい。また、重ね合わせに際しては、高画質化部４０４は複数の高画質画像を任意の手法により位置合わせした上で重ね合わせしてよい。高画質化部４０４が最終的な高画質画像を生成したら、処理はステップＳ５５０に移行する。 Next, in step S2120, the image quality improvement unit 404 performs an overlay process on the generated high-quality images to generate a single final high-quality image. The overlay process may be performed by an averaging process such as an additive average or any other existing process. When overlaying, the image quality improvement unit 404 may align the multiple high-quality images by any method before overlaying them. Once the image quality improvement unit 404 has generated the final high-quality image, the process proceeds to step S550.

ステップＳ５５０では、出力部４０５が生成された最終的な高画質画像を表示部２０に表示させる。 In step S550, the output unit 405 causes the display unit 20 to display the generated final high-quality image.

上記のように、本実施形態に係る高画質化部４０４は、複数の入力画像から一つの最終的な高画質画像を生成する。高画質化エンジンによる高画質化は入力画像に基づくため、例えば、病変部等が、ある入力画像において適切に表示されていない場合、当該入力画像を高画質化した高画質画像では低い画素値となってしまう。一方で、同一箇所を撮影した他の入力画像では病変部等が適切に表示されており、当該他の入力画像を高画質化した高画質画像では高い画素値となっている場合もある。そこで、これらの高画質画像を重ね合わせることで、当該低い又は高い画素値となっている箇所を適切に表示できるようになり、高コントラストな高画質画像を生成することができる。なお、入力画像の数は、従来の重ね合わせに必要な枚数よりも少ない数とすることで、従来のような撮影時間の長期化等の代償をより少なくすることができる。 As described above, the image quality improvement unit 404 according to this embodiment generates one final high-quality image from multiple input images. Since the image quality improvement by the image quality improvement engine is based on the input image, for example, if a lesion or the like is not displayed properly in a certain input image, the pixel value will be low in the high-quality image obtained by improving the image quality of the input image. On the other hand, there are cases where the lesion or the like is displayed properly in another input image capturing the same location, and the pixel value is high in the high-quality image obtained by improving the image quality of the other input image. Therefore, by superimposing these high-quality images, it becomes possible to properly display the areas with low or high pixel values, and a high-quality image with high contrast can be generated. In addition, by making the number of input images smaller than the number required for conventional superimposition, the cost of longer imaging time, etc., as in the conventional case, can be reduced.

なお、当該作用については、例えば、ＯＣＴＡ等のモーションコントラストデータを用いた入力画像を用いる場合に顕著となる。 This effect is particularly noticeable when using input images that use motion contrast data such as OCTA.

モーションコントラストデータは、撮影対象の同一箇所を繰り返し撮影した時間間隔における、撮影対象の時間的な変化を検出したものであるため、例えば、ある時間間隔では撮影対象の動きについて僅かな動きしか検出できない場合がある。これに対して、別の時間間隔撮影を行った場合には、撮影対象の動きをより大きな動きとして検出できる場合もある。そのため、それぞれの場合のモーションコントラスト画像を高画質化した画像を重ね合わせることで、特定のタイミングでは生じていなかった又は僅かにしか検出されていなかったモーションコントラストを補間することができる。そのため、このような処理によれば、撮影対象のより多くの動きについてコントラスト強調が行われたモーションコントラスト画像を生成することができ、検者は、撮影対象のより正確な状態を把握することができる。 Motion contrast data is the detection of temporal changes in the subject during time intervals when the same location on the subject is repeatedly photographed. For example, in some time intervals, only slight movement of the subject may be detected. In contrast, when photographs are taken at a different time interval, the movement of the subject may be detected as greater movement. Therefore, by overlaying high-quality images of the motion contrast images from each case, it is possible to interpolate motion contrast that did not occur or was only slightly detected at a particular time. Therefore, this type of processing makes it possible to generate a motion contrast image in which the contrast of more of the subject's movements is enhanced, allowing the examiner to grasp the condition of the subject more accurately.

従って、ＯＣＴＡ画像のように時間的に変化している箇所を描出する画像を入力画像として用いる場合には、異なる時間で取得した高画質画像を重ね合わせることによって、被検者の所定部位をより詳細に画像化することができる。 Therefore, when an image that depicts an area that changes over time, such as an OCTA image, is used as the input image, a specific area of the subject can be imaged in more detail by overlaying high-quality images taken at different times.

なお、本実施形態では、複数の入力画像からそれぞれ高画質画像を生成し、高画質画像を重ね合わせることで、最終的な一枚の高画質画像を生成したが、複数の入力画像から一枚の高画質画像を生成する方法はこれに限られない。例えば、図２１Ｂに示す本実施形態の高画質化処理の別例では、ステップＳ５４０において高画質化処理が開始されると、ステップＳ２１３０において、高画質化部４０４が入力画像群を重ね合わせし、一枚の重ね合わせされた入力画像を生成する。 In this embodiment, a high quality image is generated from each of a plurality of input images, and the high quality images are then superimposed to generate a single final high quality image; however, the method of generating a single high quality image from a plurality of input images is not limited to this. For example, in another example of the image quality improvement process of this embodiment shown in FIG. 21B, when the image quality improvement process is started in step S540, in step S2130, the image quality improvement unit 404 superimposes the input images to generate a single superimposed input image.

その後、ステップＳ２１４０において、高画質化部４０４が、一枚の重ね合わされた入力画像を高画質化エンジンに入力し、一枚の高画質画像を生成する。このような、高画質化処理であっても、上述の高画質化処理と同様に、複数の入力画像について低い又は高い画素値となっている箇所を適切に表示できるようになり、高コントラストな高画質画像を生成することができる。当該処理も、上記ＯＣＴＡ画像等のモーションコントラスト画像を入力画像とした場合に、顕著な作用を奏することができる。 Then, in step S2140, the image quality improvement unit 404 inputs one superimposed input image to an image quality improvement engine to generate one high-quality image. Even with this type of image quality improvement process, as with the image quality improvement process described above, it becomes possible to appropriately display areas with low or high pixel values in multiple input images, and to generate a high-quality image with high contrast. This process can also have a remarkable effect when a motion contrast image such as the OCTA image described above is used as the input image.

なお、当該高画質処理を行う場合には、高画質化エンジンの教師データの入力データとして、処理対象とされる複数の入力画像と同数の入力画像の重ね合わせ画像を用いる。これにより、高画質化エンジンにより適切な高画質化処理を行うことができる。 When performing this high image quality processing, the image quality engine uses superimposed images of the same number of input images as the multiple input images to be processed as input data for the teacher data of the image quality engine. This allows the image quality engine to perform appropriate image quality processing.

また、本実施形態による高画質化処理及び上述の別の高画質化処理について、高画質画像群又は入力画像群を組み合わせる処理は、重ね合わせに限られない。例えば、これらの画像群にＭＡＰ推定処理を適用することで一枚の画像を生成してもよい。また、高画質画像群又は入力画像群を合成して一枚の画像を生成してもよい。 In addition, in the image quality improvement process according to this embodiment and the other image quality improvement processes described above, the process of combining the high-image-quality image group or the input image group is not limited to overlay. For example, a single image may be generated by applying a MAP estimation process to these image groups. Also, a single image may be generated by synthesizing the high-image-quality image group or the input image group.

高画質画像群又は入力画像群を合成して一枚の画像を生成する場合としては、例えば、入力画像として高輝度領域について広い階調を有する画像と低輝度領域に広い階調を有する画像を用いる場合がある。この場合には、例えば、高輝度領域について広い階調を有する画像を高画質化した画像と、低輝度領域について広い階調を有する画像を高画質化した画像とを合成する。これにより、より広い明るさの幅（ダイナミックレンジ）を表現できる画像を生成することができる。なお、この場合には、高画質化エンジンの教師データの入力データは、処理対象とされる、高輝度領域について広い階調を有する画像や低輝度領域について広い階調を有する低画質画像とすることができる。また、高画質化エンジンの教師データの出力データは、入力データに対応する高画質画像とすることができる。 When generating a single image by synthesizing a group of high-quality images or a group of input images, for example, an image with a wide range of gradations in the high-luminance region and an image with a wide range of gradations in the low-luminance region may be used as the input image. In this case, for example, an image with a wide range of gradations in the high-luminance region and an image with a wide range of gradations in the low-luminance region are synthesized to generate a high-quality image. This makes it possible to generate an image that can express a wider range of brightness (dynamic range). In this case, the input data of the training data for the high-quality engine can be an image with a wide range of gradations in the high-luminance region or a low-quality image with a wide range of gradations in the low-luminance region, which are to be processed. Also, the output data of the training data for the high-quality engine can be a high-quality image corresponding to the input data.

また、高輝度領域について広い階調を有する画像と、低輝度領域について広い階調を有する画像とを合成し、合成した画像を高画質化エンジンによって高画質化してもよい。この場合にも、より広い明るさの幅を表現できる画像を生成することができる。なお、この場合には、高画質化エンジンの教師データの入力データは、処理対象とされる、高輝度領域について広い階調を有する低画質画像と低輝度領域について広い階調を有する低画質画像を合成した画像とすることができる。また、高画質化エンジンの教師データの出力データは、入力データに対応する高画質画像とすることができる。 Also, an image having a wide range of gradations in high-luminance areas and an image having a wide range of gradations in low-luminance areas may be synthesized, and the synthesized image may be enhanced in image quality by the image quality enhancement engine. In this case, an image capable of expressing a wider range of brightness may be generated. In this case, the input data of the training data for the image quality enhancement engine may be an image obtained by synthesizing a low-quality image having a wide range of gradations in high-luminance areas and a low-quality image having a wide range of gradations in low-luminance areas, which are to be processed. Furthermore, the output data of the training data for the image quality enhancement engine may be a high-quality image corresponding to the input data.

これらの場合には、高画質化エンジンを用いて、より広い明るさの幅を表現できる画像を高画質化することができ、従来と比べてより少ない枚数の画像等で処理を行うことができ、より少ない代償で、画像解析に適した画像を提供することができる。 In these cases, the image quality improvement engine can be used to improve the image quality of images that can express a wider range of brightness, making it possible to process fewer images than before and providing images suitable for image analysis at a lower cost.

なお、高輝度領域について広い階調を有する画像と、低輝度領域について広い階調を有する画像の撮影方法としては、撮影装置の露光時間をより短く又はより長くする等の、任意の方法を採用してよい。また、階調の幅の分け方は、低輝度領域及び高輝度領域に限られず、任意であってよい。 In addition, any method may be used to capture an image having a wide range of gradations in the high-luminance region and an image having a wide range of gradations in the low-luminance region, such as shortening or lengthening the exposure time of the imaging device. Furthermore, the method of dividing the range of gradations is not limited to low-luminance and high-luminance regions, and may be any method.

また、本実施形態に係る高画質化処理において、複数の高画質化エンジンを用いて、複数の入力画像を並列的に処理してもよい。なお、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置４００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 In addition, in the image quality improvement process according to this embodiment, multiple input images may be processed in parallel using multiple image quality improvement engines. Note that, as in the first embodiment, the output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 400. Also, as in the first embodiment, the output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing. In other words, a high-quality image obtained by performing at least one of the following processing groups or imaging methods may be used: overlay processing, MAP estimation processing, smoothing filter processing, tone conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing.

＜第１４の実施形態＞
次に、図４及び５を参照して、第１４の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が複数の低画質画像から生成された中画質画像を入力画像とし、高画質画像を生成する。 <Fourteenth embodiment>
Next, an image processing apparatus according to a fourteenth embodiment will be described with reference to Figures 4 and 5. In this embodiment, an image quality improving section uses a medium quality image generated from a plurality of low quality images as an input image and generates a high quality image.

本実施形態に係る取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして、複数の低画質画像を重ね合わせ処理した中画質画像を取得する。なお、低画質画像の重ね合わせに際しては、任意の位置合わせ処理が行われてよい。 The acquisition unit 401 according to this embodiment acquires a medium quality image obtained by superimposing multiple low quality images as input data to be processed from the image capture device 10 or another device. When superimposing the low quality images, any positioning process may be performed.

本実施形態に係る高画質化部４０４には、第１の実施形態と同様の、高画質化エンジンが備えられている。ただし、本実施形態の高画質化エンジンは、中程度の画質である中画質画像を入力し、高画質画像を出力するように設計されている。中画質画像とは複数の低画質画像群を重ね合わせして生成された重ね合わせ画像である。また、高画質画像は中画質画像よりも高画質な画像である。また、高画質化エンジンのトレーニングに用いられた教師データを構成するペア群についても、各ペアを構成する入力データは中画質画像と同様にして生成された中画質画像であり、出力データは高画質画像である。 The image quality improvement unit 404 according to this embodiment is equipped with an image quality improvement engine similar to that of the first embodiment. However, the image quality improvement engine of this embodiment is designed to input a medium image quality image, which is of medium image quality, and output a high image quality image. A medium image quality image is a composite image generated by superimposing a group of low image quality images. A high image quality image is an image of higher quality than a medium image quality image. Also, for the group of pairs that constitute the teacher data used to train the image quality improvement engine, the input data that constitutes each pair is a medium image quality image generated in the same manner as the medium image quality image, and the output data is a high image quality image.

出力部４０５は、高画質化部４０４が生成した高画質画像を表示部２０に表示させる。なお、出力部４０５は、高画質画像とともに、入力画像を表示部２０に表示させてもよく、この場合に、出力部４０５は、入力画像が複数の低画質画像から生成された画像であることを表示部２０に表示してもよい。 The output unit 405 causes the display unit 20 to display the high-quality image generated by the image quality improvement unit 404. The output unit 405 may cause the display unit 20 to display the input image together with the high-quality image, and in this case, the output unit 405 may display on the display unit 20 that the input image is an image generated from a plurality of low-quality images.

次に、図５を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ５２０～ステップＳ５５０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 5. Note that the processing steps S520 to S550 according to this embodiment are similar to those steps in the first embodiment, and therefore will not be described.

ステップＳ５１０においては、取得部４０１は、撮影装置１０や他の装置から、入力画像として中画質画像を取得する。なお、取得部４０１は撮影装置１０からの要求に応じて、撮影装置１０が生成した中画質画像を入力画像として取得してもよい。このような要求は、例えば、撮影装置１０が画像を生成した時、撮影装置１０が生成した画像を撮影装置１０が備える記憶装置に保存する前や保存した後、保存された画像を表示部２０に表示する時、画像解析処理に高画質画像を利用する時等に発行されてよい。 In step S510, the acquisition unit 401 acquires a medium quality image as an input image from the image capture device 10 or another device. The acquisition unit 401 may acquire a medium quality image generated by the image capture device 10 as an input image in response to a request from the image capture device 10. Such a request may be issued, for example, when the image capture device 10 generates an image, before or after the image generated by the image capture device 10 is saved in a storage device provided in the image capture device 10, when the saved image is displayed on the display unit 20, when a high quality image is used for image analysis processing, etc.

以降の処理は、第１の実施形態における処理と同様であるため、説明を省略する。 The subsequent processing is similar to that in the first embodiment, so a detailed explanation is omitted.

上記のように、本実施形態に係る取得部４０１は、被検者の所定部位の複数の画像を用いて生成された画像である中画質画像を入力画像として取得する。この場合、入力画像がより明瞭な画像となるため、高画質化エンジンは高画質画像をより精度良く生成することができる。なお、中画質画像を生成するために用いる低画質画像の枚数は、従来の重ね合わせ画像を生成するために用いられる画像の枚数より少なくてよい。 As described above, the acquisition unit 401 according to this embodiment acquires, as an input image, a medium quality image that is an image generated using multiple images of a specific part of the subject. In this case, since the input image is a clearer image, the high quality engine can generate a high quality image with higher accuracy. Note that the number of low quality images used to generate the medium quality image may be less than the number of images used to generate a conventional overlaid image.

なお、中画質画像は、複数の低画質画像を重ね合わせた画像に限られず、例えば、複数の低画質画像にＭＡＰ推定処理を適用した画像でもよいし、複数の低画質画像を合成した画像であってもよい。複数の低画質画像を合成する場合には、それぞれの画像の階調が異なっている画像同士を合成してもよい。 Note that the medium quality image is not limited to an image obtained by superimposing multiple low quality images, but may be, for example, an image obtained by applying a MAP estimation process to multiple low quality images, or an image obtained by synthesizing multiple low quality images. When synthesizing multiple low quality images, images with different gradations may be synthesized.

＜第１５の実施形態＞
次に、図４及び５を参照して、第１５の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が第１の実施形態等に係る高画質化とともに入力画像の高画像サイズ化（高サイズ化）を行う。 <Fifteenth embodiment>
Next, an image processing device according to a fifteenth embodiment will be described with reference to Figures 4 and 5. In this embodiment, an image quality improving unit performs image quality improvement according to the first embodiment and the like as well as image size increase (size increase) of an input image.

本実施形態に係る取得部４０１は、入力画像として低画像サイズの画像（低サイズ画像）を取得する。なお、低サイズ画像とは、後述する高画質化エンジンによって出力される高画像サイズの画像（高サイズ画像）よりも、画像を構成する画素数が少ない画像である。具体的には、例えば、高サイズ画像の画像サイズが幅１０２４画素、高さ１０２４画素、奥行き１０２４画素の場合に、低サイズ画像の画像サイズが５１２画素、高さ５１２画素、奥行き５１２画素である場合等である。これに関連して、本明細書における、高画像サイズ化とは、一画像あたりの画素数を増加させ、画像サイズを拡大する処理をいう。 The acquisition unit 401 according to this embodiment acquires an image of low image size (low-size image) as an input image. Note that a low-size image is an image that comprises fewer pixels than an image of high image size (high-size image) output by a high-image-quality engine described later. Specifically, for example, when the image size of a high-size image is 1024 pixels wide, 1024 pixels high, and 1024 pixels deep, the image size of a low-size image is 512 pixels high, 512 pixels deep, and so on. In this regard, in this specification, "enhancing the image size" refers to a process of increasing the number of pixels per image and enlarging the image size.

本実施形態に係る高画質化部４０４には、第１の実施形態と同様に、高画質化エンジンが備えられている。ただし、本実施形態の高画質化エンジンは、入力画像についてノイズ低減やコントラスト強調するとともに、入力画像の画像サイズを高画像サイズ化するように構成されている。そのため、本実施形態の高画質化エンジンは、低サイズ画像を入力し、高サイズ画像を出力するように構成されている。 The image quality improvement unit 404 according to this embodiment is equipped with an image quality improvement engine, as in the first embodiment. However, the image quality improvement engine of this embodiment is configured to reduce noise and enhance contrast for an input image, and to increase the image size of the input image. Therefore, the image quality improvement engine of this embodiment is configured to input a low-size image and output a high-size image.

これに関連して、高画質化エンジンの教師データを構成するペア群について、各ペアを構成する入力データは低サイズ画像であり、出力データは高サイズ画像である。なお、出力データ用として用いる高サイズ画像は、低サイズ画像を取得した撮影装置よりも高性能な装置から取得したり、撮影装置の設定を変更することによって取得したりすることができる。また、高サイズ画像群が既にある場合には、当該高サイズ画像群を撮影装置１０からの取得が想定される画像の画像サイズに縮小することで、入力データとして用いる低サイズ画像群を取得してもよい。また、高サイズ画像については、第１の実施形態等と同様に低サイズ画像を重ね合わせたものが用いられる。 In relation to this, for the group of pairs that constitute the training data for the image quality improvement engine, the input data that constitutes each pair is a low-size image, and the output data is a high-size image. Note that the high-size images used for the output data can be obtained from a device with higher performance than the imaging device that obtained the low-size images, or can be obtained by changing the settings of the imaging device. Furthermore, if a group of high-size images already exists, the group of low-size images to be used as input data may be obtained by reducing the group of high-size images to the image size of the images expected to be obtained from the imaging device 10. Furthermore, for the high-size images, low-size images that are superimposed are used, as in the first embodiment, etc.

なお、本実施形態に係る高画質化部４０４による入力画像の画像サイズの拡大については、教師データとして撮影装置１０よりも高性能な装置から取得したり、撮影装置１０の設定を変更したりすることで取得しているため、単純な画像の拡大とは異なる。具体的には、本実施形態に係る高画質化部４０４による入力画像の画像サイズの拡大処理は、単純に画像を拡大した場合と比べ、解像度の劣化を低減することができる。 Note that the image size enlargement of the input image by the image quality improvement unit 404 according to this embodiment is different from simple image enlargement, since it is obtained as teacher data from a device with higher performance than the image capture device 10 or by changing the settings of the image capture device 10. Specifically, the image size enlargement process of the input image by the image quality improvement unit 404 according to this embodiment can reduce degradation in resolution compared to when the image is simply enlarged.

このような構成により、本実施形態に係る高画質化部４０４は、入力画像に対して、ノイズ低減やコントラスト強調がなされるとともに高画像サイズ化された高画質画像を生成することができる。 With this configuration, the image quality improvement unit 404 according to this embodiment can reduce noise and enhance contrast from the input image, and generate a high-quality image with a large image size.

次に、図５を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ５２０、ステップＳ５３０、及びステップＳ５５０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ５２０の処理の後に、ステップＳ５３０の処理を省き、処理をステップＳ５４０に移行してよい。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 5. Note that the processing steps S520, S530, and S550 according to this embodiment are similar to those in the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S520, the processing step S530 may be omitted and the processing may proceed to step S540.

ステップＳ５１０において、取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして、低サイズ画像を取得する。なお、取得部４０１は撮影装置１０からの要求に応じて、撮影装置１０が生成した低サイズ画像を入力画像として取得してもよい。このような要求は、例えば、撮影装置１０が画像を生成した時、撮影装置１０が生成した画像を撮影装置１０が備える記憶装置に保存する前や保存した後、保存された画像を表示部２０に表示する時、画像解析処理に高画質画像を利用する時等に発行されてよい。 In step S510, the acquisition unit 401 acquires a low-size image from the imaging device 10 or another device as input data to be processed. The acquisition unit 401 may acquire a low-size image generated by the imaging device 10 as an input image in response to a request from the imaging device 10. Such a request may be issued, for example, when the imaging device 10 generates an image, before or after the image generated by the imaging device 10 is saved in a storage device provided in the imaging device 10, when the saved image is displayed on the display unit 20, when a high-quality image is used for image analysis processing, etc.

ステップＳ５２０及びステップＳ５３０の処理は第１の実施形態での処理と同様であるため説明を省略する。ステップＳ５４０では、高画質化部４０４が、入力画像を高画質化エンジンに入力し、高画質画像としてノイズ低減やコントラスト強調がなされるとともに高画像サイズ化された画像を生成する。以降の処理は、第１の実施形態と同様であるため説明を省略する。 The processing of steps S520 and S530 is the same as that in the first embodiment, and therefore a description thereof will be omitted. In step S540, the image quality improvement unit 404 inputs the input image to an image quality improvement engine, and generates an image that has been subjected to noise reduction and contrast enhancement as a high-quality image and has a large image size. The processing thereafter is the same as that in the first embodiment, and therefore a description thereof will be omitted.

上記のように、本実施形態に係る高画質化部４０４は、入力画像と比べてノイズ低減及びコントラスト強調のうちの少なくとも一つがなされるとともに、画像サイズの拡大がなされた高画質画像を生成する。これにより、本実施形態に係る画像処理装置４００は、画像診断に適した高画質画像を、撮影者や被検者の侵襲性を高めたり、労力を増したりすることなく出力することができる。 As described above, the image quality improvement unit 404 according to this embodiment generates a high-quality image in which at least one of noise reduction and contrast enhancement has been performed compared to the input image, and the image size has been increased. This allows the image processing device 400 according to this embodiment to output a high-quality image suitable for image diagnosis without increasing the invasiveness or effort of the photographer or subject.

なお、本実施形態では、一つの高画質化エンジンにより、第１の実施形態等による高画質化処理と高分解能化の処理を行った高画質画像を生成したが、これらの処理を行う構成はこれに限られない。例えば、高画質化部は、第１の実施形態等による高画質化処理を行う高画質化エンジン及び高画像サイズ化処理を行う別の高画質化エンジンを備えてもよい。 In this embodiment, a high-quality image is generated by performing high-quality processing and high-resolution processing according to the first embodiment, etc., using one high-quality engine, but the configuration for performing these processes is not limited to this. For example, the image quality improvement unit may include an image quality improvement engine that performs image quality processing according to the first embodiment, etc., and another image quality improvement engine that performs high image size processing.

この場合には、第１の実施形態等に係る高画質化処理を行う高画質化エンジンは第１の実施形態等に係る高画質化エンジンと同様に学習を行った機械学習モデルを用いることができる。また、高画像サイズ化処理を行う高画質化エンジンの教師データの入力データとしては、第１の実施形態等に係る高画質化エンジンが生成した高画質画像を用いる。また、当該高画質化エンジンの教師データの出力データとしては、高性能な撮影装置で取得された画像について第１の実施形態等に係る高画質化エンジンが生成した高画質画像を用いる。これにより、高画像サイズ化処理を行う高画質化エンジンは、第１の実施形態等に係る高画質化処理を行った高画質画像について高画像サイズ化した最終的な高画質画像を生成することができる。 In this case, the image quality improvement engine that performs the image quality improvement process according to the first embodiment, etc. can use a machine learning model that has been trained in the same way as the image quality improvement engine according to the first embodiment, etc. Furthermore, as input data for the teacher data of the image quality improvement engine that performs the high image size process, the high image quality image generated by the image quality improvement engine according to the first embodiment, etc. is used. Furthermore, as output data for the teacher data of the image quality improvement engine, the high image quality image generated by the image quality improvement engine according to the first embodiment, etc. for an image acquired by a high-performance imaging device is used. This allows the image quality improvement engine that performs the high image size process to generate a final high image quality image by increasing the image size of the high image quality image that has been subjected to the image quality improvement process according to the first embodiment, etc.

また、当該高画質化エンジンによる高画像サイズ化処理を、第１の実施形態等に係る高画化処理エンジンによる高画質化処理の前に行ってもよい。この場合には、高画像サイズ化処理を行う高画質化エンジンについての教師データは、撮影装置で取得した低サイズ画像である入力データと高サイズ画像である出力データのペア群により構成する。また、第１の実施形態等に係る高画質化処理を行う高画質化エンジンの教師データとしては、高サイズ画像を入力データと、高サイズ画像を重ね合わせした画像を出力データのペア群により構成する。 The high image quality engine may perform high image size processing before the high image quality processing engine according to the first embodiment, etc. In this case, the training data for the high image quality engine performing high image size processing is configured as a group of pairs of input data, which is a low size image acquired by the imaging device, and output data, which is a high size image. The training data for the high image quality engine performing high image quality processing according to the first embodiment, etc. is configured as a group of pairs of input data, which is a high size image, and output data, which is an image in which the high size image is superimposed.

このような構成によっても、画像処理装置４００は、入力画像と比べてノイズ低減及びコントラスト強調のうちの少なくとも一つがなされるとともに、画像サイズの拡大がなされた画像を高画質画像として生成することができる。 Even with this configuration, the image processing device 400 can generate a high-quality image in which at least one of noise reduction and contrast enhancement has been performed compared to the input image, and the image size has been enlarged.

なお、本実施形態では、第１の実施形態等に係る高画質化処理について、重ね合わせ画像を教師データの出力データとして用いる構成について述べたが、第１の実施形態と同様に当該出力データはこれに限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 In this embodiment, the high image quality processing according to the first embodiment and the like has been described in terms of a configuration in which the superimposed image is used as output data of the teacher data, but as with the first embodiment, the output data is not limited to this. In other words, a high image quality image obtained by performing at least one of a group of processes or imaging methods, such as superimposition processing, MAP estimation processing, smoothing filter processing, tone conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing, may be used.

なお、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置４００に接続される他の装置に出力してもよい。 Note that, as in the first embodiment, the output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 400.

＜第１６の実施形態＞
次に、図４及び５を参照して、第１６の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が第１の実施形態等に係る高画質化とともに高空間分解能化を行う。 <Sixteenth embodiment>
Next, an image processing apparatus according to a sixteenth embodiment will be described with reference to Figures 4 and 5. In this embodiment, an image quality improving section improves spatial resolution in addition to the image quality improvement according to the first embodiment.

本実施形態に係る取得部４０１は、入力画像として低空間分解能画像を取得する。なお、低空間分解能画像とは、高画質化部４０４が出力する高空間分解能画像よりも、空間分解能が低い画像である。 The acquisition unit 401 according to this embodiment acquires a low spatial resolution image as an input image. Note that a low spatial resolution image is an image with a lower spatial resolution than the high spatial resolution image output by the image quality improvement unit 404.

高画質化部４０４には、第１の実施形態と同様に、高画質化エンジンが備えられている。ただし、本実施形態の高画質化エンジンは、入力画像についてノイズ低減やコントラスト強調するとともに、入力画像の空間分解能を高空間分解能化するように構成されている。そのため、本実施形態に係る高画質化エンジンは、低空間分解能画像を入力し、高空間分解能画像を出力するように構成されている。 The image quality improvement unit 404 is equipped with an image quality improvement engine, as in the first embodiment. However, the image quality improvement engine of this embodiment is configured to reduce noise and enhance contrast of the input image, as well as to increase the spatial resolution of the input image. Therefore, the image quality improvement engine of this embodiment is configured to input a low spatial resolution image and output a high spatial resolution image.

これに関連して、高画質化エンジンの教師データを構成するペア群についても、各ペアを構成する入力データは低空間分解能画像であり、出力データは高空間分解能画像である。なお、高空間分解能画像は、低空間分解能画像を取得した撮影装置よりも高性能な装置から取得したり、撮影装置の設定を変更することによって取得したりすることができる。また、高空間分解能画像については、第１の実施形態等と同様に低空間分解能画像を重ね合わせたものが用いられる。 In relation to this, for the group of pairs that make up the training data for the image quality improvement engine, the input data that makes up each pair is a low spatial resolution image, and the output data is a high spatial resolution image. Note that the high spatial resolution image can be obtained from a device with higher performance than the imaging device that acquired the low spatial resolution image, or can be obtained by changing the settings of the imaging device. Also, for the high spatial resolution image, a low spatial resolution image is overlaid, as in the first embodiment, etc.

このような構成により、本実施形態に係る高画質化部４０４は、入力画像に対して、ノイズ低減やコントラスト強調がなされるとともに高空間分解能化された高画質画像を生成することができる。 With this configuration, the image quality improvement unit 404 according to this embodiment can reduce noise and enhance contrast from the input image, and generate a high-quality image with high spatial resolution.

ステップＳ５１０において、取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして、低空間分解能画像を取得する。なお、取得部４０１は撮影装置１０からの要求に応じて、撮影装置１０が生成した低空間分解能画像を入力画像として取得してもよい。このような要求は、例えば、撮影装置１０が画像を生成した時、撮影装置１０が生成した画像を撮影装置１０が備える記憶装置に保存する前や保存した後、保存された画像を表示部２０に表示する時、画像解析処理に高画質画像を利用する時等に発行されてよい。 In step S510, the acquisition unit 401 acquires a low spatial resolution image from the imaging device 10 or another device as input data to be processed. The acquisition unit 401 may acquire a low spatial resolution image generated by the imaging device 10 as an input image in response to a request from the imaging device 10. Such a request may be issued, for example, when the imaging device 10 generates an image, before or after the image generated by the imaging device 10 is saved in a storage device provided in the imaging device 10, when the saved image is displayed on the display unit 20, when a high-quality image is used for image analysis processing, etc.

ステップＳ５２０及びステップＳ５３０の処理は第１の実施形態での処理と同様であるため説明を省略する。ステップＳ５４０では、高画質化部４０４が、入力画像を高画質化エンジンに入力し、高画質画像としてノイズ低減やコントラスト強調がなされるとともに高空間分解能化された画像を生成する。以降の処理は、第１の実施形態と同様であるため説明を省略する。 The processing of steps S520 and S530 is the same as that in the first embodiment, and therefore a description thereof will be omitted. In step S540, the image quality improvement unit 404 inputs the input image to an image quality improvement engine, and generates an image with high spatial resolution and noise reduction as a high-quality image. The processing thereafter is the same as that in the first embodiment, and therefore a description thereof will be omitted.

上記のように、本実施形態に係る高画質化部４０４は、入力画像と比べてノイズ低減及びコントラスト強調のうちの少なくとも一つがなされるとともに、空間分解能が向上された画像を高画質画像として生成する。これにより、本実施形態に係る画像処理装置４００は、画像診断に適した高画質画像を、撮影者や被検者の侵襲性を高めたり、労力を増したりすることなく出力することができる。 As described above, the image quality improvement unit 404 according to this embodiment generates a high-quality image in which at least one of noise reduction and contrast enhancement has been performed compared to the input image, and the spatial resolution has been improved. This allows the image processing device 400 according to this embodiment to output a high-quality image suitable for image diagnosis, without increasing the invasiveness or effort of the photographer or subject.

なお、本実施形態では、一つの高画質化エンジンにより、第１の実施形態等による高画質化処理と高分解能化の処理を行った高画質画像を生成したが、これらの処理を行う構成はこれに限られない。例えば、高画質化部は、第１の実施形態等による高画質化処理を行う高画質化エンジン及び高分解能化処理を行う別の高画質化エンジンを備えてもよい。 In this embodiment, a high-quality image is generated by performing high-quality image processing and high-resolution processing according to the first embodiment, etc., using one high-quality image engine, but the configuration for performing these processes is not limited to this. For example, the image quality improvement unit may include an image quality improvement engine that performs high-quality image processing according to the first embodiment, etc., and another image quality improvement engine that performs high-resolution processing.

この場合には、第１の実施形態等に係る高画質化処理を行う高画質化エンジンは第１の実施形態等に係る高画質化エンジンと同様に学習を行った機械学習モデルを用いることができる。また、高分解能化処理を行う高画質化エンジンの教師データの入力データとしては、第１の実施形態等に係る高画質化エンジンが生成した高画質画像を用いる。また、当該高画質化エンジンの教師データの出力データとしては、高性能な撮影装置で取得された画像について第１の実施形態等に係る高画質化エンジンが生成した高画質画像を用いる。これにより、高空間分解能化処理を行う高画質化エンジンは、第１の実施形態等に係る高画質化処理を行った高画質画像について高空間分解能化した最終的な高画質画像を生成することができる。 In this case, the image quality improvement engine that performs the image quality improvement process according to the first embodiment, etc. can use a machine learning model that has been trained in the same way as the image quality improvement engine according to the first embodiment, etc. Furthermore, as input data of the teacher data of the image quality improvement engine that performs the high resolution process, the high quality image generated by the image quality improvement engine according to the first embodiment, etc. is used. Furthermore, as output data of the teacher data of the image quality improvement engine, the high quality image generated by the image quality improvement engine according to the first embodiment, etc. for an image acquired by a high performance imaging device is used. In this way, the image quality improvement engine that performs the high spatial resolution process can generate a final high quality image with high spatial resolution for the high quality image that has been subjected to the image quality improvement process according to the first embodiment, etc.

また、当該高画質化エンジンによる高空間分解能化処理を、第１の実施形態等に係る高画化処理エンジンによる高画質化処理の前に行ってもよい。この場合には、高空間分解能化処理を行う高画質化エンジンについての教師データは、撮影装置で取得した低空間分解能画像である入力データと高空間分解能画像である出力データのペア群により構成する。また、第１の実施形態等に係る高画質化処理を行う高画質化エンジンの教師データとしては、高空間分解能画像を入力データと、高空間分解能画像を重ね合わせした画像を出力データのペア群により構成する。 The high spatial resolution processing by the image quality improvement engine may be performed before the high image quality processing by the image quality improvement processing engine according to the first embodiment, etc. In this case, the training data for the image quality improvement engine performing the high spatial resolution processing is configured as a group of pairs of input data, which is a low spatial resolution image acquired by the imaging device, and output data, which is a high spatial resolution image. The training data for the image quality improvement engine performing the image quality improvement processing according to the first embodiment, etc. is configured as a group of pairs of input data, which is a high spatial resolution image, and output data, which is an image in which the high spatial resolution image is superimposed.

このような構成によっても、画像処理装置４００は、入力画像と比べてノイズ低減及びコントラスト強調のうちの少なくとも一つがなされるとともに、空間分解能が向上された画像を高画質画像として生成することができる。 Even with this configuration, the image processing device 400 can generate a high-quality image in which at least one of noise reduction and contrast enhancement is performed compared to the input image, and the spatial resolution is improved.

また、高画質化部４０４は、高画質化エンジンを用いて、高空間分解能化処理に加えて第１５の実施形態に係る高画質化処理を行ってもよい。この場合には、入力画像と比べてノイズ低減及びコントラスト強調のうちの少なくとも一つがなされるとともに、入力画像と比べて高画像サイズ化及び高空間分解能化された画像を高画質画像として生成することができる。これにより、本実施形態に係る画像処理装置４００は、画像診断に適した高画質画像を、撮影者や被検者の侵襲性を高めたり、労力を増したりすることなく出力することができる。 The image quality improvement unit 404 may use an image quality improvement engine to perform the image quality improvement process according to the fifteenth embodiment in addition to the high spatial resolution process. In this case, at least one of noise reduction and contrast enhancement is performed compared to the input image, and an image with a larger image size and higher spatial resolution compared to the input image can be generated as a high-quality image. This allows the image processing device 400 according to this embodiment to output a high-quality image suitable for image diagnosis without increasing the invasiveness or effort of the photographer or subject.

＜第１７の実施形態＞
次に、図２２及び２３を参照して、第１７の実施形態に係る画像処理装置について説明する。本実施形態では、解析部が高画質化部によって生成された高画質画像を画像解析する。 Seventeenth embodiment
Next, an image processing apparatus according to a seventeenth embodiment will be described with reference to Figures 22 and 23. In this embodiment, an analysis unit performs image analysis on a high-quality image generated by an image quality improvement unit.

図２２は、本実施形態に係る画像処理装置２２００の概略的な構成を示す。本実施形態に係る画像処理装置２２００には、取得部４０１、撮影条件取得部４０２、高画質化可否判定部４０３、高画質化部４０４、及び出力部４０５に加えて、解析部２２０８が設けられている。なお、画像処理装置２２００は、これら構成要素のうちの一部が設けられた複数の装置で構成されてもよい。ここで、取得部４０１、撮影条件取得部４０２、高画質化可否判定部４０３、高画質化部４０４、及び出力部４０５は、第１の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Figure 22 shows a schematic configuration of an image processing device 2200 according to this embodiment. In addition to the acquisition unit 401, the shooting condition acquisition unit 402, the image quality improvement possibility determination unit 403, the image quality improvement unit 404, and the output unit 405, the image processing device 2200 according to this embodiment is provided with an analysis unit 2208. Note that the image processing device 2200 may be composed of multiple devices provided with some of these components. Here, the acquisition unit 401, the shooting condition acquisition unit 402, the image quality improvement possibility determination unit 403, the image quality improvement unit 404, and the output unit 405 are the same as the configuration of the image processing device according to the first embodiment, so the configuration shown in Figure 4 is indicated using the same reference numerals and description will be omitted.

解析部２２０８は、高画質化部４０４が生成した高画質画像に対して所定の画像解析処理を適用する。画像解析処理は、例えば、眼科分野では、ＯＣＴにより取得された画像に対する、網膜層のセグメンテーション、層厚計測、乳頭三次元形状解析、篩状板解析、ＯＣＴＡ画像の血管密度計測、及び角膜形状解析等の既存の任意の画像解析処理を含む。また、画像解析処理は眼科分野の解析処理に限られず、例えば、拡散テンソル解析やＶＢＬ（Ｖｏｘｅｌ－ｂａｓｅｄＭｏｒｐｈｏｍｅｔｒｙ）解析等の放射線分野における既存の任意の解析処理も含む。 The analysis unit 2208 applies a predetermined image analysis process to the high-quality image generated by the image quality improvement unit 404. In the field of ophthalmology, the image analysis process includes any existing image analysis process such as retinal layer segmentation, layer thickness measurement, optic nerve three-dimensional shape analysis, lamina cribrosa analysis, vascular density measurement of OCTA images, and corneal shape analysis for images acquired by OCT. In addition, the image analysis process is not limited to analysis processes in the field of ophthalmology, and also includes any existing analysis process in the field of radiology, such as diffusion tensor analysis and VBL (Voxel-based Morphometry) analysis.

出力部４０５は、高画質化部４０４によって生成された高画質画像を表示部２０に表示させるとともに、解析部２２０８による画像解析処理の解析結果を表示させることができる。なお、出力部４０５は解析部２２０８による画像解析結果のみを表示部２０に表示させてもよいし、当該画像解析結果を撮影装置１０や画像管理システム、その他の装置等に出力してもよい。なお、解析結果の表示形態は、解析部２２０８で行った画像解析処理に応じて任意であってよく、例えば、画像、数値又は文字として表示されてもよい。また、解析結果の表示形態は、高画質画像を解析処理して得た解析結果を、任意の透明度により高画質画像に重畳表示させたものであってもよい。すなわち、解析結果の表示形態は、高画質画像を解析処理して得た解析結果と高画質画像とを任意の透明度によりブレンド処理して得た画像（例えば、２次元マップ）であっても良い。 The output unit 405 can display the high-quality image generated by the image quality improvement unit 404 on the display unit 20, and can also display the analysis results of the image analysis process by the analysis unit 2208. The output unit 405 can display only the image analysis results by the analysis unit 2208 on the display unit 20, or can output the image analysis results to the imaging device 10, an image management system, or other devices. The display form of the analysis results can be any form depending on the image analysis process performed by the analysis unit 2208, and can be displayed as, for example, an image, a numerical value, or a character. The display form of the analysis results can be a form in which the analysis results obtained by analyzing the high-quality image are superimposed on the high-quality image with any transparency. In other words, the display form of the analysis results can be an image (for example, a two-dimensional map) obtained by blending the analysis results obtained by analyzing the high-quality image and the high-quality image with any transparency.

以下、図２３を参照して、本実施形態に係る一連の画像処理について、ＯＣＴＡのＥｎ－Ｆａｃｅ画像を例として説明する。図２３は、本実施形態に係る一連の画像処理のフロー図である。なお、本実施形態に係るステップＳ２３１０～ステップＳ２３４０の処理は、第１の実施形態におけるステップＳ５１０～ステップＳ５４０での処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ２３２０の処理の後に、ステップＳ２３３０の処理を省き、処理をステップＳ２３４０に移行してよい。 The series of image processing steps according to this embodiment will be described below with reference to FIG. 23, using an OCTA En-Face image as an example. FIG. 23 is a flow diagram of a series of image processing steps according to this embodiment. Note that the processing steps S2310 to S2340 according to this embodiment are similar to the processing steps S510 to S540 in the first embodiment, and therefore will not be described here. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S2320, the processing step S2330 may be omitted, and the processing may proceed to step S2340.

ステップＳ２３４０において、高画質化部４０４はＯＣＴＡのＥｎ－Ｆａｃｅ画像の高画質化を行い、処理はステップＳ２３５０に移行する。ステップＳ２３５０で、解析部２２０８が、ステップＳ２３４０において生成された高画質画像を画像解析する。高画質化したＯＣＴＡのＥｎ－Ｆａｃｅ画像における画像解析としては、任意の２値化処理を適用することで、画像から血管相当の箇所（血管領域）を検出することが出来る。検出した血管相当の箇所が画像に対して占める割合を求めることで面積密度を解析することが出来る。また、２値化処理した血管相当の箇所を細線化することで、線幅１画素の画像とし、太さに依存しない血管が占める割合（スケルトン密度ともいう）を求めることも出来る。これらの画像を用いて、無血管領域（ＦＡＺ）の面積や形状（円形度など）を解析するようにしてもよい。解析の方法として、画像全体から上述した数値を計算するようにしてもよいし、不図示のユーザーインターフェースを用いて、検者（ユーザー）の指示に基づいて、指定された関心領域（ＲＯＩ）に対して数値を計算するようにしてもよい。ＲＯＩの設定は必ずしも検者に指定されるだけではなく、自動的に所定の領域が指定されるものであってもよい。ここで、上述した各種パラメータは、血管に関する解析結果の一例であって、血管に関するパラメータであれば、何でも良い。なお、解析部２２０８は複数の画像解析処理を行ってもよい。すなわち、ここではＯＣＴＡのＥｎ－Ｆａｃｅ画像に関して解析する例を示したが、これだけではなく、同時にＯＣＴにより取得された画像に対する、網膜層のセグメンテーション、層厚計測、乳頭三次元形状解析、篩状板解析などを行ってもよい。これに関連して、解析部２２０８は、任意の入力装置を介した検者からの指示に応じて、複数の画像解析処理のうちの一部又は全部を行ってもよい。 In step S2340, the image quality improvement unit 404 improves the image quality of the OCTA En-Face image, and the process proceeds to step S2350. In step S2350, the analysis unit 2208 performs image analysis on the high-image-quality image generated in step S2340. As an image analysis of the high-image-quality OCTA En-Face image, any binarization process can be applied to detect blood vessel-equivalent areas (blood vessel regions) from the image. The area density can be analyzed by determining the proportion of the image occupied by the detected blood vessel-equivalent areas. In addition, by thinning the binarized blood vessel-equivalent areas to an image with a line width of 1 pixel, the proportion of blood vessels that do not depend on the thickness (also called skeleton density) can be determined. These images may be used to analyze the area and shape (circularity, etc.) of the avascular zone (FAZ). As a method of analysis, the above-mentioned numerical values may be calculated from the entire image, or a numerical value may be calculated for a specified region of interest (ROI) based on the instructions of the examiner (user) using a user interface (not shown). The setting of the ROI is not necessarily specified by the examiner, but a predetermined region may be automatically specified. Here, the various parameters described above are examples of analysis results related to blood vessels, and any parameters related to blood vessels may be used. The analysis unit 2208 may perform multiple image analysis processes. That is, an example of analysis of an En-Face image of OCTA is shown here, but not only this, but also retinal layer segmentation, layer thickness measurement, optic disc three-dimensional shape analysis, lamina cribrosa analysis, etc. may be performed on an image acquired by OCT at the same time. In relation to this, the analysis unit 2208 may perform some or all of multiple image analysis processes in response to instructions from the examiner via an arbitrary input device.

ステップＳ２３６０では、出力部４０５が、高画質化部４０４によって生成された高画質画像及び解析部２２０８による解析結果を表示部２０に表示させる。なお、出力部４０５は高画質画像及び解析結果を別々の表示部や装置に出力してもよい。また、出力部４０５は、解析結果のみを表示部２０に表示させてもよい。さらに、解析部２２０８が複数の解析結果を出力する場合には、出力部４０５は、複数の解析結果の一部又は全部を表示部２０やその他の装置に出力してもよい。例えば、ＯＣＴＡのＥｎ－Ｆａｃｅ画像における血管に関する解析結果を２次元マップとして表示部２０に表示させてもよい。また、ＯＣＴＡのＥｎ－Ｆａｃｅ画像における血管に関する解析結果を示す値をＯＣＴＡのＥｎ－Ｆａｃｅ画像に重畳して表示部２０に表示させてもよい。 In step S2360, the output unit 405 displays the high-quality image generated by the image quality improvement unit 404 and the analysis results by the analysis unit 2208 on the display unit 20. The output unit 405 may output the high-quality image and the analysis results to separate display units or devices. The output unit 405 may also display only the analysis results on the display unit 20. Furthermore, when the analysis unit 2208 outputs multiple analysis results, the output unit 405 may output some or all of the multiple analysis results to the display unit 20 or another device. For example, the analysis results regarding blood vessels in the OCTA En-Face image may be displayed on the display unit 20 as a two-dimensional map. Furthermore, values indicating the analysis results regarding blood vessels in the OCTA En-Face image may be superimposed on the OCTA En-Face image and displayed on the display unit 20.

上記のように、本実施形態に係る画像処理装置２２００は、高画質画像を画像解析する解析部２２０８を更に備え、出力部４０５は解析部２２０８による解析結果を表示部２０に表示させる。このように、本実施形態に係る画像処理装置２２００では、画像解析に高画質画像を用いるため、解析の精度を向上させることができる。 As described above, the image processing device 2200 according to this embodiment further includes an analysis unit 2208 that performs image analysis on the high-quality image, and the output unit 405 causes the display unit 20 to display the analysis results by the analysis unit 2208. In this way, the image processing device 2200 according to this embodiment uses high-quality images for image analysis, thereby improving the accuracy of the analysis.

また、出力部４０５は、第１の実施形態と同様に、生成された高画質画像を撮影装置１０や画像処理装置２２００に接続される他の装置に出力してもよい。また、高画質化エンジンの教師データの出力データは、第１の実施形態と同様に、重ね合わせ処理を行った高画質画像に限られない。すなわち、重ね合わせ処理やＭＡＰ推定処理、平滑化フィルタ処理、階調変換処理、高性能な撮影装置を用いた撮影、高コストな処理、ノイズ低減処理といった処理群や撮影方法のうち、少なくとも一つを行うことによって得られた高画質画像を用いてもよい。 In addition, the output unit 405 may output the generated high-quality image to the imaging device 10 or another device connected to the image processing device 2200, as in the first embodiment. In addition, the output data of the teacher data of the image quality improvement engine is not limited to a high-quality image that has been subjected to overlay processing, as in the first embodiment. In other words, a high-quality image obtained by performing at least one of a group of processes or imaging methods such as overlay processing, MAP estimation processing, smoothing filter processing, gradation conversion processing, imaging using a high-performance imaging device, high-cost processing, and noise reduction processing may be used.

＜第１８の実施形態＞
次に、図４を参照して、第１８の実施形態に係る画像処理装置について説明する。本実施形態では、学習時の画像にノイズを付加しノイズ成分を学習することで高画質化部が高画質画像を生成する例について説明をする。 <Eighteenth embodiment>
Next, an image processing device according to an eighteenth embodiment will be described with reference to Fig. 4. In this embodiment, an example will be described in which a high image quality unit generates a high image quality image by adding noise to an image during learning and learning the noise component.

本実施形態に係る取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして画像を取得する。本実施形態に係る高画質化部におけるＣＮＮの構成例として、図２４を用いて説明をする。図２４は、高画質化部４０４における機械学習モデル構成の一例を示している。図２４で示す構成は、入力値群を加工して出力する処理を担う、複数の層群によって構成される。なお、前記構成に含まれる層の種類としては、図２４に示すように、畳み込み（Ｃｏｎｖｏｌｕｔｉｏｎ）層、ダウンサンプリング（Ｄｏｗｎｓａｍｐｌｉｎｇ）層、アップサンプリング（Ｕｐｓａｍｐｌｉｎｇ）層、合成（Ｍｅｒｇｅｒ）層がある。畳み込み層は、設定されたフィルタのカーネルサイズ、フィルタの数、ストライドの値、ダイレーションの値等のパラメータに従い、入力値群に対して畳み込み処理を行う層である。なお、入力される画像の次元数に応じて、前記フィルタのカーネルサイズの次元数も変更してもよい。ダウンサンプリング層は、入力値群を間引いたり、合成したりすることによって、出力値群の数を入力値群の数よりも少なくする処理である。具体的には、例えば、ＭａｘＰｏｏｌｉｎｇ処理がある。アップサンプリング層は、入力値群を複製したり、入力値群から補間した値を追加したりすることによって、出力値群の数を入力値群の数よりも多くする処理である。具体的には、例えば、線形補間処理がある。合成層は、ある層の出力値群や画像を構成する画素値群といった値群を、複数のソースから入力し、それらを連結したり、加算したりして合成する処理を行う層である。このような構成では、入力された画像Ｉｍ２４１０を構成する画素値群が畳み込み処理ブロックを経て出力された値群と、入力された画像Ｉｍ２４１０を構成する画素値群が、合成層で合成される。その後、合成された画素値群は最後の畳み込み層で高画質画像Ｉｍ２４２０に成形される。なお、図示はしないが、ＣＮＮの構成の変更例として、例えば、畳み込み層の後にバッチ正規化（ＢａｔｃｈＮｏｒｍａｌｉｚａｔｉｏｎ）層や、正規化線形関数（ＲｅｃｔｉｆｉｅｒＬｉｎｅａｒＵｎｉｔ）を用いた活性化層を組み込む等をしても良い。 The acquisition unit 401 according to this embodiment acquires an image as input data to be processed from the image capture device 10 or another device. An example of the configuration of a CNN in the image quality improvement unit according to this embodiment will be described with reference to FIG. 24. FIG. 24 shows an example of a machine learning model configuration in the image quality improvement unit 404. The configuration shown in FIG. 24 is composed of a plurality of layers that process and output a group of input values. As shown in FIG. 24, the types of layers included in the configuration include a convolution layer, a downsampling layer, an upsampling layer, and a merger layer. The convolution layer is a layer that performs convolution processing on a group of input values according to parameters such as the kernel size of the set filter, the number of filters, the stride value, and the dilation value. The number of dimensions of the kernel size of the filter may also be changed according to the number of dimensions of the input image. The downsampling layer is a process that reduces the number of output value groups to be less than the number of input value groups by thinning out or synthesizing the input value groups. Specifically, there is a Max Pooling process, for example. The upsampling layer is a process that increases the number of output value groups to be more than the number of input value groups by duplicating the input value groups or adding values interpolated from the input value groups. Specifically, there is a linear interpolation process, for example. The synthesis layer is a layer that inputs value groups such as the output value group of a certain layer and the pixel value group constituting an image from multiple sources and performs a process of synthesizing them by connecting or adding them. In such a configuration, the value group output from the pixel value group constituting the input image Im2410 through the convolution processing block and the pixel value group constituting the input image Im2410 are synthesized in the synthesis layer. After that, the synthesized pixel value group is formed into a high-quality image Im2420 in the last convolution layer. Although not shown in the figure, as an example of a modification of the CNN configuration, for example, a batch normalization layer or an activation layer using a rectifier linear unit may be incorporated after the convolution layer.

本実施形態の高画質化エンジンは、撮影装置１０や他の装置から得た画像に第一のノイズ成分を付加した低画質画像を入力し、出力データとしては、撮影装置１０や他の装置から得た画像に第二のノイズ成分を付加した画像を高画質画像としてトレーニングしている。すなわち、本実施形態の学習時の教師画像は、低画質画像と高画質画像とが共通の画像を用いており、それぞれの画像におけるノイズ成分が異なるものとなる。画像としては同じものを用いているため、ペア画像とする際の位置合わせは不要である。 The image quality improvement engine of this embodiment inputs a low-quality image obtained by adding a first noise component to an image obtained from the image capture device 10 or another device, and as output data, trains an image obtained by adding a second noise component to an image obtained from the image capture device 10 or another device as a high-quality image. That is, in this embodiment, the teacher image used during learning is a common image for both the low-quality image and the high-quality image, and the noise components in each image are different. Since the same image is used, alignment is not required when pairing images.

ノイズ成分としては、ガウシアンノイズ、対象画像特有のノイズをモデル化したもの等をノイズとして付加する。ただし、第一と第二のノイズはそれぞれ異なるノイズとする。異なるノイズとは、ノイズを付加する空間的な場所（画素の位置）が異なる、あるいはノイズの値が異なるなどを意味する。対象画像特有のノイズとしては、例えばＯＣＴの場合、模型眼や被検眼を置かない状態で撮影したデータを基にノイズを推定し、それらをノイズモデルとして使用することが出来る。ＯＣＴＡの場合では、無血管領域（ＦＡＺ）の範囲に現れるノイズや、血液の流れを模式的に再現した模型眼を撮影した画像に現れるノイズを基に、ノイズモデルとして使用することが出来る。 Noise components that are added include Gaussian noise and modeled noise specific to the target image. However, the first and second noises are different noises. Different noises mean that the spatial locations (pixel positions) to which the noise is added are different, or that the noise values are different. For example, in the case of OCT, noise specific to the target image can be estimated based on data captured without a model eye or a test eye being placed, and used as a noise model. In the case of OCTA, noise that appears in the avascular zone (FAZ) or noise that appears in an image captured of a model eye that reproduces blood flow in a schematic manner can be used as a noise model.

ガウシアンノイズの場合は、ノイズの大きさとして標準偏差、あるいは分散値を定義し、それらの数値に基づいて画像にランダムにノイズを与える。ランダムノイズを与えた結果として、全体としての平均値は変わらないようにしてもよい。すなわち、１画像の各画素に付加されるノイズの平均値は０となるようにする。ここで、平均値は０となるようにする必要はなく、入力データと出力データとに対して互いに異なるパターンのノイズが付加できればよい。また、入力データと出力データとの両方にノイズを付加する必要はなく、いずれか一方にノイズを付加してもよい。ここで、ノイズを付加しない場合、例えば、高画質化後の画像では血管の偽像が生じる場合があったが、これは、高画質化前後の画像の差異が比較的大きい場合に生じると考えることも可能である。このため、高画質化前後の画像の差異が低減されるようにしてもよい。このとき、学習時において、低画質画像と高画質画像とに対して異なるパターンのノイズを付加して得た２つの画像をペア画像としてもよいし、また、高画質画像に対して異なるパターンのノイズを付加して得た２つの画像をペア画像としてもよい。 In the case of Gaussian noise, the standard deviation or variance value is defined as the noise magnitude, and noise is randomly added to the image based on these values. The average value as a whole may not change as a result of adding random noise. In other words, the average value of the noise added to each pixel of one image is set to 0. Here, it is not necessary to make the average value 0, and it is sufficient to add noise of different patterns to the input data and the output data. In addition, it is not necessary to add noise to both the input data and the output data, and noise may be added to either one of them. Here, when noise is not added, for example, a false image of blood vessels may occur in the image after the image quality is improved, but it is also possible to think that this occurs when the difference between the images before and after the image quality is improved is relatively large. For this reason, the difference between the images before and after the image quality is improved may be reduced. At this time, during learning, two images obtained by adding noise of different patterns to a low image quality image and a high image quality image may be paired images, or two images obtained by adding noise of different patterns to a high image quality image may be paired images.

出力部４０５は、高画質化部４０４が生成した高画質画像を表示部２０に表示させる。なお、出力部４０５は、高画質画像とともに、入力画像を表示部２０に表示させてもよい。 The output unit 405 displays the high-quality image generated by the image quality improvement unit 404 on the display unit 20. The output unit 405 may also display the input image on the display unit 20 together with the high-quality image.

なお、本実施形態では、撮影装置１０や他の装置から入手した低画質画像に第一のノイズ成分と第一のノイズ成分とは異なる第二のノイズ成分を付加した画像を用いて高画質画像を生成したが、これらの処理を行う構成はこれに限られない。例えば、ノイズを付加する画像は、第一の実施形態で示した重ね合わせ処理をした高画質画像に対して第一および第二のノイズ成分を付加するようにしてもよい。すなわち、重ね合わせ処理画像に第一のノイズ成分を付加した画像を低画質画像、重ね合わせ処理画像に第二のノイズ成分を付加した画像を高画質画像として学習する構成としてもよい。 In this embodiment, a high-quality image is generated using an image in which a first noise component and a second noise component different from the first noise component are added to a low-quality image obtained from the imaging device 10 or another device, but the configuration for performing these processes is not limited to this. For example, the image to which noise is added may be a high-quality image that has been subjected to the overlay process shown in the first embodiment, with the first and second noise components added. In other words, a configuration may be used in which an image in which the first noise component is added to the overlay process image is learned as a low-quality image, and an image in which the second noise component is added to the overlay process image is learned as a high-quality image.

さらには、本実施形態では、第一と第二のノイズ成分を用いて学習する例について説明したがこれに限らない。例えば、低画質画像とする方にのみ第一のノイズ成分を付加し、高画質画像とする方にはノイズ成分を付加せずに学習を行う構成としてもよい。その際の画像としては、撮影装置１０や他の装置から入手した画像でも良いし、その画像を重ね合わせ処理した画像を対象とするようにしてもよい。 Furthermore, in this embodiment, an example of learning using the first and second noise components has been described, but this is not limiting. For example, a configuration may be used in which the first noise component is added only to the image to be considered as a low-quality image, and learning is performed without adding any noise component to the image to be considered as a high-quality image. The image in this case may be an image obtained from the imaging device 10 or another device, or an image obtained by superimposing the image may be used as the target.

さらには、ノイズ成分の大きさを入力画像の種類、あるいは、学習する矩形領域画像毎に動的に変更するようにしても良い。具体的には、値の大きなノイズを付加するとノイズ除去の効果が大きくなり、値の小さなノイズを付加するとノイズ除去の効果は小さい。そのため、例えば、暗い画像の時には付加するノイズ成分の値を小さくして、明るい画像の時には付加するノイズ成分の値を大きくするなど、画像全体あるいは矩形領域画像の条件や種類に応じて付加するノイズを調整して学習をするようにしても良い。 Furthermore, the size of the noise component may be dynamically changed for each type of input image or rectangular area image being learned. Specifically, adding a large noise value increases the effect of noise removal, while adding a small noise value decreases the effect of noise removal. Therefore, for example, the noise component value to be added may be reduced for dark images and increased for bright images, and learning may be performed by adjusting the noise to be added according to the conditions and type of the entire image or rectangular area image.

なお、本実施形態において、画像の撮影条件については明記しなかったが、様々な撮影範囲とスキャン数の異なる画像、異なる撮影部位や異なる深度の正面画像などを用いて学習をしておく。 In this embodiment, the image capture conditions are not specified, but learning is performed using images with various capture ranges and different numbers of scans, frontal images of different capture sites, and images of different depths.

上記では、撮影装置１０や他の装置から入手した画像、その画像にノイズを付加したノイズ画像、重ね合わせ処理画像、重ね合わせ処理画像にノイズを付加した画像について説明をした。しかし、これらの組み合わせは上述したものに限らず、どのように低画質画像と高画質画像とを組み合わせてもよい。 The above describes images obtained from the image capture device 10 or other devices, noise images obtained by adding noise to those images, superimposition processed images, and images obtained by adding noise to superimposition processed images. However, these combinations are not limited to those described above, and low-quality images and high-quality images may be combined in any manner.

＜第１９の実施形態＞
次に、図２５、２６を参照して、第１９の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が複数の高画質化エンジンを備え、入力画像に対して複数の高画質画像を生成する。そして、合成部２５０５が複数の高画質化エンジンから出力される複数の高画質画像を合成する例について説明をする。 Nineteenth embodiment
Next, an image processing device according to a nineteenth embodiment will be described with reference to Figs. 25 and 26. In this embodiment, an image quality improvement unit includes a plurality of image quality improvement engines, and generates a plurality of high-image-quality images for an input image. An example in which a synthesis unit 2505 synthesizes the plurality of high-image-quality images output from the plurality of image quality improvement engines will be described.

本実施形態に係る取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして画像を取得する。 The acquisition unit 401 in this embodiment acquires images as input data to be processed from the imaging device 10 or other devices.

本実施形態に係る高画質化部４０４には、第２の実施形態と同様に複数の高画質化エンジンが備えられている。ここで、複数の高画質化エンジンの各々は、それぞれ撮影部位、撮影画角、異なる深度の正面画像、ノイズ成分、及び画像の解像度のうちの少なくとも一つについての異なる学習データを用いて学習を行ったものである。高画質化部４０４は、入力画像の撮影部位、撮影画角、異なる深度の正面画像、ノイズ成分、及び画像の解像度のうちの少なくとも一つに応じた高画質化エンジンを複数用いて、高画質画像を生成する。 The image quality improvement unit 404 according to this embodiment is equipped with multiple image quality improvement engines, as in the second embodiment. Here, each of the multiple image quality improvement engines has been trained using different learning data for at least one of the imaging part, imaging angle of view, front images at different depths, noise components, and image resolution. The image quality improvement unit 404 generates a high-quality image using multiple image quality improvement engines that correspond to at least one of the imaging part, imaging angle of view, front images at different depths, noise components, and image resolution of the input image.

図２６は、本実施形態に係る一連の画像処理のフロー図である。なお、本実施形態に係るステップＳ２６１０及びステップＳ２６２０の処理は、第１の実施形態におけるステップＳ５１０及びステップＳ５２０での処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ２６２０の処理の後に、ステップＳ２６３０の処理を省き、処理をステップＳ２６４０に移行してよい。 Figure 26 is a flow diagram of a series of image processing steps according to this embodiment. Note that the processing steps S2610 and S2620 according to this embodiment are similar to the processing steps S510 and S520 in the first embodiment, and therefore will not be described. Note that if the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, after the processing step S2620, the processing step S2630 may be omitted and the processing may proceed to step S2640.

ステップＳ２６２０において、第１の実施形態と同様に、撮影条件取得部４０２が入力画像の撮影条件群を取得したら、処理はステップＳ２６３０に移行する。ステップＳ２６３０では、高画質化可否判定部４０３が、第２の実施形態と同様に、取得された撮影条件群を用いて、高画質化部４０４に備える高画質化エンジンのいずれかが入力画像を対処可能であるか否かを判定する。 In step S2620, as in the first embodiment, once the shooting condition acquisition unit 402 acquires a group of shooting conditions for the input image, the process proceeds to step S2630. In step S2630, as in the second embodiment, the image quality improvement feasibility determination unit 403 uses the acquired group of shooting conditions to determine whether or not any of the image quality improvement engines included in the image quality improvement unit 404 can handle the input image.

高画質化可否判定部４０３が、高画質化エンジン群のいずれも入力画像を対処不可能であると判定した場合には、処理はステップＳ２６６０に移行する。一方で、高画質化可否判定部４０３が、高画質化エンジン群のいずれかが入力画像を対処可能であると判定した場合には、処理はステップＳ２６４０に移行する。なお、画像処理装置４００の設定や実装形態によっては、第１の実施形態と同様に、高画質化エンジンによって一部の撮影条件が対処不可能であると判定されたとしても、ステップＳ２６４０を実施してもよい。 If the image quality improvement capability determination unit 403 determines that none of the image quality improvement engines can handle the input image, the process proceeds to step S2660. On the other hand, if the image quality improvement capability determination unit 403 determines that any of the image quality improvement engines can handle the input image, the process proceeds to step S2640. Note that, depending on the settings and implementation form of the image processing device 400, step S2640 may be performed, as in the first embodiment, even if the image quality improvement engines determine that some shooting conditions cannot be handled.

ステップＳ２６４０においては、高画質化部４０４が、高画質化エンジン群のそれぞれにステップＳ２６１０において取得した入力画像を入力し、高画質画像群を生成する。 In step S2640, the image quality improvement unit 404 inputs the input image acquired in step S2610 to each of the image quality improvement engines, and generates a group of high-image-quality images.

ステップＳ２６５０では、合成部２４０５が、ステップＳ２６４０において生成された高画質画像群のうちいくつかの高画質な画像を合成する。具体的には、例えば、第１の実施形態で示したように撮影装置１０から取得した低画質画像と、低画質画像を複数回撮影することにより取得した画像群に対して加算平均等の重ね合わせ処理をして得た高画質画像とのペア画像を用いて学習した第一の高画質化エンジンと、第１８の実施形態で示したような画像にノイズを付加したペア画像を用いて学習した第二の高画質化エンジンとの２つの高画質画像の結果を合成する。合成方法としては、加算平均や重み付き加算平均などを用いて合成することが出来る。 In step S2650, the synthesis unit 2405 synthesizes some high-quality images from the group of high-quality images generated in step S2640. Specifically, for example, the results of two high-quality images are synthesized: a first image quality improvement engine that has trained using paired images of a low-quality image acquired from the imaging device 10 as shown in the first embodiment and a high-quality image obtained by performing an averaging process or the like on a group of images acquired by capturing low-quality images multiple times; and a second image quality improvement engine that has trained using paired images in which noise has been added as shown in the 18th embodiment. As a synthesis method, synthesis can be performed using averaging, weighted averaging, or the like.

ステップＳ２６６０においては、出力部４０５が、ステップＳ２６５０において合成された画像を表示部２０に表示させたり、他の装置に出力したりする。ただし、ステップＳ２６３０において、入力画像が処理不可能であると判定されている場合には、出力部４０５は、入力画像を出力画像として出力する。なお、出力部４０５は、検者によって入力画像が指示された場合や、入力画像が処理不可能であった場合には、表示部２０に出力画像が入力画像と同じであることを表示させてもよい。 In step S2660, the output unit 405 displays the image synthesized in step S2650 on the display unit 20 or outputs it to another device. However, if it is determined in step S2630 that the input image cannot be processed, the output unit 405 outputs the input image as the output image. Note that if an input image is specified by the examiner or if the input image cannot be processed, the output unit 405 may display on the display unit 20 that the output image is the same as the input image.

＜第２０の実施形態＞
次に、図４を参照して、第２０の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部が第一の高画質化エンジンの出力結果を用いて第二の高画質化エンジンが高画質画像を生成する例について説明をする。 <Twentieth embodiment>
Next, an image processing device according to a twentieth embodiment will be described with reference to Fig. 4. In this embodiment, an example will be described in which an image quality improvement unit uses an output result of a first image quality improvement engine and a second image quality improvement engine generates a high image quality image.

本実施形態に係る高画質化部４０４には、第１の実施形態と同様の、高画質化エンジンが複数備えられている。本実施形態の高画質化部は、撮影装置１０や他の装置から入力データとして取得した低画質画像と、複数の低画質画像から生成された中画質画像を出力データとして学習した第一の高画質化エンジンを備える。さらに、第一の高画質化エンジンから出力された画像と、中画質画像よりも高画質な画像を出力データとして学習した第二の高画質化エンジンを備える。なお、中画質画像に関しては、第１４の実施形態と同様であるため、説明を省略する。 The image quality improvement unit 404 according to this embodiment is provided with multiple image quality improvement engines similar to those of the first embodiment. The image quality improvement unit of this embodiment is provided with a first image quality improvement engine that has learned output data from low image quality images acquired as input data from the image capture device 10 or other devices, and medium image quality images generated from the multiple low image quality images. It further includes a second image quality improvement engine that has learned output data from images output from the first image quality improvement engine, and images of higher image quality than the medium image quality images. Note that the medium image quality images are the same as those of the fourteenth embodiment, and therefore will not be described here.

出力部４０５は、高画質化部４０４が生成した高画質画像を表示部２０に表示させる。なお、出力部４０５は、高画質画像とともに、入力画像を表示部２０に表示させてもよく、この場合に、出力部４０５は、入力画像が複数の低画質画像から生成された画像であることを表示部２０に表示してもよい。 The output unit 405 displays the high-quality image generated by the image quality improvement unit 404 on the display unit 20. The output unit 405 may display the input image on the display unit 20 together with the high-quality image, and in this case, the output unit 405 may display on the display unit 20 that the input image is an image generated from a plurality of low-quality images.

次に、図５を参照して、本実施形態に係る一連の画像処理について説明する。なお、本実施形態に係るステップＳ５１０～ステップＳ５３０の処理は、第１の実施形態におけるこれらの処理と同様であるため、説明を省略する。 Next, a series of image processing steps according to this embodiment will be described with reference to FIG. 5. Note that the processing steps S510 to S530 according to this embodiment are similar to those steps in the first embodiment, and therefore will not be described.

ステップＳ５４０においては、高画質化部４０４が、高画質化エンジンを用いて、入力画像を高画質化し、入力画像よりも画像診断に適した高画質画像を生成する。具体的には、高画質化部４０４は、入力画像を第一の高画質化エンジンに入力し、高画質化された第一の高画質画像を生成させる。さらに、第一の高画質画像を第二の高画質化エンジンに入力し、第二の高画質画像を得る。高画質化エンジンは、教師データを用いて機械学習を行った機械学習モデルに基づいて、入力画像を用いて重ね合わせ処理を行ったような高画質画像を生成する。このため、高画質化エンジンは、入力画像よりも、ノイズ低減されたり、コントラスト強調されたりした高画質画像を生成することができる。 In step S540, the image quality improvement unit 404 uses an image quality improvement engine to improve the image quality of the input image and generate a high-image quality image that is more suitable for image diagnosis than the input image. Specifically, the image quality improvement unit 404 inputs the input image to a first image quality improvement engine, which generates a first high-image quality image with improved image quality. The first high-image quality image is then input to a second image quality improvement engine to obtain a second high-image quality image. The image quality improvement engine generates a high-image quality image that is similar to an overlay process using the input image, based on a machine learning model that has been machine-learned using training data. For this reason, the image quality improvement engine can generate a high-image quality image with reduced noise and enhanced contrast compared to the input image.

なお、本実施形態では、撮影装置１０や他の装置から入手した低画質画像と中画質画像とをペアで学習した第一の高画質化エンジンと第一の高画質画像と高画質画像とをペアで学習した第二の高画質エンジンを用いて高画質画像を生成したが、これらの処理を行う構成はこれに限られない。例えば、第一の高画質化エンジンで学習する画像のペアは、第１８の実施形態で説明をしたノイズを学習するエンジンとし、第二の高画質化エンジンは第一の高画質画像と高画質画像とをペアで学習するようにしてもよい。逆の構成として、低画質画像と中画質画像とをペアで学習した第一の高画質化エンジンと、第二の高画質化エンジンは第一の高画質画像に対してノイズを付加した画像を学習したエンジンとしてもよい。 In this embodiment, high-quality images are generated using a first high-quality engine that has learned a pair of low-quality images and medium-quality images obtained from the imaging device 10 or another device, and a second high-quality engine that has learned a pair of first high-quality images and high-quality images, but the configuration for performing these processes is not limited to this. For example, the image pair learned by the first high-quality engine may be an engine that learns noise as described in the 18th embodiment, and the second high-quality engine may learn a pair of the first high-quality image and a high-quality image. As an opposite configuration, the first high-quality engine may learn a pair of low-quality images and medium-quality images, and the second high-quality engine may learn an image with noise added to the first high-quality image.

さらに、第一の高画質化エンジンと第二の高画質化エンジン共に、第１８の実施形態で説明をしたノイズを学習するエンジンとしてもよい。この場合、例えば、第一の高画質化エンジンは、重ね合わせ処理画像により生成した高画質画像に第一および第二のノイズを付加した画像をペアで学習し、第二の高画質化エンジンは、第一の高画質化エンジンにより生成された第一の高画質画像に対して第一および第二のノイズを付加した画像をペアで学習する。なお、本実施形態では、二つの高画質化エンジンについて説明を行ったが、これに限らず、第三、第四と、さらに連結して処理をする構成としてもよい。学習に用いる画像をきれいにしていくことで、より滑らかでシャープな画像を生成しやすいネットワークが構成される。 Furthermore, both the first image quality improvement engine and the second image quality improvement engine may be engines that learn noise as described in the 18th embodiment. In this case, for example, the first image quality improvement engine learns a pair of images in which the first and second noises are added to a high-quality image generated by the superimposition processing image, and the second image quality improvement engine learns a pair of images in which the first and second noises are added to the first high-quality image generated by the first image quality improvement engine. Note that, although two image quality improvement engines have been described in this embodiment, this is not limiting, and a configuration in which a third and fourth are further linked and processed may be used. By cleaning up the images used for learning, a network that is more likely to generate smoother and sharper images is constructed.

＜第２１の実施形態＞
次に、図４及び２７を参照して、第２１の実施形態に係る画像処理装置について説明する。第１の実施形態では、高画質化部４０４は、一つの高画質化エンジンを備えていた。これに対して、本実施形態では、高画質化部が、異なる教師データを用いて機械学習を行った複数の高画質化エンジンを備え、入力画像に対して複数の高画質画像を生成する。 <Twenty-first embodiment>
Next, an image processing device according to a twenty-first embodiment will be described with reference to Figs. 4 and 27. In the first embodiment, the image quality improvement unit 404 includes one image quality improvement engine. In contrast, in this embodiment, the image quality improvement unit includes multiple image quality improvement engines that perform machine learning using different teacher data, and generates multiple high-image-quality images for an input image.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第２の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る画像処理装置について、第１、第２の実施形態に係る画像処理装置との違いを中心として説明する。なお、本実施形態に係る画像処理装置の構成は、第１、第２の実施形態に係る画像処理装置の構成と同様であるため、図４に示す構成について同一の参照符号を用いて示し、説明を省略する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment is the same as the image processing device 400 according to the second embodiment. Therefore, the following description of the image processing device according to this embodiment will focus on the differences from the image processing devices according to the first and second embodiments. Note that since the configuration of the image processing device according to this embodiment is the same as the configuration of the image processing device according to the first and second embodiments, the configuration shown in FIG. 4 will be indicated using the same reference numerals and description will be omitted.

本実施形態に係る高画質化部４０４には、それぞれ異なる教師データを用いて機械学習が行われた二つ以上の高画質化エンジンが備えられている。ここで、本実施形態に係る教師データ群の作成方法について説明する。まず、様々な撮影範囲とスキャン数の異なる画像で撮影された、入力データとしての元画像と出力データとしての重ね合わせ画像のペア群を用意する。ＯＣＴやＯＣＴＡを例に説明すると、例えば、３×３ｍｍの範囲を３００本のＡスキャンと３００枚のＢスキャンで撮影した第一の画像群のペアと、１０×１０ｍｍの範囲を５００本のＡスキャンと５００枚のＢスキャンで撮影した第二の画像群のペアとする。この時、第一の画像群のペアと第二の画像群のペアとでは、スキャン密度が２倍異なる。そのため、これらの画像群は別としてグルーピングしておく。そして、６×６ｍｍの範囲を６００本のＡスキャンと６００枚のＢスキャンで撮影した画像群がある場合には、第一の画像群と同一のグループとする。すなわち、ここではスキャン密度が同じか、ほぼ同じ（１割程度の誤差）の画像群を同一のグループでグルーピングをする。 The image quality improvement unit 404 according to this embodiment is equipped with two or more image quality improvement engines that have been subjected to machine learning using different teacher data. Here, a method for creating a teacher data group according to this embodiment will be described. First, a pair group of original images as input data and superimposed images as output data, which are taken with images having various shooting ranges and different numbers of scans, is prepared. Taking OCT and OCTA as an example, for example, a first image group pair taken with 300 A scans and 300 B scans in a 3×3 mm range and a second image group pair taken with 500 A scans and 500 B scans in a 10×10 mm range are taken. At this time, the scan density differs by two times between the first image group pair and the second image group pair. Therefore, these image groups are grouped separately. Then, if there is an image group taken with 600 A scans and 600 B scans in a 6×6 mm range, it is grouped in the same group as the first image group. In other words, images with the same or nearly the same scan density (with an error of about 10%) are grouped together.

次に、スキャン密度毎にペア群をグルーピングすることで、教師データ群を作成する。例えば、第一のスキャン密度で撮影して取得されたペア群で構成される第一の教師データ、第二のスキャン密度で撮影して取得されたペア群で構成される第二の教師データというように、教師データ群を作成する。 Next, a training data group is created by grouping the pairs by scan density. For example, a training data group is created in such a way that a first training data group is made up of pairs acquired by shooting at a first scan density, and a second training data group is made up of pairs acquired by shooting at a second scan density.

その後、各教師データを用いて別々の高画質化エンジンに機械学習を行わせる。例えば、第一の教師データでトレーニングされた機械学習モデルに対応する第一の高画質化エンジン、第二の教師データでトレーニングされた機械学習モデルに対応する第二の高画質化エンジンというように高画質化エンジン群を用意する。 Then, separate image quality improvement engines are made to perform machine learning using each training data. For example, a group of image quality improvement engines is prepared, such as a first image quality improvement engine corresponding to a machine learning model trained with the first training data, and a second image quality improvement engine corresponding to a machine learning model trained with the second training data.

このような高画質化エンジンは、それぞれ対応する機械学習モデルのトレーニングに用いた教師データが異なるため、高画質化エンジンに入力される画像の撮影条件によって、入力画像を高画質化できる程度が異なる。具体的には、第一の高画質化エンジンは、第一のスキャン密度で取得された入力画像に対しては高画質化の程度が高く、第二のスキャン密度で取得された画像に対しては高画質化の程度が低い。同様に、第二の高画質化エンジンは、第二のスキャン密度で取得された入力画像に対しては高画質化の程度が高く、第一のスキャン密度で取得された画像に対しては高画質化の程度が低い。 Since each of these image quality improvement engines uses different teacher data to train the corresponding machine learning model, the degree to which the image quality of the input image can be improved varies depending on the shooting conditions of the image input to the image quality improvement engine. Specifically, the first image quality improvement engine provides a high degree of image quality improvement for input images acquired at a first scan density, but a low degree of image quality improvement for images acquired at a second scan density. Similarly, the second image quality improvement engine provides a high degree of image quality improvement for input images acquired at a second scan density, but a low degree of image quality improvement for images acquired at a first scan density.

一方、学習時に様々な撮影範囲とスキャン密度の異なる画像を教師データとして十分の数を集められない場合がある。その場合、それらの画像群に対しては、第１８の実施形態で示したように、ノイズ成分を学習した高画質化エンジンを用意する。 On the other hand, there are cases where it is not possible to collect a sufficient number of images with various shooting ranges and different scan densities as training data during learning. In such cases, for such image groups, an image quality improvement engine that has learned noise components is prepared, as shown in the 18th embodiment.

ノイズ成分を学習した高画質化エンジンは、撮影時のスキャン密度の影響を受けにくいため、学習していないスキャン密度の画像が入力された際には、こちらを適用する。 The image quality improvement engine, which has learned about noise components, is less affected by the scan density at the time of shooting, so it is applied when an image with a scan density that has not been learned is input.

教師データのそれぞれがスキャン密度によってグルーピングされたペア群で構成されることにより、該ペア群を構成する画像群の画質傾向が似る。このため、高画質化エンジンは対応するスキャン密度であれば、第一の実施形態に係る高画像化エンジンよりも効果的に高画質化を行うことができる。なお、教師データのペアをグルーピングするための撮影条件は、スキャン密度に限られず、撮影部位であったり、正面画像においては異なる深度の画像であったり、これらのうちの二つ以上の組み合わせであったりしてもよい。 Since each piece of training data is composed of pairs grouped by scan density, the image quality tendencies of the images constituting the pair groups are similar. Therefore, the image quality improvement engine can improve image quality more effectively than the image improvement engine according to the first embodiment, as long as the image quality is compatible with the corresponding scan density. Note that the shooting conditions for grouping pairs of training data are not limited to scan density, but may be the shooting site, images at different depths in frontal images, or a combination of two or more of these.

以下、図２７を参照して、本実施形態に係る一連の画像処理について説明する。図２７は、本実施形態に係る一連の画像処理のフロー図である。なお、ステップＳ２７１０及びステップＳ２７２０の処理は、第１の実施形態に係るステップＳ５１０及びステップＳ５２０と同様であるため、説明を省略する。 Below, a series of image processing according to this embodiment will be described with reference to FIG. 27. FIG. 27 is a flow diagram of a series of image processing according to this embodiment. Note that the processing of steps S2710 and S2720 is similar to steps S510 and S520 according to the first embodiment, and therefore description thereof will be omitted.

ステップＳ２７２０において入力画像の撮影条件が取得されると、処理はステップＳ２７３０に移行する。ステップＳ２７３０においては、高画質化可否判定部４０３が、ステップＳ２７２０において取得した撮影条件群を用いて、高画質化部４０４が備える高画質化エンジン群のいずれかが、入力画像を対処可能であるか否かを判定する。 When the shooting conditions of the input image are acquired in step S2720, the process proceeds to step S2730. In step S2730, the image quality improvement feasibility determination unit 403 uses the group of shooting conditions acquired in step S2720 to determine whether any of the image quality improvement engines included in the image quality improvement unit 404 can handle the input image.

高画質化可否判定部４０３が、撮影条件外であると判定した場合には、処理はステップＳ２７７０に移行する。一方で、高画質化可否判定部４０３が、撮影条件内であると判定した場合には、処理はステップＳ２７４０に移行する。 If the image quality improvement possibility determination unit 403 determines that the shooting conditions are not met, the process proceeds to step S2770. On the other hand, if the image quality improvement possibility determination unit 403 determines that the shooting conditions are met, the process proceeds to step S2740.

ステップＳ２７４０においては、高画質化部４０４が、ステップＳ２７２０で取得した入力画像の撮影条件及び高画質化エンジン群の教師データの情報に基づいて、高画質化エンジン群から高画質化処理を行う高画質化エンジンを選択する。具体的には、例えば、ステップＳ２７２０において取得した撮影条件群のうちのスキャン密度に対して、スキャン密度に関する教師データの情報を有し、高画質化の程度が高い高画質化エンジンを選択する。上述の例では、スキャン密度が第一のスキャン密度である場合には、高画質化部４０４は第一の高画質化エンジンを選択する。 In step S2740, the image quality improvement unit 404 selects an image quality improvement engine from the group of image quality improvement engines to perform image quality improvement processing based on the shooting conditions of the input image acquired in step S2720 and the teacher data information of the group of image quality improvement engines. Specifically, for example, for the scan density among the group of shooting conditions acquired in step S2720, an image quality improvement engine that has teacher data information regarding the scan density and provides a high level of image quality improvement is selected. In the above example, when the scan density is the first scan density, the image quality improvement unit 404 selects the first image quality improvement engine.

一方、ステップＳ２７７０においては、高画質化部４０４は、ノイズ成分を学習した高画質化エンジンを選択する。 On the other hand, in step S2770, the image quality improvement unit 404 selects an image quality improvement engine that has learned the noise components.

ステップＳ２７５０では、高画質化部４０４が、ステップＳ２７４０、ステップＳ２７７０において選択した高画質化エンジンを用いて、入力画像を高画質化した高画質画像を生成する。その後、ステップＳ２７６０において、出力部４０５は、ステップＳ２７５０において高画質画像を出力して、表示部２０に表示させる。なお、出力部４０５は、高画質画像を表示部２０に表示させる際、高画質化部４０４によって選択された高画質化エンジンを用いて生成された高画質画像であることを表示させてもよい。 In step S2750, the image quality improvement unit 404 uses the image quality improvement engine selected in steps S2740 and S2770 to generate a high-image quality image by improving the image quality of the input image. After that, in step S2760, the output unit 405 outputs the high-image quality image in step S2750 and displays it on the display unit 20. Note that when displaying the high-image quality image on the display unit 20, the output unit 405 may display that the high-image quality image was generated using the image quality improvement engine selected by the image quality improvement unit 404.

上記のように、本実施形態に係る高画質化部４０４は、それぞれ異なる学習データを用いて学習を行った複数の高画質化エンジンを備える。ここで、複数の高画質化エンジンの各々は、それぞれ撮影部位、撮影画角、異なる深度の正面画像、及び画像の解像度のうちの少なくとも一つについての異なる学習データを用いて学習を行ったものである。さらに、正解データ（出力データ）を十分に集められなかったデータについては、ノイズ成分を用いて学習を行ったものである。高画質化部４０４は、これらのうちの少なくとも一つに応じた高画質化エンジンを用いて、高画質画像を生成する。 As described above, the image quality improvement unit 404 according to this embodiment includes a plurality of image quality improvement engines that have been trained using different learning data. Here, each of the plurality of image quality improvement engines has been trained using different learning data for at least one of the imaging region, imaging angle of view, front images at different depths, and image resolution. Furthermore, for data for which sufficient correct answer data (output data) could not be collected, learning was performed using noise components. The image quality improvement unit 404 generates a high-quality image using an image quality improvement engine corresponding to at least one of these.

＜第２２の実施形態＞
次に、図３０から３２を参照して、第２２の実施形態に係る画像処理装置について説明する。本実施形態では、広画角画像生成部が高画質化部によって生成された複数の高画質画像を用いて広画角画像（パノラマ画像）を生成する。 <Twenty-second embodiment>
Next, an image processing device according to a 22nd embodiment will be described with reference to Figs. 30 to 32. In this embodiment, a wide-angle image generating unit generates a wide-angle image (panoramic image) using a plurality of high-quality images generated by an image quality improving unit.

図３１（ａ）は、本実施形態に係る一連の画像処理のフロー図である。ステップＳ３１１０において、取得部４０１は撮影装置１０や他の装置から入力データとして複数の画像（少なくとも２枚）を取得する。複数の画像は、同一の被写体（被検眼など）の異なる位置を撮影した画像であり、被写体に対して完全には重複せずに、画像の一部が重複する場所を撮影した画像とする。被検眼を撮影する場合を例に説明すると、撮影時に固視灯の位置を変更し、被検眼がその固視灯に注視することで、同一の被検眼において異なる場所を撮影した画像を取得することが出来る。なお、画像撮影時には、隣接する画像同士の重複領域が少なくとも２割程度が同じ場所となるように固視灯の位置を変更して撮影しておくことが望ましい。図３２（ａ）に、隣接する画像の一部が重複するように固視灯の位置を変更して撮影したＯＣＴＡのＥｎ－Ｆａｃｅ画像の例を示す。図３２（ａ）では、固視灯の位置を変更して異なる場所を５回撮影する場合の例を示している。なお、図３２には例として５枚の画像を示しているが、５枚に限らず２枚以上であればよい。 Figure 31 (a) is a flow diagram of a series of image processing according to this embodiment. In step S3110, the acquisition unit 401 acquires multiple images (at least two) as input data from the imaging device 10 or another device. The multiple images are images of different positions of the same subject (such as the subject's eye), and are images of places where the images do not completely overlap with the subject, but where the images overlap partially. Taking the case of imaging the subject's eye as an example, images of different positions of the same subject's eye can be obtained by changing the position of the fixation light during imaging and having the subject's eye gaze at the fixation light. Note that when imaging an image, it is desirable to change the position of the fixation light so that at least about 20% of the overlapping areas between adjacent images are in the same place. Figure 32 (a) shows an example of an En-Face image of OCTA captured by changing the position of the fixation light so that adjacent images partially overlap. Figure 32 (a) shows an example of a case where the position of the fixation light is changed and different positions are captured five times. Note that while five images are shown as an example in Figure 32, the number is not limited to five and can be two or more.

なお、本実施形態に係るステップＳ３１２０の処理は、第１の実施形態におけるステップＳ５２０での処理と同様であるため、説明を省略する。なお、入力画像に対して、撮影条件について無条件で高画質化する場合には、ステップＳ３１２０の処理の後に、ステップＳ３１３０の処理を省き、処理をステップＳ３１４０に移行してよい。 The process of step S3120 in this embodiment is similar to the process of step S520 in the first embodiment, and therefore will not be described. If the image quality of the input image is to be improved unconditionally with respect to the shooting conditions, the process of step S3130 may be omitted after the process of step S3120, and the process may proceed to step S3140.

ステップＳ３１２０において、第１の実施形態と同様に、撮影条件取得部４０２が入力画像の撮影条件群を取得したら、処理はステップＳ３１３０に移行する。ステップＳ３１３０では、高画質化可否判定部４０３が、第１の実施形態と同様に、取得された撮影条件群を用いて、高画質化部４０４に備える高画質化エンジンが入力画像を対処可能であるか否かを判定する。 In step S3120, as in the first embodiment, once the shooting condition acquisition unit 402 acquires a group of shooting conditions for the input image, the process proceeds to step S3130. In step S3130, as in the first embodiment, the image quality improvement feasibility determination unit 403 uses the acquired group of shooting conditions to determine whether the image quality improvement engine provided in the image quality improvement unit 404 is capable of handling the input image.

高画質化可否判定部４０３が、高画質化エンジンが複数の入力画像を対処不可能であると判定した場合には、処理はステップＳ３１６０に移行する。一方で、高画質化可否判定部４０３が、高画質化エンジンが複数の入力画像を対処可能であると判定した場合には、処理はステップＳ３１４０に移行する。なお、画像処理装置４００の設定や実装形態によっては、第１の実施形態と同様に、高画質化エンジンによって一部の撮影条件が対処不可能であると判定されたとしても、ステップＳ３１４０を実施してもよい。 If the image quality improvement capability determination unit 403 determines that the image quality improvement engine cannot handle multiple input images, the process proceeds to step S3160. On the other hand, if the image quality improvement capability determination unit 403 determines that the image quality improvement engine can handle multiple input images, the process proceeds to step S3140. Note that, depending on the settings and implementation form of the image processing device 400, step S3140 may be performed, as in the first embodiment, even if the image quality improvement engine determines that it cannot handle some shooting conditions.

ステップＳ３１４０においては、高画質化部４０４が、ステップＳ３１１０において取得した複数の入力画像に対して処理を実行し複数の高画質画像を生成する。 In step S3140, the image quality improvement unit 404 performs processing on the multiple input images acquired in step S3110 to generate multiple high-image-quality images.

ステップＳ３１５０では、広画角画像生成部３００５が、ステップＳ３１４０において生成された高画質画像群のうちいくつかの高画質な画像を合成する。具体的には、ＯＣＴＡのＥｎ－Ｆａｃｅ画像を例に説明をする。複数の画像は完全には重複しないが、隣接する画像同士は一部の領域が互いに重複するように撮影されたＯＣＴＡのＥｎ－Ｆａｃｅ画像である。そのため、広画角画像生成部３００５は複数のＯＣＴＡのＥｎ－Ｆａｃｅ画像から重複した領域を検出し、重複領域を用いて位置合わせを実施する。位置合わせパラメータに基づいてＯＣＴＡのＥｎ－Ｆａｃｅ画像を変形して画像を合成することで、１枚のＯＣＴＡのＥｎ－Ｆａｃｅ画像よりも広範囲なＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成することが出来る。この時、入力となる複数のＯＣＴＡのＥｎ－Ｆａｃｅ画像はステップＳ３１４０において高画質化されているため、ステップＳ３１５０において出力される広画角なＯＣＴＡのＥｎ－Ｆａｃｅ画像は既に高画質化されている。図３２（ｂ）に広画角画像生成部３００５によって生成される広画角なＯＣＴＡのＥｎ－Ｆａｃｅ画像の例を示す。図３２（ｂ）は図３２（ａ）で示した５枚の画像を位置合わせして生成した例である。図３２（ｃ）には、図３２（ａ）と図３２（ｂ）との位置の対応関係を示す。図３２（ｃ）に示すように、Ｉｍ３２１０を中心に、その周辺にＩｍ３２２０～３２５０が配置される。なお、ＯＣＴＡのＥｎ－Ｆａｃｅ画像は、３次元のモーションコントラストデータから異なる深度範囲を設定することで、複数のＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成することが出来る。そのため、図３２には広画角の表層画像の例を示したが、これに限らない。例えば、図２９で示した表層のＯＣＴＡのＥｎ－Ｆａｃｅ画像（Ｉｍ２９１０）を用いて位置合わせをして、その他の深度範囲のＯＣＴＡのＥｎ－Ｆａｃｅ画像は、そこで求めたパラメータを用いて変形させるようにしてもよい。あるいは、位置合わせの入力画像をカラー画像とし、ＲＧＢ成分のＲＧ成分に表層のＯＣＴＡのＥｎ－Ｆａｃｅ、Ｂ成分に位置合わせの対象となるＯＣＴＡのＥｎ－Ｆａｃｅ画像とした合成カラー画像を生成する。そして、複数の深度範囲の層を１枚に合成した合成カラーＯＣＴＡのＥｎ－Ｆａｃｅ画像の位置合わせを実施してもよい。それにより、位置合わせ済みのカラーＯＣＴＡのＥｎ－Ｆａｃｅ画像からＢ成分のみを抽出すれば、対象となるＯＣＴＡのＥｎ－Ｆａｃｅ画像の位置合わせが済んだ広画角のＯＣＴＡのＥｎ－Ｆａｃｅ画像を得ることが出来る。なお、高画質化を行う対象として、２次元のＯＣＴＡのＥｎ－Ｆａｃｅ画像に限らず、３次元のＯＣＴ、３次元のモーションコントラストデータそのものでもよい。その場合、３次元データで位置合わせを行い、広範囲の３次元データを生成するようにしてもよい。広範囲の３次元データから任意の断面（ＸＹＺのどの面でも可能）や任意の深度範囲（Ｚ方向での範囲）を切り出すことで、高画質な広画角画像を生成することが出来る。 In step S3150, the wide-angle image generating unit 3005 synthesizes several high-quality images from the group of high-quality images generated in step S3140. Specifically, an OCTA En-Face image is used as an example for explanation. Although the multiple images do not completely overlap, adjacent images are OCTA En-Face images that are taken so that some areas of the images overlap with each other. Therefore, the wide-angle image generating unit 3005 detects overlapping areas from the multiple OCTA En-Face images and performs alignment using the overlapping areas. By transforming the OCTA En-Face images based on the alignment parameters and synthesizing the images, it is possible to generate an OCTA En-Face image with a wider range than a single OCTA En-Face image. At this time, since the En-Face images of the multiple OCTAs to be input have been made high quality in step S3140, the En-Face image of the wide-angle OCTA output in step S3150 has already been made high quality. FIG. 32(b) shows an example of the En-Face image of the wide-angle OCTA generated by the wide-angle image generating unit 3005. FIG. 32(b) is an example generated by aligning the five images shown in FIG. 32(a). FIG. 32(c) shows the positional correspondence between FIG. 32(a) and FIG. 32(b). As shown in FIG. 32(c), Im3220 to Im3250 are arranged around Im3210. Note that the En-Face image of the OCTA can generate En-Face images of multiple OCTAs by setting different depth ranges from three-dimensional motion contrast data. Therefore, although an example of a surface layer image with a wide angle of view is shown in FIG. 32, the present invention is not limited to this. For example, the En-Face image (Im2910) of the surface layer OCTA shown in FIG. 29 may be used for alignment, and the En-Face images of the OCTA in other depth ranges may be deformed using the parameters obtained there. Alternatively, a color image may be used as the input image for alignment, and a composite color image may be generated in which the En-Face of the surface layer OCTA is set to the RG component of the RGB components, and the En-Face image of the OCTA to be aligned is set to the B component. Then, the En-Face image of the composite color OCTA in which layers of multiple depth ranges are combined into one image may be aligned. As a result, by extracting only the B component from the aligned color OCTA En-Face image, a wide-angle OCTA En-Face image in which the target OCTA En-Face image has been aligned can be obtained. Note that the target for improving image quality is not limited to a two-dimensional OCTA En-Face image, but may also be three-dimensional OCT or three-dimensional motion contrast data itself. In this case, alignment may be performed using three-dimensional data to generate a wide range of three-dimensional data. A high-quality wide-angle image can be generated by extracting any cross section (any plane in the XYZ directions is possible) or any depth range (range in the Z direction) from the wide range of three-dimensional data.

ステップＳ３１６０においては、出力部４０５が、ステップＳ３１５０において複数の画像から合成された画像を表示部２０に表示させたり、他の装置に出力したりする。ただし、ステップＳ３１３０において、入力画像が処理不可能であると判定されている場合には、出力部４０５は、入力画像を出力画像として出力する。なお、出力部４０５は、検者によって入力画像が指示された場合や、入力画像が処理不可能であった場合には、表示部２０に出力画像が入力画像と同じであることを表示させてもよい。 In step S3160, the output unit 405 displays the image synthesized from the multiple images in step S3150 on the display unit 20 or outputs it to another device. However, if it is determined in step S3130 that the input image cannot be processed, the output unit 405 outputs the input image as the output image. Note that if an input image is specified by the examiner or if the input image cannot be processed, the output unit 405 may display on the display unit 20 that the output image is the same as the input image.

なお、本実施形態では、複数の入力画像からそれぞれ高画質画像を生成し、高画質画像を位置合わせすることで、最終的な一枚の高画質な広画角画像を生成したが、複数の入力画像から一枚の高画質画像を生成する方法はこれに限られない。例えば、図３１（ｂ）に示す本実施形態の高画質化処理の別例では、先に一枚の広画角画像を生成し、広画角画像に対して高画質化処理を実行して最終的に一枚の高画質な広画角画像を生成するようにしてもよい。 In this embodiment, a single high-quality wide-angle image is generated by generating high-quality images from multiple input images and aligning the high-quality images, but the method of generating a single high-quality image from multiple input images is not limited to this. For example, in another example of the image quality improvement process of this embodiment shown in FIG. 31(b), a single wide-angle image may be generated first, and image quality improvement process may be performed on the wide-angle image to finally generate a single high-quality wide-angle image.

この処理に関して、図３１（ｂ）を用いて説明を行うが、図３１（ａ）と同様な処理の部分に関しては説明を省略する。 This process will be explained using Figure 31 (b), but the explanation of the process similar to that of Figure 31 (a) will be omitted.

ステップＳ３１２１では、広画角画像生成部３００５が、ステップＳ３１１０において取得した複数の画像を合成する。広画角画像生成に関しては、ステップＳ３１５０での説明と同様であるが、入力画像が撮影装置１０や他の装置から取得した画像であり、高画質化される前の画像である点が異なる。 In step S3121, the wide-angle image generating unit 3005 synthesizes the multiple images acquired in step S3110. The wide-angle image generation is the same as that described in step S3150, except that the input image is an image acquired from the image capture device 10 or another device, and is an image before it is subjected to high image quality processing.

ステップＳ３１５１では、高画質化部４０４が、広画角画像生成部３００５が生成した高画質画像に対して処理を実行し一枚の高画質な広画角画像を生成する。 In step S3151, the image quality improvement unit 404 processes the high-quality image generated by the wide-angle image generation unit 3005 to generate a single high-quality wide-angle image.

このような構成により、本実施形態に係る画像処理装置４００は、広画角な高画質画像を生成することができる。 With this configuration, the image processing device 400 according to this embodiment can generate high-quality images with a wide angle of view.

上記第１～２２の実施形態に関しては、出力部４０５による表示部２０への高画質画像の表示は基本的に高画質化部４０４による高画質画像の生成や解析部２２０８による解析結果の出力に応じて自動で行われる。しかしながら、高画質画像の表示は、検者からの指示に応じてなされてもよい。例えば、出力部４０５は、高画質化部４０４によって生成された高画質画像と入力画像のうち、検者からの指示に応じて選択された画像を表示部２０に表示させてもよい。また、出力部４０５は、検者からの指示に応じて、表示部２０上の表示を撮影画像（入力画像）から高画質画像に切り替えてもよい。すなわち、出力部４０５は、検者からの指示に応じて、低画質画像の表示を高画質画像の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、高画質画像の表示を低画質画像の表示に変更してもよい。さらに、高画質化部４０４が、高画質化エンジンによる高画質化処理の開始（高画質化エンジンへの画像の入力）を検者からの指示に応じて実行し、出力部４０５が、高画質化部４０４によって生成された高画質画像を表示部２０に表示させてもよい。これに対し、撮影装置１０によって入力画像が撮影されると、高画質化エンジンが自動的に入力画像に基づいて高画質画像を生成し、出力部４０５が、検者からの指示に応じて高画質画像を表示部２０に表示させてもよい。なお、これらの処理は解析結果の出力についても同様に行うことができる。すなわち、出力部４０５は、検者からの指示に応じて、低画質画像の解析結果の表示を高画質画像の解析結果の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、高画質画像の解析結果の表示を低画質画像の解析結果の表示に変更してもよい。もちろん、出力部４０５は、検者からの指示に応じて、低画質画像の解析結果の表示を低画質画像の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、低画質画像の表示を低画質画像の解析結果の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、高画質画像の解析結果の表示を高画質画像の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、高画質画像の表示を高画質画像の解析結果の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、低画質画像の解析結果の表示を低画質画像の他の種類の解析結果の表示に変更してもよい。また、出力部４０５は、検者からの指示に応じて、高画質画像の解析結果の表示を高画質画像の他の種類の解析結果の表示に変更してもよい。ここで、高画質画像の解析結果の表示は、高画質画像の解析結果を任意の透明度により高画質画像に重畳表示させたものであってもよい。また、低画質画像の解析結果の表示は、低画質画像の解析結果を任意の透明度により低画質画像に重畳表示させたものであってもよい。このとき、解析結果の表示への変更は、例えば、表示されている画像に対して任意の透明度により解析結果を重畳させた状態に変更したものであってもよい。また、解析結果の表示への変更は、例えば、解析結果と画像とを任意の透明度によりブレンド処理して得た画像（例えば、２次元マップ）の表示への変更であってもよい。さらに、撮影箇所推定エンジンや画質評価エンジン、真贋評価エンジン、評価部による処理を検者からの指示に応じて開始するように、画像処理装置が構成されてもよい。なお、上記第１～２２の実施形態に関し、出力部４０５が高画質画像を表示部２０に表示させる表示態様は任意であってよい。例えば、出力部４０５は、入力画像と高画質画像を並べて表示させてもよいし、切り替えて表示させてもよい。また、出力部４０５は、入力画像や高画質画像を、撮影部位や撮影日時、撮影が行われた施設等に応じて順番に表示させてもよい。同様に、出力部４０５は高画質画像を用いた画像解析結果等を、高画質画像や高画質画像に対応する入力画像の任意の撮影条件に応じて順番に表示させてもよい。さらに、出力部４０５は高画質画像を用いた画像解析結果を、解析項目ごとに順番に表示させてもよい。 Regarding the above-mentioned first to twenty-second embodiments, the display of the high-quality image on the display unit 20 by the output unit 405 is basically performed automatically in response to the generation of the high-quality image by the image quality improvement unit 404 and the output of the analysis result by the analysis unit 2208. However, the display of the high-quality image may be performed in response to an instruction from the examiner. For example, the output unit 405 may display on the display unit 20 an image selected in response to an instruction from the examiner from among the high-quality image generated by the image quality improvement unit 404 and the input image. In addition, the output unit 405 may switch the display on the display unit 20 from the captured image (input image) to a high-quality image in response to an instruction from the examiner. That is, the output unit 405 may change the display of the low-quality image to the display of the high-quality image in response to an instruction from the examiner. In addition, the output unit 405 may change the display of the high-quality image to the display of the low-quality image in response to an instruction from the examiner. Furthermore, the image quality improvement unit 404 may start the image quality improvement process by the image quality improvement engine (input of an image to the image quality improvement engine) in response to an instruction from the examiner, and the output unit 405 may display the high-image quality image generated by the image quality improvement unit 404 on the display unit 20. In contrast, when an input image is photographed by the imaging device 10, the image quality improvement engine may automatically generate a high-image quality image based on the input image, and the output unit 405 may display the high-image quality image on the display unit 20 in response to an instruction from the examiner. Note that these processes can also be performed similarly for the output of the analysis result. That is, the output unit 405 may change the display of the analysis result of the low-image quality image to the display of the analysis result of the high-image quality image in response to an instruction from the examiner. Also, the output unit 405 may change the display of the analysis result of the high-image quality image to the display of the analysis result of the low-image quality image in response to an instruction from the examiner. Of course, the output unit 405 may change the display of the analysis result of the low-image quality image to the display of the low-image quality image in response to an instruction from the examiner. The output unit 405 may change the display of the low-quality image to a display of the analysis result of the low-quality image in response to an instruction from the examiner. The output unit 405 may change the display of the analysis result of the high-quality image to a display of the high-quality image in response to an instruction from the examiner. The output unit 405 may change the display of the high-quality image to a display of the analysis result of the high-quality image in response to an instruction from the examiner. The output unit 405 may change the display of the analysis result of the low-quality image to a display of another type of analysis result of the low-quality image in response to an instruction from the examiner. The output unit 405 may change the display of the analysis result of the high-quality image to a display of another type of analysis result of the high-quality image in response to an instruction from the examiner. Here, the display of the analysis result of the high-quality image may be a display in which the analysis result of the high-quality image is superimposed on the high-quality image with an arbitrary transparency. The display of the analysis result of the low-quality image may be a display in which the analysis result of the low-quality image is superimposed on the low-quality image with an arbitrary transparency. At this time, the change to the display of the analysis result may be, for example, a change to a state in which the analysis result is superimposed on the displayed image with an arbitrary transparency. The display of the analysis result may be changed to an image (e.g., a two-dimensional map) obtained by blending the analysis result and the image with an arbitrary transparency. Furthermore, the image processing device may be configured to start the processing by the shooting location estimation engine, the image quality evaluation engine, the authenticity evaluation engine, and the evaluation unit in response to an instruction from the examiner. Note that, with respect to the above-mentioned first to twenty-second embodiments, the display mode in which the output unit 405 displays the high-quality image on the display unit 20 may be arbitrary. For example, the output unit 405 may display the input image and the high-quality image side by side, or may display them by switching between them. Furthermore, the output unit 405 may display the input image and the high-quality image in order according to the shooting site, the shooting date and time, the facility where the shooting was performed, and the like. Similarly, the output unit 405 may display the image analysis results using the high-quality image in order according to arbitrary shooting conditions of the high-quality image and the input image corresponding to the high-quality image. Furthermore, the output unit 405 may display the image analysis results using the high-quality image in order for each analysis item.

＜第２３の実施形態＞
次に、図４、図２９と図３３を参照して、第２３の実施形態に係る画像処理装置について説明する。本実施形態では、入力データに対応する高画質画像である出力データのペア群で構成される教師データを用いて学習を行う。その際、複数の高画質化エンジンによって生成する複数の高画質な出力データを用いて、１つの高画質化エンジンを生成する。 <Twenty-third embodiment>
Next, an image processing apparatus according to a twenty-third embodiment will be described with reference to Fig. 4, Fig. 29 and Fig. 33. In this embodiment, learning is performed using teacher data consisting of a pair group of output data, which is a high-quality image corresponding to input data. At that time, one image quality improvement engine is generated using a plurality of high-quality output data generated by a plurality of image quality improvement engines.

本実施形態に係る取得部４０１は、撮影装置１０や他の装置から、処理対象である入力データとして画像を取得する。本実施形態に係る高画質化部４０４における高画質化エンジンの生成に関して、図２９と図３３を用いて説明をする。まず、図３３（ａ）を用いて本実施形態における第１の学習について説明をする。図３３（ａ）は、複数の入力データと出力データのペア群と複数の高画質化エンジンの一例を示している。Ｉｍ３３１１とＩｍ３３１２は、入力データと出力データのペア群を示している。例えば、このペアは図２９で示した表層（Ｉｍ２９１０）のペア群とする。そして、３３１３はＩｍ３３１１とＩｍ３３１２のペア群を用いて学習を行った高画質化エンジンを示している。なお、図３３（ａ）での学習には、第１の実施形態で説明したような重ね合わせ処理により生成する高画質画像を用いる方法でもよいし、第１８の実施形態で説明したようなノイズ成分を学習する方法でもよい。あるいはそれらの組み合わせでもよい。Ｉｍ３３２１とＩｍ３３２２は、入力データと出力データのペア群で、例えば、図２９で示した深層（Ｉｍ２９２０）のペア群とする。そして、３３２３はＩｍ３３２１とＩｍ３３２２のペア群で学習を行った高画質化エンジンを示している。同様に、Ｉｍ３３３１とＩｍ３３３２は、入力データと出力データのペア群で、例えば、図２９で示した外層（Ｉｍ２９３０）のペア群とする。そして、３３３３はＩｍ３３３１とＩｍ３３３２のペア群で学習を行った高画質化エンジンを示している。すなわち、図３３（ａ）ではそれぞれの画像毎に学習を行う。そのため、例えば、第１８の実施形態で説明したノイズ成分の場合は、それぞれの画像に適したノイズパラメータで学習を行うことが出来る。このとき、高画質化エンジンは、医用画像の少なくとも一部の領域の状態に応じたノイズが該少なくとも一部の領域に付加された学習データを用いて得た機械学習エンジンを含むことができる。ここで、上記状態に応じたノイズとは、例えば、少なくとも一部の領域の画素値に応じた大きさのノイズであっても良い。また、上記状態に応じたノイズとは、例えば、少なくとも一部の領域における特徴が少ない（例えば、画素値が小さい、コントラストが低い等）場合には、小さいノイズであっても良い。また、上記状態に応じたノイズとは、例えば、少なくとも一部の領域における特徴が多い（例えば、画素値が大きい、コントラストが高い等）場合には、大きなノイズであっても良い。また、高画質化エンジンは、複数の深度範囲のうち少なくとも２つの深度範囲それぞれに対して異なる大きさのノイズが付加された複数の正面画像を含む学習データを用いて得た機械学習エンジンを含むことができる。このとき、例えば、特徴が少ない（例えば、画素値が小さい）正面画像に対応する深度範囲においては、小さいノイズが付加された正面画像を学習データとしても良い。また、例えば、特徴が多い（例えば、画素値が大きい）正面画像に対応する深度範囲においては、大きいノイズが付加された正面画像を学習データとしても良い。なお、特徴が中程度である正面画像に対応する深度範囲においては、中程度の大きさのノイズが付加された正面画像を学習データとしても良い。ここで、複数の深度範囲は、深さ方向において隣り合う２つの深度範囲の一部が互いに重複していても良い。 The acquisition unit 401 according to this embodiment acquires an image as input data to be processed from the image capture device 10 or another device. The generation of the image quality improvement engine in the image quality improvement unit 404 according to this embodiment will be described with reference to FIG. 29 and FIG. 33. First, the first learning in this embodiment will be described with reference to FIG. 33(a). FIG. 33(a) shows an example of a group of pairs of multiple input data and output data and multiple image quality improvement engines. Im3311 and Im3312 show a group of pairs of input data and output data. For example, this pair is the group of pairs of the surface layer (Im2910) shown in FIG. 29. Then, 3313 shows an image quality improvement engine that has been learned using the group of pairs of Im3311 and Im3312. Note that the learning in FIG. 33(a) may be a method of using a high-quality image generated by superimposition processing as described in the first embodiment, or a method of learning noise components as described in the 18th embodiment. Or a combination of these may be used. Im3321 and Im3322 are pairs of input data and output data, for example, the pair group of the deep layer (Im2920) shown in FIG. 29. Then, 3323 indicates an image quality improvement engine that has been trained using the pair group of Im3321 and Im3322. Similarly, Im3331 and Im3332 are pairs of input data and output data, for example, the pair group of the outer layer (Im2930) shown in FIG. 29. Then, 3333 indicates an image quality improvement engine that has been trained using the pair group of Im3331 and Im3332. That is, in FIG. 33(a), training is performed for each image. Therefore, for example, in the case of the noise component described in the 18th embodiment, training can be performed with noise parameters suitable for each image. At this time, the image quality improvement engine can include a machine learning engine obtained using training data in which noise according to the state of at least a part of the area of the medical image is added to the at least a part of the area. Here, the noise according to the state may be, for example, noise of a magnitude according to the pixel value of at least a part of the region. The noise according to the state may be, for example, small noise when at least a part of the region has few features (for example, a small pixel value, a low contrast, etc.). The noise according to the state may be, for example, large noise when at least a part of the region has many features (for example, a large pixel value, a high contrast, etc.). The image quality improvement engine may include a machine learning engine obtained using learning data including a plurality of front images to which noise of different magnitudes is added for at least two depth ranges among a plurality of depth ranges. At this time, for example, in a depth range corresponding to a front image with few features (for example, a small pixel value), a front image to which small noise is added may be used as learning data. For example, in a depth range corresponding to a front image with many features (for example, a large pixel value), a front image to which large noise is added may be used as learning data. Note that, in a depth range corresponding to a front image with medium features, a front image to which medium noise is added may be used as learning data. Here, the multiple depth ranges may have parts of two adjacent depth ranges overlapping each other in the depth direction.

次に、図３３（ｂ）を用いて本実施形態における画像の推論について説明をする。図３３（ｂ）は、図３３（ａ）で学習をした高画質化エンジン３３１３～３３３３を用いて画像を生成する。例えば、複数の表層画像を用いて学習を行った高画質化エンジン３３１３に対して、低画質な表層画像Ｉｍ３３１０を入力すると高画質な表層画像Ｉｍ３３１５を出力する。また、複数の深層画像を用いて学習を行った高画質化エンジン３３２３に対して、低画質な深層画像Ｉｍ３３２０を入力すると高画質な深層画像Ｉｍ３３２５を出力する。複数の外層画像を用いて学習を行った高画質化エンジン３３３３も同様に、低画質な外層画像Ｉｍ３３３０を入力すると高画質な外層画像Ｉｍ３３３５を出力する。 Next, image inference in this embodiment will be described with reference to FIG. 33(b). In FIG. 33(b), an image is generated using image quality improvement engines 3313-3333 that have been trained in FIG. 33(a). For example, when low-quality surface image Im3310 is input to image quality improvement engine 3313 that has been trained using multiple surface images, it outputs high-quality surface image Im3315. Also, when low-quality deep image Im3320 is input to image quality improvement engine 3323 that has been trained using multiple deep images, it outputs high-quality deep image Im3325. Similarly, when low-quality outer layer image Im3330 is input to image quality improvement engine 3333 that has been trained using multiple outer layer images, it outputs high-quality outer layer image Im3335.

次に、図３３（ｃ）を用いて本実施形態における第２の学習について説明をする。図３３（ｃ）は、異なる種類の複数の画像ペア群を用いて、１つの高画質化エンジン３３００を学習する様子を示している。Ｉｍ３３１０は低画質な表層画像、Ｉｍ３３１５は高画質な表層画像のペア群、Ｉｍ３３２０は低画質な深層画像、Ｉｍ３３２５は高画質な深層画像のペア群、Ｉｍ３３３０は低画質な外層画像、Ｉｍ３３３５は高画質な外層画像のペア群を示す。すなわち、第１の学習で学習した高画質化エンジンを用いて生成した高画質画像である出力データと低画質な入力データとのペア群で構成された教師データを用いて高画質化エンジン３３００を生成する。以上により、高画質化エンジン３３００は、様々な種類の入力画像から、ノイズが低減されたり、高コントラストとなったりした、画像診断に適した高画質画像を生成することができる。 Next, the second learning in this embodiment will be described with reference to FIG. 33(c). FIG. 33(c) shows how one image quality improvement engine 3300 is trained using a plurality of image pair groups of different types. Im3310 shows a low-quality surface image, Im3315 shows a pair group of high-quality surface images, Im3320 shows a low-quality deep image, Im3325 shows a pair group of high-quality deep images, Im3330 shows a low-quality outer layer image, and Im3335 shows a pair group of high-quality outer layer images. That is, the image quality improvement engine 3300 is generated using teacher data consisting of a pair group of output data, which is a high-quality image generated using the image quality improvement engine learned in the first learning, and low-quality input data. As a result, the image quality improvement engine 3300 can generate high-quality images suitable for image diagnosis, in which noise has been reduced and contrast has been increased, from various types of input images.

出力部４０５は高画質化部４０４が生成した高画質画像を表示部２０に表示させる。なお、出力部４０５は高画質画像とともに、入力画像を表示部２０に表示させてもよい。 The output unit 405 displays the high-quality image generated by the image quality improvement unit 404 on the display unit 20. The output unit 405 may also display the input image on the display unit 20 together with the high-quality image.

なお、本実施形態では、ＯＣＴＡのＥｎ－Ｆａｃｅ画像は異なる深さの３層を用いて説明をしたが、画像の種類はこれに限らず、基準となる層とオフセットの値を変えて異なる深度範囲を設定したＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成して種類を増やしてもよい。画像の種類は深さ方向の違いに限らず、部位毎の違いでもよい。例えば、前眼部と後眼部など、異なる撮影場所であってもよい。さらに画像は、ＯＣＴＡのＥｎ－Ｆａｃｅ画像に限らず、ＯＣＴデータから生成する輝度のＥｎ－Ｆａｃｅ画像であってもよい。そして、第１の学習では別々に学習を行っておき、第２の学習において、これらＯＣＴＡのＥｎ－Ｆａｃｅ画像と輝度のＥｎ－Ｆａｃｅ画像とをまとめて学習してもよい。さらには、Ｅｎ－Ｆａｃｅ画像だけではなく、断層画像やＳＬＯ画像、眼底写真、蛍光眼底写真など、異なる撮影装置であっても構わない。 In this embodiment, the En-Face image of OCTA is described using three layers of different depths, but the type of image is not limited to this, and the type may be increased by generating En-Face images of OCTA with different depth ranges set by changing the reference layer and offset value. The type of image is not limited to differences in the depth direction, but may be different for each part. For example, it may be different shooting locations such as the anterior eye and the posterior eye. Furthermore, the image is not limited to the En-Face image of OCTA, but may be an En-Face image of brightness generated from OCT data. Then, in the first learning, learning may be performed separately, and in the second learning, these En-Face images of OCTA and the En-Face image of brightness may be learned together. Furthermore, in addition to En-Face images, different shooting devices such as tomographic images, SLO images, fundus photos, and fluorescent fundus photos may be used.

なお、第２の学習によって高画質化エンジンは１つとなる例を説明したが、必ずしも１つである必要はない。第１の学習で生成する高画質化エンジンの出力データと低画質な入力データとのペア群で学習をする高画質化エンジンの構成であればよい。さらに、第２の学習において、図３３（ｃ）では、異なる種類の複数の画像ペア群を用いて同時に学習をする例を示したが、これに限らず転移学習でもよい。例えば、Ｉｍ３３１０とＩｍ３３１５の表層画像のペア群で学習した後に、そのネットワークを使ってＩｍ３３２０とＩｍ３３２５の深層画像のペア群を学習するというようにして、最終的に高画質化エンジン３３００を生成するようにしても良い。 Although an example has been described in which the second learning results in one high image quality engine, it is not necessary that there is only one. Any high image quality engine may be configured to learn using a group of pairs of output data of the high image quality engine generated in the first learning and low image quality input data. Furthermore, in the second learning, FIG. 33(c) shows an example in which learning is performed simultaneously using multiple image pair groups of different types, but this is not limited to this and transfer learning may also be used. For example, after learning using a group of pairs of surface images Im3310 and Im3315, the network may be used to learn a group of pairs of deep images Im3320 and Im3325, thereby finally generating high image quality engine 3300.

このような構成により、本実施形態に係る高画質化部４０４は様々な種類の画像に対して、より効果的な高画質画像を生成することができる。 With this configuration, the image quality improvement unit 404 according to this embodiment can generate high-quality images more effectively for various types of images.

＜第２４の実施形態＞
次に、図３４を参照して、第２４の実施形態に係る画像処理装置について説明する。本実施形態では、高画質化部４０４での処理結果を出力部４０５が表示部２０に表示を行う例について説明を行う。なお、本実施形態では、図３４を用いて説明を行うが表示画面はこれに限らない。経過観察のように、異なる日時で得た複数の画像を並べて表示する表示画面においても同様に高画質化処理は適用可能である。また、撮影確認画面のように、検者が撮影直後に撮影成否を確認する表示画面においても同様に高画質化処理は適用可能である。 <Twenty-fourth embodiment>
Next, an image processing device according to the 24th embodiment will be described with reference to Fig. 34. In this embodiment, an example will be described in which the output unit 405 displays the processing result in the image quality improvement unit 404 on the display unit 20. Note that, in this embodiment, the description will be given using Fig. 34, but the display screen is not limited to this. The image quality improvement process can also be applied to a display screen that displays multiple images obtained at different dates and times side by side, such as in follow-up observation. In addition, the image quality improvement process can also be applied to a display screen, such as an imaging confirmation screen, on which the examiner checks whether imaging was successful immediately after imaging.

出力部４０５は、高画質化部４０４が生成した複数の高画質画像や高画質化を行っていない低画質画像を表示部２０に表示させることができる。これにより、検者の指示に応じて低画質画像、高画質画像をそれぞれ出力することができる。 The output unit 405 can display on the display unit 20 multiple high-quality images generated by the image quality improvement unit 404 and low-quality images that have not been improved in quality. This makes it possible to output low-quality images and high-quality images according to the examiner's instructions.

以下、図３４を参照して、当該インターフェース３４００の一例を示す。３４００は画面全体、３４０１は患者タブ、３４０２は撮影タブ、３４０３はレポートタブ、３４０４は設定タブを表し、３４０３のレポートタブにおける斜線は、レポート画面のアクティブ状態を表している。本実施形態においては、レポート画面を表示する例について説明をする。Ｉｍ３４０５はＳＬＯ画像、Ｉｍ３４０６は、Ｉｍ３４０７に示すＯＣＴＡのＥｎ－Ｆａｃｅ画像をＳＬＯ画像Ｉｍ３４０５に重畳表示している。ここでＳＬＯ画像とは、不図示のＳＬＯ（ＳｃａｎｎｉｎｇＬａｓｅｒＯｐｈｔｈａｌｍｏｓｃｏｐｅ：走査型検眼鏡）光学系によって取得した眼底の正面画像である。Ｉｍ３４０７とＩｍ３４０８はＯＣＴＡのＥｎ－Ｆａｃｅ画像、Ｉｍ３４０９は輝度のＥｎ－Ｆａｃｅ画像、Ｉｍ３４１１とＩｍ３４１２は断層画像を示している。３４１３と３４１４は、それぞれＩｍ３４０７とＩｍ３４０８に示したＯＣＴＡのＥｎ－Ｆａｃｅ画像の上下範囲の境界線を断層画像に重畳表示している。ボタン３４２０は、高画質化処理の実行を指定するためのボタンである。もちろん、後述するように、ボタン３４２０は、高画質画像の表示を指示するためのボタンであってもよい。 An example of the interface 3400 is shown below with reference to FIG. 34. 3400 indicates the entire screen, 3401 indicates a patient tab, 3402 indicates a photography tab, 3403 indicates a report tab, and 3404 indicates a setting tab, and the diagonal line in the report tab 3403 indicates the active state of the report screen. In this embodiment, an example of displaying a report screen is described. Im3405 is an SLO image, and Im3406 displays an OCTA En-Face image shown in Im3407 superimposed on the SLO image Im3405. Here, the SLO image is a front image of the fundus acquired by an SLO (Scanning Laser Ophthalmoscope) optical system (not shown). Im3407 and Im3408 show OCTA En-Face images, Im3409 shows a luminance En-Face image, and Im3411 and Im3412 show tomographic images. 3413 and 3414 show the boundary lines of the upper and lower ranges of the OCTA En-Face images shown in Im3407 and Im3408, respectively, superimposed on the tomographic image. Button 3420 is a button for specifying the execution of image quality improvement processing. Of course, as described below, button 3420 may also be a button for instructing the display of a high-quality image.

本実施形態において、高画質化処理の実行はボタン３４２０を指定して行うか、データベースに保存（記憶）されている情報に基づいて実行の有無を判断する。初めに、検者からの指示に応じてボタン３４２０を指定することで高画質画像の表示と低画質画像の表示を切り替える例について説明をする。なお、高画質化処理の対象画像はＯＣＴＡのＥｎ－Ｆａｃｅ画像として説明する。検者がレポートタブ３４０３を指定してレポート画面に遷移した際には、低画質なＯＣＴＡのＥｎ－Ｆａｃｅ画像Ｉｍ３４０７とＩｍ３４０８を表示する。その後、検者がボタン３４２０を指定することで、高画質化部４０４は画面に表示している画像Ｉｍ３４０７とＩｍ３４０８に対して高画質化処理を実行する。高画質化処理が完了後、出力部４０５は高画質化部４０４が生成した高画質画像をレポート画面に表示する。なお、Ｉｍ３４０６は、Ｉｍ３４０７をＳＬＯ画像Ｉｍ３４０５に重畳表示しているものであるため、Ｉｍ３４０６も高画質化処理した画像を表示する。そして、ボタン３４２０の表示をアクティブ状態に変更し、高画質化処理を実行したことが分かるような表示をする。ここで、高画質化部４０４における処理の実行は、検者がボタン３４２０を指定したタイミングに限る必要はない。レポート画面を開く際に表示するＯＣＴＡのＥｎ－Ｆａｃｅ画像Ｉｍ３４０７とＩｍ３４０８の種類は事前に分かっているため、レポート画面に遷移する際に高画質化処理を実行してもよい。そして、ボタン３４２０が押下されたタイミングで、出力部４０５が高画質画像をレポート画面に表示するようにしてもよい。さらに、検者からの指示に応じて、又はレポート画面に遷移する際に高画質化処理を行う画像の種類は２種類である必要はない。表示する可能性の高い画像、例えば、図２９で示したような表層（Ｉｍ２９１０）、深層（Ｉｍ２９２０）、外層（Ｉｍ２９３０）、脈絡膜血管網（Ｉｍ２９４０）などの複数のＯＣＴＡのＥｎ－Ｆａｃｅ画像に対して処理を行うようにしてもよい。この場合、高画質化処理をして得た画像を一時的にメモリに記憶、あるいはデータベースに記憶しておくようにしてもよい。 In this embodiment, the image quality improvement process is performed by specifying the button 3420, or the execution is determined based on information saved (stored) in the database. First, an example in which the display of a high-quality image and a low-quality image are switched by specifying the button 3420 in response to an instruction from the examiner will be described. Note that the image to be subjected to the image quality improvement process will be described as an OCTA En-Face image. When the examiner specifies the report tab 3403 to transition to the report screen, the low-quality OCTA En-Face images Im3407 and Im3408 are displayed. After that, when the examiner specifies the button 3420, the image quality improvement unit 404 performs image quality improvement process on the images Im3407 and Im3408 displayed on the screen. After the image quality improvement process is completed, the output unit 405 displays the high-quality image generated by the image quality improvement unit 404 on the report screen. In addition, Im3406 displays an image that has been subjected to image quality improvement processing because Im3407 is superimposed on the SLO image Im3405. Then, the display of the button 3420 is changed to an active state, and a display is made so that it is clear that the image quality improvement processing has been performed. Here, the execution of processing in the image quality improvement unit 404 does not need to be limited to the timing when the examiner designates the button 3420. Since the types of the OCTA En-Face images Im3407 and Im3408 to be displayed when the report screen is opened are known in advance, the image quality improvement processing may be performed when transitioning to the report screen. Then, the output unit 405 may display the high image quality image on the report screen at the timing when the button 3420 is pressed. Furthermore, the types of images to be subjected to image quality improvement processing in response to an instruction from the examiner or when transitioning to the report screen do not need to be two types. Processing may be performed on images that are likely to be displayed, such as multiple OCTA En-Face images of the superficial layer (Im2910), deep layer (Im2920), outer layer (Im2930), and choroidal vascular network (Im2940) shown in FIG. 29. In this case, the images obtained by performing the image quality improvement processing may be temporarily stored in memory or in a database.

次に、データベースに保存（記憶）されている情報に基づいて高画質化処理を実行する場合について説明をする。データベースに高画質化処理の実行を行う状態が保存されている場合、レポート画面に遷移した際に、高画質化処理を実行して得た高画質画像をデフォルトで表示する。そして、ボタン３４２０がアクティブ状態としてデフォルトで表示されることで、検者に対しては高画質化処理を実行して得た高画質画像が表示されていることが分かるように構成することができる。検者は、高画質化処理前の低画質画像を表示したい場合には、ボタン３４２０を指定してアクティブ状態を解除することで、低画質画像を表示することが出来る。高画質画像に戻したい場合、検者はボタン３４２０を指定する。データベースへの高画質化処理の実行有無は、データベースに保存されているデータ全体に対して共通、及び撮影データ毎（検査毎）など、階層別に指定するものとする。例えば、データベース全体に対して高画質化処理を実行する状態を保存してある場合において、個別の撮影データ（個別の検査）に対して、検者が高画質化処理を実行しない状態を保存した場合、その撮影データを次回表示する際には高画質化処理を実行しない状態で表示を行う。撮影データ毎（検査毎）に高画質化処理の実行状態を保存するために、不図示のユーザーインターフェース（例えば、保存ボタン）を用いてもよい。また、他の撮影データ（他の検査）や他の患者データに遷移（例えば、検者からの指示に応じてレポート画面以外の表示画面に変更）する際に、表示状態（例えば、ボタン３４２０の状態）に基づいて、高画質化処理の実行を行う状態が保存されるようにしてもよい。これにより、撮影データ単位（検査単位）で高画質化処理実行の有無が指定されていない場合、データベース全体に対して指定されている情報に基づいて処理を行い、撮影データ単位（検査単位）で指定されている場合には、その情報に基づいて個別に処理を実行することが出来る。 Next, a case where the image quality improvement process is performed based on information saved (stored) in the database will be described. If the state of performing the image quality improvement process is saved in the database, when the screen transitions to the report screen, the high-quality image obtained by performing the image quality improvement process is displayed by default. The button 3420 is displayed in an active state by default, so that the examiner can understand that the high-quality image obtained by performing the image quality improvement process is displayed. If the examiner wants to display the low-quality image before the image quality improvement process, the examiner can display the low-quality image by selecting the button 3420 to cancel the active state. If the examiner wants to return to the high-quality image, the examiner selects the button 3420. The execution or non-execution of the image quality improvement process on the database is specified by hierarchy, such as common to all data saved in the database, and for each shooting data (each examination). For example, if the state of performing the image quality improvement process on the entire database is saved, and the examiner saves a state in which the image quality improvement process is not performed on individual shooting data (individual examination), the shooting data will be displayed without the image quality improvement process the next time it is displayed. A user interface (e.g., a save button) not shown may be used to save the execution status of the image quality improvement process for each piece of imaging data (each examination). Furthermore, when transitioning to other imaging data (other examination) or other patient data (e.g., changing to a display screen other than the report screen in response to an instruction from the examiner), the execution status of the image quality improvement process may be saved based on the display status (e.g., the status of button 3420). In this way, if the execution or non-execution of image quality improvement process is not specified on an imaging data unit (examination unit), processing is performed based on information specified for the entire database, and if it is specified on an imaging data unit (examination unit), processing can be performed individually based on that information.

本実施形態におけるＯＣＴＡのＥｎ－Ｆａｃｅ画像として、Ｉｍ３４０７とＩｍ３４０８を表示する例を示しているが、表示するＯＣＴＡのＥｎ－Ｆａｃｅ画像は検者の指定により変更することが可能である。そのため、高画質化処理の実行が指定されている時（ボタン３４２０がアクティブ状態）における画像の変更について説明をする。 In this embodiment, an example is shown in which Im3407 and Im3408 are displayed as OCTA En-Face images, but the OCTA En-Face image to be displayed can be changed according to the examiner's specifications. Therefore, the following describes how to change the image when the execution of image quality improvement processing is specified (button 3420 is active).

画像の変更は、不図示のユーザーインターフェース（例えば、コンボボックス）を用いて変更を行う。例えば、検者が画像の種類を表層から脈絡膜血管網に変更した時に、高画質化部４０４は脈絡膜血管網画像に対して高画質化処理を実行し、出力部４０５は高画質化部４０４が生成した高画質な画像をレポート画面に表示する。すなわち、出力部４０５は、検者からの指示に応じて、第１の深度範囲の高画質画像の表示を、第１の深度範囲とは少なくとも一部が異なる第２の深度範囲の高画質画像の表示に変更してもよい。このとき、出力部４０５は、検者からの指示に応じて第１の深度範囲が第２の深度範囲に変更されることにより、第１の深度範囲の高画質画像の表示を、第２の深度範囲の高画質画像の表示に変更してもよい。なお、上述したようにレポート画面遷移時に表示する可能性の高い画像に対しては、既に高画質画像が生成済みである場合、出力部４０５は生成済みの高画質な画像を表示すればよい。なお、画像の種類の変更方法は上記したものに限らず、基準となる層とオフセットの値を変えて異なる深度範囲を設定したＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成することも可能である。その場合、基準となる層、あるいはオフセット値が変更された時に、高画質化部４０４は任意のＯＣＴＡのＥｎ－Ｆａｃｅ画像に対して高画質化処理を実行し、出力部４０５は高画質な画像をレポート画面に表示する。基準となる層、オフセット値の変更は、不図示のユーザーインターフェース（例えば、コンボボックスやテキストボックス）を用いて行われることができる。また、断層画像Ｉｍ３４１１とＩｍ３４１２に重畳表示している境界線３４１３と３４１４のいずれかをドラッグ（層境界を移動）することで、ＯＣＴＡのＥｎ－Ｆａｃｅ画像の生成範囲を変更することが出来る。境界線をドラッグによって変更する場合、高画質化処理の実行命令が連続的に実施される。そのため、高画質化部４０４は実行命令に対して常に処理を行ってもよいし、ドラッグによる層境界の変更後に実行するようにしてもよい。あるいは、高画質化処理の実行は連続的に命令されるが、次の命令が来た時点で前回の命令をキャンセルし、最新の命令を実行するようにしてもよい。なお、高画質化処理には比較的時間がかかる場合がある。このため、上述したどのようなタイミングで命令が実行されたとしても、高画質画像が表示されるまでに比較的時間がかかる場合がある。そこで、検者からの指示に応じてＯＣＴＡのＥｎ－Ｆａｃｅ画像を生成するための深度範囲が設定されてから、高画質画像が表示されるまでの間、該設定された深度範囲に対応するＯＣＴＡのＥｎ－Ｆａｃｅ画像（低画質画像）が表示されてもよい。すなわち、上記深度範囲が設定されると、該設定された深度範囲に対応するＯＣＴＡのＥｎ－Ｆａｃｅ画像（低画質画像）が表示され、高画質化処理が終了すると、該ＯＣＴＡのＥｎ－Ｆａｃｅ画像（該低画質画像）の表示が高画質画像の表示に変更されるように構成されてもよい。また、上記深度範囲が設定されてから、高画質画像が表示されるまでの間、高画質化処理が実行されていることを示す情報が表示されてもよい。なお、これらは、高画質化処理の実行が既に指定されている状態（ボタン３４２０がアクティブ状態）を前提とする場合だけでなく、例えば、検者からの指示に応じて高画質化処理の実行が指示された際に、高画質画像が表示されるまでの間においても、適用することが可能である。 The image is changed using a user interface (e.g., a combo box) not shown. For example, when the examiner changes the type of image from the superficial layer to the choroidal vascular network, the image quality improvement unit 404 performs image quality improvement processing on the choroidal vascular network image, and the output unit 405 displays the high-quality image generated by the image quality improvement unit 404 on the report screen. That is, the output unit 405 may change the display of the high-quality image of the first depth range to the display of a high-quality image of a second depth range that is at least partially different from the first depth range, in response to an instruction from the examiner. At this time, the output unit 405 may change the display of the high-quality image of the first depth range to the display of a high-quality image of the second depth range by changing the first depth range to the second depth range in response to an instruction from the examiner. Note that, for an image that is likely to be displayed at the time of transition to the report screen as described above, if a high-quality image has already been generated for the image, the output unit 405 may display the generated high-quality image. The method of changing the type of image is not limited to the above, and it is also possible to generate an En-Face image of OCTA with a different depth range set by changing the reference layer and offset value. In that case, when the reference layer or offset value is changed, the image quality improvement unit 404 executes image quality improvement processing on an arbitrary En-Face image of OCTA, and the output unit 405 displays a high-quality image on the report screen. The reference layer and offset value can be changed using a user interface (e.g., a combo box or a text box) not shown. In addition, the generation range of the En-Face image of OCTA can be changed by dragging (moving the layer boundary) either of the boundary lines 3413 and 3414 superimposed and displayed on the tomographic images Im3411 and Im3412. When the boundary line is changed by dragging, the execution command of the image quality improvement processing is executed continuously. Therefore, the image quality improvement unit 404 may always process the execution command, or may execute it after the layer boundary is changed by dragging. Alternatively, the execution of the high image quality processing may be commanded continuously, but the previous command may be canceled when the next command arrives, and the latest command may be executed. Note that the high image quality processing may take a relatively long time. Therefore, regardless of the timing at which the command is executed, it may take a relatively long time until the high image quality image is displayed. Therefore, from the time when the depth range for generating the En-Face image of the OCTA is set according to the instruction from the examiner until the high image quality image is displayed, the En-Face image of the OCTA (low image quality image) corresponding to the set depth range may be displayed. That is, when the depth range is set, the En-Face image of the OCTA (low image quality image) corresponding to the set depth range is displayed, and when the high image quality processing is completed, the display of the En-Face image of the OCTA (the low image quality image) may be changed to the display of the high image quality image. Also, from the time when the depth range is set until the high image quality image is displayed, information indicating that the high image quality processing is being executed may be displayed. Note that these can be applied not only when the execution of image quality improvement processing has already been specified (button 3420 is active), but also, for example, when the execution of image quality improvement processing is instructed in response to an instruction from the examiner and before a high-quality image is displayed.

本実施形態では、ＯＣＴＡのＥｎ－Ｆａｃｅ画像として、Ｉｍ３４０７とＩｍ３４０８に異なる層を表示し、低画質と高画質な画像は切り替えて表示する例を示したが、これに限らない。例えば、Ｉｍ３４０７には低画質なＯＣＴＡのＥｎ－Ｆａｃｅ画像、Ｉｍ３４０８には高画質なＯＣＴＡのＥｎ－Ｆａｃｅ画像を並べて表示するようにしてもよい。画像を切り替えて表示する場合には、同じ場所で画像を切り替えるので変化がある部分の比較を行いやすく、並べて表示する場合には、同時に画像を表示することが出来るので画像全体を比較しやすい。 In this embodiment, an example has been shown in which different layers are displayed as OCTA En-Face images in Im3407 and Im3408, and low-image-quality and high-image-quality images are switched between for display, but this is not limiting. For example, a low-image-quality OCTA En-Face image may be displayed side-by-side in Im3407, and a high-image-quality OCTA En-Face image may be displayed side-by-side in Im3408. When images are switched for display, it is easy to compare areas that have changed because the images are switched in the same location, and when displayed side-by-side, it is easy to compare the entire images because the images can be displayed simultaneously.

次に、図３４（ａ）と（ｂ）を用いて、画面遷移における高画質化処理の実行について説明を行う。図３４（ｂ）は、図３４（ａ）におけるＯＣＴＡのＥｎ－Ｆａｃｅ画像Ｉｍ３４０７を拡大表示した画面例である。図３４（ｂ）においても、図３４（ａ）と同様にボタン３４２０を表示する。図３４（ａ）から図３４（ｂ）への画面遷移は、例えば、ＯＣＴＡのＥｎ－Ｆａｃｅ画像Ｉｍ３４０７をダブルクリックすることで遷移し、図３４（ｂ）から図３４（ａ）へは閉じるボタン３４３０で遷移する。なお、画面遷移に関しては、ここで示した方法に限らず、不図示のユーザーインターフェースを用いてもよい。画面遷移の際に高画質化処理の実行が指定されている場合（ボタン３４２０がアクティブ）、画面遷移時においてもその状態を保つ。すなわち、図３４（ａ）の画面で高画質画像を表示している状態で図３４（ｂ）の画面に遷移する場合、図３４（ｂ）の画面においても高画質画像を表示する。そして、ボタン３４２０はアクティブ状態にする。図３４（ｂ）から図３４（ａ）へ遷移する場合にも同様である。図３４（ｂ）において、ボタン３４２０を指定して低画質画像に表示を切り替えることも出来る。画面遷移に関して、ここで示した画面に限らず、経過観察用の表示画面、又はパノラマ画像用の表示画面など同じ撮影データを表示する画面への遷移であれば、高画質画像の表示状態を保ったまま遷移を行う。すなわち、遷移後の表示画面において、遷移前の表示画面におけるボタン３４２０の状態に対応する画像が表示される。例えば、遷移前の表示画面におけるボタン３４２０がアクティブ状態であれば、遷移後の表示画面において高画質画像が表示される。また、例えば、遷移前の表示画面におけるボタン３４２０のアクティブ状態が解除されていれば、遷移後の表示画面において低画質画像が表示される。なお、経過観察用の表示画面におけるボタン３４２０がアクティブ状態になると、経過観察用の表示画面に並べて表示される異なる日時（異なる検査日）で得た複数の画像が高画質画像に切り換わるようにしてもよい。すなわち、経過観察用の表示画面におけるボタン３４２０がアクティブ状態になると、異なる日時で得た複数の画像に対して一括で反映されるように構成してもよい。なお、経過観察用の表示画面の例を、図３８に示す。検者からの指示に応じてタブ３８０１が選択されると、図３８のように、経過観察用の表示画面が表示される。このとき、Ｅｎ－Ｆａｃｅ画像の深度範囲を、リストボックスに表示された既定の深度範囲セット（３８０２及び３８０３）から検者が選択することで変更できる。例えば、リストボックス３８０２では網膜表層が選択され、また、リストボックス３８０３では網膜深層が選択されている。上側の表示領域には網膜表層のＥｎ－Ｆａｃｅ画像の解析結果が表示され、また、下側の表示領域には網膜深層のＥｎ－Ｆａｃｅ画像の解析結果が表示されている。すなわち、深度範囲が選択されると、異なる日時の複数の画像について、選択された深度範囲の複数のＥｎ－Ｆａｃｅ画像の解析結果の並列表示に一括して変更される。このとき、解析結果の表示を非選択状態にすると、異なる日時の複数のＥｎ－Ｆａｃｅ画像の並列表示に一括して変更されてもよい。そして、検者からの指示に応じてボタン３４２０が指定されると、複数のＥｎ－Ｆａｃｅ画像の表示が複数の高画質画像の表示に一括して変更される。また、解析結果の表示が選択状態である場合には、検者からの指示に応じてボタン３４２０が指定されると、複数のＥｎ－Ｆａｃｅ画像の解析結果の表示が複数の高画質画像の解析結果の表示に一括して変更される。ここで、解析結果の表示は、解析結果を任意の透明度により画像に重畳表示させたものであってもよい。このとき、解析結果の表示への変更は、例えば、表示されている画像に対して任意の透明度により解析結果を重畳させた状態に変更したものであってもよい。また、解析結果の表示への変更は、例えば、解析結果と画像とを任意の透明度によりブレンド処理して得た画像（例えば、２次元マップ）の表示への変更であってもよい。また、深度範囲の指定に用いる層境界の種類とオフセット位置をそれぞれ、３８０５、３８０６のようなユーザーインターフェースから一括して変更することができる。なお、断層画像も一緒に表示させ、断層画像上に重畳された層境界データを検者からの指示に応じて移動させることにより、異なる日時の複数のＥｎ－Ｆａｃｅ画像の深度範囲を一括して変更されてもよい。このとき、異なる日時の複数の断層画像を並べて表示し、１つの断層画像上で上記移動が行われると、他の断層画像上でも同様に層境界データが移動されてもよい。また、画像投影法やプロジェクションアーティファクト抑制処理の有無を例えばコンテキストメニューのようなユーザーインターフェースから選択することにより変更してもよい。また、選択ボタン３８０７を選択して選択画面を表示させ、該選択画面上に表示された画像リストから選択された画像が表示されてもよい。なお、図３８の上部に表示されている矢印３８０４は現在選択されている検査であることを示す印であり、基準検査（Ｂａｓｅｌｉｎｅ）はＦｏｌｌｏｗ－ｕｐ撮影の際に選択した検査（図３８の一番左側の画像）である。もちろん、基準検査を示すマークを表示部に表示させてもよい。また、「ＳｈｏｗＤｉｆｆｅｒｅｎｃｅ」チェックボックス３８０８が指定された場合には、基準画像上に基準画像に対する計測値分布（マップもしくはセクタマップ）を表示する。さらに、この場合には、それ以外の検査日に対応する領域に基準画像に対して算出した計測値分布と当該領域に表示される画像に対して算出した計測分布との差分計測値マップを表示する。計測結果としてはレポート画面上にトレンドグラフ（経時変化計測によって得られた各検査日の画像に対する計測値のグラフ）を表示させてもよい。すなわち、異なる日時の複数の画像に対応する複数の解析結果の時系列データ（例えば、時系列グラフ）が表示されてもよい。このとき、表示されている複数の画像に対応する複数の日時以外の日時に関する解析結果についても、表示されている複数の画像に対応する複数の解析結果と判別可能な状態で（例えば、時系列グラフ上の各点の色が画像の表示の有無で異なる）時系列データとして表示させてもよい。また、該トレンドグラフの回帰直線（曲線）や対応する数式をレポート画面に表示させてもよい。 Next, the execution of the high image quality processing during screen transition will be described using Figures 34(a) and (b). Figure 34(b) is an example of a screen in which the En-Face image Im3407 of OCTA in Figure 34(a) is enlarged and displayed. In Figure 34(b), the button 3420 is displayed in the same manner as in Figure 34(a). The screen transition from Figure 34(a) to Figure 34(b) is performed, for example, by double-clicking the En-Face image Im3407 of OCTA, and the screen transition from Figure 34(b) to Figure 34(a) is performed by pressing the close button 3430. Note that the method of screen transition is not limited to the method shown here, and a user interface not shown may be used. If the execution of the high image quality processing is specified at the time of screen transition (button 3420 is active), the state is maintained even during screen transition. That is, when a high-quality image is displayed on the screen of FIG. 34(a) and the screen of FIG. 34(b) is transitioned to the screen of FIG. 34(b), the high-quality image is also displayed on the screen of FIG. 34(b). And the button 3420 is made active. The same applies when transitioning from FIG. 34(b) to FIG. 34(a). In FIG. 34(b), the button 3420 can be designated to switch the display to a low-quality image. Regarding the screen transition, the transition is not limited to the screen shown here, and if the transition is to a screen that displays the same shooting data, such as a display screen for follow-up observation or a display screen for panoramic images, the transition is performed while maintaining the display state of the high-quality image. That is, on the display screen after the transition, an image corresponding to the state of the button 3420 on the display screen before the transition is displayed. For example, if the button 3420 on the display screen before the transition is in an active state, the high-quality image is displayed on the display screen after the transition. Also, for example, if the active state of the button 3420 on the display screen before the transition is released, the low-quality image is displayed on the display screen after the transition. When the button 3420 on the display screen for follow-up observation is activated, multiple images obtained at different dates and times (different examination dates) displayed side by side on the display screen for follow-up observation may be switched to high-quality images. That is, when the button 3420 on the display screen for follow-up observation is activated, multiple images obtained at different dates and times may be collectively reflected. An example of the display screen for follow-up observation is shown in FIG. 38. When the tab 3801 is selected in response to an instruction from the examiner, the display screen for follow-up observation is displayed as shown in FIG. 38. At this time, the examiner can change the depth range of the En-Face image by selecting it from the default depth range set (3802 and 3803) displayed in the list box. For example, the retinal surface layer is selected in the list box 3802, and the retinal deep layer is selected in the list box 3803. The analysis result of the En-Face image of the retinal surface layer is displayed in the upper display area, and the analysis result of the En-Face image of the retinal deep layer is displayed in the lower display area. That is, when a depth range is selected, the analysis results of a plurality of En-Face images of the selected depth range are collectively changed for a plurality of images of different dates and times. At this time, when the display of the analysis results is deselected, the display may be collectively changed to a parallel display of a plurality of En-Face images of different dates and times. Then, when the button 3420 is designated in response to an instruction from the examiner, the display of the plurality of En-Face images is collectively changed to a display of a plurality of high-quality images. Also, when the display of the analysis results is in a selected state, when the button 3420 is designated in response to an instruction from the examiner, the display of the analysis results of a plurality of En-Face images is collectively changed to a display of the analysis results of a plurality of high-quality images. Here, the display of the analysis results may be a display in which the analysis results are superimposed on the image with an arbitrary transparency. At this time, the change to the display of the analysis results may be, for example, a change to a state in which the analysis results are superimposed on the displayed image with an arbitrary transparency. Also, the change to the display of the analysis results may be, for example, a change to a display of an image (e.g., a two-dimensional map) obtained by blending the analysis results and the image with an arbitrary transparency. In addition, the type of layer boundary and the offset position used to specify the depth range can be changed collectively from a user interface such as 3805 and 3806. The depth ranges of a plurality of En-Face images at different dates and times may be changed collectively by displaying a tomographic image together and moving the layer boundary data superimposed on the tomographic image according to an instruction from the examiner. In this case, a plurality of tomographic images at different dates and times may be displayed side by side, and when the above movement is performed on one tomographic image, the layer boundary data may be moved similarly on the other tomographic images. In addition, the image projection method and the presence or absence of projection artifact suppression processing may be changed by selecting from a user interface such as a context menu. In addition, the selection button 3807 may be selected to display a selection screen, and an image selected from the image list displayed on the selection screen may be displayed. In addition, the arrow 3804 displayed at the top of FIG. 38 is a mark indicating that the examination is currently selected, and the reference examination (Baseline) is the examination selected during Follow-up imaging (the image on the leftmost side of FIG. 38). Of course, a mark indicating the reference examination may be displayed on the display unit. Also, when the "Show Difference" checkbox 3808 is specified, a measurement value distribution (map or sector map) for the reference image is displayed on the reference image. Furthermore, in this case, a difference measurement value map between the measurement value distribution calculated for the reference image and the measurement distribution calculated for the image displayed in the area is displayed in the area corresponding to the other examination dates. As the measurement result, a trend graph (a graph of the measurement value for the image on each examination date obtained by measuring the change over time) may be displayed on the report screen. That is, time series data (e.g., a time series graph) of multiple analysis results corresponding to multiple images at different dates and times may be displayed. In this case, analysis results for dates and times other than the multiple dates and times corresponding to the multiple images displayed may also be displayed as time series data in a state that can be distinguished from the multiple analysis results corresponding to the multiple images displayed (e.g., the color of each point on the time series graph differs depending on whether the image is displayed or not). Also, the regression line (curve) of the trend graph and the corresponding formula may be displayed on the report screen.

本実施形態においては、ＯＣＴＡのＥｎ－Ｆａｃｅ画像に関して説明を行ったが、これに限らない。本実施形態に係る表示、高画質化、及び画像解析等の処理に関する画像は、輝度のＥｎ－Ｆａｃｅ画像でもよい。さらには、Ｅｎ－Ｆａｃｅ画像だけではなく、断層画像やＳＬＯ画像、眼底写真、又は蛍光眼底写真など、異なる画像であっても構わない。その場合、高画質化処理を実行するためのユーザーインターフェースは、種類の異なる複数の画像に対して高画質化処理の実行を指示するもの、種類の異なる複数の画像から任意の画像を選択して高画質化処理の実行を指示するものがあってもよい。 In this embodiment, the En-Face image of OCTA has been described, but this is not limiting. The image related to the display, image quality improvement, image analysis, and other processes according to this embodiment may be an En-Face image of luminance. Furthermore, it may be a different image such as a tomographic image, an SLO image, a fundus photograph, or a fluorescent fundus photograph, in addition to the En-Face image. In this case, the user interface for executing the image quality improvement process may be one that instructs the execution of image quality improvement process on multiple images of different types, or one that instructs the execution of image quality improvement process by selecting any image from multiple images of different types.

このような構成により、本実施形態に係る高画質化部４０４が処理した画像を出力部４０５が表示部２０に表示することができる。このとき、上述したように、高画質画像の表示、解析結果の表示、表示される正面画像の深度範囲等に関する複数の条件のうち少なくとも１つが選択された状態である場合には、表示画面が遷移されても、選択された状態が維持されてもよい。また、上述したように、複数の条件のうち少なくとも１つが選択された状態である場合には、他の条件が選択された状態に変更されても、該少なくとも１つが選択された状態が維持されてもよい。例えば、出力部４０５は、解析結果の表示が選択状態である場合に、検者からの指示に応じて（例えば、ボタン３４２０が指定されると）、低画質画像の解析結果の表示を高画質画像の解析結果の表示に変更してもよい。また、出力部４０５は、解析結果の表示が選択状態である場合に、検者からの指示に応じて（例えば、ボタン３４２０の指定が解除されると）、高画質画像の解析結果の表示を低画質画像の解析結果の表示に変更してもよい。また、出力部４０５は、高画質画像の表示が非選択状態である場合に、検者からの指示に応じて（例えば、解析結果の表示の指定が解除されると）、低画質画像の解析結果の表示を低画質画像の表示に変更してもよい。また、出力部４０５は、高画質画像の表示が非選択状態である場合に、検者からの指示に応じて（例えば、解析結果の表示が指定されると）、低画質画像の表示を低画質画像の解析結果の表示に変更してもよい。また、出力部４０５は、高画質画像の表示が選択状態である場合に、検者からの指示に応じて（例えば、解析結果の表示の指定が解除されると）、高画質画像の解析結果の表示を高画質画像の表示に変更してもよい。また、出力部４０５は、高画質画像の表示が選択状態である場合に、検者からの指示に応じて（例えば、解析結果の表示が指定されると）、高画質画像の表示を高画質画像の解析結果の表示に変更してもよい。また、高画質画像の表示が非選択状態で且つ第１の種類の解析結果の表示が選択状態である場合を考える。この場合には、出力部４０５は、検者からの指示に応じて（例えば、第２の種類の解析結果の表示が指定されると）、低画質画像の第１の種類の解析結果の表示を低画質画像の第２の種類の解析結果の表示に変更してもよい。また、高画質画像の表示が選択状態で且つ第１の種類の解析結果の表示が選択状態である場合を考える。この場合には、出力部４０５は、検者からの指示に応じて（例えば、第２の種類の解析結果の表示が指定されると）、高画質画像の第１の種類の解析結果の表示を高画質画像の第２の種類の解析結果の表示に変更してもよい。なお、経過観察用の表示画面においては、上述したように、これらの表示の変更が、異なる日時で得た複数の画像に対して一括で反映されるように構成してもよい。ここで、解析結果の表示は、解析結果を任意の透明度により画像に重畳表示させたものであってもよい。このとき、解析結果の表示への変更は、例えば、表示されている画像に対して任意の透明度により解析結果を重畳させた状態に変更したものであってもよい。また、解析結果の表示への変更は、例えば、解析結果と画像とを任意の透明度によりブレンド処理して得た画像（例えば、２次元マップ）の表示への変更であってもよい。 With this configuration, the output unit 405 can display the image processed by the image quality improvement unit 404 according to this embodiment on the display unit 20. At this time, as described above, when at least one of the multiple conditions related to the display of the high-quality image, the display of the analysis result, the depth range of the displayed front image, etc. is selected, the selected state may be maintained even if the display screen is changed. Also, as described above, when at least one of the multiple conditions is selected, the selected state of the at least one may be maintained even if another condition is changed to a selected state. For example, when the display of the analysis result is in a selected state, the output unit 405 may change the display of the analysis result of the low-quality image to the display of the analysis result of the high-quality image in response to an instruction from the examiner (for example, when the button 3420 is designated). Also, when the display of the analysis result is in a selected state, the output unit 405 may change the display of the analysis result of the high-quality image to the display of the analysis result of the low-quality image in response to an instruction from the examiner (for example, when the designation of the button 3420 is released). In addition, the output unit 405 may change the display of the analysis result of the low-quality image to the display of the low-quality image in response to an instruction from the examiner (for example, when the designation of the display of the analysis result is released) when the display of the high-quality image is in a non-selected state. In addition, the output unit 405 may change the display of the analysis result of the low-quality image to the display of the analysis result of the low-quality image in response to an instruction from the examiner (for example, when the designation of the display of the analysis result is released) when the display of the high-quality image is in a selected state. In addition, the output unit 405 may change the display of the analysis result of the high-quality image to the display of the high-quality image in response to an instruction from the examiner (for example, when the designation of the display of the analysis result is released) when the display of the high-quality image is in a selected state. In addition, the output unit 405 may change the display of the analysis result of the high-quality image to the display of the analysis result of the high-quality image in response to an instruction from the examiner (for example, when the display of the analysis result is specified) when the display of the high-quality image is in a selected state. In addition, a case will be considered where the display of the high-quality image is in a non-selected state and the display of the first type of analysis result is in a selected state. In this case, the output unit 405 may change the display of the first type of analysis result of the low image quality image to the display of the second type of analysis result of the low image quality image in response to an instruction from the examiner (for example, when the display of the second type of analysis result is specified). Also, consider a case where the display of the high image quality image is in a selected state and the display of the first type of analysis result is in a selected state. In this case, the output unit 405 may change the display of the first type of analysis result of the high image quality image to the display of the second type of analysis result of the high image quality image in response to an instruction from the examiner (for example, when the display of the second type of analysis result is specified). In addition, in the display screen for follow-up observation, as described above, these display changes may be configured to be reflected collectively on multiple images obtained at different dates and times. Here, the display of the analysis result may be a display in which the analysis result is superimposed on the image with an arbitrary transparency. At this time, the change to the display of the analysis result may be, for example, a change to a state in which the analysis result is superimposed on the displayed image with an arbitrary transparency. Additionally, the change to the display of the analysis results may be, for example, a change to the display of an image (e.g., a two-dimensional map) obtained by blending the analysis results and the image with an arbitrary transparency.

＜第２５の実施形態＞
次に、図３５を参照して、第２５の実施形態に係る画像処理装置について説明する。本実施形態では、処理判定部３５０６ついて説明を行う。 <Twenty-fifth embodiment>
Next, an image processing apparatus according to a twenty-fifth embodiment will be described with reference to Fig. 35. In this embodiment, the processing determination unit 3506 will be described.

特に明記しない限り、本実施形態に係る画像処理装置の構成及び処理は、第１の実施形態に係る画像処理装置４００と同様である。そのため、以下では、本実施形態に係る処理判定部３５０６について説明する。 Unless otherwise specified, the configuration and processing of the image processing device according to this embodiment are the same as those of the image processing device 400 according to the first embodiment. Therefore, the processing determination unit 3506 according to this embodiment will be described below.

処理判定部３５０６は、高画質化部４０４における高画質化処理をＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）で処理をするか、ＣＰＵで処理をするか判定を行う。 The processing determination unit 3506 determines whether the image quality improvement process in the image quality improvement unit 404 should be performed by the GPU (Graphics Processing Unit) or the CPU.

処理判定部３５０６は高画質化部４０４の処理を実行する装置に搭載されているＧＰＵのＧＰＵ名、ＧＰＵドライバ、ＧＰＵ搭載のメモリサイズなど、機械学習を用いる高画質化処理を実行するのに十分な環境であるか否かを判定する。処理判定部３５０６により、ＧＰＵを使用可能であると判定された場合、高画質化部４０４はＧＰＵを用いて処理を行う。処理判定部３５０６により、ＧＰＵを使用不可能であると判定された場合、高画質化部４０４はＣＰＵを用いて処理を行う。処理判定部３５０６によりＧＰＵを使用不可能であると判定された場合、ＧＰＵと比較してＣＰＵの方が処理に時間がかかるため、出力部４０５はＧＰＵではなくＣＰＵで処理を行うことを表示部２０に表示する。なお、ＧＰＵを使用可能な場合にＧＰＵを用いて処理する表示をしても良い。表示部２０への表示の仕方として、メッセージを表示してもよいし、ＧＰＵ、ＣＰＵなど単語だけを表示するようにしても良い。なお、高画質化部４０４の処理をＣＰＵとすることで処理に時間がかかる場合（例えば、数１０秒～数分以上）、高画質化処理の実行を不可として、例えば、図３４で示したボタン３４２０を非表示としても良い。高画質化処理を実行するためのユーザーインターフェースを非表示とすることで、その機能を使用することが出来ない。使用を不可とする場合は、使用不可であることを表示部２０へ表示してもよい。 The processing determination unit 3506 determines whether the environment is sufficient to perform high image quality processing using machine learning, such as the GPU name of the GPU mounted on the device that executes the processing of the high image quality unit 404, the GPU driver, the memory size of the GPU, etc. If the processing determination unit 3506 determines that the GPU can be used, the high image quality unit 404 performs processing using the GPU. If the processing determination unit 3506 determines that the GPU cannot be used, the high image quality unit 404 performs processing using the CPU. If the processing determination unit 3506 determines that the GPU cannot be used, the output unit 405 displays on the display unit 20 that processing will be performed by the CPU rather than the GPU, since the CPU takes longer to process than the GPU. Note that if the GPU is available, a display may be made to process using the GPU. As a method of displaying on the display unit 20, a message may be displayed, or only words such as GPU and CPU may be displayed. Note that if the image quality improvement unit 404 is processed by the CPU and the processing takes a long time (for example, several tens of seconds to several minutes or more), the image quality improvement process may be disabled, and, for example, button 3420 shown in FIG. 34 may be hidden. By hiding the user interface for executing the image quality improvement process, the function cannot be used. When the function is disabled, a message indicating that it is disabled may be displayed on the display unit 20.

処理判定部３５０６はＧＰＵとＣＰＵの判定を行うだけに限らず、高画質化処理自体の実行判定も行うことが出来る。例えば、高画質化処理を実行するためにライセンス登録が必要である場合について説明をする。処理判定部３５０６はライセンス登録がされているか否かの判定を行い、ライセンス登録がされている場合には高画質化処理の実行可として、例えば、図３４で示したボタン３４２０を表示する。ライセンス登録がされていない場合には、図３４で示したボタン３４２０を非表示とすることで使用が出来ないものとする。なお、高画質化処理を実行するためにライセンス登録が必要である場合は、上述したＧＰＵ処理とＣＰＵ処理判定の前に実行をする。 The processing determination unit 3506 is not limited to only determining whether it is a GPU or a CPU, but can also determine whether or not to execute the high image quality processing itself. For example, a case where license registration is required to execute the high image quality processing will be described. The processing determination unit 3506 determines whether or not a license has been registered, and if a license has been registered, it displays, for example, button 3420 shown in FIG. 34 as indicating that the high image quality processing can be executed. If a license has not been registered, button 3420 shown in FIG. 34 is hidden, thereby making it impossible to use. Note that if license registration is required to execute the high image quality processing, it is executed before the above-mentioned GPU processing or CPU processing determination.

処理判定部３５０６は処理の実行判定を自動的に行うだけではなく、検者の指示に基づいて行うようにしても良い。例えば、不図示のユーザーインターフェースを用いて、検者からＣＰＵ実行を指定されている場合、ＧＰＵではなくＣＰＵで処理をするように判定する。その場合、処理判定部３５０６は装置に搭載されているＧＰＵを調べる必要はなく、高画質化部４０４はＣＰＵを用いて処理を行う。 The processing determination unit 3506 may not only automatically determine whether to execute processing, but may also determine processing based on instructions from the examiner. For example, if the examiner uses a user interface (not shown) to specify CPU execution, the processing determination unit 3506 determines that processing should be performed by the CPU rather than the GPU. In that case, the processing determination unit 3506 does not need to check the GPU installed in the device, and the image quality improvement unit 404 performs processing using the CPU.

上述した処理判定部３５０６の処理は、高画質化部４０４の処理を行う度に毎回実施する必要はなく、画像処理装置を起動時に行えばよい。あるいは、定期的（例えば、１日に１度）に判定を行うようにしても良い。 The processing by the processing determination unit 3506 described above does not need to be performed every time the image quality improvement unit 404 performs processing, but may be performed when the image processing device is started. Alternatively, the determination may be performed periodically (for example, once a day).

このような構成により、本実施形態に係る処理判定部３５０６が高画質化処理を実行可能か否か判定する。そして、適切な環境を選択して機械学習の処理を実行することが可能となる。 With this configuration, the processing determination unit 3506 according to this embodiment determines whether or not image quality improvement processing can be performed. Then, it becomes possible to select an appropriate environment and perform machine learning processing.

（変形例１）
上述した様々な実施形態において、高画質化エンジンの機械学習モデルと真贋評価エンジンの機械学習モデルとが競合するようにトレーニングすることで、高画質化エンジンと真贋評価エンジンとの効率や精度を向上させるようにしても良い。ここで、複数のモデルが競合するようにトレーニングするネットワークとは、例えば、敵対的生成ネットワーク（ＧＡＮ：ＧｅｎｅｒａｔｉｖｅＡｄｖｅｒｓａｒｉａｌＮｅｔｗｏｒｋｓ）である。このとき、高画質化エンジンの機械学習モデルは、画像を生成する生成モデル（Ｇｅｎｅｒａｔｏｒ）に相当する。また、真贋評価エンジンの機械学習モデルは、生成された画像が本物か否かを識別する識別モデル（Ｄｉｓｃｒｉｍｉｎａｔｏｒ）に相当する。例えば、高画質化の正解となる画像を真贋評価エンジンに評価させると真作ラベルが出力されるように、該高画質化エンジンの機械学習モデルをトレーニングする。そして、高画質化エンジンが生成する画像を真贋評価エンジンに評価させると贋作ラベルを出力するように、該真贋評価エンジンの機械学習モデルをトレーニングさせる。結果的に、高画質化エンジンが生成する画像と高画質化の正解となる画像との区別がつかなくなるように、繰り返しトレーニングをする。これによって、高画質化エンジンと真贋評価エンジンの効率や精度が向上する。 (Variation 1)
In the various embodiments described above, the efficiency and accuracy of the image quality improvement engine and the authenticity evaluation engine may be improved by training the machine learning model of the image quality improvement engine and the machine learning model of the authenticity evaluation engine so as to compete with each other. Here, the network trained so that multiple models compete with each other is, for example, a generative adversarial network (GAN). In this case, the machine learning model of the image quality improvement engine corresponds to a generative model (Generator) that generates an image. In addition, the machine learning model of the authenticity evaluation engine corresponds to a discriminator model (Discriminator) that identifies whether the generated image is genuine or not. For example, the machine learning model of the image quality improvement engine is trained so that when an image that is a correct answer for image quality improvement is evaluated by the authenticity evaluation engine, a genuine label is output. Then, the machine learning model of the authenticity evaluation engine is trained so that when an image generated by the image quality improvement engine is evaluated by the authenticity evaluation engine, a counterfeit label is output. As a result, repeated training is performed until the images generated by the image enhancement engine become indistinguishable from the correct images for image enhancement, thereby improving the efficiency and accuracy of the image enhancement engine and the authenticity assessment engine.

なお、高画質化エンジンは、敵対的生成ネットワークにより生成された少なくとも１つの画像を含む学習データを学習して得た学習済モデルであっても良い。このとき、敵対的生成ネットワークにより生成された少なくとも１つの画像を学習用の学習データとして用いるか否かを、検者からの指示により選択可能に構成されても良い。 The image quality improvement engine may be a trained model obtained by training training data including at least one image generated by a generative adversarial network. In this case, the image quality improvement engine may be configured to allow the examiner to select whether or not to use at least one image generated by the generative adversarial network as training data for training.

（変形例２）
上述した様々な実施形態及び変形例において、高画質化エンジンが生成した高画質画像と入力画像とを合成して出力しても良い。例えば、入力画像の画素値が低い（画像として暗い）場合など、高画質化エンジンがノイズ成分として画素値を低減してしまうことが考えられる。そのため、入力画像の明るさに基づいて、高画質化エンジンが生成した画像と入力画像との合成の割合を変更して出力するようにしても良い。すなわち、出力部４０５（表示制御部）は、入力画像（第１の画像）の少なくとも一部の領域に関する情報を用いて得た割合により入力画像と高画質画像（第２の画像）とを合成することにより得た合成画像を出力しても良い。このとき、２つの画像の合成の割合は、入力画像の少なくとも一部の領域における画素値（少なくとも一部の領域の明るさ）を上記情報として用いることにより決定されても良い。このとき、例えば、入力画像における画素値が低い（暗い）ほど、高画質画像に対する入力画像を合成する割合を高くする。また、例えば、入力画像における画素値が高い（明るい）ほど、高画質画像に対する入力画像を合成する割合を低くする。具体的には、画像全体の画素値の統計値（平均値、中央値、最頻値、最小値、最大値、分散、標準偏差など）に基づいて、合成する割合を変える。例えば、入力画像の画素値の統計値が第一の閾値よりも低い場合、高画質化エンジンが生成した画像と入力画像とを０．５：０．５の割合で合成（２つの画像の平均）して得た合成画像を出力する。あるいは、入力画像の画素値の統計値が第二の閾値よりも高い場合、高画質化エンジンが生成した画像と入力画像とを０．９：０．１の割合で合成（２つの画像の重み付き平均）して得た合成画像を出力する。なお、第一の閾値と第二の閾値の間の合成する割合は滑らかに変化するものとする。入力画像から計算する統計値は、画像全体で求めても良いし、いくつかの領域に分割して局所的な統計値を求めるようにしても良い。画像をいくつかの領域に分割する場合、隣接領域においては合成する割合が急激に変化しないように、滑らかな値になるように割合値を補正するようにしても良い。さらに、領域分割をするのではなく、ガウシアンフィルタのような平滑化フィルタを用いて画像をぼかすことにより、ピクセル単位での値を第一、第二の閾値と比較することで、ピクセル毎に合成する割合を求めても良い。なお、画素値の統計値を計算するための画像として、入力画像に限らない。例えば、入力画像がＯＣＴＡの場合、輝度のＥｎｆａｃｅやＰｒｏｊｅｃｔｉｏｎ画像を用いて画素値の統計値を計算するようにしても良い。 (Variation 2)
In the various embodiments and modifications described above, the high-quality image generated by the high-quality engine and the input image may be synthesized and output. For example, when the pixel value of the input image is low (the image is dark), the high-quality engine may reduce the pixel value as a noise component. Therefore, based on the brightness of the input image, the ratio of synthesis between the image generated by the high-quality engine and the input image may be changed and output. That is, the output unit 405 (display control unit) may output a synthetic image obtained by synthesizing the input image and the high-quality image (second image) at a ratio obtained using information on at least a part of the region of the input image (first image). At this time, the ratio of synthesis of the two images may be determined by using the pixel value (brightness of at least a part of the region) in at least a part of the region of the input image as the above information. At this time, for example, the lower the pixel value in the input image (darker), the higher the ratio of synthesis of the input image with the high-quality image. Also, for example, the higher the pixel value in the input image (brighter), the lower the ratio of synthesis of the input image with the high-quality image. Specifically, the synthesis ratio is changed based on the statistical value of pixel values of the entire image (average, median, mode, minimum, maximum, variance, standard deviation, etc.). For example, when the statistical value of pixel values of the input image is lower than a first threshold, the image generated by the high image quality engine and the input image are synthesized at a ratio of 0.5:0.5 (average of the two images) to output a synthetic image. Alternatively, when the statistical value of pixel values of the input image is higher than a second threshold, the image generated by the high image quality engine and the input image are synthesized at a ratio of 0.9:0.1 (weighted average of the two images) to output a synthetic image. Note that the synthesis ratio between the first threshold and the second threshold changes smoothly. The statistical value calculated from the input image may be obtained for the entire image, or may be divided into several regions to obtain local statistical values. When dividing the image into several regions, the ratio value may be corrected to a smooth value so that the synthesis ratio does not change suddenly in adjacent regions. Furthermore, instead of dividing the image into regions, a smoothing filter such as a Gaussian filter may be used to blur the image, and the pixel-by-pixel value may be compared with the first and second thresholds to determine the ratio of pixel-by-pixel synthesis. Note that the image for calculating the statistical value of pixel values is not limited to the input image. For example, when the input image is an OCTA image, the statistical value of pixel values may be calculated using an enface or projection image of luminance.

さらには、入力画像の画素値の統計値ではなく、入力画像と高画質化エンジンが生成した高画質画像との差分により、２つの画像の合成の割合を変更するようにしても良い。すなわち、２つの画像の合成の割合は、２つの画像の互いに対応する少なくとも一部の領域における画素値の差分値を上記情報として用いることにより決定されても良い。具体的には、入力画像と高画質画像との間に差が大きい場合に入力画像の割合を大きくするようにしても良い。すなわち、高画質画像がノイズ除去をしすぎている場合に、入力画像の比率を大きくして合成することで、自然な高画質画像を生成する。なお、差分値を求める際には、単純な差分情報だけではなく、構造的な差分情報により判断するようにしても良い。例えば、Ｈｅｓｓｉａｎフィルタのようなフィルタを用いて線状構造のみを抽出するようにしても良い。それによりランダムなノイズは差分として検出されず、血管のようなある程度連続性のあるノイズだけを抽出することが出来る。さらには、単純にノイズ成分をラベリング処理して、ある程度の大きさを持つノイズだけを抽出するようにしても良い。差分によって合成の割合を変更する場合においても同様に、画像全体で求めても良いし、いくつかの領域に分割して局所的な差分値を求めるようにしても良い。 Furthermore, the ratio of synthesis of two images may be changed not by the statistical value of pixel values of the input image but by the difference between the input image and the high-quality image generated by the high-quality engine. That is, the ratio of synthesis of two images may be determined by using the difference value of pixel values in at least some areas of the two images corresponding to each other as the above information. Specifically, when the difference between the input image and the high-quality image is large, the ratio of the input image may be increased. That is, when the high-quality image has been overly denoised, the ratio of the input image is increased and synthesis is performed to generate a natural high-quality image. Note that when calculating the difference value, it is possible to judge based on not only simple difference information but also structural difference information. For example, a filter such as a Hessian filter may be used to extract only linear structures. This makes it possible to extract only noise with a certain degree of continuity, such as blood vessels, without detecting random noise as a difference. Furthermore, it is possible to simply label the noise components and extract only noise with a certain degree of size. Similarly, when changing the ratio of synthesis based on the difference, it may be calculated for the entire image, or it may be divided into several areas to calculate local difference values.

さらに、部位や画像を認識して合成する割合を求めても良い。これに関して、例えば表層のＯＣＴＡ画像で説明をする。表層のＯＣＴＡ画像において、ＦＡＺ（中心窩の無血管領域）には、血管が存在しないため、ＯＣＴＡ画像においてＦＡＺは暗くなって良い。このため、ＦＡＺに関しては、入力画像に対する高画質画像の割合を高くすることが考えられる。すなわち、ノイズがより低減されている画像の方の割合を高くする。一方、ＦＡＺ以外の位置に暗い領域がある場合、その領域が無血管領域（ＮＰＡ：ＮｏｎｐｅｒｆｕｓｉｏｎＡｒｅａ）であるのか、本当は血管が存在するのに、影等によって輝度が低下した領域であるのか等の判断が難しい。そこで、入力画像に対する高画質画像の割合を低くすることが考えられる。すなわち、本来存在する低輝度の領域が画像から消えてしまっている可能性がある方の画像の割合を低くする。このように、画像の明るさや差分変化だけではなく、部位を認識して合成する割合を変化させても良い。次に画像を認識する場合について説明をする。ＯＣＴＡ画像は、表層、深層、外層では、深さに応じて画像の見え方や明るさが変わる。そのため、対象画像の種類がどの層かを認識し、層の種類に応じて割合を変化させても良い。画像の認識は、層を生成する際の境界線の位置情報を用いて行っても良いし、画像から自動的に認識するようにしても良い。すなわち、画像の明るさだけで判断するのではなく、どの深度から生成されたＯＣＴＡ画像かによって合成する割合を変更してもよい。例えば、表層のＯＣＴＡ画像は全体的に明るく、外層のＯＣＴＡ画像では全体的に暗くなる。そのため、表層のＯＣＴＡ画像と外層のＯＣＴＡ画像とにおいて、画素値の統計値によって合成する割合の第一、第二の閾値と、それに対応する割合はそれぞれ違う値としてもよい。例えば、表層において第一の閾値よりも低い場合、高画質化エンジンが生成した画像と入力画像とを０．５：０．５の割合で合成するが、外層においては第一の閾値よりも低い場合、高画質化エンジンが生成した画像と入力画像とを０．７：０．３の割合で合成するというようにしてもよい。 Furthermore, the proportion of parts and images to be synthesized may be determined by recognizing the parts and images. In this regard, for example, an OCTA image of the superficial layer will be described. In the OCTA image of the superficial layer, since there are no blood vessels in the FAZ (avascular area of the fovea), the FAZ may be dark in the OCTA image. For this reason, it is possible to increase the proportion of high-quality images in the input image for the FAZ. In other words, the proportion of images with more reduced noise is increased. On the other hand, if there is a dark area in a position other than the FAZ, it is difficult to determine whether the area is an avascular area (NPA: Nonperfusion Area) or an area where blood vessels actually exist but the brightness is reduced due to a shadow or the like. Therefore, it is possible to decrease the proportion of high-quality images in the input image. In other words, the proportion of images in which the originally existing low-brightness area may have disappeared from the image is decreased. In this way, not only the brightness and difference change of the image, but also the proportion of parts to be synthesized may be changed. Next, the case of recognizing an image will be described. The appearance and brightness of the OCTA image changes depending on the depth of the surface, deep, and outer layers. Therefore, the type of the target image may be recognized and the ratio may be changed depending on the type of layer. The image may be recognized using the position information of the boundary line when generating the layer, or may be automatically recognized from the image. In other words, the composition ratio may be changed depending on the depth from which the OCTA image was generated, rather than being determined only by the brightness of the image. For example, the OCTA image of the surface layer is generally bright, and the OCTA image of the outer layer is generally dark. Therefore, the first and second thresholds of the composition ratio based on the statistical value of the pixel values and the corresponding ratios may be different values for the OCTA image of the surface layer and the OCTA image of the outer layer. For example, if the first threshold is lower in the surface layer, the image generated by the high image quality engine and the input image are composed at a ratio of 0.5:0.5, but if the first threshold is lower in the outer layer, the image generated by the high image quality engine and the input image may be composed at a ratio of 0.7:0.3.

なお、上述した画像合成は、画素値自体を合成する処理について説明をしているが、画像の不透明度を変更するようにしてもよい。すなわち、合成の割合をアルファブレンドの値としても良い。そのため、例えば、入力画像の割合が０．３とする場合、高画質化エンジンが生成した画像のアルファ値は１、入力画像のアルファ値は０．３とした画像を表示するようにしても良い。この場合、高画質化エンジンが生成した画像は必ず表示するようにし、入力画像のアルファ値を変更して半透明で表示する方が望ましい。 Note that while the image synthesis described above is a process of synthesizing pixel values themselves, it is also possible to change the opacity of the images. In other words, the synthesis ratio may be an alpha blend value. For example, if the input image ratio is 0.3, an image may be displayed in which the alpha value of the image generated by the high image quality engine is 1 and the alpha value of the input image is 0.3. In this case, it is preferable to always display the image generated by the high image quality engine, and to change the alpha value of the input image to display it semi-transparently.

また、高画質化エンジンが生成した画像と入力画像とを合成する画像を出力する場合、上述したように高画質化エンジンが自動的に割合を決めた画像を出力するようにしても良い。また、２つの画像の合成の割合は、不図示のユーザーインターフェースを用いて、検者からの指示に応じて変更可能に構成されても良い。このとき、ユーザーインターフェースとしては、スライダーバーやテキストボックスへの数値入力などで割合を変更できるようにしても良いし、割合を変えた画像を複数提示して選択出来るようにしても良い。 When outputting an image that combines an image generated by the image quality improvement engine with an input image, the image quality improvement engine may output an image with a ratio that is automatically determined as described above. The ratio of combining the two images may be configured to be changeable in response to instructions from the examiner using a user interface (not shown). In this case, the user interface may be configured to allow the ratio to be changed using a slider bar or by inputting a number into a text box, or multiple images with different ratios may be presented for selection.

また、入力画像と高画質画像とを合成する割合は、医用画像を入力データとし、該医用画像と該医用画像を高画質化して得た高画質医用画像とを合成する割合に関する情報を正解データ（出力データ）とする学習データにより学習して得た学習済モデルを用いて、入力画像の少なくとも一部の領域に関する情報から決定されても良い。このとき、割合に関する情報は、例えば、検者からの指示に応じて設定（変更）された割合の値であっても良い。また、学習済モデルは、例えば、医用画像と、該医用画像を高画質化して得た高画質医用画像とをセットとする入力データを含む学習データにより学習して得たものであっても良い。このとき、学習済モデルは、上記学習データを用いた機械学習により得ることができる。 The ratio of combining the input image and the high-quality image may be determined from information about at least a portion of the region of the input image using a trained model obtained by training with training data in which a medical image is used as input data and information about the ratio of combining the medical image and a high-quality medical image obtained by improving the quality of the medical image is used as correct answer data (output data). In this case, the information about the ratio may be, for example, a ratio value set (changed) in response to instructions from the examiner. In addition, the trained model may be, for example, one obtained by training with training data including input data that is a set of a medical image and a high-quality medical image obtained by improving the quality of the medical image. In this case, the trained model can be obtained by machine learning using the training data.

ここで、機械学習には、例えば、多階層のニューラルネットワークから成る深層学習（ＤｅｅｐＬｅａｒｎｉｎｇ）がある。また、多階層のニューラルネットワークの少なくとも一部には、例えば、畳み込みニューラルネットワーク（ＣＮＮ：ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）を用いることができる。また、多階層のニューラルネットワークの少なくとも一部には、オートエンコーダ（自己符号化器）に関する技術が用いられてもよい。また、学習には、バックプロパゲーション（誤差逆伝搬法）に関する技術が用いられてもよい。ただし、機械学習としては、深層学習に限らず、画像等の学習データの特徴量を学習によって自ら抽出（表現）可能なモデルであれば何でも良い。また、機械学習は、このようなモデルにも限らず、学習前に予め医用画像を用いて得た特徴量を学習データとして学習するものであっても良い。例えば、機械学習は、サポートベクターマシン、アダブースト、ランダムフォレスト、ベイジアンネットワーク等であっても良い。また、学習済モデルは、検者からの指示に応じて設定（変更）された割合の値を学習データとする追加学習により更新されても良い。例えば、入力画像が比較的暗いときに、高画質画像に対する入力画像の割合を検者が高く設定する傾向にあれば、学習済モデルはそのような傾向となるように追加学習することになる。これにより、例えば、検者の好みに合った合成の割合を得ることができる学習済モデルとしてカスタマイズすることができる。このとき、設定（変更）された割合の値を追加学習の学習データとして用いるか否かを、検者からの指示に応じて決定するためのボタンが表示画面に表示されていても良い。また、学習済モデルを用いて決定された割合をデフォルトの値とし、その後、検者からの指示に応じて割合の値をデフォルトの値から変更可能となるように構成されても良い。また、高画質化エンジンは、高画質化エンジンにより生成された少なくとも１つの高画質画像を含む学習データを追加学習して得た学習済モデルであっても良い。このとき、高画質画像を追加学習用の学習データとして用いるか否かを、検者からの指示により選択可能に構成されても良い。 Here, the machine learning includes, for example, deep learning consisting of a multi-layered neural network. In addition, for example, a convolutional neural network (CNN) can be used for at least a part of the multi-layered neural network. In addition, a technology related to an autoencoder (autoencoder) may be used for at least a part of the multi-layered neural network. In addition, a technology related to backpropagation (backpropagation method) may be used for learning. However, the machine learning is not limited to deep learning, and any model that can extract (express) the feature amount of the learning data such as an image by itself by learning may be used. In addition, the machine learning is not limited to such a model, and may be a model that learns the feature amount obtained in advance using a medical image before learning as the learning data. For example, the machine learning may be a support vector machine, an adaboost, a random forest, a Bayesian network, etc. In addition, the learned model may be updated by additional learning in which the ratio value set (changed) according to the instruction from the examiner is used as the learning data. For example, if the examiner tends to set the ratio of the input image to the high-quality image high when the input image is relatively dark, the trained model will be additionally trained to have such a tendency. This allows, for example, customization of the trained model to obtain a synthesis ratio that suits the examiner's preferences. At this time, a button for determining whether or not to use the set (changed) ratio value as training data for additional training in response to an instruction from the examiner may be displayed on the display screen. In addition, the ratio determined using the trained model may be set as a default value, and the ratio value may be changed from the default value in response to an instruction from the examiner. In addition, the image quality improvement engine may be a trained model obtained by additionally training training data including at least one high-quality image generated by the image quality improvement engine. At this time, it may be configured to be selectable whether or not to use the high-quality image as training data for additional training in response to an instruction from the examiner.

なお、本変形例に示した高画質化エンジンが生成した画像と入力画像とを合成して出力する合成画像は、上述した様々な実施形態で説明をした高画質画像の代わりとして出力することが可能であり、例えば、経過観察やパノラマ画像などの表示画面においても同様である。すなわち、本変形例における合成画像を複数の位置で取得し、複数の合成画像を用いて広画角画像を生成しても良い。また、複数の合成画像を用いて生成して得た広画角画像をパノラマ画像用の表示画面に表示させても良い。また、本変形例における合成画像を異なる日時で取得し、複数の合成画像を経過観察用の表示画面に並べて表示させても良い。また、本変形例における合成画像に対して血管解析処理等のような解析処理をすることも可能である。 The composite image output by combining the image generated by the image quality improvement engine shown in this modified example with the input image can be output instead of the high quality image described in the various embodiments above, and the same is true for display screens such as follow-up observation and panoramic images. That is, the composite images in this modified example may be acquired at multiple positions, and a wide-angle image may be generated using the multiple composite images. The wide-angle image generated using the multiple composite images may be displayed on a display screen for panoramic images. The composite images in this modified example may be acquired at different dates and times, and the multiple composite images may be displayed side by side on a display screen for follow-up observation. It is also possible to perform analysis processing such as vascular analysis processing on the composite image in this modified example.

（変形例３）
上述した様々な実施形態及び変形例におけるレポート画面において、所望の層の層厚や各種の血管密度等の解析結果を表示させても良い。また、視神経乳頭部、黄斑部、血管領域、神経線維束、硝子体領域、黄斑領域、脈絡膜領域、強膜領域、篩状板領域、網膜層境界、網膜層境界端部、視細胞、血球、血管壁、血管内壁境界、血管外側境界、神経節細胞、角膜領域、隅角領域、シュレム管等の少なくとも１つを含む注目部位に関するパラメータの値（分布）を解析結果として表示させてもよい。このとき、例えば、各種のアーティファクトの低減処理が適用された医用画像を解析することで、精度の良い解析結果を表示させることができる。なお、アーティファクトは、例えば、血管領域等による光吸収により生じる偽像領域、プロジェクションアーティファクト、被検眼の状態（動きや瞬き等）によって測定光の主走査方向に生じる正面画像における帯状のアーティファクト等であっても良い。また、アーティファクトは、例えば、被検者の所定部位の医用画像上に撮影毎にランダムに生じるような写損領域であれば、何でも良い。また、上述したような様々なアーティファクト（写損領域）の少なくとも１つを含む領域に関するパラメータの値（分布）を解析結果として表示させてもよい。また、ドルーゼン、新生血管、白斑（硬性白斑）、シュードドルーゼン等の異常部位等の少なくとも１つを含む領域に関するパラメータの値（分布）を解析結果として表示させてもよい。また、解析結果は、解析マップや、各分割領域に対応する統計値を示すセクター等で表示されても良い。なお、解析結果は、医用画像の解析結果を学習データとして学習して得た学習済モデル（解析結果生成エンジン、解析結果生成用の学習済モデル）を用いて生成されたものであっても良い。このとき、学習済モデルは、医用画像とその医用画像の解析結果とを含む学習データや、医用画像とその医用画像とは異なる種類の医用画像の解析結果とを含む学習データ等を用いた学習により得たものであっても良い。また、学習済モデルは、輝度正面画像及びモーションコントラスト正面画像のように、所定部位の異なる種類の複数の医用画像をセットとする入力データを含む学習データを用いた学習により得たものであっても良い。ここで、輝度正面画像は輝度のＥｎ－Ｆａｃｅ画像に対応し、モーションコントラスト正面画像はＯＣＴＡのＥｎ－Ｆａｃｅ画像に対応する。また、高画質化エンジンにより生成された高画質画像を用いて得た解析結果が表示されるように構成されても良い。また、学習データに含まれる入力データとしては、高画質化エンジンにより生成された高画質画像であっても良いし、低画質画像と高画質画像とのセットであっても良い。また、学習データは、例えば、解析領域を解析して得た解析値（例えば、平均値や中央値等）、解析値を含む表、解析マップ、画像におけるセクター等の解析領域の位置等の少なくとも１つを含む情報を（教師あり学習の）正解データとして、入力データにラベル付けしたデータであってもよい。なお、検者からの指示に応じて、解析結果生成用の学習済モデルにより得た解析結果が表示されるように構成されてもよい。 (Variation 3)
In the report screen in the various embodiments and modifications described above, the analysis results such as the layer thickness of the desired layer and various blood vessel densities may be displayed. In addition, the values (distributions) of parameters related to the target area including at least one of the optic disc, macular region, blood vessel region, nerve fiber bundle, vitreous region, macular region, choroid region, sclera region, lamina cribrosa region, retinal layer boundary, retinal layer boundary edge, photoreceptor cells, blood cells, blood vessel wall, blood vessel inner wall boundary, blood vessel outer boundary, ganglion cells, corneal region, iridocorneal region, Schlemm's canal, etc. may be displayed as the analysis results. In this case, for example, by analyzing a medical image to which various artifact reduction processes have been applied, it is possible to display a highly accurate analysis result. Note that the artifacts may be, for example, a false image region caused by light absorption by the blood vessel region, a projection artifact, a band-shaped artifact in a front image caused in the main scanning direction of the measurement light due to the state of the subject's eye (movement, blinking, etc.), etc. Moreover, the artifact may be, for example, any defective region that occurs randomly on a medical image of a predetermined part of a subject for each shooting. Moreover, the parameter value (distribution) for a region including at least one of the various artifacts (defective regions) described above may be displayed as an analysis result. Moreover, the parameter value (distribution) for a region including at least one of abnormal parts such as drusen, neovascularization, white spots (hard white spots), and pseudodrusen may be displayed as an analysis result. Moreover, the analysis result may be displayed as an analysis map, a sector showing a statistical value corresponding to each divided region, or the like. The analysis result may be generated using a trained model (analysis result generation engine, trained model for generating analysis result) obtained by learning the analysis result of a medical image as learning data. In this case, the trained model may be obtained by learning using training data including a medical image and the analysis result of the medical image, or training data including a medical image and the analysis result of a medical image of a different type from the medical image, or the like. The trained model may be obtained by learning using training data including input data that is a set of multiple medical images of different types of a predetermined part, such as a luminance front image and a motion contrast front image. Here, the luminance front image corresponds to a luminance En-Face image, and the motion contrast front image corresponds to an OCTA En-Face image. The trained model may be configured to display an analysis result obtained using a high-quality image generated by a high-quality engine. The input data included in the training data may be a high-quality image generated by a high-quality engine, or a set of a low-quality image and a high-quality image. The training data may be data in which information including at least one of an analysis value (e.g., average value or median value) obtained by analyzing an analysis region, a table including the analysis value, an analysis map, and the position of an analysis region such as a sector in an image, is labeled as correct answer data (for supervised learning) on the input data. The trained model may be configured to display an analysis result obtained by a trained model for generating an analysis result in response to an instruction from an examiner.

また、上述した様々な実施形態及び変形例におけるレポート画面において、緑内障や加齢黄斑変性等の種々の診断結果を表示させても良い。このとき、例えば、上述したような各種のアーティファクトの低減処理が適用された医用画像を解析することで、精度の良い診断結果を表示させることができる。また、診断結果は、特定された異常部位の位置を画像上に表示されても良いし、また、異常部位の状態等を文字等によって表示されても良い。また、異常部位等の分類結果（例えば、カーティン分類）を診断結果として表示させてもよい。なお、診断結果は、医用画像の診断結果を学習データとして学習して得た学習済モデル（診断結果生成エンジン、診断結果生成用の学習済モデル）を用いて生成されたものであっても良い。このとき、学習済モデルは、医用画像とその医用画像の診断結果とを含む学習データや、医用画像とその医用画像とは異なる種類の医用画像の診断結果とを含む学習データ等を用いた学習により得たものであっても良い。また、高画質化エンジンにより生成された高画質画像を用いて得た診断結果が表示されるように構成されても良い。また、学習データに含まれる入力データとしては、高画質化エンジンにより生成された高画質画像であっても良いし、低画質画像と高画質画像とのセットであっても良い。また、学習データは、例えば、診断名、病変（異常部位）の種類や状態（程度）、画像における病変の位置、注目領域に対する病変の位置、所見（読影所見等）、診断名の根拠（肯定的な医用支援情報等）、診断名を否定する根拠（否定的な医用支援情報）等の少なくとも１つを含む情報を（教師あり学習の）正解データとして、入力データにラベル付けしたデータであってもよい。なお、検者からの指示に応じて、診断結果生成用の学習済モデルにより得た診断結果が表示されるように構成されてもよい。 In addition, various diagnostic results such as glaucoma and age-related macular degeneration may be displayed on the report screen in the various embodiments and modified examples described above. At this time, for example, by analyzing a medical image to which various artifact reduction processes as described above have been applied, a highly accurate diagnostic result can be displayed. In addition, the diagnostic result may be displayed by displaying the position of the identified abnormal part on the image, or the state of the abnormal part by characters or the like. In addition, the classification result of the abnormal part, etc. (for example, Curtin classification) may be displayed as the diagnostic result. In addition, the diagnostic result may be generated using a trained model (diagnosis result generation engine, trained model for generating diagnostic result) obtained by learning the diagnostic result of the medical image as learning data. In this case, the trained model may be obtained by learning using training data including a medical image and a diagnostic result of the medical image, or training data including a medical image and a diagnostic result of a type of medical image different from the medical image. In addition, the diagnostic result obtained by using a high-quality image generated by the high-quality engine may be displayed. The input data included in the learning data may be a high-quality image generated by a high-quality image engine, or a set of a low-quality image and a high-quality image. The learning data may be data in which the input data is labeled with information including at least one of the following: diagnosis, type and condition (degree) of the lesion (abnormal site), position of the lesion in the image, position of the lesion relative to the region of interest, findings (image findings, etc.), grounds for the diagnosis (positive medical support information, etc.), and grounds for denying the diagnosis (negative medical support information). The system may be configured to display the diagnosis result obtained by the trained model for generating the diagnosis result in response to an instruction from the examiner.

また、上述した様々な実施例及び変形例におけるレポート画面において、上述したような注目部位、アーティファクト、異常部位等の物体認識結果（物体検出結果）やセグメンテーション結果を表示させても良い。このとき、例えば、画像上の物体の周辺に矩形の枠等を重畳して表示させてもよい。また、例えば、画像における物体上に色等を重畳して表示させてもよい。なお、物体認識結果やセグメンテーション結果は、物体認識やセグメンテーションを示す情報を正解データとして医用画像にラベル付けした学習データを学習して得た学習済モデルを用いて生成されたものであってもよい。なお、上述した解析結果生成や診断結果生成は、上述した物体認識結果やセグメンテーション結果を利用することで得られたものであってもよい。例えば、物体認識やセグメンテーションの処理により得た注目部位に対して解析結果生成や診断結果生成の処理を行ってもよい。 In addition, in the report screen in the various embodiments and modifications described above, object recognition results (object detection results) and segmentation results such as the above-mentioned areas of interest, artifacts, and abnormal areas may be displayed. In this case, for example, a rectangular frame or the like may be superimposed around the object on the image. Also, for example, a color or the like may be superimposed on the object in the image. Note that the object recognition results and segmentation results may be generated using a trained model obtained by learning training data in which information indicating object recognition or segmentation is used as correct answer data and labeled on a medical image. Note that the above-mentioned analysis result generation and diagnosis result generation may be obtained by using the above-mentioned object recognition result and segmentation result. For example, the analysis result generation and diagnosis result generation process may be performed on the area of interest obtained by the object recognition or segmentation process.

また、上述した学習済モデルは、被検者の所定部位の異なる種類の複数の医用画像をセットとする入力データを含む学習データにより学習して得た学習済モデルであっても良い。このとき、学習データに含まれる入力データとして、例えば、眼底のモーションコントラスト正面画像及び輝度正面画像（あるいは輝度断層画像）をセットとするデータ等が考えられる。また、学習データに含まれる入力データとして、例えば、眼底の断層画像（Ｂスキャン画像）及びカラー眼底画像（あるいは蛍光眼底画像）をセットとするデータ等も考えられる。また、異なる種類の複数の医療画像は、異なるモダリティ、異なる光学系、又は異なる原理等により取得されたものであれば何でも良い。また、上述した学習済モデルは、被検者の異なる部位の複数の医用画像をセットとする入力データを含む学習データにより学習して得た学習済モデルであっても良い。このとき、学習データに含まれる入力データとして、例えば、眼底の断層画像（Ｂスキャン画像）と前眼部の断層画像（Ｂスキャン画像）とをセットとするデータ等が考えられる。また、学習データに含まれる入力データとして、例えば、眼底の黄斑の３次元ＯＣＴ画像（３次元断層画像）と眼底の視神経乳頭のサークルスキャン（またはラスタスキャン）断層画像とをセットとするデータ等も考えられる。なお、学習データに含まれる入力データは、被検者の異なる部位及び異なる種類の複数の医用画像であっても良い。このとき、学習データに含まれる入力データは、例えば、前眼部の断層画像とカラー眼底画像とをセットとする入力データ等が考えられる。また、上述した学習済モデルは、被検者の所定部位の異なる撮影画角の複数の医用画像をセットとする入力データを含む学習データにより学習して得た学習済モデルであっても良い。また、学習データに含まれる入力データは、パノラマ画像のように、所定部位を複数領域に時分割して得た複数の医用画像を貼り合わせたものであっても良い。このとき、パノラマ画像のような広画角画像を学習データとして用いることにより、狭画角画像よりも情報量が多い等の理由から画像の特徴量を精度良く取得できる可能性があるため、各処理の結果を向上することができる。また、学習データに含まれる入力データは、被検者の所定部位の異なる日時の複数の医用画像をセットとする入力データであっても良い。 The above-mentioned trained model may be a trained model obtained by training with training data including input data that includes a set of multiple medical images of different types of a specific part of the subject. In this case, the input data included in the training data may be, for example, data that includes a set of a motion contrast front image and a luminance front image (or a luminance tomographic image) of the fundus. In addition, the input data included in the training data may be, for example, data that includes a set of a tomographic image (B-scan image) of the fundus and a color fundus image (or a fluorescent fundus image). In addition, the multiple medical images of different types may be anything obtained by different modalities, different optical systems, or different principles. In addition, the above-mentioned trained model may be a trained model obtained by training with training data including input data that includes a set of multiple medical images of different parts of the subject. In this case, the input data included in the training data may be, for example, data that includes a set of a tomographic image (B-scan image) of the fundus and a tomographic image (B-scan image) of the anterior part of the eye. In addition, the input data included in the learning data may be, for example, data including a set of a three-dimensional OCT image (three-dimensional tomographic image) of the macula of the fundus and a circle scan (or raster scan) tomographic image of the optic disc of the fundus. The input data included in the learning data may be, for example, a set of a tomographic image of the anterior eye and a color fundus image. The above-mentioned learned model may be a learned model obtained by learning with learning data including input data including a set of a plurality of medical images of a specific part of the subject with different shooting angles of view. The input data included in the learning data may be, for example, a panoramic image, which is a combination of a plurality of medical images obtained by time-dividing a specific part into a plurality of regions. In this case, by using a wide-angle image such as a panoramic image as learning data, there is a possibility that the feature amount of the image can be accurately acquired because it has a larger amount of information than a narrow-angle image, and therefore the results of each process can be improved. Furthermore, the input data included in the learning data may be a set of multiple medical images of a specific part of a subject taken at different dates and times.

また、上述した解析結果と診断結果と物体認識結果とセグメンテーション結果とのうち少なくとも１つの結果が表示される表示画面は、レポート画面に限らない。このような表示画面は、例えば、撮影確認画面、経過観察用の表示画面、及び撮影前の各種調整用のプレビュー画面（各種のライブ動画像が表示される表示画面）等の少なくとも１つの表示画面に表示されても良い。例えば、上述した学習済モデルを用いて得た上記少なくとも１つの結果を撮影確認画面に表示させることにより、検者は、撮影直後であっても精度の良い結果を確認することができる。また、上述した低画質画像と高画質画像との表示の変更は、例えば、低画質画像の解析結果と高画質画像の解析結果との表示の変更であっても良い。 In addition, the display screen on which at least one of the above-mentioned analysis results, diagnosis results, object recognition results, and segmentation results is displayed is not limited to a report screen. Such a display screen may be displayed on at least one of the following display screens: an image capture confirmation screen, a display screen for follow-up observation, and a preview screen for various adjustments before image capture (a display screen on which various live moving images are displayed). For example, by displaying at least one of the above results obtained using the above-mentioned trained model on the image capture confirmation screen, the examiner can confirm the results with high accuracy even immediately after image capture. In addition, the change in display between the low-quality image and the high-quality image may be, for example, a change in display between the analysis results of the low-quality image and the analysis results of the high-quality image.

（変形例４）
上述した様々な実施形態及び変形例におけるプレビュー画面において、ライブ動画像の少なくとも１つのフレーム毎に上述した学習済モデルが用いられるように構成されても良い。このとき、プレビュー画面において、異なる部位や異なる種類の複数のライブ動画像が表示されている場合には、各ライブ動画像に対応する学習済モデルが用いられるように構成されても良い。これにより、例えば、ライブ動画像であっても、処理時間を短縮することができるため、検者は撮影開始前に精度の高い情報を得ることができる。このため、例えば、再撮影の失敗等を低減することができるため、診断の精度や効率を向上させることができる。なお、複数のライブ動画像は、例えば、ＸＹＺ方向のアライメントのための前眼部の動画像、眼底観察光学系のフォーカス調整やＯＣＴフォーカス調整のための眼底の正面動画像、及びＯＣＴのコヒーレンスゲート調整（測定光路長と参照光路長との光路長差の調整）のための眼底の断層動画像等の少なくとも１つの動画像であってもよい。 (Variation 4)
In the preview screen in the various embodiments and modifications described above, the learned model may be used for at least one frame of the live video. In this case, when multiple live videos of different parts or different types are displayed on the preview screen, the learned model corresponding to each live video may be used. As a result, even for live videos, for example, the processing time can be shortened, so that the examiner can obtain highly accurate information before starting to capture. Therefore, for example, failures in re-capturing can be reduced, so that the accuracy and efficiency of diagnosis can be improved. Note that the multiple live videos may be at least one video, such as a video of the anterior part of the eye for alignment in the XYZ directions, a frontal video of the fundus for focus adjustment of the fundus observation optical system or OCT focus adjustment, and a tomographic video of the fundus for coherence gate adjustment of OCT (adjustment of the optical path length difference between the measurement optical path length and the reference optical path length).

また、上述した学習済モデルを適用可能な動画像は、ライブ動画像に限らず、例えば、記憶部に記憶（保存）された動画像であっても良い。このとき、例えば、記憶部に記憶（保存）された眼底の断層動画像の少なくとも１つのフレーム毎に位置合わせして得た動画像が表示画面に表示されても良い。例えば、硝子体を好適に観察したい場合には、まず、フレーム上に硝子体ができるだけ存在する等の条件を基準とする基準フレームを選択してもよい。このとき、各フレームは、ＸＺ方向の断層画像（Ｂスキャン像）である。そして、選択された基準フレームに対して他のフレームがＸＺ方向に位置合わせされた動画像が表示画面に表示されても良い。このとき、例えば、動画像の少なくとも１つのフレーム毎に高画質化エンジンにより順次生成された高画質画像（高画質フレーム）を連続表示させるように構成させても良い。ここで、各種の調整中では、被検眼の網膜等の撮影対象がまだ上手く撮像できていない可能性がある。このため、学習済モデルに入力される医用画像と学習データとして用いられた医用画像との違いが大きいために、精度良く高画質画像が得られない可能性がある。そこで、断層画像（Ｂスキャン）の画質評価等の評価値が閾値を超えたら、高画質動画像の表示（高画質フレームの連続表示）を自動的に開始するように構成しても良い。また、断層画像（Ｂスキャン）の画質評価等の評価値が閾値を超えたら、高画質化ボタンを検者が指定可能な状態（アクティブ状態）に変更するように構成されても良い。また、走査パターン等が異なる撮影モード毎に異なる高画質化エンジンを用意して、選択された撮影モードに対応する高画質化エンジンが選択されるように構成されても良い。また、異なる撮影モードで得た様々な医用画像を含む学習データを学習して得た１つの高画質化エンジンが用いられても良い。なお、上述したフレーム間の位置合わせの手法としては、Ｘ方向の位置合わせの手法とＺ方向（深度方向）の位置合わせの手法とは、同じ手法が適用されてもよいし、全て異なる手法が適用されてもよい。また、同一方向の位置合わせは、異なる手法で複数回行われても良く、例えば、粗い位置合わせを行った後に、精密な位置合わせが行われてもよい。また、位置合わせの手法としては、例えば、断層画像（Ｂスキャン像）をセグメンテーション処理して得た網膜層境界を用いた（Ｚ方向の粗い）位置合わせ、断層画像を分割して得た複数の領域と基準画像との相関情報（類似度）を用いた（Ｘ方向やＺ方向の精密な）位置合わせ、断層画像（Ｂスキャン像）毎に生成した１次元投影像を用いた（Ｘ方向の）位置合わせ、２次元正面画像を用いた（Ｘ方向の）位置合わせ等がある。また、ピクセル単位で粗く位置合わせが行われてから、サブピクセル単位で精密な位置合わせが行われるように構成されてもよい。 In addition, the moving images to which the learned model can be applied are not limited to live moving images, but may be, for example, moving images stored (saved) in a storage unit. At this time, for example, a moving image obtained by aligning at least one frame of a tomographic moving image of the fundus stored (saved) in the storage unit may be displayed on the display screen. For example, when it is desired to preferably observe the vitreous body, a reference frame may first be selected based on a condition such that the vitreous body is present as much as possible on the frame. At this time, each frame is a tomographic image (B-scan image) in the XZ direction. Then, a moving image in which other frames are aligned in the XZ direction with respect to the selected reference frame may be displayed on the display screen. At this time, for example, a high-quality image (high-quality frame) generated sequentially by the high-quality engine for at least one frame of the moving image may be configured to be displayed continuously. Here, during various adjustments, there is a possibility that the subject to be photographed, such as the retina of the subject's eye, has not yet been well imaged. For this reason, there is a possibility that a high-quality image cannot be obtained with high accuracy due to a large difference between the medical image input to the learned model and the medical image used as learning data. Therefore, when the evaluation value of the image quality evaluation of the tomographic image (B scan) exceeds a threshold value, the display of the high-quality moving image (continuous display of high-quality frames) may be automatically started. When the evaluation value of the image quality evaluation of the tomographic image (B scan) exceeds a threshold value, the image quality improvement button may be changed to a state (active state) that the examiner can specify. Also, a different image quality improvement engine may be prepared for each imaging mode having a different scanning pattern, etc., and a high image quality improvement engine corresponding to the selected imaging mode may be selected. Also, one image quality improvement engine obtained by learning learning data including various medical images obtained in different imaging modes may be used. As the method of alignment between the above-mentioned frames, the same method may be applied for the alignment in the X direction and the alignment in the Z direction (depth direction), or all different methods may be applied. Also, alignment in the same direction may be performed multiple times using different methods, for example, after rough alignment, precise alignment may be performed. In addition, the alignment method may be, for example, (coarse alignment in the Z direction) using retinal layer boundaries obtained by segmenting a tomographic image (B-scan image), (precise alignment in the X and Z directions) using correlation information (similarity) between a reference image and multiple regions obtained by dividing a tomographic image, (X direction) alignment using a one-dimensional projection image generated for each tomographic image (B-scan image), (X direction) alignment using a two-dimensional front image, etc. Also, it may be configured so that rough alignment is performed on a pixel-by-pixel basis, and then precise alignment is performed on a sub-pixel basis.

（変形例５）
上述した様々な実施形態及び変形例においては、学習済モデルが追加学習中である場合、追加学習中の学習済モデル自体を用いて出力（推論・予測）することが難しい可能性がある。このため、追加学習中の学習済モデルに対する医用画像の入力を禁止することが良い。また、追加学習中の学習済モデルと同じ学習済モデルをもう一つ予備の学習済モデルとして用意しても良い。このとき、追加学習中には、予備の学習済モデルに対して医用画像の入力が実行できるようにすることが良い。そして、追加学習が完了した後に、追加学習後の学習済モデルを評価し、問題なければ、予備の学習済モデルから追加学習後の学習済モデルに置き換えれば良い。また、問題があれば、予備の学習済モデルが用いられるようにしても良い。 (Variation 5)
In the various embodiments and modified examples described above, when a trained model is undergoing additional training, it may be difficult to output (infer/predict) using the trained model undergoing additional training itself. For this reason, it is preferable to prohibit the input of medical images to the trained model undergoing additional training. In addition, another trained model that is the same as the trained model undergoing additional training may be prepared as a spare trained model. In this case, it is preferable to allow input of medical images to be executed to the spare trained model during additional training. Then, after the additional training is completed, the trained model after the additional training is evaluated, and if there is no problem, the spare trained model may be replaced with the trained model after the additional training. In addition, if there is a problem, the spare trained model may be used.

また、撮影部位毎に学習して得た学習済モデルを選択的に利用できるようにしても良い。具体的には、第１の撮影部位（肺、被検眼等）を含む学習データを用いて得た第１の学習済モデルと、第１の撮影部位とは異なる第２の撮影部位を含む学習データを用いて得た第２の学習済モデルと、を含む複数の学習済モデルを用意することができる。そして、これら複数の学習済モデルのいずれかを選択するように構成されても良い。このとき、検者からの指示に応じて、選択された学習済モデルに対応する撮影部位（ヘッダの情報や、検者により手動入力されたもの）と該撮影部位の撮影画像とがペアとなるデータを（例えば、病院や研究所等の外部施設のサーバ等からネットワークを介して）検索し、検索して得たデータを学習データとする学習を、選択された学習済モデルに対して追加学習として実行する制御手段と、を有しても良い。これにより、学習済モデルに対応する撮影部位の撮影画像を用いて、撮影部位毎に効率的に追加学習することができる。 In addition, a trained model obtained by training for each imaging site may be selectively used. Specifically, a plurality of trained models including a first trained model obtained using training data including a first imaging site (lungs, examined eye, etc.) and a second trained model obtained using training data including a second imaging site different from the first imaging site may be prepared. Then, one of the plurality of trained models may be selected. In this case, a control means may be provided that searches for data (for example, via a network from a server of an external facility such as a hospital or research institute) in which the imaging site corresponding to the selected trained model (header information or information manually input by the examiner) and the image of the imaging site are paired in response to an instruction from the examiner, and performs training using the searched data as training data for the selected trained model as additional learning. This allows efficient additional learning for each imaging site using the image of the imaging site corresponding to the trained model.

また、追加学習用の学習データを、病院や研究所等の外部施設のサーバ等からネットワークを介して取得する際には、改ざんや、追加学習時のシステムトラブル等による信頼性低下を低減したい。そこで、デジタル署名やハッシュ化による一致性の確認を行うことで、追加学習用の学習データの正当性を検出しても良い。これにより、追加学習用の学習データを保護することができる。このとき、デジタル署名やハッシュ化による一致性の確認した結果として、追加学習用の学習データの正当性が検出できなかった場合には、その旨の警告を行い、その学習データによる追加学習を行わない。なお、サーバは、その設置場所を問わず、例えば、クラウドサーバ、フォグサーバ、エッジサーバ等のどのような形態でもよい。 In addition, when acquiring training data for additional learning via a network from a server at an external facility such as a hospital or research institute, it is desirable to reduce deterioration in reliability due to tampering or system trouble during additional learning. Therefore, the validity of the training data for additional learning may be detected by checking the consistency using a digital signature or hashing. This makes it possible to protect the training data for additional learning. In this case, if the validity of the training data for additional learning cannot be detected as a result of checking the consistency using a digital signature or hashing, a warning to that effect is issued and additional learning using that training data is not performed. The server may be installed anywhere and may take any form, such as a cloud server, fog server, edge server, etc.

（変形例６）
上述した様々な実施形態及び変形例において、検者からの指示は、手動による指示（例えば、ユーザーインターフェース等を用いた指示）以外にも、音声等による指示であっても良い。このとき、例えば、機械学習により得た音声認識エンジン（音声認識モデル、文字認識用の学習済モデル）を含む機械学習エンジンが用いられても良い。また、手動による指示は、キーボードやタッチパネル等を用いた文字入力等による指示であっても良い。このとき、例えば、機械学習により得た文字認識エンジン（文字認識モデル、文字認識用の学習済モデル）を含む機械学習エンジンが用いられても良い。また、検者からの指示は、ジェスチャー等による指示であっても良い。このとき、機械学習により得たジェスチャー認識エンジン（ジェスチャー認識モデル、ジェスチャー認識用の学習済モデル）を含む機械学習エンジンが用いられても良い。また、検者からの指示は、モニタ上の検者の視線検出結果等であってもよい。視線検出結果は、例えば、モニタ周辺から撮影して得た検者の動画像を用いた瞳孔検出結果であってもよい。このとき、動画像からの瞳孔検出は、上述したような物体認識エンジンを用いてもよい。また、検者からの指示は、脳波、体を流れる微弱な電気信号等による指示であってもよい。このような場合、例えば、学習データとしては、上述したような種々の学習済モデルの処理による結果の表示の指示を示す文字データまたは音声データ（波形データ）等を入力データとし、種々の学習済モデルの処理による結果等を実際に表示部に表示させるための実行命令を正解データとする学習データであってもよい。また、学習データとしては、例えば、高画質化用の学習済モデルで得た高画質画像の表示の指示を示す文字データまたは音声データ等を入力データとし、高画質画像の表示の実行命令及びボタン３４２０をアクティブ状態に変更するための実行命令を正解データとする学習データであってもよい。もちろん、学習データとしては、例えば、文字データまたは音声データ等が示す指示内容と実行命令内容とが互いに対応するものであれば何でも良い。また、音響モデルや言語モデル等を用いて、音声データから文字データに変換してもよい。また、複数のマイクで得た波形データを用いて、音声データに重畳しているノイズデータを低減する処理を行ってもよい。また、文字または音声等による指示と、マウス、タッチパネル等による指示とを、検者からの指示に応じて選択可能に構成されてもよい。また、文字または音声等による指示のオン・オフを、検者からの指示に応じて選択可能に構成されてもよい。 (Variation 6)
In the various embodiments and modifications described above, the instructions from the examiner may be instructions by voice or the like, in addition to manual instructions (e.g., instructions using a user interface or the like). At this time, for example, a machine learning engine including a voice recognition engine (voice recognition model, trained model for character recognition) obtained by machine learning may be used. The manual instructions may be instructions by character input or the like using a keyboard, touch panel, or the like. At this time, for example, a machine learning engine including a character recognition engine (character recognition model, trained model for character recognition) obtained by machine learning may be used. The instructions from the examiner may be instructions by gesture or the like. At this time, a machine learning engine including a gesture recognition engine (gesture recognition model, trained model for gesture recognition) obtained by machine learning may be used. The instructions from the examiner may be a gaze detection result of the examiner on the monitor, or the like. The gaze detection result may be, for example, a pupil detection result using a moving image of the examiner obtained by shooting around the monitor. At this time, the pupil detection from the moving image may use an object recognition engine as described above. The instructions from the examiner may be instructions by brain waves, weak electrical signals flowing through the body, or the like. In such a case, for example, the learning data may be learning data in which character data or voice data (waveform data) indicating an instruction to display the results of the processing of various learned models as described above is used as input data, and the execution command for actually displaying the results of the processing of various learned models on the display unit is used as the correct answer data. In addition, the learning data may be learning data in which character data or voice data indicating an instruction to display a high-quality image obtained by a learned model for high image quality is used as input data, and the execution command for displaying the high-quality image and the execution command for changing the button 3420 to an active state are used as the correct answer data. Of course, the learning data may be anything as long as the instruction content indicated by the character data or voice data corresponds to the execution command content. In addition, the voice data may be converted into character data using an acoustic model, a language model, or the like. In addition, a process of reducing noise data superimposed on the voice data may be performed using waveform data obtained by multiple microphones. In addition, the device may be configured to be able to select between an instruction by character or voice and an instruction by a mouse, touch panel, or the like according to an instruction from an examiner. Furthermore, the instructions by text or voice may be configured to be turned on or off in response to an instruction from the examiner.

ここで、機械学習には、上述したような深層学習があり、また、多階層のニューラルネットワークの少なくとも一部には、例えば、再帰型ニューラルネットワーク（ＲＮＮ：ＲｅｃｕｒｒｅｒｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）を用いることができる。ここで、本変形例に係る機械学習エンジンの一例として、時系列情報を扱うニューラルネットワークであるＲＮＮに関して、図３６を参照して説明する。また、ＲＮＮの一種であるＬｏｎｇｓｈｏｒｔ－ｔｅｒｍｍｅｍｏｒｙ（以下、ＬＳＴＭ）に関して、図３７を参照して説明する。図３６（ａ）は、機械学習エンジンであるＲＮＮの構造を示す。ＲＮＮ３５２０は、ネットワークにループ構造を持ち、時刻ｔにおいてデータｘ^ｔ３５１０を入力し、データｈ^ｔ３５３０を出力する。ＲＮＮ３５２０はネットワークにループ機能を持つため、現時刻の状態を次の状態に引き継ぐことが可能であるため、時系列情報を扱うことができる。図３６（ｂ）には時刻ｔにおけるパラメータベクトルの入出力の一例を示す。データｘ^ｔ３５１０にはＮ個（Ｐａｒａｍｓ１～ＰａｒａｍｓＮ）のデータが含まれる。また、ＲＮＮ３５２０より出力されるデータｈ^ｔ３５３０には入力データに対応するＮ個（Ｐａｒａｍｓ１～ＰａｒａｍｓＮ）のデータが含まれる。しかし、ＲＮＮでは誤差逆伝搬時に長期時間の情報を扱うことができないため、ＬＳＴＭが用いられることがある。ＬＳＴＭは、忘却ゲート、入力ゲート、出力ゲートを備えることで長期時間の情報を学習することができる。ここで、図３７（ａ）にＬＳＴＭの構造を示す。ＬＳＴＭ３５４０において、ネットワークが次の時刻ｔに引き継ぐ情報は、セルと呼ばれるネットワークの内部状態ｃ^ｔ－１と出力データｈ^ｔ－１である。なお、図の小文字（ｃ、ｈ、ｘ）はベクトルを表している。次に、図３７（ｂ）にＬＳＴＭ３５４０の詳細を示す。図３７（ｂ）において、ＦＧは忘却ゲートネットワーク、ＩＧは入力ゲートネットワーク、ＯＧは出力ゲートネットワークを示し、それぞれはシグモイド層である。そのため、各要素が０から１の値となるベクトルを出力する。忘却ゲートネットワークＦＧは過去の情報をどれだけ保持するかを決め、入力ゲートネットワークＩＧはどの値を更新するかを判定するものである。ＣＵは、セル更新候補ネットワークであり、活性化関数ｔａｎｈ層である。これは、セルに加えられる新たな候補値のベクトルを作成する。出力ゲートネットワークＯＧは、セル候補の要素を選択し次の時刻にどの程度の情報を伝えるか選択する。なお、上述したＬＳＴＭのモデルは基本形であるため、ここで示したネットワークに限らない。ネットワーク間の結合を変更してもよい。ＬＳＴＭではなく、ＱＲＮＮ（ＱｕａｓｉＲｅｃｕｒｒｅｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）を用いてもよい。さらに、機械学習エンジンは、ニューラルネットワークに限定されるものではなく、ブースティングやサポートベクターマシン等が用いられてもよい。また、検者からの指示が文字または音声等による入力の場合には、自然言語処理に関する技術（例えば、ＳｅｑｕｅｎｃｅｔｏＳｅｑｕｅｎｃｅ）が適用されてもよい。また、検者に対して文字または音声等による出力で応答する対話エンジン（対話モデル、対話用の学習済モデル）が適用されてもよい。 Here, the machine learning includes deep learning as described above, and at least a part of the multi-layered neural network can be, for example, a recurrent neural network (RNN). Here, as an example of the machine learning engine according to this modification, an RNN, which is a neural network that handles time-series information, will be described with reference to FIG. 36. Also, a long short-term memory (hereinafter, LSTM), which is a type of RNN, will be described with reference to FIG. 37. FIG. 36(a) shows the structure of an RNN, which is a machine learning engine. The RNN 3520 has a loop structure in the network, and inputs data x ^t 3510 at time t and outputs data h ^t 3530. Since the RNN 3520 has a loop function in the network, it is possible to take over the state at the current time to the next state, and therefore it can handle time-series information. FIG. 36(b) shows an example of input and output of a parameter vector at time t. The data x ^t 3510 includes N pieces of data (Params1 to ParamsN). The data h ^t 3530 output from the RNN 3520 includes N pieces of data (Params1 to ParamsN) corresponding to the input data. However, since the RNN cannot handle long-term information during error backpropagation, the LSTM may be used. The LSTM can learn long-term information by including a forget gate, an input gate, and an output gate. Here, the structure of the LSTM is shown in FIG. 37(a). In the LSTM 3540, the information that the network takes over at the next time t is the internal state c ^t-1 of the network called a cell and the output data h ^t-1 . The lowercase letters (c, h, x) in the figure represent vectors. Next, the details of the LSTM 3540 are shown in FIG. 37(b). In FIG. 37(b), FG indicates a forgetting gate network, IG indicates an input gate network, and OG indicates an output gate network, each of which is a sigmoid layer. Therefore, a vector in which each element has a value between 0 and 1 is output. The forgetting gate network FG determines how much past information to retain, and the input gate network IG determines which value to update. CU is a cell update candidate network and is an activation function tanh layer. This creates a vector of new candidate values to be added to the cell. The output gate network OG selects cell candidate elements and selects how much information to transmit at the next time. Note that the above-mentioned LSTM model is a basic form, so it is not limited to the network shown here. The connection between networks may be changed. Instead of LSTM, QRNN (Quasi Recurrent Neural Network) may be used. Furthermore, the machine learning engine is not limited to a neural network, and boosting, support vector machines, etc. may be used. In addition, when the examiner's instructions are input by characters or voice, a technology related to natural language processing (e.g., sequence to sequence) may be applied. In addition, a dialogue engine (dialogue model, trained model for dialogue) that responds to the examiner by outputting characters or voice may be applied.

（変形例７）
上述した様々な実施形態及び変形例は、以下の各々については少なくとも含むものであり、また、以下の各々の様々な組み合わせを技術的に矛盾のない範囲で少なくとも含むものである。なお、以下における機械学習は、例えば、上述したような様々な学習が適用可能である。また、以下における少なくとも一部の領域は、例えば、上述した部分領域であり、矩形領域等である。 (Variation 7)
The various embodiments and modifications described above include at least each of the following, and also include at least various combinations of each of the following to the extent that there is no technical contradiction. Note that the machine learning described below can be applied to various learning methods such as those described above. Also, at least some of the regions described below are, for example, the partial regions described above, such as rectangular regions.

まず、高画質化エンジンは、被検者の所定部位の２次元の医用画像を用いて２次元の高画質画像を生成するために機械学習を行う機械学習エンジンを含むものであっても良い。このとき、高画質化エンジンは、２次元の医用画像の少なくとも一部の領域を含む学習データを学習して得た学習済モデルであっても良い。例えば、高画質化エンジンは、被検眼の第１の深度範囲の第１の正面画像の少なくとも一部の領域を含む学習データを学習して得た学習済モデルであっても良い。また、他の高画質化エンジンとして、被検眼の第２の深度範囲であって、第１の深度範囲とは少なくとも一部の範囲が異なる第２の深度範囲の第２の正面画像の少なくとも一部の領域を含む学習データを学習して得た他の学習済モデルが生成されても良い。すなわち、第２の正面画像の特徴量と第１の正面画像の特徴量とが比較的異なる場合には、第１の学習済モデルが生成されるだけでなく、第２の学習済モデルも生成されても良い。これにより、例えば、複数の学習済モデルが複数の医用画像に応じて選択的に用いることができる。このため、特徴量が互いに比較的異なる複数の医用画像を精度良く高画質化することができる。なお、これらの特徴量が比較的類似する場合には、第２の学習済モデルが生成されなくても良く、第１の正面画像と第２の正面画像とを学習データとして学習して得た共通の学習済モデルが生成されれば良い。 First, the high image quality engine may include a machine learning engine that performs machine learning to generate a two-dimensional high-image quality image using a two-dimensional medical image of a predetermined part of the subject. In this case, the high image quality engine may be a trained model obtained by training training data including at least a part of the area of the two-dimensional medical image. For example, the high image quality engine may be a trained model obtained by training training data including at least a part of the area of a first front image in a first depth range of the subject's eye. In addition, as another high image quality engine, another trained model obtained by training training data including at least a part of the area of a second front image in a second depth range of the subject's eye, which is at least a part of the area of a second front image in a second depth range that is different from the first depth range, may be generated. That is, when the feature amount of the second front image and the feature amount of the first front image are relatively different, not only the first trained model may be generated, but also the second trained model may be generated. As a result, for example, multiple trained models can be selectively used according to multiple medical images. Therefore, multiple medical images whose feature amounts are relatively different from each other can be accurately trained. If these features are relatively similar, it is not necessary to generate a second trained model; it is sufficient to generate a common trained model obtained by training the first front image and the second front image as training data.

また、高画質化エンジンは、被検者の所定部位の３次元の医用画像を用いて３次元の高画質画像を生成するために機械学習を行う機械学習エンジンを含むものであっても良い。このとき、高画質化エンジンは、３次元の医用画像の少なくとも一部の領域を含む学習データを学習して得た学習済モデルであっても良い。ここで、３次元の医用画像が、異なる位置の複数の２次元の医用画像により構成される場合を考える。このとき、例えば、Ｂスキャン画像は、ＸＺ平面の断層画像であり、異なる位置はＹ方向になる。この場合、学習データや学習済モデルに入力されるデータは、ＸＺ方向における位置ずれが補正（位置合わせ）された複数の２次元の医用画像により構成された３次元の医用画像であっても良い。また、学習済モデルを用いて３次元の医用画像から３次元の高画質画像を生成する場合、２次元の医用画像よりも処理時間がかかるため、例えば、高速処理が可能なサーバで処理するように構成されても良い。この場合には、撮影装置で得た医用画像データをクライアントからサーバに送信し、サーバで学習済モデルを用いた処理後に、処理後のデータをサーバからクライアントに送信するように構成されても良い。なお、サーバは、その設置場所を問わず、例えば、クラウドサーバ、フォグサーバ、エッジサーバ等のどのような形態でもよい。また、上述した複数の２次元の医用画像の位置合わせの手法としては、Ｘ方向の位置合わせの手法とＺ方向（深度方向）の位置合わせの手法とは、同じ手法が適用されても良いし、全て異なる手法が適用されても良い。また、同一方向の位置合わせは、異なる手法で複数回行われても良く、例えば、粗い位置合わせを行った後に、精密な位置合わせが行われても良い。また、位置合わせの手法としては、例えば、断層画像（Ｂスキャン像）をセグメンテーション処理して得た網膜層境界を用いた（Ｚ方向の粗い）位置合わせ、断層画像を分割して得た複数の領域と基準画像との相関情報（類似度）を用いた（Ｘ方向やＺ方向の精密な）位置合わせ、断層画像（Ｂスキャン像）毎に生成した１次元投影像を用いた（Ｘ方向の）位置合わせ、２次元正面画像を用いた（Ｘ方向の）位置合わせ等がある。また、ピクセル単位で粗く位置合わせが行われてから、サブピクセル単位で精密な位置合わせが行われるように構成されても良い。 The high image quality engine may also include a machine learning engine that performs machine learning to generate a high-quality three-dimensional image using a three-dimensional medical image of a specific part of the subject. In this case, the high image quality engine may be a trained model obtained by training training data including at least a part of the three-dimensional medical image. Here, consider a case where the three-dimensional medical image is composed of a plurality of two-dimensional medical images at different positions. In this case, for example, the B-scan image is a tomographic image of the XZ plane, and the different positions are in the Y direction. In this case, the data input to the training data or trained model may be a three-dimensional medical image composed of a plurality of two-dimensional medical images whose positional deviations in the XZ direction have been corrected (aligned). In addition, when a high-quality three-dimensional image is generated from a three-dimensional medical image using a trained model, it takes more processing time than a two-dimensional medical image, so for example, it may be configured to be processed by a server capable of high-speed processing. In this case, it may be configured to transmit medical image data obtained by the imaging device from the client to the server, and after processing using the trained model in the server, the processed data may be transmitted from the server to the client. The server may be in any form, such as a cloud server, a fog server, an edge server, etc., regardless of the installation location. As for the method of aligning the above-mentioned multiple two-dimensional medical images, the method of aligning in the X direction and the method of aligning in the Z direction (depth direction) may be the same method, or all different methods may be applied. The alignment in the same direction may be performed multiple times using different methods, for example, after performing a rough alignment, a precise alignment may be performed. As the alignment method, for example, (coarse in the Z direction) alignment using a retinal layer boundary obtained by segmenting a tomographic image (B-scan image), (precise in the X direction or Z direction) alignment using correlation information (similarity) between a reference image and multiple regions obtained by dividing the tomographic image, (X direction) alignment using a one-dimensional projection image generated for each tomographic image (B-scan image), (X direction) alignment using a two-dimensional front image, etc. Alternatively, the alignment may be performed roughly on a pixel-by-pixel basis, and then precisely on a sub-pixel basis.

また、高画質化エンジンは、被検者の所定部位の３次元の医用画像データにおける少なくとも一部の範囲が異なる複数の範囲の２次元の医用画像を含む学習済データを学習して得た学習済モデルであっても良い。例えば、高画質化エンジンは、被検眼の第１の深度範囲の第１の正面画像の少なくとも一部の領域と、第１の深度範囲とは少なくとも一部の範囲が異なる第２の深度範囲の第２の正面画像の少なくとも一部の領域とを含む学習データを学習して得た学習済モデルであっても良い。すなわち、高画質化エンジンは、被検者の所定部位の３次元の医用画像データを用いて得た複数の医用画像であって、特徴量が互いに異なる複数の医用画像を含む学習データを学習して得た学習済モデルであっても良い。これにより、高画質化エンジンは、例えば、互いに異なる複数の特徴量に対して抽象度の高い特徴量を学習結果として得ることができる。このため、例えば、複数の特徴量とは異なる特徴量の医用画像であっても、抽出された抽象度の高い特徴量が適用可能な範囲内であれば、比較的精度良く高画質化することができる。例えば、第１の深度範囲の第１の正面画像の少なくとも一部の領域と、第２の深度範囲の第２の正面画像の少なくとも一部の領域とを含む学習データを学習して得た学習済モデルを用いて、第１の深度範囲及び第２の深度範囲とは少なくとも一部の範囲が異なる第３の深度範囲の第３の正面画像の少なくとも一部の領域から、高画質画像を精度良く生成することができる。このとき、例えば、第１の深度範囲は、比較的太い血管が分布している表層であり、また、第２の深度範囲は、比較的細い血管が分布している（あるいは血管が分布していない）深層である。このため、第１の正面画像の特徴量と第２の正面画像の特徴量とは、互いに異なる。 The high image quality engine may also be a trained model obtained by training trained data including two-dimensional medical images of a plurality of ranges in which at least a portion of the range is different in three-dimensional medical image data of a predetermined part of the subject. For example, the high image quality engine may be a trained model obtained by training trained data including at least a portion of a first front image of a first depth range of the subject's eye and at least a portion of a second front image of a second depth range in which at least a portion of the range is different from the first depth range. That is, the high image quality engine may be a trained model obtained by training trained data including a plurality of medical images obtained using three-dimensional medical image data of a predetermined part of the subject, the plurality of medical images having different feature amounts. As a result, the high image quality engine can obtain, for example, a feature amount having a high degree of abstraction as a training result for a plurality of different feature amounts. For example, even if a medical image has a feature amount different from the plurality of feature amounts, the image quality can be relatively high-precision if the extracted feature amount having a high degree of abstraction is within an applicable range. For example, a trained model obtained by training data including at least a portion of a first front image in a first depth range and at least a portion of a second front image in a second depth range can be used to generate a high-quality image with high accuracy from at least a portion of a third front image in a third depth range that is at least partially different from the first and second depth ranges. In this case, for example, the first depth range is a superficial layer in which relatively thick blood vessels are distributed, and the second depth range is a deep layer in which relatively thin blood vessels are distributed (or no blood vessels are distributed). Therefore, the feature amount of the first front image and the feature amount of the second front image are different from each other.

また、学習データに含まれる入力データと正解データ（出力データ）とのセットとしては、低画質画像と高画質画像とのセットであっても良い。例えば、高画質画像は、複数の低画質画像を重ね合わせることにより得たものであっても良い。このとき、重ね合わせることで、高画質画像には、複数の低画質画像において共通しては撮像されなかったが、いずれかには撮像されるような部位が可視化される場合がある。すなわち、低画質画像には存在しない部位が高画質画像には登場する場合がある。このような場合には、高画質化エンジンが、高画質化の特徴量を学習結果として得るだけでなく、存在しない部位を新たに生成するような特徴量も合わせて得てしまう可能性があり、例えば、高画質画像において血管が本当は存在しない領域に偽血管を生成してしまう可能性がある。 The set of input data and correct answer data (output data) included in the learning data may be a set of low-quality images and high-quality images. For example, the high-quality image may be obtained by superimposing multiple low-quality images. In this case, by superimposing, a part that is not commonly captured in multiple low-quality images but is captured in one of them may be visualized in the high-quality image. In other words, a part that does not exist in the low-quality image may appear in the high-quality image. In such a case, the image quality improvement engine may not only obtain the image quality improvement feature as the learning result, but may also obtain a feature that newly generates a non-existent part, for example, there is a possibility that false blood vessels will be generated in an area in the high-quality image where blood vessels do not actually exist.

そこで、学習データに含まれる入力データと正解データとのセットとしては、画像上に存在する部位の差が互いに比較的小さい複数の医用画像であっても良い。例えば、ノイズが付加された高画質画像と高画質画像とのセットであっても良いし、互いに異なるノイズが付加された複数の高画質画像のセットであっても良い。このとき、ノイズは、画像上に存在する部位等の特徴量が失われない程度のノイズであっても良い。また、画素毎にノイズを付加するか否かをランダムに決定されたノイズパターンが、画像毎に異なっていても良い。なお、付加されるノイズの大きさが画像毎に異なっていても良い。また、重ね合わせの枚数が互いに異なる複数の医用画像のセットであっても良い。これにより、高画質化エンジンは、例えば、高画質化の特徴量を学習結果として精度良く得ることができる。このため、高画質化エンジンを用いることにより、入力された低画質画像から高画質画像を精度よく生成することができる。このとき、入力された低画質画像は、上述したような各種のアーティファクトの低減処理が適用された医用画像であってもよい。なお、高画質画像に対して該低減処理が適用されてもよい。また、該低減処理が、検者からの指示に応じて選択可能に構成されてもよい。 Therefore, the set of input data and correct answer data included in the learning data may be a plurality of medical images in which the difference between the parts present on the images is relatively small. For example, it may be a set of a high-quality image and a high-quality image to which noise has been added, or a set of a plurality of high-quality images to which different noises have been added. In this case, the noise may be noise to the extent that the feature amount of the parts present on the images is not lost. In addition, the noise pattern in which whether or not noise is added to each pixel is randomly determined may be different for each image. Note that the size of the noise added may be different for each image. In addition, it may be a set of a plurality of medical images in which the number of overlapping sheets is different from each other. In this way, the high-quality engine can, for example, accurately obtain the feature amount of the high-quality image as a learning result. Therefore, by using the high-quality engine, it is possible to accurately generate a high-quality image from an input low-quality image. In this case, the input low-quality image may be a medical image to which various artifact reduction processes as described above have been applied. Note that the reduction process may be applied to the high-quality image. In addition, the reduction process may be configured to be selectable according to an instruction from the examiner.

ここで、特徴量が互いに異なる複数の医用画像の少なくとも一部の領域に対しては、互いに異なるノイズが付加されても良い。例えば、比較的明るい医用画像に対して適度なノイズが比較的暗い医用画像に付加されると、比較的暗い医用画像上に存在する部位が失われてしまう可能性がある。そこで、例えば、比較的暗い医用画像の少なくとも一部の領域に付加されるノイズの大きさは、比較的明るい画像の少なくとも一部の領域に付加されるノイズの大きさよりも小さくしても良い。また、学習データとして用いられる高画質画像は、他の高画質化エンジンにより生成された高画質画像であってもよい。 Here, different noises may be added to at least some areas of a plurality of medical images having different feature amounts. For example, when a moderate amount of noise is added to a relatively dark medical image in comparison with a relatively bright medical image, there is a possibility that a part present in the relatively dark medical image may be lost. Therefore, for example, the magnitude of the noise added to at least some areas of the relatively dark medical image may be smaller than the magnitude of the noise added to at least some areas of the relatively bright image. Furthermore, the high-quality images used as learning data may be high-quality images generated by another image quality improvement engine.

また、上述したような様々な手法を用いても、低画質画像が比較的暗い医用画像である場合等には、高画質化エンジンがノイズ等とみなしてしまうためか、低画質画像に存在していた部位が高画質画像において一部失われてしまっている可能性がある。そこで、例えば、低画質画像と高画質画像とを画像の明るさ等に応じた割合で合成することにより、合成画像が得られるようにしても良い。これにより、例えば、高画質画像において失われてしまった部位であっても、低画質画像には存在するため、このような部位を高画質画像上に復活させることができる。 Even if the various techniques described above are used, when the low-quality image is a relatively dark medical image, the high-quality image engine may regard it as noise, etc., and therefore it is possible that parts that existed in the low-quality image may be partially lost in the high-quality image. In this case, for example, a composite image may be obtained by combining the low-quality image and the high-quality image in a ratio according to the brightness of the images, etc. In this way, for example, even if a part is lost in the high-quality image, it is possible to restore such a part to the high-quality image, since it exists in the low-quality image.

また、高画質画像または合成画像等は、検者からの指示に応じて記憶部に保存されてもよい。このとき、高画質画像または合成画像等を保存するための検者からの指示の後、ファイル名の登録の際に、推奨のファイル名として、ファイル名のいずれかの箇所（例えば、最初の箇所、最後の箇所）に、高画質化用の学習済モデルを用いた処理（高画質化処理）により生成された画像であることを示す情報（例えば、文字）を含むファイル名が、検者からの指示に応じて編集可能な状態で表示されてもよい。また、レポート画面等の種々の表示画面において、上述したように、表示部に高画質画像を表示させる際に、表示されている画像が高画質化用の学習済モデルを用いた処理により生成された高画質画像であることを示す表示が、高画質画像とともに表示されてもよい。この場合には、ユーザーは、当該表示によって、表示された高画質画像が撮影によって取得した画像そのものではないことが容易に識別できるため、誤診断を低減させたり、診断効率を向上させたりすることができる。なお、高画質化用の学習済モデルを用いた処理により生成された高画質画像であることを示す表示は、入力画像と当該処理により生成された高画質画像とを識別可能な表示であればどのような態様のものでもよい。また、高画質化用の学習済モデルを用いた処理だけでなく、上述したような種々の学習済モデルを用いた処理についても、その種類の学習済モデルを用いた処理により生成された結果であることを示す表示が、その結果とともに表示されてもよい。このとき、レポート画面等の表示画面は、検者からの指示に応じて記憶部に保存されてもよい。例えば、高画質化画像または合成画像等と、これらの画像が高画質化用の学習済モデルを用いた処理により生成された高画質画像であることを示す表示とが並んだ１つの画像としてレポート画面が記憶部に保存されてもよい。また、高画質化用の学習済モデルを用いた処理により生成された高画質画像であることを示す表示について、高画質化用の学習済モデルがどのような学習データによって学習を行ったものであるかを示す表示が表示部に表示されてもよい。当該表示としては、学習データの入力データと正解データの種類の説明の表示、入力データと正解データに含まれる撮影部位等の正解データに関する任意の表示等を含んでよい。なお、高画質化用の学習済モデルを用いた処理だけでなく、上述したような種々の学習済モデルを用いた処理についても、その種類の学習済モデルがどのような学習データによって学習を行ったものであるかを示す表示が表示部に表示されてもよい。また、高画質化用の学習済モデルを用いた処理により生成された画像であることを示す情報（例えば、文字）を、高画質画像または合成画像等に重畳した状態で表示または保存されるように構成されてもよい。このとき、画像上に重畳する箇所は、撮影対象となる注目部位等が表示されている領域には重ならない領域（例えば、画像の端）であればどこでもよい。また、重ならない領域を判定し、判定された領域に重畳させてもよい。 The high-quality image or composite image may be stored in the storage unit in response to an instruction from the examiner. At this time, after an instruction from the examiner to save the high-quality image or composite image, when registering the file name, a file name including information (e.g., characters) indicating that the image is generated by processing using a trained model for image quality improvement (image quality improvement processing) may be displayed in an editable state in response to an instruction from the examiner as a recommended file name in any part of the file name (e.g., the first part, the last part) in response to an instruction from the examiner. Also, in various display screens such as a report screen, as described above, when a high-quality image is displayed on the display unit, a display indicating that the displayed image is a high-quality image generated by processing using a trained model for image quality improvement may be displayed together with the high-quality image. In this case, the user can easily identify that the displayed high-quality image is not the image itself obtained by shooting by the display, thereby reducing misdiagnosis and improving diagnostic efficiency. Note that the display indicating that the image is a high-quality image generated by processing using a trained model for image quality improvement may be any type of display that allows the input image and the high-quality image generated by the processing to be distinguished from each other. In addition, not only for the processing using the trained model for image quality improvement, but also for the processing using various trained models as described above, a display indicating that the result is generated by the processing using the trained model of that type may be displayed together with the result. At this time, the display screen such as the report screen may be stored in the storage unit in response to an instruction from the examiner. For example, the report screen may be stored in the storage unit as one image in which the high-quality image or synthetic image, etc., and a display indicating that these images are high-quality images generated by the processing using the trained model for image quality improvement are arranged side by side. In addition, for the display indicating that the high-quality image is generated by the processing using the trained model for image quality improvement, a display indicating what kind of training data the trained model for image quality improvement has been trained with may be displayed on the display unit. The display may include a display of an explanation of the types of input data and correct answer data of the training data, any display regarding correct answer data such as the imaging site included in the input data and correct answer data, etc. In addition, not only for the processing using the trained model for image quality improvement, but also for the processing using various trained models as described above, a display indicating what kind of training data the trained model of that type has been trained with may be displayed on the display unit. Also, information (e.g., text) indicating that the image was generated by processing using a trained model for image quality improvement may be displayed or saved in a state where it is superimposed on the high-quality image or the composite image. In this case, the location to be superimposed on the image may be anywhere as long as it is an area (e.g., an edge of the image) that does not overlap with an area in which a site of interest to be photographed is displayed. Also, the non-overlapping area may be determined, and the information may be superimposed on the determined area.

また、レポート画面の初期表示画面として、ボタン３４２０がアクティブ状態（高画質化処理がオン）となるようにデフォルト設定されている場合には、検者からの指示に応じて、高画質画像または合成画像等を含むレポート画面に対応するレポート画像がサーバに送信されるように構成されてもよい。また、ボタン３４２０がアクティブ状態となるようにデフォルト設定されている場合には、検査終了時（例えば、検者からの指示に応じて、撮影確認画面やプレビュー画面からレポート画面に変更された場合）に、高画質画像または合成画像等を含むレポート画面に対応するレポート画像がサーバに（自動的に）送信されるように構成されてもよい。このとき、デフォルト設定における各種設定（例えば、レポート画面の初期表示画面におけるＥｎ－Ｆａｃｅ画像の生成のための深度範囲、解析マップの重畳の有無、高画質画像か否か、経過観察用の表示画面か否か等の少なくとも１つに関する設定）に基づいて生成されたレポート画像がサーバに送信されるように構成されもよい。 In addition, when the button 3420 is set to be in an active state (high image quality processing is on) as the default display screen of the report screen, a report image corresponding to the report screen including a high image quality image or a composite image, etc. may be configured to be sent to the server in response to an instruction from the examiner. In addition, when the button 3420 is set to be in an active state as the default, a report image corresponding to the report screen including a high image quality image or a composite image, etc. may be (automatically) sent to the server at the end of the examination (for example, when the shooting confirmation screen or the preview screen is changed to the report screen in response to an instruction from the examiner). At this time, a report image generated based on various settings in the default settings (for example, settings related to at least one of the depth range for generating an En-Face image on the initial display screen of the report screen, whether or not an analysis map is superimposed, whether or not a high image quality image is displayed, whether or not a display screen for follow-up observation is displayed, etc.) may be configured to be sent to the server.

また、上述したような種々の学習済モデルのうち、第１の種類の学習済モデルで得た画像（例えば、高画質画像、解析マップ等の解析結果を示す画像、物体認識結果を示す画像、セグメンテーション結果を示す画像）を、第１の種類とは異なる第２の種類の学習済モデルに入力してもよい。このとき、第２の種類の学習済モデルの処理による結果（例えば、解析結果、診断結果、物体認識結果、セグメンテーション結果）が生成されるように構成されてもよい。また、上述したような種々の学習済モデルのうち、第１の種類の学習済モデルの処理による結果（例えば、解析結果、診断結果、物体認識結果、セグメンテーション結果）を用いて、第１の種類の学習済モデルに入力した画像から、第１の種類とは異なる第２の種類の学習済モデルに入力する画像を生成してもよい。このとき、生成された画像は、第２の種類の学習済モデルにより処理する画像として適した画像である可能性が高い。このため、生成された画像を第２の種類の学習済モデルに入力して得た画像（例えば、高画質画像、解析マップ等の解析結果を示す画像、物体認識結果を示す画像、セグメンテーション結果を示す画像）の精度を向上することができる。また、上述したような学習済モデルの処理による解析結果や診断結果等を検索キーとして、サーバ等に格納された外部のデータベースを利用した類似画像検索を行ってもよい。なお、データベースにおいて保存されている複数の画像が、既に機械学習等によって該複数の画像それぞれの特徴量を付帯情報として付帯された状態で管理されている場合等には、画像自体を検索キーとする類似画像検索エンジン（類似画像検査モデル、類似画像検索用の学習済モデル）が用いられてもよい。 In addition, among the various trained models as described above, an image obtained by a first type of trained model (e.g., a high-quality image, an image showing an analysis result such as an analysis map, an image showing an object recognition result, an image showing a segmentation result) may be input to a second type of trained model different from the first type. At this time, a result by processing the second type of trained model (e.g., an analysis result, a diagnosis result, an object recognition result, a segmentation result) may be generated. In addition, among the various trained models as described above, an image to be input to a second type of trained model different from the first type may be generated from the image input to the first type of trained model using a result by processing the first type of trained model (e.g., an analysis result, a diagnosis result, an object recognition result, a segmentation result). At this time, the generated image is likely to be an image suitable for processing by the second type of trained model. Therefore, the accuracy of the image (e.g., a high-quality image, an image showing an analysis result such as an analysis map, an image showing an object recognition result, or an image showing a segmentation result) obtained by inputting the generated image into the second type of trained model can be improved. In addition, a similar image search may be performed using an external database stored on a server or the like, using the analysis result or diagnosis result obtained by processing the trained model as a search key. Note that, in cases where multiple images stored in a database are managed with the feature values of each of the multiple images already attached as additional information by machine learning or the like, a similar image search engine (similar image inspection model, trained model for similar image search) using the image itself as a search key may be used.

（その他の実施形態）
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける一つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。以上、実施形態を参照して本発明について説明したが、本発明は上記実施形態に限定されるものではない。本発明の趣旨に反しない範囲で変更された発明、及び本発明と均等な発明も本発明に含まれる。また、上述の各実施形態は、本発明の趣旨に反しない範囲で適宜組み合わせることができる。 Other Embodiments
The present invention can also be realized by a process in which a program that realizes one or more functions of the above-mentioned embodiments is supplied to a system or device via a network or a storage medium, and one or more processors in the computer of the system or device read and execute the program. It can also be realized by a circuit (e.g., ASIC) that realizes one or more functions. Although the present invention has been described above with reference to the embodiments, the present invention is not limited to the above-mentioned embodiments. The present invention also includes inventions that have been modified within the scope of the present invention and inventions equivalent to the present invention. In addition, the above-mentioned embodiments can be appropriately combined within the scope of the present invention.

Claims

a designation means for designating a part of a depth range of a predetermined part of a subject in three-dimensional medical image data of the predetermined part in response to an instruction from an examiner;
an acquisition means for acquiring a first image, which is a medical image of the predetermined portion corresponding to the specified partial depth range, by using the three-dimensional medical image data;
an image quality improvement unit that uses an image quality improvement engine including a machine learning engine obtained using learning data including a plurality of medical images corresponding to a plurality of depth ranges of a predetermined part of a subject to generate a second image from the first image, the second image having higher image quality than the first image, and generates a composite image by combining pixel values of corresponding pixels in the first image and the second image at a ratio that can be changed in response to an instruction from an examiner ;
A medical image processing device comprising:

The medical image processing device of claim 1, wherein the image quality improvement engine includes a machine learning engine obtained using learning data including a plurality of medical images obtained by adding noise of different magnitudes to at least a portion of each of a plurality of medical images corresponding to the plurality of depth ranges according to information regarding pixel values of at least a portion of each of the plurality of depth ranges.

3. The medical image processing apparatus according to claim 1, further comprising a wide-angle image generation unit that generates a wide-angle image using a plurality of first images obtained by photographing different positions of the specified part in a direction intersecting a depth direction of the specified part so that partial areas of the plurality of adjacent medical images corresponding to the specified partial depth range overlap, the plurality of second images obtained from the plurality of first images using the image quality improvement engine and a plurality of the first images by combining pixel values of each other for each corresponding pixel in the plurality of second images and the first images in the ratio.

4. The medical image processing device according to claim 1, wherein the image quality improvement engine includes a machine learning engine obtained using learning data including an image obtained by OCTA imaging using an OCT imaging device with higher performance than an OCT imaging device used for OCTA imaging of the first image, or an image obtained by an OCTA imaging process that requires more labor than the OCTA imaging process of the first image.

5. The medical image processing device according to claim 1, wherein the image quality improvement unit divides the first image into a plurality of two-dimensional images, inputs the images to the image quality improvement engine, and generates the second image by integrating the plurality of output images from the image quality improvement engine.

The image quality improvement engine includes a machine learning engine obtained by using learning data including a plurality of medical images having corresponding positional relationships as paired images;
The medical image processing apparatus according to claim 5 , wherein the image quality improving unit divides the first image into the plurality of two-dimensional images with an image size corresponding to an image size of the paired images and inputs the divided images to the image quality improving engine.

The medical image processing device according to claim 5 or 6, wherein the image quality improvement engine includes a machine learning engine obtained using learning data including images of a plurality of partial regions set so that adjacent partial regions partially overlap with each other for an area including a medical image and its external periphery.

The medical image processing apparatus according to claim 1 , wherein the image quality improvement engine includes a machine learning engine obtained by using learning data including medical images obtained by superimposition processing.

Specifying a depth range of a predetermined portion of the subject's three-dimensional medical image data in response to an instruction from an examiner;
acquiring a first image, which is a medical image of the predetermined portion corresponding to the specified partial depth range, using the three-dimensional medical image data;
generating a second image having higher image quality than the first image from the first image using an image quality improvement engine including a machine learning engine obtained using learning data including a plurality of medical images corresponding to a plurality of depth ranges of a predetermined part of a subject , and generating a composite image by combining pixel values of corresponding pixels in the first image and the second image at a ratio that can be changed in response to an instruction from an examiner ;
A medical image processing method comprising:

A program that, when executed by a processor, causes the processor to perform each step of the medical image processing method according to claim 9 .