JP7187557B2

JP7187557B2 - MEDICAL IMAGE LEARNING APPARATUS, METHOD AND PROGRAM

Info

Publication number: JP7187557B2
Application number: JP2020527358A
Authority: JP
Inventors: 正明大酒
Original assignee: Fujifilm Corp
Current assignee: Fujifilm Corp
Priority date: 2018-06-28
Filing date: 2019-06-10
Publication date: 2022-12-12
Anticipated expiration: 2039-06-10
Also published as: JPWO2020003991A1; WO2020003991A1

Description

本発明は医療画像学習装置、方法及びプログラムに係り、特に比較的少ないデータ数で精度のよい画像認識のための学習を行う技術に関する。 The present invention relates to a medical image learning apparatus, method, and program, and more particularly to a technique of performing learning for accurate image recognition with a relatively small amount of data.

医療分野においては、内視鏡装置を用いた検査が行われている。近年においては、画像解析によって画像に含まれる病変を検出する、病変を種別ごとに分類する等の認識を行い、報知することで検査を支援することが知られている。 In the medical field, examinations using endoscope apparatuses are performed. In recent years, it has been known to detect lesions contained in an image by image analysis, to classify lesions by type, or the like, and to support examinations by performing recognition and reporting.

認識のための画像解析においては、深層学習（Deep Learning）をはじめとする画像の機械学習が広く使用されている（例えば、非特許文献１）。 Machine learning of images including deep learning is widely used in image analysis for recognition (for example, Non-Patent Document 1).

深層学習では問題に応じた画像を大量に学習させることで分類、検出といった画像認識が可能となる。 Deep learning enables image recognition such as classification and detection by learning a large amount of images according to the problem.

ところで、医療画像には、通常光と呼ばれる光源により撮像される通常光画像の他に、特殊光と呼ばれる光源により撮像される特殊光画像がある。 By the way, medical images include special light images captured by a light source called special light in addition to normal light images captured by a light source called normal light.

目視による検査では、主に通常光画像が使用されるため、通常光画像は比較的多く集めることができる。 Since visual inspection primarily uses normal light images, a relatively large number of normal light images can be collected.

一方、特殊光画像は、観察目的に合わせて選択的に使用されるため、通常光画像に比べて、あまりデータ数が多くない。そのため、特殊光画像による学習を十分に行うことができず、学習済み認識器による特殊光画像に対する認識精度を上げることが困難であった。 On the other hand, since the special light image is selectively used according to the purpose of observation, the amount of data is not so large compared to the normal light image. Therefore, it is difficult to sufficiently perform the learning using the special light image, and it is difficult to improve the recognition accuracy of the special light image by the learned recognizer.

これに対し、特許文献１には、学習対象の対象画像群に写る被写体の形状、対象画像群に写る被写体の組織構造及び対象画像群を撮像した機器の撮像系の少なくとも１つの特性が類似する類似画像群に基づいて事前学習が行われ、事前学習の結果及び対象画像群に基づいて本学習が行われた本学習結果に基づいて、識別対象の画像群を識別した識別結果を出力する識別部を備える画像処理装置が提案されている。これにより、本学習に使用する対象画像群のデータ数が少量であっても、高精度な学習を行うことができるようにしている。 On the other hand, in Patent Document 1, at least one characteristic of the shape of the subject captured in the target image group of the learning object, the tissue structure of the subject captured in the target image group, and the imaging system of the device that captured the target image group is similar. Pre-learning is performed based on a group of similar images, and main learning is performed based on the result of pre-learning and a group of target images. An image processing apparatus has been proposed that includes a unit. As a result, highly accurate learning can be performed even if the amount of data in the target image group used for main learning is small.

国際公開第２０１７／２２１４１２号WO2017/221412

A. Krizhevsky, I. Sutskever, and G. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012A. Krizhevsky, I. Sutskever, and G. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012

特許文献１に記載の事前学習に使用する画像群は、学習対象の対象画像群と被写体の形状、被写体の組織構造及び撮像した機器の撮像系の少なくとも１つの特性が類似する類似画像群である。即ち、特許文献１には、学習対象の対象画像群と類似画像群とが、特殊光の観察光で撮像された特殊光画像群と通常光の観察光で撮像された通常光画像群であること（異なる観察光で撮像された画像群同士であること）が記載されていない。 The image group used for pre-learning described in Patent Document 1 is a similar image group similar in at least one characteristic of the target image group to be learned, the shape of the subject, the tissue structure of the subject, and the imaging system of the imaging device. . That is, in Patent Document 1, a group of target images to be learned and a group of similar images are a group of special light images captured with special observation light and a group of normal light images captured with normal observation light. It does not describe that (the images are groups of images captured with different observation lights).

本発明はこのような事情に鑑みてなされたもので、特殊光で撮像された医療画像に対して高精度な画像認識を行うモデルを生成することができる医療画像学習装置、方法及びプログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and provides a medical image learning device, method, and program capable of generating a model for highly accurate image recognition of medical images captured with special light. intended to

上記目的を達成するために本発明の一の態様に係る医療画像学習装置は、通常光で撮像された複数の第１医療画像からなる第１画像群を少なくとも用いて学習することにより画像認識用の第１モデルを生成する第１学習部と、第１モデルを元に、特殊光で撮像された複数の第２医療画像からなる第２画像群を用いて学習することにより、第２医療画像に対する画像認識を行う第２モデルを生成する第２学習部と、を備える。 In order to achieve the above object, a medical image learning apparatus according to one aspect of the present invention performs learning using at least a first image group composed of a plurality of first medical images captured with normal light, thereby learning images for image recognition. and a second image group consisting of a plurality of second medical images captured with special light based on the first learning unit that generates the first model of the second medical image. and a second learning unit that generates a second model that performs image recognition for the.

本発明の一の態様によれば、第１学習部は、比較的多く集めることができる通常光で撮像された第１医療画像からなる第１画像群を用いて学習（事前学習）を行うため、画像認識用の第１モデルを良好に生成することができる。そして、第２学習部は、第１モデルを元に、複数の第２医療画像からなる第２画像群を用いて学習（「ファインチューニング」ともいう）を行うため、比較的データ数の少ない第２画像群からでも第２医療画像に対して適切な画像認識を行う第２モデルを生成することができる。 According to one aspect of the present invention, the first learning unit performs learning (pre-learning) using a first image group composed of first medical images captured with normal light, which can be collected in relatively large numbers. , can well generate the first model for image recognition. Then, based on the first model, the second learning unit performs learning (also referred to as “fine tuning”) using a second image group consisting of a plurality of second medical images. It is possible to generate a second model that performs appropriate image recognition on the second medical image even from two image groups.

本発明の他の態様に係る医療画像学習装置において、第１学習部は、第２医療画像も用いて第１モデルを生成することが好ましい。これにより、第１モデルの生成に使用する医療画像をより多くすることができる。 In the medical image learning device according to another aspect of the present invention, it is preferable that the first learning unit also use the second medical image to generate the first model. As a result, more medical images can be used to generate the first model.

本発明の更に他の態様に係る医療画像学習装置において、複数の色チャンネルを有する第１医療画像を、１つの色チャンネルを有する第１医療画像に変換する画像処理部を備え、第１学習部は、変換された１つの色チャンネルを有する第１画像群を用いて学習することが好ましい。第１医療画像を、１つの色チャンネルを有する第１医療画像に変換することで、第１医療画像の特徴量を第２医療画像の特徴量に近づけることができ、第１モデルの生成をより適切に行うことができる。 A medical image learning device according to still another aspect of the present invention includes an image processing unit that converts a first medical image having a plurality of color channels into a first medical image having a single color channel; is preferably learned using a first set of images with one transformed color channel. By converting the first medical image into a first medical image having one color channel, the feature amount of the first medical image can be brought closer to the feature amount of the second medical image, and the first model can be generated more easily. can be done properly.

本発明の更に他の態様に係る医療画像学習装置において、画像処理部は、複数の色チャンネルを有する第１医療画像を、輝度信号のみの第１医療画像に変換することで、１つの色チャンネルを有する第１医療画像とすることが好ましい。これにより、第１学習部は、複数の色チャンネルからなる色情報の影響を受けないエッジ等の基本的な特徴量、識別規則を抽出することができ、適切な第１モデルを生成することができる。 In the medical image learning device according to still another aspect of the present invention, the image processing unit converts a first medical image having a plurality of color channels into a first medical image having only luminance signals, thereby obtaining a single color channel Preferably, the first medical image has As a result, the first learning unit can extract basic feature amounts such as edges and identification rules that are not affected by color information consisting of a plurality of color channels, and can generate an appropriate first model. can.

本発明の更に他の態様に係る医療画像学習装置において、複数の色チャンネルを有する第１医療画像及び第２医療画像を、それぞれ１つの色チャンネルを有する第１医療画像及び第２医療画像に変換する画像処理部を備え、第１学習部は、変換された１つの色チャンネルを有する第１画像群及び第２画像群を用いて学習することが好ましい。 A medical image learning apparatus according to still another aspect of the present invention converts a first medical image and a second medical image having a plurality of color channels into a first medical image and a second medical image each having a single color channel. Preferably, the first training unit trains using the first and second sets of images having one transformed color channel.

第１学習部での学習に第１画像群及び第２画像群を使用する場合、複数の色チャンネルからなる第１画像群及び第２画像群を、それぞれ１つの色チャンネルを有する画像群に変換されたものを使用することで、第１モデルをより適正に生成できるようにしている。 When the first group of images and the second group of images are used for training in the first training unit, the first group of images and the second group of images, which consist of multiple color channels, are converted into groups of images each having one color channel. By using the obtained one, it is possible to generate the first model more properly.

本発明の更に他の態様に係る医療画像学習装置において、画像処理部は、複数の色チャンネルを有する第１医療画像及び第２医療画像を、輝度信号のみの第１医療画像及び第２医療画像に変換することで、１つの色チャンネルを有する第１医療画像及び第２医療画像とすることが好ましい。これにより、第１学習部は、複数の色チャンネルからなる色情報の影響を受けないエッジ等の基本的な特徴量を抽出することができ、適切な第１モデルを生成することができる。 In the medical image learning device according to still another aspect of the present invention, the image processing unit converts the first medical image and the second medical image having a plurality of color channels into the first medical image and the second medical image having only luminance signals. resulting in a first medical image and a second medical image having one color channel. As a result, the first learning unit can extract basic feature amounts such as edges that are not affected by color information made up of a plurality of color channels, and can generate an appropriate first model.

本発明の更に他の態様に係る医療画像学習装置において、複数の色チャンネルを有する第１医療画像から、１つの色チャンネルを有する第１医療画像を抽出する抽出部を備え、第１学習部は、抽出された１つの色チャンネルを有する第１画像群を用いて学習することが好ましい。複数の色チャンネルからなる第１画像群から１つの色チャンネルを有する第１医療画像を抽出し、抽出した１つの色チャンネルを有する第１画像群を用いて学習することで、複数の色チャンネルからなる色情報の影響を受けないエッジ等の基本的な特徴量を抽出することができ、適切な第１モデルを生成することができる。 A medical image learning device according to still another aspect of the present invention comprises an extraction unit for extracting a first medical image having one color channel from a first medical image having a plurality of color channels, the first learning unit comprising , preferably using a first set of images with one color channel extracted. By extracting a first medical image having one color channel from a first image group consisting of a plurality of color channels and learning using the extracted first image group having one color channel, It is possible to extract basic feature amounts such as edges that are not affected by color information, and to generate an appropriate first model.

本発明の更に他の態様に係る医療画像学習装置において、複数の色チャンネルを有する第１医療画像及び第２医療画像から、１つの色チャンネルを有する第１医療画像及び第２医療画像をそれぞれ抽出する抽出部を備え、第１学習部は、抽出された１つの色チャンネルを有する第１画像群及び第２画像群を用いて学習することが好ましい。 In a medical image learning device according to still another aspect of the present invention, a first medical image and a second medical image each having a single color channel are extracted from a first medical image and a second medical image having a plurality of color channels. and the first learning unit learns using the first group of images and the second group of images having one extracted color channel.

第１学習部での学習に第１画像群及び第２画像群を使用する場合、複数の色チャンネルからなる第１画像群及び第２画像群から、それぞれ１つの色チャンネルを有する第１医療画像及び第２医療画像を抽出し、抽出した１つの色チャンネルの画像群を学習に使用することで、第１モデルをより適正に生成できるようにしている。 When using the first group of images and the second group of images for training in the first learning unit, the first medical image having one color channel from each of the first group of images and the second group of images each having a plurality of color channels. and a second medical image, and the extracted image group of one color channel is used for learning, so that the first model can be generated more appropriately.

本発明の更に他の態様に係る医療画像学習装置において、複数の色チャンネルは、３原色の３チャンネル、又は輝度信号及び２つの色差信号の３チャンネルであることが好ましい。 In the medical image learning device according to still another aspect of the present invention, the plurality of color channels are preferably three channels of three primary colors, or three channels of a luminance signal and two color difference signals.

本発明の更に他の態様に係る医療画像学習装置において、第１モデルを元に、第１画像群を用いて学習することにより、第１医療画像に対する画像認識を行う第３モデルを生成する第３学習部と、を備えることが好ましい。第１モデルは、第１医療画像に対する画像認識を行う第３モデルを生成する場合にも使用することができる。 In the medical image learning device according to still another aspect of the present invention, a third model for performing image recognition on the first medical image is generated by learning using the first image group based on the first model. 3 learning units. The first model can also be used to generate a third model for performing image recognition on the first medical image.

本発明の更に他の態様に係る医療画像学習装置において、第１医療画像及び第２医療画像は、それぞれ内視鏡装置により撮像された画像であることが好ましい。 In the medical image learning device according to still another aspect of the present invention, the first medical image and the second medical image are preferably images captured by an endoscope device.

本発明の更に他の態様に係る医療画像学習装置において、第１モデル及び第２モデルが、畳み込みニューラルネットワークで構成されることが好ましい。 In the medical image learning device according to still another aspect of the present invention, it is preferable that the first model and the second model are configured by convolutional neural networks.

本発明の更に他の態様に係る医療画像学習方法は、通常光で撮像された複数の第１医療画像からなる第１画像群及び特殊光で撮像された複数の第２医療画像からなる第２画像群を準備するステップと、第１学習部が、第１画像群を少なくとも用いて学習することにより画像認識用の第１モデルを生成するステップと、第２学習部が、第１モデルを元に第２画像群を用いて学習することにより、第２医療画像に対する画像認識を行う第２モデルを生成するステップと、を含む。 A medical image learning method according to still another aspect of the present invention provides a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light. preparing a group of images; generating a first model for image recognition by a first learning unit by learning using at least the first group of images; generating a second model for performing image recognition on the second medical image by training using the second set of images in the step of .

本発明の更に他の態様に係る医療画像学習方法において、画像処理部が、複数の色チャンネルを有する第１医療画像を、１つの色チャンネルを有する第１医療画像に変換するステップを含み、第１モデルを生成するステップは、変換された１つの色チャンネルを有する第１画像群を用いて学習することが好ましい。 In a medical image learning method according to still another aspect of the present invention, the image processing unit converts a first medical image having a plurality of color channels into a first medical image having a single color channel, The step of generating one model preferably trains using a first set of images having one transformed color channel.

本発明の更に他の態様に係る医療画像学習方法において、画像処理部が、複数の色チャンネルを有する第１医療画像及び第２医療画像を、１つの色チャンネルを有する第１医療画像及び第２医療画像に変換するステップを含み、第１モデルを生成するステップは、変換された１つの色チャンネルを有する第１画像群及び第２画像群を用いて学習することが好ましい。 In the medical image learning method according to still another aspect of the present invention, the image processing unit converts the first medical image and the second medical image having a plurality of color channels into the first medical image and the second medical image having one color channel. Preferably, the step of generating the first model includes transforming to medical images and training using a first set of images and a second set of images having one color channel that has been transformed.

本発明の更に他の態様に係る医療画像学習方法において、抽出部が、複数の色チャンネルを有する第１医療画像から、１つの色チャンネルを有する第１医療画像を抽出するステップを含み、第１モデルを生成するステップは、抽出された１つの色チャンネルを有する第１画像群を用いて学習することが好ましい。 In a medical image learning method according to still another aspect of the present invention, the extracting unit extracts a first medical image having one color channel from a first medical image having a plurality of color channels, The step of generating the model is preferably trained using a first set of images having one color channel extracted.

本発明の更に他の態様に係る医療画像学習方法において、抽出部が、複数の色チャンネルを有する第１医療画像及び第２医療画像から、１つの色チャンネルを有する第１医療画像及び第２医療画像をそれぞれ抽出するステップを含み、第１モデルを生成するステップは、抽出された１つの色チャンネルを有する第１画像群及び第２画像群を用いて学習することが好ましい。 In the medical image learning method according to still another aspect of the present invention, the extracting unit extracts the first medical image and the second medical image having one color channel from the first medical image and the second medical image having a plurality of color channels. Preferably, the step of generating the first model comprises extracting each of the images, and training using the first and second sets of images with one extracted color channel.

本発明の更に他の態様に係る医療画像学習プログラムは、通常光で撮像された複数の第１医療画像からなる第１画像群及び特殊光で撮像された複数の第２医療画像からなる第２画像群をそれぞれ取得する機能と、第１画像群を少なくとも用いて学習することにより画像認識用の第１モデルを生成する機能と、第１モデルを元に第２画像群を用いて学習することにより、第２医療画像に対する画像認識を行う第２モデルを生成する機能と、をコンピュータに実現させる。 A medical image learning program according to still another aspect of the present invention provides a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light. A function of acquiring each image group, a function of generating a first model for image recognition by learning using at least the first image group, and learning using a second image group based on the first model. causes the computer to realize a function of generating a second model for performing image recognition on the second medical image.

本発明によれば、比較的多く集めることができる通常光で撮像された第１医療画像からなる第１画像群を少なくとも用いて学習を行うことで、良好な第１モデルを生成し、この第１モデルを元に、特殊光で撮像された複数の第２医療画像からなる第２画像群を用いて学習を行うため、比較的データ数の少ない第２画像群からでも第２医療画像に対して適切な画像認識を行う第２モデルを生成することができる。 According to the present invention, a good first model is generated by performing learning using at least the first image group composed of the first medical images captured with normal light, which can be collected in relatively large numbers, and this first model Based on one model, learning is performed using the second image group consisting of a plurality of second medical images captured with special light, so even from the second image group with a relatively small amount of data, can be used to generate a second model that performs appropriate image recognition.

図１は、本発明に係る医療画像学習装置のハードウエア構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the hardware configuration of a medical image learning device according to the present invention. 図２は、本発明に係る医療画像学習装置１０－１の第１実施形態を示すブロック図である。FIG. 2 is a block diagram showing the first embodiment of the medical image learning device 10-1 according to the present invention. 図３は、第１学習部３０の実施形態を示す機能ブロック図である。FIG. 3 is a functional block diagram showing an embodiment of the first learning section 30. As shown in FIG. 図４は、本発明に係る医療画像学習装置１０－２の第２実施形態を示すブロック図である。FIG. 4 is a block diagram showing a second embodiment of the medical image learning device 10-2 according to the present invention. 図５は、本発明に係る医療画像学習装置１０－３の第３実施形態を示すブロック図である。FIG. 5 is a block diagram showing a third embodiment of the medical image learning device 10-3 according to the present invention. 図６は、本発明に係る医療画像学習装置１０－４の第４実施形態を示すブロック図である。FIG. 6 is a block diagram showing a fourth embodiment of the medical image learning device 10-4 according to the present invention. 図７は、本発明に係る医療画像学習方法の実施形態を示すフローチャートである。FIG. 7 is a flowchart illustrating an embodiment of a medical image learning method according to the present invention. 図８は、図７に示したステップＳ１４の第１変形例を示すステップＳ１４－１を示す図である。FIG. 8 is a diagram showing step S14-1 showing a first modification of step S14 shown in FIG. 図９は、図７に示したステップＳ１４の第２変形例を示すステップＳ１４－２を示す図である。FIG. 9 is a diagram showing step S14-2 showing a second modification of step S14 shown in FIG.

以下、添付図面に従って本発明に係る医療画像学習装置及び方法の好ましい実施形態について説明する。 Preferred embodiments of the medical image learning apparatus and method according to the present invention will be described below with reference to the accompanying drawings.

［医療画像学習装置のハードウエア構成］
図１は、本発明に係る医療画像学習装置のハードウエア構成の一例を示すブロック図である。[Hardware configuration of medical image learning device]
FIG. 1 is a block diagram showing an example of the hardware configuration of a medical image learning device according to the present invention.

図１に示す医療画像学習装置１０としては、パーソナルコンピュータ又はワークステーションを使用することができ、本例の医療画像学習装置１０は、主として通信部１２と、大容量のストレージもしくは第１データベース１４、第２データベース１６と、操作部１８と、ＣＰＵ（Central Processing Unit）２０と、ＲＡＭ(Random Access Memory)２２と、ＲＯＭ(Read Only Memory)２４と、表示部２６とから構成されている。 A personal computer or a workstation can be used as the medical image learning device 10 shown in FIG. It comprises a second database 16 , an operation section 18 , a CPU (Central Processing Unit) 20 , a RAM (Random Access Memory) 22 , a ROM (Read Only Memory) 24 and a display section 26 .

通信部１２は、有線又は無線により外部装置との通信処理を行い、外部装置との間で情報のやり取りを行う部分である。 The communication unit 12 is a part that performs wired or wireless communication processing with an external device and exchanges information with the external device.

第１データベース１４は、通常光で撮像された複数の第１医療画像（通常光画像）からなる第１画像群（通常光画像群）と、各通常光画像の正しい画像認識結果を示す正解データとからなる画像認識用（学習用の）第１データセットを保存し、第２データベース１６は、特殊光で撮像された複数の第２医療画像（特殊光画像）からなる第２画像群（特殊光画像群）と、各特殊光画像の正しい画像認識結果を示す正解データとからなる画像認識用（学習用の）第２データセットを保存している。 The first database 14 includes a first image group (normal light image group) composed of a plurality of first medical images (normal light images) captured with normal light, and correct data indicating correct image recognition results of each normal light image. The second database 16 stores a second image group (special A second data set for image recognition (for learning) consisting of correct data indicating the correct image recognition result of each special light image is stored.

ここで、通常光画像（第１医療画像）及び特殊光画像（第２医療画像）とは、内視鏡装置によりそれぞれ異なる光源下で撮像されたカラー画像である。 Here, the normal light image (first medical image) and the special light image (second medical image) are color images captured by the endoscope apparatus under different light sources.

通常光は、可視光の全ての波長帯域の光がほぼ均等に混ざった光（白色光）であり、通常光画像は、通常観察に使用される。したがって、通常光画像群は、比較的多く集めることができる。 Ordinary light is light (white light) in which light in all wavelength bands of visible light is substantially evenly mixed, and ordinary light images are used for ordinary observation. Therefore, a relatively large number of normal light images can be collected.

一方、特殊光は、１つの特定の波長帯域の光、又は複数の特定の波長帯域の光の組み合わせた、観察目的に応じた各種の波長帯域の光であり、白色の波長帯域よりも狭い帯域を有し、狭帯域観察（NBI(Narrow band imaging)、FICE(Flexible spectral imaging color enhancement)）に使用される。 On the other hand, the special light is light in one specific wavelength band or light in various wavelength bands in accordance with the purpose of observation, which is a combination of light in a plurality of specific wavelength bands, and has a narrower band than the white wavelength band. and is used for narrow band imaging (NBI (Narrow band imaging), FICE (Flexible spectral imaging color enhancement)).

特定の波長帯域の第１例は、例えば可視域の青色帯域又は緑色帯域である。この第１例の波長帯域は、３９０ｎｍ以上４５０ｎｍ以下又は５３０ｎｍ以上５５０ｎｍ以下の波長帯域を含み、且つ第１例の光は、３９０ｎｍ以上４５０ｎｍ以下又は５３０ｎｍ以上５５０ｎｍ以下の波長帯域内にピーク波長を有する。 A first example of a specific wavelength band is, for example, the blue band or the green band in the visible range. The wavelength band of this first example includes a wavelength band of 390 nm or more and 450 nm or less or 530 nm or more and 550 nm or less, and the light of the first example has a peak wavelength within the wavelength band of 390 nm or more and 450 nm or less or 530 nm or more and 550 nm or less. .

特定の波長帯域の第２例は、例えば可視域の赤色帯域である。この第２例の波長帯域は、５８５ｎｍ以上６１５ｎｍ以下又は６１０ｎｍ以上７３０ｎｍ以下の波長帯域を含み、且つ第２例の光は、５８５ｎｍ以上６１５ｎｍ以下又は６１０ｎｍ以上７３０ｎｍ以下の波長帯域内にピーク波長を有する。 A second example of a specific wavelength band is, for example, the visible red band. The wavelength band of this second example includes a wavelength band of 585 nm or more and 615 nm or less or 610 nm or more and 730 nm or less, and the light of the second example has a peak wavelength within the wavelength band of 585 nm or more and 615 nm or less or 610 nm or more and 730 nm or less. .

特定の波長帯域の第３例は、酸化ヘモグロビンと還元ヘモグロビンとで吸光係数が異なる波長帯域を含み、且つ第３例の光は、酸化ヘモグロビンと還元ヘモグロビンとで吸光係数が異なる波長帯域にピーク波長を有する。この第３例の波長帯域は、４００±１０ｎｍ、４４０±１０ｎｍ、４７０±１０ｎｍ、又は６００ｎｍ以上７５０ｎｍ以下の波長帯域を含み、且つ第３例の光は、上記４００±１０ｎｍ、４４０±１０ｎｍ、４７０±１０ｎｍ、又は６００ｎｍ以上７５０ｎｍ以下の波長帯域にピーク波長を有する。 A third example of the specific wavelength band includes a wavelength band in which oxygenated hemoglobin and reduced hemoglobin have different absorption coefficients, and the light in the third example has a peak wavelength in the wavelength band in which oxygenated hemoglobin and reduced hemoglobin have different absorption coefficients. have The wavelength band of this third example includes a wavelength band of 400±10 nm, 440±10 nm, 470±10 nm, or a wavelength band of 600 nm or more and 750 nm or less, and the light of the third example is the above 400±10 nm, 440±10 nm, 470 It has a peak wavelength in the wavelength band of ±10 nm or from 600 nm to 750 nm.

特定の波長帯域の第４例は、生体内の蛍光物質が発する蛍光の観察（蛍光観察）に用いられ且つこの蛍光物質を励起させる励起光の波長帯域（３９０ｎｍから４７０ｎｍ）である。 A fourth example of the specific wavelength band is the wavelength band (390 nm to 470 nm) of excitation light used for observation of fluorescence emitted by a fluorescent substance in the living body (fluorescence observation) and for exciting this fluorescent substance.

特定の波長帯域の第５例は、赤外光の波長帯域である。この第５例の波長帯域は、７９０ｎｍ以上８２０ｎｍ以下又は９０５ｎｍ以上９７０ｎｍ以下の波長帯域を含み、且つ第５例の光は、７９０ｎｍ以上８２０ｎｍ以下又は９０５ｎｍ以上９７０ｎｍ以下の波長帯域にピーク波長を有する。 A fifth example of the specific wavelength band is the wavelength band of infrared light. The wavelength band of the fifth example includes a wavelength band of 790 nm to 820 nm or 905 nm to 970 nm, and the light of the fifth example has a peak wavelength in the wavelength band of 790 nm to 820 nm or 905 nm to 970 nm.

このような特定の波長帯域を有する特殊光下で撮像された特殊光画像は病変が見にくいため、表面構造を観察したい等の観察目的に応じた場合にしか使用されず、データ数が多くない。 Since lesions are difficult to see in special light images captured under special light having such a specific wavelength band, they are only used for observation purposes such as observing surface structures, and the number of data is not large.

本例では、第１データベース１４に保存されている通常光画像群の第１データセットは、第２データベース１６に保存されている特殊光画像群の第２データセットよりも多く準備されているものとする。 In this example, the first data set of the normal light image group stored in the first database 14 is prepared more than the second data set of the special light image group stored in the second database 16. and

また、第１データベース１４及び第２データベース１６において、各通常光画像及び各特殊光画像に関連付けて保存されている正解データは、例えば通常光画像及び特殊光画像内に写っている病変の種類、病変の位置を示したデータ、症例固有の識別情報などが考えられる。病変の分類においては、腫瘍性、非腫瘍性の２分類、NICE分類などが挙げられる。病変の位置を示すデータは、病変を囲む矩形の情報や、病変を覆い隠すようなマスクデータなどが考えられる。 Further, in the first database 14 and the second database 16, the correct data stored in association with each normal light image and each special light image are, for example, the types of lesions shown in the normal light image and the special light image, Data indicating the position of a lesion, case-specific identification information, and the like are conceivable. Classification of lesions includes two classifications of neoplastic and non-neoplastic, NICE classification, and the like. Data indicating the position of a lesion may be information on a rectangle surrounding the lesion, mask data that hides the lesion, or the like.

本例では、第１データベース１４、第２データベース１６は、医療画像学習装置１０が備えているが、外部に設けられたものでもよい。この場合、通信部１２を介して外部のデータベースから学習用のデータセットを取得することができる。 In this example, the first database 14 and the second database 16 are provided in the medical image learning device 10, but they may be provided outside. In this case, a data set for learning can be acquired from an external database via the communication unit 12 .

操作部１８は、コンピュータに有線接続又は無線接続されるキーボード及びマウス等が用いられ、機械学習に当たって各種の操作入力を受け付ける。 The operation unit 18 uses a keyboard and a mouse that are wired or wirelessly connected to the computer, and receives various operation inputs for machine learning.

ＣＰＵ２０は、ＲＯＭ２４又は図示しないハードディスク装置等に記憶された各種のプログラム（本発明に係る医療画像学習プログラムを含む）を読み出し、各種の処理を実行する。ＲＡＭ２２は、ＣＰＵ２０の作業領域として使用され、読み出されたプログラムや各種のデータを一時的に記憶する記憶部として用いられる。 The CPU 20 reads various programs (including a medical image learning program according to the present invention) stored in the ROM 24 or a hard disk device (not shown) or the like, and executes various processes. The RAM 22 is used as a work area for the CPU 20 and is used as a storage unit for temporarily storing read programs and various data.

表示部２６は、コンピュータに接続可能な液晶モニタ等の各種モニタが用いられ、操作部１８とともに、ユーザインターフェースの一部として使用される。 Various monitors such as a liquid crystal monitor that can be connected to a computer are used as the display unit 26 , and the display unit 26 is used as part of the user interface together with the operation unit 18 .

上記構成の医療画像学習装置１０は、操作部１８により指示入力によりＣＰＵ２０が、ＲＯＭ２４やハードディスク装置等に記憶されている医療画像学習プログラムを読み出し、医療画像学習プログラムを実行することにより、後述するように医療画像学習装置として機能する。 In the medical image learning apparatus 10 configured as described above, the CPU 20 reads out a medical image learning program stored in the ROM 24, hard disk device, or the like in response to an instruction input from the operation unit 18, and executes the medical image learning program, thereby performing the operation as described later. function as a medical image learning device.

［医療画像学習装置の第１実施形態］
図２は、本発明に係る医療画像学習装置１０－１の第１実施形態を示すブロック図であり、図１に示した医療画像学習装置１０の主要な機能を示す機能ブロック図である。[First embodiment of medical image learning device]
FIG. 2 is a block diagram showing the first embodiment of the medical image learning device 10-1 according to the present invention, and is a functional block diagram showing main functions of the medical image learning device 10 shown in FIG.

図２に示す医療画像学習装置１０－１は、第１学習部３０と第２学習部４０とを備えている。 The medical image learning device 10-1 shown in FIG. 2 includes a first learning unit 30 and a second learning unit 40.

第１学習部３０は、第１データベース１４に保存された通常光画像のデータセットと、第２データベース１６に保存された特殊光画像のデータセットとを用いて学習することにより画像認識用の学習モデル（第１モデル）を生成する。本例では、学習モデルの一つである畳み込みニューラルネットワーク（ＣＮＮ：Convolution Neural Network）を構築する。 The first learning unit 30 learns for image recognition by learning using a data set of normal light images stored in the first database 14 and a data set of special light images stored in the second database 16. Generate a model (first model). In this example, a convolution neural network (CNN), which is one of learning models, is constructed.

図３は、第１学習部３０の実施形態を示す機能ブロック図である。 FIG. 3 is a functional block diagram showing an embodiment of the first learning section 30. As shown in FIG.

図３に示す第１学習部３０は、主としてＣＮＮ３２と、誤差算出部３４と、パラメータ更新部３６とから構成される。 The first learning unit 30 shown in FIG. 3 is mainly composed of a CNN 32, an error calculation unit 34, and a parameter updating unit 36.

ＣＮＮ３２は、例えば、医療画像に写っている病変の種類を画像認識する認識器に対応する部分であり、複数のレイヤー構造を有し、複数の重みパラメータを保持している。ＣＮＮ３２は、重みパラメータが初期値から最適値に更新されることで、未学習モデルから学習済みモデルに変化しうる。 The CNN 32 is, for example, a part corresponding to a recognizer that recognizes the types of lesions appearing in medical images, has a multiple layer structure, and holds multiple weight parameters. The CNN 32 can change from an unlearned model to a learned model by updating the weight parameter from the initial value to the optimum value.

このＣＮＮ３２は、入力層３２Ａと、畳み込み層とプーリング層から構成された複数セット、及び全結合層を有する中間層３２Ｂと、出力層３２Ｃとを備え、各層は複数の「ノード」が「エッジ」で結ばれる構造となっている。 This CNN 32 comprises an input layer 32A, a hidden layer 32B having multiple sets of convolutional and pooling layers, and a fully connected layer, and an output layer 32C, each layer having a plurality of "nodes" as "edges". It has a structure connected by

入力層３２Ａには、学習対象である通常光画像１４Ａが入力される。 The normal light image 14A to be learned is input to the input layer 32A.

中間層３２Ｂは、畳み込み層とプーリング層とを１セットとする複数セットと、全結合層とを有し、入力層から入力した画像から特徴を抽出する部分である。畳み込み層は、前の層で近くにあるノードにフィルタ処理し（フィルタを使用した畳み込み演算を行い）、「特徴マップ」を取得する。プーリング層は、畳み込み層から出力された特徴マップを縮小して新たな特徴マップとする。「畳み込み層」は、画像からのエッジ抽出等の特徴抽出の役割を担い、「プーリング層」は抽出された特徴が、平行移動などによる影響を受けないようにロバスト性を与える役割を担う。尚、中間層３２Ｂには、畳み込み層とプーリング層とを１セットとする場合に限らず、畳み込み層が連続する場合や正規化層も含まれる。 The intermediate layer 32B has multiple sets of convolutional layers and pooling layers, and a fully connected layer, and is a part for extracting features from an image input from the input layer. The convolutional layer filters (convolves with filters) the nodes that are nearby in the previous layer to get a "feature map". The pooling layer reduces the feature map output from the convolution layer to a new feature map. The “convolution layer” plays a role of extracting features such as edge extraction from an image, and the “pooling layer” plays a role of providing robustness so that the extracted features are not affected by translation or the like. Note that the intermediate layer 32B is not limited to the case where the convolutional layer and the pooling layer are set as one set, but also includes the case where the convolutional layers are continuous and the normalization layer.

出力層３２Ｃは、中間層３２Ｂにより抽出された特徴に基づき医療画像に写っている病変の種類を分類する認識結果を出力する部分である。学習済みＣＮＮ３２では、例えば、医療画像を、「腫瘍性」、「非腫瘍性」、「その他」の３つのカテゴリに分類し、認識結果は、「腫瘍性」、「非腫瘍性」及び「その他」に対応する３つのスコア（３つのスコアの合計は１００％）として出力する。 The output layer 32C is a portion that outputs recognition results for classifying the types of lesions appearing in medical images based on the features extracted by the intermediate layer 32B. In the trained CNN 32, for example, medical images are classified into three categories of "neoplastic", "non-neoplastic", and "other", and the recognition results are classified into "neoplastic", "non-neoplastic" and "other ” (the sum of the three scores is 100%).

学習前のＣＮＮ３２の各畳み込み層に適用されるフィルタの係数やオフセット値、及び全結合層における次の層との接続の重みは、任意の初期値がセットされる。 Arbitrary initial values are set for the filter coefficients and offset values applied to each convolutional layer of the CNN 32 before learning, and the weight of the connection with the next layer in the fully connected layer.

誤差算出部３４は、ＣＮＮ３２の出力層３２Ｃから出力される認識結果と、通常光画像１４Ａに対する正解データとを取得し、両者間の誤差を算出する。誤差の算出方法は、例えばソフトマックスクロスエントロピー、シグモイドなどが考えられる。 The error calculator 34 acquires the recognition result output from the output layer 32C of the CNN 32 and the correct data for the normal light image 14A, and calculates the error between them. Examples of error calculation methods include softmax cross entropy and sigmoid.

パラメータ更新部３６は、誤差算出部３４により算出された誤差を元に、誤差逆伝播法によりＣＮＮ３０の重みパラメータを調整する。 The parameter updating unit 36 adjusts the weight parameters of the CNN 30 by error backpropagation based on the error calculated by the error calculating unit 34 .

このパラメータの調整処理を繰り返し行い、ＣＮＮ３２の出力と正解データとの差が小さくなるまで繰り返し学習を行う。 This parameter adjustment processing is repeated, and learning is repeated until the difference between the output of the CNN 32 and the correct data becomes small.

第１学習部３０は、第１データベース１４に保存された通常光画像群の全てのデータセットを使用し、ＣＮＮ３２の各パラメータを最適化する学習を行うことで、学習済みモデル（第１モデル）を生成する。 The first learning unit 30 uses all data sets of the normal light image group stored in the first database 14, and performs learning to optimize each parameter of the CNN 32, a learned model (first model) to generate

図２に戻って、第２学習部４０は、第１学習部３０により学習した学習済みモデルを元に（学習済みＣＮＮ３２のパラメータを初期値として使用し）、第２データベース１６に保存された特殊光画像のデータセットのみを用いて再度学習することにより、特殊光画像に対する画像認識を行う学習モデル（第２モデル）を生成する。 Returning to FIG. 2 , the second learning unit 40 uses the trained model trained by the first learning unit 30 (using the parameters of the trained CNN 32 as initial values), the special model stored in the second database 16 A learning model (second model) for performing image recognition on the special light image is generated by re-learning using only the data set of the light image.

第２学習部４０は、図３に示した第１学習部３０と同様に構成されるため、その詳細な説明は省略する。尚、第２学習部４０は、第１学習部３０と同一の学習部により構成されていてもよい。 Since the second learning section 40 is configured in the same manner as the first learning section 30 shown in FIG. 3, detailed description thereof will be omitted. Note that the second learning section 40 may be configured by the same learning section as the first learning section 30 .

第１実施形態の医療画像学習装置１０－１によれば、第１学習部３０により比較的多く集めることができる通常光画像のデータセットを用いて学習（事前学習）を行うため、画像認識用の学習モデルを良好に生成することができる。そして、第２学習部４０は、第１学習部３０により生成された学習モデルを元に、特殊光画像群のみを用いて学習（「ファインチューニング」ともいう）を行うため、比較的データ数の少ない特殊光画像群からでも特殊光画像に対して適切な画像認識を行う学習モデルを生成することができる。 According to the medical image learning device 10-1 of the first embodiment, learning (pre-learning) is performed using a data set of normal light images that can be collected in relatively large numbers by the first learning unit 30. can successfully generate a learning model of Then, based on the learning model generated by the first learning unit 30, the second learning unit 40 performs learning (also referred to as “fine tuning”) using only the special light image group. It is possible to generate a learning model that performs appropriate image recognition for special light images even from a small group of special light images.

尚、第１実施形態の医療画像学習装置１０－１の第１学習部３０は、通常光画像のデータセットと特殊光画像のデータセットとを用いて学習するが、通常光画像のデータセットのみを用いて学習してもよい。 The first learning unit 30 of the medical image learning device 10-1 of the first embodiment learns using the normal light image data set and the special light image data set, but only the normal light image data set may be learned using

［医療画像学習装置の第２実施形態］
図４は、本発明に係る医療画像学習装置１０－２の第２実施形態を示すブロック図である。尚、図４に示す医療画像学習装置１０－２において、図２に示した第１実施形態の医療画像学習装置１０－１と共通する部分には同一の符号を付し、その詳細な説明は省略する。[Second Embodiment of Medical Image Learning Apparatus]
FIG. 4 is a block diagram showing a second embodiment of the medical image learning device 10-2 according to the present invention. In the medical image learning device 10-2 shown in FIG. 4, the parts common to the medical image learning device 10-1 of the first embodiment shown in FIG. omitted.

図４に示す第２実施形態の医療画像学習装置１０－２は、主として画像処理部５０が追加されている点で、図２に示した第１実施形態の医療画像学習装置１０－１と相違する。 The medical image learning device 10-2 of the second embodiment shown in FIG. 4 differs from the medical image learning device 10-1 of the first embodiment shown in FIG. 2 mainly in that an image processing unit 50 is added. do.

第１データベース１４に保存されている通常光画像及び第２データベース１６に保存されている特殊光画像は、それぞれ赤（Ｒ）、緑（Ｇ）、青（Ｂ）の３つの色チャンネルからなるカラー画像とする。 The normal light image stored in the first database 14 and the special light image stored in the second database 16 are each color images consisting of three color channels, red (R), green (G) and blue (B). be an image.

画像処理部５０は、第１データベース１４に保存されているＲＧＢの３つの色チャンネルを有する通常光画像、及び第２データベース１６に保存されているＲＧＢの３つの色チャンネルを有する特殊光画像を、それぞれ１つの色チャンネルを有する通常光画像及び特殊光画像に変換し、変換後の１つの色チャンネルを有する通常光画像及び特殊光画像を、学習用のデータセットとして第１学習部３０に出力する。 The image processing unit 50 processes a normal light image having three RGB color channels stored in the first database 14 and a special light image having three RGB color channels stored in the second database 16, It converts into a normal light image and a special light image each having one color channel, and outputs the normal light image and the special light image having one color channel after conversion to the first learning unit 30 as a data set for learning. .

画像処理部５０は、ＲＧＢのカラー画像をモノクロ処理することで、１つの色チャンネルを有する通常光画像及び特殊光画像に変換する。 The image processing unit 50 performs monochrome processing on the RGB color image to convert it into a normal light image and a special light image having one color channel.

ここで、モノクロ処理の一例としては、例えば、ＲＧＢの色信号（Ｒ信号、Ｇ信号、Ｂ信号）から、次式により輝度信号（Ｙ信号）を生成する処理が考えられる。 Here, as an example of monochrome processing, for example, a process of generating a luminance signal (Y signal) from RGB color signals (R signal, G signal, B signal) according to the following equation can be considered.

［数１］
Ｙ＝０．３Ｒ＋０．５９Ｇ＋０．１１Ｂ
第１学習部３０は、画像処理部５０により変換された１つの色チャンネルを有する通常光画像及び特殊光画像（モノクロ画像）を、学習用のデータセットとして取得し、取得したデータセットにより画像認識用の学習モデル（第１モデル）を生成する。[Number 1]
Y=0.3R+0.59G+0.11B
The first learning unit 30 acquires the normal light image and the special light image (monochrome image) having one color channel converted by the image processing unit 50 as a learning data set, and uses the acquired data set for image recognition. Generate a learning model (first model) for

第２学習部４０は、第１学習部３０により学習した学習済みＣＮＮのパラメータを初期値として使用し、第２データベース１６に保存されたＲＧＢの特殊光画像のデータセットのみを用いて再度学習することにより、特殊光画像に対する画像認識を行う学習モデルを生成する。 The second learning unit 40 uses the parameters of the learned CNN learned by the first learning unit 30 as initial values, and re-learns using only the RGB special light image data set stored in the second database 16. By doing so, a learning model for performing image recognition on the special light image is generated.

ここで、第１学習部３０での学習に用いるデータセットは、１チャンネルの画像（モノクロ画像）であるため、少なくともＣＮＮの最初の畳み込み層で使用されるフィルタも１チャンネルになる。一方、第２学習部４０での学習に用いるデータセットは、複数の色チャンネル（ＲＧＢ）の特殊光画像であるため、少なくともＣＮＮの最初の畳み込み層で使用されるフィルタも複数チャンネル（３チャンネル）になる。したがって、第２学習部４０のＣＮＮの最初の畳み込み層で使用される３チャンネルのフィルタには、それぞれ同じパラメータを初期値として使用する。 Here, since the data set used for learning in the first learning unit 30 is a 1-channel image (monochrome image), the filter used at least in the first convolution layer of the CNN is also 1-channel. On the other hand, the data set used for learning in the second learning unit 40 is a special light image of multiple color channels (RGB), so at least the filters used in the first convolution layer of the CNN are also multiple channels (3 channels) become. Therefore, the same parameters are used as initial values for the three-channel filters used in the first convolutional layer of the CNN of the second learning unit 40 .

第２実施形態の医療画像学習装置１０－２によれば、通常光画像の特徴量を特殊光画像の特徴量に近づけて事前学習することができ、第２学習部４０での特殊光画像のデータセットのみによる再学習をより適切に行うことができる。 According to the medical image learning device 10-2 of the second embodiment, pre-learning can be performed by bringing the feature amount of the normal light image closer to the feature amount of the special light image. Re-learning with only the dataset can be performed more appropriately.

尚、第２実施形態の医療画像学習装置１０－２の画像処理部５０は、通常光画像及び特殊光画像をそれぞれモノクロ処理しているが、通常光画像のみをモノクロ処理し、第１学習部３０は、モノクロ処理された通常光画像のデータセットのみを用いて学習してもよい。 Note that the image processing unit 50 of the medical image learning device 10-2 of the second embodiment performs monochrome processing on the normal light image and the special light image, respectively. 30 may be trained using only monochrome processed normal light image data sets.

また、画像処理部５０は、輝度信号からなる「白黒画像」を生成する場合に限らず、単一色（１チャンネル）の画像を生成するものであればよく、特殊光画像の色相に近似したモノクロ画像を生成することが好ましい。 Further, the image processing unit 50 is not limited to generating a "black and white image" made up of luminance signals, and may generate a single color (one channel) image. An image is preferably generated.

［医療画像学習装置の第３実施形態］
図５は、本発明に係る医療画像学習装置１０－３の第３実施形態を示すブロック図である。尚、図５に示す医療画像学習装置１０－３において、図２に示した第１実施形態の医療画像学習装置１０－１と共通する部分には同一の符号を付し、その詳細な説明は省略する。[Third Embodiment of Medical Image Learning Apparatus]
FIG. 5 is a block diagram showing a third embodiment of the medical image learning device 10-3 according to the present invention. In the medical image learning device 10-3 shown in FIG. 5, the parts common to the medical image learning device 10-1 of the first embodiment shown in FIG. omitted.

図５に示す第３実施形態の医療画像学習装置１０－３は、主として抽出部６０が追加されている点で、図２に示した第１実施形態の医療画像学習装置１０－１と相違する。 The medical image learning device 10-3 of the third embodiment shown in FIG. 5 differs from the medical image learning device 10-1 of the first embodiment shown in FIG. 2 mainly in that an extraction unit 60 is added. .

第１データベース１４に保存されている通常光画像及び第２データベース１６に保存されている特殊光画像は、それぞれＲＧＢの３つの色チャンネルからなるカラー画像とする。 The normal light image stored in the first database 14 and the special light image stored in the second database 16 are assumed to be color images each consisting of three color channels of RGB.

抽出部６０は、第１データベース１４に保存されているＲＧＢの３つの色チャンネルを有する通常光画像、及び第２データベース１６に保存されているＲＧＢの３つの色チャンネルを有する特殊光画像から、それぞれ１つの色チャンネルを有する通常光画像及び特殊光画像を抽出する。 The extraction unit 60 extracts from the normal light image having three color channels of RGB stored in the first database 14 and the special light image having three color channels of RGB stored in the second database 16, respectively. A normal light image and a special light image with one color channel are extracted.

ここで、第２データベース１６に保存されている特殊光画像が、青色又は青紫の波長帯域にピーク波長を有する特殊光により撮像された内視鏡画像の場合、抽出部６０は、Ｂの１チャンネルのみの通常光画像及び特殊光画像を抽出することが好ましい。また、第２データベース１６に保存されている特殊光画像が、赤色又は近赤外の波長帯域にピーク波長を有する特殊光により撮像された内視鏡画像の場合、抽出部６０は、Ｒの１チャンネルのみの通常光画像及び特殊光画像を抽出することが好ましい。即ち、抽出部６０は、ＲＧＢの３つの色チャンネルを有する通常光画像及び特殊光画像から、特殊光画像の色味に近い１つの色チャンネルの画像を抽出することが好ましい。 Here, if the special light image stored in the second database 16 is an endoscopic image captured with special light having a peak wavelength in the blue or blue-violet wavelength band, the extraction unit 60 extracts one channel of B It is preferable to extract only normal light images and special light images. Further, when the special light image stored in the second database 16 is an endoscopic image captured with special light having a peak wavelength in the red or near-infrared wavelength band, the extraction unit 60 selects 1 of R It is preferable to extract the channel-only normal and special light images. That is, the extraction unit 60 preferably extracts an image of one color channel that is close to the color of the special light image from the normal light image and the special light image having three color channels of RGB.

第１学習部３０は、抽出部６０により抽出された１つの色チャンネルを有する通常光画像及び特殊光画像を、学習用のデータセットとして取得し、取得したデータセットにより画像認識用の学習モデル（第１モデル）を生成する。 The first learning unit 30 acquires the normal light image and the special light image having one color channel extracted by the extraction unit 60 as a data set for learning, and uses the acquired data set as a learning model for image recognition ( 1st model).

第３実施形態の医療画像学習装置１０－３によれば、通常光画像の特徴量を特殊光画像の特徴量に近づけて事前学習することができ、第２学習部４０での特殊光画像のデータセットのみによる再学習をより適切に行うことができる。 According to the medical image learning device 10-3 of the third embodiment, pre-learning can be performed by bringing the feature amount of the normal light image closer to the feature amount of the special light image. Re-learning with only the dataset can be performed more appropriately.

尚、第３実施形態の医療画像学習装置１０－３の抽出部６０は、ＲＧＢの通常光画像及び特殊光画像から１つの色チャンネルを有する通常光画像及び特殊光画像を抽出しているが、ＲＧＢの通常光画像から１つの色チャンネルを有する通常光画像のみを抽出し、第１学習部３０は、抽出された１つの色チャンネルを有する通常光画像のデータセットのみを用いて学習してもよい。 The extraction unit 60 of the medical image learning device 10-3 of the third embodiment extracts normal light images and special light images having one color channel from RGB normal light images and special light images. Only the normal light image having one color channel is extracted from the RGB normal light image, and the first learning unit 30 may learn using only the extracted data set of the normal light image having one color channel. good.

［医療画像学習装置の第４実施形態］
図６は、本発明に係る医療画像学習装置１０－４の第４実施形態を示すブロック図である。尚、図６に示す医療画像学習装置１０－４において、図４に示した第２実施形態の医療画像学習装置１０－２と共通する部分には同一の符号を付し、その詳細な説明は省略する。[Fourth Embodiment of Medical Image Learning Apparatus]
FIG. 6 is a block diagram showing a fourth embodiment of the medical image learning device 10-4 according to the present invention. In the medical image learning device 10-4 shown in FIG. 6, the parts common to the medical image learning device 10-2 of the second embodiment shown in FIG. omitted.

図６に示す第４実施形態の医療画像学習装置１０－４は、主として第３学習部７０が追加されている点で、図４に示した第２実施形態の医療画像学習装置１０－２と相違する。 The medical image learning device 10-4 of the fourth embodiment shown in FIG. 6 is different from the medical image learning device 10-2 of the second embodiment shown in FIG. 4 mainly in that a third learning unit 70 is added. differ.

第３学習部７０は、第１学習部３０により学習した学習済みＣＮＮのパラメータを初期値として使用し、第１データベース１４に保存されたＲＧＢの通常光画像のデータセットのみを用いて再度学習することにより、通常光画像に対する画像認識を行う学習モデル（第３モデル）を生成する。 The third learning unit 70 uses the learned CNN parameters learned by the first learning unit 30 as initial values, and re-learns using only the RGB normal light image data set stored in the first database 14. Thus, a learning model (third model) for performing image recognition on a normal light image is generated.

第４実施形態の医療画像学習装置１０－４によれば、第１学習部３０により学習した学習済みＣＮＮのパラメータを初期値として使用するため、第３学習部７０は、任意の初期値から通常光画像のデータセットのみを用いて学習する場合に比べて学習時間の短縮化を図ることができる。 According to the medical image learning device 10-4 of the fourth embodiment, the parameters of the learned CNN learned by the first learning unit 30 are used as initial values. The learning time can be shortened compared to the case of learning using only the data set of optical images.

尚、第３学習部７０は、図５に示した第３実施形態の医療画像学習装置１０－３の第１学習部３０により学習した学習済みＣＮＮのパラメータを初期値として使用するようにしてもよい。 The third learning unit 70 may use the learned CNN parameters learned by the first learning unit 30 of the medical image learning device 10-3 of the third embodiment shown in FIG. 5 as initial values. good.

本実施形態の第１データベース１４及び第２データベース１６に保存されている通常光画像及び特殊光画像は、それぞれＲＧＢの「光の３原色」のカラー画像としたが、これに限らず、ＲＧＢの補色の関係にあるシアン（Ｃ）、マゼンタ（Ｍ）、イエロー（Ｙ）の「色の３原色」のカラー画像でもよい。 The normal light image and the special light image stored in the first database 14 and the second database 16 of the present embodiment are RGB "three primary colors of light", respectively. A color image of "three primary colors" of cyan (C), magenta (M), and yellow (Y), which are complementary colors, may also be used.

また、第１データベース１４及び第２データベース１６に保存されている通常光画像及び特殊光画像は、ＲＧＢの色信号から生成される輝度信号（Ｙ）と２つの色差信号（Ｃｒ，Ｃｂ）からなる３チャンネルのカラー画像でもよい。 The normal light image and the special light image stored in the first database 14 and the second database 16 are composed of a luminance signal (Y) generated from RGB color signals and two color difference signals (Cr, Cb). A 3-channel color image may also be used.

尚、ＲＧＢ、ＣＭＹ、ＹＣｒＣｂのカラー画像は、相互に変換できることは言うまでもない。 Needless to say, RGB, CMY, and YCrCb color images can be mutually converted.

［医療画像学習方法］
図７は、本発明に係る医療画像学習方法の実施形態を示すフローチャートであり、図２に示した第１実施形態の医療画像学習装置１０－１の各部の処理手順に関して示している。[Medical image learning method]
FIG. 7 is a flow chart showing an embodiment of the medical image learning method according to the present invention, and shows processing procedures of each part of the medical image learning apparatus 10-1 of the first embodiment shown in FIG.

第１データベース１４及び第２データベース１６には、学習用の通常光画像群のデータセット及び特殊光画像群のデータセットが事前に格納される（データセットを準備するステップ）。 In the first database 14 and the second database 16, a data set of normal light image groups and a data set of special light image groups for learning are stored in advance (step of preparing data sets).

第１学習部３０は、第１データベース１４及び第２データベース１６から通常光画像群のデータセット及び特殊光画像群のデータセットを取得する（ステップＳ１０、Ｓ１２）。第１学習部３０は、１０～１００前後のミニバッチ単位でデータセットを取得してもよいし、１枚ずつ取得するようにしてもよい。 The first learning unit 30 acquires the data set of the normal light image group and the data set of the special light image group from the first database 14 and the second database 16 (steps S10 and S12). The first learning unit 30 may acquire the data set in units of about 10 to 100 mini-batches, or may acquire the data sets one by one.

第１学習部３０は、取得した通常光画像及び特殊光画像のデータセットを用いて学習することにより画像認識用の学習モデル（第１モデル）を生成する（ステップＳ１４）。本例では、学習モデルの一つである学習済みＣＮＮを構築する。 The first learning unit 30 generates a learning model (first model) for image recognition by learning using the acquired data sets of the normal light image and the special light image (step S14). In this example, a trained CNN, which is one of the learning models, is constructed.

続いて、第２学習部４０は、第１学習部３０により学習した学習済みモデルを元に（学習済みＣＮＮのパラメータを初期値として使用し）、第２データベース１６に保存された特殊光画像のデータセットのみを用いて再度学習することにより、特殊光画像に対する画像認識を行う学習モデル（第２モデル）を生成する（ステップＳ１６）。 Subsequently, the second learning unit 40, based on the learned model learned by the first learning unit 30 (using the parameters of the learned CNN as initial values), the special light image stored in the second database 16 By learning again using only the data set, a learning model (second model) for performing image recognition on the special light image is generated (step S16).

この実施形態の医療画像学習方法によれば、第１学習部３０により比較的多く集めることができる通常光画像のデータセットを用いて学習（事前学習）を行うため、画像認識用の学習モデルを良好に生成することができる。そして、第２学習部４０は、第１学習部３０により生成された学習モデルを元に、特殊光画像群のみを用いて再学習を行うため、比較的データ数の少ない特殊光画像群からでも特殊光画像に対して適切な画像認識を行う学習モデルを生成することができる。 According to the medical image learning method of this embodiment, since learning (pre-learning) is performed using a data set of normal light images that can be collected in relatively large numbers by the first learning unit 30, a learning model for image recognition is prepared. can be produced well. Then, based on the learning model generated by the first learning unit 30, the second learning unit 40 re-learns using only the special light image group. It is possible to generate a learning model that performs appropriate image recognition for special light images.

尚、ステップＳ１４では、第１学習部３０は、通常光画像及び特殊光画像のデータセットを用いて学習することにより画像認識用の学習モデルを生成するが、通常光画像のデータセットのみを用いて学習してもよい。 In step S14, the first learning unit 30 generates a learning model for image recognition by learning using the data sets of the normal light image and the special light image, but uses only the data set of the normal light image. You can learn

＜第１変形例＞
図８は、図７に示したステップＳ１４の第１変形例を示すステップＳ１４－１を示す図である。<First modification>
FIG. 8 is a diagram showing step S14-1 showing a first modification of step S14 shown in FIG.

図８において、画像処理部５０（図４）は、第１データベース１４に保存されているＲＧＢの３つの色チャンネルを有する通常光画像、及び第２データベース１６に保存されているＲＧＢの３つの色チャンネルを有する特殊光画像を、それぞれ１つの色チャンネルを有する通常光画像及び特殊光画像（モノクロ画像）に変換する（ステップＳ２０）。 In FIG. 8, the image processing unit 50 (FIG. 4) processes a normal light image having three color channels of RGB stored in the first database 14 and a three color channel of RGB stored in the second database 16. The special light image having channels is converted into a normal light image and a special light image (monochrome image) each having one color channel (step S20).

第１学習部３０は、画像処理部５０により変換されたモノクロ画像を、学習用のデータセットとして取得し、取得したデータセットにより画像認識用の学習モデルの一つであるＣＮＮの学習モデル（第１モデル）を生成する（ステップＳ２２）。尚、ステップＳ２２により学習された学習済みＣＮＮのパラメータは、図７に示すステップＳ１６において、第２学習部４０のＣＮＮの初期値として使用される。 The first learning unit 30 acquires the monochrome image converted by the image processing unit 50 as a learning data set, and uses the acquired data set to acquire a CNN learning model (first learning model), which is one of the learning models for image recognition. 1 model) is generated (step S22). The parameters of the learned CNN learned in step S22 are used as the initial values of the CNN of the second learning unit 40 in step S16 shown in FIG.

＜第２変形例＞
図９は、図７に示したステップＳ１４の第２変形例を示すステップＳ１４－２を示す図である。<Second modification>
FIG. 9 is a diagram showing step S14-2 showing a second modification of step S14 shown in FIG.

図９において、抽出部６０（図５）は、第１データベース１４に保存されているＲＧＢの３つの色チャンネルを有する通常光画像、及び第２データベース１６に保存されているＲＧＢの３つの色チャンネルを有する特殊光画像から、それぞれ１つの色チャンネルを有する通常光画像及び特殊光画像を抽出する（ステップＳ３０）。抽出部６０は、ＲＧＢの３つの色チャンネルを有する通常光画像及び特殊光画像から、特殊光画像の色味に近い１つの色チャンネルの画像を抽出することが好ましい。 9, the extractor 60 (FIG. 5) extracts a normal light image with three RGB color channels stored in the first database 14 and a three RGB color channel image stored in the second database 16. A normal light image and a special light image each having one color channel are extracted from the special light image having , (step S30). The extraction unit 60 preferably extracts an image of one color channel that is close to the color of the special light image from the normal light image and the special light image having three color channels of RGB.

第１学習部３０は、抽出部６０により抽出された１つの色チャンネルを有する通常光画像及び特殊光画像を、学習用のデータセットとして取得し、取得したデータセットにより画像認識用の学習モデルの一つであるＣＮＮの学習モデル（第１モデル）を生成する（ステップＳ３２）。尚、ステップＳ３２により学習された学習済みＣＮＮのパラメータは、図７に示すステップＳ１６において、第２学習部４０のＣＮＮの初期値として使用される。 The first learning unit 30 acquires the normal light image and the special light image having one color channel extracted by the extraction unit 60 as a learning data set, and uses the acquired data set to develop a learning model for image recognition. A learning model (first model) of one CNN is generated (step S32). The parameters of the learned CNN learned in step S32 are used as the initial values of the CNN of the second learning unit 40 in step S16 shown in FIG.

［その他］
図３に示したＣＮＮ３２は、医療画像に写っている病変の種類を画像認識する学習モデルであるが、医療画像に写っている病変の位置（病変領域）を認識するセグメンテーションを行う学習モデルでもよい。この場合のＣＮＮは、ＣＮＮの一種である全層畳み込みネットワーク（ＦＣＮ：Fully Convolution Network）を適用し、医療画像に写っている病変の位置を画素レベルで把握できるものが好ましい。[others]
The CNN 32 shown in FIG. 3 is a learning model for image recognition of the types of lesions shown in medical images, but it may be a learning model that performs segmentation to recognize the position (lesion area) of lesions shown in medical images. . The CNN in this case preferably applies a fully convolutional network (FCN), which is a type of CNN, and can grasp the position of a lesion in a medical image at the pixel level.

また、本発明は、例えばＤＢＮ（Deep Belief Network）、ＳＶＭ（Support Vector Machine）などのＣＮＮ以外の機械学習のモデルにも適用できる。 The present invention can also be applied to machine learning models other than CNN, such as DBN (Deep Belief Network) and SVM (Support Vector Machine).

更に本実施形態の医療画像学習装置１０の各種制御を実行するハードウエア的な構造は、次に示すような各種のプロセッサ（processor）である。各種のプロセッサには、ソフトウェア（プログラム）を実行して各種の制御部として機能する汎用的なプロセッサであるＣＰＵ（Central Processing Unit）、ＦＰＧＡ（Field Programmable Gate Array）などの製造後に回路構成を変更可能なプロセッサであるプログラマブルロジックデバイス（Programmable Logic Device：ＰＬＤ）、ＡＳＩＣ（Application Specific Integrated Circuit）などの特定の処理を実行させるために専用に設計された回路構成を有するプロセッサである専用電気回路などが含まれる。 Further, hardware structures for executing various controls of the medical image learning apparatus 10 of this embodiment are various processors as shown below. For various processors, the circuit configuration can be changed after manufacturing such as CPU (Central Processing Unit), FPGA (Field Programmable Gate Array), which is a general-purpose processor that executes software (program) and functions as various control units. Programmable Logic Device (PLD), which is a processor, ASIC (Application Specific Integrated Circuit), etc. be

１つの処理部は、これら各種のプロセッサのうちの１つで構成されていてもよいし、同種又は異種の２つ以上のプロセッサ（例えば、複数のＦＰＧＡ、あるいはＣＰＵとＦＰＧＡの組み合わせ）で構成されてもよい。また、複数の制御部を１つのプロセッサで構成してもよい。複数の制御部を１つのプロセッサで構成する例としては、第１に、クライアントやサーバなどのコンピュータに代表されるように、１つ以上のＣＰＵとソフトウェアの組合せで１つのプロセッサを構成し、このプロセッサが複数の制御部として機能する形態がある。第２に、システムオンチップ（System On Chip：ＳｏＣ）などに代表されるように、複数の制御部を含むシステム全体の機能を１つのＩＣ（Integrated Circuit）チップで実現するプロセッサを使用する形態がある。このように、各種の制御部は、ハードウエア的な構造として、上記各種のプロセッサを１つ以上用いて構成される。 One processing unit may be composed of one of these various processors, or composed of two or more processors of the same type or different types (for example, a plurality of FPGAs, or a combination of a CPU and an FPGA). may Also, a plurality of control units may be configured by one processor. As an example of configuring a plurality of control units with one processor, first, one processor is configured with a combination of one or more CPUs and software, as typified by a computer such as a client or a server. There is a form in which a processor functions as multiple controllers. Secondly, as typified by System On Chip (SoC), etc., there is a form of using a processor that realizes the functions of the entire system including multiple control units with a single IC (Integrated Circuit) chip. be. In this way, various control units are configured using one or more of the above various processors as a hardware structure.

更にまた、これらの各種のプロセッサのハードウエア的な構造は、より具体的には、半導体素子などの回路素子を組み合わせた電気回路（circuitry）である。 Furthermore, the hardware structure of these various processors is, more specifically, an electric circuit combining circuit elements such as semiconductor elements.

また、本発明は、コンピュータにインストールされることにより、本発明に係る医療画像学習装置として機能させる医療画像学習プログラム、及びこの医療画像学習プログラムが記録された記録媒体を含む。 The present invention also includes a medical image learning program that is installed in a computer to function as the medical image learning device according to the present invention, and a recording medium in which the medical image learning program is recorded.

更に、本発明は上述した実施形態に限定されず、本発明の精神を逸脱しない範囲で種々の変形が可能であることは言うまでもない。 Furthermore, the present invention is not limited to the embodiments described above, and it goes without saying that various modifications are possible without departing from the spirit of the present invention.

１０、１０－１、１０－２、１０－３、１０－４医療画像学習装置
１２通信部
１４第１データベース
１４Ａ通常光画像
１６第２データベース
１８操作部
２０ＣＰＵ
２２ＲＡＭ
２４ＲＯＭ
２６表示部
３０第１学習部
３２Ａ入力層
３２Ｂ中間層
３２Ｃ出力層
３４誤差算出部
３６パラメータ更新部
４０第２学習部
５０画像処理部
６０抽出部
７０第３学習部
Ｓ１０、Ｓ１２、Ｓ１４、Ｓ１４－１、Ｓ１４－２、Ｓ１６、Ｓ２０、Ｓ２２、Ｓ３０、Ｓ３２ステップ10, 10-1, 10-2, 10-3, 10-4 Medical image learning device 12 Communication unit 14 First database 14A Normal light image 16 Second database 18 Operation unit 20 CPU
22 RAMs
24 ROMs
26 display unit 30 first learning unit 32A input layer 32B intermediate layer 32C output layer 34 error calculation unit 36 parameter updating unit 40 second learning unit 50 image processing unit 60 extraction unit 70 third learning unit S10, S12, S14, S14- 1, S14-2, S16, S20, S22, S30, S32 Step

Claims

a first learning unit that generates a first model for image recognition by learning using at least a first image group composed of a plurality of first medical images captured with normal light;
Based on the first model, learning is performed using a second image group consisting of a plurality of second medical images captured with special light to generate a second model for image recognition of the second medical image. a second learning unit;
an image processor that transforms the first medical image having multiple color channels into the first medical image having a single color channel;
The image processing unit converts the first medical image having the plurality of color channels into a monochrome image that approximates the hue of the second medical image, thereby obtaining the first medical image having the one color channel. year,
The first learning unit learns using the first image group having the transformed one color channel.
Medical image learning device.

a first learning unit that generates a first model for image recognition by learning using at least a first image group composed of a plurality of first medical images captured with normal light;
Based on the first model, learning is performed using a second image group consisting of a plurality of second medical images captured with special light to generate a second model for image recognition of the second medical image. a second learning unit;
an image processor that transforms the first medical image having multiple color channels into the first medical image having a single color channel;
The image processing unit converts the first medical image having the plurality of color channels into the first medical image having only a luminance signal to obtain the first medical image having the one color channel;
The first learning unit learns using the first image group having the transformed one color channel.
Medical image learning device.

a first learning unit that generates a first model for image recognition by learning using at least a first image group composed of a plurality of first medical images captured with normal light;
Based on the first model, learning is performed using a second image group consisting of a plurality of second medical images captured with special light to generate a second model for image recognition of the second medical image. a second learning unit;
an image processing unit that converts the first medical image and the second medical image having multiple color channels into the first medical image and the second medical image each having one color channel;
The image processing unit converts the first medical image and the second medical image having the plurality of color channels into a monochrome image that approximates the hue of the second medical image, thereby converting the one color channel The first medical image and the second medical image having
The first learning unit learns using the first group of images and the second group of images having one converted color channel.
Medical image learning device.

a first learning unit that generates a first model for image recognition by learning using at least a first image group composed of a plurality of first medical images captured with normal light;
Based on the first model, learning is performed using a second image group consisting of a plurality of second medical images captured with special light to generate a second model for image recognition of the second medical image. a second learning unit;
an image processing unit that converts the first medical image and the second medical image having multiple color channels into the first medical image and the second medical image each having one color channel;
The image processing unit converts the first medical image and the second medical image having the plurality of color channels into the first medical image and the second medical image having only luminance signals, thereby performing the first medical image and the second medical image. the first medical image and the second medical image having two color channels;
The first learning unit learns using the first group of images and the second group of images having one converted color channel.
Medical image learning device.

a first learning unit that generates a first model for image recognition by learning using at least a first image group composed of a plurality of first medical images captured with normal light;
Based on the first model, learning is performed using a second image group consisting of a plurality of second medical images captured with special light to generate a second model for image recognition of the second medical image. a second learning unit;
an extraction unit for extracting the first medical image having one color channel from the first medical image having multiple color channels;
The extracting unit extracts an image having a color close to that of the second medical image from the first medical image having the plurality of color channels to obtain the first medical image having the one color channel,
The first learning unit learns using the first image group having the extracted one color channel.
Medical image learning device.

a first learning unit that generates a first model for image recognition by learning using at least a first image group composed of a plurality of first medical images captured with normal light;
Based on the first model, learning is performed using a second image group consisting of a plurality of second medical images captured with special light to generate a second model for image recognition of the second medical image. a second learning unit;
an extraction unit for extracting the first medical image and the second medical image having one color channel from the first medical image and the second medical image having multiple color channels, respectively;
The extraction unit extracts an image having a color close to that of the second medical image from the first medical image and the second medical image having the plurality of color channels, thereby obtaining the image having the one color channel. As a first medical image and the second medical image,
The first learning unit learns using the first image group and the second image group having the extracted one color channel.
Medical image learning device.

The medical image learning apparatus according to any one of claims 1 to 6 , wherein the plurality of color channels are three channels of three primary colors, or three channels of a luminance signal and two color difference signals.

a third learning unit that generates a third model for image recognition of the first medical image by learning using the first image group based on the first model;
The medical image learning device according to any one of claims 1 to 7 , comprising:

The medical image learning device according to any one of claims 1 to 8 , wherein the first medical image and the second medical image are images captured by an endoscope apparatus.

10. The medical image learning device according to any one of claims 1 to 9 , wherein the first model and the second model are configured by convolutional neural networks.

preparing a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light;
a first learning unit generating a first model for image recognition by learning using at least the first image group;
a second learning unit learning using the second image group based on the first model to generate a second model for performing image recognition on the second medical image;
an image processor transforming the first medical image having multiple color channels into the first medical image having a single color channel;
The converting step converts the first medical image having the plurality of color channels into a monochrome image that approximates the hue of the second medical image, thereby obtaining the first medical image having the one color channel. year,
generating the first model trains using the first set of images having the transformed one color channel;
Medical image learning device.

preparing a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light;
a first learning unit generating a first model for image recognition by learning using at least the first image group;
a second learning unit learning using the second image group based on the first model to generate a second model for performing image recognition on the second medical image;
an image processor transforming the first medical image having multiple color channels into the first medical image having a single color channel;
the converting step includes converting the first medical image having the plurality of color channels into the first medical image having only a luminance signal, thereby forming the first medical image having the one color channel;
generating the first model trains using the first set of images having the transformed one color channel;
Medical image learning method.

preparing a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light;
a first learning unit generating a first model for image recognition by learning using at least the first image group;
a second learning unit learning using the second image group based on the first model to generate a second model for performing image recognition on the second medical image;
an image processor transforming the first medical image and the second medical image having multiple color channels into the first medical image and the second medical image having a single color channel;
The converting step converts the first medical image and the second medical image having the plurality of color channels into a monochrome image that approximates the hue of the second medical image, thereby converting the one color channel. The first medical image and the second medical image having
generating the first model trains using the first set of images and the second set of images having the transformed one color channel;
Medical image learning method.

preparing a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light;
a first learning unit generating a first model for image recognition by learning using at least the first image group;
a second learning unit learning using the second image group based on the first model to generate a second model for performing image recognition on the second medical image;
an image processor transforming the first medical image and the second medical image having multiple color channels into the first medical image and the second medical image having a single color channel;
The converting step converts the first medical image and the second medical image having the plurality of color channels into the first medical image and the second medical image having only luminance signals, The first medical image and the second medical image having color channels;
generating the first model trains using the first set of images and the second set of images having the transformed one color channel;
Medical image learning method.

preparing a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light;
a first learning unit generating a first model for image recognition by learning using at least the first image group;
a second learning unit learning using the second image group based on the first model to generate a second model for performing image recognition on the second medical image;
an extractor extracting the first medical image having one color channel from the first medical image having multiple color channels;
The extracting step extracts an image having a color close to that of the second medical image from the first medical image having the plurality of color channels, so as to obtain the first medical image having the one color channel. ,
generating the first model trains using the first set of images having the extracted one color channel;
Medical image learning method.

preparing a first image group consisting of a plurality of first medical images captured with normal light and a second image group consisting of a plurality of second medical images captured with special light;
a first learning unit generating a first model for image recognition by learning using at least the first image group;
a second learning unit learning using the second image group based on the first model to generate a second model for performing image recognition on the second medical image;
an extractor extracting the first medical image and the second medical image having one color channel from the first medical image and the second medical image having multiple color channels, respectively;
The extracting step includes extracting an image close to the color tone of the second medical image from the first medical image and the second medical image having the plurality of color channels, and having the one color channel. The first medical image and the second medical image,
generating the first model trains using the first set of images and the second set of images having the extracted one color channel;
Medical image learning method.

A medical image learning program that causes a computer to implement the medical image learning method according to any one of claims 12 to 16 .

A non-transitory computer-readable recording medium, in which the medical image learning program according to claim 17 is recorded.