JP7106741B2

JP7106741B2 - Learning method, learning device, generative model and program

Info

Publication number: JP7106741B2
Application number: JP2021502250A
Authority: JP
Inventors: 彰工藤; 嘉郎北村
Original assignee: Fujifilm Corp
Current assignee: Fujifilm Corp
Priority date: 2019-02-28
Filing date: 2020-02-25
Publication date: 2022-07-26
Anticipated expiration: 2040-02-25
Also published as: WO2020175445A1; EP3932319A1; JPWO2020175445A1; EP3932319A4; US20210374483A1; US11948349B2

Description

本発明は学習方法、学習装置、生成モデル及びプログラムに係り、特に高解像度の画像を生成する技術に関する。 The present invention relates to a learning method, a learning device, a generation model, and a program, and more particularly to technology for generating high-resolution images.

データを作り出す「Generator」（ジェネレータ）と、データを識別する「Discriminator」（ディスクリミネータ）とを交互に学習する敵対的生成ネットワーク（Generative Adversarial Networks：GAN）が知られている。非特許文献１には、ＧＡＮに関する研究が記載されている。ＧＡＮによれば、学習データの特徴に沿った精度の高いデータを生成する生成モデルを学習することができる。 Generative Adversarial Networks (GANs) are known that alternately learn a "Generator" that creates data and a "Discriminator" that identifies data. Non-Patent Document 1 describes research on GANs. According to GAN, it is possible to learn a generative model that generates highly accurate data along the characteristics of learning data.

また、ＧＡＮを応用した技術が研究されている。非特許文献２には、ＧＡＮを用いて入力画像と出力画像とのペアを学習する手法が記載されている。非特許文献２に記載の手法によれば、パラメータの調整の負担の少ない生成モデルを学習することができる。 In addition, research is being conducted on techniques to which GANs are applied. Non-Patent Literature 2 describes a method of learning a pair of an input image and an output image using a GAN. According to the technique described in Non-Patent Document 2, it is possible to learn a generative model with less burden of parameter adjustment.

ＧＡＮの画像生成では、入力データに多様性がある場合に学習が困難であるという問題点がある。例えば、医療画像において、入力とする画像の部位及びスライス条件が多岐にわたる場合、生成画像の特徴が平均化されてしまう。非特許文献２における多様な入力データへの対応においても、学習データの特徴が平均化されてしまう。 GAN image generation has a problem that learning is difficult when there is diversity in the input data. For example, in a medical image, if the input image has a wide variety of parts and slice conditions, the features of the generated image are averaged. Even in dealing with various input data in Non-Patent Document 2, the features of learning data are averaged.

これに対し、非特許文献３には、ＧＡＮの学習にデータのカテゴリ情報を用いる手法が記載されている。非特許文献３に記載の手法によれば、生成モデルをカテゴリ情報に基づいて調整することができる。 On the other hand, Non-Patent Document 3 describes a method of using data category information for GAN learning. According to the method described in Non-Patent Document 3, the generative model can be adjusted based on the category information.

Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio “Generative Adversarial Nets”, arXiv:1406.2661Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio “Generative Adversarial Nets”, arXiv:1406.2661 Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros “Image-to-Image Translation with Conditional Adversarial Networks”,CVPR2016Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros “Image-to-Image Translation with Conditional Adversarial Networks”,CVPR2016 Mehdi Mirza, Simon Osindero “Conditional Generative Adversarial Nets”, arXiv:1411.1784Mehdi Mirza, Simon Osindero “Conditional Generative Adversarial Nets”, arXiv:1411.1784

しかしながら、非特許文献３に記載されたＧＡＮの構成では、Generatorにカテゴリ情報を入力する必要があるため、ネットワークアーキテクチャの大幅な修正が必要となり、Generatorのインターフェースが複雑化するという問題点があった。 However, in the GAN configuration described in Non-Patent Document 3, it is necessary to input category information to the Generator, so there is a problem that the network architecture needs to be significantly modified and the Generator interface becomes complicated. .

また、ＧＡＮでは一部の学習データの影響を強く受けて、Mode collapseと呼ばれる、生成する画像が限定的になってしまい多様性に対応できない状態に陥ることがあるという問題点があった。 In addition, GAN is strongly affected by some learning data, and there is a problem that it may fall into a state called Mode collapse, in which images to be generated become limited and cannot cope with diversity.

本発明はこのような事情に鑑みてなされたもので、パラメータの調整及びネットワークアーキテクチャの大幅な修正をすることなく、多様性を持った学習を行い、高解像情報を含む画像を生成する学習方法、学習装置、生成モデル及びプログラムを提供することを目的とする。 The present invention has been devised in view of such circumstances, and is capable of performing diverse learning and generating images containing high-resolution information without adjusting parameters or significantly modifying the network architecture. An object is to provide a method, a learning device, a generative model and a program.

上記目的を達成するための学習方法の一の態様は、解像度が相対的に低い低解像情報を含む第１画像から解像度が相対的に高い高解像情報を含む第２画像を推定する生成モデルの学習方法であって、第１画像を用いて仮想第２画像を生成する生成モデルであるジェネレータと、学習用第２画像と仮想第２画像とを識別するディスクリミネータと、を備える敵対的生成ネットワークのジェネレータに第１画像のみを入力するジェネレータ入力工程と、ディスクリミネータに学習用第２画像と学習用第２画像の部位情報と、を入力する第１ディスクリミネータ入力工程と、ディスクリミネータに仮想第２画像と仮想第２画像の部位情報と、を入力する第２ディスクリミネータ入力工程と、を備えた学習方法である。 One aspect of the learning method for achieving the above object is generation of estimating a second image containing high-resolution information with relatively high resolution from a first image containing low-resolution information with relatively low resolution. A method of training a model, comprising: a generator that is a generative model that uses a first image to generate a virtual second image; and a discriminator that distinguishes between the training second image and the virtual second image. a generator input step of inputting only the first image to the generator of the target generation network; a first discriminator input step of inputting the second learning image and part information of the second learning image to the discriminator; a second discriminator input step of inputting a second virtual image and part information of the second virtual image to a discriminator.

本態様によれば、敵対的生成ネットワークのジェネレータに第１画像のみを入力し、ディスクリミネータに部位情報を入力し、部位情報を入力に加えたことによって条件ごとに学習を行うようにしたので、パラメータの調整及びネットワークアーキテクチャの大幅な修正をすることなく、多様性を持った学習を行い、高解像情報を含む画像を生成することができる。 According to this aspect, only the first image is input to the generator of the hostile generation network, part information is input to the discriminator, and learning is performed for each condition by adding the part information to the input. , it can learn with diversity and generate images containing high-resolution information without parameter tuning and significant modification of the network architecture.

解像度が相対的に低い低解像情報を含む第１画像は、例えば低解像度である。解像度が相対的に高い高解像情報を含む第２画像は、例えば高解像度画像である。また第２画像は、解像度が相対的に高い高解像情報を含む高周波成分画像であってもよい。 The first image containing low-resolution information with relatively low resolution is, for example, low-resolution. The second image containing high-resolution information with relatively high resolution is, for example, a high-resolution image. Also, the second image may be a high-frequency component image containing high-resolution information with relatively high resolution.

部位情報は、人体の頭部、胸部、腹部、及び脚部を含むことが好ましい。これにより、多様な人体の部位の低解像情報を含む画像から高解像情報を含む画像を生成することができる。 The part information preferably includes the head, chest, abdomen, and legs of the human body. As a result, an image containing high-resolution information can be generated from an image containing low-resolution information of various parts of the human body.

第１画像は３次元断層画像であり、解像度はスライス厚方向の解像度であることが好ましい。これにより、スライス厚方向に高解像情報を含む３次元断層画像を生成することができる。 The first image is a three-dimensional tomographic image, and the resolution is preferably the resolution in the slice thickness direction. Thereby, a three-dimensional tomographic image including high-resolution information in the slice thickness direction can be generated.

第１ディスクリミネータ入力工程及び第２ディスクリミネータ入力工程は、第１画像のスライス情報をディスクリミネータに入力することが好ましい。これにより、入力とする画像のスライス情報に多様性がある場合であっても、高解像度の画像を生成することができる。 Preferably, the first discriminator input step and the second discriminator input step input slice information of the first image to the discriminator. As a result, a high-resolution image can be generated even when slice information of an input image is diverse.

スライス情報は、スライス間隔であることが好ましい。これにより、入力とする画像のスライス間隔に多様性がある場合であっても、高解像度の画像を生成することができる。 The slice information is preferably a slice interval. As a result, a high-resolution image can be generated even when the input image has a variety of slice intervals.

スライス情報は、スライス厚であることが好ましい。これにより、入力とする画像のスライス厚に多様性がある場合であっても、高解像度の画像を生成することができる。 The slice information is preferably slice thickness. As a result, a high-resolution image can be generated even when the slice thickness of the input image is diverse.

第１ディスクリミネータ入力工程及び第２ディスクリミネータ入力工程は、第１画像をディスクリミネータに入力することが好ましい。これにより、ディスクリミネータは、学習用第２画像及び仮想第２画像を適切に識別することができる。 Preferably, the first discriminator input step and the second discriminator input step input the first image to the discriminator. Thereby, the discriminator can appropriately identify the second learning image and the virtual second image.

学習用第２画像から第１画像を生成する第１画像生成工程を備え、ジェネレータ入力工程は、ジェネレータに生成した第１画像のみを入力することが好ましい。これにより、ジェネレータに入力するための低解像情報を含む画像を適切に取得することができる。 Preferably, the first image generating step generates the first image from the second learning image, and the generator input step inputs only the generated first image to the generator. Thereby, an image including low-resolution information for inputting to the generator can be properly acquired.

上記目的を達成するためのコンピュータに実行させるためのプログラムの一の態様は、上記に記載の学習方法をコンピュータに実行させるためのプログラムである。 One aspect of a program to be executed by a computer to achieve the above object is a program for causing a computer to execute the learning method described above.

上記目的を達成するための生成モデルの一の態様は、上記に記載の学習方法によって学習された、解像度が相対的に低い低解像情報を含む第１画像から解像度が相対的に高い高解像情報を含む第２画像を推定する生成モデルである。 One aspect of the generative model for achieving the above object is a high-resolution relatively high-resolution first image containing relatively low-resolution low-resolution information learned by the learning method described above. A generative model for estimating a second image containing image information.

上記目的を達成するための学習装置の一の態様は、解像度が相対的に低い低解像情報を含む第１画像から解像度が相対的に高い高解像情報を含む第２画像を推定する生成モデルの学習装置であって、第１画像を用いて仮想第２画像を生成する生成モデルであるジェネレータと、学習用第２画像と仮想第２画像とを識別するディスクリミネータと、を備える敵対的生成ネットワークと、ジェネレータに第１画像のみを入力するジェネレータ入力部と、ディスクリミネータに学習用第２画像と学習用第２画像の部位情報と、を入力する第１ディスクリミネータ入力部と、ディスクリミネータに仮想第２画像と仮想第２画像の部位情報と、を入力する第２ディスクリミネータ入力部と、を備えた学習装置である。 One aspect of the learning device for achieving the above object is generation of estimating a second image containing high-resolution information with relatively high resolution from a first image containing low-resolution information with relatively low resolution. Apparatus for learning a model, comprising: a generator, a generative model that uses a first image to generate a virtual second image; and a discriminator that distinguishes between the training second image and the virtual second image. a generator input unit that inputs only the first image to the generator; and a first discriminator input unit that inputs the second learning image and part information of the second learning image to the discriminator; and a second discriminator input unit for inputting a second virtual image and part information of the second virtual image to the discriminator.

本発明によれば、パラメータの調整及びネットワークアーキテクチャの大幅な修正をすることなく、多様性を持った学習を行い、高解像情報を含む画像を生成することができる。 According to the present invention, it is possible to perform diverse learning and generate images containing high-resolution information without adjusting parameters and modifying network architecture significantly.

図１は、本発明に係る医療画像学習装置のハードウェア構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the hardware configuration of a medical image learning device according to the present invention. 図２は、ＣＴ画像のスライス間隔、及びスライス厚を説明するための図である。FIG. 2 is a diagram for explaining the slice interval and slice thickness of a CT image. 図３は、ＣＴ画像のスライス間隔、及びスライス厚を説明するための図である。FIG. 3 is a diagram for explaining the slice interval and slice thickness of a CT image. 図４は、ＣＴ画像のスライス間隔、及びスライス厚を説明するための図である。FIG. 4 is a diagram for explaining the slice interval and slice thickness of a CT image. 図５は、ＣＴ画像のスライス間隔、及びスライス厚を説明するための図である。FIG. 5 is a diagram for explaining the slice interval and slice thickness of a CT image. 図６は、第１の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。FIG. 6 is a functional block diagram showing learning processing of the medical image learning device 10 according to the first embodiment. 図７は、学習時の識別部４４による識別を説明するための図である。FIG. 7 is a diagram for explaining identification by the identification unit 44 during learning. 図８は、医療画像学習装置１０の医療画像学習方法の一例を示すフローチャートである。FIG. 8 is a flow chart showing an example of the medical image learning method of the medical image learning device 10. As shown in FIG. 図９は、メタ情報Ｄ１を与えずに全身の画像を学習した画像生成を説明するための図である。FIG. 9 is a diagram for explaining image generation by learning a whole-body image without providing the meta-information D1. 図１０は、メタ情報Ｄ１を与えずに全身の画像を学習した画像生成を説明するための図である。FIG. 10 is a diagram for explaining image generation by learning whole-body images without providing the meta information D1. 図１１は、脚部の画像を示す図である。FIG. 11 is a diagram showing an image of a leg. 図１２は、実施形態の効果を説明するための図である。FIG. 12 is a diagram for explaining the effect of the embodiment; 図１３は、実施形態の効果を説明するための図である。FIG. 13 is a diagram for explaining the effect of the embodiment; 図１４は、実施形態の効果を説明するための図である。FIG. 14 is a diagram for explaining the effect of the embodiment; 図１５は、第２の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。FIG. 15 is a functional block diagram showing learning processing of the medical image learning device 10 according to the second embodiment. 図１６は、第３の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。FIG. 16 is a functional block diagram showing learning processing of the medical image learning device 10 according to the third embodiment. 図１７は、第４の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。FIG. 17 is a functional block diagram showing learning processing of the medical image learning device 10 according to the fourth embodiment.

以下、添付図面に従って本発明の好ましい実施形態について詳説する。 Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

＜医療画像学習装置のハードウェア構成＞
図１は、本発明に係る医療画像学習装置のハードウェア構成の一例を示すブロック図である。医療画像学習装置１０は、パーソナルコンピュータ又はワークステーションによって構成される。医療画像学習装置１０は、主として通信部１２、画像データベース１４、操作部１６、ＣＰＵ（Central Processing Unit）１８、ＲＡＭ（Random Access Memory）２０、ＲＯＭ（Read Only Memory）２２、及び表示部２４から構成される。<Hardware configuration of medical image learning device>
FIG. 1 is a block diagram showing an example of the hardware configuration of a medical image learning device according to the present invention. The medical image learning device 10 is configured by a personal computer or workstation. The medical image learning device 10 mainly includes a communication unit 12, an image database 14, an operation unit 16, a CPU (Central Processing Unit) 18, a RAM (Random Access Memory) 20, a ROM (Read Only Memory) 22, and a display unit 24. be done.

通信部１２は、有線又は無線により外部装置との通信処理を行い、外部装置との間で情報のやり取りを行うインターフェースである。 The communication unit 12 is an interface that performs wired or wireless communication processing with an external device and exchanges information with the external device.

画像データベース１４は、医療用Ｘ線ＣＴ（Computed Tomography）装置によって撮影されたＣＴ再構成画像（ＣＴ画像）を保存する大容量ストレージ装置である。画像データベース１４は、医療画像学習装置１０の外部に設けてもよい。この場合、医療画像学習装置１０は、通信部１２を介して画像データベース１４からＣＴ画像を取得する。 The image database 14 is a large-capacity storage device that stores CT reconstructed images (CT images) captured by a medical X-ray CT (Computed Tomography) apparatus. The image database 14 may be provided outside the medical image learning device 10 . In this case, the medical image learning device 10 acquires CT images from the image database 14 via the communication unit 12 .

操作部１６は、医療画像学習装置１０に対する各種の操作入力を受け付ける入力インターフェースである。操作部１６は、コンピュータに有線接続又は無線接続される不図示のキーボード又はマウス等が用いられる。 The operation unit 16 is an input interface that receives various operation inputs to the medical image learning device 10 . The operation unit 16 uses a keyboard, mouse, or the like (not shown) that is wired or wirelessly connected to the computer.

ＣＰＵ１８は、ＲＯＭ２２又は不図示のハードディスク装置等に記憶された各種のプログラムを読み出し、各種の処理を実行する。ＲＡＭ２０は、ＣＰＵ１８の作業領域として使用される。また、ＲＡＭ２０は、読み出されたプログラム及び各種のデータを一時的に記憶する記憶部として用いられる。医療画像学習装置１０は、ＧＰＵ（Graphics Processing Unit）を備えていてもよい。 The CPU 18 reads various programs stored in the ROM 22 or a hard disk device (not shown) and executes various processes. RAM 20 is used as a work area for CPU 18 . Also, the RAM 20 is used as a storage unit that temporarily stores read programs and various data. The medical image learning device 10 may include a GPU (Graphics Processing Unit).

表示部２４は、医療画像学習装置１０の必要な情報が表示される出力インターフェースである。表示部２４は、コンピュータに接続可能な不図示の液晶モニタ等が用いられる。 The display unit 24 is an output interface that displays information necessary for the medical image learning device 10 . A liquid crystal monitor (not shown) or the like connectable to a computer is used as the display unit 24 .

医療画像学習装置１０は、操作部１６からの指示入力により、ＣＰＵ１８がＲＯＭ２２又はハードディスク装置等に記憶されている医療画像学習プログラムを読み出し、医療画像学習プログラムを実行する。これにより、後述する医療画像学習方法が実施され、ＣＴ画像における解像度が相対的に低い低解像情報を含む第１画像から解像度が相対的に高い高解像情報を含む第２画像を推定する生成モデルの学習が行われる。 In the medical image learning apparatus 10, the CPU 18 reads out a medical image learning program stored in the ROM 22, a hard disk device, or the like, and executes the medical image learning program according to an instruction input from the operation unit 16. FIG. As a result, a medical image learning method, which will be described later, is performed to estimate a second image containing high-resolution information with a relatively high resolution from a first image containing low-resolution information with a relatively low resolution in the CT image. A generative model is trained.

医療画像学習方法をコンピュータに実行させる医療画像学習プログラムは、コンピュータの読取可能な非一時的な記録媒体に記憶させて提供されてもよい。 A medical image learning program that causes a computer to execute the medical image learning method may be stored in a computer-readable non-temporary recording medium and provided.

＜ＣＴ画像の部位情報及びスライス情報＞
画像データベース１４に保存されるＣＴ画像は、人体（被検体）を撮影した医療画像であり、複数のスライス画像（アキシャル断面画像）を含む３次元断層画像である。ここでは、各スライス画像は互いに直交するＸ方向及びＹ方向に平行な画像である。Ｘ方向及びＹ方向に直交するＺ方向は、被検体の体軸方向であり、スライス厚方向ともいう。 <Part information and slice information of CT image>
The CT images stored in the image database 14 are medical images obtained by imaging a human body (subject), and are three-dimensional tomographic images including a plurality of slice images (axial cross-sectional images). Here, each slice image is an image parallel to the mutually orthogonal X and Y directions. The Z direction perpendicular to the X and Y directions is the body axis direction of the subject and is also called the slice thickness direction.

画像データベース１４に保存されるＣＴ画像は、人体の部位毎の画像である。ＣＴ画像は、部位毎に撮影された画像であってもよいし、全身を撮影した画像から部位毎に切り出された画像であってもよい。画像データベース１４に保存される各ＣＴ画像には、それぞれ部位情報とスライス情報とが関連付けて保存されている。 The CT images stored in the image database 14 are images of parts of the human body. The CT image may be an image captured for each part, or may be an image cut out for each part from an image obtained by capturing the whole body. In each CT image stored in the image database 14, part information and slice information are stored in association with each other.

部位情報は、ＣＴ画像に写った部位を示す情報である。部位情報は、頭部、胸部、腹部、及び脚部を含む。部位情報は、腰部を含んでもよい。 The part information is information indicating the part shown in the CT image. The part information includes the head, chest, abdomen, and legs. The part information may include the waist.

また、スライス情報は、ＣＴ画像のスライス間隔、及びスライス厚の少なくとも一方を含む。スライス間隔とは、互いに隣り合うスライス画像同士のＺ方向の位置の差であり、スライススペーシング（Slice Spacing）ともいう。また、スライス厚とは、１つのスライス画像のＺ方向の幅（厚み）であり、スライスシックネス（Slice Thickness）ともいう。 Also, the slice information includes at least one of the slice interval and slice thickness of the CT image. The slice interval is the difference in Z-direction position between adjacent slice images, and is also called slice spacing. The slice thickness is the width (thickness) of one slice image in the Z direction, and is also called slice thickness.

図２～図５は、ＣＴ画像のスライス間隔、及びスライス厚を説明するための図である。図２に示すＣＴ画像Ｇ１は、スライス画像Ｇ１１、Ｇ１２、及びＧ１３を含んで構成される。ＣＴ画像Ｇ１は、スライス間隔Ｐが４ｍｍ、スライス厚Ｔが４ｍｍである。即ち、スライス画像Ｇ１１とスライス画像Ｇ１２とのＺ方向の位置の差、及びスライス画像Ｇ１２とスライス画像Ｇ１３とのＺ方向の位置の差は、それぞれ４ｍｍである。また、スライス画像Ｇ１１、Ｇ１２、及びＧ１３のＺ方向の厚みは、それぞれ４ｍｍである。ＣＴ画像Ｇ１は、スライス間隔Ｐとスライス厚Ｔとが等しいため、スライス画像Ｇ１１とスライス画像Ｇ１２、及びスライス画像Ｇ１２とスライス画像Ｇ１３は、Ｚ方向に接する。 2 to 5 are diagrams for explaining the slice interval and slice thickness of CT images. A CT image G1 shown in FIG. 2 includes slice images G11, G12, and G13. The CT image G1 has a slice interval P of 4 mm and a slice thickness T of 4 mm. That is, the difference in Z-direction position between slice image G11 and slice image G12 and the difference in Z-direction position between slice image G12 and slice image G13 are each 4 mm. Also, the thickness in the Z direction of the slice images G11, G12, and G13 is 4 mm. Since the CT image G1 has the same slice interval P and slice thickness T, the slice image G11 and the slice image G12, and the slice image G12 and the slice image G13 are in contact with each other in the Z direction.

図３に示すＣＴ画像Ｇ２は、スライス画像Ｇ２１、Ｇ２２、及びＧ２３を含んで構成される。ＣＴ画像Ｇ２は、スライス間隔Ｐが４ｍｍ、スライス厚Ｔが６ｍｍである。ＣＴ画像Ｇ２は、スライス間隔Ｐよりもスライス厚Ｔの方が大きいため、スライス画像Ｇ２１とスライス画像Ｇ２２、及びスライス画像Ｇ２２とスライス画像Ｇ２３は、Ｚ方向にオーバーラップする。 A CT image G2 shown in FIG. 3 includes slice images G21, G22, and G23. The CT image G2 has a slice interval P of 4 mm and a slice thickness T of 6 mm. Since the slice thickness T of the CT image G2 is larger than the slice interval P, the slice images G21 and G22, and the slice images G22 and G23 overlap in the Z direction. do.

図４に示すＣＴ画像Ｇ３は、スライス画像Ｇ３１、Ｇ３２、及びＧ３３を含んで構成される。ＣＴ画像Ｇ３は、スライス間隔Ｐが８ｍｍ、スライス厚Ｔが４ｍｍである。ＣＴ画像Ｇ３は、スライス厚Ｔよりもスライス間隔Ｐの方が大きいため、スライス画像Ｇ３１とスライス画像Ｇ３２、及びスライス画像Ｇ３２とスライス画像Ｇ３３は、Ｚ方向に離間する。 A CT image G3 shown in FIG. 4 includes slice images G31, G32, and G33. The CT image G3 has a slice interval P of 8 mm and a slice thickness T of 4 mm. Since the slice interval P of the CT image G3 is larger than the slice thickness T , the slice images G31 and G32 , and the slice images G32 and G33 are separated in the Z direction. .

図５に示すＣＴ画像Ｇ４は、スライス画像Ｇ４１、Ｇ４２、及びＧ４３を含んで構成される。ＣＴ画像Ｇ４は、スライス間隔Ｐが１ｍｍ、スライス厚Ｔが１ｍｍである。ＣＴ画像Ｇ４は、ＣＴ画像Ｇ１、Ｇ２、及びＧ３よりもＺ方向の情報量が多い。即ち、ＣＴ画像Ｇ４は、ＣＴ画像Ｇ１、Ｇ２、及びＧ３よりもＺ方向の解像度が相対的に高い。 A CT image G4 shown in FIG. 5 includes slice images G41, G42, and G43. The CT image G4 has a slice interval P of 1 mm and a slice thickness T of 1 mm. CT image G4 has more information in the Z direction than CT images G1, G2, and G3. That is, the CT image G4 has relatively higher resolution in the Z direction than the CT images G1, G2, and G3.

ＣＴ画像のスライス間隔及びスライス厚は、ＣＴ装置を使用する施設、医師等の好みに応じて様々な条件で設定される。ＣＴ画像は、診断のためには高解像度であることが好ましいが、被検体への被ばく量が増えてしまうという問題点がある。また、ＣＴ画像はデータ量が大きいため、容量削減のために低解像度化して保存される場合もある。 The slice intervals and slice thicknesses of CT images are set under various conditions according to the preferences of facilities, doctors, etc., using the CT apparatus. A CT image preferably has a high resolution for diagnosis, but there is a problem that the dose of exposure to the subject increases. In addition, since the CT image has a large amount of data, it may be saved at a low resolution in order to reduce the capacity.

医療画像学習装置１０は、図２～図４に示すような様々な条件のスライス間隔及びスライス厚の低解像度のＣＴ画像から、図５に示すスライス間隔が１ｍｍ、スライス厚が１ｍｍの高解像度のＣＴ画像を推定する生成モデルの学習を行う。 The medical image learning apparatus 10 uses low-resolution CT images with slice intervals and slice thicknesses under various conditions as shown in FIGS. A generative model for estimating CT images is trained.

＜第１の実施形態＞
〔医療画像学習装置の機能ブロック〕
図６は、第１の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。医療画像学習装置１０は、入力部３０及び学習部４０を備えている。入力部３０及び学習部４０の機能は、図１に示した医療画像学習装置１０の各部によって実現される。<First embodiment>
[Functional block of medical image learning device]
FIG. 6 is a functional block diagram showing learning processing of the medical image learning device 10 according to the first embodiment. The medical image learning device 10 includes an input section 30 and a learning section 40 . The functions of the input unit 30 and the learning unit 40 are realized by each unit of the medical image learning device 10 shown in FIG.

入力部３０は、学習部４０が学習するために必要なデータ（学習データ）を生成し、生成した学習データを学習部４０に入力する。入力部３０は、画像データベース１４（図１参照）から不図示のＣＴ装置によって撮影されたオリジナル高解像度画像ＧＲ（Real Thin画像）、及びオリジナル高解像度画像ＧＲに関連付けられた部位情報及びスライス情報を取得する。 The input unit 30 generates data (learning data) necessary for the learning unit 40 to learn, and inputs the generated learning data to the learning unit 40 . The input unit 30 receives an original high-resolution image GR (Real Thin image) captured by a CT apparatus (not shown) from the image database 14 (see FIG. 1), and part information and slice information associated with the original high-resolution image GR. get.

入力部３０は、低解像度変換部３２を備える。低解像度変換部３２は、入力されたオリジナル高解像度画像ＧＲのデータを元に、多様な低解像度画像ＧＬ１（Thick画像）を人工的に生成する。低解像度変換部３２は、例えば、１ｍｍに等方化されたスライス画像に対して姿勢変換を施し、ランダムに領域を切り出し、スライス間隔及びスライス厚が４ｍｍ又は８ｍｍ等の低解像度画像ＧＬ１を仮想的に生成する。 The input unit 30 has a low resolution conversion unit 32 . The low-resolution conversion unit 32 artificially generates various low-resolution images GL1 (thick images) based on the input data of the original high-resolution image GR. For example, the low-resolution conversion unit 32 performs attitude conversion on a slice image isotropic to 1 mm, cuts out regions at random, and converts a low-resolution image GL1 with a slice interval and a slice thickness of 4 mm or 8 mm, etc., into a virtual image. to generate

また、低解像度変換部３２は、入力されたオリジナル高解像度画像ＧＲを単純に間引いて減らすダウンサンプリング処理によって低解像度画像ＧＬ１を生成してもよい。 Alternatively, the low-resolution conversion unit 32 may generate the low-resolution image GL1 by down-sampling processing that simply thins out and reduces the input original high-resolution image GR.

入力部３０は、オリジナル高解像度画像ＧＲ及び低解像度画像ＧＬ１を学習部４０に入力する。さらに、入力部３０は、メタ情報Ｄ１（Condition）を学習部４０に入力する。 The input unit 30 inputs the original high resolution image GR and the low resolution image GL1 to the learning unit 40 . Furthermore, the input unit 30 inputs meta information D1 (Condition) to the learning unit 40 .

メタ情報Ｄ１は、部位情報とスライス情報を含む。部位情報は、オリジナル高解像度画像ＧＲ、及び低解像度画像ＧＬ１に共通の情報である。スライス情報は、低解像度画像ＧＬ１のスライス情報を用いる。低解像度画像ＧＬ１のスライス情報は、オリジナル高解像度画像ＧＲに関連付けられたスライス情報、及び低解像度変換部３２が行った低解像度変換処理の内容から取得することができる。 The meta information D1 includes part information and slice information. The part information is information common to the original high-resolution image GR and the low-resolution image GL1. As the slice information, the slice information of the low-resolution image GL1 is used. The slice information of the low-resolution image GL1 can be obtained from the slice information associated with the original high-resolution image GR and the contents of the low-resolution conversion processing performed by the low-resolution conversion unit 32.

学習部４０は、非特許文献３に記載のアーキテクチャを２次元から３次元データへ拡張した構造をベースとしている。学習部４０は、敵対的生成ネットワーク（Generative Adversarial Networks：GAN）を構築する。ＧＡＮは、データを作り出す「Generator」と、データを識別する「Discriminator」とを備える。学習部４０は、「Generator」に相当する生成部４２と、「Discriminator」に相当する識別部４４と、を備える。学習部４０は、入力部３０から入力されたデータに基づいて、識別部４４を用いた敵対的な学習を繰り返すことで、生成部４２を学習する。生成部４２は、解像度が相対的に低い低解像情報を含む第１画像から解像度が相対的に高い高解像情報を含む第２画像を推定する生成モデルに相当する。 The learning unit 40 is based on a structure obtained by extending the architecture described in Non-Patent Document 3 from two-dimensional data to three-dimensional data. The learning unit 40 builds Generative Adversarial Networks (GAN). A GAN has a "Generator" that creates data and a "Discriminator" that identifies data. The learning unit 40 includes a generating unit 42 corresponding to "Generator" and a discriminating unit 44 corresponding to "Discriminator". The learning unit 40 learns the generation unit 42 by repeating hostile learning using the identification unit 44 based on the data input from the input unit 30 . The generation unit 42 corresponds to a generation model for estimating a second image containing high-resolution information with relatively high resolution from a first image containing low-resolution information with relatively low resolution.

生成部４２には、入力部３０（ジェネレータ入力部の一例）から低解像度画像ＧＬ１（第１画像の一例）のみが入力される。生成部４２は、入力された低解像度画像ＧＬ１から、オリジナル高解像度画像ＧＲと同様の解像度を有する仮想高解像度画像ＧＶを生成する。 Only the low-resolution image GL1 (an example of the first image) is input to the generation unit 42 from the input unit 30 (an example of the generator input unit). The generator 42 generates a virtual high-resolution image GV having the same resolution as the original high-resolution image GR from the input low-resolution image GL1.

識別部４４には、入力部３０から入力されるオリジナル高解像度画像ＧＲ（学習用第２画像の一例）と低解像度画像ＧＬ１とのペア、又は生成部４２（ディスクリミネータ入力部の一例）から入力される仮想高解像度画像ＧＶ（仮想第２画像の一例）と低解像度画像ＧＬ１とのペアが入力される。 The identifying unit 44 receives a pair of an original high-resolution image GR (an example of a second image for learning) and a low-resolution image GL1 input from the input unit 30, or a pair from the generating unit 42 (an example of a discriminator input unit). A pair of an input virtual high-resolution image GV (an example of a second virtual image) and a low-resolution image GL1 is input.

さらに、識別部４４には、新しいチャンネルが確保される。識別部４４の新しいチャンネルには、メタ情報Ｄ１が入力される。識別部４４は、低解像度画像ＧＬ１及びメタ情報Ｄ１を用いて、入力された高解像度画像がオリジナル高解像度画像ＧＲ、及び仮想高解像度画像ＧＶのいずれであるかを識別する。識別部４４の識別結果は、生成部４２に入力される。 Furthermore, a new channel is reserved for the identification unit 44 . Meta information D1 is input to the new channel of the identification unit 44 . Using the low-resolution image GL1 and the meta information D1, the identification unit 44 identifies whether the input high-resolution image is the original high-resolution image GR or the virtual high-resolution image GV. The identification result of the identification unit 44 is input to the generation unit 42 .

図７は、学習時の識別部４４による識別を説明するための図である。図７に示すＦ７Ａの場合は、識別部４４に、低解像度画像ＧＬ１、メタ情報Ｄ１、及びオリジナル高解像度画像ＧＲが入力されている。したがって、識別部４４が入力された高解像度画像をオリジナル高解像度画像ＧＲであると識別した場合は正解であり、仮想高解像度画像ＧＶであると識別した場合は不正解である。 FIG. 7 is a diagram for explaining identification by the identification unit 44 during learning. In the case of F7A shown in FIG. 7, the low-resolution image GL1, the meta information D1, and the original high-resolution image GR are input to the identification unit 44. FIG. Therefore, if the identifying unit 44 identifies the input high-resolution image as the original high-resolution image GR, it is correct, and if it identifies it as the virtual high-resolution image GV, it is incorrect.

一方、図７に示すＦ７Ｂの場合は、識別部４４に、低解像度画像ＧＬ１、メタ情報Ｄ１、及び生成部４２によって生成された仮想高解像度画像ＧＶが入力されている。したがって、識別部４４が入力された高解像度画像をオリジナル高解像度画像ＧＲであると識別した場合は不正解であり、仮想高解像度画像ＧＶであると識別した場合は正解である。 On the other hand, in the case of F7B shown in FIG. 7, the low-resolution image GL1, the meta information D1, and the virtual high-resolution image GV generated by the generation unit 42 are input to the identification unit 44. FIG. Therefore, if the identifying unit 44 identifies the input high-resolution image as the original high-resolution image GR, it is incorrect, and if it identifies it as the virtual high-resolution image GV, it is correct.

識別部４４は、入力された高解像度画像が不図示のＣＴ装置によって撮影された本物のＣＴ画像であるか、又は生成部４２によって生成された仮想のＣＴ画像であるか、の識別を正解するように学習される。一方、生成部４２は、不図示のＣＴ装置によって撮影されたリアルなＣＴ画像に似せた仮想のＣＴ画像を生成し、識別部４４が識別を不正解とするように学習される。 The identification unit 44 correctly identifies whether the input high-resolution image is a real CT image captured by a CT device (not shown) or a virtual CT image generated by the generation unit 42. is learned as On the other hand, the generation unit 42 generates a virtual CT image that resembles a real CT image taken by a CT device (not shown), and the identification unit 44 learns to make the identification incorrect.

学習が進行すると、生成部４２と識別部４４とが精度を高めあい、生成部４２は識別部４４に識別されない、より本物のＣＴ画像に近い仮想高解像度画像ＧＶを生成できるようになる。この学習によって獲得された生成部４２が生成モデルである。生成部４２及び識別部４４は、例えば畳み込みニューラルネットワークを用いて学習される。 As the learning progresses, the generation unit 42 and the identification unit 44 mutually improve their accuracy, and the generation unit 42 can generate a virtual high-resolution image GV that is closer to the real CT image and is not identified by the identification unit 44 . The generator 42 acquired by this learning is the generative model. The generation unit 42 and the identification unit 44 are trained using, for example, a convolutional neural network.

〔医療画像学習方法〕
図８は、医療画像学習装置１０の医療画像学習方法の一例を示すフローチャートである。図８に示すように、医療画像学習方法は、オリジナル画像取得工程（ステップＳ１）、低解像度画像生成工程（ステップＳ２）、低解像度画像入力工程（ステップＳ３）、仮想高解像度画像生成工程（ステップＳ４）、メタ情報入力工程（ステップＳ５）、高解像度画像入力工程（ステップＳ６）、及び学習工程（ステップＳ７）を備える。[Medical image learning method]
FIG. 8 is a flow chart showing an example of the medical image learning method of the medical image learning device 10. As shown in FIG. As shown in FIG. 8, the medical image learning method includes an original image acquisition step (step S1), a low resolution image generation step (step S2), a low resolution image input step (step S3), a virtual high resolution image generation step (step S4), a meta information input step (step S5), a high resolution image input step (step S6), and a learning step (step S7).

ステップＳ１では、入力部３０は、画像データベース１４からオリジナル高解像度画像ＧＲを取得する。ここでは、スライス間隔が１ｍｍ、スライス厚が１ｍｍのオリジナル高解像度画像ＧＲを取得する。 At step S1 , the input unit 30 acquires the original high-resolution image GR from the image database 14 . Here, an original high-resolution image GR with a slice interval of 1 mm and a slice thickness of 1 mm is acquired.

ステップＳ２（第１画像生成工程の一例）では、低解像度変換部３２は、入力されたオリジナル高解像度画像ＧＲから低解像度画像ＧＬ１を生成する。ここでは、低解像度変換部３２は、スライス間隔が５ｍｍ、スライス厚が１ｍｍの低解像度画像ＧＬ１を生成する。即ち、オリジナル高解像度画像ＧＲと低解像度画像ＧＬ１とは、Ｚ方向の解像度が異なり、Ｘ方向及びＹ方向は同じ解像度を有する。 In step S2 (an example of the first image generating step), the low-resolution converter 32 generates a low-resolution image GL1 from the input original high-resolution image GR. Here, the low-resolution converter 32 generates a low-resolution image GL1 with a slice interval of 5 mm and a slice thickness of 1 mm. That is, the original high-resolution image GR and the low-resolution image GL1 have different resolutions in the Z direction, but have the same resolution in the X and Y directions.

ステップＳ３（ジェネレータ入力工程の一例）では、入力部３０は、ステップＳ２で生成した低解像度画像ＧＬ１を学習部４０の生成部４２に入力する。生成部４２には、低解像度画像ＧＬ１のみが入力される。 In step S3 (an example of a generator input step), the input unit 30 inputs the low-resolution image GL1 generated in step S2 to the generation unit 42 of the learning unit 40 . Only the low-resolution image GL1 is input to the generation unit 42 .

ステップＳ４では、生成部４２は、入力された低解像度画像ＧＬ１から、仮想高解像度画像ＧＶを生成する。ここでは、生成部４２は、スライス間隔が１ｍｍ、スライス厚が１ｍｍの仮想高解像度画像ＧＶを生成する。 In step S4, the generator 42 generates a virtual high-resolution image GV from the input low-resolution image GL1. Here, the generator 42 generates a virtual high-resolution image GV with a slice interval of 1 mm and a slice thickness of 1 mm.

ステップＳ５（第１ディスクリミネータ入力工程の一例、第２ディスクリミネータ入力工程の一例）では、入力部３０（第１ディスクリミネータ入力部の一例、第２ディスクリミネータ入力部の一例）は、メタ情報Ｄ１を学習部４０の識別部４４に入力する。メタ情報Ｄ１は、ステップＳ６で識別部４４に入力される高解像度画像の部位情報と低解像度画像ＧＬ１のスライス情報である。なお、オリジナル高解像度画像ＧＲ及び仮想高解像度画像ＧＶの部位情報は、低解像度画像ＧＬ１の部位情報と共通である。したがって、入力部３０は、メタ情報Ｄ１の部位情報として、低解像度画像ＧＬ１の部位情報を識別部４４に入力してもよい。 In step S5 (an example of a first discriminator input step, an example of a second discriminator input step), the input unit 30 (an example of a first discriminator input unit, an example of a second discriminator input unit) , the meta information D1 is input to the identification unit 44 of the learning unit 40 . The meta information D1 is the part information of the high-resolution image and the slice information of the low-resolution image GL1 that are input to the identification unit 44 in step S6. The part information of the original high-resolution image GR and the virtual high-resolution image GV is common to the part information of the low-resolution image GL1. Therefore, the input unit 30 may input the part information of the low-resolution image GL1 to the identifying part 44 as the part information of the meta information D1.

ステップＳ６では、入力部３０は、高解像度画像を学習部４０の識別部４４に入力する。高解像度画像は、オリジナル高解像度画像ＧＲ又は仮想高解像度画像ＧＶである。即ち、入力部３０（第１ディスクリミネータ入力部の一例）は識別部４４にオリジナル高解像度画像ＧＲを入力する（第１ディスクリミネータ入力工程の一例）、又は入力部３０（第２ディスクリミネータ入力部の一例）は識別部４４に仮想高解像度画像ＧＶを入力する（第２ディスクリミネータ入力工程の一例）。さらに、入力部３０は、識別部４４に低解像度画像ＧＬ１を入力する（第１ディスクリミネータ入力部の一例、第２ディスクリミネータ入力部の一例）。 In step S6 , the input unit 30 inputs the high-resolution image to the identification unit 44 of the learning unit 40 . The high resolution image is the original high resolution image GR or the virtual high resolution image GV. That is, the input unit 30 (an example of a first discriminator input unit) inputs the original high-resolution image GR to the identification unit 44 (an example of a first discriminator input step), or the input unit 30 (an example of a second discriminator input step). An example of a discriminator input unit) inputs the virtual high-resolution image GV to the identification unit 44 (an example of a second discriminator input step). Furthermore, the input unit 30 inputs the low-resolution image GL1 to the identification unit 44 (an example of the first discriminator input unit, an example of the second discriminator input unit).

ステップＳ７では、識別部４４は、入力された低解像度画像ＧＬ１及びメタ情報Ｄ１を用いて、入力された高解像度画像がオリジナル高解像度画像ＧＲ、及び仮想高解像度画像ＧＶのいずれであるかを識別する。識別部４４の識別結果は、生成部４２に入力される。これにより、生成部４２及び識別部４４が学習される。 In step S7, the identification unit 44 identifies whether the input high-resolution image is the original high-resolution image GR or the virtual high-resolution image GV using the input low-resolution image GL1 and meta information D1. do. The identification result of the identification unit 44 is input to the generation unit 42 . Thereby, the generation unit 42 and the identification unit 44 are learned.

以上の学習方法を繰り返すことで、識別部４４は、入力された高解像度画像がオリジナル高解像度画像ＧＲ、及び仮想高解像度画像ＧＶのいずれであるかをより好適に識別できるように学習する。一方、生成部４２は、オリジナル高解像度画像ＧＲに近い仮想高解像度画像ＧＶを生成するように学習する。これにより、生成部４２は、解像度が相対的に低い低解像度画像ＧＬ１から解像度が相対的に高い仮想高解像度画像ＧＶを推定することができる。 By repeating the learning method described above, the identification unit 44 learns so as to be able to more preferably identify whether the input high-resolution image is the original high-resolution image GR or the virtual high-resolution image GV. On the other hand, the generator 42 learns to generate a virtual high-resolution image GV close to the original high-resolution image GR. Thereby, the generator 42 can estimate the virtual high-resolution image GV with relatively high resolution from the low-resolution image GL1 with relatively low resolution.

図９及び図１０は、識別部４４にメタ情報Ｄ１を与えずに全身の画像を学習した生成部４２における画像生成を説明するための図である。図９は、頭部のＣＴ画像のサジタル断面画像を示している。図９に示すＦ９Ａは、生成部４２に入力された低解像度画像である。図９に示すＦ９Ｂは、生成部４２から出力された仮想高解像度画像である。また、図９に示すＦ９Ｃは、Ｆ９Ｂの正解データ（Ground truth）であり、Ｆ９Ａの元データとなったオリジナル高解像度画像である。図１０は、腰部のＣＴ画像のサジタル断面画像を示している。図１０に示すＦ１０Ａ、Ｆ１０Ｂ、及びＦ１０Ｃは、それぞれ入力された低解像度画像、出力された仮想高解像度画像、及び正解データである。
9 and 10 are diagrams for explaining image generation in the generation unit 42 that has learned the whole body image without providing the identification unit 44 with the meta information D1. FIG. 9 shows a sagittal cross-sectional image of a CT image of the head. F9A shown in FIG. 9 is the low-resolution image input to the generation unit 42. FIG. F9B shown in FIG. 9 is the virtual high-resolution image output from the generator 42. FIG. F9C shown in FIG. 9 is the ground truth of F9B and the original high-resolution image used as the original data of F9A. FIG. 10 shows a sagittal cross-sectional image of a CT image of the waist. F10A, F10B , and F10C shown in FIG. 10 are an input low-resolution image, an output virtual high-resolution image, and correct data, respectively.

生成部４２は、頭部から脚部まで部位に関係なく、低解像度画像を入力すると正解データに近い高解像度画像を出力することが理想である。しかしながら、図９及び図１０に示すように、識別部４４にメタ情報Ｄ１を与えずに学習した場合、生成部４２の生成した仮想高解像度画像は、他部位の学習の影響を受けて特徴が平均化され、不明瞭な画像となった。この原因は、入力データの多様性であると考えられる。 Ideally, the generating unit 42 outputs a high-resolution image close to the correct data when a low-resolution image is input, regardless of the body part from the head to the legs. However, as shown in FIGS. 9 and 10, when learning is performed without providing the meta-information D1 to the identifying unit 44, the virtual high-resolution image generated by the generating unit 42 is affected by learning of other parts, and the features are different. Averaging resulted in a blurred image. The reason for this is thought to be the diversity of input data.

図１１は、脚部のＣＴ画像のサジタル断面画像を示している。図１１に示すＦ１１Ａは、脚部の低解像度画像である。図１１に示すＦ１１Ｂは、脚部の高解像度画像である。図１１に示すように、脚部の画像は背景及び垂直方向の情報が多く、低解像度画像と高解像度画像との変化が比較的小さい。このため、生成部４２は、脚部の高解像度画像の生成については「ほぼ何もしない」という学習を行う。しかしながら、この学習は、頭部、胸部、及び腹部の高解像度画像の生成の学習に悪影響を与えてしまうという問題点があった。 FIG. 11 shows a sagittal section image of a CT image of the leg. F11A shown in FIG. 11 is a low-resolution image of the leg. F11B shown in FIG. 11 is a high-resolution image of the leg. As shown in FIG. 11, the leg image has a lot of background and vertical information, and the change between the low resolution image and the high resolution image is relatively small. For this reason, the generation unit 42 learns to “almost do nothing” for generation of high-resolution images of the legs. However, there is a problem that this learning adversely affects the learning for generating high-resolution images of the head, chest, and abdomen.

また、識別部４４は、画像の部位の多様性により混乱して精度が向上しないという問題点があった。 In addition, the identification unit 44 has a problem that the accuracy is not improved due to confusion due to the diversity of the parts of the image.

このような課題に対し、１つの方法として、部位別に識別（超解像度画像出力モデル）を学習及び作成することも考えられる。しかしながら、この方法では、部位別のモデル作成が必要になる、推論時のデータの部位を判断する部分が必要となる、部位をまたがったデータの場合の処理が必要、等の新たな課題がある。 As one method for solving such a problem, it is conceivable to learn and create a classification (super-resolution image output model) for each part. However, with this method, there are new problems such as the need to create models for each part, the need to determine the part of the data at the time of inference, and the need to process data that spans parts. .

図１２～図１４は、本実施形態の効果を説明するための図である。図１２は、頭部のＣＴ画像のサジタル断面画像を示している。図１２に示すＦ１２Ａは、学習された生成モデルである生成部４２に入力された低解像度画像である。図１２に示すＦ１２Ｂは、生成部４２から出力された仮想高解像度画像である。また、図１２に示すＦ１２Ｃは、Ｆ１２Ｂの正解データ（Ground truth）であり、Ｆ１２Ａの元データとなったオリジナル高解像度画像である。図１３は腹部のＣＴ画像のサジタル断面画像を示している。図１３に示すＦ１３Ａ、Ｆ１３Ｂ、及びＦ１３Ｃは、それぞれ入力された低解像度画像、出力された仮想高解像度画像、及び正解データである。図１４は腰部のＣＴ画像のサジタル断面画像を示している。図１４に示すＦ１４Ａ、Ｆ１４Ｂ、及びＦ１４Ｃは腰部の画像であり、それぞれ入力された低解像度画像、出力された仮想高解像度画像、及び正解データである。 12 to 14 are diagrams for explaining the effect of this embodiment. FIG. 12 shows a sagittal cross-sectional image of a CT image of the head. F12A shown in FIG. 12 is a low-resolution image input to the generator 42, which is a learned generative model. F12B shown in FIG. 12 is the virtual high-resolution image output from the generation unit 42 . F12C shown in FIG. 12 is ground truth data of F12B, and is an original high-resolution image used as original data of F12A. FIG. 13 shows a sagittal cross-sectional image of a CT image of the abdomen. F13A, F13B, and F13C shown in FIG. 13 are an input low-resolution image, an output virtual high-resolution image, and correct data, respectively. FIG. 14 shows a sagittal cross-sectional image of a CT image of the waist. F14A, F14B, and F14C shown in FIG. 14 are images of the waist, which are an input low-resolution image, an output virtual high-resolution image, and correct data, respectively.

図１２～図１４に示すように、本実施形態によって学習された生成部４２によれば、識別部４４にメタ情報Ｄ１を入力して学習したことで、部位やスライス間隔等によらず高精度で画像生成が可能になった。この生成部４２は、ＶＴＳ（Virtual Thin Slice）装置に適用することができる。 As shown in FIGS. 12 to 14, according to the generating unit 42 learned according to the present embodiment, by inputting the meta information D1 into the identifying unit 44 and learning, high-precision images can be obtained regardless of the site, slice interval, or the like. image generation is possible. This generator 42 can be applied to a VTS (Virtual Thin Slice) device.

本実施形態によれば、部位別の生成モデルを用意する必要がない。本実施形態では、生成部４２には低解像度画像のみが入力されるため、ネットワークアーキテクチャの大幅な修正が不要である。また、識別部４４は、特徴量として部位情報を使用して学習することができる。これにより、学習された生成モデルは、入力される低解像度画像がどの部位の画像であるかの情報が不要であり、どの部位の画像が入力されても高解像の画像を生成することができる。したがって、過去のＣＴ画像の資産を有効利用することができる。なお、この生成モデルは、複数の部位にまたがった画像が入力された場合であっても、高解像の画像の生成が可能である。 According to this embodiment, there is no need to prepare a generative model for each part. In this embodiment, since only low-resolution images are input to the generation unit 42, significant modification of the network architecture is unnecessary. Further, the identification unit 44 can learn using part information as a feature amount. As a result, the trained generative model does not need information about which part of the image the input low-resolution image is, and can generate a high-resolution image regardless of which part of the image is input. can. Therefore, assets of past CT images can be effectively used. Note that this generation model can generate a high-resolution image even when an image spanning a plurality of parts is input.

ここでは、入力部３０はメタ情報Ｄ１として部位情報及びスライス情報を識別部４４に入力したが、メタ情報Ｄ１は少なくとも部位情報を含めばよい。また、メタ情報Ｄ１に、低解像度画像のぼかしの強弱情報を含めてもよい。ぼかしの強弱情報は、「ぼかし強」、「ぼかし弱」等の分類であってもよい。 Here, the input unit 30 inputs the part information and the slice information to the identifying part 44 as the meta information D1, but the meta information D1 may include at least the part information. Further, the meta-information D1 may include information on the degree of blurring of the low-resolution image. Blur strength information may be a classification such as “strong blur” or “weak blur”.

また、本実施形態では、識別部４４にオリジナル高解像度画像ＧＲと低解像度画像ＧＬ１とのペア、又は生成部４２から入力される仮想高解像度画像ＧＶと低解像度画像ＧＬ１とのペアが入力されているが、低解像度画像ＧＬ１の入力は必須ではない。即ち、識別部４４には、オリジナル高解像度画像ＧＲ又は仮想高解像度画像ＧＶが入力されればよい。 Further, in this embodiment, a pair of the original high-resolution image GR and the low-resolution image GL1 or a pair of the virtual high-resolution image GV and the low-resolution image GL1 input from the generation unit 42 are input to the identifying unit 44. However, it is not essential to input the low-resolution image GL1. In other words, the original high resolution image GR or the virtual high resolution image GV may be input to the identification unit 44 .

なお、低解像度画像ＧＬ１を識別部４４に入力しない場合であっても、識別部４４に仮想高解像度画像ＧＶが入力される場合、メタ情報Ｄ１のスライス情報は低解像度画像ＧＬ１のスライス情報を使用する。これは、識別部４４に入力された仮想高解像度画像ＧＶがどのような条件の低解像度画像ＧＬ１を元に生成されたのかを考慮するためである。 Even if the low-resolution image GL1 is not input to the identification unit 44, if the virtual high-resolution image GV is input to the identification unit 44, the slice information of the low-resolution image GL1 is used as the slice information of the meta information D1. do. This is to consider under what conditions the virtual high-resolution image GV input to the identification unit 44 was generated based on the low-resolution image GL1.

識別部４４にオリジナル高解像度画像ＧＲが入力される場合は、スライス情報は意味を持たないため、スライス情報にはランダムな値を与えて学習させる。これにより、識別部４４は、「スライス間隔４ｍｍの低解像度画像から生成された偽物の可能性がある高解像度画像の真偽を判別する」、「スライス間隔８ｍｍの低解像度画像から生成された偽物の可能性がある高解像度画像の真偽を判別する」等のような条件別に識別を学習することができる。 When the original high-resolution image GR is input to the identification unit 44, the slice information is given a random value for learning because the slice information has no meaning. As a result, the identifying unit 44 determines whether a high-resolution image generated from a low-resolution image with a slice interval of 4 mm and possibly a fake is true, Discrimination can be learned according to conditions such as "determine the authenticity of a high-resolution image that has the possibility of

＜第２の実施形態＞
図１５は、第２の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。なお、図６に示す機能ブロック図と共通する部分には同一の符号を付し、その詳細な説明は省略する。<Second embodiment>
FIG. 15 is a functional block diagram showing learning processing of the medical image learning device 10 according to the second embodiment. The same reference numerals are given to the same parts as those in the functional block diagram shown in FIG. 6, and detailed description thereof will be omitted.

医療画像学習装置１０は、入力部３０に補間部３４を備えている。補間部３４は、低解像度変換部３２が生成した低解像度画像ＧＬ１から、低解像度画像ＧＬ２を生成する。補間部３４は、低解像度画像ＧＬ１のＺ方向にスプライン補間処理を施す。また、補間部３４は、スプライン補間処理が施された低解像度画像ＧＬ１にガウシアンフィルタ処理を施す。スプライン補間処理及びガウシアンフィルタ処理により生成された画像が、低解像度画像ＧＬ２である。 The medical image learning device 10 includes an interpolation section 34 in the input section 30 . The interpolation unit 34 generates a low-resolution image GL2 from the low-resolution image GL1 generated by the low-resolution conversion unit 32. The interpolation unit 34 performs spline interpolation processing on the low-resolution image GL1 in the Z direction. Further, the interpolation unit 34 performs Gaussian filter processing on the low-resolution image GL1 that has been subjected to the spline interpolation processing. An image generated by spline interpolation processing and Gaussian filtering processing is the low-resolution image GL2.

例えば、低解像度画像ＧＬ１は、スライス間隔が５ｍｍ、スライス厚が１ｍｍであり、補間部３４が生成する低解像度画像ＧＬ２は、スライス間隔が１ｍｍ、スライス厚が１ｍｍである。低解像度画像ＧＬ２のスライス間隔及びスライス厚はオリジナル高解像度画像ＧＲのスライス間隔及びスライス厚と同様であるが、低解像度画像ＧＬ２は、オリジナル高解像度画像ＧＲと比較してＺ方向にボケた画像となる。 For example, the low-resolution image GL1 has a slice interval of 5 mm and a slice thickness of 1 mm, and the low-resolution image GL2 generated by the interpolation unit 34 has a slice interval of 1 mm and a slice thickness of 1 mm. The slice interval and slice thickness of the low-resolution image GL2 are the same as those of the original high-resolution image GR, but the low-resolution image GL2 is blurred in the Z direction compared to the original high-resolution image GR. Become.

補間部３４が生成した低解像度画像ＧＬ２は、学習部４０の生成部４２に入力される。即ち、本実施形態では、生成部４２は低解像度画像ＧＬ２から仮想高解像度画像ＧＶを生成する。なお、仮想高解像度画像ＧＶのスライス間隔及びスライス厚は低解像度画像ＧＬ２のスライス間隔及びスライス厚と同様であるが、仮想高解像度画像ＧＶは、低解像度画像ＧＬ２と比較してＺ方向にシャープな画像となる。 The low-resolution image GL2 generated by the interpolating section 34 is input to the generating section 42 of the learning section 40 . That is, in this embodiment, the generator 42 generates the virtual high-resolution image GV from the low-resolution image GL2. Note that the slice interval and slice thickness of the virtual high-resolution image GV are the same as those of the low-resolution image GL2, but the virtual high-resolution image GV is sharper in the Z direction than the low-resolution image GL2. becomes an image.

また、補間部３４が生成した低解像度画像ＧＬ２は、高解像度画像とペアで識別部４４に入力される。識別部４４は、低解像度画像ＧＬ２及びメタ情報Ｄ１を用いて、入力された高解像度画像がオリジナル高解像度画像ＧＲ、及び仮想高解像度画像ＧＶのいずれであるかを識別する。なお、識別部４４には、低解像度画像ＧＬ２は入力されなくてもよい。 Also, the low-resolution image GL2 generated by the interpolation unit 34 is input to the identification unit 44 in a pair with the high-resolution image. Using the low-resolution image GL2 and the meta information D1, the identification unit 44 identifies whether the input high-resolution image is the original high-resolution image GR or the virtual high-resolution image GV. Note that the low-resolution image GL2 does not have to be input to the identification unit 44 .

本実施形態によれば、生成部４２は、オリジナル高解像度画像ＧＲから低解像度変換され、さらにスプライン補間処理された低解像度画像ＧＬ２から高解像度画像を生成する学習をすることができる。したがって、低解像度画像ＧＬ１から高解像度画像を生成する学習と合せることで、多様な入力の低解像度画像に適応した生成モデルを得ることができる。 According to the present embodiment, the generating unit 42 can learn to generate a high-resolution image from the low-resolution image GL2 obtained by low-resolution conversion from the original high-resolution image GR and spline interpolation processing. Therefore, by combining learning for generating a high-resolution image from the low-resolution image GL1, it is possible to obtain a generative model adapted to various input low-resolution images.

＜第３の実施形態＞
図１６は、第３の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。なお、図６に示す機能ブロック図と共通する部分には同一の符号を付し、その詳細な説明は省略する。本実施形態に係る医療画像学習装置１０は、高解像度画像に代えて、解像度が相対的に高い高解像情報を含む高周波成分画像を生成し、識別する。<Third Embodiment>
FIG. 16 is a functional block diagram showing learning processing of the medical image learning device 10 according to the third embodiment. The same reference numerals are assigned to the same parts as those in the functional block diagram shown in FIG. 6, and detailed description thereof will be omitted. The medical image learning apparatus 10 according to the present embodiment generates and identifies high-frequency component images containing high-resolution information with relatively high resolution instead of high-resolution images.

医療画像学習装置１０は、入力部３０に高周波成分抽出部３６を備えている。高周波成分抽出部３６は、オリジナル高解像度画像ＧＲから高周波成分を抽出し、オリジナル高周波成分画像ＣＲ（学習用高周波成分画像の一例）を生成する。高周波成分の抽出は、ハイパスフィルタを用いて行われる。オリジナル高周波成分画像ＣＲは、オリジナル高解像度画像ＧＲと同様に、スライス間隔が１ｍｍ、スライス厚が１ｍｍである。 The medical image learning device 10 includes a high-frequency component extraction section 36 in the input section 30 . The high-frequency component extraction unit 36 extracts high-frequency components from the original high-resolution image GR and generates an original high-frequency component image CR (an example of a high-frequency component image for learning). High-frequency components are extracted using a high-pass filter. Similar to the original high-resolution image GR, the original high-frequency component image CR has a slice interval of 1 mm and a slice thickness of 1 mm.

高周波成分抽出部３６が生成したオリジナル高周波成分画像ＣＲは、学習部４０の識別部４４に入力される。 The original high frequency component image CR generated by the high frequency component extraction unit 36 is input to the identification unit 44 of the learning unit 40 .

また、学習部４０の生成部４２は、入力部３０の低解像度変換部３２から入力された低解像度画像ＧＬ１から、オリジナル高周波成分画像ＣＲと同様の解像度を有する仮想高周波成分画像ＣＶを生成する。ここでは、生成部４２は、スライス間隔が１ｍｍ、スライス厚が１ｍｍの仮想高周波成分画像ＣＶを生成する。 Also, the generation unit 42 of the learning unit 40 generates a virtual high-frequency component image CV having the same resolution as the original high-frequency component image CR from the low-resolution image GL1 input from the low-resolution conversion unit 32 of the input unit 30 . Here, the generator 42 generates a virtual high-frequency component image CV with a slice interval of 1 mm and a slice thickness of 1 mm.

識別部４４には、入力部３０から入力されるオリジナル高周波成分画像ＣＲ（学習用第２画像の一例）と低解像度画像ＧＬ１とのペア、又は生成部４２（ディスクリミネータ入力部の一例）から入力される仮想高解像度画像ＧＶ（仮想第２画像の一例）と低解像度画像ＧＬ１とのペアが入力される。 In the identification unit 44, a pair of the original high-frequency component image CR (an example of the second image for learning) and the low-resolution image GL1 input from the input unit 30, or from the generation unit 42 (an example of the discriminator input unit) A pair of an input virtual high-resolution image GV (an example of a second virtual image) and a low-resolution image GL1 is input.

識別部４４は、入力部３０から入力された低解像度画像ＧＬ１及びメタ情報Ｄ１を用いて、入力された高周波成分画像がオリジナル高周波成分画像ＣＲ、及び仮想高周波成分画像ＣＶのいずれであるかを識別する。 The identification unit 44 identifies whether the input high-frequency component image is the original high-frequency component image CR or the virtual high-frequency component image CV using the low-resolution image GL1 and the meta information D1 input from the input unit 30. do.

本実施形態によれば、生成部４２は、低解像度画像ＧＬ１から高周波成分画像を生成する学習をすることができる。生成部４２は、生成部４２が生成した高周波成分画像と生成部４２の入力である低解像度画像ＧＬ１とを加算処理することで、高解像度画像を得ることができる。 According to this embodiment, the generator 42 can learn to generate a high-frequency component image from the low-resolution image GL1. The generating unit 42 can obtain a high-resolution image by adding the high-frequency component image generated by the generating unit 42 and the low-resolution image GL1 input to the generating unit 42 .

＜第４の実施形態＞
図１７は、第４の実施形態に係る医療画像学習装置１０の学習処理を示す機能ブロック図である。なお、図１５及び図１６に示す機能ブロック図と共通する部分には同一の符号を付し、その詳細な説明は省略する。<Fourth Embodiment>
FIG. 17 is a functional block diagram showing learning processing of the medical image learning device 10 according to the fourth embodiment. 15 and 16 are denoted by the same reference numerals, and detailed description thereof will be omitted.

入力部３０の低解像度変換部３２は、オリジナル高解像度画像ＧＲから低解像度画像ＧＬ１を生成する。補間部３４は、低解像度画像ＧＬ１から、低解像度画像ＧＬ２を生成する。 A low-resolution conversion unit 32 of the input unit 30 generates a low-resolution image GL1 from the original high-resolution image GR. The interpolation unit 34 generates a low resolution image GL2 from the low resolution image GL1.

学習部４０の生成部４２は、入力部３０の補間部３４から入力された低解像度画像ＧＬ２から、オリジナル高解像度画像ＧＲと同様の解像度を有する仮想高周波成分画像ＣＶを生成する。 The generation unit 42 of the learning unit 40 generates a virtual high-frequency component image CV having the same resolution as the original high-resolution image GR from the low-resolution image GL2 input from the interpolation unit 34 of the input unit 30 .

医療画像学習装置１０は、学習部４０に加算部４６を備えている。加算部４６は、生成部４２が生成した仮想高周波成分画像ＣＶと補間部３４が生成した低解像度画像ＧＬ２とを加算する。これにより、加算部４６は、オリジナル高解像度画像ＧＲと同様の解像度を有する仮想高解像度画像ＧＶを生成する。 The medical image learning apparatus 10 includes an addition section 46 in the learning section 40 . The adder 46 adds the virtual high-frequency component image CV generated by the generator 42 and the low-resolution image GL2 generated by the interpolator 34 . Thereby, the adder 46 generates a virtual high-resolution image GV having the same resolution as the original high-resolution image GR.

識別部４４には、入力部３０から入力されるオリジナル高解像度画像ＧＲと低解像度画像ＧＬ２ペア、又は加算部４６から入力される仮想高解像度画像ＧＶと低解像度画像ＧＬ２のペアが入力される。 The identifying unit 44 receives an original high-resolution image GR and low-resolution image GL2 pair input from the input unit 30 or a virtual high-resolution image GV and low-resolution image GL2 pair input from the adding unit 46 .

識別部４４は、低解像度画像ＧＬ１及びメタ情報Ｄ１を用いて、入力された高解像度画像がオリジナル高解像度画像ＧＲ、及び仮想高解像度画像ＧＶのいずれであるかを識別する。識別部４４の識別結果は、生成部４２に入力される。これにより、生成部４２及び識別部４４が学習される。 Using the low-resolution image GL1 and the meta information D1, the identification unit 44 identifies whether the input high-resolution image is the original high-resolution image GR or the virtual high-resolution image GV. The identification result of the identification unit 44 is input to the generation unit 42 . Thereby, the generation unit 42 and the identification unit 44 are learned.

本実施形態によれば、生成部４２は、低解像度画像ＧＬ１から高周波成分画像を生成する学習をすることができる。また、識別部４４は、高解像度画像の識別の学習をすることができる。 According to this embodiment, the generator 42 can learn to generate a high-frequency component image from the low-resolution image GL1. Further, the identification unit 44 can learn to identify high-resolution images.

〔その他〕
ここでは、医療画像学習装置１０は、１つのパーソナルコンピュータ又はワークステーションによって構成される例を説明したが、複数のパーソナルコンピュータ等で構成してもよい。例えば、入力部３０と学習部４０とを異なるパーソナルコンピュータ等で構成してもよい。このように構成することで、学習データの生成と生成モデルの学習とを物理的にも時間的にも互いに束縛されることなく実施することができる。〔others〕
Although the medical image learning apparatus 10 is configured by one personal computer or workstation here, it may be configured by a plurality of personal computers or the like. For example, the input unit 30 and the learning unit 40 may be configured by different personal computers or the like. With this configuration, the generation of learning data and the learning of the generative model can be performed without being physically or temporally bound to each other.

ここではＣＴ画像の超解像の生成モデルの学習方法を説明したが、本実施形態に係る生成モデルの学習方法は、部位情報を有する３次元断層画像に適用することができる。例えば、ＭＲＩ（Magnetic Resonance Imaging）装置により取得されるＭＲ画像、ＰＥＴ（Positron Emission Tomography）装置により取得されるＰＥＴ画像、ＯＣＴ（Optical Coherence Tomography）装置により取得されるＯＣＴ画像、３次元超音波撮影装置により取得される３次元超音波画像等であってもよい。 Although the method of learning a super-resolution generative model for CT images has been described here, the method of learning a generative model according to this embodiment can be applied to three-dimensional tomographic images having part information. For example, MR images acquired by MRI (Magnetic Resonance Imaging) devices, PET images acquired by PET (Positron Emission Tomography) devices, OCT images acquired by OCT (Optical Coherence Tomography) devices, and three-dimensional ultrasound imaging devices. It may be a three-dimensional ultrasound image or the like acquired by.

また、本実施形態に係る生成モデルの学習方法は、部位情報を有する２次元画像に適用することができる。例えば、Ｘ線画像であってもよい。また、医療画像に限定されず、通常のカメラ画像に適用することができる。２次元画像に適用する場合は、スライス情報のスライス間隔に代えて、シャープネス（再構成関数）情報を使用すればよい。また、スライス情報のスライス厚に代えて、ピクセルスペーシング情報、即ち画素の大きさの情報を使用すればよい。 In addition, the generative model learning method according to the present embodiment can be applied to a two-dimensional image having part information. For example, it may be an X-ray image. Moreover, it is not limited to medical images, and can be applied to normal camera images. When applied to a two-dimensional image, sharpness (reconstruction function) information may be used in place of the slice interval of the slice information. Pixel spacing information, that is, pixel size information may be used instead of the slice thickness of the slice information.

ここまで説明した実施形態において、医療画像学習装置１０の各種の処理を実行する処理部（processing unit）のハードウェア的な構造は、次に示すような各種のプロセッサ（processor）である。各種のプロセッサには、ソフトウェア（プログラム）を実行して各種の処理部として機能する汎用的なプロセッサであるＣＰＵ（Central Processing Unit）、画像処理に特化したプロセッサであるＧＰＵ（Graphics Processing Unit）、ＦＰＧＡ（Field Programmable Gate Array）等の製造後に回路構成を変更可能なプロセッサであるプログラマブルロジックデバイス（Programmable Logic Device：ＰＬＤ）、ＡＳＩＣ（Application Specific Integrated Circuit）等の特定の処理を実行させるために専用に設計された回路構成を有するプロセッサである専用電気回路等が含まれる。 In the embodiments described so far, the hardware structure of the processing unit that executes various processes of the medical image learning apparatus 10 is various processors as shown below. Various processors include a CPU (Central Processing Unit), which is a general-purpose processor that executes software (programs) and functions as various processing units, a GPU (Graphics Processing Unit), which is a processor specialized for image processing, Programmable Logic Device (PLD), which is a processor whose circuit configuration can be changed after manufacturing such as FPGA (Field Programmable Gate Array), ASIC (Application Specific Integrated Circuit), etc. Dedicated to execute specific processing It includes a dedicated electrical circuit, which is a processor with designed circuitry, and the like.

１つの処理部は、これら各種のプロセッサのうちの１つで構成されていてもよいし、同種又は異種の２つ以上のプロセッサ（例えば、複数のＦＰＧＡ、或いはＣＰＵとＦＰＧＡの組み合わせ、又はＣＰＵとＧＰＵの組み合わせ）で構成されてもよい。また、複数の処理部を１つのプロセッサで構成してもよい。複数の処理部を１つのプロセッサで構成する例としては、第１に、サーバ及びクライアント等のコンピュータに代表されるように、１つ以上のＣＰＵとソフトウェアの組合せで１つのプロセッサを構成し、このプロセッサが複数の処理部として機能する形態がある。第２に、システムオンチップ（System On Chip：ＳｏＣ）等に代表されるように、複数の処理部を含むシステム全体の機能を１つのＩＣ（Integrated Circuit）チップで実現するプロセッサを使用する形態がある。このように、各種の処理部は、ハードウェア的な構造として、各種のプロセッサを１つ以上用いて構成される。 One processing unit may be composed of one of these various processors, or two or more processors of the same or different type (for example, a plurality of FPGAs, a combination of CPU and FPGA, or a combination of CPU and GPU). Also, a plurality of processing units may be configured by one processor. As an example of configuring a plurality of processing units with a single processor, first, as represented by computers such as servers and clients, a single processor is configured with a combination of one or more CPUs and software. There is a form in which a processor functions as multiple processing units. Secondly, as typified by System On Chip (SoC), etc., there is a form of using a processor that realizes the functions of the entire system including a plurality of processing units with a single IC (Integrated Circuit) chip. be. In this way, various processing units are configured using one or more various processors as a hardware structure.

さらに、これらの各種のプロセッサのハードウェア的な構造は、より具体的には、半導体素子等の回路素子を組み合わせた電気回路（circuitry）である。 Further, the hardware structure of these various processors is, more specifically, an electric circuit combining circuit elements such as semiconductor elements.

本発明の技術的範囲は、上記の実施形態に記載の範囲には限定されない。各実施形態における構成等は、本発明の趣旨を逸脱しない範囲で、各実施形態間で適宜組み合わせることができる。 The technical scope of the present invention is not limited to the scope described in the above embodiments. Configurations and the like in each embodiment can be appropriately combined between each embodiment without departing from the gist of the present invention.

１０…医療画像学習装置
１２…通信部
１４…画像データベース
１６…操作部
１８…ＣＰＵ
２０…ＲＡＭ
２２…ＲＯＭ
２４…表示部
３０…入力部
３２…低解像度変換部
３４…補間部
３６…高周波成分抽出部
４０…学習部
４２…生成部
４４…識別部
４６…加算部
ＣＲ…オリジナル高周波成分画像
ＣＶ…仮想高周波成分画像
Ｄ１…メタ情報
Ｇ１，Ｇ２，Ｇ３，Ｇ４…ＣＴ画像
Ｇ１１，Ｇ１２，Ｇ１３…スライス画像
Ｇ２１，Ｇ２２，Ｇ２３…スライス画像
Ｇ３１，Ｇ３２，Ｇ３３…スライス画像
Ｇ４１，Ｇ４２，Ｇ４３…スライス画像
ＧＬ１，ＧＬ２…低解像度画像
ＧＲ…オリジナル高解像度画像
ＧＶ…仮想高解像度画像
Ｓ１～Ｓ７医療画像学習方法の各ステップDESCRIPTION OF SYMBOLS 10... Medical image learning apparatus 12... Communication part 14... Image database 16... Operation part 18... CPU
20 RAM
22 ROM
24 Display unit 30 Input unit 32 Low resolution conversion unit 34 Interpolation unit 36 High frequency component extraction unit 40 Learning unit 42 Generation unit 44 Identification unit 46 Addition unit CR Original high frequency component image CV Virtual high frequency Component image D1 Meta information G1, G2, G3, G4 CT images G11, G12, G13 Slice images G21, G22, G23 Slice images G31, G32, G33 Slice images G41, G42, G43 Slice image GL1, GL2: Low-resolution image GR: Original high-resolution image GV: Virtual high-resolution image S1 to S7 Each step of the medical image learning method

Claims

A generative model learning method for estimating a second image containing high-resolution information with relatively high resolution from a first image containing low-resolution information with relatively low resolution, comprising:
to the generator of a generative adversarial network comprising: a generator that is a generative model that generates a virtual second image using a first image; and a discriminator that distinguishes between a training second image and the virtual second image: a generator input step of inputting only the first image;
a first discriminator input step of inputting the second learning image and part information of the second learning image to the discriminator;
a second discriminator input step of inputting the virtual second image and part information of the virtual second image into the discriminator;
A learning method with

2. The learning method according to claim 1, wherein the part information includes the head, chest, abdomen, and legs of the human body.

The first image is a three-dimensional tomographic image,
3. The learning method according to claim 1, wherein said resolution is a resolution in a slice thickness direction.

4. The learning method according to claim 3, wherein said first discriminator input step and said second discriminator input step input slice information of said first image to said discriminator.

5. The learning method according to claim 4, wherein the slice information is a slice interval.

6. The learning method according to claim 4, wherein the slice information is slice thickness.

7. The learning method according to claim 1, wherein said first discriminator input step and said second discriminator input step input said first image into said discriminator.

A first image generating step of generating the first image from the second learning image,
7. The learning method according to any one of claims 1 to 6, wherein the generator input step inputs only the generated first image to the generator.

A program for causing a computer to execute the learning method according to any one of claims 1 to 8.

A non-transitory computer-readable recording medium that causes a computer to execute the program according to claim 9 when instructions stored in the recording medium are read by the computer.

A second image containing high-resolution information having a relatively high resolution from a first image containing low-resolution information having a relatively low resolution learned by the learning method according to any one of claims 1 to 8 A generative model for a computer to realize the function of estimating an image.

A generative model learning device for estimating a second image containing high-resolution information having a relatively high resolution from a first image containing low-resolution information having a relatively low resolution,
a generative adversarial network comprising: a generator that is a generative model that generates a virtual second image using a first image; and a discriminator that distinguishes between a training second image and the virtual second image;
a generator input unit that inputs only the first image to the generator;
a first discriminator input unit for inputting the second learning image and part information of the second learning image to the discriminator;
a second discriminator input unit for inputting the second virtual image and part information of the second virtual image to the discriminator;
A learning device with