JP7769732B2

JP7769732B2 - Image analysis method, image analysis device, and program

Info

Publication number: JP7769732B2
Application number: JP2024004675A
Authority: JP
Inventors: 陽介関口; 和巳袴田
Original assignee: Sysmex Corp
Current assignee: Sysmex Corp
Priority date: 2018-02-27
Filing date: 2024-01-16
Publication date: 2025-11-13
Anticipated expiration: 2038-02-27
Also published as: JP7458519B2; JP2024038396A; JP2023065352A; JP7217589B2; JP2019148473A

Description

本明細書には、画像解析方法、画像解析装置、プログラム、学習済み深層学習アルゴリズムの製造方法および学習済み深層学習アルゴリズムが開示される。より詳細には、解析対象の組織の画像について、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成することを含む画像解析方法、画像解析装置、プログラム、学習済み深層学習アルゴリズムの製造方法および学習済み深層学習アルゴリズム等が開示される。 This specification discloses an image analysis method, an image analysis device, a program, a method for manufacturing a trained deep learning algorithm, and a trained deep learning algorithm. More specifically, this specification discloses an image analysis method, an image analysis device, a program, a method for manufacturing a trained deep learning algorithm, a trained deep learning algorithm, etc., which include generating data indicating whether an area included in analysis data for an image of tissue to be analyzed is a tumorous area.

特許文献１には、病理組織画像において組織像を、正常、良性腫瘍、前癌状態および癌状態の４つのグループに分類し、判定する画像診断支援装置が開示されている。画像分類手段は、画像データから注視領域を抽出し、注視領域の特徴を示す特徴量を算出し、算出した特徴量に基づいてグループの分類を行う。特徴量は、細胞核における単位面積あたりの塊の密度、塊面積の密度、塊の面積、塊の太さ、および塊の長さなどである。画像判定手段は、このような特徴量と判定結果との関係を学習し、学習済みの学習パラメータに基づいて判定を行う。学習は、サポートベクターマシンなどの学習アルゴリズムを用いて、機械学習を実行する。 Patent Document 1 discloses an image diagnosis support device that classifies and assesses tissue images in pathological tissue images into four groups: normal, benign tumor, precancerous state, and cancerous state. The image classification means extracts a region of interest from the image data, calculates feature quantities that indicate the characteristics of the region of interest, and classifies the group based on the calculated feature quantities. The feature quantities include the density of clumps per unit area in the cell nucleus, the density of the clump area, the clump area, the clump thickness, and the clump length. The image assessment means learns the relationship between these feature quantities and assessment results, and makes assessments based on the learned learning parameters. Learning is performed using machine learning algorithms such as support vector machines.

特開２０１０－２０３９４９号公報JP 2010-203949 A

悪性腫瘍であるか否かを確定診断する際に、病理組織標本を用いた病理組織診断が行われる。また病理組織診断は、手術中に悪性腫瘍を含む組織の切除部位を決定するための術中迅速診断として行われることも少なくない。術中迅速診断は、手術中に患者の患部を切開した状態で待機させ、腫瘍が悪性であるか、切除した組織の断端に腫瘍が残っていないか、リンパ節転移があるか等の判断を病理組織診断により行うものである。術中迅速診断の結果により、待機している患者のその後の手術の方向性が決められる。 When making a definitive diagnosis of whether a tumor is malignant, a histopathological diagnosis is performed using a pathological tissue specimen. Histopathological diagnosis is also often performed as intraoperative rapid diagnosis to determine the area of tissue containing a malignant tumor to be removed during surgery. Intraoperative rapid diagnosis involves having the patient wait with the affected area incised during surgery, and using histopathological diagnosis to determine whether the tumor is malignant, whether any tumor remains at the margins of the resected tissue, whether there is lymph node metastasis, and so on. The results of the intraoperative rapid diagnosis determine the direction of subsequent surgery for the patient.

病理組織診断は、医師、特に病理医が顕微鏡等により組織標本を観察して診断するが、組織標本の観察により正確な確定診断を行えるようになるためには、長い期間、熟練した病理専門医の元で、様々な症例の組織標本の観察を繰り返す必要があり、病理医の育成にも膨大な時間を要している。 Pathological tissue diagnoses are made by doctors, particularly pathologists, who observe tissue specimens using a microscope or other device. However, to be able to make an accurate, definitive diagnosis through the observation of tissue specimens, a long period of time is required for repeated observation of tissue specimens from various cases under the supervision of an experienced pathologist, and the training of pathologists also takes an enormous amount of time.

病理医の不足は深刻であり、病理医が不足している結果、患者の悪性腫瘍の確定診断が遅れ、治療開始が遅れる、あるいは確定診断を待たずに治療を開始するという状態も危惧されている。また、通常の組織診断と術中迅速診断との両方が、少ない病理医に集中するために、一人の病理医の業務量が膨大となり、病理医自身の労務状態も問題となっている。しかし、現在のところこの問題の解消策は見つかっていない。 The shortage of pathologists is serious, and as a result, there are concerns that a definitive diagnosis of a patient's malignant tumor will be delayed, leading to delayed treatment, or that treatment will begin without waiting for a definitive diagnosis. Furthermore, because both regular tissue diagnosis and rapid intraoperative diagnosis are concentrated among a small number of pathologists, the workload for each pathologist is enormous, and the working conditions of the pathologists themselves are also becoming an issue. However, no solution to this problem has yet been found.

したがって、病理組織診断を装置がサポートできるようになることは、特にその診断が人の目による判断に近いほど、病理医不足の解消、および病理医の労務状態の改善に大きく貢献すると考えられる。 Therefore, the ability of devices to support pathological tissue diagnosis, especially to the extent that the diagnosis is close to that made by the human eye, is expected to make a significant contribution to resolving the shortage of pathologists and improving their working conditions.

装置が病理組織診断をサポートするという点において、上述の特許文献１に記載の発明では、機械学習による画像解析に基づいて標本組織の病理判定を行っている。この方法では、特徴量を人の手で作成する必要がある。特徴量を人の手で作成する方法には、その人の力量が画像解析の性能に大きく影響を与えるという問題がある。さらに、特許文献１に記載の発明は、顕微鏡で強拡大された組織標本の画像内の細胞核１つ１つに対して癌細胞の核であるか否かの判定を行う。したがって、標本１枚の全体について解析しようとすると長い解析時間を要するという問題がある。 In terms of devices that support pathological tissue diagnosis, the invention described in the aforementioned Patent Document 1 performs pathological judgment of specimen tissue based on image analysis using machine learning. This method requires that features be created manually. The problem with manually creating features is that the skill of the person performing the manual creation of features has a significant impact on the performance of the image analysis. Furthermore, the invention described in Patent Document 1 determines whether each cell nucleus in an image of a tissue specimen magnified under a microscope is a cancer cell nucleus. Therefore, there is a problem in that analyzing the entire specimen requires a long analysis time.

したがって、個々の細胞について癌細胞の核であるか否かを病理医や機械学習アルゴリズムによって判定する前に、１枚の標本に含まれる組織全体を観察し、その標本のどの部位に腫瘍組織が含まれるかをスクリーニングすることは、１枚の標本に対する病理診断にかかる時間を短縮する一助となる。 Therefore, before a pathologist or machine learning algorithm determines whether an individual cell is a cancer cell nucleus, observing the entire tissue contained in a single specimen and screening which parts of the specimen contain tumor tissue can help shorten the time required for a pathological diagnosis of a single specimen.

本発明は、解析対象の組織が解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成する画像解析方法、画像解析装置、およびプログラムを提供することを課題とする。 The present invention aims to provide an image analysis method, image analysis device, and program that generate data indicating whether an area of the tissue to be analyzed, included in the analysis data, is a tumor area.

上記の課題を解決するために、本発明の一態様に係る画像解析方法は、ニューラルネットワーク構造の深層学習アルゴリズム（６０）を用いて、組織の画像を解析する画像解析方法であって、組織の種類に応じて生成された複数の深層学習アルゴリズム（６０）から、解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）を選択し、前記解析対象の組織を含む解析対象画像（７８）から解析用データ（８０）を生成し、前記解析用データ（８０）を、選択された前記深層学習アルゴリズム（６０）に入力し、前記解析用データ（８０）が入力された前記深層学習アルゴリズム（６０）から出力される解析結果に基づき、腫瘍領域とそれ以外の領域とを識別可能に前記解析対象の組織の画像を表示する、ことを含む。 In order to solve the above-mentioned problems, one embodiment of the image analysis method of the present invention is an image analysis method that analyzes a tissue image using a deep learning algorithm (60) with a neural network structure, and includes selecting a deep learning algorithm (60) corresponding to the type of tissue to be analyzed from a plurality of deep learning algorithms (60) generated according to the type of tissue, generating analysis data (80) from an analysis target image (78) including the tissue to be analyzed, inputting the analysis data (80) into the selected deep learning algorithm (60), and displaying an image of the tissue to be analyzed in a manner that allows differentiation between tumor regions and other regions based on analysis results output from the deep learning algorithm (60) to which the analysis data (80) has been input.

データベース（１０５）に前記複数の深層学習アルゴリズム（６０）が記憶されており、前記解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）が、前記データベース（１０５）に記憶された前記複数の深層学習アルゴリズム（６０）から選択されてもよい。 The plurality of deep learning algorithms (60) may be stored in a database (105), and the deep learning algorithm (60) corresponding to the type of tissue to be analyzed may be selected from the plurality of deep learning algorithms (60) stored in the database (105).

前記複数の深層学習アルゴリズム（６０）のそれぞれは、組織の種類と対応付けられて前記データベース（１０５）に記憶されていてもよい。 Each of the plurality of deep learning algorithms (60) may be associated with a tissue type and stored in the database (105).

前記複数の深層学習アルゴリズム（６０）の少なくとも１つは、１つの組織の種類と対応付けられて前記データベース（１０５）に記憶されていてもよい。 At least one of the plurality of deep learning algorithms (60) may be stored in the database (105) in association with one tissue type.

ユーザによる入力部（２６）を介した操作に基づいて、前記解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）が選択されてもよい。 The deep learning algorithm (60) corresponding to the type of tissue to be analyzed may be selected based on user operation via the input unit (26).

前記画像解析方法は、前記深層学習アルゴリズム（６０）から出力される解析結果に基づき、腫瘍領域とそれ以外の領域とを識別可能に表示するための描画処理を前記解析対象画像（７８）に施すことをさらに含んでもよい。 The image analysis method may further include performing a rendering process on the analysis target image (78) to distinguish between tumor areas and other areas based on the analysis results output from the deep learning algorithm (60).

前記描画処理は、前記腫瘍領域とそれ以外の領域とを異なる色で描画する処理を含んでもよい。 The drawing process may include drawing the tumor area and other areas in different colors.

前記解析対象画像（７８）が、組織診断用標本の画像であり、前記解析対象画像（７８）が２以上の原色を組み合わせた色相を含んでもよい。 The image to be analyzed (78) may be an image of a tissue diagnostic specimen, and the image to be analyzed (78) may include hues that combine two or more primary colors.

前記解析対象画像（７８）が、前記解析対象の組織を３倍から２０倍に拡大して撮像された画像であってもよい。 The image (78) to be analyzed may be an image of the tissue to be analyzed captured at a magnification of 3 to 20 times.

前記解析用データ（８０）は、前記解析対象画像から切り出された画像に対応し、前記解析対象画像（７８）から複数生成されてもよい。 The analysis data (80) corresponds to an image cut out from the analysis target image, and multiple pieces of the analysis data (80) may be generated from the analysis target image (78).

前記解析対象画像（７８）が、組織診断用標本の画像であり、前記組織診断用標本が染色された標本であり、前記解析対象画像（７８）は、前記染色された標本を顕微鏡の明視野下で撮像した画像であってもよい。 The image to be analyzed (78) may be an image of a tissue diagnostic specimen, which may be a stained specimen, and the image to be analyzed (78) may be an image of the stained specimen captured under a bright field microscope.

前記深層学習アルゴリズム（６０）の学習に用いられる訓練データが、個体から採取された腫瘍領域を含む組織の標本に対して明視野観察用染色を施して作製された標本の染色像を顕微鏡の明視野下で撮像した明視野画像に基づいて生成されていてもよい。 The training data used to train the deep learning algorithm (60) may be generated based on bright-field images of a tissue specimen containing a tumor area taken from an individual, stained for bright-field observation, and then captured under a bright-field microscope.

前記ニューラルネットワーク（５０）の出力層（５０ｂ）がソフトマックス関数を活性化関数とするノードであってもよい。 The output layer (50b) of the neural network (50) may be a node that uses a softmax function as an activation function.

上記の課題を解決するために、本発明の一態様に係る画像解析装置（２００）は、ニューラルネットワーク構造の深層学習アルゴリズム（６０）を用いて、組織の画像を解析する画像解析装置であって、表示部（２７）と、組織の種類に応じて生成された複数の深層学習アルゴリズム（６０）から、解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）を選択し、前記解析対象の組織を含む解析対象画像（７８）から解析用データ（８０）を生成し、前記解析用データ（８０）を、選択された前記深層学習アルゴリズム（６０）に入力し、前記解析用データ（８０）が入力された前記深層学習アルゴリズム（６０）から出力される解析結果に基づき、腫瘍領域とそれ以外の領域とを識別可能に前記解析対象の組織の画像を前記表示部（２７）に表示する、制御部（２０）を備える。 In order to solve the above problems, an image analysis device (200) according to one embodiment of the present invention is an image analysis device that analyzes an image of tissue using a deep learning algorithm (60) with a neural network structure, and includes a display unit (27) and a control unit (20) that selects a deep learning algorithm (60) corresponding to the type of tissue to be analyzed from a plurality of deep learning algorithms (60) generated according to the type of tissue, generates analysis data (80) from an analysis target image (78) containing the tissue to be analyzed, inputs the analysis data (80) to the selected deep learning algorithm (60), and displays an image of the tissue to be analyzed on the display unit (27) so that tumor areas and other areas can be distinguished based on the analysis results output from the deep learning algorithm (60) to which the analysis data (80) has been input.

前記画像解析装置（２００）は、前記複数の深層学習アルゴリズム（６０）を記憶する記憶部（２３）をさらに備え、前記制御部（２０）は、前記解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）を、前記記憶部（２３）に記憶された前記複数の深層学習アルゴリズム（６０）から選択してもよい。 The image analysis device (200) may further include a memory unit (23) that stores the plurality of deep learning algorithms (60), and the control unit (20) may select the deep learning algorithm (60) corresponding to the type of tissue to be analyzed from the plurality of deep learning algorithms (60) stored in the memory unit (23).

前記記憶部（２３）は、前記複数の深層学習アルゴリズム（６０）のそれぞれを、組織の種類と対応付けて記憶してもよい。 The memory unit (23) may store each of the plurality of deep learning algorithms (60) in association with a tissue type.

前記記憶部（２３）は、前記複数の深層学習アルゴリズム（６０）の少なくとも１つを、１つの組織の種類と対応付けて記憶してもよい。 The memory unit (23) may store at least one of the plurality of deep learning algorithms (60) in association with one tissue type.

前記画像解析装置（２００）は、ユーザにより操作される入力部（２６）をさらに備え、前記制御部（２０）は、前記入力部（２６）を介した操作に基づいて、前記解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）を選択してもよい。 The image analysis device (200) may further include an input unit (26) operated by a user, and the control unit (20) may select the deep learning algorithm (60) corresponding to the type of tissue to be analyzed based on operation via the input unit (26).

前記制御部（２０）は、生成された前記解析用データに基づき、腫瘍領域とそれ以外の領域とを識別可能に表示するための描画処理を前記解析対象画像（７８）に施してもよい。 The control unit (20) may perform a drawing process on the analysis target image (78) based on the generated analysis data to display the tumor area and other areas in a distinguishable manner.

上記の課題を解決するために、本発明の一態様に係るコンピュータプログラムは、ニューラルネットワーク構造の深層学習アルゴリズム（６０）を用いて、組織の画像を解析するコンピュータプログラムであって、コンピュータに、組織の種類に応じて生成された複数の深層学習アルゴリズム（６０）から、解析対象の組織の種類に対応する前記深層学習アルゴリズム（６０）を選択する処理と、前記解析対象の組織を含む解析対象画像（７８）から解析用データ（８０）を生成する処理と、前記解析用データ（８０）を、選択された前記深層学習アルゴリズム（６０）に入力する処理と、前記解析用データ（８０）が入力された前記深層学習アルゴリズム（６０）から出力される解析結果に基づき、腫瘍領域とそれ以外の領域とを識別可能に前記解析対象の組織の画像を表示する処理と、を実行させる。 In order to solve the above problem, one embodiment of the present invention provides a computer program that analyzes tissue images using a deep learning algorithm (60) with a neural network structure, and causes a computer to execute the following processes: selecting a deep learning algorithm (60) corresponding to the type of tissue to be analyzed from multiple deep learning algorithms (60) generated according to the type of tissue; generating analysis data (80) from an analysis target image (78) containing the tissue to be analyzed; inputting the analysis data (80) into the selected deep learning algorithm (60); and displaying an image of the tissue to be analyzed in a manner that allows differentiation between tumor regions and other regions based on the analysis results output from the deep learning algorithm (60) to which the analysis data (80) has been input.

標本に含まれる組織全体を観察し、その標本のどの部位に腫瘍組織が含まれるかスクリーニングするためのデータを生成することができる。 It is possible to observe the entire tissue contained in a specimen and generate data to screen which parts of the specimen contain tumor tissue.

深層学習方法の概要を説明するための模式図である。FIG. 1 is a schematic diagram for explaining an overview of a deep learning method. 深層学習方法の概要を説明するための模式図である。FIG. 1 is a schematic diagram for explaining an overview of a deep learning method. 深層学習方法の概要を説明するための模式図である。FIG. 1 is a schematic diagram for explaining an overview of a deep learning method. 訓練データの詳細を説明するための模式図である。FIG. 10 is a schematic diagram for explaining details of training data. 訓練データの詳細を説明するための模式図である。FIG. 10 is a schematic diagram for explaining details of training data. 画像解析方法の概要を説明するための模式図である。FIG. 1 is a schematic diagram for explaining an outline of an image analysis method. 第１の実施形態に係る画像解析システムの概略構成図である。1 is a schematic configuration diagram of an image analysis system according to a first embodiment. ベンダ側装置１００のハードウェア構成を示すブロック図である。FIG. 2 is a block diagram showing the hardware configuration of a vendor-side device 100. ユーザ側装置２００のハードウェア構成を示すブロック図である。FIG. 2 is a block diagram showing the hardware configuration of a user device 200. 第１の実施形態に係る深層学習装置１００Ａの機能を説明するためのブロック図である。FIG. 2 is a block diagram for explaining the functions of the deep learning device 100A according to the first embodiment. 深層学習処理の手順を示すフローチャートである。1 is a flowchart showing the procedure of deep learning processing. ニューラルネットワークによる学習の詳細を説明するための模式図である。FIG. 10 is a schematic diagram for explaining details of learning by a neural network. 第１の実施形態に係る画像解析装置２００Ａの機能を説明するためのブロック図である。FIG. 2 is a block diagram for explaining functions of an image analysis device 200A according to the first embodiment. 画像解析処理の手順を示すフローチャートである。10 is a flowchart showing the procedure of an image analysis process. 第２の実施形態に係る画像解析システムの概略構成図である。FIG. 10 is a schematic configuration diagram of an image analysis system according to a second embodiment. 第２の実施形態に係る統合型の画像解析装置２００Ｂの機能を説明するためのブロック図である。FIG. 10 is a block diagram for explaining functions of an integrated image analyzing device 200B according to a second embodiment. 第３の実施形態に係る画像解析システムの概略構成図である。FIG. 10 is a schematic configuration diagram of an image analysis system according to a third embodiment. 第３の実施形態に係る統合型の画像解析装置１００Ｂの機能を説明するためのブロック図である。FIG. 10 is a block diagram for explaining functions of an integrated image analyzing device 100B according to a third embodiment. 胃がん組織から取得された組織標本の解析結果である。（ａ）は、胃がん組織をＨＥ染色して撮像した明視野画像のホールスライドイメージであり、解析対象の画像である。（ｂ）は、ホールスライドイメージの腫瘍領域を病理医が指定し実線で囲った像である。（ｃ）は、拡大倍率１倍の訓練用画像と解析用画像を用いた解析結果を示す。（ｄ）は、拡大倍率５倍の訓練用画像と解析用画像を用いた解析結果を示す。（ｅ）は、拡大倍率４０倍の訓練用画像と解析用画像を用いた解析結果を示す。図１９（ｃ）から（ｅ）において解析処理により得られた腫瘍領域を白で表示し、非腫瘍領域をグレーで示し、非組織領域を黒で示す。19 shows the analysis results of a tissue specimen obtained from gastric cancer tissue. (a) is a whole slide image of a bright-field image of gastric cancer tissue stained with HE, and is the image to be analyzed. (b) is an image of the whole slide image in which the pathologist has designated the tumor region and enclosed it with a solid line. (c) shows the analysis results using training images and analysis images magnified by 1x. (d) shows the analysis results using training images and analysis images magnified by 5x. (e) shows the analysis results using training images and analysis images magnified by 40x. In Figures 19(c) to (e), the tumor region obtained by the analysis process is displayed in white, the non-tumor region is displayed in gray, and the non-tissue region is displayed in black. 胃がん組織から取得された組織標本の解析結果である。（ａ）は、胃がん組織をＨＥ染色して撮像した明視野画像のホールスライドイメージであり、解析対象の画像である。（ｂ）は、ホールスライドイメージの腫瘍領域を病理医が指定し実線で囲った像である。（ｃ）は、拡大倍率５倍の訓練用画像と解析用画像を用いた解析結果を示す。（ｄ）は、拡大倍率１６倍の訓練用画像と解析用画像を用いた解析結果を示す。図１９（ｃ）、（ｄ）において解析処理により得られた腫瘍領域を白で表示し、非腫瘍領域をグレーで示し、非組織領域を黒で示す。19 shows the analysis results of a tissue specimen obtained from gastric cancer tissue. (a) is a whole slide image of a bright-field image of gastric cancer tissue stained with HE, and is the image to be analyzed. (b) is an image of the whole slide image in which the tumor region is designated by a pathologist and surrounded by a solid line. (c) shows the analysis results using training images and analysis images magnified 5x. (d) shows the analysis results using training images and analysis images magnified 16x. In Figures 19(c) and (d), the tumor region obtained by the analysis process is displayed in white, the non-tumor region is displayed in gray, and the non-tissue region is displayed in black. 組織標本の拡大倍率とウィンドウサイズの画素数による判別精度の違いを示す図である。（ａ）は各拡大倍率の感度、（ｂ）は、各拡大倍率の陽性的中率を示す。（ｃ）は、ウィンドウサイズの画素数の違いによる感度、（ｄ）は陽性的中率の変化を示す。1 shows the difference in discrimination accuracy depending on the magnification of the tissue specimen and the number of pixels in the window size. (a) shows the sensitivity at each magnification, (b) shows the positive predictive value at each magnification, (c) shows the change in sensitivity due to differences in the number of pixels in the window size, and (d) shows the change in the positive predictive value.

以下、本発明の概要および実施の形態を、添付の図面を参照して詳細に説明する。なお、以下の説明および図面において、同じ符号は同じまたは類似の構成要素を示すこととし、よって、同じまたは類似の構成要素に関する説明を省略する。 The following provides a detailed description of the overview and embodiments of the present invention, with reference to the accompanying drawings. Note that in the following description and drawings, the same reference numerals will indicate the same or similar components, and therefore, descriptions of the same or similar components will be omitted.

画像解析方法は、組織の画像を解析する。前記画像解析方法は、ニューラルネットワーク構造の、好ましくは、畳み込みニューラルネットワーク構造の深層学習アルゴリズムを用いる。前記画像解析方法は、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成することを含む。 The image analysis method analyzes an image of tissue. The image analysis method uses a deep learning algorithm with a neural network structure, preferably a convolutional neural network structure. The image analysis method includes generating data indicating whether an area included in the analysis data is a tumor area.

本発明において、組織または細胞の画像は、組織試料の標本または細胞を含む試料の標本から取得される画像である。組織試料の標本または細胞を含む試料の標本は、個体から採取される。個体は、特に制限されないが、好ましくは哺乳類であり、より好ましくはヒトである。前記個体から試料が採取される際に、個体が生きているか死亡しているかは問わない。前記組織は、個体内に存在するものである限り、制限されない。前記個体から採取された組織としては、外科的な切除組織、生検組織等を挙げることができる。前記腫瘍は上皮性および非上皮性のいずれであってもよい。前記腫瘍として好ましくは悪性上皮性腫瘍である。悪性腫瘍は、特に制限されないが、悪性腫瘍としては、例えば、気管、気管支又は肺等から発生する呼吸器系悪性腫瘍；上咽頭、食道、胃、十二指腸、空腸、回腸、盲腸、虫垂、上行結腸、横行結腸、Ｓ状結腸、直腸又は肛門部等から発生する消化管系悪性腫瘍；肝臓癌；膵臓癌；膀胱、尿管又は腎臓から発生する泌尿器系悪性腫瘍；卵巣、卵管および子宮等から発生する女性生殖器系悪性腫瘍；乳癌；前立腺癌；皮膚癌；視床下部、下垂体、甲状腺、副甲状腺、副腎等の内分泌系悪性腫瘍；中枢神経系悪性腫瘍；骨軟部組織から発生する悪性腫瘍等の固形腫瘍が挙げられる。より好ましくは、肺癌（扁平上皮癌、小細胞癌、大細胞癌、腺癌）等の呼吸器系上皮性悪性腫瘍；胃癌、十二指腸癌、大腸癌（Ｓ状結腸癌、直腸癌等）等の消化管系上皮性悪性腫瘍；肝臓癌；膵臓癌；膀胱癌；甲状腺癌；卵巣癌；乳癌；前立腺癌を挙げることができる。最も好ましくは、胃癌である。 In the present invention, a tissue or cell image is an image obtained from a tissue sample specimen or a cell-containing sample specimen. The tissue sample specimen or cell-containing sample specimen is collected from an individual. The individual is not particularly limited, but is preferably a mammal, more preferably a human. The individual may be alive or dead when the sample is collected from the individual. There are no limitations on the tissue, as long as it is present within the individual. Examples of tissue collected from the individual include surgically resected tissue and biopsy tissue. The tumor may be either epithelial or non-epithelial. The tumor is preferably a malignant epithelial tumor. The malignant tumor is not particularly limited, but examples of the malignant tumor include respiratory system malignant tumors arising from the trachea, bronchi, lungs, etc.; digestive system malignant tumors arising from the nasopharynx, esophagus, stomach, duodenum, jejunum, ileum, cecum, appendix, ascending colon, transverse colon, sigmoid colon, rectum, anus, etc.; liver cancer; pancreatic cancer; urinary system malignant tumors arising from the bladder, ureter, or kidney; female reproductive system malignant tumors arising from the ovaries, fallopian tubes, uterus, etc.; breast cancer; prostate cancer; skin cancer; endocrine system malignant tumors such as the hypothalamus, pituitary gland, thyroid gland, parathyroid gland, and adrenal gland; central nervous system malignant tumors; and solid tumors such as malignant tumors arising from bone and soft tissue. More preferred are respiratory epithelial malignant tumors such as lung cancer (squamous cell carcinoma, small cell carcinoma, large cell carcinoma, adenocarcinoma); digestive epithelial malignant tumors such as gastric cancer, duodenal cancer, and colon cancer (sigmoid colon cancer, rectal cancer, etc.); liver cancer; pancreatic cancer; bladder cancer; thyroid cancer; ovarian cancer; breast cancer; and prostate cancer. Gastric cancer is the most preferred.

前記標本は、前記組織を顕微鏡等で観察ができるように加工した状態のもの、例えばプレパラートを意図する。前記標本は、公知の方法に従って作製することができる。例えば、組織標本の場合には、前記個体から組織を採取した後に、所定の固定液（ホルマリン固定等）で組織を固定し、その固定組織をパラフィン包埋し、パラフィン包埋組織を薄切する。薄切切片をスライドグラスにのせる。切片がのったスライドグラスに対して光学顕微鏡での観察のため、すなわち明視野観察のための染色を施し、所定の封入処理をして標本が完成する。組織標本の典型例は、組織診断用標本（病理標本）であり、染色は、へマトキシリン・エオジン（ＨＥ）染色である。 The specimen is intended to be a state in which the tissue has been processed so that it can be observed under a microscope, for example, such as a slide. The specimen can be prepared according to known methods. For example, in the case of a tissue specimen, after collecting the tissue from the individual, the tissue is fixed in a specified fixative (formalin fixation, etc.), the fixed tissue is embedded in paraffin, and the paraffin-embedded tissue is thinly sliced. The thin slices are placed on a glass slide. The slide with the slices on it is stained for observation under an optical microscope, i.e., for brightfield observation, and the specimen is completed after a specified mounting process. A typical example of a tissue specimen is a tissue diagnostic specimen (pathological specimen), and the stain is hematoxylin-eosin (HE) staining.

ＨＥ染色の核染色はヘマトキシリンである。ヘマトキシリンは、組織細胞染色において核染色剤として幅広く使用されている（例えば、免疫染色、レクチン染色、糖染色、脂肪染色、膠原線維染色等）。このため、本発明は、このようなヘマトキシリンを核染色に使う標本全般において、適用が可能である。 Hematoxylin is the nuclear stain used in HE staining. Hematoxylin is widely used as a nuclear stain in tissue and cell staining (e.g., immunostaining, lectin staining, sugar staining, fat staining, collagen fiber staining, etc.). Therefore, the present invention can be applied to all specimens in which hematoxylin is used as a nuclear stain.

前記画像解析には、訓練用画像を用いて訓練された深層学習アルゴリズムを用いる。前記画像解析は、前記標本から取得した、解析対象の組織を含む解析対象画像から解析データを生成する。前記解析データを、前記深層学習アルゴリズムに入力し、前記解析対象画像内に存在する組織が解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成する。解析用データに含まれる領域が腫瘍領域であるか否かを示すデータとは、判別対象にある組織に含まれる複数の注目画素を含む領域が、腫瘍組織を含む可能性および／または非腫瘍領域である可能性を示すデータである。解析用データに含まれる領域が腫瘍領域であるか否かを示すデータは、腫瘍領域を含むことと非腫瘍領域であることを区別できるラベル値、表示等であり得る。 The image analysis uses a deep learning algorithm trained using training images. The image analysis generates analysis data from an analysis target image obtained from the specimen and containing the tissue to be analyzed. The analysis data is input into the deep learning algorithm, which generates data indicating whether a region of tissue present in the analysis target image included in the analysis data is a tumorous region. The data indicating whether a region included in the analysis data is a tumorous region is data indicating the possibility that a region including multiple pixels of interest contained in the tissue to be distinguished contains tumor tissue and/or is a non-tumorous region. The data indicating whether a region included in the analysis data is a tumorous region may be a label value, display, etc. that can distinguish between including a tumorous region and a non-tumorous region.

前記訓練用画像は、学習対象となる標本の複数の領域から取得される。好ましくは、前記訓練用画像は、学習対象となる組織に含まれる複数の層構造から層構造ごとに腫瘍領域を含む領域、非腫瘍領域を含む領域、および組織を含まない領域より取得される。 The training images are obtained from multiple regions of the specimen to be trained. Preferably, the training images are obtained from a region containing a tumor region, a region containing a non-tumor region, and a region containing no tissue for each of the multiple layer structures contained in the tissue to be trained.

例えば、第１の訓練用画像７０Ｒ１は、個体から採取された組織に含まれる、腫瘍領域を含む領域から取得された画像である。この画像は、顕微鏡観察により、組織構造が認識できるように染色された標本から取得される。前記染色は、組織構造が認識できる限り制限されないが、好ましくは明視野観察用の染色である。前記明視野観察用染色は、少なくとも細胞核と細胞核以外の部位が、色相で区別可能に染色できる限り制限されない。前記標本が哺乳類の組織標本である場合には、例えば、ＨＥ染色を挙げることができる。 For example, the first training image 70R1 is an image acquired from a region containing a tumor region in tissue collected from an individual. This image is acquired from a specimen stained so that the tissue structure can be recognized by microscopic observation. The staining is not limited as long as the tissue structure can be recognized, but is preferably a staining for bright-field observation. The staining for bright-field observation is not limited as long as it can stain at least cell nuclei and areas other than cell nuclei in a distinguishable hue. If the specimen is a mammalian tissue specimen, for example, HE staining can be used.

第１の訓練用画像７０Ｒ１からは、第１の訓練データ７４Ｒ１と、第１の予備訓練データ７０Ｒ１Ｌが生成される。第１の訓練データ７４Ｒ１は、第１の訓練用画像７０Ｒ１に含まれる色相を原色毎に分離した単一色画像に関する情報である。第１の予備訓練データ７０Ｒ１Ｌは、第１の訓練用画像７０Ｒ１に含まれている組織の領域が、腫瘍領域を含むことを示す二値化データ（ラベル値）として生成される。第１の訓練用画像７０Ｒ１に含まれる組織の領域が腫瘍領域を含むか否かは、例えば病理医等による標本観察により判定される。ここで、前記訓練用画像に含まれる組織の領域において、腫瘍細胞と、非腫瘍細胞と、細胞以外の組織の領域とが混在する場合がある。この場合、前記訓練用画像に含まれる組織の領域が、例えば前記訓練用画像に含まれる組織の領域全体の面積を１００％とした際に、腫瘍細胞がその全体の面積の約１０％以上、約２０％以上、約３０％以上、約４０％以上、約５０％以上、約６０％以上、約７０％以上、約８０％以上、または約９０％以上を占める場合に、腫瘍領域を含むと判定することができる。好ましくは、約４０％以上、約５０％以上、約６０％以上、約７０％以上、約８０％以上、または約９０％以上を占める場合に、腫瘍領域を含むと判定することができる。前記訓練用画像に含まれる組織の領域全体の面積（１００％）から非腫瘍細胞および細胞以外の組織（非腫瘍領域）が占める割合を減じた値を前記腫瘍細胞が組織全体の面積を占める割合として決定してもよい。 First training data 74R1 and first preliminary training data 70R1L are generated from the first training image 70R1. The first training data 74R1 is information regarding single-color images in which the hues contained in the first training image 70R1 are separated into primary colors. The first preliminary training data 70R1L is generated as binarized data (label values) indicating that the tissue region contained in the first training image 70R1 includes a tumor region. Whether the tissue region contained in the first training image 70R1 includes a tumor region is determined, for example, by specimen observation by a pathologist or the like. Here, the tissue region contained in the training image may contain a mixture of tumor cells, non-tumor cells, and tissue regions other than cells. In this case, when the area of the entire tissue region included in the training image is taken as 100%, the tissue region can be determined to contain a tumor region if tumor cells occupy at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, or at least about 90% of the total area. Preferably, the tissue region can be determined to contain a tumor region if it occupies at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, or at least about 90%. The percentage of the total tissue area occupied by tumor cells may be determined by subtracting the percentage of non-tumor cells and tissue other than cells (non-tumor region) from the total area (100%) of the tissue region included in the training image.

前記訓練用画像に含まれる第２の訓練用画像７０Ｒ２は、非腫瘍領域を含む組織の領域（好ましくは、非腫瘍細胞と細胞以外の組織の領域とを含む領域）から取得された画像である。この画像は、顕微鏡観察により、組織構造が認識できるように染色された標本から取得される。前記染色は、組織構造が認識できる限り制限されないが、好ましくは明視野観察用の染色である。前記明視野観察用染色は、少なくとも細胞核と細胞核以外の部位が、色相で区別可能に染色できる限り制限されない。前記標本が哺乳類の組織標本である場合には、例えば、ＨＥ染色を挙げることができる。前記染色は、第１の訓練用画像７０Ｒ１を取得した標本と同じものであることが好ましい。 The second training image 70R2 included in the training images is an image acquired from a tissue region including a non-tumor region (preferably a region including non-tumor cells and tissue regions other than cells). This image is acquired from a specimen stained so that the tissue structure can be recognized by microscopic observation. The staining is not limited as long as the tissue structure can be recognized, but is preferably a staining for brightfield observation. The staining for brightfield observation is not limited as long as it can stain at least cell nuclei and regions other than cell nuclei in a distinguishable hue. When the specimen is a mammalian tissue specimen, for example, HE staining can be used. The staining is preferably the same as that used for the specimen from which the first training image 70R1 was acquired.

第２の訓練用画像７０Ｒ２からは、第２の訓練データ７４Ｒ２と、第２の予備訓練データ７０Ｒ２Ｌが生成される。第２の訓練データ７４Ｒ２は、第２の訓練用画像７０Ｒ２に含まれる色相を原色毎に分離した単一色画像に関する情報である。第２の予備訓練データ７０Ｒ２Ｌは、第２の訓練用画像７０Ｒ２に含まれている組織の領域が、非腫瘍領域を含むことを示す二値化データ（ラベル値）である。第２の訓練用画像７０Ｒ２に含まれる組織の領域が非腫瘍領域を含むか否かは、例えば病理医等による標本観察により判定される。ここで、前記訓練用画像に含まれる組織の領域において、腫瘍細胞と、非腫瘍細胞と、細胞以外の組織の領域が混在する場合がある。この場合、前記訓練用画像に含まれる組織の領域が、例えば前記訓練用画像に含まれる組織の領域全体の面積を１００％とした際に、非腫瘍細胞と細胞以外の組織の領域（非腫瘍領域）とが組織全体の面積の約１０％以上、約２０％以上、約３０％以上、約４０％以上、約５０％以上、約６０％以上、約７０％以上、約８０％以上、約９０％以上、または１００％を占める場合に、非腫瘍領域を含むと判定することができる。好ましくは、約４０％以上、約５０％以上、約６０％以上、約７０％以上、約８０％以上、約９０％以上、または１００％を占める場合に、非腫瘍領域を含むと判定することができる。前記第２の訓練用画像７０Ｒ２に含まれる組織の領域全体の面積（１００％）から腫瘍領域が占める割合を減じた値を前記非腫瘍細胞と細胞以外の組織の領域とが組織全体の面積を占める割合として決定してもよい。 Second training data 74R2 and second preliminary training data 70R2L are generated from the second training image 70R2. The second training data 74R2 is information regarding single-color images in which the hues contained in the second training image 70R2 are separated into primary colors. The second preliminary training data 70R2L is binarized data (label values) indicating that the tissue region contained in the second training image 70R2 includes a non-tumor region. Whether the tissue region contained in the second training image 70R2 includes a non-tumor region is determined, for example, by specimen observation by a pathologist, etc. Here, the tissue region contained in the training image may contain a mixture of tumor cells, non-tumor cells, and tissue regions other than cells. In this case, the tissue area included in the training image can be determined to contain a non-tumorous region if, for example, the area of the entire tissue area included in the training image is taken as 100%, and the area of non-tumor cells and tissue other than cells (non-tumor region) occupies approximately 10% or more, approximately 20% or more, approximately 30% or more, approximately 40% or more, approximately 50% or more, approximately 60% or more, approximately 70% or more, approximately 80% or more, approximately 90% or more, or 100% of the area of the entire tissue. Preferably, the tissue area can be determined to contain a non-tumorous region if the area of non-tumor cells and tissue other than cells occupies approximately 40% or more, approximately 50% or more, approximately 60% or more, approximately 70% or more, approximately 80% or more, approximately 90% or more, or 100%. The percentage of the area of the entire tissue area included in the second training image 70R2 (100%) minus the percentage of the tumorous region may be determined as the percentage of the area of the entire tissue area.

前記訓練用画像に含まれる第３の訓練用画像７０Ｒ３は、第１の訓練用画像７０Ｒ１、および／または第２の訓練用画像７０Ｒ２を取得した標本における組織を含まない領域（非組織領域ともいう）から取得された画像である。 The third training image 70R3 included in the training images is an image obtained from a region that does not contain tissue (also called a non-tissue region) in the specimen from which the first training image 70R1 and/or the second training image 70R2 were obtained.

第３訓練用画像からは、第３の訓練データ７４Ｒ３と、第３の予備訓練データ７０Ｒ３Ｌが生成される。第３の訓練データ７４Ｒ３は、第３の訓練用画像７０Ｒ３に含まれる色相を原色毎に分離した単一色画像に関する情報である。第３の予備訓練データ７０Ｒ２Ｌは、第３の訓練用画像７０Ｒ３に含まれている領域が、組織を含まない領域であることであることを示す二値化データ（ラベル値）である。第３の訓練用画像７０Ｒ３に含まれる領域は、例えば病理医等による標本観察により判定される。ここで、前記訓練用画像には、組織の領域と背景（例えば、プレパラートのガラス部分）の領域とが混在する場合がある。この場合、前記訓練用画像に含まれる背景の領域が、例えば前記１枚の訓練用画像に含まれる領域全体の面積を１００％とした際に、背景の領域が約９０％以上、約９１％以上、約９２％以上、約９３％以上、約９４％以上、約９５％以上、約９６％以上、約９７％以上、約９８％以上、または約９９％以上を占める場合、あるいは背景の領域が１００％を占める場合に、組織を含まない領域であると判定することができる。好ましくは、背景の領域が約９５％以上、約９６％以上、約９７％以上、約９８％以上、または約９９％以上を占める場合、あるいは背景の領域が１００％を占める場合に、組織を含まない領域であると判定することができる。 From the third training image, third training data 74R3 and third preliminary training data 70R3L are generated. The third training data 74R3 is information relating to a single-color image in which the hues contained in the third training image 70R3 are separated into primary colors. The third preliminary training data 70R2L is binarized data (label values) indicating that the areas contained in the third training image 70R3 are areas that do not contain tissue. The areas contained in the third training image 70R3 are determined, for example, by specimen observation by a pathologist or the like. Here, the training image may contain a mixture of tissue areas and background areas (e.g., the glass portion of a slide slide). In this case, when the area of the background region included in the training image is, for example, 100% when the area of the entire area included in one training image is taken as 100%, if the background region occupies approximately 90% or more, approximately 91% or more, approximately 92% or more, approximately 93% or more, approximately 94% or more, approximately 95% or more, approximately 96% or more, approximately 97% or more, approximately 98% or more, or approximately 99% or more, or if the background region occupies 100%, the region can be determined to be tissue-free. Preferably, if the background region occupies approximately 95% or more, approximately 96% or more, approximately 97% or more, approximately 98% or more, or approximately 99% or more, or if the background region occupies 100%, the region can be determined to be tissue-free.

各訓練用画像は、訓練用の組織標本を、例えば５倍から２０倍に拡大した画像として取得されることが好ましい。判別精度の点から、３０倍以上、特に４０倍以上に拡大した画像は、訓練用画像としては不向きである。 Each training image is preferably acquired as an image of the training tissue specimen enlarged, for example, by 5 to 20 times. From the standpoint of discrimination accuracy, images enlarged by 30 times or more, and especially 40 times or more, are unsuitable as training images.

第１の訓練データ７４Ｒ１および第１の予備訓練データ７０Ｒ１Ｌからは腫瘍領域訓練データが生成され、第２の訓練データ７４Ｒ２および第２の予備訓練データ７０Ｒ２Ｌからは非腫瘍領域訓練データが生成され、第３の訓練データ７４Ｒ３および第３の予備訓練データ７０Ｒ２Ｌからは非組織領域訓練データが生成される。腫瘍領域訓練データ、非腫瘍領域訓練データおよび非組織領域訓練データを併せて訓練データと呼ぶことがある。 Tumor region training data is generated from the first training data 74R1 and the first preliminary training data 70R1L, non-tumor region training data is generated from the second training data 74R2 and the second preliminary training data 70R2L, and non-tissue region training data is generated from the third training data 74R3 and the third preliminary training data 70R2L. The tumor region training data, non-tumor region training data, and non-tissue region training data are sometimes collectively referred to as training data.

本発明の概要および実施の形態では、ＨＥ染色された組織標本を撮像した画像内に含まれる組織が腫瘍領域、非腫瘍領域および非組織領域を含むことを、深層学習アルゴリズムによって判別する場合を一例として説明する。 In the overview and embodiments of the present invention, we will explain, as an example, a case in which a deep learning algorithm is used to determine whether the tissue contained in an image of an HE-stained tissue specimen includes tumorous regions, non-tumorous regions, and non-tissue regions.

［深層学習方法および画像解析方法の概要］
はじめに、深層学習方法について説明する。 [Overview of deep learning and image analysis methods]
First, we explain the deep learning method.

・深層学習方法の概要
図１を用いて、第１の訓練データ７４Ｒ１、第１の予備訓練データ７０Ｒ１Ｌおよび腫瘍領域訓練データ７５Ｒ１の概要を説明する。図１には、スライドイメージスキャナにより取得した組織標本のホールスライドイメージ７０Ｗ１を使ったニューラルネットワークへの訓練データの入力例を示す。ホールスライドイメージ７０Ｗ１は明視野観察用染色としてＨＥ染色を施して作製された標本を明視野で撮像した組織標本のスライドイメージを示す。ホールスライドイメージ７０Ｗ２は、病理医によりホールスライドイメージ７０Ｗ１において腫瘍領域を含む部分を病理医が実線で囲って指定した画像である。ホールスライドイメージ７０Ｗ３は、ホールスライドイメージ７０Ｗ２を訓練用画像として分割（例えば、５１２分割）した像を示す。ホールスライドイメージ７０Ｗ３内の記号Ｒ１で示された四角枠で囲まれた領域は、図１に示す第１の訓練用画像７０Ｒ１として使用する領域を示す。記号Ｒ２で示された四角枠で囲まれた領域は、図２に示す第２の訓練用画像７０Ｒ２として使用する領域を示す。記号Ｒ３で示された四角枠で囲まれた領域は、図３に示す第３の訓練用画像７０Ｒ３として使用する領域を示す。第１の訓練用画像７０Ｒ１が腫瘍領域を含むこと、第２の訓練用画像７０Ｒ２が非腫瘍領域を含むこと、および第３の訓練用画像７０Ｒ３が組織を含まない領域であることは、それぞれの訓練画像を取得する前に判定していてもよく、それぞれの訓練画像を取得してから判定してもよい。 Overview of the Deep Learning Method: An overview of the first training data 74R1, the first preliminary training data 70R1L, and the tumor region training data 75R1 will be described using Figure 1. Figure 1 shows an example of training data input to a neural network using a whole slide image 70W1 of a tissue specimen acquired by a slide image scanner. The whole slide image 70W1 is a slide image of a tissue specimen prepared by HE staining for bright-field observation and captured in bright-field. The whole slide image 70W2 is an image designated by a pathologist by enclosing a portion of the whole slide image 70W1 containing the tumor region with a solid line. The whole slide image 70W3 is an image obtained by dividing the whole slide image 70W2 (e.g., 512 divisions) as a training image. The area enclosed by a square frame marked with the symbol R1 in the whole slide image 70W3 indicates the area used as the first training image 70R1 shown in Figure 1. The rectangular area indicated by the symbol R2 indicates the area used as the second training image 70R2 shown in Figure 2. The rectangular area indicated by the symbol R3 indicates the area used as the third training image 70R3 shown in Figure 3. The fact that the first training image 70R1 contains a tumor region, the second training image 70R2 contains a non-tumor region, and the third training image 70R3 is a region that does not contain tissue may be determined before or after each training image is acquired.

図１において、第１の訓練用画像７０Ｒ１は、ＨＥ染色した標本を明視野観察下で例えばカラー画像として撮像しているため、第１の訓練用画像７０Ｒ１には複数の色相が含まれる。 In Figure 1, the first training image 70R1 is an HE-stained specimen captured under bright-field observation, for example as a color image, and therefore includes multiple hues.

第１の訓練用画像７０Ｒ１は、例えば公知の光学顕微鏡、蛍光顕微鏡またはバーチャルスライドスキャナ等の画像取得装置を用いて、予め取得することができる。例示的には、本実施形態において画像取得装置から取得されるカラー撮像は、カラースペースがＲＧＢの２４ビットカラーであることが好ましい。ＲＧＢの２４ビットカラーでは、赤色、緑色および青色のそれぞれの濃さ（色濃度）を、８ビット（２５６段階）の階調で表すことが好ましい。第１の訓練用画像７０Ｒ１は、１以上の原色を含む画像であればよい。第１の訓練用画像７０Ｒ１に対応して、第１の訓練データ７４Ｒ１が生成される。 The first training image 70R1 can be acquired in advance using an image acquisition device such as a known optical microscope, fluorescent microscope, or virtual slide scanner. For example, in this embodiment, the color image acquired from the image acquisition device preferably has a color space of 24-bit RGB color. In 24-bit RGB color, the intensity (color density) of each of the red, green, and blue colors is preferably represented by 8-bit (256 levels) gradation. The first training image 70R1 may be any image that includes one or more primary colors. First training data 74R1 is generated corresponding to the first training image 70R1.

本発明において、色相は、例示的には、光の３原色の組み合わせ、または色の３原色の組み合わせで規定される。第１の訓練データ７４Ｒ１は、第１の訓練用画像７０Ｒ１から生成される、第１の訓練用画像７０Ｒ１に現れる色相を個々の原色に分離して原色毎に生成し、その濃度に応じた符号で表されたデータである。図１では光の３原色である赤（Ｒ）、緑（Ｇ）、青（Ｂ）の原色毎に分離した単一色の濃淡で示された画像（以下、「単一色画像」ともいう）７２Ｒ１Ｒ，７２Ｒ１Ｇ，７２Ｒ１Ｂを得る。 In the present invention, hue is illustratively defined as a combination of the three primary colors of light or the three primary colors of color. The first training data 74R1 is generated from the first training image 70R1 by separating the hues appearing in the first training image 70R1 into individual primary colors, and data is generated for each primary color, expressed as a code corresponding to its density. In Figure 1, images 72R1R, 72R1G, and 72R1B are obtained, which are shown in shades of a single color separated into the three primary colors of light: red (R), green (G), and blue (B) (hereinafter also referred to as "single-color images").

単一色画像７２Ｒ１Ｒ，７２Ｒ１Ｇ，７２Ｒ１Ｂ上の各画素について各色の色濃度を符号化し、画像全体をＲ、Ｇ、Ｂ毎の各画像について、画素毎の色濃度に対応した符号化図（以下、「色濃度符号化図」ともいう）７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂを生成する。色濃度は、各色２５６段階を示す数値で符号化しても良い。また、色濃度は、各色２５６段階を示す数値に対して、さらに前処理を行って、各画素における色濃度を例えば、値０から値７の８段階で示す数字で符号化してもよい。図１に例示的に示すＲ、Ｇ、Ｂ各色の単一色画像から生成された色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂは、説明の便宜上、各画素における色濃度を値０から値７の８段階の符号で表している。図１に示す色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂから画素毎にＲ、Ｇ、Ｂの色濃度値が組み合わされた第１の訓練データ７４Ｒ１が生成される。色濃度を示す符号は、本明細書において色濃度値ともいう。また、第１の訓練データ７４Ｒ１として、色濃度符号化図に替えて、各画素に対応した色濃度値の行列を生成してもよい。 The color density of each pixel in the single-color images 72R1R, 72R1G, and 72R1B is encoded, and coded maps (hereinafter referred to as "color density coded maps") 72R1r, 72R1g, and 72R1b corresponding to the color density of each pixel are generated for each R, G, and B image. Color density may be coded using a numerical value representing 256 levels of each color. Furthermore, the numerical values representing 256 levels of color density may be further preprocessed to code the color density of each pixel using a number representing eight levels, for example, from 0 to 7. For ease of explanation, the color density coded maps 72R1r, 72R1g, and 72R1b generated from the R, G, and B single-color images shown in Figure 1 represent the color density of each pixel using eight levels of codes, from 0 to 7. First training data 74R1 is generated from the color density encoding charts 72R1r, 72R1g, and 72R1b shown in FIG. 1, combining R, G, and B color density values for each pixel. Codes indicating color density are also referred to as color density values in this specification. Alternatively, a matrix of color density values corresponding to each pixel may be generated as the first training data 74R1 instead of the color density encoding chart.

図１において、第１の予備訓練データ７０Ｒ１Ｌは、第１の訓練用画像７０Ｒ１に含まれている組織が、腫瘍領域を含むことを示すデータである。例えば、第１の訓練用画像７０Ｒ１は、全領域が病理医の判定により腫瘍領域であると判定されているため第１の訓練用画像７０Ｒ１には、後述する所定の画素数の領域毎に腫瘍領域を含むことを示すラベル値として同じ数値、例えば「１」が付され、第１の予備訓練データ７０Ｒ１Ｌとなる。 In FIG. 1, the first preliminary training data 70R1L is data indicating that the tissue contained in the first training image 70R1 includes a tumor region. For example, since the entire area of the first training image 70R1 has been determined by a pathologist to be a tumor region, the first training image 70R1 is assigned the same numerical value, for example, "1," as a label value indicating that each area of a predetermined number of pixels described below contains a tumor region, and this becomes the first preliminary training data 70R1L.

深層学習方法では、図１に示す第１の訓練データ７４Ｒ１および第１の予備訓練データ７０Ｒ１Ｌから腫瘍領域訓練データ７５Ｒ１を生成する。そして、腫瘍領域訓練データ７５Ｒ１の各画素の色濃度値データ７６（第１の訓練データ７４Ｒ１から得られる）を入力層５０ａとし、ラベル値７７Ｒ１（第１の予備訓練データ７０Ｒ１Ｌから得られる）を出力層５０ｂとするニューラルネットワーク５０に学習させる。 In the deep learning method, tumor region training data 75R1 is generated from the first training data 74R1 and first preliminary training data 70R1L shown in Figure 1. Then, a neural network 50 is trained using color density value data 76 (obtained from the first training data 74R1) of each pixel in the tumor region training data 75R1 as the input layer 50a and label values 77R1 (obtained from the first preliminary training data 70R1L) as the output layer 50b.

次に、図２を用いて、第２の訓練データ７４Ｒ２、第２の予備訓練データ７０Ｒ２Ｌおよび非腫瘍領域訓練データ７５Ｒ２の概要を説明する。図２において、ホールスライドイメージ７０Ｗ１、ホールスライドイメージ７０Ｗ２、およびホールスライドイメージ７０Ｗ３は図１と同様であるため、第２の訓練用画像７０Ｒ２には、第１の訓練用画像７０Ｒ１と同様に複数の色相が含まれる。 Next, an overview of the second training data 74R2, the second preliminary training data 70R2L, and the non-tumor region training data 75R2 will be provided using Figure 2. In Figure 2, hole slide image 70W1, hole slide image 70W2, and hole slide image 70W3 are the same as those in Figure 1, so the second training image 70R2 contains multiple hues, just like the first training image 70R1.

第２の訓練用画像７０Ｒ２は、腫瘍領域を含む組織に替えて非腫瘍領域含む組織を用いる以外は、第１の訓練用画像７０Ｒ１と同様の方法で取得される。第２の訓練用画像７０Ｒ２に対応して、第２の訓練データ７４Ｒ２が生成される。 The second training images 70R2 are acquired in the same manner as the first training images 70R1, except that tissue containing non-tumorous regions is used instead of tissue containing tumorous regions. Second training data 74R2 is generated corresponding to the second training images 70R2.

第２の訓練データ７４Ｒ２は、腫瘍領域を含む組織に替えて非腫瘍領域含む組織を用いる以外は、第１の訓練データ７４Ｒ１と同様の方法で生成される。図２に示す色濃度符号化図７２Ｒ２ｒ，７２Ｒ２ｇ，７２Ｒ２ｂから画素毎にＲ、Ｇ、Ｂの色濃度値が組み合わされた第２の訓練データ７４Ｒ２が生成される。 The second training data 74R2 is generated in the same manner as the first training data 74R1, except that tissue containing non-tumorous regions is used instead of tissue containing tumorous regions. The second training data 74R2 is generated by combining R, G, and B color density values for each pixel from the color density encoding diagrams 72R2r, 72R2g, and 72R2b shown in Figure 2.

図２において、第２の予備訓練データ７０Ｒ２Ｌは、第２の訓練用画像７０Ｒ２が、非腫瘍領域を含むことを示すデータである。第２の訓練用画像７０Ｒ２は全領域が病理医の判定により非腫瘍領域であると判定されているため、第２の訓練用画像７０Ｒ２には、後述する所定の画素数の領域毎に非腫瘍領域を含むことを示すラベル値として同じ数値、例えば「２」が付され、第２の予備訓練データ７０Ｒ２Ｌとなる。非腫瘍領域を含む領域であることを示すラベル値は、腫瘍領域を含む領域および組織を含まない領域を示すラベル値と区別できる限り制限されない。 In Figure 2, second preliminary training data 70R2L is data indicating that second training image 70R2 contains a non-tumorous region. Since all regions of second training image 70R2 have been determined by a pathologist to be non-tumorous, the second training image 70R2 is assigned the same numerical value, for example, "2," as a label value indicating that it contains a non-tumorous region for each region of a predetermined number of pixels (described below), thereby becoming second preliminary training data 70R2L. The label value indicating that a region contains a non-tumorous region is not limited as long as it can be distinguished from the label values indicating regions containing tumorous regions and regions not containing tissue.

深層学習方法では、図２に示す第２の訓練データ７４Ｒ２および第２の予備訓練データ７０Ｒ２Ｌから非腫瘍領域訓練データ７５Ｒ２を生成する。そして、非腫瘍領域訓練データ７５Ｒ２の各画素の色濃度値データ７６（第２の訓練データ７４Ｒ２から得られる）を入力層５０ａとし、ラベル値７７Ｒ２（第２の予備訓練データ７０Ｒ２Ｌから得られる）を出力層５０ｂとするニューラルネットワーク５０に学習させる。 In the deep learning method, non-tumor region training data 75R2 is generated from the second training data 74R2 and second preliminary training data 70R2L shown in Figure 2. Then, a neural network 50 is trained using color density value data 76 (obtained from the second training data 74R2) of each pixel in the non-tumor region training data 75R2 as the input layer 50a and label values 77R2 (obtained from the second preliminary training data 70R2L) as the output layer 50b.

次に、図３を用いて、第３の訓練データ７４Ｒ３、第３の予備訓練データ７０Ｒ３Ｌおよび非組織領域訓練データ７５Ｒ３の概要を説明する。図３において、ホールスライドイメージ７０Ｗ１、ホールスライドイメージ７０Ｗ２、およびホールスライドイメージ７０Ｗ３は図１と同様であるため、第３の訓練用画像７０Ｒ３には、第１の訓練用画像７０Ｒ１と同様に複数の色相が含まれる。 Next, an overview of the third training data 74R3, the third preliminary training data 70R3L, and the non-tissue region training data 75R3 will be provided using Figure 3. In Figure 3, hole slide image 70W1, hole slide image 70W2, and hole slide image 70W3 are the same as those in Figure 1, so the third training image 70R3 contains multiple hues, just like the first training image 70R1.

第３の訓練用画像７０Ｒ３は、腫瘍領域を含む領域に替えて組織を含まない領域を用いる以外は、第１の訓練用画像７０Ｒ１と同様の方法で取得される。第３の訓練用画像７０Ｒ３に対応して、第３の訓練データ７４Ｒ３が生成される。 The third training image 70R3 is acquired in the same manner as the first training image 70R1, except that a tissue-free region is used instead of the region containing the tumor region. Third training data 74R3 is generated corresponding to the third training image 70R3.

第３の訓練データ７４Ｒ３は、腫瘍領域を含む領域に替えて組織を含まない領域を用いる以外は、第１の訓練データ７４Ｒ１と同様の方法で生成される。図３に示す色濃度符号化図７２Ｒ３ｒ，７２Ｒ３ｇ，７２Ｒ３ｂから画素毎にＲ、Ｇ、Ｂの色濃度値が組み合わされた第３の訓練データ７４Ｒ３が生成される。 The third training data 74R3 is generated in the same manner as the first training data 74R1, except that a tissue-free region is used instead of a region containing a tumor region. The third training data 74R3 is generated by combining R, G, and B color density values for each pixel from the color density encoding diagrams 72R3r, 72R3g, and 72R3b shown in Figure 3.

図３において、第３の予備訓練データ７０Ｒ３Ｌは、第３の訓練用画像７０Ｒ３に含まれている領域が、組織を含まない領域であることを示すデータである。第３の訓練用画像７０Ｒ３は全領域が病理医の判定により非組織領域であると判定されているため、第３の訓練用画像７０Ｒ３には、後述する所定の画素数の領域毎に組織を含まない領域であることを示すラベル値として同じ数値、例えば「０」が付され、第３の予備訓練データ７０Ｒ３Ｌとなる。組織を含まない領域であることを示すラベル値は、腫瘍領域を含む領域および非腫瘍領域を含む領域を示す数値とは区別される。 In Figure 3, the third preliminary training data 70R3L is data indicating that the areas included in the third training image 70R3 are areas that do not contain tissue. Since all areas of the third training image 70R3 have been determined by a pathologist to be non-tissue areas, the third training image 70R3 is assigned the same numerical value, for example "0," as a label value indicating that each area contains no tissue for each region of a predetermined number of pixels (described below), thereby forming the third preliminary training data 70R3L. The label value indicating that each area does not contain tissue is distinguished from the numerical value indicating an area containing a tumorous region and an area containing a non-tumorous region.

深層学習方法では、図３に示す第３の訓練データ７４Ｒ３および第３の予備訓練データ７０Ｒ３Ｌから非組織領域訓練データ７５Ｒ３を生成するそして、非組織領域訓練データ７５Ｒ３の各画素の色濃度値データ７６（第３の訓練データ７４Ｒ３から得られる）を入力層５０ａとし、ラベル値７７Ｒ３（第３の予備訓練データ７０Ｒ３Ｌから得られる）を出力層５０ｂとするニューラルネットワーク５０に学習させる。 In the deep learning method, non-tissue region training data 75R3 is generated from the third training data 74R3 and third preliminary training data 70R3L shown in Figure 3. Then, the non-tissue region training data 75R3 is trained on a neural network 50, which uses color density value data 76 of each pixel of the non-tissue region training data 75R3 (obtained from the third training data 74R3) as the input layer 50a and label values 77R3 (obtained from the third preliminary training data 70R3L) as the output layer 50b.

図４（ａ）、（ｂ）を参照して、腫瘍領域訓練データ７５Ｒ１の生成方法を説明する。図４（ａ）に示す第１の訓練データ７４Ｒ１は、所定の画素数の腫瘍領域の色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂのそれぞれの色濃度値を組み合わせたデータである。本明細書において、第１の訓練データ７４Ｒ１の各画素の位置は、便宜上列を左からｌ_１、ｌ_２・・・ｌ_i、行を上からｍ_１、ｍ_２・・・ｍ_ｊで表す。第１の訓練データ７４Ｒ１は、図４（ａ）では、その画像サイズ（訓練データ１つあたりの大きさ）が説明の便宜のために簡素化されており、第１の訓練データ７４Ｒ１が、縦方向９画素および横方向９画素の合計８１画素を有する。すなわち、各画素の位置は、列を左からｌ_１、ｌ_２・・・ｌ_９、行を上からｍ_１、ｍ_２・・・ｍ_９で表す。図４（ａ）に示す３つの値が、各画素におけるＲ、Ｇ、Ｂ各色の色濃度値である。例示的には、３つの値は、左から、赤（Ｒ）、緑（Ｇ）および青（Ｂ）の順序で格納されている。第１の訓練データ７４Ｒ１の各画素の色濃度値は便宜的に値０から値７の８段階で示されている。これは、画像の前処理の一例として、撮像された際に２５６段階で表されている各色の画像７２Ｒ１Ｒ，７２Ｒ１Ｇ，７２Ｒ１Ｂの明るさを、８段階の色濃度値にそれぞれ変換する処理である。色濃度値は、例えば最も低い明るさ（ＲＧＢカラー２５６段階で表した時の輝度値が低い階調群）を色濃度値０とし、明るさの程度が高くなるに従い徐々に高い値を割り当ててゆき、最も高い明るさ（ＲＧＢカラー２５６段階で表した時の輝度値が高い階調群）を色濃度値７とする。 A method for generating tumor region training data 75R1 will be described with reference to Figures 4(a) and (b). The first training data 74R1 shown in Figure 4(a) is data combining the color density values of the color density encoding maps 72R1r, 72R1g, and 72R1b of tumor regions with a predetermined number of pixels. For convenience, the position of each pixel in the first training data 74R1 is represented by columns _l1 , _l2 ... _l1 from left to right, and rows _m1 , _m2 ... _mj from top to bottom. In Figure 4(a), the image size (size per training data item) of the first training data 74R1 is simplified for ease of explanation, and the first training data 74R1 has a total of 81 pixels, nine vertically and nine horizontally. That is, the position of each pixel is represented by columns _l1 , _l2 ... _l9 from left to right, and rows _m1 , _m2 ... _m9 from top to bottom. The three values shown in FIG. 4A are the color density values of the R, G, and B colors at each pixel. For example, the three values are stored in the order of red (R), green (G), and blue (B) from left to right. For convenience, the color density values of each pixel in the first training data 74R1 are shown in eight levels, from 0 to 7. This is an example of image preprocessing, in which the brightness of each color image 72R1R, 72R1G, and 72R1B, which is represented in 256 levels when captured, is converted into eight levels of color density values. For example, the color density value is assigned such that the lowest brightness (a group of gradations with low brightness values when expressed in the 256-level RGB color system) is assigned a color density value of 0, and gradually higher values are assigned as the brightness increases, with the highest brightness (a group of gradations with high brightness values when expressed in the 256-level RGB color system) being assigned a color density value of 7.

図１および図４（ｂ）に示す腫瘍領域訓練データ７５Ｒ１は、図４（ａ）に示す第１の訓練データ７４Ｒ１を所定数の画素からなる領域（以下、「ウィンドウサイズ」と記載する）で切り出したデータに第１の予備訓練データ７０Ｒ１Ｌのラベル値に対応するラベル値７７Ｒ１を付したものである。ウィンドウサイズの腫瘍領域訓練データ７５Ｒ１も、説明の便宜のために３×３画素（ｌ_１：ｌ_３，ｍ_１：ｍ_３）に簡素化して示すが、各画素の位置は、便宜上列を左からｌ_１、ｌ_２・・・ｌ_i、行を上からｍ_１、ｍ_２・・・ｍ_ｊで表す。実際の好ましいウィンドウサイズは、例示的には１２５×１２５画素（ｉ＝１２５、ｊ＝１２５）、１５０×１５０画素（ｉ＝１５０、ｊ＝１５０）、１６６×１６６画素（ｉ＝１６６、ｊ＝１６６）、１８０×１８０画素（ｉ＝１８０、ｊ＝１８０）、２００×２００画素（ｉ＝２００、ｊ＝２００）、２２０×２２０画素（ｉ＝２２０、ｊ＝２２０）、２５０×２５０画素（ｉ＝２５０、ｊ＝２５０）程度が、学習効率の点から好ましい。より好ましくは、ウィンドウサイズは２００×２００画素程度である。ウィンドウサイズは、たとえば６０μｍ×６０μｍ以上５００μｍ×５００μｍ以下の範囲である。好ましくは、６０μｍ×６０μｍ、１００μｍ×１００μｍ、１５０μｍ×１５０μｍ、２００μｍ×２００μｍ、２５０μｍ×２５０μｍ、３００μｍ×３００μｍ、３５０μｍ×３５０μｍ、４００μｍ×４００μｍ、４５０μｍ×４５０μｍ、５００μｍ×５００μｍである。より好ましくは、２００μｍ×２００μｍ～４００μｍ×４００μｍである。例えば、図４（ｂ）に示すように、第１の訓練データ７４Ｒ１の左端から３×３画素のウィンドウＷ１（ｌ_１：ｌ_３，ｍ_１：ｍ_３）を設定し、第１の訓練データ７４Ｒ１に対して左から右にウィンドウＷ１を移動させる。ウィンドウＷ１は（ｌ_１：ｌ_３，ｍ_１：ｍ_３）の腫瘍領域訓練データ７５Ｒ１を切り出した後、第１の予備訓練データ７０Ｒ１Ｌのラベル値に対応するラベル値７７Ｒ１を腫瘍領域訓練データ７５Ｒ１に付す。ウィンドウＷ１は点線で示す腫瘍領域訓練データ７４Ｒ１の次のウィンドウ（ｌ_４：ｌ_６，ｍ_４：ｍ_６）に移動し、（ｌ_４：ｌ_６，ｍ_４：ｍ_６）をウィンドウサイズの腫瘍領域訓練データ７５Ｒ１として切り出す。新たに切り出された腫瘍領域訓練データ７５Ｒ１に第１の予備訓練データ７０Ｒ１Ｌのラベル値に対応するラベル値７７Ｒ１を付す。この操作を、繰り返し、複数のウィンドウサイズの腫瘍領域訓練データ７５Ｒ１を第１の訓練データ７４Ｒ１から切り出す。切り出したウィンドウサイズの腫瘍領域訓練データ７５Ｒ１は第１の予備訓練データ７０Ｒ１Ｌのラベル値に対応するラベル値７７Ｒ１が付され図１に示すニューラルネットワーク５０の学習に用いられる。 1 and 4(b), the tumor region training data 75R1 is obtained by extracting a region consisting of a predetermined number of pixels (hereinafter referred to as the "window size") from the first training data 74R1 shown in Fig. 4(a ₎ and assigning a label value 77R1 corresponding to the label value of the first preliminary training data 70R1L to the extracted data. For ease of explanation, the window-sized tumor region training data 75R1 is also shown simplified to 3 x 3 pixels (l1: _l3 , _m1 : _m3 ), but for convenience, the position of each pixel is represented by _l1 , _l2 ... _l1 from left to right for columns and _m1 , _m2 ... _mj for rows from top to bottom. In terms of learning efficiency, preferred window sizes are, for example, approximately 125 x 125 pixels (i = 125, j = 125), 150 x 150 pixels (i = 150, j = 150), 166 x 166 pixels (i = 166, j = 166), 180 x 180 pixels (i = 180, j = 180), 200 x 200 pixels (i = 200, j = 200), 220 x 220 pixels (i = 220, j = 220), and 250 x 250 pixels (i = 250, j = 250). More preferably, the window size is approximately 200 x 200 pixels. The window size is, for example, in the range of 60 μm x 60 μm to 500 μm x 500 μm. Preferably, the window size is 60 μm × 60 μm, 100 μm × 100 μm, 150 μm × 150 μm, 200 μm × 200 μm, 250 μm × 250 μm, 300 μm × 300 μm, 350 μm × 350 μm, 400 μm × 400 μm, 450 μm × 450 μm, or 500 μm × 500 μm. More preferably, the window size is 200 μm × 200 μm to 400 μm × 400 μm. For example, as shown in FIG. 4B, a 3 × 3 pixel window W1 (l ₁ :l ₃ , m ₁ :m ₃ ) is set from the left end of the first training data 74R1, and the window W1 is moved from left to right relative to the first training data 74R1. After extracting tumor region training data 75R1 of ( _l1 : _l3 , _m1 : _m3 ) from window W1, label values 77R1 corresponding to the label values of first preliminary training data 70R1L are assigned to the tumor region training data 75R1. Window W1 then moves to the next window ( _l4 : _l6 , _m4 : _m6 ) of tumor region training data 74R1, indicated by the dotted line, and extracts ( _l4 : _l6 , _m4 : _m6 ) as window-sized tumor region training data 75R1. Label values 77R1 corresponding to the label values of first preliminary training data 70R1L are assigned to the newly extracted tumor region training data 75R1. This operation is repeated to extract tumor region training data 75R1 of multiple window sizes from the first training data 74R1. The tumor region training data 75R1 of the extracted window size is assigned a label value 77R1 corresponding to the label value of the first preliminary training data 70R1L and is used for training the neural network 50 shown in FIG.

図１に示すように、ニューラルネットワーク５０の入力層５０ａのノード数は、入力されるウィンドウサイズの腫瘍領域訓練データ７５Ｒ１の画素数と画像に含まれる原色の数（例えば光の三原色であれば、Ｒ、Ｇ、Ｂの３つ）との積に対応している。ウィンドウサイズの腫瘍領域訓練データ７５Ｒ１の各画素の色濃度値データ７６をニューラルネットワークの入力層５０ａとし、第１の予備訓練データ７０Ｒ１Ｌに対応するラベル値７７Ｒ１をニューラルネットワークの出力層５０ｂとして、ニューラルネットワーク５０に学習させる。各画素の色濃度値データ７６は、腫瘍領域訓練データ７５Ｒ１の各画素のＲ、Ｇ、Ｂ各色の色濃度値の集合データである。例示として、ウィンドウサイズの腫瘍領域訓練データ７５Ｒ１が３×３画素である場合には、各画素についてＲ、Ｇ、Ｂ毎に１つずつの色濃度値が与えられるので、色濃度値データ７６の色濃度値数は「２７」（３×３×３＝２７）となり、ニューラルネットワーク５０の入力層５０ａのノード数も「２７」となる。 As shown in Figure 1, the number of nodes in the input layer 50a of the neural network 50 corresponds to the product of the number of pixels in the input window-sized tumor region training data 75R1 and the number of primary colors contained in the image (for example, R, G, and B for the three primary colors of light). The color density value data 76 of each pixel of the window-sized tumor region training data 75R1 is used as the input layer 50a of the neural network, and the label value 77R1 corresponding to the first preliminary training data 70R1L is used as the output layer 50b of the neural network, and the neural network 50 is trained. The color density value data 76 of each pixel is a collection of color density values of the R, G, and B colors of each pixel of the tumor region training data 75R1. For example, if the window size of the tumor region training data 75R1 is 3 x 3 pixels, each pixel is assigned one color density value for each of R, G, and B, so the number of color density values in the color density value data 76 is 27 (3 x 3 x 3 = 27), and the number of nodes in the input layer 50a of the neural network 50 is also 27.

このように、ニューラルネットワーク５０に入力するウィンドウサイズの腫瘍領域訓練データ７５Ｒ１は、ユーザが作成することなく、コンピュータが自動的に作成することができる。これにより、ニューラルネットワーク５０の効率的な深層学習が促進される。 In this way, the window-sized tumor region training data 75R1 to be input to the neural network 50 can be automatically created by the computer without the user having to create it. This promotes efficient deep learning of the neural network 50.

図４（ｂ）に示すように、初期状態において、ウィンドウＷ１は、第１の訓練データ７４Ｒ１の左上角に位置している。以後、ウィンドウＷ１によってウィンドウサイズの腫瘍領域訓練データ７５Ｒ１を切り出し、ニューラルネットワーク５０の学習を行う度に、ウィンドウＷ１の位置を移動させる。具体的には、ウィンドウＷ１が第１の訓練データ７４Ｒ１の例えば全ての画素を走査するように、ウィンドウＷ１を移動させる。これにより、第１の訓練データ７４Ｒ１の全ての画素から切り出されたウィンドウサイズの腫瘍領域訓練データ７５Ｒ１が、ニューラルネットワーク５０の学習に用いられる。よってニューラルネットワーク５０の学習の程度を向上でき、深層学習の結果、図６に示すニューラルネットワーク６０構造を有する深層学習アルゴリズムが得られる。 As shown in Figure 4(b), in the initial state, window W1 is located in the upper left corner of first training data 74R1. Thereafter, window-sized tumor region training data 75R1 is extracted using window W1, and the position of window W1 is moved each time neural network 50 is trained. Specifically, window W1 is moved so that it scans, for example, all pixels of first training data 74R1. As a result, window-sized tumor region training data 75R1 extracted from all pixels of first training data 74R1 is used for training neural network 50. This improves the level of learning of neural network 50, and as a result of deep learning, a deep learning algorithm having the neural network 60 structure shown in Figure 6 is obtained.

図５（ａ）には、第２の訓練データ７４Ｒ２を示す。第２の訓練データ７４Ｒ２から非腫瘍領域訓練データ７５Ｒ２を生成する方法は、第１の訓練データ７４Ｒ１および第１の予備訓練データ７０Ｒ１Ｌに替えて、第２の訓練データ７４Ｒ２および第２の予備訓練データ７０Ｒ２Ｌを使用する点を除き、腫瘍領域訓練データ７５Ｒ１と同様である。非腫瘍領域訓練データ７５Ｒ２に付される第２の予備訓練データ７０Ｒ２Ｌのラベル値に対応するラベル値７７Ｒ２が、図２に示す出力層５０ｂに入力される。 Figure 5(a) shows second training data 74R2. The method for generating non-tumor region training data 75R2 from second training data 74R2 is similar to that for tumor region training data 75R1, except that second training data 74R2 and second preliminary training data 70R2L are used instead of first training data 74R1 and first preliminary training data 70R1L. Label values 77R2 corresponding to the label values of second preliminary training data 70R2L assigned to non-tumor region training data 75R2 are input to output layer 50b shown in Figure 2.

図５（ｂ）には、第３の訓練データ７４Ｒ３を示す。第３の訓練データ７４Ｒ３から非組織領域訓練データ７５Ｒ３を生成する方法は、第１の訓練データ７４Ｒ１および第１の予備訓練データ７０Ｒ１Ｌに替えて、第３の訓練データ７４Ｒ３および第３の予備訓練データ７０Ｒ３Ｌを使用する点を除き、腫瘍領域訓練データ７５Ｒ１と同様である。非組織領域訓練データ７５Ｒ３に付される第３の予備訓練データ７０Ｒ３Ｌのラベル値に対応するラベル値７７Ｒ３が、図３に示す出力層５０ｂに入力される。 Figure 5(b) shows third training data 74R3. The method for generating non-tissue region training data 75R3 from third training data 74R3 is similar to that for tumor region training data 75R1, except that third training data 74R3 and third preliminary training data 70R3L are used instead of first training data 74R1 and first preliminary training data 70R1L. Label value 77R3, which corresponds to the label value of third preliminary training data 70R3L assigned to non-tissue region training data 75R3, is input to output layer 50b shown in Figure 3.

各領域訓練データは、１０枚以上、２０枚以上、５０枚以上、１００枚以上の各領域の訓練用画像を使って生成することが好ましい。 It is preferable that training data for each region be generated using 10 or more, 20 or more, 50 or more, or 100 or more training images for each region.

・画像解析方法の概要図６に示すように、画像解析方法では、解析対象の組織を含む標本を撮像した解析対象画像（明視野画像）７８から、解析用データ８０を生成する。前記標本は、第１の訓練用画像と同じ染色が施されていることが好ましい。解析対象画像７８も、例えば公知の顕微鏡またはバーチャルスライドスキャナ等を用いて、例えばカラー画像として取得することができる。解析対象画像７８は、訓練用画像と同程度または同じ拡大倍率で取得されることが好ましい。解析対象画像（明視野画像）７８は、１以上の原色を含む画像であればよい。カラーの解析対象画像７８を、各画素についてＲ、Ｇ、Ｂ各色の色濃度値で符号化すると、画像全体をＲ、Ｇ、Ｂ毎に各画素における色濃度値の符号化図として表すことができる（解析用色濃度符号化図７９ｒ，７９ｇ，７９ｂ）。解析用色濃度符号化図７９ｒ，７９ｇ，７９ｂから画素毎にＲ、Ｇ、Ｂの色濃度値が組み合わされた解析対象データ（図示せず）が生成される。図６に例示的に示すＲ、Ｇ、Ｂ各色の単一色画像における色濃度の符号を示す色濃度符号化図７９ｒ，７９ｇ，７９ｂは、３原色の各画像７９Ｒ，７９Ｇ，７９Ｂに代えて、値０から値７の８段階の符号で表された色濃度値を表示している。本明細書において、解析対象データの各画素の位置は、便宜上列を左からｌ_１、ｌ_２・・・ｌ_i、行を上からｍ_１、ｍ_２・・・ｍ_ｊで表す。 Overview of Image Analysis Method: As shown in FIG. 6 , in the image analysis method, analysis data 80 is generated from an analysis target image (bright-field image) 78, which is an image of a specimen containing the target tissue. The specimen is preferably stained in the same manner as the first training image. The analysis target image 78 can also be acquired, for example, as a color image using, for example, a known microscope or virtual slide scanner. The analysis target image 78 is preferably acquired at the same magnification as the training image. The analysis target image (bright-field image) 78 may be any image containing one or more primary colors. By encoding the color density values of R, G, and B for each pixel of the color analysis target image 78, the entire image can be represented as an encoded diagram of the color density values for each pixel of R, G, and B (analysis color density encoded diagrams 79r, 79g, and 79b). Analysis target data (not shown) is generated by combining the R, G, and B color density values for each pixel from the analysis color density encoded diagrams 79r, 79g, and 79b. 6, color density coding charts 79r, 79g, and 79b showing the codes of color density in the single-color images of R, G, and B, respectively, display color density values expressed in eight levels of codes from value 0 to value 7, instead of the images 79R, 79G, and 79B of the three primary colors. For convenience, in this specification, the position of each pixel in the data to be analyzed will be represented by _l1 , _l2 , ... _li from left to right for columns and _m1 , _m2 , ... _mj for rows from top to bottom.

解析用データ８０は、解析対象データを所定の画素数の領域（すなわち、上記各領域訓練データと同じウィンドウサイズ）で切り出したデータであり、解析対象画像７８に含まれている組織の色濃度値を含むデータである。図６では、ウィンドウサイズの解析用データ８０も、腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３と同様に、説明の便宜のために３×３画素に簡素化して示すが、実際の好ましいウィンドウサイズは、腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３と同様である。例えば、３×３画素のウィンドウＷ２を設定し、解析対象データに対してウィンドウＷ２を移動させる。ウィンドウＷ２は、解析対象データ中の所定数の画素を含んでおり、解析対象データを、各領域訓練データと同様の方法で例えば３×３画素の黒枠で示すウィンドウＷ２によって切り出すと、ウィンドウサイズの解析用データ８０が得られる。各領域訓練データと同様の方法で、複数の解析用データ８０が、解析対象データから、ウィンドウサイズに対応する所定の画素数の領域毎に生成される。解析対象データ、ウィンドウサイズの解析用データ８０においても、腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３と同様に、各画素について、色濃度値が赤（Ｒ）、緑（Ｇ）および青（Ｂ）の順序で格納されている。 The analysis data 80 is data obtained by cutting the analysis target data into a region of a predetermined number of pixels (i.e., the same window size as the above-mentioned region training data), and includes the color density values of the tissues contained in the analysis target image 78. In Figure 6, the window-sized analysis data 80 is simplified to 3 x 3 pixels for ease of explanation, as with the tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3. However, the actual preferred window size is the same as that of the tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3. For example, a 3 x 3 pixel window W2 is set and moved relative to the analysis target data. Window W2 includes a predetermined number of pixels in the analysis target data. By cutting the analysis target data using the window W2, for example, indicated by a 3 x 3 pixel black frame, in the same manner as with the region training data, the window-sized analysis data 80 is obtained. In a similar manner to the region training data, multiple analysis data 80 are generated from the analysis target data for each region of a predetermined number of pixels corresponding to the window size. In the analysis target data and window size analysis data 80, color density values for each pixel are stored in the order of red (R), green (G), and blue (B), just like in tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3.

画像解析方法では、図１から図３に示すウィンドウサイズの腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３を用いて学習させたニューラルネットワークを有する深層学習アルゴリズム６０を用いて、解析用データ８０を処理する。解析用データ８０を処理することによって、解析対象の組織における解析用データに含まれる領域が腫瘍領域であるか否かを示すデータ８３を生成する。 In the image analysis method, analysis data 80 is processed using a deep learning algorithm 60 having a neural network trained using tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3, each having the window sizes shown in Figures 1 to 3. By processing the analysis data 80, data 83 is generated that indicates whether an area included in the analysis data in the tissue to be analyzed is a tumor region.

再び図６を参照し、Ｒ、Ｇ、Ｂ各色の色濃度符号化図７９ｒ，７９ｇ，７９ｂに基づき生成された解析対象データから切り出された解析用データ８０が深層学習アルゴリズムを構成するニューラルネットワーク６０に入力される。ニューラルネットワーク６０の入力層６０ａのノード数は、入力される画素数と画像に含まれる原色の数との積に対応している。解析用データ８０の各画素の色濃度値データ８１を、ニューラルネットワーク６０に入力すると、出力層６０ｂからは、解析用データ８０の推定値８２（３値）が出力される。例えば推定値が１の場合は解析用データ８０を構成する所定画素数の領域（画素領域）が腫瘍領域を含むことを示し、推定値が２の場合は前記画素領域が非腫瘍領域を含むことを示し、推定値が０の場合は前記画素領域が非組織領域であることを示す。すなわち、ニューラルネットワーク６０の出力層６０ｂから出力される推定値８２は、解析対象画像の前記画素領域毎に生成されるラベル値であり、解析対象画像における解析用データに含まれる領域が腫瘍領域であるか否かを示すデータである。推定値８２は、ニューラルネットワークに関する後述する説明ではクラスとも呼ばれる。ニューラルネットワーク６０は、入力された解析用データ８０に対し、解析用データ８０に含まれる所定数の画素の色濃度値データ８１に基づいて、腫瘍領域を含むことを示すラベル値、非腫瘍領域を含むことを示すラベル値、または非組織領域を示すラベル値を生成する。言い替えると、ニューラルネットワーク６０は、解析用データ８０を、解析対象画像に含まれる注目画素領域の状態（腫瘍化しているか、腫瘍化していないか、組織ではないか）を示すクラスに分類する。ここで、各画素の色濃度値データ８１は、解析用データ８０の各画素のＲ、Ｇ、Ｂ各色の色濃度値の集合データである。 Referring back to Figure 6, analysis data 80 extracted from the analysis target data generated based on the R, G, and B color density coding maps 79r, 79g, and 79b are input to a neural network 60, which constitutes a deep learning algorithm. The number of nodes in the input layer 60a of the neural network 60 corresponds to the product of the number of input pixels and the number of primary colors contained in the image. When color density value data 81 for each pixel of the analysis data 80 is input to the neural network 60, an estimated value 82 (ternary value) of the analysis data 80 is output from the output layer 60b. For example, an estimated value of 1 indicates that a region of a predetermined number of pixels (pixel region) constituting the analysis data 80 contains a tumor region; an estimated value of 2 indicates that the pixel region contains a non-tumor region; and an estimated value of 0 indicates that the pixel region is a non-tissue region. In other words, the estimated value 82 output from the output layer 60b of the neural network 60 is a label value generated for each pixel region of the analysis target image and indicates whether the region of the analysis target image contained in the analysis data is a tumor region. The estimated values 82 are also referred to as classes in the description of neural networks below. For the input analysis data 80, the neural network 60 generates a label value indicating the inclusion of a tumorous region, a label value indicating the inclusion of a non-tumorous region, or a label value indicating a non-tissue region based on the color density value data 81 of a predetermined number of pixels contained in the analysis data 80. In other words, the neural network 60 classifies the analysis data 80 into classes indicating the state of the pixel region of interest contained in the image to be analyzed (tumorous, non-tumorous, or non-tissue). Here, the color density value data 81 of each pixel is a collection of color density values of the R, G, and B colors of each pixel in the analysis data 80.

以後、ウィンドウＷ２が解析対象データの全ての画素を走査するように、ウィンドウＷ２を所定画素数の領域単位で移動させながら、解析用データ８０をウィンドウサイズで切り出す。切り出された解析用データ８０を、ニューラルネットワーク６０に入力する。これにより、解析対象画像における組織の解析用データに含まれる領域が腫瘍領域であるか否かを示すデータに基づいて、ラベル値８３を得る。図６に示す例では、ラベル値８３についてさらに領域検出処理を行うことにより、領域を示す領域強調画像８４を得る。領域検出処理は、具体的には、例えば推定値８２に応じて各領域を検出する処理となり、実際に推定値８２が「１」の画素領域を腫瘍領域と、推定値８２が「２」の画素領域を非腫瘍領域と、推定値８２が０の画素領域を非組織領域と判別する処理となる。領域強調画像８４は、画像解析処理により検出された領域を、ラベル値８３を色（腫瘍領域を白、非腫瘍領域をグレー、非組織領域を黒）で表した図である。また、各領域を判別した後に、腫瘍領域と、それ以外の領域（すなわち、非腫瘍領域および／又は非組織領域）とを識別可能に表示装置に表示させる処理を行ってもよい。例えば、腫瘍領域を色で塗りつぶす、腫瘍領域とそれ以外の領域との間に線を描画する等の処理を行い、これらを表示装置に識別可能に表示する。また、非腫瘍領域および／又は非組織領域についても識別可能に表示装置に表示させる処理を行ってもよい。 Subsequently, the analysis data 80 is extracted by the window size while moving the window W2 in units of a predetermined number of regions so that the window W2 scans all pixels of the analysis target data. The extracted analysis data 80 is input to the neural network 60. This obtains a label value 83 based on data indicating whether a region included in the analysis data of tissue in the analysis target image is a tumor region. In the example shown in Figure 6, a region-enhanced image 84 indicating the region is obtained by further performing region detection processing on the label value 83. Specifically, the region detection processing is a process of detecting each region according to the estimated value 82, for example, and actually classifying pixel regions with an estimated value 82 of "1" as tumor regions, pixel regions with an estimated value 82 of "2" as non-tumor regions, and pixel regions with an estimated value 82 of "0" as non-tissue regions. The region-enhanced image 84 is a diagram of the regions detected by the image analysis processing, with the label value 83 represented by color (tumor regions in white, non-tumor regions in gray, and non-tissue regions in black). Furthermore, after each region is identified, a process may be performed to display the tumor region and other regions (i.e., non-tumor regions and/or non-tissue regions) in a distinguishable manner on the display device. For example, the tumor region may be filled in with color, or a line may be drawn between the tumor region and other regions, and these may be displayed in a distinguishable manner on the display device. A process may also be performed to display the non-tumor region and/or non-tissue region in a distinguishable manner on the display device.

＜第１の実施形態＞
第１の実施形態では、上述の概要で説明した深層学習方法および画像解析方法を実施するシステムの構成について、具体的に説明する。 First Embodiment
In the first embodiment, a specific description will be given of the configuration of a system that implements the deep learning method and image analysis method outlined above.

［構成の概要］
図７を参照すると、第１の実施形態に係る画像解析システムは、深層学習装置１００Ａと、画像解析装置２００Ａとを備える。ベンダ側装置１００は深層学習装置１００Ａとして動作し、ユーザ側装置２００は画像解析装置２００Ａとして動作する。深層学習装置１００Ａは、ニューラルネットワーク５０に訓練データを使って学習させ、訓練データによって訓練された深層学習アルゴリズム６０をユーザに提供する。学習済みのニューラルネットワーク６０から構成される深層学習アルゴリズムは、記憶媒体９８またはネットワーク９９を通じて、深層学習装置１００Ａから画像解析装置２００Ａに提供される。画像解析装置２００Ａは、学習済みのニューラルネットワーク６０から構成される深層学習アルゴリズムを用いて解析対象の画像の解析を行う。 [Configuration Overview]
Referring to FIG. 7 , the image analysis system according to the first embodiment includes a deep learning device 100A and an image analysis device 200A. The vendor-side device 100 operates as the deep learning device 100A, and the user-side device 200 operates as the image analysis device 200A. The deep learning device 100A trains a neural network 50 using training data and provides the deep learning algorithm 60 trained by the training data to the user. The deep learning algorithm configured from the trained neural network 60 is provided from the deep learning device 100A to the image analysis device 200A via a storage medium 98 or a network 99. The image analysis device 200A analyzes the image to be analyzed using the deep learning algorithm configured from the trained neural network 60.

深層学習装置１００Ａは、例えば汎用コンピュータで構成されており、後述するフローチャートに基づいて、深層学習処理を行う。画像解析装置２００Ａは、例えば汎用コンピュータで構成されており、後述するフローチャートに基づいて、画像解析処理を行う。記憶媒体９８は、例えばＤＶＤ－ＲＯＭやＵＳＢメモリ等の、コンピュータ読み取り可能であって非一時的な有形の記憶媒体である。 Deep learning device 100A is configured, for example, by a general-purpose computer, and performs deep learning processing based on the flowchart described below. Image analysis device 200A is configured, for example, by a general-purpose computer, and performs image analysis processing based on the flowchart described below. Storage medium 98 is a computer-readable, non-transitory, tangible storage medium, such as a DVD-ROM or USB memory.

深層学習装置１００Ａは撮像装置３００に接続されている。撮像装置３００は、撮像素子３０１と、蛍光顕微鏡３０２とを備え、ステージ３０９上にセットされた学習用の標本３０８の、明視野画像および蛍光画像を撮像する。学習用の標本３０８は、上述の染色が施されている。深層学習装置１００Ａは、撮像装置３００によって撮像された訓練用画像を取得する。 The deep learning device 100A is connected to the imaging device 300. The imaging device 300 includes an image sensor 301 and a fluorescence microscope 302, and captures bright-field images and fluorescent images of a training specimen 308 set on a stage 309. The training specimen 308 has been stained as described above. The deep learning device 100A acquires the training images captured by the imaging device 300.

画像解析装置２００Ａは撮像装置４００に接続されている。撮像装置４００は、撮像素子４０１と、蛍光顕微鏡４０２とを備え、ステージ４０９上にセットされた解析対象の標本４０８の、明視野画像を撮像する。解析対象の標本４０８は、上述の通り予め染色されている。画像解析装置２００Ａは、撮像装置４００によって撮像された解析対象画像７８を取得する。 The image analysis device 200A is connected to the imaging device 400. The imaging device 400 includes an imaging element 401 and a fluorescence microscope 402, and captures a bright-field image of a specimen 408 to be analyzed that is set on a stage 409. The specimen 408 to be analyzed has been stained in advance as described above. The image analysis device 200A acquires the image 78 to be analyzed captured by the imaging device 400.

撮像装置３００，４００には、標本を撮像する機能を有する、公知の蛍光顕微鏡またはバーチャルスライドスキャナ等を用いることができる。撮像装置４００は、標本を撮像する機能を有する限り、光学顕微鏡であっても良い。 The imaging devices 300 and 400 can be well-known fluorescent microscopes or virtual slide scanners that have the function of capturing images of specimens. The imaging device 400 can also be an optical microscope, as long as it has the function of capturing images of specimens.

［ハードウェア構成］
図８を参照すると、ベンダ側装置１００（１００Ａ，１００Ｂ）は、処理部１０（１０Ａ，１０Ｂ）と、入力部１６と、出力部１７とを備える。 [Hardware configuration]
Referring to FIG. 8, the vendor-side device 100 (100A, 100B) includes a processing unit 10 (10A, 10B), an input unit 16, and an output unit 17.

処理部１０は、後述するデータ処理を行うＣＰＵ（Central Processing Unit）１１と、データ処理の作業領域に使用するメモリ１２と、後述するプログラムおよび処理データを記憶する記憶部１３と、各部の間でデータを伝送するバス１４と、外部機器とのデータの入出力を行うインタフェース部１５と、ＧＰＵ（Graphics Processing Unit）１９とを備えている。入力部１６および出力部１７は、処理部１０に接続されている。例示的には、入力部１６はキーボードまたはマウス等の入力装置であり、出力部１７は液晶ディスプレイ等の表示装置である。ＧＰＵ１９は、ＣＰＵ１１が行う演算処理（例えば、並列演算処理）を補助するアクセラレータとして機能する。すなわち以下の説明においてＣＰＵ１１が行う処理とは、ＣＰＵ１１がＧＰＵ１９をアクセラレータとして用いて行う処理も含むことを意味する。 The processing unit 10 includes a CPU (Central Processing Unit) 11 that performs data processing (described below), a memory 12 used as a work area for data processing, a storage unit 13 that stores programs and processing data (described below), a bus 14 that transmits data between each unit, an interface unit 15 that inputs and outputs data to and from external devices, and a GPU (Graphics Processing Unit) 19. An input unit 16 and an output unit 17 are connected to the processing unit 10. Illustratively, the input unit 16 is an input device such as a keyboard or mouse, and the output unit 17 is a display device such as an LCD display. The GPU 19 functions as an accelerator that assists the arithmetic processing (e.g., parallel arithmetic processing) performed by the CPU 11. In other words, in the following description, processing performed by the CPU 11 also includes processing performed by the CPU 11 using the GPU 19 as an accelerator.

また、処理部１０は、以下の図１０で説明する各ステップの処理を行うために、本発明に係るプログラムおよび学習前のニューラルネットワーク５０を、例えば実行形式で記憶部１３に予め記憶している。実行形式は、例えばプログラミング言語からコンパイラにより変換されて生成される形式である。処理部１０は、記憶部１３に記憶したプログラムおよび学習前のニューラルネットワーク５０を使用して処理を行う。 In order to perform the processing of each step described below in FIG. 10, the processing unit 10 pre-stores the program according to the present invention and the pre-trained neural network 50 in, for example, an executable format in the storage unit 13. The executable format is, for example, a format generated by conversion from a programming language using a compiler. The processing unit 10 performs processing using the program and pre-trained neural network 50 stored in the storage unit 13.

以下の説明においては、特に断らない限り、処理部１０が行う処理は、記憶部１３またはメモリ１２に格納されたプログラムおよびニューラルネットワーク５０に基づいて、ＣＰＵ１１が行う処理を意味する。ＣＰＵ１１はメモリ１２を作業領域として必要なデータ（処理途中の中間データ等）を一時記憶し、記憶部１３に演算結果等の長期保存するデータを適宜記憶する。 In the following description, unless otherwise specified, the processing performed by the processing unit 10 refers to the processing performed by the CPU 11 based on the programs and neural network 50 stored in the storage unit 13 or memory 12. The CPU 11 uses the memory 12 as a working area to temporarily store necessary data (such as intermediate data during processing), and stores calculation results and other data to be stored long-term in the storage unit 13 as appropriate.

図９を参照すると、ユーザ側装置２００（２００Ａ，２００Ｂ，２００Ｃ）は、処理部２０（２０Ａ，２０Ｂ，２０Ｃ）と、入力部２６と、出力部２７とを備える。 Referring to FIG. 9, the user device 200 (200A, 200B, 200C) includes a processing unit 20 (20A, 20B, 20C), an input unit 26, and an output unit 27.

処理部２０は、後述するデータ処理を行うＣＰＵ（Central Processing Unit）２１と、データ処理の作業領域に使用するメモリ２２と、後述するプログラムおよび処理データを記憶する記憶部２３と、各部の間でデータを伝送するバス２４と、外部機器とのデータの入出力を行うインタフェース部２５と、ＧＰＵ（Graphics Processing Unit）２９とを備えている。入力部２６および出力部２７は、処理部２０に接続されている。例示的には、入力部２６はキーボードまたはマウス等の入力装置であり、出力部２７は液晶ディスプレイ等の表示装置である。ＧＰＵ２９は、ＣＰＵ２１が行う演算処理（例えば、並列演算処理）を補助するアクセラレータとして機能する。すなわち以下の説明においてＣＰＵ２１が行う処理とは、ＣＰＵ２１がＧＰＵ２９をアクセラレータとして用いて行う処理も含むことを意味する。 The processing unit 20 includes a CPU (Central Processing Unit) 21 that performs data processing (described below), a memory 22 used as a work area for data processing, a storage unit 23 that stores programs and processing data (described below), a bus 24 that transmits data between each unit, an interface unit 25 that inputs and outputs data to and from external devices, and a GPU (Graphics Processing Unit) 29. An input unit 26 and an output unit 27 are connected to the processing unit 20. Illustratively, the input unit 26 is an input device such as a keyboard or mouse, and the output unit 27 is a display device such as an LCD display. The GPU 29 functions as an accelerator that assists the arithmetic processing (e.g., parallel arithmetic processing) performed by the CPU 21. In other words, in the following description, processing performed by the CPU 21 also includes processing performed by the CPU 21 using the GPU 29 as an accelerator.

また、処理部２０は、以下の図１４で説明する各ステップの処理を行うために、本発明に係るプログラムおよび学習済みのニューラルネットワーク構造の深層学習アルゴリズム６０を、例えば実行形式で記憶部２３に予め記憶している。実行形式は、例えばプログラミング言語からコンパイラにより変換されて生成される形式である。処理部２０は、記憶部２３に記憶したプログラムおよび深層学習アルゴリズム６０を使用して処理を行う。 In addition, in order to perform the processing of each step described below in Figure 14, the processing unit 20 pre-stores the program of the present invention and a deep learning algorithm 60 with a trained neural network structure in the memory unit 23, for example, in executable format. The executable format is, for example, a format generated by conversion from a programming language using a compiler. The processing unit 20 performs processing using the program and deep learning algorithm 60 stored in the memory unit 23.

以下の説明においては、特に断らない限り、処理部２０が行う処理は、記憶部２３またはメモリ２２に格納されたプログラムおよび深層学習アルゴリズム６０に基づいて、実際には処理部２０のＣＰＵ２１が行う処理を意味する。ＣＰＵ２１はメモリ２２を作業領域として必要なデータ（処理途中の中間データ等）を一時記憶し、記憶部２３に演算結果等の長期保存するデータを適宜記憶する。 In the following description, unless otherwise specified, the processing performed by the processing unit 20 actually means processing performed by the CPU 21 of the processing unit 20 based on the program and deep learning algorithm 60 stored in the storage unit 23 or memory 22. The CPU 21 uses the memory 22 as a working area to temporarily store necessary data (such as intermediate data during processing), and stores calculation results and other data to be stored long-term in the storage unit 23 as appropriate.

［機能ブロックおよび処理手順］
・深層学習処理
図１０を参照すると、第１の実施形態に係る深層学習装置１００Ａの処理部１０Ａは、訓練データ生成部１０１と、訓練データ入力部１０２と、アルゴリズム更新部１０３とを備える。これらの機能ブロックは、コンピュータに深層学習処理を実行させるプログラムを、処理部１０Ａの記憶部１３またはメモリ１２にインストールし、このプログラムをＣＰＵ１１が実行することにより実現される。ウィンドウサイズデータベース１０４と、アルゴリズムデータベース１０５とは、処理部１０Ａの記憶部１３またはメモリ１２に記憶される。 [Function blocks and processing procedures]
10 , the processing unit 10A of the deep learning device 100A according to the first embodiment includes a training data generation unit 101, a training data input unit 102, and an algorithm update unit 103. These functional blocks are implemented by installing a program that causes a computer to execute deep learning processing in the storage unit 13 or memory 12 of the processing unit 10A and executing this program by the CPU 11. A window size database 104 and an algorithm database 105 are stored in the storage unit 13 or memory 12 of the processing unit 10A.

学習用の標本の第１の訓練用画像７０Ｒ１、第２の訓練用画像７０Ｒ２、第３の訓練用画像７０Ｒ３は、撮像装置３００によって予め撮像され、処理部１０Ａの記憶部１３またはメモリ１２に予め記憶されていることとする。ニューラルネットワーク５０は、例えば解析対象の標本が由来する組織の種別（例えば組織名）と対応付けられて、アルゴリズムデータベース１０５に予め格納されている。 The first training image 70R1, second training image 70R2, and third training image 70R3 of the learning sample are assumed to have been captured in advance by the imaging device 300 and stored in advance in the storage unit 13 or memory 12 of the processing unit 10A. The neural network 50 is pre-stored in the algorithm database 105, for example, in association with the type of tissue (e.g., tissue name) from which the sample to be analyzed is derived.

深層学習装置１００Ａの処理部１０Ａは、図１１に示す処理を行う。図１０に示す各機能ブロックを用いて説明すると、ステップＳ１０からＳ１２、Ｓ１７およびＳ１８の処理は、訓練データ生成部１０１が行う。ステップＳ１４の処理は、訓練データ入力部１０２が行う。ステップＳ１５およびＳ１６の処理は、アルゴリズム更新部１０３が行う。 The processing unit 10A of the deep learning device 100A performs the processing shown in Figure 11. Explaining this using the functional blocks shown in Figure 10, the processing of steps S10 to S12, S17, and S18 is performed by the training data generation unit 101. The processing of step S14 is performed by the training data input unit 102. The processing of steps S15 and S16 is performed by the algorithm update unit 103.

以下に説明するステップＳ１０からＳ１８では、第１の訓練用画像７０Ｒ１と、第２の訓練用画像７０Ｒ２と、第３の訓練用画像７０Ｒ３とを用いた深層学習処理を説明する。 Steps S10 to S18 described below describe deep learning processing using the first training image 70R1, the second training image 70R2, and the third training image 70R3.

処理部１０Ａは、深層学習方法の概要に記載の方法に準じて、腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３を生成するための深層学習処理を行う。処理部１０Ａは、例えば、病理医等による入力部１６からの操作により、第１の訓練用画像７０Ｒ１の領域を含む広い領域の画像（ホールスライドイメージ７０Ｗ１）を、出力部１７に表示する。判定を行う病理医等は、出力部１７に表示されたホールスライドイメージ７０Ｗ１の画像を目視で確認する。病理医等は、腫瘍領域を含むと判定したホールスライドイメージ７０Ｗ１中の領域を、例えば入力部１６を介して指定し、ホールスライドイメージ７０Ｗ１に例えば赤色等の実線で囲む。病理医等は、非腫瘍領域および非組織領域についても、同様に、ホールスライドイメージ７０Ｗ１中の領域について、例えば赤色とは異なる青色、緑色等の実線で囲む。処理部１０Ａは、ホールスライドイメージ７０Ｗ１を出力部１７に表示して病理医により判断してもらうことに代えて、判定済みのホールスライドイメージ７０Ｗ２を、Ｉ／Ｆ部１５を介して例えばネットワーク９９を介して取得してもよい。病理医等は、訓練用画像として好ましい画素にホールスライドイメージ７０Ｗ２を分割してもよい。 Processing unit 10A performs deep learning processing to generate tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3 in accordance with the method described in the Overview of Deep Learning Methods. Processing unit 10A displays an image of a large region (whole slide image 70W1) including the region of first training image 70R1 on output unit 17, for example, in response to operation by a pathologist or other user via input unit 16. The pathologist or other user making the assessment visually confirms the image of whole slide image 70W1 displayed on output unit 17. The pathologist or other user specifies, via input unit 16, an area in whole slide image 70W1 determined to contain a tumor region, and encircles the area in whole slide image 70W1 with a solid line, for example, red. Similarly, the pathologist or other user encircles the non-tumor region and non-tissue region in whole slide image 70W1 with a solid line, for example, blue or green, different from red. Instead of displaying the whole slide image 70W1 on the output unit 17 and having the pathologist make a judgment, the processing unit 10A may acquire the judged whole slide image 70W2 via the I/F unit 15, for example, via the network 99. The pathologist or the like may divide the whole slide image 70W2 into pixels that are suitable for use as a training image.

処理部１０Ａは、病理医等の入力部１６からの指定により、ホールスライドイメージ７０Ｗ２中に赤色の実線で囲まれている領域から取得された第１の訓練用画像７０Ｒ１を所定の画素数を含むように切り出す。同様に、非腫瘍領域および非組織領域と判定された部分からも、第２の訓練用画像７０Ｒ２、第３の訓練用画像７０Ｒ３のそれぞれを、所定の画素の領域を含むように切り出す。 The processing unit 10A, based on instructions from the pathologist or other person using the input unit 16, cuts out a first training image 70R1 obtained from the area surrounded by a solid red line in the whole slide image 70W2, so that it contains a predetermined number of pixels. Similarly, it cuts out a second training image 70R2 and a third training image 70R3 from the areas determined to be non-tumorous and non-tissue regions, so that they each contain a predetermined number of pixel areas.

ステップＳ１０において、処理部１０Ａは、切り出された第１の訓練用画像７０Ｒ１からＲ、Ｇ、Ｂ各色の色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂを生成する。色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂは、第１の訓練用画像７０Ｒ１の各画素のＲ、Ｇ、Ｂ各色の色濃度値を段階的に表す符号を各画素に付すことにより生成する。本実施形態では、色濃度値を値０から値２５５の２５６階調で表すとして各Ｒ、Ｇ、Ｂ階調画像について色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂを生成する。色濃度値の割り当ては、例えば最も低い明るさを色濃度値０とし、明るさの程度が高くなるに従い徐々に高い値を割り当ててゆき、最も高い明るさを色濃度値２５５とする。処理部１０Ａは、色濃度符号化図７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂから画素毎にＲ、Ｇ，Ｂの色濃度値を付した第１の訓練データ７４Ｒ１を生成する。処理部１０Ａは、第１の訓練用画像７０Ｒ１の各画素に腫瘍領域であることを示すラベル値である「１」を対応させた第１の予備訓練データ７０Ｒ１Ｌを生成する。 In step S10, the processing unit 10A generates color density coding maps 72R1r, 72R1g, and 72R1b for each of the R, G, and B colors from the extracted first training image 70R1. The color density coding maps 72R1r, 72R1g, and 72R1b are generated by assigning to each pixel of the first training image 70R1 a code that represents the color density value of each of the R, G, and B colors of that pixel in a stepwise manner. In this embodiment, the color density values are represented in 256 gradations from 0 to 255, and color density coding maps 72R1r, 72R1g, and 72R1b are generated for each of the R, G, and B gradation images. The color density values are assigned, for example, by assigning a color density value of 0 to the lowest brightness and gradually increasing values as the brightness level increases, with the highest brightness being a color density value of 255. The processing unit 10A generates first training data 74R1 by assigning R, G, and B color density values to each pixel from the color density encoding charts 72R1r, 72R1g, and 72R1b. The processing unit 10A generates first preliminary training data 70R1L by associating each pixel of the first training image 70R1 with a label value of "1" indicating a tumorous region.

ステップＳ１１において、処理部１０Ａは、第２の訓練用画像７０Ｒ２からステップＳ１０と同様の方法により第２の訓練データ７４Ｒ２を生成する。処理部１０Ａは、第２の訓練用画像７０Ｒ２の各画素に非腫瘍領域であることを示すラベル値である「２」を対応させた第２の予備訓練データ７０Ｒ２Ｌを生成する。 In step S11, the processing unit 10A generates second training data 74R2 from the second training image 70R2 using a method similar to step S10. The processing unit 10A generates second preliminary training data 70R2L in which each pixel of the second training image 70R2 is assigned a label value of "2," indicating a non-tumorous region.

ステップＳ１２において、処理部１０Ａは、第３の訓練用画像７０Ｒ３からステップＳ１０と同様の方法により第３の訓練データ７４Ｒ３を生成する。処理部１０Ａは、第３の訓練用画像７０Ｒ３の各画素に非組織領域であることを示すラベル値である「０」を対応させた第３の予備訓練データ７０Ｒ３Ｌを生成する。 In step S12, the processing unit 10A generates third training data 74R3 from the third training image 70R3 using a method similar to step S10. The processing unit 10A generates third preliminary training data 70R3L in which each pixel of the third training image 70R3 is assigned a label value of "0," indicating a non-tissue region.

ステップＳ１０からステップＳ１２は同時に行ってもよく、順不同であってもよい。 Steps S10 to S12 may be performed simultaneously or in any order.

ステップＳ１３において、処理部１０Ａは、入力部１６を通じて、深層学習装置１００Ａ側のユーザからの、学習用の組織の種別の入力を受け付ける。処理部１０Ａは、入力された組織の種別に基づき、ウィンドウサイズデータベース１０４（ウィンドウサイズＤＢ１０４）を参照して、ウィンドウサイズを設定し、アルゴリズムデータベース１０５（アルゴリズムＤＢ１０５）を参照して、学習に用いるニューラルネットワーク５０を設定する。ウィンドウサイズは例えば２００×２００画素とする。ウィンドウサイズは、１回の入力時にニューラルネットワーク５０に入力する訓練データの単位であり、ウィンドウサイズの各領域に対応する訓練データの画素数と画像に含まれる色の原色の数との積が、入力層５０ａのノード数に対応している。ウィンドウサイズは組織の種別と対応付けられて、ウィンドウサイズデータベース１０４内に予め記憶されている。 In step S13, processing unit 10A receives input of the type of tissue to be used for learning from the user of deep learning device 100A via input unit 16. Based on the input type of tissue, processing unit 10A sets the window size by referencing window size database 104 (window size DB104) and sets the neural network 50 to be used for learning by referencing algorithm database 105 (algorithm DB105). The window size is, for example, 200 x 200 pixels. The window size is the unit of training data input to the neural network 50 in one input, and the product of the number of pixels of the training data corresponding to each region of the window size and the number of primary colors included in the image corresponds to the number of nodes in the input layer 50a. The window size is associated with the type of tissue and pre-stored in window size database 104.

ステップＳ１４において、処理部１０Ａは、各領域に対応する訓練データから、ウィンドウサイズの腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３を生成する。具体的には、上述の「深層学習方法の概要」において、図４（ａ）、（ｂ）を参照して説明したように、第１の訓練データ７４Ｒ１および第１の予備訓練データ７０Ｒ１Ｌから、ウィンドウＷ１によってウィンドウサイズの腫瘍領域訓練データ７５Ｒ１を作成する。処理部１０Ａは、第２の訓練データ７４Ｒ２および第２の予備訓練データ７０Ｒ２Ｌから、ウィンドウサイズの非腫瘍領域訓練データ７５Ｒ２を生成する。具体的には、上述の「深層学習方法の概要」において、図４（ａ）、（ｂ）および図５（ａ）を参照して説明したように、第２の訓練データ７４Ｒ２および第２の予備訓練データ７０Ｒ２Ｌから、ウィンドウＷ１によってウィンドウサイズの非腫瘍領域訓練データ７５Ｒ２を作成する。処理部１０Ａは、第３の訓練データ７４Ｒ３および第３の予備訓練データ７０Ｒ３Ｌから、ウィンドウサイズの非組織領域訓練データ７５Ｒ３を生成する。具体的には、上述の「深層学習方法の概要」において、図４（ａ）、（ｂ）および図５（ｂ）を参照して説明したように、第３の訓練データ７４Ｒ３および第３の予備訓練データ７０Ｒ３Ｌから、ウィンドウＷ１によってウィンドウサイズの非組織領域訓練データ７５Ｒ３を作成する。 In step S14, processing unit 10A generates window-sized tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3 from the training data corresponding to each region. Specifically, as described above in the "Outline of the Deep Learning Method" with reference to Figures 4(a) and (b), processing unit 10A generates window-sized tumor region training data 75R1 using window W1 from first training data 74R1 and first preliminary training data 70R1L. Processing unit 10A generates window-sized non-tumor region training data 75R2 using second training data 74R2 and second preliminary training data 70R2L. Specifically, as described above in the "Outline of the Deep Learning Method" with reference to Figures 4(a), (b), and 5(a), processing unit 10A generates window-sized non-tumor region training data 75R2 using window W1 from second training data 74R2 and second preliminary training data 70R2L. Processing unit 10A generates window-sized non-tissue region training data 75R3 from third training data 74R3 and third preliminary training data 70R3L. Specifically, as described above in the "Overview of Deep Learning Method" with reference to Figures 4(a), (b), and 5(b), processing unit 10A creates window-sized non-tissue region training data 75R3 using window W1 from third training data 74R3 and third preliminary training data 70R3L.

図１１に示すステップＳ１５において、処理部１０Ａは、ウィンドウサイズの腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３を用いて、ニューラルネットワーク５０を学習させる。ニューラルネットワーク５０の学習結果は、ウィンドウサイズの腫瘍領域訓練データ７５Ｒ１、非腫瘍領域訓練データ７５Ｒ２および非組織領域訓練データ７５Ｒ３を用いてニューラルネットワーク５０を学習させる度に蓄積される。 In step S15 shown in FIG. 11, processing unit 10A trains neural network 50 using window-sized tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3. The training results of neural network 50 are accumulated each time neural network 50 is trained using window-sized tumor region training data 75R1, non-tumor region training data 75R2, and non-tissue region training data 75R3.

実施形態に係る画像解析方法では、畳み込みニューラルネットワークを使用しており、確率的勾配降下法を用いるため、ステップＳ１６において、処理部１０Ａは、予め定められた所定の試行回数分の学習結果が蓄積されているか否かを判断する。学習結果が所定の試行回数分蓄積されている場合、処理部１０ＡはステップＳ１７の処理を行い、学習結果が所定の試行回数分蓄積されていない場合、処理部１０ＡはステップＳ１８の処理を行う。 The image analysis method according to the embodiment uses a convolutional neural network and employs the stochastic gradient descent method. Therefore, in step S16, the processing unit 10A determines whether learning results for a predetermined number of trials have been accumulated. If learning results for the predetermined number of trials have been accumulated, the processing unit 10A performs the processing of step S17. If learning results for the predetermined number of trials have not been accumulated, the processing unit 10A performs the processing of step S18.

学習結果が所定の試行回数分蓄積されている場合、ステップＳ１７において、処理部１０Ａは、ステップＳ１５において蓄積しておいた学習結果を用いて、ニューラルネットワーク５０の結合重みｗを更新する。実施形態に係る画像解析方法では、確率的勾配降下法を用いるため、所定の試行回数分の学習結果が蓄積した段階で、ニューラルネットワーク５０の結合重みｗを更新する。結合重みｗを更新する処理は、具体的には、後述の（式１１）および（式１２）に示される、勾配降下法による計算を実施する処理である。 If learning results have been accumulated for a predetermined number of trials, in step S17, processing unit 10A updates the connection weights w of neural network 50 using the learning results accumulated in step S15. Since the image analysis method according to the embodiment uses stochastic gradient descent, the connection weights w of neural network 50 are updated once learning results for a predetermined number of trials have been accumulated. Specifically, the process of updating the connection weights w is a process of performing calculations using the gradient descent method shown in (Equation 11) and (Equation 12) described below.

ステップＳ１７において、処理部１０Ａは、第１の訓練データ７４Ｒ１、第２の訓練データ７４Ｒ２および第３の訓練データ７４Ｒ３のそれぞれについて各領域訓練データ内の規定数の画素を処理したか否かを判断する。各領域訓練データの規定数の画素について、ステップＳ１５からステップＳ１７の一連の処理がなされている場合は、深層学習処理を終了する。ニューラルネットワークの学習は、必ずしも各訓練データ内の全ての画素に対して行う必要は無く、処理部１０Ａは、各訓練データ内の一部の画素、すなわち規定数の画素に対して処理をし学習を行うことができる。規定数の画素は、各領域訓練データ内の全ての画素であってもよい。 In step S17, the processing unit 10A determines whether a specified number of pixels in each area training data for each of the first training data 74R1, second training data 74R2, and third training data 74R3 have been processed. If the series of processes from step S15 to step S17 have been performed for the specified number of pixels in each area training data, the deep learning process is terminated. Neural network learning does not necessarily have to be performed on all pixels in each training data; the processing unit 10A can process and learn only a portion of the pixels in each training data, i.e., a specified number of pixels. The specified number of pixels may be all pixels in each area training data.

各領域訓練データ内の規定数の画素が処理されていない場合は、処理部１０Ａは、ステップＳ１９において、図４（ｂ）に示すように、第１の訓練データ７４Ｒ１内、第２の訓練データ７４Ｒ２内または第３の訓練データ７４Ｒ３内において、ウィンドウの位置を移動させる。その後、処理部１０Ａは、移動後の新たなウィンドウ位置において、ステップＳ１４からステップＳ１６の一連の処理を行う。すなわち、処理部１０Ａは、ステップＳ１４において、移動後の新たなウィンドウ位置において第１の訓練データ７４Ｒ１、第２の訓練データ７４Ｒ２および第３の訓練データ７４Ｒ３のそれぞれについてウィンドウサイズの各領域訓練データ７５Ｒ１，７５Ｒ２，７５Ｒ３を切り出し、各領域を示すラベル値を付す。引き続き、処理部１０Ａは、ステップＳ１５において、新たに切り出したウィンドウサイズの各領域訓練データ７５Ｒ１，７５Ｒ２，７５Ｒ３を用いて、ニューラルネットワーク５０を学習させる。ステップＳ１６において、所定の試行回数分の学習結果が蓄積されている場合は、処理部１０Ａは、ステップＳ１７において、ニューラルネットワーク５０の結合重みｗを更新する。このようなウィンドウサイズ毎のニューラルネットワーク５０の学習を、第１の訓練データ７４Ｒ１、第２の訓練データ７４Ｒ２および第３の訓練データ７４Ｒ３のそれぞれについて規定数の画素に対して行う。 If the specified number of pixels in each region training data has not been processed, in step S19, the processing unit 10A moves the window position within the first training data 74R1, the second training data 74R2, or the third training data 74R3, as shown in FIG. 4(b). The processing unit 10A then performs a series of processes from step S14 to step S16 at the new window position after the movement. That is, in step S14, the processing unit 10A extracts region training data 75R1, 75R2, and 75R3 of the window size for each of the first training data 74R1, the second training data 74R2, and the third training data 74R3 at the new window position after the movement, and assigns a label value indicating each region. Subsequently, in step S15, the processing unit 10A trains the neural network 50 using the region training data 75R1, 75R2, and 75R3 of the newly extracted window size. In step S16, if learning results for a predetermined number of trials have been accumulated, processing unit 10A updates the connection weight w of neural network 50 in step S17. This type of learning of neural network 50 for each window size is performed on a specified number of pixels for each of the first training data 74R1, second training data 74R2, and third training data 74R3.

以上説明した、ステップＳ１０からＳ１９の深層学習処理を、さらに別途取得された第１の訓練用画像７０Ｒ１と、第２の訓練用画像７０Ｒ２と、第３の訓練用画像７０Ｒ３に対して繰り返し行うことにより、ニューラルネットワーク５０の学習の程度を向上させる。これにより、図５に示すニューラルネットワーク構造の深層学習アルゴリズム６０を得る。 The deep learning process of steps S10 to S19 described above is repeated for the first training image 70R1, the second training image 70R2, and the third training image 70R3, which are acquired separately, thereby improving the level of learning of the neural network 50. This results in the deep learning algorithm 60 with the neural network structure shown in Figure 5.

・ニューラルネットワークの構造
図１２（ａ）に示すように、第１の実施形態では、深層学習タイプのニューラルネットワークを用いる。深層学習タイプのニューラルネットワークは、図１２に示すニューラルネットワーク５０のように、入力層５０ａと、出力層５０ｂと、入力層５０ａおよび出力層５０ｂの間の中間層５０ｃとを備え、中間層５０ｃが複数の層で構成されている。中間層５０ｃを構成する層の数は、例えば５層以上とすることができる。 Neural Network Structure As shown in Fig. 12(a), the first embodiment uses a deep learning type neural network. The deep learning type neural network includes an input layer 50a, an output layer 50b, and an intermediate layer 50c between the input layer 50a and the output layer 50b, as in the neural network 50 shown in Fig. 12, and the intermediate layer 50c is composed of multiple layers. The number of layers constituting the intermediate layer 50c can be, for example, five or more.

ニューラルネットワーク５０では、層状に配置された複数のノード８９が、層間において結合されている。これにより、情報が入力側の層５０ａから出力側の層５０ｂに、図中矢印Ｄに示す一方向のみに伝播する。本実施形態では、入力層５０ａのノード数は、入力される画像の画素数すなわち図３（ｃ）に示すウィンドウＷ１の画素数と各画素に含まれる色の原色の数との積に対応している。入力層５０ａに画像の画素データ（色濃度値）を入力することができるので、ユーザは入力画像から特徴量を別途算出することなく、入力画像を入力層５０ａに入力することができる。 In the neural network 50, multiple nodes 89 are arranged in layers and connected between layers. This allows information to propagate in only one direction, as indicated by arrow D in the figure, from the input layer 50a to the output layer 50b. In this embodiment, the number of nodes in the input layer 50a corresponds to the product of the number of pixels in the input image, i.e., the number of pixels in the window W1 shown in Figure 3(c), and the number of primary colors contained in each pixel. Since image pixel data (color density values) can be input to the input layer 50a, the user can input an input image to the input layer 50a without having to separately calculate feature values from the input image.

・各ノードにおける演算
図１２（ｂ）は、各ノードにおける演算を示す模式図である。各ノード８９では、複数の入力を受け取り、１つの出力（ｚ）を計算する。図１２（ｂ）に示す例の場合、ノード８９は４つの入力を受け取る。ノード８９が受け取る総入力（ｕ）は、以下の（式１）で表される。 12(b) is a schematic diagram showing the operations at each node. Each node 89 receives multiple inputs and calculates one output (z). In the example shown in FIG. 12(b), node 89 receives four inputs. The total input (u) received by node 89 is expressed by the following (Equation 1):

各入力には、それぞれ異なる重みが掛けられる。（式１）中、ｂはバイアスと呼ばれる値である。ノードの出力（ｚ）は、（式１）で表される総入力（ｕ）に対する所定の関数ｆの出力となり、以下の（式２）で表される。関数ｆは活性化関数と呼ばれる。 Each input is multiplied by a different weight. In (Equation 1), b is a value called the bias. The output (z) of the node is the output of a predetermined function f for the total input (u) expressed in (Equation 1), and is expressed in (Equation 2) below. The function f is called the activation function.

図１２（ｃ）は、ノード間の演算を示す模式図である。ニューラルネットワーク５０では、（式１）で表される総入力（ｕ）に対して、（式２）で表される結果（ｚ）を出力するノードが層状に並べられている。前の層のノードの出力が、次の層のノードの入力となる。図１２（ｃ）に示す例では、図中左側の層のノード８９ａの出力が、図中右側の層のノード８９ｂの入力となる。右側の層の各ノード８９ｂは、それぞれ、左側の層のノード８９ａからの出力を受け取る。左側の層の各ノード８９ａと右側の層の各ノード８９ｂとの間の各結合には、異なる重みが掛けられる。左側の層の複数のノード８９ａのそれぞれの出力をｘ_１～ｘ_４とすると、右側の層の３つのノード８９ｂのそれぞれに対する入力は、以下の（式３－１）～（式３－３）で表される。 FIG. 12(c) is a schematic diagram showing the operations between nodes. In the neural network 50, nodes that output a result (z) expressed by (Equation 2) in response to a total input (u) expressed by (Equation 1) are arranged in layers. The output of a node in the previous layer becomes the input of a node in the next layer. In the example shown in FIG. 12(c), the output of node 89a in the left layer becomes the input of node 89b in the right layer. Each node 89b in the right layer receives the output from node 89a in the left layer. A different weight is applied to each connection between each node 89a in the left layer and each node 89b in the right layer. If the outputs of each of the multiple nodes 89a in the left layer are _x1 to _x4 , the inputs to each of the three nodes 89b in the right layer are expressed by the following (Equation 3-1) to (Equation 3-3).

これら（式３－１）～（式３－３）を一般化すると、（式３－４）となる。ここで、ｉ＝１，・・・Ｉ、ｊ＝１，・・・Ｊである。 Generalizing these (Equation 3-1) to (Equation 3-3), we get (Equation 3-4), where i = 1, ... I, j = 1, ... J.

（式３－４）を活性化関数に適用すると出力が得られる。出力は以下の（式４）で表される。 Applying (Equation 3-4) to the activation function gives the output, which is expressed as (Equation 4) below.

・活性化関数
実施形態に係る画像解析方法では、活性化関数として、正規化線形関数（rectified linear unit function）を用いる。正規化線形関数は以下の（式５）で表される。 Activation Function In the image analysis method according to the embodiment, a rectified linear unit function is used as the activation function. The rectified linear unit function is expressed by the following (Equation 5).

（式５）は、ｚ＝ｕの線形関数のうち、ｕ＜０の部分をｕ＝０とする関数である。図１２（ｃ）に示す例では、ｊ＝１のノードの出力は、（式５）により、以下の式で表される。 (Equation 5) is a linear function of z = u, where the part of u<0 is set to u=0. In the example shown in Figure 12(c), the output of node j=1 is expressed by (Equation 5) as follows:

・ニューラルネットワークの学習
ニューラルネットワークを用いて表現される関数をｙ（ｘ：ｗ）とおくと、関数ｙ（ｘ：ｗ）は、ニューラルネットワークのパラメータｗを変化させると変化する。入力ｘに対してニューラルネットワークがより好適なパラメータｗを選択するように、関数ｙ（ｘ：ｗ）を調整することを、ニューラルネットワークの学習と呼ぶ。ニューラルネットワークを用いて表現される関数の入力と出力との組が複数与えられているとする。ある入力ｘに対する望ましい出力をｄとすると、入出力の組は、｛（ｘ_１，ｄ_１），（ｘ_２，ｄ_２），・・・，（ｘ_ｎ，ｄ_ｎ）｝と与えられる。（ｘ，ｄ）で表される各組の集合を、訓練データと呼ぶ。具体的には、図３（ｂ）に示す、Ｒ、Ｇ、Ｂ各色の単一色画像における画素毎の色濃度値と真値像のラベルとの組、の集合が、図３（ａ）に示す訓練データである。 Neural Network Training If a function expressed using a neural network is defined as y(x:w), the function y(x:w) changes when the parameter w of the neural network is changed. Adjusting the function y(x:w) so that the neural network selects a parameter w that is more suitable for the input x is called neural network training. Suppose multiple pairs of input and output of a function expressed using a neural network are given. If the desired output for a certain input x is d, the input/output pairs are given as {( _x1 , _d1 ), ( _x2 , _d2 ), ..., ( _xn , _dn )}. The set of pairs represented by (x, d) is called training data. Specifically, the set of pairs of color density values and true value image labels for each pixel in the single-color images of R, G, and B shown in Figure 3(b) is the training data shown in Figure 3(a).

ニューラルネットワークの学習とは、どのような入出力の組（ｘ_ｎ，ｄ_ｎ）に対しても、入力ｘ_ｎを与えたときのニューラルネットワークの出力ｙ（ｘ_ｎ：ｗ）が、出力ｄ_ｎになるべく近づくように重みｗを調整することを意味する。誤差関数（error function）とは、ニューラルネットワークを用いて表現される関数と訓練データとの近さ Learning a neural network means adjusting the weights w so that the output y( _xn :w) of the neural network when given an input _xn is as close as possible to the output _dn for any input/output pair ( _xn , _dn ). The error function is the degree of closeness between the function expressed using a neural network and the training data.

を測る尺度である。誤差関数は損失関数（loss function）とも呼ばれる。実施形態に係る画像解析方法において用いる誤差関数Ｅ（ｗ）は、以下の（式６）で表される。（式６）は交差エントロピー（cross entropy）と呼ばれる。 It is a measure of the error function. The error function is also called a loss function. The error function E(w) used in the image analysis method according to the embodiment is expressed by the following (Equation 6). (Equation 6) is called cross entropy.

（式６）の交差エントロピーの算出方法を説明する。実施形態に係る画像解析方法において用いるニューラルネットワーク５０の出力層５０ｂでは、すなわちニューラルネットワークの最終層では、入力ｘを内容に応じて有限個のクラスに分類するための活性化関数を用いる。活性化関数はソフトマックス関数（softmax function）と呼ばれ、以下の（式７）で表される。なお、出力層５０ｂには、クラス数ｋと同数のノードが並べられているとする。出力層Ｌの各ノードｋ（ｋ＝１，・・・Ｋ）の総入力ｕは、前層Ｌ－１の出力から、ｕ_ｋ ^（Ｌ）で与えられるとする。これにより、出力層のｋ番目のノードの出力は以下の（式７）で表される。 A method for calculating the cross entropy of (Equation 6) will now be described. In the output layer 50b of the neural network 50 used in the image analysis method according to the embodiment, i.e., in the final layer of the neural network, an activation function is used to classify the input x into a finite number of classes according to its content. The activation function is called a softmax function, and is expressed by the following (Equation 7). It is assumed that the output layer 50b has the same number of nodes as the number of classes k. The total input u of each node k (k = 1, ..., K) in the output layer L is given by u _k ^(L) from the output of the previous layer L-1. As a result, the output of the kth node in the output layer is expressed by the following (Equation 7).

（式７）がソフトマックス関数である。（式７）で決まる出力ｙ_１，・・・，ｙ_Ｋの総和は常に１となる。 (Equation 7) is the softmax function. The sum of the outputs y ₁ , . . . , y _K determined by (Equation 7) is always 1.

各クラスをＣ_１，・・・，Ｃ_Ｋと表すと、出力層Ｌのノードｋの出力ｙ_k（すなわちｕ_ｋ ^（Ｌ））は、与えられた入力ｘがクラスＣ_kに属する確率を表す。以下の（式８）を参照されたい。入力ｘは、（式８）で表される確率が最大になるクラスに分類される。 If each class is represented as C ₁ , ..., C _K , the output y _k (i.e., u _k ^(L) ) of node k in the output layer L represents the probability that a given input x belongs to class C _k . See Equation 8 below. The input x is classified into the class that maximizes the probability represented by Equation 8.

ニューラルネットワークの学習では、ニューラルネットワークで表される関数を、各クラスの事後確率（posterior probability）のモデルとみなし、そのような確率モデルの下で、訓練データに対する重みｗの尤度（likelihood）を評価し、尤度を最大化するような重みｗを選択する。 When training a neural network, the function represented by the neural network is considered to be a model of the posterior probability of each class, and under such a probability model, the likelihood of the weight w for the training data is evaluated, and the weight w that maximizes the likelihood is selected.

（式７）のソフトマックス関数による目標出力ｄ_ｎを、出力が正解のクラスである場合のみ１とし、出力がそれ以外の場合は０になるとする。目標出力をｄ_ｎ＝［ｄ_ｎ１，・・・，ｄ_ｎＫ］というベクトル形式で表すと、例えば入力ｘ_ｎの正解クラスがＣ_３である場合、目標出力ｄ_ｎ３のみが１となり、それ以外の目標出力は０となる。このように符号化すると、事後分布（posterior）は以下の（式９）で表される。 The target output _dn by the softmax function in (Equation 7) is set to 1 only if the output is the correct class, and to 0 otherwise. If the target output is expressed in a vector format as _dn = [ _dn1 , ..., _dnK ], for example, if the correct class of the input _xn is _C3 , only the target output _dn3 will be 1, and the other target outputs will be 0. When encoded in this way, the posterior distribution is expressed by the following (Equation 9).

訓練データ｛（ｘ_ｎ，ｄ_ｎ）｝（ｎ＝１，・・・，Ｎ）に対する重みｗの尤度Ｌ（ｗ）は、以下の（式１０）で表される。尤度Ｌ（ｗ）の対数をとり符号を反転すると、（式６）の誤差関数が導出される。 The likelihood L(w) of weight w for training data {(x _n , d _n )} (n=1, ..., N) is expressed by the following (Equation 10): Taking the logarithm of the likelihood L(w) and inverting the sign derives the error function of (Equation 6).

学習は、訓練データを基に計算される誤差関数Ｅ（ｗ）を、ニューラルネットワークのパラメータｗについて最小化することを意味する。実施形態に係る画像解析方法では、誤差関数Ｅ（ｗ）は（式６）で表される。 Learning means minimizing the error function E(w) calculated based on training data with respect to the neural network parameter w. In the image analysis method according to the embodiment, the error function E(w) is expressed by (Equation 6).

誤差関数Ｅ（ｗ）をパラメータｗについて最小化することは、関数Ｅ（ｗ）の局所的な極小点を求めることと同じ意味である。パラメータｗはノード間の結合の重みである。重みｗの極小点は、任意の初期値を出発点として、パラメータｗを繰り返し更新する反復計算によって求められる。このような計算の一例には、勾配降下法（gradient descent method）がある。 Minimizing the error function E(w) with respect to the parameter w is equivalent to finding a local minimum of the function E(w). The parameter w is the weight of the connection between nodes. The minimum of the weight w is found by iterative calculations that start with an arbitrary initial value and repeatedly update the parameter w. An example of such calculations is the gradient descent method.

勾配降下法では、次の（式１１）で表されるベクトルを用いる。 The gradient descent method uses the vector expressed in the following (Equation 11).

勾配降下法では、現在のパラメータｗの値を負の勾配方向（すなわち－∇Ｅ）に移動させる処理を何度も繰り返す。現在の重みをｗ^（ｔ）とし、移動後の重みをｗ^{（ｔ＋１）}とすると、勾配降下法による演算は、以下の（式１２）で表される。値ｔは、パラメータｗを移動させた回数を意味する。 In gradient descent, the process of moving the current value of parameter w in the negative gradient direction (i.e., -∇E) is repeated many times. If the current weight is w ^(t) and the weight after the movement is w ^(t+1) , the calculation using gradient descent is expressed by the following (Equation 12). The value t represents the number of times the parameter w has been moved.

記号
は、パラメータｗの更新量の大きさを決める定数であり、学習係数と呼ばれる。（式１２）で表される演算を繰り返すことにより、値ｔの増加に伴って誤差関数Ｅ（ｗ^（ｔ））が減少し、パラメータｗは極小点に到達する。 symbol
is a constant that determines the magnitude of the update amount of the parameter w, and is called the learning coefficient. By repeating the calculation expressed by (Equation 12), the error function E(w ^(t) ) decreases as the value t increases, and the parameter w reaches a minimum point.

なお、（式１２）による演算は、全ての訓練データ（ｎ＝１，・・・，Ｎ）に対して実施してもよく、一部の訓練データのみに対して実施してもよい。一部の訓練データのみに対して行う勾配降下法は、確率的勾配降下法（stochastic gradient descent）と呼ばれる。実施形態に係る画像解析方法では、確率的勾配降下法を用いる。 Note that the calculation using (Equation 12) may be performed on all training data (n = 1, ..., N), or on only a portion of the training data. Gradient descent performed on only a portion of the training data is called stochastic gradient descent. The image analysis method according to the embodiment uses stochastic gradient descent.

・画像解析処理
図１３を参照すると、第１の実施形態に係る画像解析装置２００Ａの処理部２０Ａは、解析用データ生成部２０１と、解析用データ入力部２０２と、解析部２０３と、領域検出部２０４とを備える。これらの機能ブロックは、本発明に係るコンピュータに画像解析処理を実行させるプログラムを、処理部２０Ａの記憶部２３またはメモリ２２にインストールし、このプログラムをＣＰＵ２１が実行することにより実現される。ウィンドウサイズデータベース１０４と、アルゴリズムデータベース１０５とは、記憶媒体９８またはネットワーク９９を通じて深層学習装置１００Ａから提供され、処理部２０Ａの記憶部２３またはメモリ２２に記憶される。 13 , the processing unit 20A of the image analysis device 200A according to the first embodiment includes an analysis data generation unit 201, an analysis data input unit 202, an analysis unit 203, and an area detection unit 204. These functional blocks are realized by installing a program for causing a computer according to the present invention to execute image analysis processing in the storage unit 23 or memory 22 of the processing unit 20A and executing this program by the CPU 21. The window size database 104 and the algorithm database 105 are provided from the deep learning device 100A via the storage medium 98 or the network 99 and stored in the storage unit 23 or memory 22 of the processing unit 20A.

解析対象の組織の解析対象画像７８は、撮像装置４００によって予め撮像され、処理部２０Ａの記憶部２３またはメモリ２２に予め記憶されていることとする。学習済みの結合重みｗを含む深層学習アルゴリズム６０は、解析対象の組織の標本が由来する組織の種類と対応付けられてアルゴリズムデータベース１０５に格納されており、コンピュータに画像解析処理を実行させるプログラムの一部であるプログラムモジュールとして機能する。すなわち、深層学習アルゴリズム６０は、ＣＰＵおよびメモリを備えるコンピュータにて用いられ、解析対象の組織において各領域を示すデータを出力するという、使用目的に応じた特有の情報の演算または加工を実行するよう、コンピュータを機能させる。具体的には、処理部２０ＡのＣＰＵ２１は、記憶部２３またはメモリ２２に記憶された深層学習アルゴリズム６０に規定されているアルゴリズムに従って、学習済みの結合重みｗに基づくニューラルネットワーク６０の演算を行う。処理部２０ＡのＣＰＵ２１は、入力層６０ａに入力された、解析対象の組織を撮像した解析対象画像７８に対して演算を行い、出力層６０ｂから、解析対象の組織において各領域を示すデータである３値画像８３を出力する。 An analysis target image 78 of the tissue to be analyzed is assumed to have been captured in advance by the imaging device 400 and stored in advance in the storage unit 23 or memory 22 of the processing unit 20A. The deep learning algorithm 60, including the learned connection weights w, is stored in the algorithm database 105 in association with the tissue type from which the tissue sample to be analyzed is derived, and functions as a program module that is part of a program that causes a computer to execute image analysis processing. That is, the deep learning algorithm 60 is used in a computer equipped with a CPU and memory, and causes the computer to perform calculations or processing of specific information according to its intended use, such as outputting data indicating each region in the tissue to be analyzed. Specifically, the CPU 21 of the processing unit 20A performs calculations of the neural network 60 based on the learned connection weights w in accordance with the algorithm specified in the deep learning algorithm 60 stored in the storage unit 23 or memory 22. The CPU 21 of the processing unit 20A performs calculations on the analysis target image 78, which is an image of the tissue to be analyzed and is input to the input layer 60a, and outputs a ternary image 83, which is data indicating each region in the tissue to be analyzed, from the output layer 60b.

図１４を参照すると、画像解析装置２００Ａの処理部２０Ａは、図１３に示す処理を行う。図１３に示す各機能ブロックを用いて説明すると、ステップＳ２１およびＳ２２の処理は、解析用データ生成部２０１が行う。ステップＳ２３，Ｓ２４，Ｓ２６およびＳ２７の処理は、解析用データ入力部２０２が行う。ステップＳ２５およびＳ２８の処理は、解析部２０３が行う。ステップＳ２９の処理は、領域検出部２０４が行う。 Referring to Figure 14, the processing unit 20A of the image analysis device 200A performs the processing shown in Figure 13. Explaining this using the functional blocks shown in Figure 13, the processing of steps S21 and S22 is performed by the analysis data generation unit 201. The processing of steps S23, S24, S26, and S27 is performed by the analysis data input unit 202. The processing of steps S25 and S28 is performed by the analysis unit 203. The processing of step S29 is performed by the area detection unit 204.

ステップＳ２１において、処理部２０Ａは、例えば、ユーザ等が入力部２６から入力した処理開始の指令により、入力された解析対象画像７８からＲ、Ｇ、Ｂ各色の色濃度符号化図７９ｒ，７９ｇ，７９ｂを生成する。色濃度符号化図７９ｒ，７９ｇ，７９ｂから画素毎にＲ、Ｇ、Ｂ各色の色濃度値を組み合わせた解析対象データが生成される。色濃度符号化図７９ｒ，７９ｇ，７９ｂの生成方法は、図１１に示す深層学習処理時におけるステップＳ１０での生成方法と同様である。 In step S21, the processing unit 20A generates color density coded maps 79r, 79g, and 79b for each of the R, G, and B colors from the input analysis target image 78 in response to a command to start processing input, for example, by a user or the like, via the input unit 26. Analysis target data that combines the color density values of each of the R, G, and B colors for each pixel is generated from the color density coded maps 79r, 79g, and 79b. The method for generating the color density coded maps 79r, 79g, and 79b is the same as the method for generating them in step S10 during the deep learning process shown in FIG. 11.

図１４に示すステップＳ２２において、処理部２０Ａは、入力部２６を通じて、解析条件として、画像解析装置２００Ａ側のユーザからの、組織の種別の入力を受け付ける。処理部２０Ａは、入力された組織の種別に基づき、ウィンドウサイズデータベース１０４およびアルゴリズムデータベース１０５を参照して、解析に用いるウィンドウサイズを設定し、解析に用いる深層学習アルゴリズム６０を取得する。ウィンドウサイズは、１回の入力時にニューラルネットワーク６０に入力する解析用データの単位であり、ウィンドウサイズの解析用データ８０の画素数と画像に含まれる色の原色の数との積が、入力層６０ａのノード数に対応している。ウィンドウサイズは組織の種別と対応付けられて、ウィンドウサイズデータベース１０４内に予め記憶されている。ウィンドウサイズは、図６に示すウィンドウＷ２のように、例えば３×３画素である。深層学習アルゴリズム６０も、組織の種別と対応付けられて、図１３に示すアルゴリズムデータベース１０５内に予め記憶されている。 In step S22 shown in FIG. 14, processing unit 20A receives input of tissue type from the user of image analysis device 200A via input unit 26 as an analysis condition. Based on the input tissue type, processing unit 20A references window size database 104 and algorithm database 105 to set a window size to be used for analysis and acquires a deep learning algorithm 60 to be used for analysis. The window size is the unit of analysis data input to neural network 60 at one time, and the product of the number of pixels of analysis data 80 in the window size and the number of primary colors contained in the image corresponds to the number of nodes in input layer 60a. Window sizes are associated with tissue types and pre-stored in window size database 104. The window size is, for example, 3 x 3 pixels, as in window W2 shown in FIG. 6. The deep learning algorithm 60 is also associated with tissue types and pre-stored in algorithm database 105 shown in FIG. 13.

図１４に示すステップＳ２３において、処理部２０Ａは、色濃度符号化図７９ｒ，７９ｇ，７９ｂから画素毎にＲ、Ｇ、Ｂ各色の色濃度値を組み合わせた解析対象データを生成し、さらにウィンドウサイズの解析用データ８０を生成する。 In step S23 shown in FIG. 14, the processing unit 20A generates analysis target data by combining the color density values of R, G, and B for each pixel from the color density encoding diagrams 79r, 79g, and 79b, and further generates analysis data 80 for the window size.

ステップＳ２４において、処理部２０Ａは、図５に示す解析用データ８０を、深層学習アルゴリズム６０に入力する。ウィンドウの初期位置は、深層学習処理時におけるステップＳ１４と同様に、例えばウィンドウ内の３×３画素に位置する画素が、解析対象データの左上角に対応する位置である。処理部２０Ａは、ウィンドウサイズの解析用データ８０に含まれる、３×３画素×３原色の合計２７個の色濃度値のデータ８１を入力層６０ａに入力すると、深層学習アルゴリズム６０は、出力層６０ｂに判別結果８２を出力する。 In step S24, the processing unit 20A inputs the analysis data 80 shown in Figure 5 to the deep learning algorithm 60. As in step S14 during deep learning processing, the initial position of the window is, for example, a position where the pixel located within a 3x3 pixel area within the window corresponds to the upper left corner of the data to be analyzed. When the processing unit 20A inputs data 81 of a total of 27 color density values (3x3 pixels x 3 primary colors) contained in the window-sized analysis data 80 to the input layer 60a, the deep learning algorithm 60 outputs a discrimination result 82 to the output layer 60b.

図１４に示すステップＳ２５において、処理部２０Ａは、図５に示す出力層６０ｂに出力される判別結果８２を記憶する。判別結果８２は、解析対象データの各画素領域についての推定値（０、１、２の３値のいずれか）である。 In step S25 shown in FIG. 14, the processing unit 20A stores the discrimination result 82 output to the output layer 60b shown in FIG. 5. The discrimination result 82 is an estimated value (one of three values: 0, 1, or 2) for each pixel region of the data to be analyzed.

図１４に示すステップＳ２６において、処理部２０Ａは、入力画像内の全ての画素を処理したか否かを判断する。入力画像は、図６に示す色濃度符号化図７９ｒ，７９ｇ，７９ｂであり、解析対象データ内の全ての画素について、図１４に示すステップＳ２３からステップＳ２５の一連の処理がなされている場合は、ステップＳ２８の処理を行う。 In step S26 shown in FIG. 14, the processing unit 20A determines whether all pixels in the input image have been processed. The input image is the color density encoding charts 79r, 79g, and 79b shown in FIG. 6, and if the series of processes from step S23 to step S25 shown in FIG. 14 have been performed for all pixels in the data to be analyzed, the processing unit 20A performs the process of step S28.

解析対象データ内の全ての画素が処理されていない場合は、処理部２０Ａは、ステップＳ２７において、深層学習処理時におけるステップＳ２０と同様に、図５に示す色濃度符号化図７９ｒ，７９ｇ，７９ｂ内において、ウィンドウＷ２を所定の画素数の単位で移動させる。その後、処理部２０Ａは、移動後の新たなウィンドウＷ２の位置において、ステップＳ２３からステップＳ２５の一連の処理を行う。処理部２０Ａは、ステップＳ２５において、移動後の新たなウィンドウ位置に対応する、判別結果８２を記憶する。このようなウィンドウサイズ毎の判別結果８２の記憶を、解析対象画像内の全ての画素に対して行うことにより、解析結果の３値画像８３が得られる。解析結果の３値画像８３の画像サイズは、解析対象画像の画像サイズと同じである。ここで、３値画像８３には、推定値の値２、値１および値０が各画素に付された数値データであってもよく、推定値の値２、値１および値０に代えて、例えば値２、値１および値０のそれぞれに対応付けた表示色で示した画像であっても良い。 If all pixels in the analysis target data have not been processed, in step S27, the processing unit 20A moves the window W2 within the color density encoding diagrams 79r, 79g, and 79b shown in FIG. 5 by a predetermined number of pixels, similar to step S20 during deep learning processing. The processing unit 20A then performs a series of processes from step S23 to step S25 at the new position of the moved window W2. In step S25, the processing unit 20A stores the discrimination result 82 corresponding to the new window position after the movement. By storing the discrimination result 82 for each window size in this manner for all pixels in the analysis target image, a ternary image 83 of the analysis result is obtained. The image size of the ternary image 83 of the analysis result is the same as the image size of the analysis target image. Here, the ternary image 83 may be numerical data in which estimated values 2, 1, and 0 are assigned to each pixel, or may be an image in which, instead of estimated values 2, 1, and 0, display colors corresponding to values 2, 1, and 0, respectively, are used.

図１４に示すステップＳ２８では、処理部２０Ａは、解析結果の３値画像８３を出力部２７に出力する。 In step S28 shown in FIG. 14, the processing unit 20A outputs a ternary image 83 of the analysis results to the output unit 27.

ステップＳ２９では、ステップＳ２８に引き続き、処理部２０Ａは、解析結果の３値画像８３についてさらに、領域検出処理を行う。３値画像８３において、腫瘍領域と、非腫瘍領域と、非組織領域とは、３値で区別して表されている。 In step S29, following step S28, the processing unit 20A further performs region detection processing on the ternary image 83 of the analysis results. In the ternary image 83, the tumor region, non-tumor region, and non-tissue region are distinguished and represented by three values.

任意ではあるが、処理部２０Ａは、得られた各領域を、解析対象の解析対象画像７８に重ねることにより、領域強調画像８４を作成する。処理部２０Ａは、作成した領域強調画像８４を出力部２７に出力し、画像解析処理を終了する。 Optionally, the processing unit 20A creates a region-enhanced image 84 by overlaying each of the obtained regions on the analysis target image 78 of the analysis target. The processing unit 20A outputs the created region-enhanced image 84 to the output unit 27, and ends the image analysis process.

任意ではあるが、処理部２０Ａは、解析対象画像７８に含まれる組織において、腫瘍領域がどの程度含まれているか算出してもよい。例えば、解析対象画像７８に含まれる組織に該当する画素数、すなわち腫瘍領域および非腫瘍領域と判定された画素数の合計（組織領域総画素数）を１００％とした場合に、腫瘍領域と判定された画素数が何％を占めるかを算出することにより、腫瘍領域の含有率を算出することができる。また、総画素数から腫瘍領域と判定された画素数を減じることにより、または組織領域総画素数）を１００％とした場合に、非腫瘍領域と判定された画素数が何％を占めるかを算出することにより、非腫瘍領域の含有率を算出することができる。そして、腫瘍領域の含有率と非腫瘍領域の含有率との比率を算出してもよい。前記算出された値は、出力部２７に出力されてもよい。また、算出された値は、領域強調画像８４と共に出力部２７に出力されてもよい。 Optionally, processing unit 20A may calculate the extent to which the tissue contained in analysis target image 78 contains tumorous regions. For example, the percentage of pixels determined to be tumorous regions can be calculated by taking the number of pixels corresponding to the tissue contained in analysis target image 78, i.e., the sum of the number of pixels determined to be tumorous regions and non-tumorous regions (total number of pixels in the tissue region), as 100%. The percentage of non-tumorous regions can also be calculated by subtracting the number of pixels determined to be tumorous regions from the total number of pixels, or by calculating the percentage of pixels determined to be non-tumorous regions when the total number of pixels in the tissue region is taken as 100%. The ratio between the percentage of tumorous regions and the percentage of non-tumorous regions can then be calculated. The calculated value may be output to output unit 27. The calculated value may also be output to output unit 27 along with region-enhanced image 84.

このように腫瘍領域の含有率を算出することは、がん組織の遺伝子解析等に用いられる組織が検査に適した腫瘍組織を含んでいるかスクリーニングする上で有用である。 Calculating the tumor area content in this way is useful for screening whether tissue used for genetic analysis of cancer tissue contains tumor tissue suitable for testing.

以上、画像解析装置２００Ａのユーザは、解析対象の組織の解析対象画像７８を画像解析装置２００Ａに入力することにより、解析結果として、３値画像８３を取得することができる。３値画像８３は、解析対象の標本における、腫瘍領域と、非腫瘍領域と、非組織領域とを表しており、ユーザは、解析対象の標本において、各領域を判別することが可能となる。 As described above, a user of image analysis device 200A can input analysis target image 78 of the tissue to be analyzed into image analysis device 200A to obtain ternary image 83 as the analysis result. ternary image 83 represents the tumor region, non-tumor region, and non-tissue region in the specimen to be analyzed, allowing the user to distinguish between each region in the specimen to be analyzed.

さらに、画像解析装置２００Ａのユーザは、解析結果として、領域強調画像８４を取得することができる。領域強調画像８４は、例えば、解析対象の解析対象画像７８に、各領域を色で塗りつぶすことにより生成される。また、別の態様では、各領域の境界線を重ねることにより生成されている。これにより、ユーザは、解析対象の組織において、各領域を一目で把握することが可能となる。 Furthermore, the user of the image analysis device 200A can obtain a region-enhanced image 84 as the analysis result. The region-enhanced image 84 is generated, for example, by filling in each region with color in the analysis target image 78 of the analysis target. In another aspect, it is generated by overlapping the boundaries of each region. This allows the user to grasp each region in the tissue being analyzed at a glance.

解析対象の標本において腫瘍領域、および／または非腫瘍領域を示すことは、組織診断に先立って解析対象組織における腫瘍領域をスクリーニングし、標本観察効率を上げる一助となる。 Indicating tumor and/or non-tumor regions in the specimen to be analyzed helps to screen for tumor regions in the tissue to be analyzed prior to tissue diagnosis and improves specimen observation efficiency.

＜第２の実施形態＞
以下、第１の実施形態に係る画像解析システムと相違する点について、第２の実施形態に係る画像解析システムを説明する。 Second Embodiment
The image analysis system according to the second embodiment will be described below, focusing on the differences from the image analysis system according to the first embodiment.

［構成の概要］
図１５を参照すると、第２の実施形態に係る画像解析システムは、ユーザ側装置２００を備え、ユーザ側装置２００が、統合型の画像解析装置２００Ｂとして動作する。画像解析装置２００Ｂは、例えば汎用コンピュータで構成されており、第１の実施形態において説明した深層学習処理および画像解析処理の両方の処理を行う。つまり、第２の実施形態に係る画像解析システムは、ユーザ側で深層学習および画像解析を行う、スタンドアロン型のシステムである。第２の実施形態に係る画像解析システムは、ユーザ側に設置された統合型の画像解析装置２００Ｂが、第１の実施形態に係る深層学習装置１００Ａおよび画像解析装置２００Ａの両方の機能を担う点において、第１の実施形態に係る画像解析システムと異なる。 [Configuration Overview]
Referring to FIG. 15 , the image analysis system according to the second embodiment includes a user-side device 200, which operates as an integrated image analysis device 200B. The image analysis device 200B is configured, for example, as a general-purpose computer and performs both the deep learning process and the image analysis process described in the first embodiment. In other words, the image analysis system according to the second embodiment is a standalone system in which deep learning and image analysis are performed on the user side. The image analysis system according to the second embodiment differs from the image analysis system according to the first embodiment in that the integrated image analysis device 200B installed on the user side performs the functions of both the deep learning device 100A and the image analysis device 200A according to the first embodiment.

画像解析装置２００Ｂは撮像装置４００に接続されている。撮像装置４００は、深層学習処理時には、学習用の組織の訓練用画像を取得し、画像解析処理時には、解析対象の組織の解析対象画像７８を取得する。 Image analysis device 200B is connected to imaging device 400. During deep learning processing, imaging device 400 acquires training images of tissue for learning, and during image analysis processing, acquires analysis target images 78 of the tissue to be analyzed.

［ハードウェア構成］
画像解析装置２００Ｂのハードウェア構成は、図９に示すユーザ側装置２００のハードウェア構成と同様である。 [Hardware configuration]
The hardware configuration of the image analyzing device 200B is the same as the hardware configuration of the user device 200 shown in FIG.

［機能ブロックおよび処理手順］
図１６を参照すると、第２の実施形態に係る画像解析装置２００Ｂの処理部２０Ｂは、訓練データ生成部１０１と、訓練データ入力部１０２と、アルゴリズム更新部１０３と、解析用データ生成部２０１と、解析用データ入力部２０２と、解析部２０３と、領域検出部２０４とを備える。これらの機能ブロックは、コンピュータに深層学習処理および画像解析処理を実行させるプログラムを、処理部２０Ｂの記憶部２３またはメモリ２２にインストールし、このプログラムをＣＰＵ２１が実行することにより実現される。ウィンドウサイズデータベース１０４と、アルゴリズムデータベース１０５とは、処理部２０Ｂの記憶部２３またはメモリ２２に記憶され、どちらも深層学習時および画像解析処理時に共通して使用される。学習済みのニューラルネットワーク６０は、組織の種別または細胞を含む試料の種別と対応付けられて、アルゴリズムデータベース１０５に予め格納されており、深層学習処理により結合重みｗが更新されて、深層学習アルゴリズム６０として、アルゴリズムデータベース１０５に格納される。なお、訓練用画像である第１の訓練用画像７０Ｒ１、第２の訓練用画像７０Ｒ２、第３の訓練用画像７０Ｒ３は、撮像装置４００によって予め撮像され、処理部２０Ｂの記憶部２３またはメモリ２２に予め記憶されていることとする。解析対象の標本の解析対象画像７８も、撮像装置４００によって予め撮像され、処理部２０Ｂの記憶部２３またはメモリ２２に予め記憶されていることとする。 [Function blocks and processing procedures]
16 , the processing unit 20B of the image analysis device 200B according to the second embodiment includes a training data generation unit 101, a training data input unit 102, an algorithm update unit 103, an analysis data generation unit 201, an analysis data input unit 202, an analysis unit 203, and a region detection unit 204. These functional blocks are implemented by installing a program for causing a computer to execute deep learning processing and image analysis processing in the storage unit 23 or memory 22 of the processing unit 20B and executing the program by the CPU 21. The window size database 104 and the algorithm database 105 are stored in the storage unit 23 or memory 22 of the processing unit 20B and are both used during deep learning and image analysis processing. A trained neural network 60 is associated with a tissue type or a cell-containing sample type and pre-stored in the algorithm database 105. The connection weights w are updated by the deep learning processing and stored in the algorithm database 105 as a deep learning algorithm 60. It is assumed that the first training image 70R1, the second training image 70R2, and the third training image 70R3, which are training images, are captured in advance by the imaging device 400 and stored in advance in the storage unit 23 or the memory 22 of the processing unit 20B. It is also assumed that the analysis target image 78 of the specimen to be analyzed is captured in advance by the imaging device 400 and stored in advance in the storage unit 23 or the memory 22 of the processing unit 20B.

画像解析装置２００Ｂの処理部２０Ｂは、深層学習処理時には、図１１に示す処理を行い、画像解析処理時には、図１４に示す処理を行う。図１６に示す各機能ブロックを用いて説明すると、深層学習処理時には、ステップＳ１０からＳ１２、Ｓ１４、Ｓ１８およびＳ１９の処理は、訓練データ生成部１０１が行う。ステップＳ１３の処理は、訓練データ入力部１０２が行う。ステップＳ１６およびＳ１７の処理は、アルゴリズム更新部１０３が行う。画像解析処理時には、ステップＳ２１およびＳ２２の処理は、解析用データ生成部２０１が行う。ステップＳ２３，Ｓ２４，Ｓ２６およびＳ２７の処理は、解析用データ入力部２０２が行う。ステップＳ２５およびＳ２８の処理は、解析部２０３が行う。ステップＳ２９の処理は、領域検出部２０４が行う。 The processing unit 20B of the image analysis device 200B performs the processing shown in FIG. 11 during deep learning processing, and performs the processing shown in FIG. 14 during image analysis processing. Using the functional blocks shown in FIG. 16, during deep learning processing, the processing of steps S10 to S12, S14, S18, and S19 is performed by the training data generation unit 101. The processing of step S13 is performed by the training data input unit 102. The processing of steps S16 and S17 is performed by the algorithm update unit 103. During image analysis processing, the processing of steps S21 and S22 is performed by the analysis data generation unit 201. The processing of steps S23, S24, S26, and S27 is performed by the analysis data input unit 202. The processing of steps S25 and S28 is performed by the analysis unit 203. The processing of step S29 is performed by the area detection unit 204.

第２の実施形態に係る画像解析装置２００Ｂが行う深層学習処理の手順および画像解析処理の手順は、第１の実施形態に係る深層学習装置１００Ａおよび画像解析装置２００Ａがそれぞれ行う手順と同様である。なお、第２の実施形態に係る画像解析装置２００Ｂは、次の点において第１の実施形態に係る深層学習装置１００Ａおよび画像解析装置２００Ａと異なる。 The deep learning processing procedures and image analysis processing procedures performed by the image analysis device 200B according to the second embodiment are similar to the procedures performed by the deep learning device 100A and image analysis device 200A according to the first embodiment, respectively. Note that the image analysis device 200B according to the second embodiment differs from the deep learning device 100A and image analysis device 200A according to the first embodiment in the following respects:

深層学習処理時のステップＳ１３において、処理部２０Ｂは、入力部２６を通じて、画像解析装置２００Ｂのユーザからの、学習用の組織の種別の入力を受け付ける。処理部２０Ｂは、入力された組織の種別に基づき、ウィンドウサイズデータベース１０４を参照して、ウィンドウサイズを設定し、アルゴリズムデータベース１０５を参照して、学習に用いるニューラルネットワーク５０を設定する。 In step S13 during deep learning processing, processing unit 20B accepts input of the type of tissue to be used for learning from the user of image analysis device 200B via input unit 26. Based on the input type of tissue, processing unit 20B references window size database 104 to set the window size, and references algorithm database 105 to set the neural network 50 to be used for learning.

以上、画像解析装置２００Ｂのユーザは、解析対象画像７８を画像解析装置２００Ｂに入力することにより、解析結果として、３値画像８３を取得することができる。さらに、画像解析装置２００Ｂのユーザは、解析結果として、領域強調画像８４を取得することができる。 As described above, by inputting the analysis target image 78 into the image analysis device 200B, the user of the image analysis device 200B can obtain a ternary image 83 as the analysis result. Furthermore, the user of the image analysis device 200B can obtain a region-enhanced image 84 as the analysis result.

第２の実施形態に係る画像解析装置２００Ｂによると、ユーザは、自身が選択した種類の組織を、学習用の組織として用いることができる。これは、ニューラルネットワーク５０の学習がベンダ側任せではなく、ユーザ自身がニューラルネットワーク５０の学習の程度を向上できることを意味する。 The image analysis device 200B according to the second embodiment allows the user to use the type of tissue they select as the tissue for learning. This means that the learning of the neural network 50 is not left to the vendor, but the user can improve the level of learning of the neural network 50 themselves.

＜第３の実施形態＞
以下、第２の実施形態に係る画像解析システムと相違する点について、第３の実施形態に係る画像解析システムを説明する。 Third Embodiment
The image analysis system according to the third embodiment will be described below, focusing on the differences from the image analysis system according to the second embodiment.

［構成の概要］
図１７を参照すると、第３の実施形態に係る画像解析システムは、ベンダ側装置１００と、ユーザ側装置２００とを備える。ベンダ側装置１００は統合型の画像解析装置１００Ｂとして動作し、ユーザ側装置２００は端末装置２００Ｃとして動作する。画像解析装置１００Ｂは、例えば汎用コンピュータで構成されており、第１の実施形態において説明した深層学習処理および画像解析処理の両方の処理を行う、クラウドサーバ側の装置である。端末装置２００Ｃは、例えば汎用コンピュータで構成されており、ネットワーク９９を通じて、解析対象の画像を画像解析装置１００Ｂに送信し、ネットワーク９９を通じて、解析結果の画像を画像解析装置１００Ｂから受信する、ユーザ側の端末装置である。 [Configuration Overview]
17 , the image analysis system according to the third embodiment includes a vendor-side device 100 and a user-side device 200. The vendor-side device 100 operates as an integrated image analysis device 100B, and the user-side device 200 operates as a terminal device 200C. The image analysis device 100B is configured, for example, as a general-purpose computer, and is a cloud server-side device that performs both the deep learning processing and the image analysis processing described in the first embodiment. The terminal device 200C is configured, for example, as a general-purpose computer, and is a user-side terminal device that transmits images to be analyzed to the image analysis device 100B via a network 99 and receives images of the analysis results from the image analysis device 100B via the network 99.

第３の実施形態に係る画像解析システムは、ベンダ側に設置された統合型の画像解析装置１００Ｂが、第１の実施形態に係る深層学習装置１００Ａおよび画像解析装置２００Ａの両方の機能を担う点において、第２の実施形態に係る画像解析システムと同様である。一方、第３の実施形態に係る画像解析システムは、端末装置２００Ｃを備え、解析対象の画像の入力インタフェースと、解析結果の画像の出力インタフェースとをユーザ側の端末装置２００Ｃに提供する点において、第２の実施形態に係る画像解析システムと異なる。つまり、第３の実施形態に係る画像解析システムは、深層学習処理および画像解析処理を行うベンダ側が、解析対象の画像および解析結果の画像の入出力インタフェースをユーザ側に提供する、クラウドサービス型のシステムである。 The image analysis system according to the third embodiment is similar to the image analysis system according to the second embodiment in that an integrated image analysis device 100B installed on the vendor side performs the functions of both the deep learning device 100A and the image analysis device 200A according to the first embodiment. On the other hand, the image analysis system according to the third embodiment differs from the image analysis system according to the second embodiment in that it includes a terminal device 200C, which provides an input interface for images to be analyzed and an output interface for images of the analysis results to the user-side terminal device 200C. In other words, the image analysis system according to the third embodiment is a cloud service-type system in which the vendor side, which performs deep learning processing and image analysis processing, provides the user side with input/output interfaces for images to be analyzed and images of the analysis results.

画像解析装置１００Ｂは撮像装置３００に接続されており、撮像装置３００によって撮像される、学習用の組織の訓練用画像を取得する。 The image analysis device 100B is connected to the imaging device 300 and acquires training images of the tissue to be studied, which are captured by the imaging device 300.

端末装置２００Ｃは撮像装置４００に接続されており、撮像装置４００によって撮像される、解析対象の組織の解析対象画像７８を取得する。 The terminal device 200C is connected to the imaging device 400 and acquires an analysis target image 78 of the tissue to be analyzed, which is captured by the imaging device 400.

［ハードウェア構成］
画像解析装置１００Ｂのハードウェア構成は、図７に示すベンダ側装置１００のハードウェア構成と同様である。端末装置２００Ｃのハードウェア構成は、図９に示すユーザ側装置２００のハードウェア構成と同様である。 [Hardware configuration]
The hardware configuration of the image analyzing device 100B is similar to that of the vendor-side device 100 shown in Fig. 7. The hardware configuration of the terminal device 200C is similar to that of the user-side device 200 shown in Fig. 9.

［機能ブロックおよび処理手順］
図１８を参照すると、第３の実施形態に係る画像解析装置１００Ｂの処理部１０Ｂは、訓練データ生成部１０１と、訓練データ入力部１０２と、アルゴリズム更新部１０３と、解析用データ生成部２０１と、解析用データ入力部２０２と、解析部２０３と、領域検出部２０４とを備える。これらの機能ブロックは、コンピュータに深層学習処理および画像解析処理を実行させるプログラムを、処理部１０Ｂの記憶部１３またはメモリ１２にインストールし、このプログラムをＣＰＵ１１が実行することにより実現される。ウィンドウサイズデータベース１０４と、アルゴリズムデータベース１０５とは、処理部１０Ｂの記憶部１３またはメモリ１２に記憶され、どちらも深層学習時および画像解析処理時に共通して使用される。ニューラルネットワーク５０は、組織の種別と対応付けられて、アルゴリズムデータベース１０５に予め格納されており、深層学習処理により結合重みｗが更新されて、深層学習アルゴリズム６０として、アルゴリズムデータベース１０５に格納される。 [Function blocks and processing procedures]
18 , the processing unit 10B of the image analysis device 100B according to the third embodiment includes a training data generation unit 101, a training data input unit 102, an algorithm update unit 103, an analysis data generation unit 201, an analysis data input unit 202, an analysis unit 203, and a region detection unit 204. These functional blocks are implemented by installing a program for causing a computer to execute deep learning processing and image analysis processing in the storage unit 13 or memory 12 of the processing unit 10B and executing the program by the CPU 11. The window size database 104 and the algorithm database 105 are stored in the storage unit 13 or memory 12 of the processing unit 10B, and both are used during deep learning and image analysis processing. The neural network 50 is associated with the tissue type and pre-stored in the algorithm database 105. The connection weight w is updated through deep learning processing and stored in the algorithm database 105 as a deep learning algorithm 60.

なお、訓練用画像である第１の訓練用画像７０Ｒ１、第２の訓練用画像７０Ｒ２、第３の訓練用画像７０Ｒ３は、撮像装置３００によって予め撮像され、処理部１０Ｂの記憶部１３またはメモリ１２に予め記憶されていることとする。解析対象の組織の解析対象画像７８も、撮像装置４００によって予め撮像され、端末装置２００Ｃの処理部２０Ｃの記憶部２３またはメモリ２２に予め記憶されていることとする。 The training images, first training image 70R1, second training image 70R2, and third training image 70R3, are assumed to have been captured in advance by imaging device 300 and stored in advance in storage unit 13 or memory 12 of processing unit 10B. The analysis target image 78 of the tissue to be analyzed is also assumed to have been captured in advance by imaging device 400 and stored in advance in storage unit 23 or memory 22 of processing unit 20C of terminal device 200C.

画像解析装置１００Ｂの処理部１０Ｂは、深層学習処理時には、図１１に示す処理を行い、画像解析処理時には、図１４に示す処理を行う。図１８に示す各機能ブロックを用いて説明すると、深層学習処理時には、ステップＳ１０からＳ１２、Ｓ１４、Ｓ１８およびＳ１９の処理は、訓練データ生成部１０１が行う。ステップＳ１３の処理は、訓練データ入力部１０２が行う。ステップＳ１６およびＳ１７の処理は、アルゴリズム更新部１０３が行う。画像解析処理時には、ステップＳ２１およびＳ２２の処理は、解析用データ生成部２０１が行う。ステップＳ２３，Ｓ２４，Ｓ２６およびＳ２７の処理は、解析用データ入力部２０２が行う。ステップＳ２５およびＳ２８の処理は、解析部２０３が行う。ステップＳ２９の処理は、領域検出部２０４が行う。 The processing unit 10B of the image analysis device 100B performs the processing shown in FIG. 11 during deep learning processing, and performs the processing shown in FIG. 14 during image analysis processing. Using the functional blocks shown in FIG. 18, during deep learning processing, the processing of steps S10 to S12, S14, S18, and S19 is performed by the training data generation unit 101. The processing of step S13 is performed by the training data input unit 102. The processing of steps S16 and S17 is performed by the algorithm update unit 103. During image analysis processing, the processing of steps S21 and S22 is performed by the analysis data generation unit 201. The processing of steps S23, S24, S26, and S27 is performed by the analysis data input unit 202. The processing of steps S25 and S28 is performed by the analysis unit 203. The processing of step S29 is performed by the area detection unit 204.

第３の実施形態に係る画像解析装置１００Ｂが行う深層学習処理の手順および画像解析処理の手順は、第１の実施形態に係る深層学習装置１００Ａおよび画像解析装置２００Ａがそれぞれ行う手順と同様である。なお、第３の実施形態に係る画像解析装置１００Ｂは、次の４つの点において第１の実施形態に係る深層学習装置１００Ａおよび画像解析装置２００Ａと異なる。 The deep learning processing procedure and image analysis processing procedure performed by the image analysis device 100B according to the third embodiment are similar to the procedures performed by the deep learning device 100A and the image analysis device 200A according to the first embodiment, respectively. Note that the image analysis device 100B according to the third embodiment differs from the deep learning device 100A and the image analysis device 200A according to the first embodiment in the following four points:

図１４に示す画像解析処理時のステップＳ２１において、処理部１０Ｂは、解析対象の組織の解析対象画像７８を、ユーザ側の端末装置２００Ｃから受信し、受信した解析対象画像７８からＲ、Ｇ、Ｂ各色の色濃度符号化図７９ｒ，７９ｇ，７９ｂを生成する。色濃度符号化図７９ｒ，７９ｇ，７９ｂおよび解析対象データの生成方法は、図１１に示す深層学習処理時におけるステップＳ１０での生成方法と同様である。 In step S21 of the image analysis process shown in FIG. 14, the processing unit 10B receives an analysis target image 78 of the tissue to be analyzed from the user's terminal device 200C, and generates color density coded maps 79r, 79g, and 79b for each of the colors R, G, and B from the received analysis target image 78. The method for generating the color density coded maps 79r, 79g, and 79b and the analysis target data is the same as the method for generating them in step S10 of the deep learning process shown in FIG. 11.

図１４に示す画像解析処理時のステップＳ２２において、処理部１０Ｂは、端末装置２００Ｃの入力部２６を通じて、解析条件として、端末装置２００Ｃのユーザからの、組織の種別の入力を受け付ける。処理部１０Ｂは、入力された組織の種別に基づき、ウィンドウサイズデータベース１０４およびアルゴリズムデータベース１０５を参照して、解析に用いるウィンドウサイズを設定し、解析に用いる深層学習アルゴリズム６０を取得する。 In step S22 of the image analysis process shown in FIG. 14, the processing unit 10B receives input of the tissue type from the user of the terminal device 200C as an analysis condition via the input unit 26 of the terminal device 200C. Based on the input tissue type, the processing unit 10B references the window size database 104 and the algorithm database 105 to set the window size to be used in the analysis and obtain the deep learning algorithm 60 to be used in the analysis.

画像解析処理時のステップＳ２８において、処理部１０Ｂは、解析結果の３値画像８３を、ユーザ側の端末装置２００Ｃに送信する。ユーザ側の端末装置２００Ｃでは、処理部２０Ｃが、受信した解析結果の３画像８３を出力部２７に出力する。 In step S28 during image analysis processing, the processing unit 10B transmits the ternary image 83 of the analysis results to the user's terminal device 200C. In the user's terminal device 200C, the processing unit 20C outputs the received ternary image 83 of the analysis results to the output unit 27.

画像解析処理時のステップＳ２９において、処理部１０Ｂは、ステップＳ２８に引き続き、解析結果の３値画像８３についてさらに、核領域の検出処理を行う。任意のステップとして、処理部１０Ｂは、得られた各領域を、解析対象の解析対象画像７８に重ねることにより、領域強調画像８４を作成する。処理部１０Ｂは、作成した領域強調画像８４を、ユーザ側の端末装置２００Ｃに送信する。ユーザ側の端末装置２００Ｃでは、処理部２０Ｃが、受信した領域強調画像８４を出力部２７に出力し、画像解析処理を終了する。 In step S29 of the image analysis process, following step S28, the processing unit 10B further performs a nuclear region detection process on the ternary image 83 of the analysis results. As an optional step, the processing unit 10B creates a region-enhanced image 84 by overlaying each of the obtained regions on the analysis target image 78 of the analysis target. The processing unit 10B transmits the created region-enhanced image 84 to the user-side terminal device 200C. In the user-side terminal device 200C, the processing unit 20C outputs the received region-enhanced image 84 to the output unit 27, thereby completing the image analysis process.

以上、端末装置２００Ｃのユーザは、解析対象の組織の解析対象画像７８を画像解析装置１００Ｂに送信することにより、解析結果として、３値画像８３を取得することができる。さらに、端末装置２００Ｃのユーザは、解析結果として、領域強調画像８４を取得することができる。 As described above, the user of terminal device 200C can obtain a ternary image 83 as the analysis result by sending an analysis target image 78 of the tissue to be analyzed to image analysis device 100B. Furthermore, the user of terminal device 200C can obtain a region-enhanced image 84 as the analysis result.

第３の実施形態に係る画像解析装置１００Ｂによると、ユーザは、ウィンドウサイズデータベース１０４およびアルゴリズムデータベース１０５を深層学習装置１００Ａから取得することなく、画像解析処理の結果を享受することができる。これにより、解析対象の組織を解析するサービスとして、腫瘍領域および／または非腫瘍領域を判別し各組織における腫瘍領域をスクリーニングするサービスを、クラウドサービスとして提供することができる。 The image analysis device 100B according to the third embodiment allows a user to enjoy the results of image analysis processing without acquiring the window size database 104 and algorithm database 105 from the deep learning device 100A. This makes it possible to provide a cloud service that analyzes the tissue to be analyzed, distinguishing between tumorous and/or non-tumorous regions and screening for tumorous regions in each tissue.

［コンピュータプログラム］
本発明の実施形態には、前記ステップＳ１０からＳ２０を処理部１０Ａ，２０Ｂ，１０Ｂに実行させ、学習済み深層学習アルゴリズムの生成するコンピュータプログラムおよびその製品が含まれる。また、本発明の実施形態には、前記ステップＳ２１からＳ２９を処理部１０Ａ，２０Ｂ，１０Ｂに実行させ、コンピュータを個体から採取された組織の画像を解析するために機能させるコンピュータプログラムおよびその製品が含まれる。 [Computer Program]
Embodiments of the present invention include a computer program and a product thereof that causes processing units 10A, 20B, and 10B to execute steps S10 to S20 and generate a trained deep learning algorithm.Embodiments of the present invention also include a computer program and a product thereof that causes processing units 10A, 20B, and 10B to execute steps S21 to S29 and cause a computer to function to analyze images of tissue collected from an individual.

［その他の形態］
以上、本発明を概要および特定の実施形態によって説明したが、本発明は上記した概要および各実施形態に限定されるものではない。 [Other forms]
Although the present invention has been described above with reference to the outline and specific embodiments, the present invention is not limited to the outline and each embodiment described above.

上記第１から第３の実施形態では、ステップＳ１３において、処理部１０Ａ，２０Ｂ，１０Ｂは、ウィンドウサイズデータベース１０４を参照して、ウィンドウサイズの画素数を設定しているが、オペレータまたはユーザがウィンドウサイズを直接設定してもよい。この場合、ウィンドウサイズデータベース１０４は不要となる。 In the first to third embodiments described above, in step S13, processing units 10A, 20B, and 10B refer to the window size database 104 to set the number of pixels for the window size, but the operator or user may set the window size directly. In this case, the window size database 104 is not necessary.

上記第１から第３の実施形態では、ステップＳ１３において、処理部１０Ａ，２０Ｂ，１０Ｂは、入力された組織の種別に基づいて、ウィンドウサイズの画素数を設定しているが、組織の種別の入力に代えて、組織のサイズを入力してもよい。処理部１０Ａ，２０Ｂ，１０Ｂは、入力された組織のサイズに基づいて、ウィンドウサイズデータベース１０４を参照して、ウィンドウサイズの画素数を設定すればよい。ステップＳ２２においてもステップＳ１３と同様に、組織の種別の入力に代えて、組織のサイズを入力してもよい。処理部２０Ａ，２０Ｂ，１０Ｂは、入力された組織のサイズに基づいて、ウィンドウサイズデータベース１０４およびアルゴリズムデータベース１０５を参照して、ウィンドウサイズの画素数を設定し、ニューラルネットワーク６０を取得すればよい。 In the first to third embodiments described above, in step S13, processing units 10A, 20B, and 10B set the number of pixels in the window size based on the input tissue type. However, instead of inputting the tissue type, the tissue size may be input. Processing units 10A, 20B, and 10B may set the number of pixels in the window size by referencing window size database 104 based on the input tissue size. In step S22, as in step S13, instead of inputting the tissue type, the tissue size may be input. Processing units 20A, 20B, and 10B may set the number of pixels in the window size by referencing window size database 104 and algorithm database 105 based on the input tissue size, and acquire the neural network 60.

組織のサイズを入力する態様については、サイズを数値として直接入力してもよいし、例えば入力のユーザインタフェースをプルダウンメニューとして、ユーザが入力しようとするサイズに対応する、所定の数値範囲をユーザに選択させて入力してもよい。 When entering the size of the tissue, the size may be entered directly as a numerical value, or the input user interface may be a pull-down menu, allowing the user to select a predetermined numerical range that corresponds to the size they wish to enter.

また、ステップＳ１３およびステップＳ２２において、組織の種別または組織のサイズに加えて、訓練用画像、および解析対象画像７８を撮像した際の撮像倍率を入力してもよい。撮像倍率を入力する態様については、倍率を数値として直接入力してもよいし、例えば入力のユーザインタフェースをプルダウンメニューとして、ユーザが入力しようとする倍率に対応する、所定の数値範囲をユーザに選択させて入力してもよい。 Furthermore, in steps S13 and S22, in addition to the tissue type or tissue size, the imaging magnification used when capturing the training image and the analysis target image 78 may be input. The imaging magnification may be input directly as a numerical value, or, for example, the input user interface may be a pull-down menu, allowing the user to select a predetermined numerical range corresponding to the magnification they wish to input.

上記第１から第３の実施形態では、深層学習処理時および画像解析処理時において、説明の便宜のためにウィンドウサイズを３×３画素と設定しているが、ウィンドウサイズの画素数はこれに限定されない。ウィンドウサイズは、例えば組織の種別に応じて設定してもよい。この場合、ウィンドウサイズの画素数と画像に含まれる色の原色の数との積が、ニューラルネットワーク５０，６０の入力層５０ａ，６０ａのノード数に対応していればよい。 In the first to third embodiments described above, the window size is set to 3x3 pixels during deep learning processing and image analysis processing for ease of explanation, but the number of pixels in the window size is not limited to this. The window size may be set according to the type of tissue, for example. In this case, the product of the number of pixels in the window size and the number of primary colors contained in the image should correspond to the number of nodes in the input layers 50a, 60a of the neural networks 50, 60.

上記第１から第３の実施形態では、ステップＳ１７において、処理部１０Ａ，２０Ｂ，１０Ｂは、深層学習アルゴリズム６０を、組織の種別と一対一に対応付けて、アルゴリズムデータベース１０５に記憶している。これに代えて、ステップＳ１７において、処理部１０Ａ，２０Ｂ，１０Ｂは、１つの深層学習アルゴリズム６０に、複数の組織の種別を対応付けて、アルゴリズムデータベース１０５に記憶してもよい。 In the first to third embodiments described above, in step S17, the processing units 10A, 20B, and 10B store the deep learning algorithms 60 in the algorithm database 105 in one-to-one correspondence with the organization type. Alternatively, in step S17, the processing units 10A, 20B, and 10B may store one deep learning algorithm 60 in the algorithm database 105 in correspondence with multiple organization types.

上記第１から第３の実施形態では、色相は、光の３原色の組み合わせ、または色の３原色の組み合わせで規定されているが、色相の数は３つに限定されない。色相の数は、赤（Ｒ），緑（Ｇ），青（Ｂ）に黄（Ｙ）を加えた４原色としても良いし、赤（Ｒ），緑（Ｇ），青（Ｂ）の３原色からいずれか１つの色相を減らした２原色としてもよい。あるいは、赤（Ｒ），緑（Ｇ），青（Ｂ）の３原色のいずれか１つ（例えば緑（Ｇ））のみの１原色としてもよい。例えば公知の顕微鏡またはバーチャルスライドスキャナ等を用いて取得される訓練用画像７０Ｒ１，７０Ｒ２，７０Ｒ３および解析対象画像７８も、赤（Ｒ），緑（Ｇ），青（Ｂ）の３原色のカラー画像に限定されず、２原色のカラー画像であってもよく、１以上の原色を含む画像であればよい。 In the first to third embodiments, the hue is defined as a combination of the three primary colors of light or the three primary colors of color, but the number of hues is not limited to three. The number of hues may be four, i.e., red (R), green (G), blue (B), and yellow (Y), or two, i.e., two hues, obtained by removing one hue from the three primary colors of red (R), green (G), and blue (B). Alternatively, the hue may be a single primary color, i.e., only one of the three primary colors of red (R), green (G), and blue (B) (e.g., green (G)). For example, the training images 70R1, 70R2, and 70R3 and the analysis target image 78, acquired using a known microscope or virtual slide scanner, are not limited to color images of the three primary colors of red (R), green (G), and blue (B). They may be color images of two primary colors, or may be images containing one or more primary colors.

上記第１から第３の実施形態では、ステップＳ１０において、処理部１０Ａ，２０Ｂ，１０Ｂは、訓練用画像の各色濃度符号化図（色濃度符号化図７０Ｒ１ｒ，７０Ｒ１ｇ，７０Ｒ１ｂ、色濃度符号化図７０Ｒ２ｒ，７０Ｒ２ｇ，７０Ｒ２ｂ、色濃度符号化図７０Ｒ３ｒ，７０Ｒ３ｇ，７０Ｒ３ｂ）を各原色の８ビット（２５６段階）の単一色画像として生成しているが、各色濃度符号化図を作成する際の原色の階調は、８ビットに制限されない。各色濃度符号化図の階調は、スライドイメージスキャナの撮像条件に依存してもよい。例えば、１ビット以上であり２ビット、４ビット、１６ビット、２４ビットの画像であってもよい。同様に、ステップＳ２１において、処理部２０Ａ，２０Ｂ，１０Ｂは、色濃度符号化図７９ｒ，７９ｇ，７９ｂ各原色の単一色画像として生成しているが、色濃度符号化図を作成する際の原色の階調は３階調に制限されない。色濃度符号化図を作成する際の原色は、８ビットに制限されない。色濃度符号化図７９ｒ，７９ｇ，７９ｂの階調は、スライドイメージスキャナの撮像条件に依存してもよい。例えば、１ビット以上であり２ビット、４ビット、１６ビット、２４ビットの画像であってもよい。訓練用画像の各色濃度符号化図および色濃度符号化図７９ｒ，７９ｇ，７９ｂの階調は、全てが同じ階調であることが好ましい。 In the first to third embodiments described above, in step S10, processing units 10A, 20B, and 10B generate each density coding map of the training image (density coding maps 70R1r, 70R1g, and 70R1b; density coding maps 70R2r, 70R2g, and 70R2b; and density coding maps 70R3r, 70R3g, and 70R3b) as an 8-bit (256-level) single-color image of each primary color. However, the gradation of the primary colors used to create each density coding map is not limited to 8 bits. The gradation of each density coding map may depend on the imaging conditions of the slide image scanner. For example, the gradation may be 1 bit or more, such as 2 bits, 4 bits, 16 bits, or 24 bits. Similarly, in step S21, processing units 20A, 20B, and 10B generate color density coding maps 79r, 79g, and 79b as single-color images of each primary color. However, the number of gradations of the primary colors used to create the color density coding maps is not limited to three gradations. The number of gradations of the primary colors used to create the color density coding maps is not limited to 8 bits. The gradations of the color density coding maps 79r, 79g, and 79b may depend on the imaging conditions of the slide image scanner. For example, they may be 1 bit or more, such as 2 bits, 4 bits, 16 bits, or 24 bits. It is preferable that the gradations of each color density coding map and color density coding maps 79r, 79g, and 79b of the training images are all the same.

上記第１から第３の実施形態では、ステップＳ１０において、処理部１０Ａ，２０Ｂ，１０Ｂは、訓練用画像から、各色濃度符号化図を生成しているが、予め各画素が階調に応じて符号化された行列表を訓練用画像として用いてもよい。処理部１０Ａ，２０Ｂ，１０Ｂは、訓練用画像を各色濃度符号化図として、例えばバーチャルスライドスキャナ等から直接取得してもよい。同様に、ステップＳ２１において、処理部２０Ａ，２０Ｂ，１０Ｂは、解析対象画像７８からＲ、Ｇ、Ｂ各色の色濃度符号化図７９ｒ，７９ｇ，７９ｂを生成しているが、予め各画素が階調に応じて符号化された行列表を解析用画像として用いてもよい。すなわち、処理部２０Ａ，２０Ｂ，１０Ｂは、色濃度符号化図７９ｒ，７９ｇ，７９ｂを、例えばバーチャルスライドスキャナ等から直接取得してもよい。 In the first to third embodiments described above, in step S10, the processing units 10A, 20B, and 10B generate each color density coding map from the training image. However, a matrix table in which each pixel is pre-coded according to gradation may be used as the training image. The processing units 10A, 20B, and 10B may acquire the training image as each color density coding map directly, for example, from a virtual slide scanner. Similarly, in step S21, the processing units 20A, 20B, and 10B generate color density coding maps 79r, 79g, and 79b for each of the R, G, and B colors from the analysis target image 78. However, a matrix table in which each pixel is pre-coded according to gradation may be used as the analysis image. That is, the processing units 20A, 20B, and 10B may acquire the color density coding maps 79r, 79g, and 79b directly, for example, from a virtual slide scanner.

上記第１から第３の実施形態では、訓練用画像から各色濃度符号化図を生成する際のカラースペースにはＲＧＢを用いているが、カラースペースはＲＧＢに制限されない。ＲＧＢ以外にも、ＹＵＶ、ＣＭＹ、およびＣＩＥＬ^＊ａ^＊ｂ^＊等の種々のカラースペースを用いることができる。 In the first to third embodiments, RGB is used as the color space when generating each color density coding map from the training images, but the color space is not limited to RGB. Various color spaces such as YUV, CMY, and CIE L ^* a ^* b ^* can be used instead of RGB.

上記第１から第３の実施形態では、各領域訓練データおよび解析用データ８０において、各画素について濃度値が赤（Ｒ），緑（Ｇ），青（Ｂ）の順番で格納されているが、濃度値を格納および取り扱う順番はこれに限定されない。例えば濃度値は、青（Ｂ），緑（Ｇ），赤（Ｒ）の順番で格納されていてもよく、各領域訓練データにおける濃度値の並び順と、解析用データ８０における濃度値の並び順とが同じであればよい。 In the first to third embodiments described above, the density values for each pixel in the area training data and analysis data 80 are stored in the order of red (R), green (G), and blue (B), but the order in which the density values are stored and handled is not limited to this. For example, the density values may be stored in the order of blue (B), green (G), and red (R), as long as the order of the density values in the area training data and the analysis data 80 are the same.

上記第１から第３の実施形態では、処理部１０Ａ，１０Ｂは一体の装置として実現されているが、処理部１０Ａ，１０Ｂは一体の装置である必要はなく、ＣＰＵ１１、メモリ１２、記憶部１３等が別所に配置され、これらがネットワークで接続されていてもよい。処理部１０Ａ，１０Ｂと、入力部１６と、出力部１７とについても、一ヶ所に配置される必要は必ずしもなく、それぞれ別所に配置されて互いにネットワークで通信可能に接続されていてもよい。処理部２０Ａ，２０Ｂ，２０Ｃについても処理部１０Ａ，１０Ｂと同様である。 In the first to third embodiments described above, processing units 10A and 10B are realized as an integrated device, but processing units 10A and 10B do not need to be an integrated device; the CPU 11, memory 12, storage unit 13, etc. may be located in separate locations and connected via a network. Processing units 10A and 10B, input unit 16, and output unit 17 also do not necessarily need to be located in the same place; they may be located in separate locations and connected to each other via a network so that they can communicate with each other. The same applies to processing units 20A, 20B, and 20C.

上記第１から第３の実施形態では、訓練データ生成部１０１、訓練データ入力部１０２、アルゴリズム更新部１０３、解析用データ生成部２０１、解析用データ入力部２０２、解析部２０３および領域検出部２０４の各機能ブロックは、単一のＣＰＵ１１または単一のＣＰＵ２１において実行されているが、これら各機能ブロックは単一のＣＰＵにおいて実行される必要は必ずしもなく、複数のＣＰＵで分散して実行されてもよい。また、これら各機能ブロックは、複数のＧＰＵで分散して実行されてもよいし、複数のＣＰＵと複数のＧＰＵとで分散して実行されてもよい。 In the first to third embodiments described above, the functional blocks of the training data generation unit 101, training data input unit 102, algorithm update unit 103, analysis data generation unit 201, analysis data input unit 202, analysis unit 203, and area detection unit 204 are executed by a single CPU 11 or a single CPU 21, but these functional blocks do not necessarily have to be executed by a single CPU and may be executed in a distributed manner across multiple CPUs. Furthermore, these functional blocks may be executed in a distributed manner across multiple GPUs, or in a distributed manner across multiple CPUs and multiple GPUs.

上記第２および第３の実施形態では、図１１および図１４で説明する各ステップの処理を行うためのプログラムを記憶部１３，２３に予め記憶している。これに代えて、プログラムは、例えばＤＶＤ－ＲＯＭやＵＳＢメモリ等の、コンピュータ読み取り可能であって非一時的な有形の記憶媒体９８から処理部１０Ｂ，２０Ｂにインストールしてもよい。または、処理部１０Ｂ，２０Ｂをネットワーク９９と接続し、ネットワーク９９を介して例えば外部のサーバ（図示せず）からプログラムをダウンロードしてインストールしてもよい。 In the second and third embodiments described above, the programs for performing the processing of each step described in Figures 11 and 14 are stored in advance in the storage units 13 and 23. Alternatively, the programs may be installed in the processing units 10B and 20B from a computer-readable, non-transitory, tangible storage medium 98, such as a DVD-ROM or USB memory. Alternatively, the processing units 10B and 20B may be connected to a network 99, and the programs may be downloaded and installed via the network 99 from, for example, an external server (not shown).

上記第１から第３の実施形態では、入力部１６，２６はキーボードまたはマウス等の入力装置であり、出力部１７，２７は液晶ディスプレイ等の表示装置として実現されている。これに代えて、入力部１６、２６と出力部１７、２７とを一体化してタッチパネル式の表示装置として実現してもよい。または、出力部１７，２７をプリンター等で構成し、解析結果の３値画像８３または領域強調画像８４を印刷して出力してもよい。 In the first to third embodiments described above, the input units 16, 26 are input devices such as a keyboard or a mouse, and the output units 17, 27 are realized as display devices such as a liquid crystal display. Alternatively, the input units 16, 26 and the output units 17, 27 may be integrated into a touch panel display device. Alternatively, the output units 17, 27 may be configured as a printer or the like, and the ternary image 83 or region-enhanced image 84 of the analysis results may be printed and output.

上記第１から第３の実施形態では、撮像装置３００は、深層学習装置１００Ａまたは画像解析装置１００Ｂと直接接続されているが、撮像装置３００は、ネットワーク９９を介して深層学習装置１００Ａまたは画像解析装置１００Ｂと接続されていてもよい。撮像装置４００についても同様に、撮像装置４００は、画像解析装置２００Ａまたは画像解析装置２００Ｂと直接接続されているが、撮像装置４００は、ネットワーク９９を介して画像解析装置２００Ａまたは画像解析装置２００Ｂと接続されていてもよい。 In the first to third embodiments described above, imaging device 300 is directly connected to deep learning device 100A or image analysis device 100B, but imaging device 300 may be connected to deep learning device 100A or image analysis device 100B via network 99. Similarly, imaging device 400 is directly connected to image analysis device 200A or image analysis device 200B, but imaging device 400 may be connected to image analysis device 200A or image analysis device 200B via network 99.

＜学習済み深層学習アルゴリズムの検証＞
上記第２の実施形態に示すスタンドアロン型のシステムにて、深層学習処理および画像解析処理を行った。学習の対象とする組織として、腫瘍領域と正常領域を含むヒト胃がん組織を用いた。学習済みの深層学習アルゴリズムを作成するにあたり、組織標本の拡大倍率を１倍、３倍、５倍、１０倍、２０倍、および４０倍と変化させて訓練用画像を撮像し、異なる拡大倍率の訓練用画像を用いた訓練データをニューラルネットワークに学習させ、解析結果の正確性を確認した。
学習データおよび解析データの詳細は、次の通りであった。
学習データ：１０６枚のホールスライドイメージ
検証用解析データ：４５枚のホールスライドイメージ <Verification of trained deep learning algorithms>
Deep learning processing and image analysis processing were performed using the standalone system described in the second embodiment. Human stomach cancer tissue, including tumor and normal regions, was used as the target tissue for learning. To create the trained deep learning algorithm, training images were captured at magnifications of 1x, 3x, 5x, 10x, 20x, and 40x for the tissue specimen. The neural network was trained using the training data using the training images at different magnifications, and the accuracy of the analysis results was confirmed.
Details of the training data and analysis data are as follows:
Training data: 106 hole slide images Verification analysis data: 45 hole slide images

［訓練データの作成および学習］
ＨＥ染色した胃がん組織の明視野画像のホールスライドイメージを、バーチャルスライドスキャナ（NanoZoomer-XR、(Hamamatsu Photonics；スキャン分解能：20倍モードスキャン時0.46 μm/pixel、40倍モードスキャン時0.23 μm/pixel)）を用いてカラー撮像した。撮像倍率は１倍、３倍、５倍、１０倍、２０倍、および４０倍とした。学習対象の組織を含むホールスライドイメージ上で、病理医が腫瘍領域、非腫瘍領域、および非組織領域を指定した。各領域が指定されたホールスライドイメージを上記の条件で分割し、訓練用画像とした。個々の訓練用画像について腫瘍領域であるか、非腫瘍領域であるか、非組織領域であるかを決定した。その際、腫瘍領域と非腫瘍領域が混在する組織の領域については、訓練用画像内の組織領域に該当する画素数のうち、腫瘍領域と指定された画素数が５０％以上を占める場合には腫瘍領域とした。また、訓練用画像内の組織領域に該当する画素数のうち、非腫瘍領域と指定された画素数が占める割合が１００％の場合には非腫瘍領域とした。非組織領域と組織領域が混在する訓練用画像については、組織領域が訓練用画像の画素数の０％である場合に非組織領域とし、それ以外は組織領域とした。腫瘍領域、非腫瘍領域、非組織領域と判定されたそれぞれの訓練用画像の各画素について、各領域を区別するラベル値（腫瘍領域は「１」、非腫瘍領域は「２」、非組織領域は「０」）を付与し、第１の予備訓練データ７０Ｒ１Ｌ、第２の予備訓練データ７０Ｒ２Ｌおよび第３の予備訓練データ７０Ｒ２Ｌを生成した。腫瘍領域と判定された訓練用画像を第１の訓練用画像とし、非腫瘍領域と判定された訓練用画像を第２の訓練用画像７０Ｒ２とし、非組織領域と判定された訓練用画像を第３の訓練用画像７０Ｒ３とした。各訓練用画像についてＲ、Ｇ、Ｂ各色の色濃度値を８ビットで階調化して、Ｒ、Ｇ、Ｂ各色の色濃度符号化図を生成し、作成したＲ、Ｇ、Ｂ各色の色濃度符号化図を組み合わせて第１の訓練データ７４Ｒ１、第２の訓練データ７４Ｒ２、第３の訓練データ７４Ｒ３のそれぞれを生成した。 [Creating training data and learning]
Bright-field whole-slide images of HE-stained gastric cancer tissue were scanned in color using a virtual slide scanner (NanoZoomer-XR, Hamamatsu Photonics; scanning resolution: 0.46 μm/pixel in 20x mode, 0.23 μm/pixel in 40x mode). The imaging magnifications were 1x, 3x, 5x, 10x, 20x, and 40x. A pathologist designated tumor, non-tumor, and non-tissue regions on the whole-slide images containing the target tissue. The whole-slide images with designated regions were divided according to the above criteria and used as training images. Each training image was determined to be tumor, non-tumor, or non-tissue. For tissue regions containing both tumor and non-tumor regions, a region was considered tumor if 50% or more of the pixels designated as tumor were included in the tissue region in the training image. Furthermore, if the proportion of pixels designated as non-tumorous regions among the number of pixels corresponding to tissue regions in the training image was 100%, the region was designated as non-tumorous. For training images containing a mixture of non-tissue and tissue regions, if the tissue region accounted for 0% of the number of pixels in the training image, the region was designated as non-tissue, and otherwise the region was designated as tissue. Each pixel in each training image determined to be a tumor region, non-tumor region, or non-tissue region was assigned a label value (tumor region: "1", non-tumor region: "2", non-tissue region: "0") to distinguish between the regions, and first preliminary training data 70R1L, second preliminary training data 70R2L, and third preliminary training data 70R2L were generated. The training image determined to be a tumor region was designated as the first training image, the training image determined to be a non-tumor region was designated as the second training image 70R2, and the training image determined to be a non-tissue region was designated as the third training image 70R3. For each training image, the color density values of each of the R, G, and B colors were gradated using 8 bits to generate color density encoding maps for each of the R, G, and B colors, and the created color density encoding maps for each of the R, G, and B colors were combined to generate the first training data 74R1, the second training data 74R2, and the third training data 74R3, respectively.

その後、第１の訓練データ７４Ｒ１と第１の予備訓練データ７０Ｒ１Ｌとを組み合わせて腫瘍領域訓練データを作成した。第２の訓練データ７４Ｒ２と第２の予備訓練データ７０Ｒ２Ｌとを組み合わせて非腫瘍領域訓練データを作成した。第３の訓練データ７４Ｒ３と第３の予備訓練データ７０Ｒ２Ｌとを組み合わせて非組織領域訓練データを作成した。作成した各領域訓練データを２００×２００画素のウィンドウサイズに分割し、分割したウィンドウサイズの訓練データを入力層として、ニューラルネットワークを学習させた。 Tumor region training data was then created by combining the first training data 74R1 and the first preliminary training data 70R1L. Non-tumor region training data was created by combining the second training data 74R2 and the second preliminary training data 70R2L. Non-tissue region training data was created by combining the third training data 74R3 and the third preliminary training data 70R2L. Each region training data created was divided into windows of 200 x 200 pixels, and the neural network was trained using the training data of the divided window sizes as the input layer.

各領域訓練データは組織標本の拡大倍率毎に生成し、拡大倍率毎にニューラルネットワークを学習させた。 Training data for each region was generated for each magnification of the tissue specimen, and the neural network was trained for each magnification.

［解析対象画像の作成］
訓練データと同様に、ＨＥ染色したヒト胃がん組織の明視野画像のホールスライドイメージを、バーチャルスライドスキャナを用いてカラー撮像し解析用画像を得た。撮像倍率は１倍、３倍、５倍、１０倍、２０倍、および４０倍とした。その後、撮像した解析用画像をもとにＲ、Ｇ、Ｂ各色の色濃度符号化図を作成し、作成したＲ、Ｇ、Ｂ各色の色濃度符号化図を組み合わせて解析対象データを生成した。解析対象データは、拡大倍率毎に生成した。 [Creating an image to be analyzed]
Similar to the training data, whole-slide images of HE-stained human gastric cancer tissues were captured in bright field using a virtual slide scanner to obtain the analysis images. The magnifications were 1x, 3x, 5x, 10x, 20x, and 40x. R, G, and B color density coded maps were then created based on the captured analysis images, and the resulting R, G, and B color density coded maps were combined to generate the analysis data. The analysis data was generated for each magnification.

［解析結果］
解析対象データから２００×２００画素のウィンドウサイズの解析用データを作成し、解析用データを、拡大倍率に対応した学習済みのニューラルネットワークに入力した。ニューラルネットワークから出力される解析結果をもとに、腫瘍領域と、非腫瘍領域と、非組織領域（背景）とに分類し、解析処理により得られた腫瘍領域を白で表示し、非腫瘍領域をグレーで表示し、非組織領域を黒で表示するように処理した。解析結果を図１９に示す。 [Analysis results]
Analysis data with a window size of 200 x 200 pixels was created from the data to be analyzed, and the analysis data was input into a trained neural network corresponding to the magnification. Based on the analysis results output from the neural network, the data was classified into tumor regions, non-tumor regions, and non-tissue regions (background). The tumor regions obtained by the analysis process were displayed in white, non-tumor regions in gray, and non-tissue regions in black. The analysis results are shown in Figure 19.

図１９中、（ａ）は、胃がん組織をＨＥ染色して撮像した明視野画像のホールスライドイメージであり、解析対象の画像である。（ｂ）は、ホールスライドイメージの腫瘍領域を病理医が指定し実線で囲った像である。（ｃ）は、拡大倍率１倍の訓練用画像と解析用画像を用いた解析結果を示す。病理医が指定した各領域を正確に示すことはできなかった。（ｄ）は、拡大倍率５倍の訓練用画像と解析用画像を用いた解析結果を示す。病理医が指定した腫瘍領域を腫瘍領域として概ね判別することができた。また、病理医が指定した非腫瘍領域を非腫瘍領域として概ね判別することができた。病理医が指定した非組織領域を非組織領域として概ね判別することができた。（ｅ）は、拡大倍率４０倍の訓練用画像と解析用画像を用いた解析結果を示す。病理医が指定した腫瘍領域を腫瘍領域としてある程度判別することができた。しかし、病理医が非腫瘍領域と指定した領域において、腫瘍領域と判定される領域が増えており、非腫瘍領域を腫瘍領域と誤判定する割合が高くなっていた。 In Figure 19, (a) is a whole slide image of a bright-field image of HE-stained gastric cancer tissue, and is the image to be analyzed. (b) is an image of the whole slide image in which the pathologist designated the tumor area and enclosed by a solid line. (c) shows the analysis results using training images and analysis images magnified by 1x. It was not possible to accurately identify each of the areas designated by the pathologist. (d) shows the analysis results using training images and analysis images magnified by 5x. The tumor area designated by the pathologist was generally able to be identified as tumor area. In addition, the non-tumor area designated by the pathologist was generally able to be identified as non-tumor area. The non-tissue area designated by the pathologist was generally able to be identified as non-tissue area. (e) shows the analysis results using training images and analysis images magnified by 40x. The tumor area designated by the pathologist was generally able to be identified as tumor area. However, among areas designated by pathologists as non-tumorous, the number of areas judged to be tumorous was increasing, resulting in a high rate of non-tumorous areas being mistakenly identified as tumorous.

以上の結果から、訓練用画像および解析用画像を取得する際の拡大倍率は低すぎても高すぎても誤判定の原因となることが示された。 These results show that using magnifications that are too low or too high when acquiring training images and analysis images can lead to incorrect classification.

図２０中、（ａ）は、図１９とは異なる胃がん組織をＨＥ染色して撮像した明視野画像のホールスライドイメージであり、解析対象の画像である。（ｂ）は、ホールスライドイメージの腫瘍領域を病理医が指定し実線で囲った像である。（ｃ）は、拡大倍率５倍の訓練用画像と解析用画像を用いた解析結果を示す。病理医が指定した腫瘍領域を腫瘍領域として概ね判別することができた。また、病理医が指定した非腫瘍領域を非腫瘍領域として概ね判別することができた。病理医が指定した非組織領域を非組織領域として概ね判別することができた。（ｄ）は、拡大倍率２０倍の訓練用画像と解析用画像を用いた解析結果を示す。拡大倍率５倍の場合と同様に、病理医が指定した各領域を概ね判別することができた。 In Figure 20, (a) is a whole slide image of a bright-field image of HE-stained gastric cancer tissue, different from that shown in Figure 19, and is the image to be analyzed. (b) is an image of the whole slide image in which the pathologist designated the tumor region and circled it with a solid line. (c) shows the analysis results using training images and analysis images at 5x magnification. The tumor region designated by the pathologist was generally able to be distinguished as tumor region. Additionally, the non-tumor region designated by the pathologist was generally able to be distinguished as non-tumor region. The non-tissue region designated by the pathologist was generally able to be distinguished as non-tissue region. (d) shows the analysis results using training images and analysis images at 20x magnification. As with the 5x magnification, each region designated by the pathologist was generally able to be distinguished.

拡大倍率５倍の解析用画像について、下式にしたがって、ホールスライドイメージ１枚に存在する組織領域における腫瘍領域の含有率を算出した。 For the analysis images at 5x magnification, the tumor area content in the tissue area present in one whole slide image was calculated according to the following formula:

腫瘍領域の含有率（％）＝｛［腫瘍領域と判定された画素数］／［（腫瘍領域と判定された画素数）＋（非腫瘍領域と判定された画素数）］｝×１００ Tumor area content (%) = {[Number of pixels identified as tumor area] / [(Number of pixels identified as tumor area) + (Number of pixels identified as non-tumor area)]} x 100

腫瘍領域の含有率は５４．５％であった。 The tumor area content was 54.5%.

また、図２１に各拡大倍率の感度（ａ）および陽性的中率（ｂ）を示す。この結果から、訓練用画像および解析用画像を取得する際の組織標本の拡大倍率は、５から２０倍が適切であると考えられた。また、図２１にウィンドウサイズの画素数の違いによる感度（ｃ）および陽性的中率（ｄ）の変化を示す。解析用画像は拡大倍率５倍で撮像した。ウィンドウサイズの画素数は、１２５×１２５画素から２００×２００画素が好ましいと考えられた。 Figure 21 also shows the sensitivity (a) and positive predictive value (b) for each magnification. From these results, it was determined that a magnification of 5 to 20 times is appropriate for the tissue specimen when acquiring training images and analysis images. Figure 21 also shows the change in sensitivity (c) and positive predictive value (d) due to differences in the number of pixels in the window size. The analysis images were captured at a magnification of 5 times. A window size of 125 x 125 to 200 x 200 pixels was considered preferable.

今回１枚のホールスライドイメージの解析時間は、数分～３０分程度であった。 In this study, the analysis time for one whole slide image was approximately several minutes to 30 minutes.

＜付記事項＞
ある実施形態は、ニューラルネットワーク構造の深層学習アルゴリズム（６０）を用いて、組織の画像を解析する画像解析方法に関する。前記画像解析方法では、解析対象の組織を含む解析対象画像（７８）から解析用データ（８０）を生成し、解析用データ（８０）を、深層学習アルゴリズム（６０）に入力し、深層学習アルゴリズム（６０）によって、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成してもよい。本発明によれば、解析対象の組織における解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成することができる。 <Additional Notes>
One embodiment relates to an image analysis method for analyzing a tissue image using a deep learning algorithm (60) with a neural network structure. The image analysis method may generate analysis data (80) from an analysis target image (78) including the tissue to be analyzed, input the analysis data (80) to a deep learning algorithm (60), and generate data indicating whether a region included in the analysis data is a tumor region using the deep learning algorithm (60). According to the present invention, data indicating whether a region included in the analysis data in the tissue to be analyzed is a tumor region can be generated.

前記実施形態において、好ましくは、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータは、腫瘍領域の領域を示すデータである。この実施形態により、ユーザが一目で解析対象の組織における腫瘍領域を把握することができる。 In the above embodiment, preferably, the data indicating whether an area included in the analysis data is a tumor area is data indicating the area of the tumor area. This embodiment allows the user to grasp the tumor area in the tissue to be analyzed at a glance.

前記実施形態において、好ましくは、前記解析対象画像は、組織診断用標本の画像であり、２以上の原色を組み合わせた色相を含む。この実施形態により、病理診断を一助することができる。 In the above embodiment, the image to be analyzed is preferably an image of a tissue diagnostic specimen and includes hues that combine two or more primary colors. This embodiment can aid in pathological diagnosis.

前記実施形態において、好ましくは、前記解析対象画像は、前記解析対象の組織を３倍から２０倍に拡大して撮像された画像である。前記実施形態において、解析対象画像に含まれる領域の大きさが、２００μｍ×２００μｍ以上４００μｍ×４００μｍ以下である。これらの実施形態により、より精度の高い判別を行うことができる。 In the above-described embodiments, the image to be analyzed is preferably an image of the tissue to be analyzed captured at a magnification of 3 to 20 times. In the above-described embodiments, the size of the area included in the image to be analyzed is 200 μm x 200 μm or more and 400 μm x 400 μm or less. These embodiments enable more accurate determination.

前記実施形態において、好ましくは、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータに基づいて、腫瘍細胞の領域とそれ以外の領域とを区別して提示するためのデータを生成する。この実施形態により、ユーザが一目で解析対象の組織における腫瘍領域を把握することができる。 In the above embodiment, data is preferably generated to distinguish between tumor cell regions and other regions based on data indicating whether a region included in the analysis data is a tumor region. This embodiment allows the user to grasp the tumor region in the tissue being analyzed at a glance.

前記実施形態において、好ましくは、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータに基づいて、腫瘍細胞の領域とそれ以外の領域との境界を示すデータを生成する。この実施形態により、ユーザが一目で解析対象の組織における腫瘍領域と非腫瘍領域の境界を把握することができる。 In the above embodiment, data indicating the boundary between tumor cell regions and other regions is preferably generated based on data indicating whether a region included in the analysis data is a tumor region. This embodiment allows the user to grasp at a glance the boundary between tumor regions and non-tumor regions in the tissue being analyzed.

前記実施形態において、好ましくは、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータに基づいて、前記解析対象の組織における腫瘍領域の含有率を示すデータを生成する。この実施形態により、ユーザが解析対象の組織における腫瘍領域の含有率を把握することができる。 In the above embodiment, data indicating the percentage of tumor regions in the tissue to be analyzed is preferably generated based on data indicating whether or not a region included in the analysis data is a tumor region. This embodiment allows the user to grasp the percentage of tumor regions in the tissue to be analyzed.

前記実施形態において、好ましくは、解析用データに含まれる領域が腫瘍領域であるか否かを示すデータに基づいて、前記解析対象の組織における腫瘍領域と非腫瘍領域の比率を示すデータを生成する。この実施形態により、ユーザが解析対象の組織における腫瘍領域の含有率を把握することができる。また、遺伝子検査等に使用されるサンプルが適切に採取されているか知ることができる。 In the above embodiment, data indicating the ratio of tumorous to non-tumorous areas in the tissue to be analyzed is preferably generated based on data indicating whether an area included in the analysis data is a tumorous area. This embodiment allows the user to understand the proportion of tumorous areas in the tissue to be analyzed. It also allows the user to know whether samples to be used for genetic testing, etc., have been collected appropriately.

前記実施態様により、好ましくは、１つの解析対象画像について、腫瘍領域と、非腫瘍領域と、組織を含まない領域とを区別して提示するためのデータを生成する。この実施形態により、ユーザが一目で解析対象の組織における腫瘍領域の有無を把握することができる。 In this embodiment, data is preferably generated for distinguishing and presenting tumorous regions, non-tumorous regions, and regions that do not contain tissue for a single image to be analyzed. This embodiment allows the user to grasp at a glance whether or not there is a tumorous region in the tissue to be analyzed.

前記実施形態において、好ましくは、１つの解析対象画像について所定画素数の領域毎に応じた複数の解析用データを生成する。この実施形態により、解析対象の組織の広い範囲について、精度よく解析を行うことができる。 In the above embodiment, preferably, multiple pieces of analysis data are generated for each region of a predetermined number of pixels for one image to be analyzed. This embodiment enables accurate analysis of a wide range of tissue to be analyzed.

前記実施形態において、好ましくは、深層学習アルゴリズム（６０）は、入力された前記解析用データに対し前記所定画素数の領域が腫瘍領域および非腫瘍領域から選択される少なくとも一方を含むことを示すラベルを生成する。 In the above embodiment, preferably, the deep learning algorithm (60) generates a label indicating that the region of the specified number of pixels contains at least one selected from a tumor region and a non-tumor region for the input analysis data.

前記実施形態において、好ましくは、ニューラルネットワーク（５０）の入力層（５０ａ）のノード数が、解析用データ（８０）の前記所定画素数と前記組み合わせた原色の数との積に対応している。この実施形態により、精度の高い解析を行うことができる。 In the above embodiment, the number of nodes in the input layer (50a) of the neural network (50) preferably corresponds to the product of the predetermined number of pixels in the analysis data (80) and the number of combined primary colors. This embodiment enables highly accurate analysis.

前記実施形態において、好ましくは、前記標本が染色された標本であり、前記解析対象画像は、前記染色された標本を顕微鏡の明視野下で撮像した画像である。この実施形態により、病理診断を一助することができる。 In the above embodiment, preferably, the specimen is a stained specimen, and the image to be analyzed is an image of the stained specimen captured under a bright field microscope. This embodiment can aid in pathological diagnosis.

前記実施形態において、好ましくは、前記深層学習アルゴリズムの学習に用いられる訓練データが、個体から採取された腫瘍領域を含む組織の標本に対して明視野観察用染色を施して作製された標本の染色像を顕微鏡の明視野下で撮像した明視野画像に基づいて生成されている。この実施形態により、組織診断を一助することができる。 In the above embodiment, preferably, the training data used to train the deep learning algorithm is generated based on bright-field images of tissue specimens, including tumor regions, extracted from an individual and stained for bright-field observation, captured under a bright-field microscope. This embodiment can aid in tissue diagnosis.

前記実施形態において、好ましくは、前記明視野観察用染色は、核染色にヘマトキシリンを用いる。より好ましくは、前記明視野観察用染色が、ヘマトキシリン・エオジン染色（ＨＥ染色）である。この実施形態により、組織診断として汎用性の高い検査を網羅することができる。 In the above embodiment, preferably, the staining for brightfield observation uses hematoxylin for nuclear staining. More preferably, the staining for brightfield observation is hematoxylin-eosin staining (HE staining). This embodiment enables a wide range of highly versatile tests to be performed in tissue diagnosis.

前記実施形態において、好ましくは、前記訓練データが、前記明視野画像から判定された、腫瘍領域であることを示すラベル値を含む。より好ましくは、前記訓練データが、前記ラベル値を前記明視野画像の所定画素数の領域毎に含む。この実施形態により、腫瘍領域の訓練データを生成することができる。 In the above embodiment, the training data preferably includes a label value indicating a tumor region determined from the bright-field image. More preferably, the training data includes the label value for each region of a predetermined number of pixels in the bright-field image. This embodiment makes it possible to generate training data for tumor regions.

前記実施形態において、好ましくは、前記訓練データが、前記明視野画像における所定画素数の領域毎に生成されている。この態様により、深層学習アルゴリズム（５０）の学習効率を上げることができる。 In the above embodiment, the training data is preferably generated for each region of a predetermined number of pixels in the bright-field image. This aspect improves the learning efficiency of the deep learning algorithm (50).

前記実施形態において、好ましくは、前記深層学習アルゴリズムが、前記解析用データを、前記解析対象画像に含まれる組織が腫瘍領域であることを示すクラスに分類する。 In the above embodiment, preferably, the deep learning algorithm classifies the analysis data into a class indicating that the tissue included in the image to be analyzed is a tumor region.

前記実施形態において、好ましくは、前記ニューラルネットワーク（５０）の出力層（５０ｂ）がソフトマックス関数を活性化関数とするノードである。この実施形態により、学習効率および解析精度を上げることができる。 In the above embodiment, preferably, the output layer (50b) of the neural network (50) is a node that uses a softmax function as an activation function. This embodiment can improve learning efficiency and analysis accuracy.

前記実施形態において、好ましくは、前記深層学習アルゴリズムは、解析用データ（７８）が入力される度に、前記解析対象画像に含まれる組織が腫瘍領域であることを示すデータを所定画素数の領域毎に生成する。この実施形態により、解析効率を上げることができる。 In the above embodiment, preferably, the deep learning algorithm generates data indicating that the tissue included in the image to be analyzed is a tumor region for each region of a predetermined number of pixels each time analysis data (78) is input. This embodiment can improve analysis efficiency.

前記実施形態において、好ましくは、深層学習アルゴリズム（６０）が、前記組織の種類に応じて生成されている。さらに、好ましくは、前記組織の種類に応じて複数の前記深層学習アルゴリズムの中から選択された、前記解析対象の組織の種類に対応する前記深層学習アルゴリズムを用いて、前記解析用データを処理する。この実施形態により、解析精度を上げることができる。 In the above embodiment, preferably, a deep learning algorithm (60) is generated according to the type of tissue. Furthermore, preferably, the analysis data is processed using a deep learning algorithm corresponding to the type of tissue to be analyzed, selected from among a plurality of deep learning algorithms according to the type of tissue. This embodiment can improve the accuracy of analysis.

ある実施形態は、ニューラルネットワーク構造の深層学習アルゴリズム（６０）を用いて、組織の画像を解析する画像解析装置（１００）に関する。前記解析装置は、解析対象の組織を含む解析対象画像（７８）から解析用データ（８０）を生成し、解析用データ（８０）を、深層学習アルゴリズム（５０）に入力し、深層学習アルゴリズム（５０）によって、前記解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成する処理部（１０）を備えてもよい。 One embodiment relates to an image analysis device (100) that analyzes tissue images using a deep learning algorithm (60) with a neural network structure. The analysis device may include a processing unit (10) that generates analysis data (80) from an analysis target image (78) that includes the tissue to be analyzed, inputs the analysis data (80) to a deep learning algorithm (50), and uses the deep learning algorithm (50) to generate data indicating whether a region included in the analysis data is a tumor region.

ある実施形態は、ニューラルネットワーク構造の深層学習アルゴリズム（６０）を用いて、組織の画像を解析するコンピュータプログラムに関する。前記コンピュータに、解析対象の組織を含む解析対象画像（７８）から解析用データ（８０）を生成する処理と、解析用データ（８０）を、深層学習アルゴリズム（６０）に入力する処理と、深層学習アルゴリズム（６０）によって、前記解析用データに含まれる領域が腫瘍領域であるか否かを示すデータを生成する処理と、を実行させてもよい。 One embodiment relates to a computer program that analyzes tissue images using a deep learning algorithm (60) with a neural network structure. The computer may be configured to perform the following steps: generate analysis data (80) from an analysis target image (78) containing the tissue to be analyzed; input the analysis data (80) into the deep learning algorithm (60); and generate, using the deep learning algorithm (60), data indicating whether a region included in the analysis data is a tumor region.

ある実施態様は、深層学習アルゴリズム（５０）の学習に用いられる訓練データを訓練用の組織標本から取得した訓練用画像に基づいて生成するステップと、前記訓練データをニューラルネットワーク（５０）に学習させる学習させるステップと、を含み、前記生成ステップは、腫瘍領域を撮像した第１の訓練用画像（７０Ｒ１）に対応する第１の訓練データ（７４Ｒ１）を取得する第１の取得ステップと、非腫瘍領域を撮像した第２の訓練用画像（７０Ｒ２）に対応する第２の訓練データ（７４Ｒ２）を取得する第２の取得ステップと、組織を含まない領域を撮像した第３の訓練用画像（７０Ｒ３）に対応する第３の訓練データ（７４Ｒ３）を取得する第３の取得ステップと、を含み、前記学習ステップは、前記第１の訓練データ（７４Ｒ１）が、腫瘍領域であることをニューラルネットワーク（５０）に学習させる第１の学習ステップと、前記第２の訓練データ（７４Ｒ２）が、非腫瘍領域であることをニューラルネットワーク（５０）に学習させる第２の学習ステップと、前記第３の訓練データ（７４Ｒ３）が、組織を含まない領域であることをニューラルネットワーク（５０）に学習させる第３の学習ステップと、を含む、学習済み深層学習アルゴリズムの生成方法に関する。好ましくは、前記第１の訓練データ（７４Ｒ１）と、第２の訓練データ（７４Ｒ２）と、第３の訓練データ（７４Ｒ３）とをニューラルネットワーク（５０）の入力層（５０ａ）とし、腫瘍領域であることと、非腫瘍領域であることと、組織を含まない領域であることとをそれぞれ前記第１の訓練データ（７４Ｒ１）と、第２の訓練データ（７４Ｒ２）と、第３の訓練データ（７４Ｒ３）とに対応するニューラルネットワーク（５０）の出力層（５０ｂ）とする。さらに好ましくは、前記第１の取得ステップの前に、前記第１の訓練用画像（７０Ｒ１）から、前記第１の訓練データ（７４Ｒ１）を生成するステップと、前記第２の取得ステップの前に、前記第２の訓練用画像（７０Ｒ２）から、前記第２の訓練データ（７４Ｒ２）を生成するステップと、前記第３の取得ステップの前に、前記第３の訓練用画像（７０Ｒ３）から、前記第３の訓練データ（７４Ｒ３）を生成するステップと、をさらに含んでもよい。本実施形態によれば、解析対象の組織が、腫瘍領域および非腫瘍領域から選択される少なくとも一方を含むことを判別する、学習済み深層学習アルゴリズムの生成することができる。 One embodiment includes a step of generating training data used for training a deep learning algorithm (50) based on training images acquired from a training tissue specimen, and a step of training the neural network (50) to learn the training data, wherein the generating step includes a first acquisition step of acquiring first training data (74R1) corresponding to a first training image (70R1) obtained by imaging a tumor region, a second acquisition step of acquiring second training data (74R2) corresponding to a second training image (70R2) obtained by imaging a non-tumor region, and a third acquisition step of acquiring a third training image (74R3) obtained by imaging a region not containing tissue. and a third acquisition step of acquiring third training data (74R3) corresponding to the first training data (74R1) that corresponds to the first training data (74R2) that corresponds to the second training data (74R3), wherein the learning step includes a first learning step of making a neural network (50) learn that the first training data (74R1) is a tumor region, a second learning step of making the neural network (50) learn that the second training data (74R2) is a non-tumor region, and a third learning step of making the neural network (50) learn that the third training data (74R3) is a region that does not contain tissue. Preferably, the first training data (74R1), the second training data (74R2), and the third training data (74R3) are used as an input layer (50a) of the neural network (50), and a tumor region, a non-tumor region, and a region not containing tissue are used as an output layer (50b) of the neural network (50) corresponding to the first training data (74R1), the second training data (74R2), and the third training data (74R3), respectively. More preferably, the method may further include the steps of generating the first training data (74R1) from the first training image (70R1) before the first acquisition step, generating the second training data (74R2) from the second training image (70R2) before the second acquisition step, and generating the third training data (74R3) from the third training image (70R3) before the third acquisition step. According to this embodiment, a trained deep learning algorithm can be generated that determines whether the tissue to be analyzed includes at least one selected from a tumorous region and a non-tumorous region.

前記生成方法において、好ましくは、前記訓練用画像が、個体から採取された組織に対して明視野観察用染色を施して作製された標本の染色像を顕微鏡の明視野下で撮像した明視野画像である。 In the above generation method, the training images are preferably bright-field images obtained by capturing stained images of specimens prepared by staining tissue collected from an individual with staining agents for bright-field observation under a bright-field microscope.

ある実施形態は、第１の訓練データ（７４Ｒ１）と、第２の訓練データ（７４Ｒ２）と、第３の訓練データ（７４Ｒ３）とをニューラルネットワーク（５０）の入力層（５０ａ）とし、腫瘍領域であることと、非腫瘍領域であることと、組織を含まない領域であることとをそれぞれ前記第１の訓練データ（７４Ｒ１）と、第２の訓練データ（７４Ｒ２）と、第３の訓練データ（７４Ｒ３）とに対応するニューラルネットワーク（５０）の出力層（５０ｂ）として学習させた深層学習アルゴリズムであって、前記第１の訓練データ（７４Ｒ１）は、訓練用の組織の腫瘍領域を撮像した第１の訓練用画像（７０Ｒ１）から生成され、前記第２の訓練データ（７４Ｒ２）は、訓練用の組織の非腫瘍領域を撮像した第２の訓練用画像（７０Ｒ２）から生成され、前記第３の訓練データ（７４Ｒ３）は、組織を含まない領域を撮像した第３の訓練用画像（７０Ｒ３）から生成される、学習済み深層学習アルゴリズム（６０）であってもよい。 In one embodiment, first training data (74R1), second training data (74R2), and third training data (74R3) are used as the input layer (50a) of a neural network (50), and tumor regions, non-tumor regions, and regions not containing tissue are learned as the output layer (50b) of the neural network (50) corresponding to the first training data (74R1), second training data (74R2), and third training data (74R3), respectively. The trained deep learning algorithm (60) may be a trained deep learning algorithm, wherein the first training data (74R1) is generated from a first training image (70R1) capturing an image of a tumorous region of training tissue, the second training data (74R2) is generated from a second training image (70R2) capturing an image of a non-tumorous region of training tissue, and the third training data (74R3) is generated from a third training image (70R3) capturing an image of a region not containing tissue.

１０（１０Ａ，１０Ｂ）処理部
２０（２０Ａ，２０Ｂ，２０Ｃ）処理部
１１，２１ＣＰＵ
１２，２２メモリ
１３，２３記録部
１４，２４バス
１５，２５インタフェース部
１６，２６入力部
１７，２７出力部
１９，２９ＧＰＵ
５０ニューラルネットワーク（深層学習アルゴリズム）
５０ａ入力層
５０ｂ出力層
５０ｃ中間層
６０学習済みのニューラルネットワーク（学習済みの深層学習アルゴリズム）
６０ａ入力層
６０ｂ出力層
６０ｃ中間層
７０Ｗ１ホールスライドイメージ
７０Ｗ２ホールスライドイメージ
７０Ｗ３ホールスライドイメージ
７０Ｒ１Ｌ第１の予備訓練データ
７０Ｒ２Ｌ第２の予備訓練データ
７０Ｒ３Ｌ第３の予備訓練データ
７２Ｒ１ｒ，７２Ｒ１ｇ，７２Ｒ１ｂ色濃度符号化図
７２Ｒ２ｒ，７２Ｒ２ｇ，７２Ｒ２ｂ色濃度符号化図
７２Ｒ３ｒ，７２Ｒ３ｇ，７２Ｒ３ｂ色濃度符号化図
７４Ｒ１，７４Ｒ２，７４Ｒ３訓練データ
７５Ｒ１，７５Ｒ２，７５Ｒ３ウィンドウサイズの訓練データ
７６色濃度値
７７Ｒ１，７７Ｒ２，７７Ｒ３真値像のラベル値
７８解析対象の明視野画像
７９ｒ，７９ｇ，７９ｂ解析対象の色濃度符号化図
８０解析用データ
８１色濃度値
８２判別結果（画素の推定値）
８３解析結果の３値画像
８４領域強調画像
８９（８９ａ，８９ｂ）ノード
９８記録媒体
９９ネットワーク
１００ベンダ側装置
１００Ａ深層学習装置
１００Ｂ統合型の画像解析装置
１０１訓練データ生成部
１０２訓練データ入力部
１０３アルゴリズム更新部
１０４ウィンドウサイズデータベース
１０５アルゴリズムデータベース
２００ユーザ側装置
２００Ａ画像解析装置
２００Ｂ統合型の画像解析装置
２００Ｃ端末装置
２０１解析用データ生成部
２０２解析用データ入力部
２０３解析部
２０４領域検出部
３００，４００撮像装置
３０１，４０１撮像素子
３０８，４０８試料組織
３０９，４０９ステージ
Ｗ１ウィンドウ
Ｗ２ウィンドウ 10 (10A, 10B) Processing unit 20 (20A, 20B, 20C) Processing units 11, 21 CPU
12, 22 Memory 13, 23 Recording unit 14, 24 Bus 15, 25 Interface unit 16, 26 Input unit 17, 27 Output unit 19, 29 GPU
50 Neural Networks (Deep Learning Algorithms)
50a Input layer 50b Output layer 50c Hidden layer 60 Trained neural network (trained deep learning algorithm)
60a Input layer 60b Output layer 60c Hidden layer 70W1 Hole slide image 70W2 Hole slide image 70W3 Hole slide image 70R1L First preliminary training data 70R2L Second preliminary training data 70R3L Third preliminary training data 72R1r, 72R1g, 72R1b Color density coded map 72R2r, 72R2g, 72R2b Color density coded map 72R3r, 72R3g, 72R3b Color density coded map 74R1, 74R2, 74R3 Training data 75R1, 75R2, 75R3 Window size training data 76 Color density values 77R1, 77R2, 77R3 Label value of true value image 78 Bright-field images to be analyzed 79r, 79g, 79b Color density coded map to be analyzed 80 Analysis data 81 Color density values 82 Discrimination result (pixel estimate)
83 Ternary image of analysis result 84 Region-enhanced image 89 (89a, 89b) Node 98 Recording medium 99 Network 100 Vendor-side device 100A Deep learning device 100B Integrated image analysis device 101 Training data generation unit 102 Training data input unit 103 Algorithm update unit 104 Window size database 105 Algorithm database 200 User-side device 200A Image analysis device 200B Integrated image analysis device 200C Terminal device 201 Analysis data generation unit 202 Analysis data input unit 203 Analysis unit 204 Region detection unit 300, 400 Imaging device 301, 401 Imaging element 308, 408 Sample structure 309, 409 Stage W1 Window W2 Window

Claims

An image analysis method for analyzing tissue images using a deep learning algorithm with a neural network structure, comprising:
Accepts input of the type of tissue to be analyzed,
By referring to an algorithm database that stores a plurality of deep learning algorithms generated according to the type of tissue, and a window size database that stores the number of pixels of the window size of an image that is to be input to the deep learning algorithm at one time, the deep learning algorithm and the number of pixels corresponding to the received type of tissue to be analyzed are set;
generating analysis data from the analysis target image including the tissue to be analyzed according to the set number of pixels;
inputting the analysis data into the configured deep learning algorithm;
and displaying an image of the tissue to be analyzed in a manner that allows a tumor region to be distinguished from other regions, based on data that indicates whether a pixel region consisting of a plurality of pixels included in the analysis data is a tumor region, the data being output from the deep learning algorithm to which the analysis data has been input.
Image analysis methods.

The image analysis method of claim 1, wherein each of the plurality of deep learning algorithms is associated with a tissue type and stored in the algorithm database.

The image analysis method described in claim 2, wherein at least one of the plurality of deep learning algorithms is stored in the algorithm database in association with one tissue type.

The image analysis method described in any one of claims 1 to 3, wherein the deep learning algorithm corresponding to the type of tissue to be analyzed is selected based on a user's operation via an input unit.

The image analysis method of any one of claims 1 to 4, further comprising: applying a rendering process to the image to be analyzed based on the data to display tumor areas and other areas in a distinguishable manner.

The image analysis method of claim 5, wherein the drawing process includes drawing the tumor region and other regions in different colors.

The image analysis method according to any one of claims 1 to 6, wherein the image to be analyzed is an image of a tissue diagnostic specimen, and the image to be analyzed includes a hue that is a combination of two or more primary colors.

The image analysis method according to any one of claims 1 to 7, wherein the image to be analyzed is an image of the tissue to be analyzed captured at a magnification of 3x to 20x.

An image analysis method according to any one of claims 1 to 8, wherein the analysis data corresponds to an image cut out from the image to be analyzed, and a plurality of pieces of analysis data are generated from the image to be analyzed.

The image analysis method according to any one of claims 1 to 9, wherein the image to be analyzed is an image of a tissue diagnostic specimen, the tissue diagnostic specimen being a stained specimen, and the image to be analyzed is an image of the stained specimen captured under bright field conditions with a microscope.

The image analysis method of claim 10, wherein the training data used to train the deep learning algorithm is generated based on bright-field images of a tissue specimen containing a tumor region taken from an individual, stained for bright-field observation, and then captured under bright-field microscope conditions.

The image analysis method described in any one of claims 1 to 11, wherein the output layer of the neural network is a node that uses a softmax function as an activation function.

An image analysis device that analyzes tissue images using a deep learning algorithm with a neural network structure,
A display unit;
a control unit that receives an input of a type of tissue to be analyzed, and references an algorithm database that stores a plurality of deep learning algorithms generated according to the type of tissue, and a window size database that stores the number of pixels of a window size of an image that is to be input to the deep learning algorithm at one time, to set the deep learning algorithm and the number of pixels corresponding to the received type of tissue to be analyzed, generates analysis data from the analysis target image including the tissue to be analyzed according to the set number of pixels, inputs the analysis data to the set deep learning algorithm, and displays the image of the tissue to be analyzed on the display unit so that a tumor region and other regions can be distinguished based on data that indicates whether a pixel region consisting of a plurality of pixels included in the analysis data is a tumor region, the data being output from the deep learning algorithm to which the analysis data has been input.
Image analysis device.

The image analysis device of claim 13, wherein the algorithm database stores each of the plurality of deep learning algorithms in association with a tissue type.

The image analysis device of claim 14, wherein the algorithm database stores at least one of the plurality of deep learning algorithms in association with one tissue type.

further comprising an input unit operated by a user,
The image analysis device according to claim 13 , wherein the control unit selects the deep learning algorithm corresponding to the type of tissue to be analyzed based on an operation via the input unit.

The image analysis device of any one of claims 13 to 16, wherein the control unit performs a drawing process on the image to be analyzed based on the data to display a tumorous region and other regions in a distinguishable manner.

The image analysis device of claim 17, wherein the drawing process includes drawing the tumor region and other regions in different colors.

An image analysis device according to any one of claims 13 to 18, wherein the analysis data corresponds to an image cut out from the image to be analyzed, and a plurality of pieces of analysis data are generated from the image to be analyzed.

A computer program for analyzing tissue images using a deep learning algorithm with a neural network structure, comprising:
On the computer,
receiving input of the type of tissue to be analyzed;
a process of setting the deep learning algorithm and the number of pixels corresponding to the received type of tissue to be analyzed by referring to an algorithm database that stores multiple deep learning algorithms generated according to the type of tissue and a window size database that stores the number of pixels of the window size of an image that is to be input to the deep learning algorithm at one time;
generating analysis data from the analysis target image including the tissue to be analyzed according to the set number of pixels;
inputting the analysis data into the set deep learning algorithm;
and displaying an image of the tissue to be analyzed in a manner that allows a tumor region to be distinguished from other regions, based on data indicating whether a pixel region consisting of a plurality of pixels included in the analysis data is a tumor region, the data being output from the deep learning algorithm to which the analysis data has been input.
Computer program.