JP7650509B2

JP7650509B2 - HEALTH MANAGEMENT SYSTEM, HEALTH MANAGEMENT DEVICE, HEALTH MANAGEMENT PROGRAM, AND HEALTH MANAGEMENT METHOD

Info

Publication number: JP7650509B2
Application number: JP2021567503A
Authority: JP
Inventors: 鉄之助原勢; 頼夫熊本; 康宏大宮
Original assignee: PST CORPORATION,INC.
Current assignee: PST CORPORATION,INC.
Priority date: 2019-12-24
Filing date: 2020-12-22
Publication date: 2025-03-25
Anticipated expiration: 2040-12-22
Also published as: TW202133150A; JPWO2021132284A1; WO2021132284A1

Description

本発明は、健康管理システム、健康管理装置、健康管理プログラムおよび健康管理方法に関する。The present invention relates to a health management system, a health management device, a health management program, and a health management method.

２０００年に世界保健機関（ＷＨＯ）が健康寿命を提唱して以来、単に寿命を延ばすのではなく、心身ともに自立し、健康的に生活できる期間を如何に延ばすかに関心が高まっている。Since the World Health Organization (WHO) advocated healthy lifespan in 2000, there has been growing interest not just in extending lifespan, but in how to extend the period during which people can live independently and healthily both physically and mentally.

日本において、平均寿命と健康寿命（健康上の問題で日常生活が制限されることがない期間）の差は、２０１６年には男性８．８４年、女性１２．３４年であった（非特許文献１参照）。In Japan, the difference between average life expectancy and healthy life expectancy (the period during which daily life is not restricted by health problems) was 8.84 years for men and 12.34 years for women in 2016 (see non-patent document 1).

国民が健康を維持することは、個人の幸せのベースであると同時に、疾病重症化による生産性の低下を防ぎ、また、医療費や介護給付費用が発生する期間の増大を防ぐことができ、社会保障負担の増大も抑制される。Maintaining the health of citizens is the basis of individual happiness, and at the same time, it prevents a decline in productivity due to the worsening of illness, prevents an increase in the period during which medical expenses and nursing care costs are incurred, and curbs the increase in social security burdens.

一方で、アスリート等の特殊な場合を除いては、個人としては病気になって初めて健康の大切さに気付くことが多く、即ち普段は健康に対する意識が高くないことから、早期発見が困難な場合がある。On the other hand, except for special cases such as athletes, individuals often only realize the importance of health when they become ill. In other words, since people are not normally very health conscious, early detection can be difficult.

例えば、健診として（健康診断、健康診査、人間ドックを含む）、半ば強制的に行われる会社の健康診断では受診率が７７％であるのに対し、自営業主、家族従業員、家事専業者、契約社員などは、５０％程度である。受診しなかった理由は、時間が取れなかった、費用がかかる、健診があることを知らなかったなどさまざまであるが、結局は健康に対する意識（ヘルスリテラシー）が低く、病気を予防又は早期発見することに対する意義を過小評価しているものと言える。For example, in the case of health checkups (including medical examinations, health checkups, and medical check-ups), which are semi-mandatory company health checkups, the attendance rate is 77%, whereas the attendance rate for self-employed persons, family employees, housekeepers, contract employees, etc. is around 50%. There are various reasons for not getting a checkup, such as not being able to make time, the cost, or not knowing that a checkup existed, but ultimately it can be said that this is due to low health literacy and an underestimation of the importance of preventing or detecting illness early.

一方、精神・神経疾患については、たとえ健診を受けたとしても、検査項目にほとんど入っていないという問題点がある。理由は、精度の高いバイオマーカーが無いこと、また検査を行うにも高価な装置が必要であることなどによる。更に、うつ病に対する質問票はＢＤＩ、ＰＨＱ－９など世界的にも共通で用いられているものがあるが、被験者の意識的な評価懸念や無意識の防衛、検査者の主観が入りうるため客観性に限界がある。On the other hand, there is a problem that mental and neurological disorders are rarely included in the test items even if people undergo health checkups. This is because there are no highly accurate biomarkers, and expensive equipment is required to perform the tests. Furthermore, although there are questionnaires for depression that are commonly used around the world, such as the BDI and PHQ-9, there are limitations to their objectivity due to the possibility of the subject's conscious concerns about the evaluation, unconscious defenses, and the examiner's subjectivity being included.

また精神・神経系疾患は、その病気の可能性があることを他人に知られたくない、精神科へ通院することに抵抗感があるなどの理由から、自分自身で健康管理ができる状況が必要と考えられる。In addition, people with mental or neurological illnesses may not want others to know that they may have the illness, or may feel reluctant to visit a psychiatrist, so it is thought that they need to be able to manage their own health.

近年、発話による音声を解析することにより、感情や心的状態を推定する技術（特許文献１、２参照）が開示されており、音声を解析して人の状態を測り数値化することが可能になってきている。In recent years, technologies have been disclosed that estimate emotions and mental states by analyzing spoken voice (see Patent Documents 1 and 2), making it possible to measure and quantify a person's state by analyzing voice.

また、声紋による個人認証を行いデバイスにアクセスする権利を付与する技術（特許文献３参照）や、スマートホーム対応家電等の声によって機械を操作するための音声認識技術（特許文献４参照）が開示されている等、声を対人コミュニケーション以外に積極的に利用する場面も生じてきている。Furthermore, there are also situations where voice is actively used for purposes other than interpersonal communication, such as the disclosure of a technology that performs personal authentication using voiceprints and grants the right to access devices (see Patent Document 3) and a voice recognition technology that enables the operation of machines, such as smart home compatible home appliances, by voice (see Patent Document 4).

加えてスマートフォンの普及により、一人ひとりが通話デバイスを携帯していることから、必要であればいつでも発話が可能な状況となっている。In addition, with the spread of smartphones, everyone carries a calling device, making it possible to speak whenever necessary.

更に、声は録音され電子データとして保存しておけば、血液や尿のように劣化することがないので、何時でも必要に応じ、過去にさかのぼって解析することができるという利点を有する。Furthermore, if voice is recorded and stored as electronic data, it does not deteriorate like blood or urine, and therefore has the advantage that it can be analyzed retroactively at any time if necessary.

特開２００７－２９６１６９号公報JP 2007-296169 A 国際公開第２００６／１３２１５９号International Publication No. 2006/132159 米国公開第２０１６／０１１９３３８号U.S. Publication No. 2016/0119338 特開２０１４－２０６６４２号公報JP 2014-206642 A

厚生労働科学研究費補助金平成３０年度分担研究報告書２６－３９頁 “健康寿命の全国推移の算定・評価に関する研究”Ministry of Health, Labour and Welfare Science Research Grant FY2018 Research Report, pp. 26-39 "Research on the calculation and evaluation of nationwide trends in healthy life expectancy"

病気の予防や早期発見に繋がる検診率を高めるには、簡単に、自分で行え、費用も安く、日常生活の中でわざわざそのための機会を作る必要のない検査が求められている。そこで、本発明の目的は、誰でも、何処にいても、短時間で、非侵襲的に、他人に知られることなく、極めて簡便に測定し自身の健康状態を把握することができる健康管理システム、健康管理装置、健康管理プログラム及び健康管理方法を提供することにある。In order to increase the rate of medical examinations that lead to the prevention and early detection of diseases, there is a need for tests that are easy to perform by oneself, inexpensive, and do not require the user to make time for them in their daily lives. Therefore, the object of the present invention is to provide a health management system, a health management device, a health management program, and a health management method that allow anyone, anywhere, to measure and understand their own health condition in a short time, non-invasively, and without the knowledge of others.

本発明者らは、声を心身の状態を示すバイオマーカーとして利用することにより、健康な状態から、病気に罹患し、回復し、再び健康な状態に至るまで継続的に健康管理を行えることを見出し、本発明に到達した。The inventors discovered that by using the voice as a biomarker indicating the state of the mind and body, it is possible to continuously manage one's health from a healthy state to contracting an illness, recovering, and then returning to a healthy state, and thus arrived at the present invention.

本願に係る健康管理システムは、利用者から取得された音声データを継続的に格納する第１のサーバと、利用者の識別データを格納する第２のサーバと、利用者向けの出力データを格納する第３のサーバと、および音声データを基に抽出された音響特徴量を入力として所定の疾患の推定を行う推定装置、を備える健康管理システムであって、推定装置は、音声データを基に音響特徴量を抽出する抽出部と、音響特徴量を入力として所定の疾患の推定を行う推定部と、および推定部の推定結果および利用者の識別データを基に、第３のサーバの出力データを選択する選択部、を備え、選択部は、利用者がいずれの疾患も罹患していないと推定部に推定された場合は、第３のサーバの罹患前の利用者向けの第１の出力データを選択するように構成され、利用者がいずれかの疾患を罹患している疑いがあると推定部に推定された場合は、第３のサーバの罹患の疑いがある利用者向けの第２の出力データを選択するように構成され、かつ利用者が医師の診断によりいずれかの疾患を診断された後は、第２のサーバの識別データが更新され、第３のサーバの罹患後の利用者向けの第３の出力データを選択し、かつ寛解後も継続的に第３の出力データを選択するように構成されることを特徴とする。The health management system according to the present application is a health management system comprising a first server that continuously stores voice data acquired from a user, a second server that stores identification data of the user, a third server that stores output data for the user, and an estimation device that infers a predetermined disease using an acoustic feature extracted from the voice data as an input, the estimation device comprising an extraction unit that extracts an acoustic feature based on the voice data, an estimation unit that infers the predetermined disease using the acoustic feature as an input, and a selection unit that selects output data of the third server based on an estimation result of the estimation unit and the identification data of the user, and the selection unit: The system is characterized in that, when the estimation unit estimates that the user is not suffering from any disease, it is configured to select first output data from the third server for the user before the disease, and when the estimation unit estimates that the user is suspected of suffering from any disease, it is configured to select second output data from the third server for the user who is suspected of suffering from the disease, and after the user is diagnosed with any disease by a doctor's diagnosis, the identification data of the second server is updated, third output data from the third server for the user after the disease has been developed, and the third output data is continuously selected even after remission.

本願発明によれば、誰でも、何処にいても、短時間で、非侵襲的に、他人に知られることなく、極めて簡便に測定し自身の健康状態を把握することができる健康管理システム、健康管理装置、健康管理プログラム及び健康管理方法を提供することができる。According to the present invention, it is possible to provide a health management system, a health management device, a health management program, and a health management method that allow anyone, anywhere, to measure and understand their own health condition in an extremely simple manner, in a short time, non-invasively, and without others knowing.

図１は、本願の健康管理システム２００の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 1 is a hardware configuration diagram showing an example of a computer that realizes the functions of a health management system 200 of the present invention. 図２は、本願の実施形態に係る健康管理システム２００の構成図である。FIG. 2 is a configuration diagram of a health management system 200 according to an embodiment of the present application. 図３Ａは、１つ以上の音響特徴量を算出するための特徴量の式Ｆ（ａ）を示す図である。FIG. 3A illustrates a feature equation F(a) for calculating one or more acoustic features. 図３Ｂは、ある一つの音響特徴量の強度が、疾患毎に異なっていることを示すイメージ図である。FIG. 3B is an image diagram showing that the intensity of a certain acoustic feature varies for each disease. 図３Ｃは、ある３つの音響パラメータの組み合わせによって得られた疾患の予測値の分布画像である。FIG. 3C is a distribution image of disease predictive values obtained by combinations of three acoustic parameters. 図４Ａは、健常者または特定疾患と、それ以外の分離性能を示すＲＯＣ曲線のグラフを示す図である。FIG. 4A is a graph showing an ROC curve indicating the performance of separating healthy subjects or subjects with a specific disease from others. 図４Ｂは、健常者または特定疾患と、それ以外の分離性能を示すＲＯＣ曲線のグラフを示す図である。FIG. 4B is a graph showing an ROC curve indicating the performance of separating healthy subjects or subjects with a specific disease from others. 図５は、学習用データに用いるために各患者及び健常者が読み上げる例文を示す図である。FIG. 5 shows example sentences read by patients and healthy subjects for use as learning data. 図６は、利用者が健常、認知症群、または気分障害群であるかを推定した結果を示す図である。FIG. 6 is a diagram showing the results of estimating whether a user is in a healthy state, a dementia group, or a mood disorder group. 図７は、大うつ病と健常の分類において、フレーズ単位での推定プログラムの推定結果を示す図である。FIG. 7 shows the estimation results of the estimation program on a phrase-by-phrase basis in the classification of major depression and healthy subjects. 図８Ａは、推定プログラムに基づき、各フレーズに対して大うつ病または健常のいずれかと判定したものを、各人ごとに纏めた結果を示す図である。FIG. 8A shows the results of collating for each person the judgment that each phrase is either major depression or healthy based on the estimation program. 図８Ｂは、推定プログラムに基づき、各フレーズに対して大うつ病または健常のいずれかと判定したものを、各人ごとに纏めた結果を示す図である。FIG. 8B shows the results of collating for each person the judgment that each phrase was judged to be either major depression or healthy based on the estimation program. 図９は、各人ごとの健常判定率を求め、健常判定率が６０％以上の者を健常者と推定し、健常判定率が６０％未満の者を大うつ病患者と推定した場合の結果を示す図である。FIG. 9 shows the results when the healthy judgment rate was calculated for each individual, and individuals with a healthy judgment rate of 60% or more were estimated to be healthy individuals, and individuals with a healthy judgment rate of less than 60% were estimated to be major depression patients. 図１０は、レビー小体型認知症と健常の分類において、フレーズ単位での推定プログラムの推定結果を示す図である。FIG. 10 shows the estimation results of a phrase-by-phrase estimation program in classifying dementia with Lewy bodies from healthy subjects. 図１１Ａは、レビー小体型認知症の推定結果を示す図である。FIG. 11A is a diagram showing the estimation results of Lewy body dementia. 図１１Ｂは、レビー小体型認知症の推定結果を示す図である。FIG. 11B is a diagram showing the estimation results of Lewy body dementia. 図１２は、各人ごとの健常判定率を求め、健常判定率が６０％以上の者を健常者と推定し、健常判定率が６０％未満の者をレビー小体型認知症と判定した場合の結果を示す図である。Figure 12 shows the results when the healthy judgment rate was calculated for each individual, and those with a healthy judgment rate of 60% or more were presumed to be healthy, and those with a healthy judgment rate of less than 60% were judged to have Lewy body dementia. 図１３は、健康管理システム２００の動作を示すシーケンス図である。FIG. 13 is a sequence diagram showing the operation of the health management system 200. 図１４は、健康管理システム２００の動作を示すシーケンス図である。FIG. 14 is a sequence diagram showing the operation of the health management system 200. 図１５は、本健康管理システム２００の全体像を示す概略図である。FIG. 15 is a schematic diagram showing an overall image of the health management system 200. 図１６は、音声プラットフォーム装置または他システム（送受信、独自判定結果出力）または組込済みシステムを用いて利用者の登録を行う一例を示す図である。FIG. 16 is a diagram showing an example of registering a user using a voice platform device or another system (transmission/reception, output of unique determination results) or an embedded system. 図１７は、利用者の登録に関する流れを説明する図である。FIG. 17 is a diagram for explaining the flow of user registration. 図１８は、音声データの格納に関する流れを説明する図である。FIG. 18 is a diagram for explaining the flow of storing audio data. 図１９は、識別情報を登録済の利用者の音声解析の例を説明する図である。FIG. 19 is a diagram illustrating an example of a voice analysis of a user whose identification information has been registered. 図２０は、判別に関する例を説明する図である。FIG. 20 is a diagram for explaining an example regarding the determination. 図２１は、行動支援に関する例を説明する図である。FIG. 21 is a diagram for explaining an example related to action support. 図２２Ａは、システムに関する流れを説明する図である。FIG. 22A is a diagram for explaining the flow of the system. 図２２Ｂは、システムに関する流れを説明する図である。FIG. 22B is a diagram illustrating the flow of the system. 図２３は、データの送受信、音声プラットフォーム、音声解析装置、組込済みシステムの説明を行う図である。FIG. 23 is a diagram illustrating data transmission and reception, a voice platform, a voice analysis device, and an embedded system. 図２４は、音声プラットフォーム装置の説明を行う図である。FIG. 24 is a diagram illustrating the audio platform device. 図２５は、音声解析装置の説明を行う図である。FIG. 25 is a diagram illustrating a voice analysis device. 図２６は、他システムの説明を行う図である。FIG. 26 is a diagram for explaining another system. 図２７Ａは、他システムの説明を行う図である。FIG. 27A is a diagram illustrating another system. 図２７Ｂは、他システムの説明を行う図である。FIG. 27B is a diagram illustrating another system.

以下に、本願に係る疾患推定システムおよび疾患推定方法を実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る疾患推定システムおよび疾患推定方法が限定されるものではない。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。
〔１．プログラム〕 Hereinafter, a detailed description will be given of a form for carrying out the disease inference system and disease inference method according to the present application (hereinafter, referred to as an "embodiment") with reference to the drawings. Note that the disease inference system and disease inference method according to the present application are not limited to the embodiment. In addition, the same parts in each of the following embodiments are given the same reference numerals, and duplicated descriptions are omitted.
1. Program

実施形態に係る健康管理システム２００は、例えば図１に示すような構成のコンピュータ１００によって実現される。以下、例に挙げて説明する。図１は、健康管理システム２００の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１００は、ＣＰＵ１０１、ＲＡＭ１０２、ＲＯＭ１０３、ＨＤＤ１０４、通信インターフェイス（Ｉ／Ｆ）１０５、入出力インターフェイス（Ｉ／Ｆ）１０６、およびメディアインターフェイス（Ｉ／Ｆ）１０７を有する。A health management system 200 according to an embodiment is realized by a computer 100 having a configuration as shown in Fig. 1, for example. An example will be described below. Fig. 1 is a hardware configuration diagram showing an example of a computer that realizes the functions of the health management system 200. The computer 100 has a CPU 101, a RAM 102, a ROM 103, a HDD 104, a communication interface (I/F) 105, an input/output interface (I/F) 106, and a media interface (I/F) 107.

ＣＰＵ１０１は、ＲＯＭ１０３またはＨＤＤ１０４に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１０３は、コンピュータ１００の起動時にＣＰＵ１０１によって実行されるブートプログラムや、コンピュータ１００のハードウェアに依存するプログラム等を格納する。The CPU 101 operates based on a program stored in the ROM 103 or the HDD 104, and controls each unit. The ROM 103 stores a boot program executed by the CPU 101 when the computer 100 is started, programs that depend on the hardware of the computer 100, and the like.

ＨＤＤ１０４は、ＣＰＵ１０１によって実行されるプログラム、および、係るプログラムによって使用されるデータ等を格納する。通信インターフェイス１０５は、ネットワークＮを介して他の機器からデータを受信してＣＰＵ１０１へ送り、ＣＰＵ１０１が生成したデータを他の機器へ送信する。The HDD 104 stores programs executed by the CPU 101, data used by the programs, etc. The communication interface 105 receives data from other devices via the network N and sends it to the CPU 101, and transmits data generated by the CPU 101 to other devices.

ＣＰＵ１０１は、入出力インターフェイス１０６を介して、ディスプレイやプリンタ等の出力装置、および、マイク等の音声の入力装置、キーボードやマウス等の入力装置を制御する。ＣＰＵ１０１は、入出力インターフェイス１０６を介して、入力装置からデータを取得する。また、ＣＰＵ１０１は、生成したデータを入出力インターフェイス１０６を介して出力装置へ出力する。The CPU 101 controls output devices such as a display and a printer, and audio input devices such as a microphone, and input devices such as a keyboard and a mouse, via the input/output interface 106. The CPU 101 acquires data from the input devices via the input/output interface 106. The CPU 101 also outputs generated data to the output devices via the input/output interface 106.

メディアインターフェイス１０７は、記録媒体１０８に格納されたプログラムまたはデータを読み取り、ＲＡＭ１０２を介してＣＰＵ１０１に提供する。ＣＰＵ１０１は、係るプログラムを、メディアインターフェイス１０７を介して記録媒体１０８からＲＡＭ１０２上にロードし、ロードしたプログラムを実行する。記録媒体１０８は、例えばＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）、ＰＤ（ＰｈａｓｅｃｈａｎｇｅｒｅｗｒｉｔａｂｌｅＤｉｓｋ）等の光学記録媒体、ＭＯ（Ｍａｇｎｅｔｏ－Ｏｐｔｉｃａｌｄｉｓｋ）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。The media interface 107 reads a program or data stored in the recording medium 108 and provides it to the CPU 101 via the RAM 102. The CPU 101 loads the program from the recording medium 108 onto the RAM 102 via the media interface 107, and executes the loaded program. The recording medium 108 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or a PD (Phase change rewritable Disc), a magneto-optical recording medium such as an MO (Magneto-Optical Disk), a tape medium, a magnetic recording medium, or a semiconductor memory.

例えば、コンピュータ１００が実施形態に係る健康管理システム２００として機能する場合、コンピュータ１００のＣＰＵ１０１は、ＲＡＭ１０２上にロードされたプログラムを実行することにより、制御部の機能を実現する。また、ＨＤＤ１０４には、記憶部内のデータが格納される。コンピュータ１００のＣＰＵ１０１は、これらのプログラムを記録媒体１０８から読み取って実行するが、他の例として、他の装置からこれらのプログラムを取得してもよい。
〔２．健康管理システムの構成〕 For example, when the computer 100 functions as the health management system 200 according to the embodiment, the CPU 101 of the computer 100 executes a program loaded onto the RAM 102 to realize the function of the control unit. In addition, data in the storage unit is stored in the HDD 104. The CPU 101 of the computer 100 reads and executes these programs from the recording medium 108, but as another example, these programs may be obtained from another device.
2. Structure of the health management system

次に、実施形態に係る健康管理システム２００の構成について図２を用いて説明する。図２に示すように、健康管理システム２００は、ネットワークＮを介して、利用者端末２０１およびウェブサーバ２０２と有線または無線により通信可能に接続される。なお、健康管理システム２００は、他にも複数台の利用者端末２０１や複数台のウェブサーバ２０２と接続されてもよい。ウェブサーバ２０２には、健康管理システム２００の機能の一部、健診施設および医療施設のサーバが含まれる。Next, the configuration of a health management system 200 according to an embodiment will be described with reference to Fig. 2. As shown in Fig. 2, the health management system 200 is connected to a user terminal 201 and a web server 202 via a network N so as to be able to communicate with each other by wire or wirelessly. The health management system 200 may also be connected to a plurality of other user terminals 201 and a plurality of web servers 202. The web server 202 includes some of the functions of the health management system 200, and servers of health checkup facilities and medical facilities.

健康管理システム２００は、図２に示すように、通信部２０３、第１のサーバ２０４、第２のサーバ２０５、第３のサーバ２０６、および第４のサーバ２１０を備える。第１ないし第４のサーバはそれぞれ記憶部としての機能を有する。また、第４のサーバの一部は制御部として機能する。通信部２０３は、例えば、ＮＩＣ（ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）等によって実現される。通信部２０３は、ネットワークＮと有線または無線で接続され、利用者端末２０１との間で情報の送受信を行う。As shown in Fig. 2, the health management system 200 includes a communication unit 203, a first server 204, a second server 205, a third server 206, and a fourth server 210. Each of the first to fourth servers functions as a storage unit. Also, a part of the fourth server functions as a control unit. The communication unit 203 is realized by, for example, a NIC (Network Interface Card) or the like. The communication unit 203 is connected to the network N by wire or wirelessly, and transmits and receives information between the communication unit 203 and the user terminal 201.

記憶部は、例えば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、フラッシュメモリ（ＦｌａｓｈＭｅｍｏｒｙ）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。The storage unit is realized by, for example, a semiconductor memory element such as a random access memory (RAM) or a flash memory, or a storage device such as a hard disk or an optical disk.

また、制御部は、例えば、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）やＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等によって、健康管理システム２００の記憶装置に記憶されている各種プログラムがＲＡＭを作業領域として実行されることにより実現される。また、制御部は、例えば、ＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）やＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）等の集積回路により実現される。The control unit is realized by, for example, a central processing unit (CPU) or a micro processing unit (MPU) executing various programs stored in the storage device of the health management system 200 using a RAM as a working area. The control unit is realized by, for example, an integrated circuit such as an application specific integrated circuit (ASIC) or a field programmable gate array (FPGA).

上述したように、記憶部および制御部の機能は、図２の第１のサーバないし第４のサーバが協働して実現される。ここで、第１ないし第４のサーバついて説明する。As described above, the functions of the storage unit and the control unit are realized by the cooperation of the first to fourth servers in Fig. 2. Here, the first to fourth servers will be described.

第１のサーバ２０４は、利用者から取得された音声データを継続的に格納する機能を有する。第２のサーバ２０５は、利用者の識別データを格納する機能を有し、利用者端末２０１およびウェブサーバ２０２との通信により継続的に利用者のログが更新される。The first server 204 has a function of continuously storing voice data acquired from the user. The second server 205 has a function of storing user identification data, and the user log is continuously updated through communication with the user terminal 201 and the web server 202.

第３のサーバ２０６は、第１ないし第３の配信データ（２０７ないし２０９）を備える。第３のサーバ２０６は、利用者向けの配信データを格納する機能を有する。第３のサーバの機能は、通信部２０３を介して外部のウェブサーバ２０２がその機能を代替してもよい。The third server 206 includes the first to third distribution data (207 to 209). The third server 206 has a function of storing distribution data for users. The function of the third server may be substituted by an external web server 202 via a communication unit 203.

第４のサーバ２１０は、算出部２１１、推定部２１２、選択部２１３および記憶部２１４を備える。第４のサーバは、第１のサーバの音声データを基に算出された音響特徴量を入力として所定の疾患を推定する推定処理を行い、かつ第３のサーバに格納された配信データの選択・配信処理を行うことで、推定装置として機能する。The fourth server 210 includes a calculation unit 211, an estimation unit 212, a selection unit 213, and a storage unit 214. The fourth server performs estimation processing to estimate a predetermined disease using an acoustic feature calculated based on the voice data of the first server as an input, and also performs selection and distribution processing of distribution data stored in the third server, thereby functioning as an estimation device.

なお、当該第１ないし第４のサーバは健康管理システム２００の機能を説明するために便宜的に振り分けたものであり、同一のサーバの一部または複数のサーバがこれらの機能を担っていてもよい。The first to fourth servers are assigned for the sake of convenience in order to explain the functions of the health management system 200, and these functions may be performed by part of the same server or by multiple servers.

第４のサーバ２１０に記憶されている推定プログラムおよび配信プログラムは、図１のＲＡＭ１０２を作業領域として実行されることにより、算出部２１１、推定部２１２、配信部２１３として動作する。算出部２１１、推定部２１２、および配信部２１３は、第４のサーバ内で制御部として機能する。一方、記憶部２１４には、推定プログラム、配信プログラム、そして後述する第２の音響パラメータのデータが格納される。The estimation program and distribution program stored in the fourth server 210 are executed using the RAM 102 in Fig. 1 as a working area, and thereby operate as a calculation unit 211, an estimation unit 212, and a distribution unit 213. The calculation unit 211, the estimation unit 212, and the distribution unit 213 function as control units within the fourth server. Meanwhile, the storage unit 214 stores the estimation program, the distribution program, and data on a second acoustic parameter, which will be described later.

ここで、算出部２１１は、音声データを基に音響特徴量を算出する。推定部２１２は、算出部２１１で算出した音響特徴量を入力として所定の疾患の推定を行う。選択部２１３は、推定部２１２の推定結果を基に、第２のサーバの識別データを参考にして、第３のサーバに格納される配信データを選択する。Here, the calculation unit 211 calculates acoustic features based on the speech data. The estimation unit 212 estimates a predetermined disease using the acoustic features calculated by the calculation unit 211 as an input. The selection unit 213 selects distribution data to be stored in the third server based on the estimation result of the estimation unit 212 and with reference to the identification data of the second server.

ここで、健康管理システム２００は、第４のサーバ２１０に記憶されている推定プログラムおよび配信プログラムに基づき、利用者の疾患情報等を推定する推定処理と、推定された情報に基づいて、第３のサーバ２０６に格納される音声データ、動画像データまたはテキストデータ（以下、音声データ、動画像データまたはテキストデータを、単に「配信データ」と称する）を選択して配信する配信処理を実行する。配信データは、利用者に興味を抱かせ、かかる配信データに含まれる情報を広く知らせるものであれば、文字、図形、記号、音声、動画、ゲーム、その他任意のコンテンツを適用可能である。Here, the health management system 200 executes an estimation process for estimating disease information and the like of the user based on an estimation program and a distribution program stored in the fourth server 210, and a distribution process for selecting and distributing voice data, video data, or text data (hereinafter, voice data, video data, or text data will be simply referred to as "distribution data") stored in the third server 206 based on the estimated information. The distribution data can be any content, such as characters, figures, symbols, voice, video, games, or any other content, as long as it is capable of generating interest in the user and widely informing the information contained in the distribution data.

第３のサーバ２０６に格納される配信データは、利用者の音声入力を促す配信コンテンツを含むデータや、利用者に医療機関への受診または健康診断を促すメッセージを含む利用者の健康管理に関する配信データが含まれる。利用者の健康管理に関する配信データとは、より良い健康への改善や、維持する方法に関しての情報にアクセスし、理解し、利用するための個人の意欲や能力を決定する認知的・社会的スキルを高めるための情報に関する配信データであり、例えば、ヘルスリテラシーを高める情報が含まれる。また、配信データは、第４のサーバの推定処理の数値結果であってもよい。The distribution data stored in the third server 206 includes data including distribution content that prompts the user to input voice, and distribution data related to the user's health management, including a message that prompts the user to visit a medical institution or undergo a health check. The distribution data related to the user's health management is distribution data related to information for improving cognitive and social skills that determine an individual's motivation and ability to access, understand, and use information related to how to improve and maintain better health, and includes, for example, information for improving health literacy. The distribution data may also be a numerical result of the estimation process of the fourth server.

図２の第３のサーバ２０６では、第１の配信データ２０７、第２の配信データ２０８、および第３の配信データ２０９が便宜的に分類して格納されている。第１の配信データ２０７は、罹患前の利用者向けの健康情報を含む。第２の配信データ２０８は、第４のサーバ２１０の推定部２１２により、いずれかの疾患を罹患している疑いがあると推定された利用者に向けて、医療機関への受診または健康診断を促すメッセージを含む健康情報が含まれる。第３の配信データ２０９は、罹患後の利用者向けの健康情報を含む。In the third server 206 in Fig. 2, the first distribution data 207, the second distribution data 208, and the third distribution data 209 are conveniently classified and stored. The first distribution data 207 includes health information for users before they become ill. The second distribution data 208 includes health information including a message encouraging a user who is suspected of having any disease by the estimation unit 212 of the fourth server 210 to visit a medical institution or undergo a health check. The third distribution data 209 includes health information for users after they become ill.

第１ないし第３の配信データ（２０７ないし２０９）についてより具体的に説明する。第１の配信データ２０７が配信される利用者は、日常生活におけるヘルスケア、疾病予防、ヘルスプロモーションが比較的良好になされている人物が想定される。そのため、第１の配信データ２０７には、定期的な健康診断の受診を促す情報、利用者端末２０１が利用される状況・場所に合わせたレクリエーション情報等の健康増進に関する情報が含まれる。第１の配信データ２０７が配信される頻度は、第２、第３の配信データが配信される頻度よりも少ない。例えば、利用者端末２０１を介して利用者が音声を入力するタイミングで、第１の配信データ２０７が配信されてもよい。The first to third distribution data (207 to 209) will be described in more detail. The user to whom the first distribution data 207 is distributed is assumed to be a person who is relatively good at health care, disease prevention, and health promotion in daily life. Therefore, the first distribution data 207 includes information on health promotion such as information encouraging regular medical checkups and recreational information tailored to the situation and place where the user terminal 201 is used. The frequency at which the first distribution data 207 is distributed is less than the frequency at which the second and third distribution data are distributed. For example, the first distribution data 207 may be distributed at the timing when the user inputs voice via the user terminal 201.

第２の配信データ２０８が配信される利用者は、何等かの精神・神経系疾患を罹患していると疑われる人物が想定される。そのため、第２の配信データ２０８には、特定の医療機関への受診を促す情報が含まれる。ただし、利用者は精神・神経系疾患を患っている疑いがあることを自ら認めない恐れがある。また、不調を自覚しても、精神疾患の偏見や、精神科・心療内科を受診すること対する抵抗感から、患者が医療ケアを受けない恐れがあるため、医療機関等のソーシャルサービスにかかることを根気よく説得するための配信データを、第１の配信データよりも高頻度で配信するように構成されてもよい。The user to whom the second distribution data 208 is distributed is assumed to be a person suspected of suffering from some mental or neurological disorder. Therefore, the second distribution data 208 includes information encouraging the user to visit a specific medical institution. However, there is a risk that the user will not admit that he or she is suspected of suffering from a mental or neurological disorder. In addition, even if the patient is aware of his or her condition, there is a risk that the patient will not receive medical care due to prejudice against mental illness and reluctance to visit a psychiatrist or psychosomatic clinic. Therefore, the distribution data for persistently persuading the patient to seek social services such as those of a medical institution may be distributed more frequently than the first distribution data.

また、精神・神経系疾患は症状が共通しているため、個人では疾患を特定し難い場合がある。例えば、アルツハイマー型認知症と前頭側頭型認知症、アルツハイマー型認知症とレビー小体型認知症、レビー小体型認知症とパーキンソン病、双極性障害と大うつ病は、それぞれの疾患の区別が困難である。また、高齢者のうつ病では、抑うつ気分や思考の制止などのうつ病症状により、注意、集中力、判断力、記憶力が低下し、一見認知症のように見える仮性認知症があり、適切な治療を受けるためには、うつ病の類なのか認知症の類なのかを区別する必要がある。本願発明のように推定部２１２を介して推定した疾患を基に、特定した疾患に応じて推奨する診療機関を区別して配信データを提供できる点で本発明は有利である。In addition, since mental and neurological diseases have common symptoms, it may be difficult for an individual to identify the disease. For example, it is difficult to distinguish between Alzheimer's dementia and frontotemporal dementia, Alzheimer's dementia and Lewy body dementia, Lewy body dementia and Parkinson's disease, and bipolar disorder and major depression. In addition, in the case of depression in elderly people, attention, concentration, judgment, and memory are reduced due to depressive symptoms such as depressed mood and inhibition of thinking, and there is pseudodementia that appears to be dementia at first glance, and it is necessary to distinguish whether it is a type of depression or a type of dementia in order to receive appropriate treatment. The present invention is advantageous in that it can provide distribution data by distinguishing medical institutions recommended according to the identified disease based on the disease estimated via the estimation unit 212 as in the present invention.

第３の配信データ２０９が配信される利用者は、何等かの精神・神経系疾患を罹患した病歴のある人物、または何等かの精神・神経系疾患を治療中の人物が想定される。一般に、精神・神経系疾患を完治させることは難しい。そのため、完治とまでは言えないが、病状が治まって穏やかな寛解の状態となった後は、症状が消失後も再発予防のため薬の内服や医師による経過観察を継続するのが一般的である。そのため、第３の配信データ２０９には、継続的な音声入力により疾患の症状の確認を促すメッセージ等が含まれる。The user to whom the third distribution data 209 is distributed is assumed to be a person who has a history of suffering from some mental/nervous disease, or a person who is undergoing treatment for some mental/nervous disease. In general, it is difficult to completely cure a mental/nervous disease. Therefore, although it cannot be said that the disease is completely cured, it is common to continue taking medicine or being monitored by a doctor to prevent recurrence even after the symptoms have disappeared after the condition has improved and entered a state of mild remission. Therefore, the third distribution data 209 includes a message or the like that encourages the user to check the symptoms of the disease by continuous voice input.

なお、利用者に第３の配信データ２０９の配信がなされた後であっても、その後、推定部２１２が疾患の再発の疑いがあると推定した場合や、他の精神・神経系疾患を罹患していると疑われた場合には、第２の配信データ２０８が配信される。Even after the third distribution data 209 has been distributed to the user, if the estimation unit 212 subsequently estimates that there is a suspicion of recurrence of the disease or that the user is suspected of suffering from another mental/neurological disease, the second distribution data 208 is distributed.

以上のようにして、第４のサーバ２１０の選択部２１３は、推定部２１２の推定結果や、利用者の健康状態に応じて第１ないし第３の配信データを選択して配信する。In this manner, the selection unit 213 of the fourth server 210 selects and distributes the first to third distribution data in accordance with the estimation result of the estimation unit 212 and the health condition of the user.

また、利用者がいずれかの疾患を罹患していると医師に診断された後は、医師の所属する医療機関のサーバからウェブサーバ２０２を介して、または利用者自らが利用者端末２０１を介して、第２のサーバ２０５の識別データを更新するように構成される。識別データの更新が完了した後は、選択部２１３は、その利用者に関し、第２または第３の配信データ（２０８、２０９）を選択するように構成される。In addition, after a doctor diagnoses that the user has any disease, the identification data of the second server 205 is updated by a server of the medical institution to which the doctor belongs via the web server 202, or by the user himself/herself via the user terminal 201. After updating of the identification data is completed, the selection unit 213 is configured to select the second or third distribution data (208, 209) for the user.

健康管理システム２００の説明に戻る。健康管理システム２００は、利用者が使用する利用者端末２０１を音声入力手段として利用者の音声データを収集しつつ、利用者端末２０１を介してデータ通信を行うウェブサーバ等の情報処理装置である。例えば健康管理システム２００は、利用者端末２０１からデータの配信を要求する配信要求を受信すると、利用者端末２０１を使用する利用者の音声データによる疾患の推定結果に基づいて、対象となる配信データを選択し、選択した配信データを利用者端末２０１へ送信する。Returning to the description of the health management system 200, the health management system 200 is an information processing device such as a web server that collects voice data of a user using a user terminal 201 used by the user as a voice input means and performs data communication via the user terminal 201. For example, when the health management system 200 receives a distribution request for requesting data distribution from the user terminal 201, the health management system 200 selects target distribution data based on a disease estimation result based on the voice data of the user using the user terminal 201, and transmits the selected distribution data to the user terminal 201.

利用者の識別データは、第２のサーバ２０５に格納され、利用者端末２０１を介してログが継続的に更新され、かつ健診実施機関または医療機関らウェブサーバ２０２を介して健診結果や診断結果が継続的に更新される。利用者の識別データには、利用者が所定の疾患に罹患した経験があるか否かを含む個人情報を有する健康情報、利用者の性別、年齢、住所、趣味、興味の対象、行動履歴、ウェブページの閲覧履歴等、利用者のプロファイルや素性を示す種々の情報がある。個人情報を有する健康情報には、利用者の健診結果または医師による疾患の診断結果を含む個人情報が含まれる。The user's identification data is stored in the second server 205, the log is continuously updated via the user terminal 201, and the medical examination institution or medical institution continuously updates the medical examination results and diagnosis results via the web server 202. The user's identification data includes health information having personal information including whether the user has ever suffered from a specific disease, and various information indicating the user's profile and background, such as the user's gender, age, address, hobbies, interests, behavioral history, and web page browsing history. The health information having personal information includes personal information including the user's medical examination results or the results of a doctor's diagnosis of a disease.

利用者端末２０１は、音声を入力する手段および配信データを出力する手段を有するスマートフォンやタブレット等の通信端末であり、３Ｇ（Ｇｅｎｅｒａｔｉｏｎ）やＬＴＥ（ＬｏｎｇＴｅｒｍＥｖｏｌｕｔｉｏｎ）等の無線通信網を介して任意のサーバ装置と通信を行うことができる。利用者端末２０１は、音声を入力する手段および配信データを出力する手段を有してさえいれば、デスクトップ型のＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）やノート型のＰＣ、ネットブック、サーバ装置などの情報処理装置であってもよい。The user terminal 201 is a communication terminal such as a smartphone or tablet having a means for inputting voice and a means for outputting distribution data, and can communicate with any server device via a wireless communication network such as 3G (Generation) or LTE (Long Term Evolution). The user terminal 201 may be an information processing device such as a desktop PC (Personal Computer), a notebook PC, a netbook, or a server device, as long as it has a means for inputting voice and a means for outputting distribution data.

利用者端末２０１は、音声を入力する手段または配信データを出力する手段を有する電子機器に、その機能の一部を組込むこともできる。例えば、自動車等の移動装置、家庭用ゲーム機等のコンピュータゲームを動作させるためのハードウェア機器、またはカラオケボックス、ランニングマシン等の遊戯施設が、音声を入力する機能または配信データを出力する機能を有することができる。The user terminal 201 may have some of its functions built into an electronic device having a means for inputting voice or a means for outputting distribution data. For example, a mobile device such as an automobile, a hardware device for operating a computer game such as a home game machine, or an amusement facility such as a karaoke booth or a running machine may have the function for inputting voice or the function for outputting distribution data.

健康管理システム２００は、利用者端末２０１を介して利用者に音声入力を促す旨のウェブコンテンツを示すことができる。利用者に音声入力を促す旨のウェブコンテンツは、第３のサーバ２０６に格納される配信コンテンツであり、テキスト、画像、ビデオ、アニメーション、または音声による再生が含まれる。かかる場合、利用者端末２０１はアプリケーションを介して自動的に、または利用者の操作により、ウェブコンテンツの配信を行うウェブサーバに対してウェブコンテンツの配信要求を送信する。配信要求の送信は、利用者が利用者端末２０１のアプリケーションを起動した際に自動的にウェブサーバに送信されてもよい。The health management system 200 can display web content that prompts the user to input voice via the user terminal 201. The web content that prompts the user to input voice is distribution content stored in the third server 206, and includes text, images, videos, animations, or audio playback. In such a case, the user terminal 201 transmits a web content distribution request to a web server that distributes web content automatically via an application or by operation of the user. The distribution request may be automatically transmitted to the web server when the user starts an application on the user terminal 201.

健康管理システム２００は、利用者に音声入力を促す旨のウェブコンテンツを利用者端末２０１に配信する。そして、利用者端末２０１は、健康管理システム２００から受け付けたウェブコンテンツを表示する。The health management system 200 delivers web content that prompts the user to input voice to the user terminal 201. The user terminal 201 then displays the web content received from the health management system 200.

例えば、利用者が利用者端末２０１を介してゲームを実行している場合には、アプリケーションを介して、ゲーム中のキャラクタが利用者に音声入力を促す旨のメッセージを配信するように構成されてもよい。For example, when a user is playing a game via the user terminal 201, a character in the game may be configured to deliver a message via the application encouraging the user to input voice input.

健康管理システム２００は、利用者端末２０１を介して利用者からの音声入力がされると、音声データを配信サーバ１０の第１のサーバ２０４に格納する。そして、第３のサーバ２０６に格納されるデータの配信を要求する配信要求を受信すると、利用者端末２０１を使用する利用者の音声データを基に、疾患の推定処理をし、さらに配信データの選択処理を行い、選択された配信データを端末装置１００へ送信する。
〔３．健康管理システム２００が実行する処理〕 When a user inputs voice via the user terminal 201, the health management system 200 stores the voice data in the first server 204 of the distribution server 10. Then, when a distribution request for requesting distribution of data stored in the third server 206 is received, a disease estimation process is performed based on the voice data of the user using the user terminal 201, and further a process of selecting distribution data is performed, and the selected distribution data is transmitted to the terminal device 100.
3. Processing Executed by Health Management System 200

健康管理システム２００は、第３のサーバ２０６に格納されるデータを配信する場合、第４のサーバ２１０の推定部２１２の推定結果と、第２のサーバ２０５に格納される利用者の識別データに基づいて、配信対象となる配信データを第３のサーバに格納される上記第１ないし第３の配信データ（２０７ないし２０９）の中から選択する。When distributing data stored in the third server 206, the health management system 200 selects the distribution data to be distributed from among the first to third distribution data (207 to 209) stored in the third server based on the estimation result of the estimation unit 212 of the fourth server 210 and the user identification data stored in the second server 205.

他の従来技術では、配信対象となる健康管理に関する配信データは、第２のサーバ２０５に格納される利用者の識別データにのみ基づいていた。より具体的な例を挙げると、従来技術では、配信対象となる利用者が男性である場合は、多くの男性が提供されてきた健康情報に類似する健康情報が配信されてきた。In other conventional technologies, the distribution data related to health management to be distributed is based only on the identification data of the user stored in the second server 205. To give a more specific example, in the conventional technologies, if the user to be distributed is a man, health information similar to the health information provided to many men is distributed.

しかしながら、新規に登録された利用者の属性情報は、ログが存在しない場合がある。また、クッキー等が消去され、利用者を識別するためのＩＤ（Ｉｄｅｎｔｉｆｉｅｒ）が不明になった場合等、利用者の属性情報が不明となる場合がある。かかる場合、従来技術では、配信対象となる適切な配信データを選択することができない。However, there are cases where there is no log of attribute information of a newly registered user. In addition, there are cases where the attribute information of a user becomes unknown, such as when cookies or the like are deleted and an ID (Identifier) for identifying a user becomes unknown. In such cases, the conventional technology is unable to select appropriate distribution data to be distributed.

また、定期的に健診を受信しない利用者や、何等かの精神・神経系疾患を発症しても医療機関を受診しない利用者には、健康管理システム２００が選択すべき配信データが好ましくなく、ヘルスリテラシーを高めるべく提供される情報が、かえって利用者に悪影響を及ぼす危険性がある。In addition, for users who do not receive regular health checkups or who do not visit a medical institution even when they develop a mental or neurological disorder, the distribution data that the health management system 200 should select is not desirable, and there is a risk that the information provided to improve health literacy may actually have a negative impact on the users.

このように、従来技術では、第２のサーバ２０５に格納される利用者のログが存在しない場合や、利用者の属性情報が不明となる場合、適切な配信データを選択することができない。また、何等かの精神・神経系疾患を発症し、利用者端末２０１を介して得た利用者の行動履歴等からその異常を検知できたとしても、予測される疾患がある程度特定できていなければ、誤った健康情報を選択・提供し、利用者に悪影響を及ぼす危険性がある。As described above, in the conventional technology, when there is no user log stored in the second server 205 or when the user's attribute information is unknown, it is not possible to select appropriate distribution data. Even if a user develops some kind of mental or neurological disease and the abnormality can be detected from the user's behavior history obtained via the user terminal 201, if the predicted disease cannot be identified to some extent, there is a risk that incorrect health information will be selected and provided, which may have a negative impact on the user.

そこで、健康管理システム２００は、第１のサーバ２０４に格納された音声データから音響特徴量を算出する第４のサーバ２１０の算出部２１１を用いて、利用者の音響特徴量を算出する。そして、配信サーバ１０は、算出した特徴量に基づいて、第４のサーバの推定部２１２において利用者の疾患を推定する推定処理を実行する。Therefore, the health management system 200 calculates the acoustic feature of the user using a calculation unit 211 of a fourth server 210 that calculates the acoustic feature from the voice data stored in the first server 204. Then, the distribution server 10 executes an estimation process in which an estimation unit 212 of the fourth server estimates a disease of the user based on the calculated feature.

また、健康管理システム２００は、利用者の属性情報が不明である利用者が音声データを入力した場合は、する第４のサーバ２１０の算出部２１１を用いて係る音声データの音響特徴量を算出する。また、健康管理システム２００は、算出した特徴量に基づいて、第４のサーバの推定部２１２において利用者の疾患を推定する推定処理を実行する。そして、健康管理システム２００は、推定した疾患情報を用いて、選択部２１３が配信対象となる配信データを選択し、配信を行う配信処理を実行する。
〔４．算出部および推定部〕 Furthermore, when a user whose attribute information is unknown inputs voice data, the health management system 200 calculates acoustic features of the voice data using a calculation unit 211 of a fourth server 210. The health management system 200 also executes an estimation process in which an estimation unit 212 of the fourth server estimates a disease of the user based on the calculated features. The health management system 200 then executes a distribution process in which a selection unit 213 uses the estimated disease information to select distribution data to be distributed and distributes the data.
4. Calculation Unit and Estimation Unit

ここで、健康管理システム２００が疾患を推定する際に用いる算出部２１１と推定部２１２について説明する。
〔４－１．推定モデルの構築〕 Here, the calculation unit 211 and the estimation unit 212 used by the health management system 200 when estimating a disease will be described.
4-1. Construction of estimation model

健康管理システムの第４のサーバの推定モデルがどのような構築をしているか説明する。まず、被験者の音声データから、複数の音響パラメータを抽出する段階を経る。音響パラメータは、被験者の音声データから抽出される。音響パラメータは、音が伝わる際の特徴をパラメータ化したものである。例えば、音響パラメータにはゼロ点交差率やハースト指数等がある。ゼロ点交差率は、音声の音圧の波形が基準圧力を横切る単位時間あたりの回数を、音声における波形の変化の激しさの度合いとして算出したものである。ハースト指数は、音声の波形における変化の相関性を示す。The following describes how the estimation model of the fourth server of the health management system is constructed. First, a stage is taken in which multiple acoustic parameters are extracted from the subject's voice data. The acoustic parameters are parameterized features of sound transmission. For example, acoustic parameters include the zero-crossing rate and the Hurst exponent. The zero-crossing rate is calculated as the number of times per unit time that the waveform of the sound pressure of the voice crosses the reference pressure, as a measure of the intensity of changes in the waveform of the voice. The Hurst exponent indicates the correlation of changes in the waveform of the voice.

音響パラメータは、第１の音響パラメータと第２の音響パラメータに分類される。ここで、第１の音響パラメータは、特定の疾患を推定すべき利用者の音声から抽出した音響パラメータである。第１の音響パラメータは、第１のサーバ２０４に格納される音声データを基に抽出される。The acoustic parameters are classified into first acoustic parameters and second acoustic parameters. Here, the first acoustic parameters are acoustic parameters extracted from the voice of a user whose specific disease is to be estimated. The first acoustic parameters are extracted based on the voice data stored in the first server 204.

第２の音響パラメータは、予めデータベースに記録されている音響パラメータである。第２の音響パラメータのデータは、第４のサーバ２１０の記憶部２１４または当該データベースの機能を代替えする他のサーバに格納される。第２の音響パラメータは、アルツハイマー型認知症、レビー小体型認知症、パーキンソン病、大うつ病、非定型うつ病、または双極性障害の疾患を有する患者の音声データから抽出を行い、各音響パラメータと各疾患が予め紐付けされている。The second acoustic parameters are acoustic parameters recorded in advance in a database. Data on the second acoustic parameters is stored in the storage unit 214 of the fourth server 210 or another server that substitutes for the function of the database. The second acoustic parameters are extracted from speech data of a patient with Alzheimer's disease, Lewy body dementia, Parkinson's disease, major depression, atypical depression, or bipolar disorder, and each acoustic parameter is linked in advance to each disease.

本願発明で使用する音響パラメータの具体例を示すと、以下のような項目がある。
１）音量のエンベロープ（アタックタイム、ディケイタイム、サステインレベル、リリースタイム）
２）波形の変動情報（Ｓｈｉｍｍｅｒ，Ｊｉｔｔｅｒ）
３）ゼロ点交差率
４）ハースト指数
５）ＶＯＴ（ＶｏｉｃｅＯｎｓｅｔＴｉｍｅ）
６）メル周波数ケプストラムのある係数に関する発話内分布の統計値（第１四分位点、中央値、第３四分位点、９５パーセント点、算術平均、幾何平均、第３四分位点と中央値の差など）
７）周波数スペクトラムの変化の速さにおける発話内分布の統計値（第１四分位点、中央値、第３四分位点、９５パーセント点、算術平均、幾何平均、第３四分位点と中央値の差など）
８）メル周波数ケプストラムのある係数の時間変化に関する発話内分布の統計値（第１四分位点、中央値、第３四分位点、９５パーセント点、算術平均、幾何平均、第３四分位点と中央値の差など）
９）メル周波数ケプストラムのある係数の時間変化の時間変化に関する発話内分布の統計値（第１四分位点、中央値、第３四分位点、９５パーセント点、算術平均、幾何平均、第３四分位点と中央値の差など）
１０）周波数スペクトラム９０％ロールオフの発話内時間変化における２次回帰近似に対する二乗誤差
１１）周波数スペクトラム重心の発話内時間変化における２次回帰近似に対する算術誤差
その他、ピッチレート、有声音である確率、任意範囲の周波数のパワー、音階、話速（一定時間におけるモーラ数）、ポーズ・間、音量などが挙げられる。 Specific examples of the acoustic parameters used in the present invention include the following items.
1) Volume envelope (attack time, decay time, sustain level, release time)
2) Waveform fluctuation information (Shimmer, Jitter)
3) Zero point crossing rate 4) Hurst index 5) VOT (Voice Onset Time)
6) Statistics of the intra-utterance distribution of certain coefficients of the Mel-frequency cepstrum (1st quartile, median, 3rd quartile, 95th percentile, arithmetic mean, geometric mean, difference between the 3rd quartile and the median, etc.)
7) Statistics of the intra-utterance distribution of the rate of change of the frequency spectrum (1st quartile, median, 3rd quartile, 95th percentile, arithmetic mean, geometric mean, difference between the 3rd quartile and the median, etc.)
8) Statistics on the intra-utterance distribution of the time-varying coefficients of the Mel-frequency cepstrum (1st quartile, median, 3rd quartile, 95th percentile, arithmetic mean, geometric mean, difference between the 3rd quartile and the median, etc.)
9) Statistics on the intra-utterance distribution of the time change of a certain coefficient of the Mel-frequency cepstrum (1st quartile, median, 3rd quartile, 95th percentile, arithmetic mean, geometric mean, difference between the 3rd quartile and the median, etc.)
10) Squared error for a quadratic regression approximation of the intra-utterance time change of the frequency spectrum 90% roll-off. 11) Arithmetic error for a quadratic regression approximation of the intra-utterance time change of the frequency spectrum center of gravity. Other examples include pitch rate, probability of being a voiced sound, power of a given range of frequencies, scale, speaking rate (number of moras in a given time), pauses/intervals, and volume.

なお、このような音響パラメータは、発話フレーズ毎に音響パラメータの抽出ソフト（例えば、ｏｐｅｎＳＭＩＬＥ等が挙げられる）により抽出される。本発明においては、７４４０個の音響パラメータを抽出し、まず録音環境に依存する音響パラメータを除き、残ったものから健康状態を表すのに適切な音響パラメータを１つまたは複数選択した。Such acoustic parameters are extracted for each speech phrase by acoustic parameter extraction software (e.g., openSMILE, etc.). In the present invention, 7,440 acoustic parameters were extracted, acoustic parameters dependent on the recording environment were first removed, and one or more acoustic parameters suitable for expressing the health condition were selected from the remaining parameters.

第４のサーバ２１０に格納されている推定プログラムは、人工知能による学習機能を有しその学習機能によって推定処理を行う。ニューラルネットワーク型のディープラーニングが利用されてもよいし、部分的に学習分野を強化する強化学習等が利用されてもよいし、その他、遺伝的アルゴリズム、クラスタ分析、自己組織化マップ、アンサンブル学習、等が利用されてもよい。もちろん、これら以外の人工知能に関する技術が利用されてもよい。アンサンブル学習においては、ブースティングと決定木を併用した手法により分類アルゴリズムを作成してもよい。The estimation program stored in the fourth server 210 has a learning function based on artificial intelligence and performs estimation processing using the learning function. Neural network-type deep learning may be used, or reinforcement learning that partially strengthens the learning field may be used, or other techniques such as genetic algorithms, cluster analysis, self-organizing maps, and ensemble learning may be used. Of course, other artificial intelligence techniques may be used. In ensemble learning, a classification algorithm may be created using a method that combines boosting and decision trees.

推定プログラムの作成段階において、アルゴリズム作成者は、上記第２の音響パラメータの項目の中から、変数ｆ（ｎ）として使用する任意の音響パラメータをより良い組合せとなるようにステップワイズ法により検討を行い、１つまたは複数選択する。次に、選択された任意の音響パラメータに対し係数を付し、１つまたは複数の音響特徴量を作成する。さらに、これらの音響特徴量を組合せて、音響特徴量Ｆ（ａ）を作成する。In the stage of creating the estimation program, the algorithm creator performs stepwise analysis to find the best combination of any acoustic parameters to be used as the variable f(n) from among the items of the second acoustic parameters, and selects one or more of them. Next, a coefficient is assigned to the selected acoustic parameter to create one or more acoustic features. Furthermore, these acoustic features are combined to create an acoustic feature F(a).

ステップワイズ法には変数増加法、変数減少法、変数増減法の３種類があるがいずれを用いてもよい。ステップワイズ法で用いられる回帰分析には、線形判別式やロジスティック回帰分析等の線形分類の処理が含まれる。変数ｆ（ｎ）およびそれらの係数、すなわち図３Ａの式で示される式Ｆ（ａ）の係数ｘｎは回帰係数と呼ばれ、関数ｆ（ｎ）に付与される重みである。There are three types of stepwise methods: forward, backward, and increment/decrement methods, and any of them may be used. Regression analysis used in the stepwise method includes linear classification such as linear discriminant and logistic regression analysis. The variables f(n) and their coefficients, i.e., the coefficients xn of the formula F(a) shown in FIG. 3A, are called regression coefficients, and are weights given to the function f(n).

回帰係数は、学習アルゴリズムの作成者が選択した後に、データベースに蓄積される疾患情報などから、推定精度を高めるための機械学習により品質の改善を行ってもよい。After the creator of the learning algorithm selects the regression coefficients, the quality may be improved by machine learning to increase the estimation accuracy based on disease information accumulated in a database.

図３Ａは、１つ以上の音響特徴量を算出するための特徴量の式Ｆ（ａ）である。式Ｆ（ａ）を基にして被験者の疾患の予測値を算出することができる。ここで、ｆ（ｎ）は、上記の音響パラメータの項目（１）ないし（１１）の中からいずれか１つ以上の第２の音響パラメータを任意に選択したものである。ｘｎは疾患に固有の回帰係数である。ｆ（ｎ）、ｘｎは、推定プログラムの一部として、予め第４のサーバ２１０の記憶部２１４内に格納されていてもよい。推定プログラムの機械学習の過程で特徴量Ｆ（ａ）の回帰係数を改善してもよい。FIG. 3A is a feature equation F(a) for calculating one or more acoustic features. A predicted value of the subject's disease can be calculated based on the equation F(a). Here, f(n) is any one or more second acoustic parameters arbitrarily selected from the above acoustic parameter items (1) to (11). xn is a regression coefficient specific to the disease. f(n) and xn may be stored in advance in the storage unit 214 of the fourth server 210 as part of the estimation program. The regression coefficient of the feature F(a) may be improved during the machine learning process of the estimation program.

図２の算出部２１１は、第２の音響パラメータの組合せに基づいて、健常者と疾患を有する被験者を区別する、あるいは疾患どうしを区別するための特徴量を算出する。この特徴量から、基準範囲と、基準範囲から被験者の値がどの程度離れているのかを算出するためのスコアリングを行うことにより、被験者の疾患の予測値を算出する。2 calculates a feature quantity for distinguishing between healthy subjects and subjects with a disease, or between diseases, based on the combination of the second acoustic parameters. From the feature quantity, a reference range and a scoring for calculating how far the subject's value is from the reference range are calculated, thereby calculating a predicted value of the subject's disease.

図３Ｂは、ある一つの音響特徴量の強度が、疾患毎に異なっていることを示すイメージ図である。被験者は疾患Ａのスコアが最も高いことを示している。従って、被験者の疾患Ａに対する予測値は、他の疾患群と比較して高く算出される。また、例えば、強度５０を閾値と設定することにより、疾患Ａ、疾患Ｄ、疾患Ｅの群と、疾患Ｂ、疾患Ｃの群とに分類できる。FIG. 3B is an image showing that the intensity of a certain acoustic feature varies for each disease. It shows that the subject has the highest score for disease A. Therefore, the predicted value for disease A of the subject is calculated to be higher than that of other disease groups. In addition, for example, by setting the intensity of 50 as the threshold, it is possible to classify into a group of diseases A, D, and E and a group of diseases B and C.

図３Ｂは、一つの音響特徴量の強度を基に疾患の予測値を算出しているが、実際は一つの音響特徴量のみで疾患を分類できることは困難である。そのため、いくつかの音響特徴量の組み合わせにより求められる特徴量Ｆ（ａ）を算出して、疾患を分類しても良い。3B, a disease prediction value is calculated based on the intensity of one acoustic feature, but in reality, it is difficult to classify diseases using only one acoustic feature. Therefore, diseases may be classified by calculating a feature F(a) obtained by combining several acoustic features.

この音響特徴量の組合せである特徴量Ｆ（ａ）を基に、ラベル付けされた被験者の音声について疾患の予測値を計算し、疾患ごとの予測値の分布を求める。これにより、各疾患を分類することができる。Based on the feature F(a), which is a combination of these acoustic features, a predicted value of the disease is calculated for the voice of the labeled subject, and a distribution of the predicted values for each disease is obtained. This makes it possible to classify each disease.

図３Ｃは、ある３つの音響パラメータの組み合わせによって得られた疾患の予測値（図３Ｃ中では、「メンタル値」と記載している）の分布画像である。FIG. 3C is a distribution image of disease predictive values (denoted as "mental values" in FIG. 3C) obtained by combining three acoustic parameters.

図３Ｃから、レビー小体型認知症の患者群の予測値の分布が、他の疾患の患者群及び健常者群の予測値の分布から分離できることがわかる。本願発明は、疾患ごとに、他の疾患と区別できるように音響パラメータの組み合わせを設定し、特徴量Ｆ（ａ）を計算し、音響特徴量の組合せである特徴量Ｆ（ａ）を入力として、各対象者の音声がどの疾患に当てはまるかを判定することができる。3C shows that the distribution of predicted values for the Lewy body dementia patient group can be separated from the distribution of predicted values for the other disease patient groups and the healthy subject group. The present invention sets a combination of acoustic parameters for each disease so that it can be distinguished from other diseases, calculates a feature F(a), and uses the feature F(a), which is a combination of acoustic features, as an input to determine which disease the voice of each subject corresponds to.

別の手法としては、各患者の音声から、疾患毎の音響特徴量の組合せである特徴量Ｆ（ａ）を抽出し、どの疾患の特徴量をより多く有しているかを求め、疾患の予測値を互いに比較することにより、患者が罹患している疾患と推定することができる。As another method, a feature F(a), which is a combination of acoustic features for each disease, is extracted from the voice of each patient, and it is determined which disease features the patient has more of. The predicted values of the diseases are then compared with each other to infer the disease that the patient is suffering from.

この場合、疾患の予測値は、その疾患に罹患している程度ととらえることができる。各疾患の予測値を比較、再計算することにより、どの疾患に罹患しているかを確率で表現することができる。In this case, the predicted value of a disease can be regarded as the degree of the disease. By comparing and recalculating the predicted values of each disease, it is possible to express the probability of having each disease.

このようにして、アルツハイマー型認知症、レビー小体型認知症、パーキンソン病、大うつ病、非定型うつ病、双極性障害の６疾患を含む患者の音声、および健常者の音声から、それぞれの疾患に関連する音響特徴量の組合せである特徴量Ｆ（ａ）を抽出し、それぞれの疾患の予測値を算出する。In this way, feature F(a), which is a combination of acoustic features associated with each disease, is extracted from the voices of patients with six diseases, including Alzheimer's disease, dementia with Lewy bodies, Parkinson's disease, major depression, atypical depression, and bipolar disorder, as well as the voices of healthy individuals, and a predicted value for each disease is calculated.

また、対象とする疾患に関して、更に血管性認知症、前頭側頭型認知症、気分循環症、気分変調症の４疾患を加えた１０疾患を含む患者の音声から推定プログラムを作成してもよい。Furthermore, a prediction program may be created from the voices of patients with ten diseases, including four more diseases, namely, vascular dementia, frontotemporal dementia, cyclothymia, and dysthymia, as the target diseases.

最終的には、判別対象者が発話した音声を解析することにより、上述の６疾患乃至１０疾患のいずれかであるか、または健常であるかを推定する。Finally, by analyzing the voice spoken by the subject, it is estimated whether the subject has one of the above-mentioned six to ten diseases, or is healthy.

推定プログラムの推定フローに関して、上述したように、個別の疾患に対し、それぞれの疾患の特徴量（ａ）を抽出して疾患の予測値を算出してもよいが、まず、疾患群に関する音響特徴量の組合せを作成し、疾患群に関する特徴量Ｆ（ａ）を推定部への入力として用い、複数の段階に分けて入力・推定を行うことで、最終的に各疾患または健常の推定をしてもよい。
〔４－２．推定精度〕 Regarding the estimation flow of the estimation program, as described above, for each individual disease, the feature (a) of each disease may be extracted and a predicted value of the disease may be calculated. Alternatively, first, a combination of acoustic features related to a group of diseases may be created, and the feature F(a) related to the group of diseases may be used as an input to the estimation unit, and input/estimation may be performed in multiple stages to finally estimate each disease or healthy state.
[4-2. Estimation accuracy]

図４Ａ及び図４Ｂは、健常者または特定疾患と、それ以外の分離性能を示すＲＯＣ曲線のグラフである。横軸が１－特異度を示し、縦軸が感度を示す。言い換えると、横軸が偽陽性率を示し、縦軸が真陽性率を示す。図４Ａ及び図４ＢのＲＯＣ曲線は、いずれも偽陽性立が低い時点で真陽性率が高い値を示した。4A and 4B are graphs of ROC curves showing the separation performance between healthy subjects or specific diseases and others. The horizontal axis shows 1-specificity, and the vertical axis shows sensitivity. In other words, the horizontal axis shows the false positive rate, and the vertical axis shows the true positive rate. Both of the ROC curves in FIG. 4A and FIG. 4B showed high true positive rates at points where the false positive rate was low.

また、ＡＵＣ（ＡｒｅａｕｎｄｅｒａｎＲＯＣｃｕｒｖｅ）はいずれも０．５より高く、ランダムに識別した場合と有意な差が確認できた。分離性能の検証が行われた疾患は、レビー小体型認知症、アルツハイマー型認知症、パーキンソン病、大うつ病、双極性障害、非特定うつ病である。それぞれのＲＯＣ曲線におけるＡＵＣは、レビー小体型認知症が０．７９４、アルツハイマー型認知症が０．７９９、パーキンソン病が０．７７１、大うつ病が０．８６９、双極性障害が０．８６、非特定うつ病が０．８６であった。なお、本願発明を用いて推定可能な疾患は上記のものに限定されない。ＡＵＣの結果から、疾患の推定精度が高いことが確認できる。In addition, the AUC (Area under an ROC curve) was higher than 0.5 in all cases, and a significant difference from the case of random classification was confirmed. The diseases for which the separation performance was verified were Lewy body dementia, Alzheimer's dementia, Parkinson's disease, major depression, bipolar disorder, and non-specific depression. The AUC in each ROC curve was 0.794 for Lewy body dementia, 0.799 for Alzheimer's dementia, 0.771 for Parkinson's disease, 0.869 for major depression, 0.86 for bipolar disorder, and 0.86 for non-specific depression. The diseases that can be estimated using the present invention are not limited to the above. From the AUC results, it can be confirmed that the accuracy of disease estimation is high.

本願発明の推定プログラムは、レビー小体型認知症を、健常または他の疾患と分類することができる。The prediction program of the present invention can classify dementia with Lewy bodies as healthy or as having another disease.

本願発明の推定プログラムは、アルツハイマー型認知症を、健常または他の疾患と分類することができる。The prediction program of the present invention can classify Alzheimer's disease as healthy or as having another disease.

本願発明の推定プログラムは、パーキンソン病を、健常または他の疾患と分類することができる。The prediction program of the present invention can classify Parkinson's disease from healthy subjects or from other diseases.

本願発明の推定プログラムは、大うつ病を、健常または他の疾患と分類することができる。The prediction program of the present invention can classify major depression as healthy or as having another disease.

本願発明の推定プログラムは、双極性障害を、健常または他の疾患と分類することができる。The prediction program of the present invention can classify bipolar disorder from health or from other diseases.

本願発明の推定プログラムは、非特定うつ病を、健常または他の疾患と分類することができる。The prediction program of the present invention can classify non-specific depression as healthy or as having another disease.

本願発明の推定プログラムは、健常な状態を、上記の疾患と分類することができる。The prediction program of the present invention can classify a healthy state into the above-mentioned diseases.

前述のように作成された推定プログラムは、精神・神経疾患に罹患していると疑われる者であっても、健常と推定される者であっても特に制限なく利用できる。使用場面も、医師による診察のツールとして、あるいは健康診断や人間ドックの検査項目として、対象者の音声を取得すれば簡便に利用可能である。The estimation program created as described above can be used without any restrictions by those suspected of having a mental or neurological disorder, or those presumed to be healthy. It can also be used as a tool for doctors to examine the subjects, or as a test item for health checkups and medical examinations, simply by acquiring the subjects' voices.

本発明の音声データ解析装置には、いくつかの異なった音声解析プログラムを搭載することができる。The voice data analysis device of the present invention can be equipped with several different voice analysis programs.

例えば、健康な人が日ごろから健康状態を把握する目的であれば、ＭＩＭＯＳＹＳ（ＰＳＴ株式会社の登録商標）により、“元気圧”と“活量値”から元気の度合いを知ることができる。ＭＩＭＯＳＹＳは音声解析により利用者が発する感情の種類と大きさ及び元気の度合いを算出する音声解析プログラムを搭載した製品である。For example, if a healthy person wants to know their daily health condition, MIMOSSYS (registered trademark of PST Corporation) can tell them the level of vitality from their "vitality pressure" and "activity value." MIMOSSYS is a product equipped with a voice analysis program that uses voice analysis to calculate the type and magnitude of emotions expressed by the user and the level of vitality.

また、利用者自身が元気が落ちてきたと自覚症状がある場合や、ＭＩＭＯＳＹＳの測定によって元気の度合いが低くなる傾向にあった場合には、精神・神経系疾患の可能性を測定する解析プログラムを用いることが好ましい。In addition, if the user himself/herself is aware of a decline in vitality, or if the MIMOSYS measurement indicates a tendency for the level of vitality to decline, it is preferable to use an analysis program that determines the possibility of a mental or neurological disorder.

このような音声解析プログラムとしては、例えば、健常であるか、認知症群の疾患である可能性があるか、気分障害群の疾患である可能性があるかを推定するプログラムが挙げられる。Examples of such voice analysis programs include programs that estimate whether a person is healthy, has a possibility of having a dementia group disease, or has a possibility of having a mood disorder group disease.

更に、利用者が何らかの精神・神経系疾患の可能性が高いと思われる場合は、精神・神経系疾患のいずれかを推定するプログラムを用いることが好ましい。Furthermore, if the user is considered to have a high possibility of having some kind of mental or neurological disorder, it is preferable to use a program that predicts either the mental or neurological disorder.

ここで、精神・神経系疾患としては、具体的には例えば、認知症群として、レビー小体型認知症、アルツハイマー型認知症、パーキンソン病が挙げられる。この他にも血管性認知症を含めることができる。また気分障害群として、大うつ病、双極性障害、非定型うつ病が挙げられる。この他にも気分循環症、気分変調症を含めることができる。Here, specific examples of psychiatric and neurological disorders include dementia with Lewy bodies, Alzheimer's disease, and Parkinson's disease. Vascular dementia can also be included. Mood disorders include major depression, bipolar disorder, and atypical depression. Cyclothymia and dysthymia can also be included.

また、推定する対象の疾患としては、精神・神経系疾患が好ましいが、発話に影響する疾患であれば推定できる可能性があるので、例えば、心疾患、呼吸器系疾患等が挙げられる。
〔５．推定プログラムの作成例〕 In addition, the disease to be estimated is preferably a mental or nervous system disease, but any disease that affects speech may be possible to estimate, such as heart disease, respiratory disease, etc.
5. Example of Estimation Program Creation

上記項目４では、第４のサーバ２１０の推定部２１２で実行される推定プログラムの分類器としての性能を十分に証明した。本項目５では、推定プログラムの作成例を紹介する。
〔５－１．推定プログラムの作成例１〕 In the above item 4, the performance as a classifier of the estimation program executed in the estimation unit 212 of the fourth server 210 has been sufficiently proven. In this item 5, an example of creating an estimation program will be introduced.
[5-1. Example 1 of Estimation Program Creation]

本項目５－１では、疾患を、レビー小体型認知症、アルツハイマー型認知症、パーキンソン病を含む認知症群と、大うつ病、双極性障害、非定型うつ病を含む気分障害群に分類した場合に、利用者が健常、認知症群、または気分障害群であるかを推定するための推定プログラムについて説明する。まず当該推定プログラムの作成手順について記す。In this section 5-1, we will explain a prediction program for predicting whether a user is healthy, in the dementia group, or in the mood disorder group when diseases are classified into a dementia group including dementia with Lewy bodies, Alzheimer's disease, and Parkinson's disease, and a mood disorder group including major depression, bipolar disorder, and atypical depression. First, we will describe the procedure for creating the prediction program.

アルツハイマー型認知症の患者１５名、双極性障害の患者１２名、非定型うつ病の患者１４名、レビー小体型認知症の患者１５名、大うつ病の患者１５名、パーキンソン病の患者１５名、健常者１５名の各音声を学習用データとして用いた。The speech samples used as training data were 15 patients with Alzheimer's disease, 12 patients with bipolar disorder, 14 patients with atypical depression, 15 patients with Lewy body dementia, 15 patients with major depression, 15 patients with Parkinson's disease, and 15 healthy individuals.

アルツハイマー型認知症の患者５名、双極性障害の患者４名、非定型うつ病の患者５名、レビー小体型認知症の患者５名、大うつ病の患者５名、パーキンソン病の患者５名、健常者５名の各音声をテストデータとして用いた。なお、学習用データとテスト用データとへの振り分けはランダムに行った。The test data consisted of speech samples from five patients with Alzheimer's disease, four patients with bipolar disorder, five patients with atypical depression, five patients with dementia with Lewy bodies, five patients with major depression, five patients with Parkinson's disease, and five healthy subjects. The data was randomly divided into training data and test data.

学習用データに用いる音声は各患者及び健常者が、図５に紹介される１７の文章（１番～１３番までは各２回、１４番～１７番までは各１回）を発話し各人ごとに３０個の発話を得た。The speech used for the learning data was obtained by each patient and healthy subject speaking the 17 sentences introduced in Figure 5 (numbers 1 to 13 spoken twice each, and numbers 14 to 17 spoken once each), resulting in 30 utterances for each person.

これら各発話について特徴的な音響パラメータを抽出し、その頻度、強度等、音響パラメータの特徴と当該被験者の疾患とを関連付けることにより、上記の６疾患と健常者の７種とそれに関連づけられた各種の音響パラメータ群が得られた。Characteristic acoustic parameters were extracted from each of these utterances, and the characteristics of the acoustic parameters, such as their frequency and intensity, were correlated with the subject's disease, thereby obtaining various groups of acoustic parameters associated with the above six diseases and seven types of healthy subjects.

各疾患に対して特徴的な音響パラメータを選択し、さらに選択された音響パラメータの組合せを行うことにより、各人の発話文章毎に、６疾患のいずれか、または健常者であるかを推定し、続いて認知症群、気分障害群のいずれの疾患であるか、または健常であるかを推定するための特徴量に基づく推定プログラム１０を作成した。Characteristic acoustic parameters were selected for each disease, and the selected acoustic parameters were then combined to infer for each person's spoken sentence whether they had one of the six diseases or were a healthy person. Then, an estimation program 10 was created based on the features to infer whether the person had a dementia group, a mood disorder group, or was healthy.

テストデータとしては、図５の１７個の文章のうち、２「あいうえおかきくけこ」、３「本日は晴天なり」、６「とても元気です」、８「食欲があります」、９「心が穏やかです」、１０「おこりっぽいです」、１２「上を向いて歩こう」、１３「がんばるぞー」の８つのフレーズ（各２回で計１６個）を用いた。各発話について、いずれの疾患であるかまたは健常であるかを推定した。推定された疾患のうち、アルツハイマー型認知症、レビー小体型認知症、パーキンソン病の３つの疾患に推定されたものは、認知症群の疾患であると判定した。また、大うつ病、双極性障害、非定型うつ病の３つの疾患に推定されたものは、気分障害群の疾患であると判定した。As test data, eight phrases (16 in total, two for each) were used from the 17 sentences in Figure 5: 2 "aiueokakikukeko", 3 "It's a fine day today", 6 "I'm very healthy", 8 "I have an appetite", 9 "I'm calm", 10 "I'm angry", 12 "Let's look up and walk", and 13 "Let's do our best". For each utterance, it was estimated whether it was a disease or healthy. Among the estimated diseases, those estimated to be Alzheimer's disease, dementia with Lewy bodies, and Parkinson's disease were determined to be diseases in the dementia group. In addition, those estimated to be major depression, bipolar disorder, and atypical depression were determined to be diseases in the mood disorder group.

ここで、各人ごとの１６個の発話が、認知症群の疾患、気分障害群の疾患、健常のうちいずれに判定されたかを集計し、もっとも多くの数の判定を得たものを、最終的な判定結果とした。
（結果１）アルツハイマー型認知症の患者（患者コードＡＤ０３）： Here, the 16 utterances from each person were tallied to see whether they were judged to be a dementia group disease, a mood disorder group disease, or healthy, and the one that received the most judgments was regarded as the final judgment result.
(Result 1) Patient with Alzheimer's disease (patient code AD03):

認知症群の疾患と判定されたもの１３個、気分障害群の疾患と判定されたもの１個、健常と判定されたもの２個であった。従って、最終的に認知症群の疾患と推定された。これは、実際の疾患であるアルツハイマー型認知症が属する認知症群と一致していた。Thirteen were judged to be dementia group, one was judged to be mood disorder group, and two were judged to be healthy. Therefore, they were finally estimated to be dementia group. This was consistent with the dementia group to which the actual disease Alzheimer's disease belongs.

同様に行った３４名の判定結果を図６に示す。当該推定プログラムによる正答率は８５．３％であった。これにより、レビー小体型認知症、アルツハイマー型認知症、パーキンソン病を含む認知症群と、大うつ病、双極性障害、非定型うつ病を含む気分障害群に分類した場合に、当該プログラムは、利用者が健常、認知症群、または気分障害群であるかを推定可能であることが十分に証明された。
〔５－２．推定プログラムの作成例２〕 The results of the similar judgments for 34 people are shown in Figure 6. The accuracy rate of the prediction program was 85.3%. This fully proves that the program can predict whether a user is healthy, in the dementia group, or in the mood disorder group, when classified into a dementia group including Lewy body dementia, Alzheimer's dementia, and Parkinson's disease, and a mood disorder group including major depression, bipolar disorder, and atypical depression.
[5-2. Example 2 of Estimation Program Creation]

本項目５－２では、推定プログラムの作成例として、大うつ病を、健常または他の疾患と分類する推定プログラムと、レビー小体型認知症を、健常または他の疾患と分類する推定プログラムの作成手順について説明する。なお、同様の手順により、アルツハイマー型認知症、パーキンソン病、双極性障害、または非定型うつ病を、健常または選択された疾患以外の疾患と分類するプログラムの作成を行うことができることは、上記項目４の結果から説明するまでもないため省略する。
〔５－２－１．大うつ病の推定プログラムの作成例〕 In this section 5-2, as examples of creating a prediction program, a prediction program that classifies major depression as healthy or another disease, and a prediction program that classifies Lewy body dementia as healthy or another disease will be described. Note that a similar procedure can be used to create a program that classifies Alzheimer's disease, Parkinson's disease, bipolar disorder, or atypical depression as healthy or a disease other than the selected disease, but this will be omitted because it is obvious from the results of section 4 above.
[5-2-1. Example of creating a prediction program for major depression]

まず、大うつ病と、健常を分類する推定プログラム（５－２－１Ａ）の作成手順について説明する。First, the procedure for creating the estimation program (5-2-1A) for classifying major depression and healthy subjects will be described.

健常者２０名、大うつ病患者２０名による発話フレーズ（各人それぞれ約３０発話）毎に、音響特徴量を抽出するソフトウエアであるｏｐｅｎＳＭＩＬＥを用いて、７４４０個の音響特徴量を抽出した。音声を取得した施設により有意差のある特徴量を除外し、１５４７個の特徴量を選定した。そして、ブースティングと決定木を併用した手法により、特徴量から、健常者と大うつ病とを判別するための疾患の予測値を生成し、分類アルゴリズムを作成した。フレーズ毎の推定結果を図７Ａに示す。7440 acoustic features were extracted from each phrase (approximately 30 utterances from each person) by 20 healthy subjects and 20 patients with major depression using openSMILE, a software that extracts acoustic features. Features with significant differences depending on the facility where the voice was obtained were excluded, and 1547 features were selected. Then, using a method that combines boosting and decision trees, a disease prediction value for discriminating between healthy subjects and patients with major depression was generated from the features, and a classification algorithm was created. The estimation results for each phrase are shown in Figure 7A.

図７Ａに示すように、大うつ病と、健常を分類する推定プログラムにおける大うつ病の判定は、感度８１．４％、特異度８５．５％、正答率８３．５％であった。As shown in FIG. 7A, the prediction program for classifying major depression from healthy subjects had a sensitivity of 81.4%, a specificity of 85.5%, and an accuracy rate of 83.5% in determining whether major depression was present.

当該推定プログラムに基づき、各フレーズに対して大うつ病または健常のいずれかと判定したものを、各人ごとに纏めた結果を図８Ａ及び図８Ｂに示す。The results of determining whether each phrase was major depression or healthy for each person based on the prediction program are shown in FIGS. 8A and 8B.

更に、各人ごとの健常判定率を求め、健常判定率が６０％以上の者を健常者と推定し、健常判定率が６０％未満の者を大うつ病患者と推定した。その結果を図９に示す。２０人の健常者はすべて健常者と推定され、２０人の大うつ病患者はすべて大うつ病患者と推定された。即ち、特異度、感度、正答率とも１００％であった。Furthermore, the healthy judgment rate for each person was calculated, and those with a healthy judgment rate of 60% or more were estimated to be healthy, and those with a healthy judgment rate of less than 60% were estimated to be major depression patients. The results are shown in Figure 9. All 20 healthy people were estimated to be healthy, and all 20 major depression patients were estimated to be major depression patients. In other words, the specificity, sensitivity, and accuracy rate were all 100%.

次に、大うつ病と、他の精神・神経系疾患を分類する推定プログラム５－２－１Ｂの作成手順について説明する。Next, a procedure for creating the estimation program 5-2-1B for classifying major depression from other psychiatric and neurological disorders will be described.

当該推定プログラム５－２－１Ｂは、互いに症状が類似しているアルツハイマー型認知症、レビー小体型認知症、パーキンソン病、大うつ病、非定型うつ病、双極性障害を少なくとも含む精神・神経系疾患のうち、大うつ病を抽出して推定するものである。The estimation program 5-2-1B extracts and estimates major depression from among psychiatric and neurological disorders including at least Alzheimer's disease, Lewy body dementia, Parkinson's disease, major depression, atypical depression, and bipolar disorder, which have similar symptoms.

解析に使用した発話の種類は、図５に示した１７種類の発話（フレーズ）である。The types of utterances used in the analysis were the 17 types of utterances (phrases) shown in FIG.

まず、発話フレーズ毎に、音響特徴量を抽出するソフトウエアであるｏｐｅｎＳＭＩＬＥを用いて、それぞれ７４４０個の音響特徴量を抽出した。音声を取得した施設により有意差のあるデータを除外し、１５４７個のデータの特徴量を用いてブースティングと決定木を併用した手法により分類アルゴリズムを作成し、大うつ病とその他の疾患を判別するための疾患の予測値を生成した。First, 7440 acoustic features were extracted for each spoken phrase using openSMILE, a software for extracting acoustic features. Data with significant differences depending on the facility where the voice was acquired were excluded, and a classification algorithm was created using the features of 1547 data by a method that combined boosting and decision trees, and a disease prediction value for distinguishing major depression from other diseases was generated.

なお、大うつ病以外の５つの患者による発話フレーズは２２３６個を用い、一方で大うつ病患者の発話は９４０個であったので、データ数の不均衡を避けるため、ＳｙｎｔｈｅｔｉｃＭｉｎｏｒｉｔｙＯｖｅｒ－ｓａｍｐｌｉｎｇＴｅｃｈｎｉｑｕｅ（ＳＭＯＴＥ）の手法を用いて大うつ病患者の発話の疑似データを作成し、大うつ病患者の発話フレーズを３１３９個とした。In addition, 2236 speech phrases from the five patients other than those with major depression were used, while the number of speech phrases from the patients with major depression was 940. Therefore, in order to avoid an imbalance in the number of data, pseudo-data of the speech of the patients with major depression was created using the Synthetic Minority Over-sampling Technique (SMOTE), and the number of speech phrases from the patients with major depression was set to 3139.

データを１０分割し、推定プログラム５－２－１Ｂの分類アルゴリズムの１０回交差検証を行った。The data was divided into 10 parts, and 10-fold cross-validation of the classification algorithm of the estimation program 5-2-1B was performed.

推定プログラム５－２－１Ｂに基づく推定結果を図７Ｂに示す。推定プログラム５－２－１Ｂを用いて、大うつ病と他の精神・神経系疾患を区別する場合において、大うつ病から見た再現率は９１．１％、他の精神・神経系疾患から見た再現率は９３．８％、正答率は９２．７％であった。また、ＡＵＣは０．９７７であった。
〔５－２－２．レビー小体型認知症の推定プログラムの作成例〕 The results of the estimation based on the estimation program 5-2-1B are shown in Figure 7B. When using the estimation program 5-2-1B to distinguish between major depression and other psychiatric and neurological disorders, the recall rate for major depression was 91.1%, the recall rate for other psychiatric and neurological disorders was 93.8%, and the accuracy rate was 92.7%. The AUC was 0.977.
[5-2-2. Example of creating a prediction program for Lewy body dementia]

まず、レビー小体型認知症と、健常を分類する推定プログラムの作成手順について説明する。First, we will explain the procedure for creating a prediction program for classifying Lewy body dementia and healthy subjects.

当該推定プログラム５－２－２Ａ作成手順は、音声として、健常者２０名、レビー小体型認知症患者２０名の発話フレーズを用いた以外は、上記項目５－２－１の推定プログラムと同様の手順を用いた。フレーズ毎の推定結果を図１０Ａに示す。The procedure for creating the estimation program 5-2-2A was the same as that for the estimation program in item 5-2-1 above, except that speech phrases from 20 healthy subjects and 20 patients with dementia with Lewy bodies were used as the voices. The estimation results for each phrase are shown in FIG. 10A.

当該推定プログラム５－２－２Ａにおけるレビー小体型認知症の判定は、感度８１．５％、特異度８３．１％、正答率８２．２％であった。The prediction program 5-2-2A had a sensitivity of 81.5%, a specificity of 83.1%, and an accuracy rate of 82.2% in determining whether or not there is Lewy body dementia.

当該推定プログラム５－２－２Ａに基づき、各フレーズに対してレビー小体型認知症または健常のいずれかを推定したものを、各人ごとに纏めた結果を図１１Ａ及び図１１Ｂに示す。Based on the estimation program 5-2-2A, the results of estimating whether each phrase was dementia with Lewy bodies or healthy for each person are shown in Figures 11A and 11B.

更に、各人の健常判定率を求め、健常判定率が６０％以上の者を健常者と推定し、健常判定率が６０％未満のものをレビー小体型認知症の患者と推定した。その結果を図１２に示す。２０人の健常者はすべて健常と推定され、２０人のレビー小体型認知症患者は１９人がレビー小体型認知症と推定され、１人が健常者と推定された。即ち、特異度は１００％、感度は９５％、正答率は９７．５％であった。Furthermore, the healthy judgment rate for each person was calculated, and those with a healthy judgment rate of 60% or more were estimated to be healthy, and those with a healthy judgment rate of less than 60% were estimated to be patients with dementia with Lewy bodies. The results are shown in Figure 12. All 20 healthy people were estimated to be healthy, and of the 20 patients with dementia with Lewy bodies, 19 were estimated to have dementia with Lewy bodies, and 1 was estimated to be a healthy person. That is, the specificity was 100%, the sensitivity was 95%, and the accuracy rate was 97.5%.

次に、レビー小体型認知症と、他の精神・神経系疾患を分類する推定プログラム（５－２－２Ｂ）の作成手順について説明する。Next, we will explain the procedure for creating a prediction program (5-2-2B) that classifies Lewy body dementia from other psychiatric and neurological disorders.

推定プログラム５－２－２Ｂは、互いに症状が類似しているアルツハイマー型認知症、レビー小体型認知症、パーキンソン病、大うつ病、非定型うつ病、双極性障害を少なくとも含む精神・神経系疾患のうち、レビー小体型認知症を抽出して推定するものである。解析に使用した発話の種類は、前記推定プログラム１と同様に１７種類の発話（フレーズ）である。The estimation program 5-2-2B extracts and estimates dementia with Lewy bodies from among psychiatric and neurological disorders including at least Alzheimer's disease, dementia with Lewy bodies, Parkinson's disease, major depression, atypical depression, and bipolar disorder, which have similar symptoms. The types of utterances used in the analysis are 17 types of utterances (phrases), the same as in the estimation program 1.

まず、発話フレーズ毎に、音響特徴量を抽出するソフトウエアであるｏｐｅｎＳＭＩＬＥを用いて、それぞれ７４４０個の音響特徴量を抽出した。音声を取得した施設により有意差のある特徴量を除外し、１５４７個の特徴量を用いてブースティングと決定木を併用した手法により分類アルゴリズムを作成し、レビー小体型認知症とその他の疾患を判別するための疾患の予測値を生成した。First, 7440 acoustic features were extracted for each spoken phrase using openSMILE, a software for extracting acoustic features. Features with significant differences depending on the facility where the voice was acquired were excluded, and a classification algorithm was created using a method that combined boosting and decision trees using 1547 features, generating disease prediction values for distinguishing between Lewy body dementia and other diseases.

なお、レビー小体型認知症以外の５つの患者による発話フレーズは２９８７個を用い、一方でレビー小体型認知症患者の発話は５８３個であったので、データ数の不均衡を避けるため、ＳｙｎｔｈｅｔｉｃＭｉｎｏｒｉｔｙＯｖｅｒ－ｓａｍｐｌｉｎｇＴｅｃｈｎｉｑｕｅ（ＳＭＯＴＥ）の手法を用いてレビー小体型認知症患者の発話の疑似データを作成し、レビー小体型認知症患者の発話フレーズを２６９６個とした。In addition, 2,987 speech phrases from the five patients other than Lewy body dementia were used, while 583 speech phrases from patients with Lewy body dementia were used. In order to avoid an imbalance in the amount of data, pseudo-data of the speech of patients with Lewy body dementia was created using the Synthetic Minority Over-sampling Technique (SMOTE), resulting in 2,696 speech phrases from patients with Lewy body dementia.

データを１０分割し、推定プログラム５－２－２Ｂの分類アルゴリズムの１０回交差検証を行った。The data was divided into 10 parts, and 10-fold cross-validation of the classification algorithm of the estimation program 5-2-2B was performed.

推定プログラム５－２－２Ｂに基づく推定結果を図１０Ｂに示す。推定プログラム５－２－２Ｂを用いて、レビー小体型認知症と他の精神・神経系疾患を区別する場合において、レビー小体型認知症から見た再現率は８８．６％、他の精神・神経系疾患から見た再現率は８９．８％、正答率は８９．２％であった。また、ＡＵＣは０．９５９であった。The estimation results based on the estimation program 5-2-2B are shown in Figure 10B. When using the estimation program 5-2-2B to distinguish between dementia with Lewy bodies and other psychiatric and neurological disorders, the recall rate for dementia with Lewy bodies was 88.6%, the recall rate for other psychiatric and neurological disorders was 89.8%, and the accuracy rate was 89.2%. The AUC was 0.959.

上記のような手順と同様にして、特定の疾患を、その他の５疾患と分類する推定プログラムと、その特定の疾患と健常者とを分類する推定プログラムを組み合わせることにより、アルツハイマー型認知症、レビー小体型認知症、パーキンソン病、大うつ病、非定型うつ病、双極性障害の各患者及び健常者とを高い判定確率で推定することができる。
〔６．健康管理システム２００の動作〕 By combining a prediction program that classifies a specific disease from the other five diseases with a prediction program that classifies the specific disease from healthy individuals in a similar manner to the above procedure, it is possible to predict with a high probability whether a patient has Alzheimer's disease, dementia with Lewy bodies, Parkinson's disease, major depression, atypical depression, or bipolar disorder, or whether the patient is a healthy individual.
6. Operation of the Health Management System 200

図１３、図１４は、健康管理システム２００の動作を示すシーケンス図である。以下に順番に説明する。13 and 14 are sequence diagrams showing the operation of the health management system 200. Each will be described in order below.

図１３は、利用者が医療機関にかかる前であるか、または利用者が一度医療機関にかかった後に寛解した場合の健康管理システム２００の動作を示すシーケンス図である。FIG. 13 is a sequence diagram showing the operation of the health management system 200 when the user has not yet visited a medical institution or has gone into remission after visiting a medical institution once.

まず、ステップＳ１００１において、利用者端末２０１から入力した利用者の識別情報または配信要求が、ネットワークＮ、通信部２０３を介して第２サーバ２０５に送信される。精神・神経系疾患を寛解した利用者は、その旨を識別情報に含めてもよい。First, in step S1001, the user's identification information or a delivery request input from the user terminal 201 is transmitted to the second server 205 via the network N and the communication unit 203. A user who has achieved remission of a psychiatric or neurological disorder may include this fact in the identification information.

次に、ステップＳ１００２において、第２サーバは利用者端末２０１から配信要求がされた旨を伝えるメッセージを第３サーバ２０６に送信する。Next, in step S 1002 , the second server transmits a message to the third server 206 notifying that a distribution request has been made from the user terminal 201 .

次に、ステップＳ１００３において、第３サーバ２０６は配信要求を受信すると、配信データを利用者端末２０１へ送信する。この配信データには、音声入力を促す旨のメッセージが含まれている。配信データは、通信部２０３、ネットワークＮを介して利用者端末２０１へ送信され、配信データを出力する手段によって出力される。Next, in step S1003, when the third server 206 receives the distribution request, it transmits the distribution data to the user terminal 201. This distribution data includes a message prompting the user to input voice. The distribution data is transmitted to the user terminal 201 via the communication unit 203 and the network N, and is output by the means for outputting the distribution data.

次に、ステップＳ１００４において、利用者が利用者端末２０１の音声の入力手段を介して入力した音声データが第１サーバ２０４へ送信される。Next, in step S 1004 , voice data input by the user via the voice input means of the user terminal 201 is transmitted to the first server 204 .

次に、ステップＳ１００５において、第１サーバは受信した音声データのコピーを格納するとともに、当該音声データを第４サーバへ送信する。Next, in step S1005, the first server stores a copy of the received voice data and transmits the voice data to the fourth server.

次に、ステップＳ１００６において、第４サーバは受信した音声データを前処理して第１の音響特徴量を算出し、かつ、予め記憶部２１４に格納されている第２の音響特徴量と、推定プログラムを基にして、算出部２１１で特徴量Ｆ（ａ）を算出する。推定部２１２は、算出部２１１で算出した特徴量Ｆ（ａ）を基にして、精神・神経系疾患の推定処理を行う。Next, in step S1006, the fourth server preprocesses the received speech data to calculate a first acoustic feature, and calculates a feature F(a) in the calculation unit 211 based on the second acoustic feature previously stored in the storage unit 214 and the estimation program. The estimation unit 212 performs a process of estimating a psychiatric/neurological disorder based on the feature F(a) calculated by the calculation unit 211.

ここで、推定部２１２が、利用者がいずれの精神・神経系疾患にも罹患していないと推定した場合は、選択部２１３を介して第３のサーバ２０６の第１の配信データ２０７が配信される。また、推定部２１２が、利用者がいずれかの精神・神経系疾患に罹患している疑いがあると推定した場合は、選択部２１３を介して第３のサーバ２０６の第２の配信データ２０８が配信される。Here, when the estimation unit 212 estimates that the user does not suffer from any psychiatric/neurological disorder, the first distribution data 207 of the third server 206 is distributed via the selection unit 213. When the estimation unit 212 estimates that the user is suspected of suffering from any psychiatric/neurological disorder, the second distribution data 208 of the third server 206 is distributed via the selection unit 213.

また、第２サーバ２０５に登録された利用者の識別情報の中に、精神・神経系疾患に罹患した病歴があるものの、既に寛解している旨の記録がある場合であって、かつ推定部２１２がいずれの精神・神経系疾患にも罹患していないと推定した場合には、選択部２１３を介して第３のサーバ２０６の第３の配信データ２０９が配信される。In addition, if the identification information of a user registered in the second server 205 includes a history of a psychiatric/neurological disorder but a record indicating that the disorder is already in remission, and the estimation unit 212 estimates that the user is not suffering from any psychiatric/neurological disorder, the third distribution data 209 of the third server 206 is distributed via the selection unit 213.

次に、ステップＳ１００７において、第３のサーバ２０６は、所定の配信データを利用者端末２０１に送信する。利用者端末２０１は受信した配信データを出力手段から出力して、処理を終了する。Next, in step S1007, the third server 206 transmits the predetermined distribution data to the user terminal 201. The user terminal 201 outputs the received distribution data from an output means, and ends the process.

このように、図１３では、第２のサーバ２０５に登録される利用者の識別情報と、利用者の音声データが第４のサーバ２１０の推定部２１２でどのように推定されるかによって、第３のサーバ２０６から配信される配信データが変更するように構成される。そのため、利用者の個々の健康状態に適合した健康情報を配信データとして配信できる点で有用である。また、図１３に示すステップは、その一部または全体を繰り返し実行することにより、時系列で状態の変化を確認することができる。13, the data distributed from the third server 206 is changed depending on the identification information of the user registered in the second server 205 and how the user's voice data is estimated by the estimation unit 212 of the fourth server 210. This is useful in that health information suited to the individual health condition of the user can be distributed as distribution data. In addition, the steps shown in FIG. 13 can be repeatedly executed in part or in whole to check changes in condition in chronological order.

次に、図１４の説明に移る。図１４は、利用者が医師の診断を受けた後のシーケンス図であって、図１３の健康管理システム２００の機能の他に、健康管理システム２００と医療機関のサーバとの相互通信を含む。なお、図１４では、利用者が何等かの精神・神経系疾患に罹患した旨の診断がされたものと仮定して、ステップの説明をする。Next, we move on to the explanation of Fig. 14. Fig. 14 is a sequence diagram after a user is diagnosed by a doctor, and includes mutual communication between the health management system 200 and a server of a medical institution in addition to the functions of the health management system 200 in Fig. 13. Note that in Fig. 14, the steps will be explained on the assumption that the user has been diagnosed with some kind of mental/neurological disorder.

まず、ステップＳ２００１において、ウェブサーバ２０２の一種である医療機関サーバ２０２Ｍは、ネットワークＮ、通信部２０３を介して第２のサーバ２０５へアクセスして、利用者の識別情報を更新する。First, in step S2001, the medical institution server 202M, which is a type of the web server 202, accesses the second server 205 via the network N and the communication unit 203, and updates the user's identification information.

次に、ステップＳ２００２において、第２のサーバ２０５は利用者の識別情報が更新された旨のメッセージを第３のサーバ２０６へ送信する。Next, in step S2002, the second server 205 transmits a message to the third server 206 indicating that the user's identification information has been updated.

次に、ステップＳ２００３において、医療機関サーバ２０２Ｍから利用者への音声入力の要求がされた場合、または利用者端末から健康管理システム２００にアクセスがあった場合に、第３のサーバ２０６が、利用者に音声入力を促す旨の配信メッセージを利用者端末２０１に送信する。Next, in step S2003, when the medical institution server 202M requests the user to input voice, or when the user terminal accesses the health management system 200, the third server 206 sends a distribution message to the user terminal 201 prompting the user to input voice.

次に、ステップＳ２００４において、音声の入力手段を介して入力された利用者の音声データが第１サーバ２０４へ送信される。Next, in step S 2004 , the user's voice data input via the voice input means is transmitted to the first server 204 .

次に、ステップＳ２００５において、第１のサーバ２０４は音声データをコピーして保存するとともに、当該音声データを第４のサーバ２１０へ送信する。Next, in step S 2005 , the first server 204 copies and stores the voice data, and also transmits the voice data to the fourth server 210 .

次に、ステップＳ２００６において、第４サーバ２１０は受信した音声データを前処理して第１の音響特徴量を算出し、かつ、予め記憶部２１４に格納されている第２の音響特徴量と、推定プログラムを基にして、算出部２１１で特徴量Ｆ（ａ）を算出する。推定部２１２は、算出部２１１で算出した特徴量Ｆ（ａ）を基にして、精神・神経系疾患の推定処理を行う。Next, in step S2006, the fourth server 210 preprocesses the received voice data to calculate a first acoustic feature, and calculates a feature F(a) in the calculation unit 211 based on the second acoustic feature previously stored in the storage unit 214 and an estimation program. The estimation unit 212 performs a process of estimating a psychiatric/neurological disorder based on the feature F(a) calculated by the calculation unit 211.

ここで、ステップＳ２００１ないしステップＳ２００６の処理は繰り返し行われてもよい。例えば、図１４では、利用者は既に何等かの精神・神経系疾患に罹患した旨の診断がされているが、診断後も予後経過を定期的に音声データとして入力し、第４サーバで推定処理をし続けることができる。また、第１サーバの音声データや第４サーバの推定処理の結果はその都度保存される。保存されたデータは、次の来院時の診断において予後経過の判断材料として用いることができる。Here, the processes of steps S2001 to S2006 may be repeated. For example, in FIG. 14, the user has already been diagnosed with some kind of mental or neurological disease, but even after the diagnosis, the user can continue to input the prognosis as voice data periodically and the estimation process can be continued in the fourth server. The voice data in the first server and the results of the estimation process in the fourth server are stored each time. The stored data can be used as a basis for determining the prognosis in the diagnosis at the next visit to the hospital.

また、診断後に、図１３のステップにて取得した診断前の第１サーバの音声データや第４サーバの推定処理の結果を確認することもできる。これにより、診断の正確性を増すことができる。After the diagnosis, it is also possible to check the voice data of the first server before the diagnosis and the result of the estimation process of the fourth server acquired in the step of Fig. 13. This can increase the accuracy of the diagnosis.

次に、ステップＳ２００７において、第３サーバ２０６は第４サーバ２１０の推定処理の結果を基に、選択部を介して配信データを選択し、予後の利用者に配信データを送信する。配信データは、第２または第３の配信データが選択される。Next, in step S2007, the third server 206 selects distribution data via the selection unit based on the result of the estimation process of the fourth server 210, and transmits the distribution data to the prognostic user. The second or third distribution data is selected as the distribution data.

なお、第２または第３の配信データの中に、医師から提供される問診票を含む利用者への質問データが含まれる場合がある。その場合は、ステップＳ２００７へ進む。ステップＳ２００７以降の操作は、ステップＳ２００１ないしステップＳ２００６の操作とは独立して繰り返し実行されてもよい。In addition, there is a case where the second or third distribution data includes question data for the user, including a medical questionnaire provided by a doctor. In that case, proceed to step S2007. The operations from step S2007 onwards may be repeatedly executed independently of the operations from step S2001 to step S2006.

次に、ステップＳ２００７において、問診票が利用者端末２０１へ送信される。次に、利用者が利用者端末２０１から問診票への入力を完了すると、ステップＳ２００８において、入力データが第２のサーバ２０５へ送信され、保存される。Next, in step S2007, the medical questionnaire is transmitted to the user terminal 201. Next, when the user completes inputting the information into the medical questionnaire from the user terminal 201, in step S2008, the input data is transmitted to the second server 205 and stored therein.

次に、ステップＳ２００９において、入力データが医療機関サーバ２０２Ｍへ送信される。ステップＳ２００９の操作は、第２サーバ２０５で行うステップＳ２００８の操作と組合されていてもよい。医療機関サーバ２０２Ｍは入力データを出力手段から出力して、処理を終了する。Next, in step S2009, the input data is transmitted to the medical institution server 202M. The operation of step S2009 may be combined with the operation of step S2008 performed by the second server 205. The medical institution server 202M outputs the input data from an output means and ends the process.

以上のようにして、図１３または図１４の健康管理システム２００を利用することにより、利用者は未病時には個人の健康管理ツールとして健康管理システム２００を利用することができる。また、精神・神経系疾患の発症後は、医師の診断支援ツールとして健康管理システム２００を利用することができる。また、完治・寛解後の利用者に対しては、個人の健康管理ツールとして、必要に応じて医師によるモニタリング手段として健康管理システム２００を利用することができる。In this manner, by using the health management system 200 in Fig. 13 or 14, the user can use the health management system 200 as a personal health management tool when the user is not yet ill. After the onset of a mental or neurological disease, the health management system 200 can be used as a diagnostic support tool for doctors. For users who have fully recovered or are in remission, the health management system 200 can be used as a personal health management tool and as a monitoring means by doctors as necessary.

これまでに、健康管理システム２００の動作の一例を紹介したが、本願発明はこれに限定されるものではない。なお、図１４では、ウェブサーバ２０２と健康管理システム２００の組合せた利用例として医療機関サーバ２０２Ｍを取り上げたが、医療機関が本システムを用いる代わりに、マーケット機関が消費者動向調査を行う目的で医療機関サーバ２０２Ｍを含む健康管理システム２００を利用してもよい。
〔７．本発明の特徴〕 So far, an example of the operation of the health management system 200 has been introduced, but the present invention is not limited to this. In addition, in Fig. 14, the medical institution server 202M is taken up as an example of the combined use of the web server 202 and the health management system 200, but instead of a medical institution using this system, a market institution may use the health management system 200 including the medical institution server 202M for the purpose of conducting a consumer trend survey.
7. Features of the present invention

健康管理システム２００は、発話を取得した毎に出力を表示させ、その都度健康状態を把握することも可能であるが、図１３、１４に示すようにして、継続的に音声を取得し、それを纏めて解析し、時系列で変化を見ることができる。例えば、１か月毎、３か月毎など、定期的に、あるいは何か体調不良があった時にそれまでの経緯を見ることができる。The health management system 200 can display the output every time an utterance is acquired, and can grasp the health condition each time, but it can also acquire voice continuously, compile and analyze it, and view changes over time, as shown in Figures 13 and 14. For example, it is possible to view the history of a person's health periodically, such as once a month or once every three months, or whenever a person experiences some kind of health problem.

それにより、健康管理システムとして従来のものに比べて以下のような優れた効果を有する。As a result, the health management system has the following advantages over conventional systems:

（１）時系列で変化を見ることにより、健康状態のトレンドが分かり、病気に向かっていくトレンドとなっていれば、予防的な対処をすることができる。(1) By looking at changes over time, you can see trends in your health condition, and if there is a trend toward illness, you can take preventive measures.

（２）継続的に音声を取得すれば、例えば１週間に一度でも音声を取得すれば、毎週の健康状況を捉えることができる。これは年に１回または２回の健康診断に比べ頻度が極めて多く。予防と早期発見のタイミングを逃さない。(2) By continuously recording audio, for example, even once a week, it is possible to grasp the health status every week. This is much more frequent than a health checkup once or twice a year, so that the timing for prevention and early detection is not missed.

（３）うつ病などの精神系疾患の場合、遺伝的要素、個人の資質（ストレス耐性、レジリエンス等）、居住地域の物理的環境、家族や会社の同僚など近い人間関係等、様々な要因が重なって発症する。そして、その原因を取り除かない場合は、一旦治っても再発するリスクが大きい。そこで、時系列として解析することにより、どのタイミングで健康状態が悪くなっているかが分かり、その時のライフイベントを考慮すると、主たる原因を捉えることが可能となる。従って、原因を取り除くことが容易になり、治癒・寛解後の再発を起きにくくすることができる。(3) In the case of mental illnesses such as depression, various factors such as genetic factors, personal qualities (stress tolerance, resilience, etc.), the physical environment of the residential area, and close human relationships with family and colleagues at work overlap to cause the illness. If the causes are not removed, there is a high risk of recurrence even after recovery. Therefore, by analyzing the illness over time, it is possible to determine the timing at which the health condition deteriorates, and by considering the life events at that time, it is possible to identify the main cause. This makes it easier to remove the causes, making it less likely that the illness will recur after recovery or remission.

（４）アルツハイマー型認知症などの認知症疾患は、病気が進行すると本人の病識が低下し、その病気であることを認めなくなる傾向にある。そうすると、治療を受けにくくなり重症化が早くなるほか、家族など周囲の精神的負担が大きくなる。認知症は年齢を重ねると誰もが発症する可能性が高くなることから、軽症のうちに認知症が進行する危険性を本人が自覚することにより、認知症の進行を防ぐトレーニングを積極的に受けたり、将来の備えをすることができる。また家族にとっても、将来的な心構えを持ち、対処を容易にすることができる。
〔８．音声取得の場面〕 (4) As dementia diseases such as Alzheimer's progress, the individual's awareness of the disease decreases and they tend to stop acknowledging that they have the disease. This makes it more difficult to receive treatment and the disease progresses more quickly, and increases the mental burden on family and others around the individual. Since anyone is more likely to develop dementia as they age, if the individual is aware of the risk of dementia progressing while the disease is still mild, they can actively receive training to prevent the progression of dementia and prepare for the future. It also allows the family to prepare for the future and make it easier to deal with the disease.
[8. Voice Acquisition Scene]

図２において、利用者端末２０１が、音声を入力する手段および配信データを出力する手段を有するスマートフォンやタブレット等の通信端末を含むことを説明した。ここでは、音声を入力する手段に関し、その他の例を紹介する。2, it has been described that the user terminal 201 includes a communication terminal such as a smartphone or a tablet having a means for inputting voice and a means for outputting distribution data. Here, other examples of the means for inputting voice will be introduced.

（１）利用者に半強制的に発話を指示し、音声を取得する場合
・出勤時のタイムカードの代わりに音声で出勤したことを確認する。
・テレワーク時に、自宅等から業務開始の合図として、一定の発話を行う。また、業務終了時も同様。
・運送業のドライバー、パイロット等、業務前検査（健康状態の把握）として発話する。
・治療の経過観察時、医師から指示された患者が音声を取得する。例えば、毎食後服薬する際に音声も取得する。(1) When semi-forcefully instructing a user to speak and acquiring audio recordings: Confirming that a user has clocked in by audio recording instead of by using a time card when clocking in.
・When working remotely, a certain utterance is made from home etc. as a signal to start work. The same applies when work ends.
- Used as a pre-work examination (to understand health status) for transport drivers, pilots, etc.
・During follow-up monitoring of treatment, patients will be asked by their doctor to record their voice. For example, voice recordings will be made when patients take their medicine after each meal.

（２）利用者が日常的に発話する状況にあり、その音声を取得する場合
・コールセンターのオペレータの会話の音声を取得する。
・コンビニ、ファストフードその他小売業の接客の会話の音声を取得する。
・アナウンサー、声優、司会者、駅員等、業務上発話する音声を取得する。
・スマートホーム家電を音声により操作する場合に、その音声を取得する。
・オンラインゲーム等でチーム内でチャットを行う際に、発話する音声を取得する。
・スマホアプリのゲーム中に、呪文を唱える際に実際の音声で行うよう設定する。
・警官、警備会社、タクシー、林業等が用いる無線による会話の音声を取得する。
・医療従事者、介護従事者が、患者等に声をかける際の音声を取得する。
・保険業や金融業の店舗の窓口において、商品を紹介する場合等に発話する音声を取得する。(2) When a user is in a situation where he or she regularly speaks and that voice is to be acquired, the voice of a call center operator's conversation is to be acquired.
・Capture audio of customer service conversations at convenience stores, fast food restaurants, and other retail businesses.
-Capture audio spoken in the course of business by announcers, voice actors, presenters, station staff, etc.
- When smart home appliances are operated by voice, the voice is acquired.
- Capturing audio when chatting within a team in online games, etc.
- When playing a smartphone app game, set it so that spells are cast using actual audio.
- Capture audio of radio conversations used by police officers, security companies, taxis, forestry workers, etc.
-Capture audio when medical and care workers speak to patients, etc.
- Acquire voice spoken at the counter of an insurance or financial store when introducing a product.

（３）利用者にインセンティブを与えることによって利用者がメリットを得るため自主的に発話する音声を取得する場合
・スマホゲームにおいて、音声を発することにより、ゲーム内で使用できるアイテムを取得することができるように設定する。
・クーポン配布サイトへのアクセス時に、発話により通常よりも優待されたクーポンが得られるように設定する。(3) When obtaining voice that the user voluntarily speaks in order to obtain a benefit by giving the user an incentive. In a smartphone game, the system is set up so that the user can obtain an item that can be used in the game by making a sound.
- When accessing a coupon distribution site, set it up so that you can receive a more preferential coupon than usual by speaking.

（４）他人から、利用者にアクセスすることによって、利用者が発話する状況を作る場合
・一人暮らしの高齢者に対し、デイケア担当者、警備会社等から定期的に健康確認の電話をかけ、それに応答した音声を取得する。
・ペットとして会話ロボットを保有する利用者に対して会話ロボットが定期的に話しかけ、それに応答した音声を取得する。
・利用者が入院している場合、体温、脈拍、血圧とうの測定と共に、利用者が発話する音声を取得する。(4) Creating a situation in which the user can speak by accessing the user from another person. Day care staff, security companies, etc., can periodically call elderly people living alone to check on their health and record the voice responses they make.
- The conversational robot periodically speaks to a user who owns the conversational robot as a pet, and the audio response is captured.
- If the user is hospitalized, their body temperature, pulse, and blood pressure are measured, and their speech is recorded.

（５）利用者が定期的に通う場所において、発話することを提案し、音声を取得する。
・ジム、ヨガサロン、スイミングスクール、エステティックサロン、ダンススタジオ等、健康・美容志向の高い利用者に対して、血圧その他の測定と同様に健康管理の一環として音声を取得する。(5) Suggest speaking to the user in places the user regularly visits, and capture the voice.
- For health and beauty conscious customers at gyms, yoga salons, swimming schools, beauty salons, dance studios, etc., audio recording will be taken as part of health management, similar to blood pressure and other measurements.

（６）利用者が自身の健康管理のために、自主的、定期的に発話する音声を取得する場合。
多様な音声取得場面を設定することにより、継続的に音声取得が可能な環境を作ることが可能である。各人にとっては、音声取得場面が変化したとしても、上記のいずれかの状況があれば、継続的に音声を取得することができる。
〔９．配信データの一例〕(6) When a user voluntarily and regularly records speech for the purpose of managing his/her own health.
By setting up a variety of voice capture situations, it is possible to create an environment in which voice capture can be continued. For each person, even if the voice capture situation changes, voice can be continuously captured as long as any of the above conditions are met.
[9. Example of distribution data]

第３のサーバ２０６から配信される配信データの一例について紹介する。
〔９－１．健康時の出力〕 An example of distribution data distributed from the third server 206 will be introduced.
[9-1. Output when healthy]

健康管理システム２００は、健康時は、音声解析エンジンとしては、非医療分野での使用となるので、病気を判定するエンジンよりも、病気のリスクや心身状態の判定エンジンを用いることが主体となると考えられる。具体的には例えば、「健康」、「うつ症状を呈する」、「認知機能の低下を呈する」との３つに分類する音声判定エンジンを用いる。In the health management system 200, when the patient is healthy, the voice analysis engine is used in a non-medical field, and therefore it is considered that the engine for determining the risk of illness and mental and physical state is mainly used, rather than an engine for determining illness. Specifically, for example, a voice determination engine that classifies the patient into three categories: "healthy", "presenting depressive symptoms", and "presenting cognitive decline".

また、「健康」か「病気の可能性あり」か、を判定するエンジンと、病気の可能性としてそれは「うつ症状を呈するもの」であるか、「認知機能の低下を呈するもの」であるかを判定するエンジンを組み合わせて用いることが考えられる。
〔９－２．算出部２１１または推定部２１２の数値化例、および言語への変換例〕 It is also possible to combine an engine that determines whether a person is "healthy" or "possibly ill" with an engine that determines whether the possible illness is "presenting depressive symptoms" or "presenting cognitive decline."
9-2. Examples of quantification by the calculation unit 211 or the estimation unit 212 and examples of conversion into language

取得した音声を解析し、「健康」、「抑うつ症状を呈する」、「認知機能の低下を呈する」の３つに分類する場合、どれに属するかの確信度として合計が１となるように数値化する。The captured voice is analyzed and classified into three categories: "healthy," "exhibiting depressive symptoms," and "exhibiting cognitive decline." The confidence level of which category the person belongs to is calculated as a number that sums up to 1.

この時、「健康」を表す値を”Ｈ値”、抑うつ状態を呈する確信度の値を”Ｍ値”、認知症を呈する確信度の値を”Ｃ値”とすると、あるユーザーの音声を解析した場合に、解析装置（エンジン）による解析結果は例えば
Ｈ値０、Ｍ値０．７５、Ｃ値０．２５や、
Ｈ値０．５Ｍ値０．２５、Ｃ値０．２５などとなる。 In this case, if the value representing "health" is the "H value," the certainty value of a depressed state is the "M value," and the certainty value of dementia is the "C value," when analyzing the voice of a certain user, the analysis results by the analysis device (engine) may be, for example, H value 0, M value 0.75, and C value 0.25.
H value 0.5, M value 0.25, C value 0.25, etc.

解析装置により得られた数値は、そのまま出力してもよいが、ユーザーに分かりやすくするため、なんらかの言葉に変換して表示することができる。The numerical values obtained by the analysis device may be output as they are, but in order to make them easier for the user to understand, they may be converted into some kind of words and then displayed.

例えば、Ｍ値とＣ値がいずいれも０．３以下であれば、ユーザーは健康であると判断し、「心身は健康な状態です」と出力する。また、Ｍ値またはＣ値が０．３を超え～０．５以下の場合は、なにか心身の健康を維持、強化、改善するためのプログラムを試みることを提案する。For example, if both the M and C values are 0.3 or less, the system judges the user to be healthy and outputs the message, "Your body and mind are in a healthy state." If the M or C value is between 0.3 and 0.5, the system suggests that the user try a program to maintain, strengthen, or improve their physical and mental health.

例えば、言語への変換例として、「運動はストレス解消になりますよ。ジムやダンスは如何ですか？」、「くよくよすることがあったらマインドフルネスで解消しましょう」、「マッサージ、エステ、あかすりなど肌に触れるものは、気持ちが安らぎますよ」、「気分転換に旅行に行きましょう！」、「子ども心を発露すると持ち味が生かせるようになりますよ。パーティーやディスコは如何ですか？」等のフレーズが含まれる。For example, examples of phrases that can be converted into language include, "Exercise can help relieve stress. How about going to the gym or dancing?", "If you have something worrying, try relieving it with mindfulness", "Massages, beauty treatments, exfoliation, and other things that involve contact with the skin are soothing", "Let's go on a trip to change your mood!", and "If you let your childlike side out, you'll be able to make the most of your natural talents. How about a party or a disco?"

Ｍ値またはＣ値が０．５を超えた場合には、０．５を超えた分類について、ユーザーに対してフィードバックを行う。例えばＭ値が０．５値を超えた場合は、「精神力が弱っていませんか？」、「ストレスが高くなっていませんか？」、「頑張りすぎのようですね」、「対人関係でトラブルはありませんか？」「無理をしないことも必要です」などど、なにか気づくきっかけになるフィードバックを行う。
〔９－３．病気である可能性が出てきた場合〕 When the M value or C value exceeds 0.5, feedback is given to the user for the category exceeding 0.5. For example, when the M value exceeds 0.5, feedback is given that can trigger awareness, such as "Are you feeling weak mentally?", "Are you under a lot of stress?", "It seems like you're trying too hard", "Are you having trouble with interpersonal relationships?", "It's important not to push yourself too hard", etc.
[9-3. When the possibility of illness emerges]

更に、Ｍ値またはＣ値が０．５を超えた場合は、更に疾患の有無を判定する解析装置にかけ、その数値により別のフィードバックを加えることができる。Furthermore, if the M or C value exceeds 0.5, it can be subjected to further analysis to determine the presence or absence of disease, and further feedback can be provided based on the numerical value.

例えば、病気を判定する解析装置により、特定の疾患であることを示す確信度が０．７以上の場合に、その疾患が大うつ病、双極性障害、非定型うつ病など、抑うつ症状が主たる症状である場合は、「メンタルクリニックや心療内科へ行くことを勧めます。」、また確信度が０．７以上の場合に、その疾患がアルツハイマー型認知症、レビー小体型認知症、パーキンソン病など、脳の機能障害による認知機能の低下や運動機能の低下が主たる症状である場合は、「神経内科へ行くことをお勧めします」などのフィードバックを加えることができる。
〔９－４．医師が使用する場合〕 For example, if the certainty factor indicating that a particular disease is present is 0.7 or higher using an analysis device that determines the disease, and the disease is one in which depressive symptoms are the main symptom, such as major depression, bipolar disorder, or atypical depression, then feedback such as "We recommend that you visit a mental clinic or a psychosomatic medicine specialist" can be added.Also, if the certainty factor is 0.7 or higher and the disease is one in which cognitive or motor decline due to brain dysfunction is the main symptom, such as Alzheimer's disease, Lewy body dementia, or Parkinson's disease, then feedback such as "We recommend that you visit a neurologist" can be added.
[9-4. When used by doctors]

医師が使用する場合、通常の診断機器として、病気を判定するエンジンを用いる。なお、心身状態の深刻度または疾病の重症度を判定するエンジンを併せて用いることにより、治療の経過観察を行うことができる。また、患者の音声入力により経時的なデータ取得を可能にすることで、医師や看護師が直接患者宅へ訪問診療する労力が軽減でき、医療コストの削減が可能となる。
〔１０．全体的なシステムの解説〕 When used by doctors, the engine is used as a regular diagnostic device to determine illness. By also using an engine that determines the severity of the patient's physical and mental condition or the severity of the illness, treatment progress can be monitored. In addition, by enabling data to be acquired over time through voice input from the patient, the effort required for doctors and nurses to visit patients' homes in person can be reduced, which can lead to reduced medical costs.
[10. Overall System Description]

図１５は、本健康管理システム２００の全体像を示す概略図である。主要なシステムの流れを各ステップＳ３００１ないしＳ３０１２を用いて説明する。15 is a schematic diagram showing an overall image of the health management system 200. The main system flow will be explained using steps S3001 to S3012.

まず、ステップＳ３００１において、利用者からの音声入力が様々な利用シーンにおいてなされる。この様々な利用シーンには、利用者端末２０１からの音声入力の他にも、「８．音声取得の場面」の項目で挙げたシーンが含まれる。入力された音声データは、第１ないし第４のサーバを備える健康管理システム２００において、主に第１のサーバ２０４に保存される。同時に、利用者が利用者端末２０１を介して入力した識別情報が健康管理システム２００の第２のサーバ２０５に保存される。First, in step S3001, voice input from the user is performed in various usage scenarios. In addition to voice input from the user terminal 201, the various usage scenarios include the scenarios listed in the section "8. Situations in which voice is acquired." In the health management system 200 having first to fourth servers, the input voice data is stored mainly in the first server 204. At the same time, the identification information input by the user via the user terminal 201 is stored in the second server 205 of the health management system 200.

次に、入力された音声データを基に第４のサーバ２１０において音声の解析、測定、および推定処理が行われ、ステップＳ３００２において、推定結果が利用者端末２０１に送信される。また、ウェブサーバ２０２を経由して推定結果が非医療サービス機関２０２Ａへ送信される場合もある（Ｓ３００３）。この場合には、ステップＳ３００４において、非医療サービス機関２０２Ａから、第３のサーバ２０６に推定結果が送信される。なお図１５では、第３のサーバ２０６の機能の一部が健康管理システム２００とは別に記載されている。そして、ステップＳ３００５において、医療機関への受診を推奨する旨のメッセージが第３のサーバ２０６から利用者端末２０１へ送信される。Next, based on the input voice data, the fourth server 210 performs voice analysis, measurement, and estimation processing, and in step S3002, the estimation result is transmitted to the user terminal 201. In addition, the estimation result may be transmitted to the non-medical service institution 202A via the web server 202 (S3003). In this case, in step S3004, the estimation result is transmitted from the non-medical service institution 202A to the third server 206. Note that in FIG. 15, part of the function of the third server 206 is described separately from the health management system 200. Then, in step S3005, a message recommending a visit to a medical institution is transmitted from the third server 206 to the user terminal 201.

また、ステップＳ３００６、ステップＳ３００７では、利用者端末２０１と健康管理システム２００との間で、利用者の識別情報に関する管理情報が送受信される。In addition, in steps S3006 and S3007, management information relating to the user's identification information is transmitted and received between the user terminal 201 and the health management system 200.

その後、利用者は、施設を訪問する等して問診を受ける。The user then undergoes a medical interview by visiting the facility, etc.

次に、ステップＳ３００８において、健康管理システム２００は、第１のサーバ２０４に保存された音声データ、第４のサーバ２１０に保存された推定結果データを施設に送信することができる。施設は、問診・検査・診断時にそれらのデータを利用することができる。Next, in step S3008, the health management system 200 can transmit the voice data stored in the first server 204 and the estimation result data stored in the fourth server 210 to a facility. The facility can use the data when taking a medical interview, performing an examination, and making a diagnosis.

施設は、検査、診断をした後、診断結果を健康管理システム２００に情報提供する（ステップＳ３００９）。健康管理システム２００は、治療時や予後期間中も、第１のサーバ２０４に保存された音声データ、第４のサーバ２１０に保存された推定結果データを送受信し（ステップＳ３０１０、ステップＳ３０１１）、治療に役立てることができる。患者が寛解した後も、定期的に利用者の音声データを取得するなどして、経過観察を行うことができる。After the examination and diagnosis, the facility provides the diagnosis results to the health management system 200 (step S3009). The health management system 200 transmits and receives the voice data stored in the first server 204 and the estimated result data stored in the fourth server 210 (steps S3010 and S3011) during treatment and during the prognosis period, and can use them for treatment. Even after the patient goes into remission, the progress can be monitored by periodically acquiring the user's voice data.

また、ステップＳ３０１２では、音声データや推定結果の時系列データを様々な利用シーンにフィードバックして利用することができる。
〔１１．その他の解説〕 In step S3012, the voice data and the time-series data of the estimation result can be fed back and used in various usage scenarios.
[11. Other Commentaries]

その他本願発明に関する解説を行う。なお、本願発明は以降の解説内容に限定されるものではない。Other aspects of the present invention will be explained below. Note that the present invention is not limited to the following explanation.

図１６は、音声プラットフォーム装置または他システム（送受信、独自判定結果出力）または組込済みシステムを用いて利用者の登録を行う一例である。FIG. 16 shows an example of registering a user using a voice platform device or another system (transmission/reception, output of unique determination results) or an embedded system.

本装置を用いる流れは、
（１）登録した利用者の音声及びその他データを入力し利用者データ保存装置へ保存し、（２）保存した音声及び付随するその他データを選択し、
（３）利用サービスを選択し、
（４）サービスで用いる音声データ解析装置で解析し結果を利用者データ保存装置へ保存し、
（５）解析結果をもとに、その他のデータを踏まえてあらかじめ用意した判別マスタから判別結果を検索し、
（６）判別結果とあらかじめサービスで要求されている解析結果等データを送信し、
（７）サービス提供者またはシステムが受信し、行動支援に有益な出力またはサービスを利用者へ提供する、である。 The process for using this device is as follows:
(1) input the voice and other data of the registered user and store it in the user data storage device; (2) select the stored voice and other associated data;
(3) Select the service you wish to use,
(4) The voice data is analyzed using a voice data analysis device used in the service, and the results are stored in a user data storage device.
(5) Based on the analysis results, search for the discrimination results from the discrimination master prepared in advance based on other data,
(6) Transmit the results of the discrimination and any analysis results or other data required by the service.
(7) The service provider or system receives the data and provides the user with an output or service that is useful for behavioral support.

例えば、現在、スポーツジムやダイエットプログラムは肉体の健康増進を目的にサービスを提供している。これら健康増進施設では、運動等アクティビティトレーナー等従事者からのサービスを受け目標を設定し実行計画を立案している。他方、アスリートを中心にフィジカル面だけでなくメンタル面の強化改善を行うことにより効果的な目標の達成を目指すこともある。音声プラットフォーム装置または音声プラットフォーム装置の機能を組込み済システム用いることにより、メンタル面の強化に有効な情報活用を促すことが考えられる。For example, currently, sports gyms and diet programs provide services aimed at improving physical health. At these health promotion facilities, goals are set and action plans are drawn up through services from activity trainers and other professionals. On the other hand, some organizations, particularly athletes, aim to achieve effective goals by improving not only the physical aspects but also the mental aspects. By using an audio platform device or a system with the functions of an audio platform device built in, it is possible to encourage the use of information that is effective in improving the mental aspects.

図１７は、利用者の登録に関する流れを説明する。利用者は、使用開始の同意画面からデータの利用に関する説明を確認し、内容に同意する場合は利用開始ボタンを押す。利用開始ボタンを押下した後に、初回登録画面が表示される。初回登録画面に表示された必要事項（年齢、性別、既往歴等）を入力し入力完了ボタンを押す。入力完了ボタンを押した際に、入力で用いた機器を特定する符号が取得可能な場合は主に利用する端末識別番号として入力される。入力完了ボタンを押した後、利用者本人が画面に表示されたまたは自由発話の場合は自由発話を選択し、一方または両方の入力ガイドに沿って音声を入力する。声紋抽出に必要な音声が入力されたことが確認されたら音声入力完了を示す画面を表示し、確認ボタンを押下することで入力データが音声プラットフォーム装置へ確定情報として送信される。FIG. 17 explains the flow of user registration. The user checks the explanation of data usage from the consent screen for starting use, and if the user agrees to the contents, presses the start use button. After pressing the start use button, the initial registration screen is displayed. Input the necessary information (age, sex, medical history, etc.) displayed on the initial registration screen, and press the input completion button. When the input completion button is pressed, if a code that identifies the device used for input can be obtained, it is entered as the terminal identification number mainly used. After pressing the input completion button, the user himself selects "free speech" or "free speech" displayed on the screen, and inputs voice according to one or both input guides. When it is confirmed that the voice required for voiceprint extraction has been input, a screen indicating completion of voice input is displayed, and the input data is transmitted to the voice platform device as confirmed information by pressing the confirmation button.

例えば、利用者がメンタル面も含めたトレーニング計画の策定に際し、音声プラットフォーム装置または音声プラットフォーム装置の機能を組込み済システムの利用を希望する場合は、利用者登録を行う。この際に利用者情報として音声を取得する。For example, when a user wishes to use a voice platform device or a system incorporating the functions of a voice platform device to formulate a training plan that also includes mental aspects, the user registers. At this time, voice is acquired as user information.

図１８は、音声データの格納に関する流れを説明する。利用者が初回登録または随時音声データの入力を音声プラットフォーム装置に直接または他システムで取得した音声を音声プラットフォーム装置を経由して入力を行う。入力時確認を完了した音声データは音声プラットフォーム装置で一時保存された状態からデータ保存装置へ送信される。送信された音声および付随するデータは音声プラットフォーム装置を構成するデータ保存装置で受信しデータ保存装置のデータベースへ音声取得日時、利用端末および利用に伴うデータとともに保存される。保存した音声およびその他データは、利用者本人の操作または利用者が承認したサービスまたはサービス提供者等以外は検索、閲覧、出力、削除できない。登録時の同意に記載のある個人を一意に特定することが困難なデータセットは音声プラットフォーム装置を用いたサービスを提供する事業者から音声プラットフォーム装置の利用許可を得た事業者等へ閲覧およびメッセージの送受信を許可することができる。利用者が利用の停止や終了を音声プラットフォーム事業者へあらかじめ定められた手順で申し入れを行い処理が終了した後同意に基づき、または利用者が利用規約に違反したと音声プラットフォーム事業者が判断した場合は個人を特定可能なデータを（論理または物理的に）消去することができる。個人を一意に特定できないデータは、音声プラットフォーム事業者の統計情報として当該事業者が継続して利用することができる。FIG. 18 explains the flow of voice data storage. A user registers for the first time or inputs voice data at any time directly to the voice platform device, or inputs voice acquired by another system via the voice platform device. After completing the input confirmation, the voice data is sent from the state temporarily stored in the voice platform device to the data storage device. The transmitted voice and associated data are received by the data storage device constituting the voice platform device and stored in the database of the data storage device together with the voice acquisition date and time, the terminal used, and data associated with the use. The stored voice and other data cannot be searched, viewed, output, or deleted by anyone other than the user himself or the service or service provider approved by the user. Data sets that are difficult to uniquely identify individuals as described in the consent at the time of registration can be viewed and messages can be sent and received by businesses that have obtained permission to use the voice platform device from businesses that provide services using the voice platform device. When a user requests the voice platform operator to stop or terminate use in a predetermined procedure and the process is completed, the data that can identify an individual can be erased (logically or physically) based on consent, or if the voice platform operator determines that the user has violated the terms of use. Data that cannot uniquely identify individuals may continue to be used by voice platform operators as statistical information for those operators.

本プラットフォームで保存するデータは機微な個人情報を含むものであり、データ保存環境は暗号化やデータの分散等個人情報を保護するための措置を施しているものとなる。The data stored on this platform includes sensitive personal information, and the data storage environment will include measures to protect personal information, such as encryption and data distribution.

例えば、スポーツジムで利用申し込み時、計画策定時、計画実行中、計画終了時に活動記録の一環として音声データを入力する。音声プラットフォーム装置をスマートフォン等携帯端末機器で利用している場合は、スポーツジムのマイク以外に自身の機器で取得した音声も保存し活用することが可能となる。For example, voice data is input as part of an activity record when applying for use at a sports gym, when making a plan, during the execution of the plan, and when the plan is completed. If the voice platform device is used with a mobile terminal device such as a smartphone, it is possible to save and utilize voices captured by the user's own device in addition to those captured by the microphone at the sports gym.

音声取得方法の例としては、以下の様なものがある。Examples of audio acquisition methods include the following:

（１）利用者が健康増進等サービスを受ける際にフィジカルだけでなくメンタルの改善強化を目的として活動計画を立案する際に音声を入力する。(1) When a user receives health promotion services, voice input is used to create an activity plan aimed at improving not only the physical condition but also the mental condition.

（２）利用者が音声等データの保存や活用を希望し音声プラットフォーム装置の機能を用いてまたは音声プラットフォーム装置の機能を利用しているその他アプリやシステム等を用いてまたはそれらが接続されているウェアラブル機器等を経由して入力する。(2) When a user wishes to store or utilize voice or other data, the user inputs the data using the functions of the voice platform device, or using other apps, systems, etc. that utilize the functions of the voice platform device, or via wearable devices, etc. to which they are connected.

（３）事業主等が利用者本人の同意を得てあらかじめ業務動線の中（コールセンターの応対記録や運送業の連絡用無線または歌手がレコーディングやライブを行うまたは役者、声優等が演技を行うまたは医療、介護、健康増進サービス等で問診や看護や声掛けやカウンセリングまたは保険商品のセールスまたは小売店等での接客時等）で取得し、音声プラットフォーム装置で取扱い可能な形へ変換し音声プラットフォーム装置の機能を用いて入力する。(3) With the user's consent, employers etc. obtain the user's voice in advance during business activities (such as call center response records, communication radio communications in the transportation industry, or when a singer is recording or performing live, or when an actor or voice actor is performing, or when taking medical interviews, providing nursing care, speaking to customers or counseling in medical, nursing care or health promotion services, or when selling insurance products, or when serving customers at a retail store, etc.), convert it into a form that can be handled by the voice platform device, and input it using the functions of the voice platform device.

（４）事業者等がマーケティングや顧客ロイヤリティの改善または情報の取得等を目的にインセンティブを提供する際またはインセンティブの実行を行う場合、またはアプリケーション等において抽選によってゲーム内で用いるカードや仮想的な物品を購入する仕組み（いわゆるガチャ）あるいはスマートスピーカ等主に音声で操作を行う装置の機能利用またはスマートフォン等音声入力装置を持つ機器の操作またはパソコンやスマートフォン等機器のアプリケーション操作等実施時の操作として音声を入力する。(4) When businesses provide or implement incentives for the purpose of improving marketing or customer loyalty or obtaining information, or when using a system in an application, etc. to purchase cards or virtual items to be used in a game by lottery (so-called gacha), or when using the functions of a device that is mainly operated by voice, such as a smart speaker, or when operating a device with a voice input device such as a smartphone, or when operating an application on a device such as a computer or smartphone,

（５）音声入力を伴うアミューズメント（カラオケ等）やサービスの提供に音声を用いるあるいはサービスの説明や提供に伴い取得した音声（健康診断の一環としてまたは保険商品購入時の健康状態把握または介護や健康増進サービスの計画策定および実績の評価を目的として取得する音声等）を入力する。(5) Using voice for amusements (such as karaoke) or provision of services that involve voice input, or inputting voice obtained in connection with explaining or providing a service (such as voice obtained as part of a health check or for the purpose of understanding health status when purchasing an insurance product, or for the purpose of planning nursing care or health promotion services and evaluating the performance of such services).

（６）衣類やアクセサリの試着や着用時または自動車等輸送機器の乗車時または不動産売買に伴う家屋の閲覧や生活時またはその他サービスや財の試用や利用の際に音声を取得する。(6) Audio is collected when trying on or wearing clothes or accessories, when riding in a vehicle such as a car, when viewing or living in a home in connection with a real estate transaction, or when trying out or using other services or goods.

（７）ネット経由のサービス（ネットショッピング、遠隔医療、カウンセリング、教育、金融等商品のあっせん、就職や転職の支援サービス、異性との出会いをあっせんするマッチングアプリ等）。(7) Services provided via the Internet (online shopping, remote medical care, counseling, education, financial product referrals, employment and career change support services, matching apps that arrange meetings with members of the opposite sex, etc.).

（８）情報発信者の音声コンテンツや状態を伝えるときにパソコンやスマートフォン等機器を用いて音声を入力する。(8) When conveying the voice content or status of the information sender, the voice is input using a device such as a computer or smartphone.

（９）精神疾患や認知症系疾患の寛解後再発時により適切なタイミングで自身の変化を知ることを目的に日々音声を入力する。(9) Daily voice input is performed in order to know changes in one's own condition at a more appropriate time when a mental illness or dementia-related disease relapses after remission.

図１９は、識別情報を登録済の利用者の音声解析の例を説明する。利用者は、音声解析をあらかじめ保存したまたはサービス利用時に登録または取込みした音声データ及び付随データを用いた音声プラットフォーム装置の機能またはその他サービス提供者のサービス利用を依頼しサービス提供者が受けた際に、利用者のサービス利用に必要なデータをサービスで利用する解析装置へ音声およびその他サービスで必要となるデータを解析用データセットとして送信する。解析装置はサービス利用許可を確認ののち、受信したデータセットを用いて前処理し、解析し、解析結果を出力する。出力した解析結果データセットを音声プラットフォーム装置へ送信する。音声プラットフォーム装置で送信した解析用データセットの解析結果を受信し、結果を確認し、解析が期待したサービスレベルで完了している場合は完了通知を、完了していない場合は再解析依頼を解析装置にまたは解析用データセットの不備に起因するものであれば音声プラットフォーム装置へ通知として、またはいずれかの装置の予期しないエラーであればプラットフォーム装置のサービスを提供する事業者と利用者及び解析装置の管理者へエラーログを基にしたあらかじめ定められたメッセージを送信する。FIG. 19 illustrates an example of voice analysis of a user whose identification information has been registered. When a user requests the use of the voice platform device's functions or other service provider's services using voice data and associated data that have been previously saved or registered or imported when using the service, and the service provider receives the request, the voice and other data required for the service are transmitted to the analysis device that uses the data required for the user's use of the service as an analysis data set. After confirming permission to use the service, the analysis device preprocesses and analyzes the received data set, and outputs the analysis result. The output analysis result data set is transmitted to the voice platform device. The voice platform device receives the analysis result of the analysis data set transmitted by the voice platform device, checks the result, and if the analysis is completed at the expected service level, a completion notice is transmitted to the analysis device, and if it is not completed, a reanalysis request is transmitted to the analysis device, or if it is due to a defect in the analysis data set, a notice is transmitted to the voice platform device, or if it is an unexpected error of any device, a predetermined message based on the error log is transmitted to the operator providing the service of the platform device, the user, and the administrator of the analysis device.

例えば、各シーンで入力した音声は、音声プラットフォーム装置のデータ保存装置に本人の音声として保存される。保存された音声は、音声取得時のデータ確認または音声保存時に実行可能な声紋確認装置により声紋の本人性を表す数値とともに保存する。保存したデータは、利用者が選択したサービスにより解析に必要なデータセットをデータ保存装置から抽出し解析データセットを作成し音声プラットフォーム装置から解析装置へ送信される。For example, the voice input in each scene is stored as the person's own voice in the data storage device of the voice platform device. The stored voice is stored together with a voiceprint value that indicates the person's identity by a voiceprint verification device that can be executed when the voice is acquired or when the voice is saved. The stored data is extracted from the data storage device and a data set required for analysis is created by a service selected by the user, and the analysis data set is transmitted from the voice platform device to an analysis device.

解析装置は受信した解析データセットを前処理し、解析し、解析結果を送信する。解析結果を閲覧することで、メンタルの状態（メンタル値）や認知機能の状態（コグニティブ値）を、時系列に保存された音声に対し時系列で確認することができる。The analysis device preprocesses and analyzes the received analysis data set, and transmits the analysis results. By viewing the analysis results, the mental state (mental value) and the state of cognitive function (cognitive value) can be confirmed in chronological order for the voice recorded in chronological order.

図２０は、判別に関する例を説明する。利用者は、音声解析結果の判定をあらかじめ保存したまたはサービス利用時に登録または取込みした音声データ及び付随データの解析結果を用いた音声プラットフォーム装置の機能またはその他サービス提供者のサービス利用を依頼しサービス提供者が受けた際に、利用者のサービス利用要望に基づき設定した判別条件に基づき判別に必要なデータをサービスで利用する判別装置へ音声解析結果およびその他サービスで必要となるデータを判別用データセットとして抽出し送信する。判別装置は、あらかじめ登録されている音声解析結果等のデータに対応する判別結果データ保存装置から対応する判別結果を検索しデータ判別結果として、サービスであらかじめ定められた出力先および音声プラットフォームデータ保存装置へ出力する。20 illustrates an example of discrimination. When a user requests the use of a voice platform device function or other service provider service using the analysis results of voice data and associated data that have been previously stored or registered or imported when using a service, and the service provider accepts the request, the voice analysis result and other data required for the service are extracted as a discrimination data set and transmitted to a discrimination device that uses data required for discrimination based on discrimination conditions set based on the user's service usage request. The discrimination device searches for a corresponding discrimination result from a discrimination result data storage device that corresponds to data such as a voice analysis result that has been registered in advance, and outputs the data discrimination result to an output destination and a voice platform data storage device that are predetermined for the service.

例えば、解析結果データをもとにメンタルの状態や認知機能の状態をよりよく理解するため指標や情報を判別する。利用者は判別結果から自身の状態を時系列で確認することにより、今後対策すべきソリューションまたはサービスを客観的な情報を基に検討可能となる。For example, based on the analysis results data, indicators and information are identified to better understand the state of mental health and cognitive function. By checking the chronological status of the user from the results of the identification, the user can consider future solutions or services based on objective information.

図２１は、行動支援に関する例を説明する。利用者は、音声プラットフォーム装置に保存したまたはサービス利用時に登録または取込みした音声データ及び付随データの判別結果を用いた音声プラットフォーム装置の機能またはその他サービス提供者のサービス利用を依頼しサービス提供者が受けた際に、利用者が設定あるいは選択したまたはあらかじめサービス提供者がサービスの一部を構成するデータとして設定した情報活用条件に基づきサービスで利用する情報活用方法選択装置へ音声解析結果およびその他サービスで必要となるデータおよび判別結果を情報活用データセットとして抽出し送信する。21 illustrates an example of behavioral support. When a user requests the use of the voice platform device's functions or other service provider's services using the discrimination results of the voice data and associated data stored in the voice platform device or registered or imported when using the service, and the service provider accepts the request, the voice analysis results and other data and discrimination results required for the service are extracted as an information utilization data set and transmitted to an information utilization method selection device used in the service based on the information utilization conditions set or selected by the user or set in advance by the service provider as data constituting a part of the service.

情報活用方法選択装置は、あらかじめ登録されている情報活用方法から受信したデータセットに対応する情報活用方法データ保存装置から対応する情報活用方法を検索し情報活用方法選択結果として、サービスであらかじめ定められた出力先および音声プラットフォームデータ保存装置へ出力する。The information utilization method selection device searches for a corresponding information utilization method from an information utilization method data storage device that corresponds to the received data set from pre-registered information utilization methods, and outputs the information utilization method selection result to an output destination predetermined by the service and to the voice platform data storage device.

出力結果をもとに情報活用を行った結果を受信する。受信した情報活用結果は音声プラットフォームデータ保存装置へ送信する。音声データプラットフォームデータ保存装置は、受信した情報活用結果を選択した情報活用方法の結果として保存する。The result of information utilization based on the output result is received. The received information utilization result is transmitted to the voice data platform data storage device. The voice data platform data storage device stores the received information utilization result as a result of the selected information utilization method.

例えば、判別結果をもとにお勧めのトレーニングメニューや疾病予防等にも有効なカウンセリングあるいは食事（に限らずプラットフォーム事業者やサービス提供者の持つ情報等）のフィードバック等情報の検索結果を表示することで計画の進捗や新たな支援の機会あるいは疾病予防の行動を行う機会等を得ることができる。For example, by displaying search results for information such as recommended training menus, counseling that is also effective for disease prevention, or feedback on diet (and other information held by platform operators and service providers), based on the results of the discrimination, it is possible to obtain progress on plans, opportunities for new support, or opportunities to take action to prevent disease.

図２２Ａ及び図２２Ｂは、システムに関する流れを説明する。医療施設等サービス提供者は、組込済みシステム導入時および登録情報の変更が発生する場合は、施設情報や従事者の情報等を登録する。あわせて組込み済システムで利用する検査等サービスの条件についてあらかじめ定められた設定からの組み合わせまたは管理者等による設定あるいはサービス提供者が既に保有するシステムからの接続等によりデータの共有や相互入出力を行うことも可能となる。22A and 22B explain the flow of the system. When the embedded system is introduced or when changes to the registered information occur, the service provider such as a medical facility registers facility information, employee information, etc. In addition, it is possible to share data and perform mutual input and output by combining the conditions of the examination and other services used in the embedded system from a combination of settings that are predetermined, settings by an administrator, or a connection from a system already owned by the service provider.

利用者は事前に本件発明を用いるサービスまたは施設の利用者として利用者登録（医療であれば患者として医療施設に情報登録され、診察券等が発行されている状態、あるいは社会保険に加入し社会保険番号が発行されている等）を行っている者を対象とする。The target users are those who have previously registered as users of the service or facility that uses the present invention (in the case of medical care, this means that their information is registered as a patient at a medical facility and they have been issued a patient card, or they have joined social insurance and been issued a social insurance number, etc.).

音声データプラットフォーム装置に診察券関連データや社会保険関連データを保存することで、スマートフォン等機器やＩＣカード等の技術や非接触通信等の技術を用いることで本人確認やデータの交換等を実現することも可能となる。By storing medical card-related data and social insurance-related data in the voice data platform device, it will be possible to perform identity verification and data exchange, etc., by using devices such as smartphones, IC card technology, and contactless communication technology.

組込み済みシステムの初回利用開始時に、利用者は利用条件を確認し承諾するか否か判断することができる。利用条件を承諾後、利用者の情報を利用者本人または親族等または利用者または親族等から依頼を受けた医療施設スタッフあるいはシステムを用いて利用者情報を登録する。あらかじめ利用者情報が病院システム等で保存している場合は事前に承諾を受けたのち事後に利用者情報を登録することも可能となる。利用者本人のスマートフォン等機器を用いて利用者情報や音声等の入力を行う場合には、入力端末の情報も取得することが可能となる。また、声紋による本人性の確認を行う機能を組み込むことによっても本人確認が可能となる。利用同意及び利用者情報の登録結果は組込み済システムのデータ保存装置に保存され、保存結果は利用者とサービス提供者向けに電子データまたは紙またはその他形式により出力する。When the user first starts using the embedded system, the user can check the terms of use and decide whether or not to accept them. After accepting the terms of use, the user's information is registered by the user himself/herself or his/her relatives, or by the medical facility staff requested by the user or his/her relatives, or by the system. If the user information is stored in a hospital system or the like in advance, it is also possible to register the user information after receiving prior consent. If the user inputs user information or voice, etc. using a device such as a smartphone, it is also possible to obtain information on the input device. In addition, identity verification is possible by incorporating a function to verify the identity of the user using voiceprints. The consent to use and the results of the registration of the user information are stored in the data storage device of the embedded system, and the results are output to the user and the service provider as electronic data, paper, or other format.

利用者の登録後、利用者または書面あるいは組込み済システム等を利用して利用者あるいは親族等の指示を受けたサービス提供者は、サービス（解析および判別および情報活用条件等）を選択し当該装置のマイク等音声入力装置またはあらかじめ音声プラットフォーム装置に保存している音声等データを取り込むことで音声等データを入力する。After the user is registered, the service provider, who has received instructions from the user or a relative, etc., either in writing or using an embedded system, etc., selects a service (analysis, discrimination, conditions for information utilization, etc.) and inputs voice data, etc., by capturing voice data, etc., stored in advance in a voice platform device or using a voice input device such as a microphone of the device.

解析対象の音声データ入力後、サービスの条件を満たすデータか否か確認を行い、サービスの条件を満たしていれば組込み済システムのデータ保存装置へ、満たしていなければ対応するメッセージ等および再度入力画面を出力する。After inputting the voice data to be analyzed, the data is checked to see if it meets the conditions of the service. If it does, it is sent to the data storage device of the embedded system; if it does not, a corresponding message or the like is output and the input screen is displayed again.

音声データが正常に入力されたことを確認後、サービスの条件に従いあらかじめ定められた解析装置へ音声データを出力する。解析装置は音声データの正常受信確認後解析に必要な前処理を行い音声データの解析を実行し解析結果をデータ保存装置へ出力する。After confirming that the voice data has been input correctly, the voice data is output to a predetermined analysis device in accordance with the terms of the service. After confirming that the voice data has been received correctly, the analysis device performs pre-processing necessary for analysis, analyzes the voice data, and outputs the analysis results to a data storage device.

判別装置は、保存した解析結果を含む判別に必要なデータをデータ保存装置から抽出し、あらかじめ設定した判別結果データ保存装置から対応するデータ判別結果を選択する。データ判別結果は、サービスで設定された情報活用方法選択装置及びデータ保存装置へ向けて出力される。情報活用方法選択装置は入力されたデータ判別結果をもとにあらかじめサービスで設定された内容に基づき情報活用方法を選択し、選択結果を情報活用方法で定義した出力先およびデータ保存装置へ出力する。情報活用の結果は、情報活用装置で受信し、データ保存先へ出力する。The discrimination device extracts data necessary for discrimination, including the stored analysis results, from the data storage device, and selects a corresponding data discrimination result from a previously set discrimination result data storage device. The data discrimination result is output to an information utilization method selection device and data storage device set in the service. The information utilization method selection device selects an information utilization method based on the contents previously set in the service based on the input data discrimination result, and outputs the selection result to the output destination and data storage device defined in the information utilization method. The information utilization result is received by the information utilization device and output to the data storage destination.

組込み済システムのデータ保存装置に保存したデータは、利用者本人の求めに応じあらかじめ登録したスマートフォン端末、磁気記憶装置を持つＩＣカード、非接触通信等を用いてデータを取得することが可能となる。また、組込み済システムから利用者の同意に基づき特定のサービスで利用した結果も、組込済みシステムの機能を経由して取得することが可能となる。The data stored in the data storage device of the embedded system can be retrieved at the request of the user using a pre-registered smartphone terminal, an IC card with a magnetic storage device, contactless communication, etc. In addition, the results of using a specific service from the embedded system with the user's consent can also be retrieved via the embedded system's functions.

本発明の健康管理システムは、例えば、以下の様な場面での利用が可能となる。
最近元気がないと周囲から指摘され、自身も何事においても意欲が低く単純なミスが増えてきた利用者が家族に連れられて近所の一般内科を受診した。一般内科は精神疾患や認知症の専門医ではなく治療方針を決めにくい、あるいは治療効果が上がる治療方針にたどり着くまでに長い期間がかかることもある。ところが、組込み済システムを導入しているこの一般内科を受診した利用者は、事前問診時に認知機能の簡易的な測定とともに音声の検査を指示された。本人は意欲低下気味なので、同伴の家族とともに利用規約を確認し音声による検査の実施に同意した。本人の音声入力以外は家族が対応したため、本人の負荷も少なく診察室に入る前には医師が音声検査の結果を確認した。 The health care system of the present invention can be used in the following situations, for example.
A patient who had been told by people around him that he had been feeling low recently, had low motivation in everything he did, and was making more and more simple mistakes, was taken by his family to a nearby general internal medicine clinic. General internal medicine clinics are not specialists in mental illnesses or dementia, so it can be difficult to decide on a treatment plan, or it can take a long time to arrive at a treatment plan that will be effective. However, when the patient visited this general internal medicine clinic, which has a built-in system, he was instructed to take a voice test as well as a simple measurement of cognitive function during the preliminary interview. Since the patient's motivation was declining, he and his family members confirmed the terms of use and agreed to the voice test. Since the family members handled everything except the patient's voice input, the burden on the patient was low, and the doctor checked the results of the voice test before entering the examination room.

診察時には、認知機能の低下も見られたが、音声検査の結果も踏まえてうつ系の疾患である大うつ病の可能性が高いと診断し治療計画を立て、本人と家族も同意した。施設訪問時以外は服薬の前後で本人のスマートフォンで音声プラットフォーム装置と連動する治療支援アプリを用いて音声を取得した。通院時に院外での音声データと状態の変化を確認し、時系列の音声データをスマートフォンから組込み済システムへ非接触型の通信機能を用いて取り込むことで組込み済システムで再度検査した。その際に、過去の通院時に取得した音声データ及び医療データを本人のデータとして音声プラットフォーム装置上のデータ保存装置へデータを保存した。院外の状態から治療効果が想定より低い可能性が示唆されたため、治療方針を修正し処方を切り替えた。症状の改善が進み職場でも徐々に改善が見られたとき、遠方への転勤が発生した。治療を継続するため、転居先から通院可能かつ音声プラットフォーム装置を採用している病院を音声プラットフォーム装置で検索し、紹介状を現在の通院先で作成の上、これまでの治療経緯をデータで自身の音声プラットフォーム装置へ保存した。データがあることにより新たな病院でも切れ目なく治療が進められた。治療効果も上がってきて症状の改善が見られたため、通常の診察だけでは発見しにくい減薬のタイミングを検知した。症状が安定しているときは服薬量を自身で調整してよいことになり、音声データを確認しつつ自身で通院と通院の間でも減薬することができた。日中変動を考慮したきめ細やかな減薬の効果もあり、寛解に至った。精神疾患は寛かい後の再発率も高いが、音声プラットフォーム装置を用いて自身の状態を時系列で把握することによりメンタルに良い影響を与えるアクティビティをデータで見つけることができたため安定していたが、業務量が一時的に増加した際にメンタル値の落ち込みを確認したため再度受診した。症状が悪くなる前に受信したため、認知行動療法を中心により短期の通院で寛解を迎えた。At the time of the examination, the patient's cognitive function was also impaired, but based on the results of the voice test, the patient was diagnosed with a high possibility of major depression, a depressive disorder, and a treatment plan was made, which the patient and her family agreed to. Except when visiting the facility, the patient's voice was recorded using a treatment support app linked to the voice platform device on her smartphone before and after taking medication. During hospital visits, the voice data and changes in condition outside the hospital were confirmed, and the time-series voice data was imported from the smartphone to the built-in system using a non-contact communication function, and then re-examined using the built-in system. At that time, the voice data and medical data obtained during past hospital visits were saved as the patient's data in the data storage device on the voice platform device. Since the condition outside the hospital suggested that the treatment effect may be lower than expected, the treatment plan was revised and the prescription was changed. When the symptoms improved and gradual improvement was seen at work, the patient was transferred to a distant location. In order to continue treatment, the voice platform device was used to search for hospitals that could be visited from the new address and that used the voice platform device, a referral letter was created at the current hospital, and the history of treatment to date was saved as data on the patient's own voice platform device. The data allowed the patient to continue treatment without interruption at the new hospital. The treatment was more effective and the patient's symptoms improved, so the timing for tapering was detected, which would be difficult to detect with a regular examination alone. When symptoms were stable, the patient was allowed to adjust the dosage themselves, and was able to tapering between visits while checking the audio data. The detailed tapering, which took into account daily fluctuations, was effective, and the patient went into remission. Although the relapse rate of mental illnesses is high after remission, the patient was able to use the audio platform device to grasp his condition chronologically, and was able to find activities that had a positive effect on his mental health through data, which helped him to remain stable. However, when his workload temporarily increased, he noticed a drop in his mental health values, so he went back to the hospital. Because the consultation was received before his symptoms worsened, he went into remission with shorter visits, mainly cognitive behavioral therapy.

図２３では、データの送受信、音声プラットフォーム、音声解析装置、組込済みシステムの説明を行う。FIG. 23 explains data transmission and reception, the voice platform, the voice analysis device, and the embedded system.

「３００．他システム（データ送信のみ）」では、利用者を特定した音声データを取得し音声プラットフォーム装置へ送信する。音声データは、スマホ、スマートスピーカ、カラオケ、業務無線等で取得可能である。結果は音声プラットフォーム装置で確認できる。"300. Other systems (data transmission only)" acquires voice data that identifies the user and transmits it to the voice platform device. Voice data can be acquired from smartphones, smart speakers, karaoke, business radios, etc. The results can be confirmed on the voice platform device.

「４００．他システム（送受信、独自判定結果出力）」では、音声取得と判定結果出力を本装置で行う。判別結果をもとに本装置独自の判別結果へ変換可能である。トレーニングやケアの予実策定支援システムや、勤怠等業務管理システムで個別の要件があるものが該当する。In "400. Other systems (transmission and reception, output of unique judgment results)," voice acquisition and judgment result output are performed by this device. The judgment results can be converted to the device's own judgment results based on the judgment results. This applies to systems that have individual requirements for training and care budget planning support systems and attendance and other business management systems.

「１００．音声プラットフォーム装置」では、本装置単体で録音、サービス選択、結果確認、履歴確認、情報閲覧、データ二次利用支援等を利用可能。音声取得や判別結果のフィードバックで固有の制約や要件がある場合は、本装置以外で取得した音声データの受信や解析結果をもとに本システムで判別したデータを特定の外部システムへ出力可能となる。なお、本装置以外での処理に際しても、利用者本人の同意は本装置または本装置が提供する手順、機能で取得処理を行う必要がある。With "100. Voice Platform Device," the device itself can record, select services, check results, check history, view information, and support secondary data use. If there are specific constraints or requirements for voice acquisition or feedback of discrimination results, data determined by this system based on the results of receiving and analyzing voice data acquired outside of this device can be output to a specific external system. Note that even when processing outside of this device, the consent of the user must be obtained using this device or the procedures and functions provided by this device.

「２００．音声解析装置」では、音声プラットフォーム装置から受信した音声データを前処理し、解析し、解析結果をあらかじめ定められた様式で音声プラットフォーム装置へ送信する機能を持つ。解析は、音声データとともに受信したサービス情報により解析処理実施の可否を判定する。解析実施後、特定の疾患可能性やメンタル値やコグニティブ値等の結果を送信し、音声プラットフォーム装置で受信が完了したことまでの過程における実行履歴を保存する。解析後、音声データは自動的に削除される。"200. Voice analysis device" has the function of preprocessing and analyzing the voice data received from the voice platform device, and transmitting the analysis results to the voice platform device in a predetermined format. The analysis determines whether or not to perform the analysis process based on the service information received along with the voice data. After the analysis is performed, the results such as the possibility of a specific disease, mental values, cognitive values, etc. are transmitted, and the execution history of the process up to the completion of reception by the voice platform device is stored. After analysis, the voice data is automatically deleted.

「５００．組込済みシステム」では、医療等データのポータビリティやサービス提供者の意図及び各種規制等の制約により音声プラットフォーム装置への直接接続を行いたくない場合には、個別のシステムへ本装置の機能を組み込むことにより音声の取得から判別結果の出力が可能である。医療施設等のシステムで管理した情報をＮＦＣやＢｌｕｅｔｏｏｔｈ（登録商標）技術等を用いた非常時接続などを利用して持ち出す場合には、音声プラットフォーム装置で入力可能なフォーマット仕様を満たす形で出力することにより音声プラットフォーム装置へ取り込むことが可能である。In the case of "500. Embedded system", when a direct connection to the voice platform device is not desired due to restrictions such as portability of medical data, the intention of the service provider, and various regulations, the functions of this device can be embedded in an individual system to obtain voice and output the determination results. When information managed in a system of a medical facility or the like is to be taken out using an emergency connection using NFC or Bluetooth (registered trademark) technology, the information can be imported into the voice platform device by outputting it in a form that meets the format specifications that can be input into the voice platform device.

音声プラットフォーム装置及び組込み済システムのデータ保存先へ各ステップで出力されたデータは、利用者ごと抽出・選択結果および結果を紐付けたレコードとして保存される。保存結果は、本人同意の範囲内で再利用を行うことが可能となる。The data output at each step to the data storage destination of the voice platform device and the embedded system is stored as a record linking the extraction and selection results for each user. The stored results can be reused within the scope of the individual's consent.

保存されたデータは、利用同意が得られた範囲で個人を特定する情報を除き第三者が音声プラットフォーム装置の事業者の許可を得た場合に限り検索または閲覧または統計情報として抽出等を行うことが可能となる。個人を特定しない状態で第三者が設定した一定の条件を満たす利用者に対し、利用者が希望する場合は、個人情報利用の目的とともに一部または全部のデータの開示または提供を行うこと、または事業者等からの開示請求に対して応答することや、例えば治験ボランティアのリクルーティング等を行うことが音声データプラットフォームを介して可能となる。The stored data, excluding information that can identify individuals to the extent that consent for use has been obtained, can be searched, viewed, or extracted as statistical information only if a third party has permission from the operator of the voice platform device. For users who meet certain conditions set by a third party without being identified as individuals, if the user so wishes, it is possible to disclose or provide some or all of the data along with the purpose of use of the personal information, or to respond to requests for disclosure from operators, etc., or, for example, to recruit volunteers for clinical trials, via the voice data platform.

図２４では、音声プラットフォーム装置の説明を行う。本装置は音声取得部および取得音声確認部、取得音声送信部、データ保存部、解析データセット抽出部、解析データセット送信部、解析結果データ受信部、解析結果判別装置（判別対象データ抽出および判別結果検索および判別結果出力等が含まれる）、情報活用方法選択装置（選択対象データ抽出および情報活用方法検索および検索結果出力等が含まれる）および各装置の利用者および管理者等利用画面等から構成される。なお、前記データ保存部に保存されるデータとしては例えば、利用者データ、サービス選択データ、音声データ、サービス利用履歴、解析データ、判別データ、情報活用方法選択結果および情報活用実施データ等からなる利用者データ。サービス提供者、情報活用方法およびサービスの価格表等からなる各種マスタデータ。解析エンジンの種類、接続方法、接続設定および接続履歴等からなる解析エンジン利用データ。公開用ミニマムデータセット、閲覧履歴、データ利用申し入れ履歴、データ利用承認履歴、データ貸与期間および貸与データ削除履歴等からなる二次利用データ等。トランザクションデータ、マスタデータ、各種設定および利用実行履歴等の各種データがあげられる。
そして、この音声プラットフォーム装置は、本人または本人が認めた者または事業者による音声を中心とした安全性が高くこれまでにないデータの利活用を実現する新たな装置である。 FIG. 24 describes the voice platform device. This device is composed of a voice acquisition unit, an acquired voice confirmation unit, an acquired voice transmission unit, a data storage unit, an analysis data set extraction unit, an analysis data set transmission unit, an analysis result data reception unit, an analysis result discrimination device (including discrimination target data extraction, discrimination result search, and discrimination result output, etc.), an information utilization method selection device (including selection target data extraction, information utilization method search, and search result output, etc.), and a user and administrator use screen of each device, etc. In addition, the data stored in the data storage unit includes, for example, user data consisting of user data, service selection data, voice data, service utilization history, analysis data, discrimination data, information utilization method selection result, and information utilization implementation data, etc. Various master data consisting of service providers, information utilization methods, and service price lists, etc. Analysis engine utilization data consisting of analysis engine types, connection methods, connection settings, and connection history, etc. Secondary utilization data consisting of public minimum datasets, browsing history, data utilization application history, data utilization approval history, data lending period, and lending data deletion history, etc. Various data such as transaction data, master data, various settings, and utilization execution history can be cited.
This voice platform device is a new device that enables highly secure and unprecedented utilization of data, mainly voice, by the individual himself/herself, or by a person or business authorized by the individual.

図２５では、音声解析装置の説明を行う。音声解析エンジン実行・管理環境および解析用音声入力装置および解析用音声前処理装置および音声解析装置および解析結果出力装置および出力先受信結果確認装置および解析データセットと解析結果削除装置および解析エンジン利用実績取得装置および利用履歴取得補完装置および利用者別エンジン別利用実績から費用を計算し出力する装置および接続先別接続方法設定装置等からなる。The voice analysis device is explained in Fig. 25. It is composed of a voice analysis engine execution and management environment, an analytical voice input device, an analytical voice preprocessing device, a voice analysis device, an analysis result output device, an output destination reception result confirmation device, an analysis data set and analysis result deletion device, an analysis engine usage record acquisition device, a usage history acquisition and complement device, a device for calculating and outputting costs from the usage record by user and engine, and a connection method setting device by connection destination.

複数の解析エンジンをリクエストに応じて選択実行可能かつ個人情報を本人の求めに応じて解析し解析後は依頼元システムやサービスで保有する利用者データの一部のみを持つことにより個人を特定しない形でデータ解析を多面的に行うことが可能な環境を提供する。To provide an environment in which multiple analysis engines can be selected and executed upon request, personal information can be analyzed according to the request of the individual, and after the analysis, data analysis can be performed in a multifaceted manner without identifying individuals by having only a portion of the user data held by the requesting system or service.

図２６、図２７Ａ及び図２７Ｂでは、他システムの説明を行う。26, 27A and 27B, other systems will be described.

音声プラットフォーム装置の一部機能（音声データ取得、音声データ確認、判別、情報活用方法検索等）を持つシステム。独自のセキュリティやユーザビリティ等に対する要件を満たす必要がある場合や個々のサービサーが既に保有するシステムを活用したサービスの提供を目的とした利用形態。取得、確認後の音声は音声プラットフォームへ送信され、解析および判別および情報活用方法検索の結果は音声プラットフォーム装置であらかじめ定められたデータフォーマットで受信する。A system that has some of the functions of a voice platform device (acquiring voice data, checking voice data, discriminating, searching for ways to utilize information, etc.). A form of use aimed at providing services that utilize systems already owned by individual service providers, or when it is necessary to meet unique requirements for security, usability, etc. After acquisition and confirmation, the voice is sent to the voice platform, and the results of analysis, discrimination, and searching for ways to utilize the information are received in a data format predetermined by the voice platform device.

独自の仕様で音声を取得している場合に、取得済み音声をあらかじめ定められた仕様で出力することにより音声プラットフォーム装置への音声データ入力を可能にする。音声データ入力後は音声プラットフォーム装置の機能を使う。When voice is acquired according to a unique specification, the acquired voice can be output according to a predetermined specification, enabling voice data input to the voice platform device. After the voice data is input, the functions of the voice platform device can be used.

主に医療施設等音声プラットフォーム装置へのネットワーク接続ができない場合の利用形態を示すもので、利用同意取得から結果の出力までを行う独立したシステム装置。音声プラットフォーム装置からのデータ入力や音声プラットフォーム装置へのデータ出力はインターネットを介さずスマートフォン等の無線技術（Ｂｌｕｅｔｏｏｔｈ（登録商標），
ＮＦＣ，ＡＩｒＤｒｏｐ等）あるいはメモリつきＩＣカード等を介して実現することが可能である。 This refers to a usage pattern when a network connection to a voice platform device is not possible, such as in a medical facility, and is an independent system device that handles everything from obtaining consent to use to outputting results. Data input from the voice platform device and data output to the voice platform device are not transmitted via the Internet, but through wireless technology such as a smartphone (Bluetooth (registered trademark),
This can be realized via a communication technology such as NFC, AirDrop, etc., or an IC card with memory.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。Although some of the embodiments of the present application have been described in detail above with reference to the drawings, these are merely examples, and the present invention can be embodied in other forms that incorporate various modifications and improvements based on the knowledge of those skilled in the art, including the aspects described in the Disclosure of the Invention section.

本出願は、２０１９年１２月２４日に出願された日本出願である特願２０１９－２３２５０４号に基づく優先権を主張し、当該日本出願に記載された全ての記載内容を援用するものである。This application claims priority based on Japanese Patent Application No. 2019-232504, filed on December 24, 2019, and incorporates by reference all of the contents of that Japanese application.

２００健康管理システム
２０４第１のサーバ
２０５第２のサーバ
２０６第３のサーバ
２１０第４のサーバ
２１１算出部
２１２推定部
２１３選択部 200 Health management system 204 First server 205 Second server 206 Third server 210 Fourth server 211 Calculation unit 212 Estimation unit 213 Selection unit

Claims

a first server for continuously storing voice data acquired from a user;
a second server for storing user identification data;
a third server for storing output data for users; and
Estimation device,
A health management system comprising:
The estimation device includes:
A calculation unit that calculates an acoustic feature based on the speech data;
an estimation unit that estimates a predetermined disease using the acoustic feature as an input; and a selection unit that selects output data of the third server based on an estimation result of the estimation unit and identification data of the user.
Equipped with
The selection unit is
when the estimation unit estimates that there is no suspicion of the subject suffering from the predetermined disease, the selection unit selects, from the third server, first output data including distribution data for healthy individuals;
When the estimation unit estimates that the patient is suspected of having the predetermined disease, the selection unit selects second output data including a message from the third server encouraging the patient to visit a medical institution,
After the user is diagnosed by a doctor as suffering from a predetermined disease, the identification data of the second server is updated, and the selection unit is configured to select third output data including distribution data for the user after the user is diagnosed with the disease from the third server,
The health management system wherein the estimation unit estimates at least one disease from a group of diseases including Alzheimer's disease, Lewy body dementia, Parkinson's disease, atypical depression, and bipolar disorder.

The health management system of claim 1, wherein the estimation unit is also capable of estimating major depression.

The estimation unit is capable of classifying the subject into a dementia group including Lewy body dementia, Alzheimer's disease, and Parkinson's disease, and a mood disorder group including major depression, bipolar disorder, and atypical depression.
The health management system according to claim 2 .

The health management system according to claim 1 or 2, wherein the estimation unit is capable of estimating Lewy body dementia.

The health management system according to claim 1 or 2, wherein the estimation unit is capable of estimating Alzheimer's dementia.

The health management system according to claim 1 or 2, wherein the estimation unit is capable of estimating Parkinson's disease.

The health management system according to claim 1 , wherein the estimation unit is capable of estimating bipolar disorder.