JP7619094B2

JP7619094B2 - Speech control program, speech control method, and speech control device

Info

Publication number: JP7619094B2
Application number: JP2021036275A
Authority: JP
Inventors: 憲彦坂本
Original assignee: エフサステクノロジーズ株式会社
Priority date: 2021-03-08
Filing date: 2021-03-08
Publication date: 2025-01-22
Anticipated expiration: 2041-03-08
Also published as: JP2022136589A

Description

本発明は、発言制御プログラム、発言制御方法、および発言制御装置に関する。 The present invention relates to a speech control program, a speech control method, and a speech control device.

リモートワークの推進により、遠隔からのネットワークを介した音声会議が多く行われている。音声会議は、会議の参加者の各端末と会議をホストするサーバとが離れているため、音声が他者に伝わるまで遅延が発生する場合がある。 With the advancement of remote work, many audio conferences are being held remotely over networks. In audio conferences, because the devices of the participants are far from the server hosting the conference, there can be a delay before the audio is transmitted to others.

このため、発言者の発言が終わった際に発言を行っても、別の参加者も発言を始めていたり、または、発言者がさらに発言を続けていたりして、参加者同士の音声が重なり、会議の進行を妨げる場合がある。 As a result, even if a participant speaks when the speaker has finished, another participant may also begin speaking, or the speaker may continue speaking, causing the participants' voices to overlap and disrupting the progress of the meeting.

特開平８－９０６３号公報Japanese Patent Application Publication No. 8-9063 特開２００６－６０３６４号公報JP 2006-60364 A 特表２００２－５２２９９８号公報Special Publication No. 2002-522998 特開平６－１８９００５２号公報Japanese Unexamined Patent Publication No. 6-1890052

しかしながら、音声会議は、対面の会議とは異なり、参加者間でお互いの姿勢や表情などを観察することができないため、発言の間合いを取ることが困難である。そのため、音声会議において、参加者間の音声の重なりを抑止する技術が求められている。 However, unlike face-to-face meetings, audio conferences make it difficult for participants to observe each other's postures or facial expressions, making it difficult to time their speech. For this reason, there is a demand for technology that can prevent voices from overlapping in audio conferences.

音声の重なりを抑止する一つの手法として、例えば、発言者の発言優先度を、会議議題との関連度に基づいて、発言の優先度を決定する手法がある。しかしながら、当該手法は、静的情報に基づいて会議開始前に予め優先度を決定するものであり、実際の会議で起きるような、議事の進行、変化に伴って発言の優先度が高い発言者が変化していくケースには対応できない。 One method for preventing voice overlap is to determine the priority of each speaker's speech based on their relevance to the meeting agenda. However, this method determines the priority before the meeting starts based on static information, and cannot handle cases that occur in real meetings where the speaker with the highest priority changes as the agenda progresses and changes.

一つの側面では、音声会議における参加者間の音声の重なりを抑止できる発言制御プログラム、発言制御方法、および発言制御装置を提供することを目的とする。 In one aspect, the objective is to provide a speech control program, a speech control method, and a speech control device that can prevent overlapping of voices between participants in an audio conference.

第１の案では、発言制御装置に、所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、発言優先度に基づいて、参加者のうちの特定者以外の発言をミュートする処理を実行させる。 In the first proposal, the speech control device calculates speech priority for multiple participants who speak within a specified time period based on characteristic words in meeting-related email sentences and keywords acquired sequentially from the meeting audio, and performs a process to mute speech from all participants except for specific ones based on the speech priority.

一つの側面では、音声会議における参加者間の音声の重なりを抑止できる。 On one hand, it can prevent voice overlap between participants in an audio conference.

図１は、実施例１にかかる情報処理システムの全体構成例を示す図である。FIG. 1 is a diagram illustrating an example of an overall configuration of an information processing system according to a first embodiment. 図２は、実施例１にかかる発言制御装置１００の機能構成を示す機能ブロック図である。FIG. 2 is a functional block diagram of the speech control device 100 according to the first embodiment. 図３は、実施例１にかかる発言制御処理の全体の流れを示す図である。FIG. 3 is a diagram illustrating an overall flow of the speech control process according to the first embodiment. 図４は、実施例１にかかる過去の会議音声のワード抽出の一例を示す図である。FIG. 4 is a diagram illustrating an example of word extraction from past conference voice according to the first embodiment. 図５は、実施例１にかかる議題関連度の決定の一例を示す図である。FIG. 5 is a diagram illustrating an example of how topic relevance is determined according to the first embodiment. 図６は、実施例１にかかるメール文の特徴語抽出の一例を示す図である。FIG. 6 is a diagram illustrating an example of extracted characteristic words from an email text according to the first embodiment. 図７は、実施例１にかかる役職ポイントの決定の一例を示す図である。FIG. 7 is a diagram illustrating an example of determining position points according to the first embodiment. 図８は、実施例１にかかる基準となる発言優先度の算出の一例を示す図である。FIG. 8 is a diagram illustrating an example of calculation of a reference statement priority level according to the first embodiment. 図９は、実施例１にかかる会議音声のキーワード抽出の一例を示す図である。FIG. 9 is a diagram illustrating an example of keyword extraction from the conference voice according to the first embodiment. 図１０は、実施例１にかかる発言優先度に対する加点の一例を示す図である。FIG. 10 is a diagram illustrating an example of points added to a message priority level according to the first embodiment. 図１１は、実施例１にかかる発言制御の一例を示す図である。FIG. 11 is a diagram illustrating an example of speech control according to the first embodiment. 図１２は、実施例１にかかる発言制御表示の出力の一例を示す図である。FIG. 12 is a diagram illustrating an example of an output of a speech control display according to the first embodiment. 図１３は、実施例１にかかる発言優先度算出処理の流れを示すフローチャートである。FIG. 13 is a flowchart of the statement priority calculation process according to the first embodiment. 図１４は、実施例１にかかる発言制御処理の流れを示すフローチャートである。FIG. 14 is a flowchart of the speech control process according to the first embodiment. 図１５は、ハードウェア構成例を説明する図である。FIG. 15 is a diagram illustrating an example of a hardware configuration.

以下に、本願の開示する発言制御プログラム、発言制御方法、および発言制御装置の実施例を図面に基づいて詳細に説明する。なお、この実施例によりこの発明が限定されるものではない。また、各実施例は、矛盾のない範囲内で適宜組み合わせることができる。 Below, examples of the speech control program, speech control method, and speech control device disclosed in the present application are described in detail with reference to the drawings. Note that the present invention is not limited to these examples. Furthermore, the respective examples can be appropriately combined within a range that does not cause inconsistencies.

［全体構成例］
図１は、実施例１にかかる情報処理システムの全体構成例を示す図である。図１に示すように、実施例１にかかる情報処理システムは、発言制御装置１００と、参加者端末２００－１～２００－ｎ（ｎは任意の整数。以下、まとめて「参加者端末２００」という）とがネットワークＮを介して相互に通信可能に接続されるシステムである。なお、ネットワークＮには、有線や無線を問わず、イントラネットなどの各種通信網を採用できる。また、ネットワークＮは、単一のネットワークではなく、例えば、イントラネットとインターネットとがゲートウェイなどネットワーク装置やその他の装置（図示せず）を介して構成されてよい。 [Overall configuration example]
FIG. 1 is a diagram showing an example of the overall configuration of an information processing system according to a first embodiment. As shown in FIG. 1, the information processing system according to the first embodiment is a system in which a speech control device 100 and participant terminals 200-1 to 200-n (n is any integer; hereinafter, collectively referred to as "participant terminals 200") are connected to each other via a network N so as to be able to communicate with each other. Note that the network N can be any of a variety of communication networks, such as an intranet, whether wired or wireless. Furthermore, the network N is not a single network, and may be configured, for example, by connecting an intranet and the Internet via a network device such as a gateway or other device (not shown).

発言制御装置１００は、実施例１などを含む本実施形態の実行主体であり、例えば、音声会議の参加者が所属する企業などによって管理される、サーバコンピュータなどの情報処理装置である。または、発言制御装置１００は、クラウドコンピューティングサービスを提供するクラウドサービス提供者によって管理されるクラウドサーバ装置であってもよい。また、発言制御装置１００は、複数台のコンピュータで構成される分散型コンピューティングシステムであってもよい。 The speech control device 100 is the executing entity of this embodiment including Example 1, and is, for example, an information processing device such as a server computer managed by a company to which the participants of the audio conference belong. Alternatively, the speech control device 100 may be a cloud server device managed by a cloud service provider that provides cloud computing services. Furthermore, the speech control device 100 may be a distributed computing system composed of multiple computers.

発言制御装置１００は、参加者端末２００に対して音声会議サービスを提供し、参加者端末２００の各々から各参加者の音声を受信し、他の参加者の利用する参加者端末２００に各参加者の音声を配信する。また、発言制御装置１００は、音声と併せて、参加者端末２００のカメラ機能または参加者端末２００に接続されたＷｅｂカメラ装置などによって撮影された映像を受信および配信してもよい。 The speech control device 100 provides an audio conference service to the participant terminals 200, receives the voice of each participant from each participant terminal 200, and distributes the voice of each participant to the participant terminals 200 used by the other participants. In addition to the voice, the speech control device 100 may also receive and distribute video captured by the camera function of the participant terminal 200 or a web camera device connected to the participant terminal 200.

また、発言制御装置１００は、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、参加者同士の音声が重なった場合、発言優先度が最も高い参加者以外の発言をミュートする。 In addition, the speech control device 100 calculates speech priority based on characteristic words in the meeting-related email text and keywords sequentially acquired from the meeting audio, and when the voices of participants overlap, mutes the speech of participants other than the one with the highest speech priority.

参加者端末２００は、音声会議の参加者が利用する情報処理端末である。参加者端末２００は、参加者の会社や自宅などに設置されるデスクトップＰＣやノートＰＣであってもよいし、参加者によって携帯されるスマートフォンやタブレットＰＣなどのモバイル端末などであってもよい。 The participant terminal 200 is an information processing terminal used by the participants of the audio conference. The participant terminal 200 may be a desktop PC or a notebook PC installed in the participant's company or home, or may be a mobile terminal such as a smartphone or a tablet PC carried by the participant.

参加者端末２００は、参加者が音声会議に参加するためのマイクやスピーカー機能などを備え、または、これらの機能を提供する外部機器と接続される。また、参加者端末２００には、音声会議に参加するためのアプリケーションが予めインストールされる。または、参加者端末２００が、このようなアプリケーションと同等の機能を有する、発言制御装置１００上のＷｅｂアプリケーションを使用する場合は、参加者端末２００に当該アプリケーションはインストールされなくてもよい。 The participant terminal 200 is equipped with a microphone and speaker function for participants to participate in an audio conference, or is connected to an external device that provides these functions. In addition, an application for participating in an audio conference is pre-installed in the participant terminal 200. Alternatively, if the participant terminal 200 uses a web application on the speech control device 100 that has the same function as such an application, the application does not need to be installed in the participant terminal 200.

参加者は、参加者端末２００を介して、参加者同士の音声を互いに送受信することで、音声会議を実施できる。 Participants can hold an audio conference by sending and receiving audio between each other via the participant terminal 200.

［発言制御装置１００の機能構成］
次に、図１に示される発言制御装置１００の機能構成について説明する。図２は、実施例１にかかる発言制御装置１００の機能構成を示す機能ブロック図である。図２に示すように、発言制御装置１００は、通信部１１０、記憶部１２０、および制御部１３０を有する。 [Functional configuration of the speech control device 100]
Next, a description will be given of the functional configuration of the remark control device 100 shown in Fig. 1. Fig. 2 is a functional block diagram showing the functional configuration of the remark control device 100 according to the first embodiment. As shown in Fig. 2, the remark control device 100 has a communication unit 110, a storage unit 120, and a control unit 130.

通信部１１０は、参加者端末２００など、他の装置との間の通信を制御する処理部であり、例えば、ネットワークインタフェースカードなどの通信インタフェースである。 The communication unit 110 is a processing unit that controls communication with other devices, such as the participant terminal 200, and is, for example, a communication interface such as a network interface card.

記憶部１２０は、各種データや、制御部１３０が実行するプログラムを記憶する記憶装置の一例であり、例えば、メモリやハードディスクなどである。記憶部１２０は、メール情報１２１、スケジュール情報１２２、組織情報１２３、および会議情報１２４を記憶する。 The storage unit 120 is an example of a storage device that stores various data and programs executed by the control unit 130, such as a memory or a hard disk. The storage unit 120 stores email information 121, schedule information 122, organizational information 123, and meeting information 124.

メール情報１２１は、音声会議のインビテーションメールに対する参加者ごとの返信メールや、当該返信メールの本文から抽出される参加者ごとの特徴語などの情報を記憶する。メール情報１２１は、発言制御装置１００とは別のメールサーバなどの情報処理装置から取得されてよい。 The email information 121 stores information such as reply emails from each participant to the invitation email for the audio conference and characteristic words for each participant extracted from the text of the reply email. The email information 121 may be obtained from an information processing device such as a mail server other than the speech control device 100.

スケジュール情報１２２は、音声会議の参加者の会議などのスケジュール情報を記憶する。スケジュール情報１２２は、発言制御装置１００とは別のスケジュール管理サーバなどの情報処理装置から取得されてよい。 The schedule information 122 stores schedule information such as conferences of participants in an audio conference. The schedule information 122 may be obtained from an information processing device such as a schedule management server other than the speech control device 100.

組織情報１２３は、参加者の役職や役職ごとに設定される役職ポイントなどの情報を記憶する。組織情報１２３は、発言制御装置１００とは別の情報処理装置から取得されてよい。 The organizational information 123 stores information such as the participants' job titles and the job title points set for each job title. The organizational information 123 may be obtained from an information processing device other than the speech control device 100.

会議情報１２４は、参加者端末２００から受信される会議音声や、会議音声から抽出されるワードやワードの出現回数などの情報を記憶する。 The conference information 124 stores information such as the conference audio received from the participant terminal 200, words extracted from the conference audio, and the number of times each word occurs.

なお、記憶部１２０には、上記情報以外にも様々な情報を記憶できる。また、各情報のデータ構成は、上記内容に限定されない。 Note that the storage unit 120 can store various information other than the above information. Also, the data structure of each piece of information is not limited to the above content.

制御部１３０は、発言制御装置１００全体を司る処理部であり、例えば、プロセッサなどである。制御部１３０は、抽出部１３１、決定部１３２、算出部１３３、ミュート部１３４、および出力部１３５を備える。なお、各処理部は、プロセッサが有する電子回路の一例やプロセッサが実行するプロセスの一例である。 The control unit 130 is a processing unit that controls the entire speech control device 100, and is, for example, a processor. The control unit 130 includes an extraction unit 131, a determination unit 132, a calculation unit 133, a mute unit 134, and an output unit 135. Each processing unit is an example of an electronic circuit that the processor has, or an example of a process that the processor executes.

抽出部１３１は、参加者ごとに、メール文に対して形態素解析を行い、特徴語を抽出する。なお、抽出部１３１は、特徴語の抽出に、ｔｆ－ｉｄｆ手法などの統計手法を用いることができる。 The extraction unit 131 performs morphological analysis on the email text for each participant and extracts characteristic words. Note that the extraction unit 131 can use a statistical method such as the tf-idf method to extract characteristic words.

また、抽出部１３１は、会議音声をテキスト化し、形態素解析を行い、キーワードを抽出する。より具体的には、抽出部１３１は、過去に行われた一定数、例えば、２０つの会議の会議音声をテキスト化し、形態素解析を行い、会議ごとにワードおよびワードの出現回数を予め抽出する。また、抽出部１３１は、参加者端末２００から受信した現在行われている会議の音声をリアルタイムでテキスト化し、形態素解析を行い、ワードおよびワードの出現回数を抽出する。そして、抽出部１３１は、過去の会議のワードおよび出現回数、ならびに現在行われている会議のワードおよび出現回数に基づいて、現在行われている会議における重要なワード、すなわち、キーワードを抽出する。なお、抽出部１３１は、キーワードの抽出においても、ｔｆ－ｉｄｆ手法などの統計手法を用いることができる。 The extraction unit 131 also converts the conference voice into text, performs morphological analysis, and extracts keywords. More specifically, the extraction unit 131 converts the conference voice of a certain number of past conferences, for example 20 conferences, into text, performs morphological analysis, and pre-extracts words and the number of occurrences of the words for each conference. The extraction unit 131 also converts the voice of the current conference received from the participant terminal 200 into text in real time, performs morphological analysis, and extracts words and the number of occurrences of the words. Then, the extraction unit 131 extracts important words, i.e., keywords, in the current conference based on the words and the number of occurrences of the past conferences and the words and the number of occurrences of the current conference. The extraction unit 131 can also use statistical methods such as the tf-idf method to extract keywords.

決定部１３２は、参加者ごとに、メール文の文字数に基づいて、会議に対する関連度である議題関連度を決定する。これは、例えば、メール文の文字数に比例して議題関連度を決定してよい。 The determination unit 132 determines the agenda relevance, which is the degree of relevance to the meeting, for each participant based on the number of characters in the email text. For example, the determination unit 132 may determine the agenda relevance in proportion to the number of characters in the email text.

また、決定部１３２は、参加者のうち少なくとも２名以上の話し始めの音声を所定時間内に受信した場合、算出部１３３によって算出される発言優先度が最も高い参加者を特定者に決定する。ここで決定された特定者以外の発言はミュートされることになるが、音声の受信に所定時間内と一定時間の余裕を持たせたのは、音声の受信に対する参加者端末２００間のタイムラグを考慮するためである。 In addition, when the determination unit 132 receives the voices of at least two or more of the participants who have started speaking within a specified time, it determines the participant with the highest speech priority calculated by the calculation unit 133 as the specific person. Any speech other than that of the specific person determined here will be muted, but the reason why a certain amount of time is allowed for the reception of the voice within the specified time is to take into account the time lag between the participant terminals 200 in receiving the voice.

算出部１３３は、所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出する。より具体的には、例えば、参加者ごとにメール文から抽出された特徴語と、会議音声から抽出されたキーワードとの一致数が多い参加者の方が、現在の議題により関連性が高いと判断し、発言優先度がより高くなるように算出される。なお、会議音声から抽出されるキーワードは、会議の議題などの変化に応じて逐次変化するため、参加者ごとの発言優先度も会議中に動的に変化することになる。 The calculation unit 133 calculates a comment priority for multiple participants who spoke within a specified time based on the characteristic words of the conference-related email text and the keywords successively acquired from the conference audio. More specifically, for example, a participant with a greater number of matches between the characteristic words extracted from the email text and the keywords extracted from the conference audio is determined to be more relevant to the current agenda, and a higher comment priority is calculated. Note that since the keywords extracted from the conference audio change successively in response to changes in the agenda of the conference, the comment priority for each participant also changes dynamically during the conference.

また、算出部１３３は、決定部１３２によってメール文の文字数から予め決定された参加者の議題関連度や役職に基づいて、発言優先度を算出できる。ここで決定される発言優先度は、会議前に決定でき、上述した会議中に動的に変化する発言優先度の基準となる発言優先度である。このように、発言制御装置１００は、会議前に、参加者ごとに基準となる発言優先度を予め算出しておき、会議中の議題などの変化に応じて発言優先度を動的に変化させる。これにより、発言制御装置１００は、実際の会議で起きるような、議事の進行などに伴って発言の優先度が高い参加者が変化していくケースに対応できる。 The calculation unit 133 can also calculate the comment priority based on the subject relevance and job title of the participant, which are determined in advance by the determination unit 132 from the number of characters in the email text. The comment priority determined here is a comment priority that can be determined before the conference and serves as a standard for the comment priority that changes dynamically during the conference as described above. In this way, the comment control device 100 calculates a standard comment priority for each participant before the conference, and dynamically changes the comment priority in response to changes in the subject and other factors during the conference. This allows the comment control device 100 to handle cases where participants with high comment priority change as the proceedings progress, as occurs in actual conferences.

ミュート部１３４は、算出部１３３によって算出された発言優先度に基づいて、決定部１３２によって決定された、参加者のうちの発言優先度が最も高い参加者である特定者以外の発言をミュートする。また、ミュート部１３４は、参加者のいずれかが発言中の場合、既に話をしている参加者である発言者の発言を優先するため、各参加者の発言優先度に関係なく、発言者以外の発言をミュートする。 The muting unit 134 mutes comments from participants other than the specific participant with the highest speech priority determined by the determination unit 132 based on the speech priority calculated by the calculation unit 133. In addition, when any participant is speaking, the muting unit 134 prioritizes the speech of the speaker who is already speaking, and therefore mutes comments from participants other than the speaker, regardless of the speech priority of each participant.

出力部１３５は、参加者の発言がミュートされている場合、発言がミュートされていることを示す表示や、発言が優先されていることを示す表示を、参加者端末２００に出力させる。 When a participant's speech is muted, the output unit 135 causes the participant terminal 200 to output a display indicating that the speech is muted and a display indicating that the speech has priority.

［機能詳細］
次に、実施例１にかかる発言制御処理について詳細に説明する。図３は、実施例１にかかる発言制御処理の全体の流れを示す図である。なお、図３の例では、発言制御処理の実行主体は発言制御装置１００である。また、図３の例では、発言制御処理のステップＳ１～４が会議前に実行される処理ステップであり、ステップＳ５～９が会議中に実行される処理ステップである。 [Function details]
Next, the utterance control process according to the first embodiment will be described in detail. Fig. 3 is a diagram showing an overall flow of the utterance control process according to the first embodiment. In the example of Fig. 3, the utterance control process is executed by the utterance control device 100. In the example of Fig. 3, steps S1 to S4 of the utterance control process are processing steps executed before the conference, and steps S5 to S9 are processing steps executed during the conference.

まず、発言制御装置１００は、過去の会議音声からワードを抽出する（ステップＳ１）。これは、参加者の発言を制御する会議の会議音声から特徴的なキーワードを抽出するために、過去に行われた一定数、例えば、２０つの会議の会議音声から基準となるワードを抽出する処理である。 First, the speech control device 100 extracts words from past conference audio (step S1). This is a process of extracting reference words from the conference audio of a certain number of past conferences, for example 20 conferences, in order to extract characteristic keywords from the conference audio of the conference in which the speech of participants is to be controlled.

次に、発言制御装置１００は、参加者ごとに会議関連のメールの文字数から議題関連度を決定する（ステップＳ２）。ここで、議題関連度は、会議関連のメールの文字数が多い参加者の方が会議に対する関連度がより高いであろうという考えの下、例えば、メールの文字数が多い参加者ほど高くなるように決定される。 Next, the speech control device 100 determines the topic relevance for each participant from the number of characters in the conference-related email (step S2). Here, the topic relevance is determined so that, for example, the topic relevance is higher for participants who write more characters in their emails, based on the idea that participants who write more characters in their conference-related emails will be more relevant to the conference.

次に、発言制御装置１００は、参加者ごとに会議関連のメール文から特徴語を抽出する（ステップＳ３）。ここで抽出された参加者ごとの特徴語は、参加者の発言を制御する会議の会議音声から抽出されるキーワードと比較され、例えば、特徴語とキーワードとが一致するほど、その参加者の発言優先度が高くなるように制御される。 Next, the speech control device 100 extracts characteristic words from the conference-related email text for each participant (step S3). The characteristic words extracted here for each participant are compared with keywords extracted from the conference audio of the conference for which the participant's speech is controlled, and the speech priority of the participant is controlled to be higher, for example, as the characteristic words and keywords match.

なお、ステップＳ２やＳ３は、例えば、１０分ごとにスケジュール情報１２２を検索し、１０分以内に開始する会議があった場合、メール情報１２１から当該会議のインビテーションメールに対する返信メールを検索し、実行されてよい。なお、スケジュール情報１２２に対する検索間隔は１０分に限定されない。 Note that steps S2 and S3 may be executed, for example, by searching schedule information 122 every 10 minutes, and if there is a meeting that is to start within 10 minutes, searching email information 121 for a reply email to the invitation email for that meeting. Note that the search interval for schedule information 122 is not limited to 10 minutes.

次に、発言制御装置１００は、参加者ごとに役職ポイントを決定する（ステップＳ４）。ここで、役職ポイントは、役職が上位の参加者ほど高くなるように決定される。これは、実施例１にかかる発言制御処理では役職ポイントが高いほど発言優先度が高くなるに制御されるため、特に日本では、役職が上位の参加者の発言を優先しようとする考えに基づく。そのため、特に、役職に基づいて発言優先度を制御する必要がない場合は、役職ポイントを決定しなくてよい。 Next, the speech control device 100 determines position points for each participant (step S4). Here, the position points are determined so that the higher the position of the participant, the higher the position. This is based on the idea that, in the speech control process of Example 1, the higher the position points, the higher the speech priority, and therefore, particularly in Japan, priority is given to the speech of participants with higher positions. Therefore, when there is no need to control speech priority based on position, it is not necessary to determine position points.

なお、図３の例では、ステップＳ４までが会議前に予め実行されるように示しているが、例えば、ステップＳ４は会議中に行われるようにするなど、各処理ステップの実行タイミングは変更可能である。 In the example of FIG. 3, steps up to S4 are shown to be executed before the meeting, but the execution timing of each processing step can be changed, for example, so that step S4 is executed during the meeting.

次に、発言制御装置１００は、参加者ごとに基準となる発言優先度を決定する（ステップＳ５）。これは、例えば、会議開始時に実行され、ステップＳ２で決定された議題関連度や、ステップＳ４で決定された役職ポイントに基づいて、参加者ごとの発言優先度を決定する処理である。 Next, the speech control device 100 determines a standard speech priority for each participant (step S5). This is a process that is executed, for example, at the start of the conference, to determine the speech priority for each participant based on the topic relevance determined in step S2 and the position points determined in step S4.

次に、発言制御装置１００は、進行中の会議の音声からキーワードを抽出する（ステップＳ６）。これは、進行中の会議の音声からリアルタイムでワードを逐次抽出し、ステップＳ１で抽出された過去の会議音声の基準ワード群と比較することで、特徴的なキーワードを抽出する処理である。 Next, the speech control device 100 extracts keywords from the audio of the ongoing conference (step S6). This is a process in which words are sequentially extracted in real time from the audio of the ongoing conference, and characteristic keywords are extracted by comparing the words with the reference words from past conference audio extracted in step S1.

次に、発言制御装置１００は、ステップＳ５で決定された発言優先度を加点する（ステップＳ７）。これは、ステップＳ３で抽出された参加者ごとの特徴語と、ステップＳ６で抽出されたキーワードとが一致するほど、その参加者の発言優先度が高くなるように、発言優先度を動的に加点する処理である。 Next, the speech control device 100 adds points to the speech priority determined in step S5 (step S7). This is a process of dynamically adding points to the speech priority so that the more the characteristic words for each participant extracted in step S3 match the keywords extracted in step S6, the higher the speech priority of that participant becomes.

次に、発言制御装置１００は、会議の参加者間の音声が重なった時、参加者の発言を制御する（ステップＳ８）。これは、例えば、複数の参加者の話し始めの音声が重なってしまった場合に、発言優先度が最も高い参加者以外の発言をミュートする処理である。または、特定の参加者が既に話をしている場合に後から話し始めた参加者の音声をミュートする処理である。 Next, the speech control device 100 controls the speech of participants when their voices overlap (step S8). For example, when multiple participants start speaking and their voices overlap, this is a process of muting the speech of all participants except the one with the highest speech priority. Or, when a specific participant is already speaking, this is a process of muting the voice of a participant who starts speaking later.

次に、発言制御装置１００は、特定の参加者以外の発言をミュートした場合、発言制御表示を出力する（ステップＳ９）。これは、ステップＳ８で発言がミュートされた場合、例えば、発言がミュートされていることを示す表示や、発言が優先されていることを示す表示を、参加者端末２００に出力させる処理である。 Next, when the speech control device 100 mutes speech other than that of a specific participant, it outputs a speech control display (step S9). This is a process in which, when speech is muted in step S8, the participant terminal 200 outputs, for example, a display indicating that the speech has been muted or a display indicating that the speech has been given priority.

以上、図３を用いて実施例１にかかる発言制御処理の全体の流れを説明した。次に、図４～図１２を用いて、図３を用いて説明した発言制御処理の各処理ステップＳ１～Ｓ９をより詳細に説明する。 The overall flow of the speech control process in Example 1 has been described above using Figure 3. Next, each of the processing steps S1 to S9 of the speech control process described using Figure 3 will be described in more detail using Figures 4 to 12.

図４は、実施例１にかかる過去の会議音声のワード抽出の一例を示す図である。図４を用いて説明する抽出処理は、図３のステップＳ１に対応する。 Figure 4 is a diagram showing an example of word extraction from past conference audio according to the first embodiment. The extraction process described using Figure 4 corresponds to step S1 in Figure 3.

図４に示すように、発言制御装置１００は、過去に行われた会議Ａ～Ｃの会議音声をテキスト化し、形態素解析を行い、会議ごとにワードおよびその出現回数を抽出する。なお、図４の例では、過去の会議としてＡ～Ｃの３つを示しているが、抽出対象の会議は３つに限定されない。また、会議音声のテキスト化には、ＡｚｕｒｅＳｐｅｅｃｈＡＰＩ（ＡｐｐｌｉｃａｔｉｏｎＰｒｏｇｒａｍｍｉｎｇＩｎｔｅｒｆａｃｅ）などの音声テキスト変換ツールを用いてよい。 As shown in FIG. 4, the speech control device 100 converts the conference audio of past conferences A to C into text, performs morphological analysis, and extracts words and their frequency of occurrence for each conference. Note that the example in FIG. 4 shows three past conferences, A to C, but the number of conferences to be extracted is not limited to three. A speech-to-text conversion tool such as Azure Speech API (Application Programming Interface) may be used to convert the conference audio into text.

次に、図５は、実施例１にかかる議題関連度の決定の一例を示す図である。図５を用いて説明する決定処理は、図３のステップＳ２に対応する。 Next, FIG. 5 is a diagram showing an example of determining the topic relevance in Example 1. The determination process described using FIG. 5 corresponds to step S2 in FIG. 3.

図５に示すように、発言制御装置１００は、参加者ごとに、参加者の発言を制御する会議関連のインビテーションメールに対する返信メールの文字数から議題関連度を決定する。議題関連度は、例えば、返信メールの文字数の範囲である０～９９、１００～１９９、２００～２９９、・・・ごとに、０、１、２、・・・などと決定されてよい。また、１人の参加者のメールが複数ある場合、返信メールの文字数は、複数のメールの文字数の合計であってよいが、返信メール中の引用部分は除いてカウントされる。 As shown in FIG. 5, the speech control device 100 determines the topic relevance for each participant from the number of characters in the reply email to the conference-related invitation email for which the participant's speech is being controlled. The topic relevance may be determined as 0, 1, 2, etc., for each range of reply email character counts: 0-99, 100-199, 200-299, etc. In addition, if one participant has multiple emails, the number of characters in the reply emails may be the total number of characters in the multiple emails, but quoted portions in the reply emails are not counted.

次に、図６は、実施例１にかかるメール文の特徴語抽出の一例を示す図である。図６を用いて説明する抽出処理は、図３のステップＳ３に対応する。 Next, FIG. 6 is a diagram showing an example of extracting characteristic words from an email text according to the first embodiment. The extraction process described with reference to FIG. 6 corresponds to step S3 in FIG. 3.

図６に示すように、まず、発言制御装置１００は、参加者ごとに、参加者の発言を制御する会議関連のメール文からワードとその出現回数を抽出する。なお、１人の参加者のメールが複数ある場合、ワードの出現回数も、複数のメールの出現回数の合計であってよいが、返信メール中の引用部分は除いてカウントされる。そして、発言制御装置１００は、例えば、上位１０％など、ｔｆ－ｉｄｆ値の高いワードを、その参加者の関心を特徴付ける参加者の特徴語として抽出する。 As shown in FIG. 6, the speech control device 100 first extracts, for each participant, words and their frequency of occurrence from the conference-related email text that controls the participant's speech. Note that if one participant has multiple emails, the frequency of occurrence of a word may be the total frequency of occurrence in the multiple emails, but quoted portions in reply emails are excluded from the count. The speech control device 100 then extracts words with high tf-idf values, for example, the top 10%, as characteristic words for the participant that characterize the participant's interests.

次に、図７は、実施例１にかかる役職ポイントの決定の一例を示す図である。図７を用いて説明する決定処理は、図３のステップＳ４に対応する。 Next, FIG. 7 is a diagram showing an example of determining position points in Example 1. The determination process described using FIG. 7 corresponds to step S4 in FIG. 3.

図７に示すように、発言制御装置１００は、例えば、上位の役職ほど高く予め設定されたポイントを各参加者の役職に合わせて付与し、参加者ごとに役職ポイントを決定する。なお、図７の役職ポイントの各数値はあくまでも例であり、これらに限定されない。また、役職は図７に示す例より細かく、または粗く設定されてもよい。 As shown in FIG. 7, the speech control device 100 determines the position points for each participant by, for example, awarding higher pre-set points to participants according to their higher position. Note that the numerical values of the position points in FIG. 7 are merely examples and are not limited to these. Furthermore, the positions may be set more finely or coarsely than in the example shown in FIG. 7.

次に、図８は、実施例１にかかる基準となる発言優先度の算出の一例を示す図である。図８を用いて説明する算出処理は、図３のステップＳ５に対応する。 Next, FIG. 8 is a diagram showing an example of calculation of a reference comment priority according to the first embodiment. The calculation process described using FIG. 8 corresponds to step S5 in FIG. 3.

図８に示すように、発言制御装置１００は、例えば、参加者ごとに、ステップＳ２で決定された議題関連度と、ステップＳ４で決定された役職ポイントとを足し合わせて基準となる発言優先度を決定する。なお、ここで決定される発言優先度はあくまでも基準であり、発言優先度は、会議音声から逐次抽出されるキーワードに基づいて動的に変化し得る。 As shown in FIG. 8, the speech control device 100 determines a standard speech priority for each participant by, for example, adding up the topic relevance determined in step S2 and the position points determined in step S4. Note that the speech priority determined here is merely a standard, and the speech priority can change dynamically based on keywords successively extracted from the conference audio.

次に、図９は、実施例１にかかる会議音声のキーワード抽出の一例を示す図である。図９を用いて説明する抽出処理は、図３のステップＳ６に対応する。 Next, FIG. 9 is a diagram showing an example of keyword extraction from the conference audio according to the first embodiment. The extraction process described with reference to FIG. 9 corresponds to step S6 in FIG. 3.

図９に示すように、まず、発言制御装置１００は、進行中の会議の音声を、例えば、１分単位など、所定の時間単位でテキスト化し、形態素解析を行い、ワードおよびその出現回数を抽出する。進行中の会議の音声のテキスト化にも、例えば、ＡｚｕｒｅＳｐｅｅｃｈＡＰＩなどの音声テキスト変換ツールを用いてよい。 As shown in FIG. 9, the speech control device 100 first converts the audio of the ongoing conference into text in predetermined time units, such as one minute, and performs morphological analysis to extract words and their frequency of occurrence. A speech-to-text conversion tool, such as Azure Speech API, may also be used to convert the audio of the ongoing conference into text.

次に、発言制御装置１００は、進行中の会議の音声から抽出されたワードおよびその出現回数と、図３のステップＳ１で抽出した過去の会議の音声から抽出されたワードおよびその出現回数とを比較する。そして、発言制御装置１００は、比較結果から、進行中の会議の音声に現れる特徴的なキーワードを抽出する。キーワードの抽出は、ｔｆ－ｉｄｆ手法などの統計手法を用いて、例えば、上位１０％など、ｔｆ－ｉｄｆ値の高いワードがキーワードとして抽出される。 Next, the speech control device 100 compares the words extracted from the audio of the ongoing conference and their frequency of occurrence with the words extracted from the audio of past conferences extracted in step S1 of FIG. 3 and their frequency of occurrence. The speech control device 100 then extracts characteristic keywords that appear in the audio of the ongoing conference from the comparison results. Keywords are extracted using a statistical method such as the tf-idf method, and words with high tf-idf values, for example, the top 10%, are extracted as keywords.

なお、進行中の会議の音声からのワードおよび出現回数の抽出が１分単位など、所定の時間単位である理由は、議事の進行などに伴って重要なワードが変化していくケースに対応するためである。 The reason why words and their occurrence counts are extracted from the audio of an ongoing meeting in specified time units, such as one minute, is to accommodate cases where important words change as the agenda progresses.

次に、図１０は、実施例１にかかる発言優先度に対する加点の一例を示す図である。図１０を用いて説明する加点処理は、図３のステップＳ７に対応する。 Next, FIG. 10 is a diagram showing an example of adding points to a comment priority level in Example 1. The adding point process described using FIG. 10 corresponds to step S7 in FIG. 3.

図１０に示すように、発言制御装置１００は、例えば、図３のステップＳ３で抽出された参加者ごとの特徴語と、ステップＳ６で抽出された進行中の会議のキーワードとの一致数を、ステップＳ５で決定された発言優先度に加点する。 As shown in FIG. 10, the speech control device 100 adds, for example, the number of matches between the characteristic words for each participant extracted in step S3 of FIG. 3 and the keywords for the ongoing conference extracted in step S6 to the speech priority determined in step S5.

なお、ステップＳ６で抽出される進行中の会議のキーワードは、上述したように、１分単位など、所定の時間単位で逐次抽出されるため、ステップＳ７における発言優先度の加点も、キーワードが抽出される度に実行される。その際、発言優先度に対する前回の加点分はリセットされ、新たに抽出されたキーワードと参加者ごとの特徴語との一致数が発言優先度に加点される。 As described above, the keywords of the ongoing conference extracted in step S6 are extracted sequentially in predetermined time units, such as one minute, so the addition of the comment priority in step S7 is also performed each time a keyword is extracted. At that time, the previous addition to the comment priority is reset, and the number of matches between the newly extracted keyword and the characteristic words of each participant is added to the comment priority.

次に、図１１は、実施例１にかかる発言制御の一例を示す図である。図１１を用いて説明する制御処理は、図３のステップＳ８に対応する。 Next, FIG. 11 is a diagram showing an example of speech control according to the first embodiment. The control process described using FIG. 11 corresponds to step S8 in FIG. 3.

図１１に示すように、発言制御装置１００は、会議の参加者が同タイミングで発声し、音声が重なった時、参加者の発言を制御する。図１１の例では、ＡさんとＢさんの話し始めの音声を発言制御装置１００が同タイミングで受信しているため、発言制御装置１００は、ＡさんおよびＢさんの発言優先度を比較し、発言優先度の高いＡさんの音声を優先するため、Ｂさんの音声をミュートする。なお、ここでいう同タイミングで受信される音声とは、音声の受信に対する参加者端末２００間のタイムラグを考慮し、例えば、０．５秒や１秒間に受信された音声をいう。 As shown in FIG. 11, the speech control device 100 controls the speech of conference participants when they speak at the same time and their voices overlap. In the example of FIG. 11, the speech control device 100 receives the voices of A and B as they begin to speak at the same time, so the speech control device 100 compares the speech priorities of A and B, prioritizes the voice of A, who has a higher speech priority, and mutes the voice of B. Note that the voices received at the same time here refer to voices received within, for example, 0.5 seconds or 1 second, taking into account the time lag between the participant terminals 200 relative to the reception of the voices.

なお、図１１に示すように、発言制御装置１００は、参加者が異なるタイミングで発声し、音声が重ならない場合は、それぞれの音声は特にミュートすることなく、参加者端末２００に配信される。また、図１１の例におけるＤさんの発言のように、既に別の参加者であるＣさんが発言中の場合、発言制御装置１００は、発言優先度に関係なく、後から話し始めたＤさんの発言をミュートする。 As shown in FIG. 11, when participants speak at different times and the voices do not overlap, the speech control device 100 distributes each voice to the participant terminal 200 without muting them. Also, when another participant, C, is already speaking, as in the case of D's speech in the example of FIG. 11, the speech control device 100 mutes the speech of D, who started speaking later, regardless of speech priority.

また、図１１に示すように、発言制御装置１００は、ミュートされずに参加者端末２００に配信される音声を、１分単位など、所定の時間単位でテキスト化し、キーワードを抽出する。 Also, as shown in FIG. 11, the speech control device 100 converts the audio that is not muted and is delivered to the participant terminal 200 into text in predetermined time units, such as one minute units, and extracts keywords.

次に、図１２は、実施例１にかかる発言制御表示の出力の一例を示す図である。図１２を用いて説明する出力処理は、図３のステップＳ９に対応する。 Next, FIG. 12 is a diagram showing an example of the speech control display output according to the first embodiment. The output process described using FIG. 12 corresponds to step S9 in FIG. 3.

図１２に示すように、発言制御装置１００は、特定の参加者以外の発言をミュートした場合、参加者端末２００の各々に、発言が制御されていることを示す発言制御表示を出力させる。発言制御表示の一例は、例えば、図１２に示すように、発言がミュートされている参加者の参加者端末２００－１に出力させる、発言がミュート中であることを示すミュート中表示２５０である。一方、ミュート中の参加者に対し優先発言中の参加者の参加者端末２００－２には、発言が優先的に配信されていることを示す優先発言中表示２６０が出力される。 As shown in FIG. 12, when the speech control device 100 mutes speech other than that of a specific participant, it causes each participant terminal 200 to output a speech control display indicating that speech is being controlled. One example of a speech control display is a muted display 250 indicating that speech is being muted, which is output to the participant terminal 200-1 of the participant whose speech has been muted, as shown in FIG. 12. On the other hand, a priority speech display 260 indicating that speech is being distributed with priority is output to the participant terminal 200-2 of the participant making a priority speech to the muted participant.

なお、図１２に示す発言制御表示はあくまでも一例であり、例えば、参加者端末２００に表示される参加者リストの各参加者のアイコンや名前などの近くに、ミュート中または優先発言中であることを示す表示を出力してもよい。 Note that the speech control display shown in FIG. 12 is merely an example. For example, a display indicating that a participant is muted or making a priority speech may be output near the icon or name of each participant in the participant list displayed on the participant terminal 200.

［処理の流れ］
以上、図３を用いて発言制御処理の全体の流れ、および個々の処理ステップを詳細に説明した。なお、発言制御処理は、参加者間の音声が重なった場合に参加者の発言を制御するための発言優先度を算出する発言優先度算出処理と、音声が重なった場合に発言優先度に基づいて発言を制御する発言制御処理とに分けることができる。それぞれの処理を図１３および図１４のフローチャートを用いて説明する。 [Process flow]
The overall flow of the speech control process and each processing step have been described in detail above using Fig. 3. The speech control process can be divided into a speech priority calculation process that calculates speech priorities for controlling speech of participants when their voices overlap, and a speech control process that controls speech based on speech priorities when their voices overlap. Each process will be described using the flowcharts in Figs. 13 and 14.

まず、発言制御装置１００によって実行される、実施例１にかかる発言優先度算出処理を流れに沿って説明する。図１３は、実施例１にかかる発言優先度算出処理の流れを示すフローチャートである。図１３に示す発言優先度算出処理は、参加者間の音声が重なった場合に参加者の発言を制御するための発言優先度を算出する処理である。 First, the flow of the speech priority calculation process according to the first embodiment executed by the speech control device 100 will be described. FIG. 13 is a flowchart showing the flow of the speech priority calculation process according to the first embodiment. The speech priority calculation process shown in FIG. 13 is a process for calculating a speech priority for controlling the speech of a participant when the voices of the participants overlap.

発言制御装置１００は、例えば、１０分ごとにスケジュール情報１２２を検索し、１０分以内に開始する会議があった場合、メール情報１２１から当該会議のインビテーションメールに対する返信メールを取得する（ステップＳ１０１）。 The speech control device 100 searches the schedule information 122, for example, every 10 minutes, and if there is a meeting that is starting within 10 minutes, it obtains a reply email to the invitation email for that meeting from the email information 121 (step S101).

次に、発言制御装置１００は、例えば、参加者ごとに、ステップＳ１０１で取得したメール文の文字数に比例して、会議に対する関連度である議題関連度を決定する（ステップＳ１０２）。 Next, the speech control device 100 determines, for each participant, the topic relevance, which is the degree of relevance to the meeting, in proportion to the number of characters in the email text obtained in step S101 (step S102).

次に、発言制御装置１００は、例えば、参加者ごとに、ステップＳ１０２で決定した議題関連度に基づいて、基準となる発言優先度を決定する（ステップＳ１０３）。なお、この際、発言制御装置１００は、例えば、役職が上位の参加者ほど高く設定される役職ポイントを組織情報１２３から取得し、当該役職ポイントにさらに基づいて、基準となる発言優先度を決定してもよい。 Next, the speech control device 100 determines a standard speech priority for each participant based on the topic relevance determined in step S102 (step S103). Note that at this time, the speech control device 100 may obtain, for example, position points from the organizational information 123, which are set higher for participants with higher positions, and determine the standard speech priority further based on the position points.

次に、発言制御装置１００は、例えば、参加者ごとに、ステップＳ１０１で取得したメール文に対して形態素解析を行い、ｔｆ－ｉｄｆ手法を用いて上位１０％以内の特徴語を抽出する（ステップＳ１０４）。 Next, the speech control device 100 performs morphological analysis on the email text obtained in step S101 for each participant, and extracts the top 10% of characteristic words using the tf-idf method (step S104).

次に、発言制御装置１００は、参加者の音声を参加者端末２００から受信していない場合（ステップＳ１０５：Ｎｏ）、音声の受信を待つ。一方、音声を受信した場合（ステップＳ１０５：Ｙｅｓ）、発言制御装置１００は、受信した音声をテキスト化し、形態素解析を行い、キーワードを抽出する（ステップＳ１０６）。 Next, if the speech control device 100 has not received the participant's voice from the participant terminal 200 (step S105: No), it waits for the voice to be received. On the other hand, if the speech has been received (step S105: Yes), the speech control device 100 converts the received speech into text, performs morphological analysis, and extracts keywords (step S106).

なお、ステップＳ１０６のキーワード抽出について、発言制御装置１００は、例えば、１分単位で１分間に受信された各会議音声からワードおよびその出現回数を抽出する。そして、発言制御装置１００は、各会議音声から抽出されたワードおよびその出現回数と、予め抽出された過去の会議の音声のワードおよびその出現回数とを比較し、ｔｆ－ｉｄｆ手法を用いて上位１０％以内のキーワードを抽出する。 For keyword extraction in step S106, the utterance control device 100 extracts words and their frequency of occurrence from each conference audio received in one-minute increments, for example. The utterance control device 100 then compares the words and their frequency of occurrence extracted from each conference audio with the words and their frequency of occurrence in audio from past conferences that have been extracted in advance, and extracts the top 10% of keywords using the tf-idf method.

次に、発言制御装置１００は、例えば、参加者ごとに、ステップＳ１０６で抽出されたキーワードと、ステップＳ１０４で抽出された特徴語とを比較し、一致数を算出する（ステップＳ１０７）。 Next, the speech control device 100, for example, for each participant, compares the keywords extracted in step S106 with the characteristic words extracted in step S104 and calculates the number of matches (step S107).

次に、発言制御装置１００は、例えば、参加者ごとに、ステップＳ１０７で算出した一致数を用いて、ステップＳ１０３で決定された基準となる発言優先度を加点する（ステップＳ１０８）。なお、一致数がゼロの参加者は、加点分もゼロであるため実質的に加点は行われない。 Next, the speech control device 100, for example, uses the number of matches calculated in step S107 for each participant to add points to the standard speech priority determined in step S103 (step S108). Note that for participants with zero matches, the added points are also zero, so no points are actually added.

次に、１分単位など予め設定された所定の間隔から１分間など所定時間が経過していない場合（ステップＳ１０９：Ｎｏ）、ステップＳ１０５に戻り、発言制御装置１００は、会議の終了まで、さらなる音声の受信を待つ。一方、１分間など所定時間が経過した場合（ステップＳ１０９：Ｙｅｓ）、発言制御装置１００は、ステップＳ１０８で加点した発言優先度の加点分をリセットし、基準となる発言優先度に戻す（ステップＳ１１０）。ステップＳ１１０の実行後、発言制御装置１００は、ステップＳ１０５に戻り、会議の終了まで、さらなる音声の受信を待つ。 Next, if a predetermined time, such as one minute, has not elapsed from a predetermined interval, such as one minute (step S109: No), the process returns to step S105, and the utterance control device 100 waits to receive further audio until the end of the conference. On the other hand, if a predetermined time, such as one minute, has elapsed (step S109: Yes), the utterance control device 100 resets the added points for the utterance priority in step S108, and returns to the standard utterance priority (step S110). After executing step S110, the utterance control device 100 returns to step S105, and waits to receive further audio until the end of the conference.

次に、発言制御装置１００によって実行される、実施例１にかかる発言制御処理を流れに沿って説明する。図１４は、実施例１にかかる発言制御処理の流れを示すフローチャートである。図１４に示す発言制御処理は、参加者間の音声が重なった場合に発言優先度に基づいて発言を制御する処理である。 Next, the speech control process according to the first embodiment, which is executed by the speech control device 100, will be described in sequence. FIG. 14 is a flowchart showing the flow of the speech control process according to the first embodiment. The speech control process shown in FIG. 14 is a process for controlling speech based on speech priority when the voices of participants overlap.

まず、発言制御装置１００は、参加者の音声を参加者端末２００から受信していない場合（ステップＳ２０１：Ｎｏ）、音声の受信を待つ。一方、音声を受信した場合（ステップＳ２０１：Ｙｅｓ）、発言制御装置１００は、既に他の参加者の音声を受信し、他の参加者が発言中か否か判定する（ステップＳ２０２）。 First, if the speech control device 100 has not received the participant's voice from the participant terminal 200 (step S201: No), it waits for the reception of the voice. On the other hand, if the speech control device 100 has received the voice of another participant (step S201: Yes), it determines whether the other participant is speaking or not (step S202).

既に他の参加者が発言中の場合（ステップＳ２０２：Ｙｅｓ）、発言制御装置１００は、ステップＳ２０１で受信した音声をミュートする（ステップＳ２０５）。 If another participant is already speaking (step S202: Yes), the speech control device 100 mutes the audio received in step S201 (step S205).

一方、発言中の参加者がいない場合（ステップＳ２０２：Ｎｏ）、発言制御装置１００は、ステップＳ２０１で受信した音声の他に同時受信した会議音声があるか否か判定する（ステップＳ２０３）。なお、ここでいう同時受信とは、音声の受信に対する参加者端末２００間のタイムラグを考慮し、例えば、０．５秒や１秒の間の受信であってよい。 On the other hand, if no participant is speaking (step S202: No), the speech control device 100 determines whether or not there is conference audio received simultaneously in addition to the audio received in step S201 (step S203). Note that simultaneous reception here may be reception within, for example, 0.5 seconds or 1 second, taking into account the time lag between the participant terminals 200 for receiving the audio.

他に同時受信した会議音声がない場合（ステップＳ２０３：Ｎｏ）、発言制御装置１００は、ステップＳ２０１で受信した音声をミュートすることなく参加者端末２００の各々に配信し、図１４に示す発言制御処理は終了する。 If there is no other conference audio received simultaneously (step S203: No), the speech control device 100 distributes the audio received in step S201 to each participant terminal 200 without muting it, and the speech control process shown in FIG. 14 ends.

一方、他に同時受信した会議音声がある場合（ステップＳ２０３：Ｙｅｓ）、発言制御装置１００は、図１３に示す発言優先度算出処理で算出された発言優先度に基づいて、発言者を決定する（ステップＳ２０４）。 On the other hand, if there is other conference audio received simultaneously (step S203: Yes), the speech control device 100 determines the speaker based on the speech priority calculated by the speech priority calculation process shown in FIG. 13 (step S204).

次に、発言制御装置１００は、ステップＳ２０４で決定された発言者の音声をミュートすることなく参加者端末２００の各々に配信し、当該発言者以外の音声をミュートする（ステップＳ２０５）。 Next, the speech control device 100 distributes the voice of the speaker determined in step S204 to each participant terminal 200 without muting it, and mutes the voices of all speakers other than that speaker (step S205).

次に、発言制御装置１００は、発言制御表示を参加者端末２００に出力させる（ステップＳ２０６）。当該発言制御表示は、例えば、発言がミュートされていることを示す表示や、発言が優先されていることを示す表示であり、発言がミュートされている、または優先されている参加者の参加者端末２００に出力させることができる。または、発言制御装置１００は、参加者端末２００に表示される参加者リストの各参加者のアイコンや名前などの近くに、ミュート中または優先発言中であることを示す発言制御表示を出力させてもよい。ステップＳ２０６の実行後、図１４に示す発言制御処理は終了する。 Next, the speech control device 100 outputs a speech control display to the participant terminal 200 (step S206). The speech control display is, for example, a display indicating that a speech has been muted or a display indicating that a speech has been prioritized, and can be output to the participant terminal 200 of the participant whose speech has been muted or prioritized. Alternatively, the speech control device 100 may output a speech control display indicating that a speech is being muted or prioritized near the icon or name of each participant in the participant list displayed on the participant terminal 200. After step S206 is executed, the speech control process shown in FIG. 14 ends.

［効果］
上述したように、発言制御プログラムは、発言制御装置１００に、所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、発言優先度に基づいて、参加者のうちの特定者以外の発言をミュートする処理を実行させる。 [effect]
As described above, the speech control program causes the speech control device 100 to calculate speech priority for multiple participants who speak within a specified period of time based on characteristic words in conference-related email sentences and keywords acquired successively from the conference audio, and to execute a process of muting speech from all participants other than specific ones based on the speech priority.

これにより、発言制御装置１００は、音声会議で発言が重なった参加者のうち優先度が最高の者以外の発言をミュートするので、音声会議における参加者間の音声の重なりを抑止できる。 As a result, the speech control device 100 mutes the speech of all participants in an audio conference except for the one with the highest priority, thereby preventing overlapping speech between participants in the audio conference.

また、発言制御プログラムは、発言制御装置１００に、参加者ごとに、メール文の文字数に基づいて、会議に対する関連度を決定する処理をさらに実行させ、発言制御装置１００は、関連度に基づいて、発言優先度を算出する。 The speech control program also causes the speech control device 100 to execute a process for determining the relevance of each participant to the conference based on the number of characters in the email text, and the speech control device 100 calculates the speech priority based on the relevance.

これにより、発言制御装置１００は、音声会議ごとにより適した発言優先度を用いて、音声会議における参加者間の音声の重なりを抑止できる。 This allows the speech control device 100 to prevent overlapping voices between participants in an audio conference by using speech priorities that are more appropriate for each audio conference.

また、発言制御装置１００は、参加者の役職に基づいて、発言優先度を算出する。 The speech control device 100 also calculates speech priority based on the participants' job titles.

これにより、発言制御装置１００は、音声会議の参加者により適した発言優先度を用いて、音声会議における参加者間の音声の重なりを抑止できる。 This allows the speech control device 100 to prevent voice overlap between participants in an audio conference by using speech priorities that are more appropriate for the participants in the audio conference.

また、発言制御プログラムは、発言制御装置１００に、参加者ごとに、メール文に対して形態素解析を行い、特徴語を抽出し、会議音声をテキスト化し、形態素解析を行い、キーワードを抽出する処理をさらに実行させ、発言制御装置１００は、関連度、および特徴語とキーワードとの一致数に基づいて、発言優先度を算出する。 The speech control program also causes the speech control device 100 to perform morphological analysis on the email text for each participant, extract characteristic words, convert the conference audio into text, perform morphological analysis, and further execute the process of extracting keywords, and the speech control device 100 calculates speech priority based on the relevance and the number of matches between the characteristic words and the keywords.

これにより、発言制御装置１００は、音声会議中の議題などの変化に応じて発言優先度を動的に変化させ、会議内容により適した発言優先度を用いて、音声会議における参加者間の音声の重なりを抑止できる。 As a result, the speech control device 100 can dynamically change the speech priority in response to changes in the agenda during the audio conference, and use the speech priority that is more appropriate for the conference content, thereby preventing overlapping of voices between participants in the audio conference.

また、発言制御プログラムが発言制御装置１００に実行させる、キーワードを抽出する処理は、会議音声をテキスト化し、形態素解析を行い、第１のワードおよび第１のワードの第１の出現回数を抽出し、会議音声以外の第２の会議音声をテキスト化し、形態素解析を行い、会議ごとに第２のワードおよび第２のワードの第２の出現回数を抽出し、第１のワード、第１の出現回数、第２のワード、および第２の出現回数に基づいて、キーワードを抽出する処理を含む。 The keyword extraction process that the speech control program causes the speech control device 100 to execute includes converting the conference audio into text, performing morphological analysis, extracting a first word and a first number of occurrences of the first word, converting a second conference audio other than the conference audio into text, performing morphological analysis, extracting a second word and a second number of occurrences of the second word for each conference, and extracting keywords based on the first word, the first number of occurrences, the second word, and the second number of occurrences.

また、発言制御装置１００は、特徴語およびキーワードの抽出に、ｔｆ－ｉｄｆ手法を用いる。 The speech control device 100 also uses the tf-idf method to extract feature words and keywords.

これにより、発言制御装置１００は、音声会議の参加者や会議の内容により適した参加者ごとの特徴語や会議音声のキーワードを抽出できる。 This allows the speech control device 100 to extract characteristic words for each participant and keywords from the conference audio that are more suited to the participants in the audio conference and the content of the conference.

また、発言制御プログラムは、発言制御装置１００に、参加者のうち少なくとも２名以上の話し始めの音声を第２の所定時間内に受信した場合、発言優先度が最も高い参加者を特定者に決定する処理をさらに実行させる。 The speech control program also causes the speech control device 100 to execute a process of determining the participant with the highest speech priority as the specified person when the speech of at least two or more of the participants who have started speaking is received within a second predetermined time.

これにより、発言制御装置１００は、音声会議における参加者間の音声の重なりを抑止できる。 This allows the speech control device 100 to prevent voice overlap between participants in an audio conference.

また、発言制御プログラムは、発言制御装置１００に、参加者のいずれかが発言中の場合、発言者以外の発言をミュートする処理をさらに実行させる。 In addition, the speech control program further causes the speech control device 100 to execute a process of muting speech from anyone other than the speaker when any of the participants is speaking.

［システム］
上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更できる。また、実施例で説明した具体例、分布、数値などは、あくまで一例であり、任意に変更できる。 [system]
The information including the processing procedures, control procedures, specific names, various data and parameters shown in the above documents and drawings can be changed as desired unless otherwise specified. In addition, the specific examples, distributions, values, etc. described in the embodiments are merely examples and can be changed as desired.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散や統合の具体的形態は図示のものに限られない。つまり、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成できる。例えば、発言制御装置１００の抽出部１３１と決定部１３２とを統合できる。 In addition, each component of each device shown in the figure is a functional concept, and does not necessarily have to be physically configured as shown in the figure. In other words, the specific form of distribution and integration of each device is not limited to that shown in the figure. In other words, all or part of them can be functionally or physically distributed or integrated in any unit depending on various loads, usage conditions, etc. For example, the extraction unit 131 and the determination unit 132 of the speech control device 100 can be integrated.

さらに、各装置にて行なわれる各処理機能は、その全部または任意の一部が、ＣＰＵおよび当該ＣＰＵにて解析実行されるプログラムにて実現され、あるいは、ワイヤードロジックによるハードウェアとして実現され得る。 Furthermore, each processing function performed by each device may be realized, in whole or in part, by a CPU and a program analyzed and executed by the CPU, or may be realized as hardware using wired logic.

［ハードウェア］
上述した発言制御装置１００のハードウェア構成について説明する。図１５は、ハードウェア構成例を示す図である。図１５に示すように、発言制御装置１００は、通信部１００ａ、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）１００ｂ、メモリ１００ｃ、およびプロセッサ１００ｄを有する。また、図１５に示した各部は、バスなどで相互に接続される。 [Hardware]
The hardware configuration of the above-mentioned speech control device 100 will be described. Fig. 15 is a diagram showing an example of the hardware configuration. As shown in Fig. 15, the speech control device 100 has a communication unit 100a, a HDD (Hard Disk Drive) 100b, a memory 100c, and a processor 100d. The units shown in Fig. 15 are connected to each other via a bus or the like.

通信部１００ａは、ネットワークインタフェースカードなどであり、他のサーバとの通信を行う。ＨＤＤ１００ｂは、図２に示した機能を動作させるプログラムやＤＢを記憶する。 The communication unit 100a is a network interface card or the like, and communicates with other servers. The HDD 100b stores the programs and DBs that operate the functions shown in FIG. 2.

プロセッサ１００ｄは、図２に示した各処理部と同様の処理を実行するプログラムをＨＤＤ１００ｂなどから読み出してメモリ１００ｃに展開することで、図２で説明した各機能を実行するプロセスを動作させる。例えば、このプロセスは、発言制御装置１００が有する各処理部と同様の機能を実行する。具体的には、例えば、プロセッサ１００ｄは、決定部１３２や算出部１３３などと同様の機能を有するプログラムをＨＤＤ１００ｂなどから読み出す。そして、プロセッサ１００ｄは、決定部１３２や算出部１３３などと同様の処理を実行するプロセスを実行する。 The processor 100d reads out a program that executes the same processes as the processing units shown in FIG. 2 from the HDD 100b or the like and loads it into the memory 100c, thereby operating a process that executes the functions described in FIG. 2. For example, this process executes the same functions as the processing units of the speech control device 100. Specifically, for example, the processor 100d reads out a program that has the same functions as the determination unit 132, the calculation unit 133, and the like from the HDD 100b or the like. Then, the processor 100d executes a process that executes the same processes as the determination unit 132, the calculation unit 133, and the like.

このように、発言制御装置１００は、プログラムを読み出して実行することで各処理を実行する情報処理装置として動作する。また、発言制御装置１００は、媒体読取装置によって記録媒体から上記プログラムを読み出し、読み出された上記プログラムを実行することで上記した実施例と同様の機能を実現することもできる。なお、この他の実施例でいうプログラムは、発言制御装置１００によって実行されることに限定されるものではない。例えば、他のコンピュータまたはサーバがプログラムを実行する場合や、これらが協働してプログラムを実行するような場合にも、本発明を同様に適用できる。 In this way, the utterance control device 100 operates as an information processing device that executes each process by reading and executing a program. The utterance control device 100 can also realize functions similar to those of the above-mentioned embodiment by reading the program from a recording medium using a media reading device and executing the read program. Note that the program in these other embodiments is not limited to being executed by the utterance control device 100. For example, the present invention can be similarly applied to cases where another computer or server executes a program, or where these cooperate to execute a program.

なお、このプログラムは、インターネットなどのネットワークを介して配布できる。また、このプログラムは、ハードディスク、フレキシブルディスク（ＦＤ）、ＣＤ－ＲＯＭ、ＭＯ（Ｍａｇｎｅｔｏ－Ｏｐｔｉｃａｌｄｉｓｋ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）などのコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行できる。 This program can be distributed via a network such as the Internet. This program can also be recorded on a computer-readable recording medium such as a hard disk, flexible disk (FD), CD-ROM, MO (Magneto-Optical disk), or DVD (Digital Versatile Disc), and can be executed by being read from the recording medium by a computer.

なお、以上の実施例１を含む実施形態に関し、さらに以下の付記を開示する。 The following notes are further provided with respect to the embodiments including Example 1 above.

（付記１）発言制御装置に、
所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、
前記発言優先度に基づいて、前記参加者のうちの特定者以外の発言をミュートする
処理を実行させることを特徴とする発言制御プログラム。 (Appendix 1) A speech control device,
Calculate comment priorities for multiple participants who comment within a specified time period based on characteristic words in emails related to the meeting and keywords sequentially acquired from the meeting audio;
A speech control program that executes a process of muting speeches of participants other than those of specific participants based on the speech priority.

（付記２）前記発言制御装置に、
前記参加者ごとに、前記メール文の文字数に基づいて、会議に対する関連度を決定する
処理をさらに実行させ、
前記発言制御装置は、前記関連度に基づいて、前記発言優先度を算出することを特徴とする付記１に記載の発言制御プログラム。 (Additional Note 2) The speech control device includes:
determining a degree of relevance to the conference for each of the participants based on the number of characters in the email message;
2. The speech control program according to claim 1, wherein the speech control device calculates the speech priority based on the relevance.

（付記３）前記発言制御装置は、前記参加者の役職に基づいて、前記発言優先度を算出することを特徴とする付記２に記載の発言制御プログラム。 (Appendix 3) The speech control program described in Appendix 2 is characterized in that the speech control device calculates the speech priority based on the role of the participant.

（付記４）前記発言制御装置に、
前記参加者ごとに、前記メール文に対して形態素解析を行い、前記特徴語を抽出し、
前記会議音声をテキスト化し、形態素解析を行い、前記キーワードを抽出する
処理をさらに実行させ、
前記発言制御装置は、前記関連度、および前記特徴語と前記キーワードとの一致数に基づいて、前記発言優先度を算出することを特徴とする付記２または３に記載の発言制御プログラム。 (Additional Note 4) The speech control device includes:
performing a morphological analysis on the email text for each of the participants to extract the characteristic words;
The conference voice is converted into text, a morphological analysis is performed, and the keywords are extracted.
The comment control program according to claim 2 or 3, wherein the comment control device calculates the comment priority based on the relevance and the number of matches between the feature word and the keyword.

（付記５）前記発言制御装置に実行させる、前記キーワードを抽出する処理は、
前記会議音声をテキスト化し、形態素解析を行い、第１のワードおよび前記第１のワードの第１の出現回数を抽出し、
前記会議音声以外の第２の会議音声をテキスト化し、形態素解析を行い、会議ごとに第２のワードおよび前記第２のワードの第２の出現回数を抽出し、
前記第１のワード、前記第１の出現回数、前記第２のワード、および前記第２の出現回数に基づいて、前記キーワードを抽出する
処理を含むことを特徴とする付記４に記載の発言制御プログラム。 (Note 5) The process of extracting the keywords executed by the speech control device includes:
converting the conference voice into text, performing morphological analysis, and extracting a first word and a first number of occurrences of the first word;
A second conference voice other than the conference voice is converted into text, a morphological analysis is performed, and a second word and a second number of occurrences of the second word are extracted for each conference;
5. The speech control program according to claim 4, further comprising a process of extracting the keyword based on the first word, the first number of occurrences, the second word, and the second number of occurrences.

（付記６）前記発言制御装置は、前記特徴語および前記キーワードの抽出に、ｔｆ－ｉｄｆ手法を用いることを特徴とする付記１乃至５のいずれか一項に記載の発言制御プログラム。 (Appendix 6) The speech control program described in any one of appendices 1 to 5, characterized in that the speech control device uses the tf-idf method to extract the characteristic words and the keywords.

（付記７）前記発言制御装置に、
前記参加者のうち少なくとも２名以上の話し始めの音声を第２の所定時間内に受信した場合、前記発言優先度が最も高い前記参加者を前記特定者に決定する
処理をさらに実行させることを特徴とする付記１乃至６のいずれか一項に記載の発言制御プログラム。 (Supplementary Note 7) The speech control device includes:
The speech control program according to any one of claims 1 to 6, further comprising a process of determining the participant with the highest speech priority as the specific person when speech start voices of at least two or more of the participants are received within a second predetermined time.

（付記８）前記発言制御装置に、
前記参加者のいずれかが発言中の場合、発言者以外の発言をミュートする
処理をさらに実行させることを特徴とする付記１乃至７のいずれか一項に記載の発言制御プログラム。 (Supplementary Note 8) The speech control device includes:
8. The speech control program according to claim 1, further comprising a process of muting speeches made by participants other than the speaker when any of the participants is speaking.

（付記９）発言制御装置が、
所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、
前記発言優先度に基づいて、前記参加者のうちの特定者以外の発言をミュートする
処理を実行することを特徴とする発言制御方法。 (Supplementary Note 9) A speech control device,
Calculate comment priorities for multiple participants who comment within a specified time period based on characteristic words in emails related to the meeting and keywords sequentially acquired from the meeting audio;
A speech control method comprising: executing a process of muting speech from other than that of specific participants based on the speech priority.

（付記１０）所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、
前記発言優先度に基づいて、前記参加者のうちの特定者以外の発言をミュートする
処理を実行する制御部を有することを特徴とする発言制御装置。 (Appendix 10) For multiple participants who make comments within a specified time, a comment priority is calculated based on characteristic words in emails related to the conference and keywords sequentially acquired from the conference audio,
A speech control device comprising: a control unit that executes a process of muting speech from any of the participants other than specific participants based on the speech priority.

（付記１１）制御部が、
前記参加者ごとに、前記メール文の文字数に基づいて、会議に対する関連度を決定する
処理をさらに実行し、
制御部は、前記関連度に基づいて、前記発言優先度を算出することを特徴とする付記１０に記載の発言制御装置。 (Supplementary Note 11) The control unit
determining a relevance of each of the participants to the conference based on the number of characters in the email message;
11. The speech control device according to claim 10, wherein the control unit calculates the speech priority based on the relevance.

（付記１２）制御部は、前記参加者の役職に基づいて、前記発言優先度を算出することを特徴とする付記１１に記載の発言制御装置。 (Appendix 12) The speech control device described in Appendix 11, characterized in that the control unit calculates the speech priority based on the role of the participant.

（付記１３）制御部が、
前記参加者ごとに、前記メール文に対して形態素解析を行い、前記特徴語を抽出し、
前記会議音声をテキスト化し、形態素解析を行い、前記キーワードを抽出する
処理をさらに実行し、
制御部は、前記関連度、および前記特徴語と前記キーワードとの一致数に基づいて、前記発言優先度を算出することを特徴とする付記１１または１２に記載の発言制御装置。 (Supplementary Note 13) The control unit
performing a morphological analysis on the email text for each of the participants to extract the characteristic words;
converting the conference voice into text, performing morphological analysis, and extracting the keywords;
13. The speech control device according to claim 11, wherein the control unit calculates the speech priority based on the relevance and the number of matches between the feature word and the keyword.

（付記１４）制御部が実行する、前記キーワードを抽出する処理は、
前記会議音声をテキスト化し、形態素解析を行い、第１のワードおよび前記第１のワードの第１の出現回数を抽出し、
前記会議音声以外の第２の会議音声をテキスト化し、形態素解析を行い、会議ごとに第２のワードおよび前記第２のワードの第２の出現回数を抽出し、
前記第１のワード、前記第１の出現回数、前記第２のワード、および前記第２の出現回数に基づいて、前記キーワードを抽出する
処理を含むことを特徴とする付記１３に記載の発言制御装置。 (Additional Note 14) The process of extracting the keywords executed by the control unit includes:
converting the conference voice into text, performing morphological analysis, and extracting a first word and a first number of occurrences of the first word;
A second conference voice other than the conference voice is converted into text, a morphological analysis is performed, and a second word and a second number of occurrences of the second word are extracted for each conference;
The speech control device according to claim 13, further comprising a process of extracting the keyword based on the first word, the first number of occurrences, the second word, and the second number of occurrences.

（付記１５）制御部は、前記特徴語および前記キーワードの抽出に、ｔｆ－ｉｄｆ手法を用いることを特徴とする付記１０乃至１４のいずれか一項に記載の発言制御装置。 (Appendix 15) The speech control device described in any one of appendices 10 to 14, characterized in that the control unit uses the tf-idf method to extract the characteristic words and the keywords.

（付記１６）制御部が、
前記参加者のうち少なくとも２名以上の話し始めの音声を第２の所定時間内に受信した場合、前記発言優先度が最も高い前記参加者を前記特定者に決定する
処理をさらに実行することを特徴とする付記１０乃至１５のいずれか一項に記載の発言制御装置。 (Supplementary Note 16) The control unit
The speech control device according to any one of claims 10 to 15, further comprising a process of determining the participant with the highest speech priority as the specific person when speech start voices of at least two or more of the participants are received within a second predetermined time.

（付記１７）制御部が、
前記参加者のいずれかが発言中の場合、発言者以外の発言をミュートする
処理をさらに実行することを特徴とする付記１０乃至１６のいずれか一項に記載の発言制御装置。 (Supplementary Note 17) The control unit,
The speech control device according to any one of claims 10 to 16, further comprising a process of muting speeches made by any participant other than the participant when the participant is speaking.

（付記１８）プロセッサと、
前記プロセッサに動作可能に接続されたメモリと
を備えた発言制御装置であって、前記プロセッサは、
所定時間内に発言した複数の参加者に対して、会議関連のメール文の特徴語および会議音声から逐次取得されるキーワードに基づいて発言優先度を算出し、
前記発言優先度に基づいて、前記参加者のうちの特定者以外の発言をミュートする
処理を実行することを特徴とする発言制御装置。 (Supplementary Note 18) A processor;
A speech control device comprising: a memory operably connected to the processor, the processor comprising:
Calculate comment priorities for multiple participants who comment within a specified time period based on characteristic words in emails related to the meeting and keywords sequentially acquired from the meeting audio;
A speech control device which executes a process of muting speech from any of the participants other than specific participants based on the speech priority.

これまで本発明の実施例について説明したが、本発明は上述した実施例以外にも、種々の異なる形態にて実施されてよいものである。 Though the present invention has been described above with reference to examples, the present invention may be implemented in a variety of different forms other than the examples described above.

１００発言制御装置
１００ａ通信部
１００ｂＨＤＤ
１００ｃメモリ
１００ｄプロセッサ
１１０通信部
１２０記憶部
１２１メール情報
１２２スケジュール情報
１２３組織情報
１２４会議情報
１３０制御部
１３１抽出部
１３２決定部
１３３算出部
１３４ミュート部
１３５出力部
２００参加者端末
２５０ミュート中表示
２６０優先発言中表示 100: speech control device 100a: communication unit 100b: HDD
100c Memory 100d Processor 110 Communication unit 120 Storage unit 121 Mail information 122 Schedule information 123 Organization information 124 Conference information 130 Control unit 131 Extraction unit 132 Determination unit 133 Calculation unit 134 Muting unit 135 Output unit 200 Participant terminal 250 Muted display 260 Priority speech display

Claims

A system having a speech control device and a plurality of participant terminals capable of communicating with each other via a network, in which a plurality of participants using the plurality of participant terminals hold a conference by voice,
The speech control device includes :
calculating , for each participant, a speech priority level that is higher for a participant having a greater number of matches between the characteristic words and the keywords based on the characteristic words of the email texts related to the conference and the keywords sequentially acquired from the conference voice;
A speech control program that executes a process of muting speeches of participants other than a specific person having the highest speech priority based on the speech priority .

The speech control device includes:
determining a degree of relevance to the conference for each of the participants based on the number of characters in the email message;
2. The program according to claim 1, wherein the speech control device calculates the speech priority based on the relevance.

The speech control program according to claim 2, characterized in that the speech control device calculates the speech priority based on the role of the participant.

The speech control device includes:
performing a morphological analysis on the email text for each of the participants to extract the characteristic words;
The conference voice is converted into text, a morphological analysis is performed, and the keywords are extracted.
4. The comment control program according to claim 2, wherein the comment control device calculates the comment priority based on the relevance and the number of matches between the feature words and the keywords.

The process of extracting the keyword that is executed by the speech control device includes:
Converting the conference voice of the current conference into text, performing morphological analysis, and extracting a first word and a first number of occurrences of the first word;
Converting a second conference speech of a past conference into text, performing a morphological analysis, and extracting a second word and a second number of occurrences of the second word for each conference;
5. The speech control program according to claim 4, further comprising a process for extracting the keyword based on the first word, the first number of occurrences, the second word, and the second number of occurrences.

The speech control program according to any one of claims 1 to 5, characterized in that the speech control device uses the tf-idf method to extract the characteristic words and the keywords.

The speech control device includes:
The speech control program according to any one of claims 1 to 6, further comprising a process of determining the participant having the highest speech priority as the specific person when speech start voices of at least two of the participants are received within a second predetermined time.

The speech control device includes:
8. The speech control program according to claim 1, further comprising a process for muting speeches from participants other than the speaker when any of the participants is speaking.

A system having a speech control device and a plurality of participant terminals capable of communicating with each other via a network, in which a plurality of participants using the plurality of participant terminals hold a conference by voice,
The speech control device,
calculating , for each participant, a speech priority level that is higher for a participant having a greater number of matches between the characteristic words and the keywords based on the characteristic words of the email texts related to the conference and the keywords sequentially acquired from the conference voice;
a process for muting speech from among the participants other than a specific person having the highest speech priority based on the speech priority, said process comprising: muting said speech from among the participants other than a specific person having the highest speech priority based on the speech priority.

A speech control device used in a system having a speech control device and a plurality of participant terminals capable of communicating with each other via a network, the system comprising:
calculating , for each participant, a speech priority level that is higher for a participant having a greater number of matches between the characteristic words and the keywords based on the characteristic words of the email texts related to the conference and the keywords sequentially acquired from the conference voice;
A speech control device comprising: a control unit that executes a process of muting speeches of the participants other than those of a specific person having the highest speech priority based on the speech priority .