JP7517366B2

JP7517366B2 - Voice recording management system, voice recording management device, voice recording management method and program

Info

Publication number: JP7517366B2
Application number: JP2022101335A
Authority: JP
Inventors: 豊柳浦; 拓郎真野; 章敬中島
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2021-08-16
Filing date: 2022-06-23
Publication date: 2024-07-17
Anticipated expiration: 2042-06-23
Also published as: JP2023027001A

Description

本発明は、音声記録管理システム、音声記録管理装置、音声記録管理方法及びプログラムに関する。 The present invention relates to an audio recording management system, an audio recording management device, an audio recording management method, and a program.

従来から、音声情報をテキスト情報に変換することにより、会議の議事録を自動生成する技術が知られている。 Technology has been known for some time now for automatically generating meeting minutes by converting audio information into text information.

例えば、会議で記録された音声データに対する音声認識の結果である会議テキスト情報を複数の区間に分割し、分割されたテキスト情報ごとに要約情報を生成する技術が知られている（例えば、特許文献１参照）。 For example, a technique is known in which conference text information, which is the result of speech recognition of audio data recorded at a conference, is divided into multiple sections, and summary information is generated for each section of the divided text information (see, for example, Patent Document 1).

しかしながら、従来の技術では、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータと、そのテキストデータに対応する画像データ及び音声データをそれぞれ個別に編集する必要があり、利便性が低いという課題があった。 However, with conventional technology, when editing a voice recording generated based on voice information, it was necessary to separately edit the text data contained in the voice recording and the image data and voice data corresponding to that text data, resulting in a problem of low convenience.

上述した課題を解決するために、請求項１に係る発明は、音声情報に基づいて得られた音声記録情報を管理する音声記録管理装置と、前記音声記録管理装置と通信することで前記音声記録情報を表示可能な一以上の通信端末と、を含む音声記録管理システムであって、前記音声記録管理装置は、前記一以上の通信端末のうち、第１の通信端末が送信した前記音声情報を表す音声データ、及び前記第１の通信端末に表示された画面を表す画面データを取得する取得手段と、取得された前記音声データに基づいて得られた所定のテキストを表す所定のテキストデータと、取得された前記画面データに係る前記画面に含まれる画像であり、前記所定のテキストに対応付けられた所定の画像を表す所定の画像データと、前記所定のテキストで示される所定の音声データとを、前記第１の通信端末を含む前記一以上の通信端末に送信する送信手段と、を有し、前記送信手段は、前記第１の通信端末が送信した編集要求であり、前記所定のテキスト又は前記所定の画像に対する編集要求に応じて、前記所定のテキストデータを編集処理した編集後テキストデータと前記所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、前記所定の音声データを編集処理した編集後音声データとを、前記第１の通信端末と異なる第２の通信端末に対して送信し、前記第２の通信端末は、前記音声記録管理装置が送信した前記編集後画面データに係る編集後画面を表示手段に表示する表示制御手段と、前記音声記録管理装置が送信した前記編集後音声データに係る編集後音声を再生する音声再生手段と、を有する、ことを特徴とする音声記録管理システムを提供する。 In order to solve the above-mentioned problem, the invention according to claim 1 is a voice recording management system including a voice recording management device that manages voice recording information obtained based on voice information, and one or more communication terminals that can display the voice recording information by communicating with the voice recording management device, wherein the voice recording management device includes an acquisition means for acquiring voice data representing the voice information transmitted by a first communication terminal among the one or more communication terminals, and screen data representing a screen displayed on the first communication terminal, and a predetermined text data representing a predetermined text obtained based on the acquired voice data, an image included on the screen related to the acquired screen data, and predetermined image data representing a predetermined image associated with the predetermined text, and a predetermined voice data indicated by the predetermined text, and a predetermined voice data, which is displayed by the predetermined text, and a predetermined voice data, which is displayed by the predetermined text, are transmitted to the first communication terminal. and a transmission means for transmitting the edit request transmitted by the first communication terminal to the one or more communication terminals including the first communication terminal, the transmission means transmits edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to a second communication terminal different from the first communication terminal in response to the edit request transmitted by the first communication terminal for the specified text or the specified image, the edit request being transmitted by the first communication terminal, and the second communication terminal has a display control means for displaying an edited screen related to the edited screen data transmitted by the voice recording management device on a display means, and an audio playback means for playing back the edited audio related to the edited audio data transmitted by the voice recording management device.

以上説明したように本発明によれば、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータ又はそのテキストデータに対応する画像データを編集すればよいので、音声記録の編集における利便性を向上させることができるという効果を奏する。 As described above, according to the present invention, when editing an audio recording generated based on audio information, it is sufficient to edit the text data contained in the audio recording or the image data corresponding to that text data, thereby achieving the effect of improving the convenience of editing the audio recording.

通信システムの全体構成の一例を示す図である。FIG. 1 illustrates an example of an overall configuration of a communication system. 通信端末及び音声認識サーバのハードウエア構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of a hardware configuration of a communication terminal and a voice recognition server. 音声記録管理装置のハードウエア構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of a hardware configuration of a voice recording management device. 通信システムの機能構成の一例を示す図である。FIG. 1 illustrates an example of a functional configuration of a communication system. ログイン管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a login management table. 記録書誌情報管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram showing an example of a record bibliographic information management table. テキスト情報管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a text information management table. キャプチャ画像管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a capture image management table. キャプチャ画像取得間隔テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a capture image acquisition interval table. 非公開音声管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a private voice management table. アプリ起動、認証処理及びセッション確立処理の一例を示すシーケンス図である。11 is a sequence diagram showing an example of application startup, authentication processing, and session establishment processing. FIG. 通信端末におけるアプリ起動時の画面表示例である。13 is an example of a screen display when an application is started on a communication terminal. 記録開始処理の一例を示すシーケンス図である。FIG. 11 is a sequence diagram illustrating an example of a recording start process. 通信端末における記録開始指示の画面表示例である。13 is an example of a screen display of a recording start instruction on a communication terminal. 記録書誌情報の登録処理の一例を示すシーケンス図である。FIG. 11 is a sequence diagram showing an example of a process for registering record bibliographic information. 音声認識処理の一例を示すシーケンス図である。FIG. 11 is a sequence diagram illustrating an example of a voice recognition process. 通信端末における記録中の画面表示例である。13 is an example of a screen display during recording on the communication terminal. 画面キャプチャ処理の一例を示すシーケンス図である。FIG. 11 is a sequence diagram illustrating an example of a screen capture process. 記録終了処理の一例を示すシーケンス図である。FIG. 11 is a sequence diagram showing an example of a recording end process. 通信端末における記録終了時の画面表示例である。13 is an example of a screen display when recording ends on the communication terminal. 通信端末における記録選択時の画面表示例である。13 is an example of a screen display when selecting recording on the communication terminal. 通信端末における共有情報入力ダイアログの画面表示例である。13 is an example of a shared information input dialogue screen displayed on the communication terminal. 記録閲覧編集画面の生成処理の一例を示すシーケンス図である。FIG. 13 is a sequence diagram showing an example of a process for generating a record viewing and editing screen. 作成者の通信端末又は最初の閲覧時に対する記録閲覧編集画面生成処理の一例を示すフローチャートである。13 is a flowchart showing an example of a record viewing and editing screen generation process for the creator's communication terminal or for the first viewing. 作成者以外の利用者の通信端末に対する記録閲覧画面生成処理の一例を示すフローチャートである。13 is a flowchart showing an example of a record viewing screen generation process for a communication terminal of a user other than the creator. 作成者以外の利用者の通信端末に対する記録閲覧画面生成処理の一例を示すフローチャートである。13 is a flowchart showing an example of a record viewing screen generation process for a communication terminal of a user other than the creator. 作成者の通信端末の記録閲覧編集画面の画面表示例である。13 is a screen display example of a record viewing and editing screen on the creator's communication terminal. 作成者以外の利用者の通信端末における記録閲覧の画面表示例である。13 is an example of a screen display for viewing a record on a communication terminal of a user other than the creator. 記録閲覧編集処理の一例を示すシーケンス図である。FIG. 11 is a sequence diagram showing an example of a record viewing and editing process. 各種ボタン操作により分岐される処理の一例を示すフローチャートである。11 is a flowchart showing an example of a process branched by various button operations. 各種ボタン操作により分岐される処理の一例を示すフローチャートである。11 is a flowchart showing an example of a process branched by various button operations. テキストに対する非公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a private button is operated on text. テキストに対する非公開ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。13 is an example of a screen display displayed on a communication terminal of a user other than the creator when a private button for text is operated. テキストに対する公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a publish button is operated on text. 作成者の通信端末におけるテキストに対する公開ボタン操作時の画面表示例である。13 is a screen display example when a creator operates a publish button on the text on the communication terminal. テキストに対する削除ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a delete button is operated on text. テキストに対する削除ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。13 is an example of a screen display displayed on a communication terminal of a user other than the creator when a delete button for text is operated. キャプチャ画像に対する非公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a private button is operated on a captured image. キャプチャ画像に対する非公開ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。13 is a diagram illustrating an example of a screen display displayed on a communication terminal of a user other than the creator when a private button for a captured image is operated. キャプチャ画像に対する公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a publish button is operated on a captured image. キャプチャ画像に対する削除ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a delete button is operated on a captured image. キャプチャ画像に対する削除ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。13 is an example of a screen display displayed on a communication terminal of a user other than the creator when a delete button for a captured image is operated. 第２の実施形態に係るテキスト情報管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a text information management table according to the second embodiment. 第２の実施形態に係る要約情報管理テーブルの一例を示す概念図である。FIG. 11 is a conceptual diagram illustrating an example of a summary information management table according to the second embodiment. 第２の実施形態に係る各種ボタン操作により分岐される処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process branched by various button operations according to the second embodiment. 第２の実施形態に係る各種ボタン操作により分岐される処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process branched by various button operations according to the second embodiment. 第２の実施形態に係るテキストグループに対する非公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process performed when a private button is operated on a text group according to the second embodiment. 第２の実施形態に係る通信端末におけるテキストグループに対する要約力ボタン操作時の画面表示例である。13 is an example of a screen display when a summarization button is operated for a text group in the communication terminal according to the second embodiment. 第２の実施形態に係る通信端末における要約情報入力ダイアログの画面表示例である。13 is a diagram illustrating an example of a screen display of a summary information input dialogue in a communication terminal according to a second embodiment. 第２の実施形態に係る通信端末における要約表示欄を含む画面表示例である。13 is an example of a screen display including a summary display field in a communication terminal according to a second embodiment. 第２の実施形態に係るテキストグループに対する公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a publish button is operated on a text group according to the second embodiment. 第２の実施形態に係るテキストグループに対する削除ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process performed when a delete button is operated on a text group according to the second embodiment. 第２の実施形態に係るキャプチャ画像に対する非公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process performed when a private button is operated on a captured image according to the second embodiment. 第２の実施形態に係るキャプチャ画像に対する公開ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process performed when a publish button is operated on a captured image according to the second embodiment. 第２の実施形態に係るキャプチャ画像に対する削除ボタン操作時の処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process performed when a delete button is operated on a captured image according to the second embodiment. 第３の実施形態に係る通信システムの機能構成の一例を示す図である。FIG. 11 is a diagram illustrating an example of a functional configuration of a communication system according to a third embodiment. 第３の実施形態に係る記録書誌情報管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a record bibliographic information management table according to the third embodiment. 第３の実施形態に係るテキスト情報管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a text information management table according to the third embodiment. 第３の実施形態に係るキャプチャ画像管理テーブルの一例を示す概念図である。FIG. 13 is a conceptual diagram illustrating an example of a capture image management table according to the third embodiment. 第３の実施形態に係る通信端末における記録選択時の画面表示例である。13 is an example of a screen display when selecting recording in the communication terminal according to the third embodiment. 第３の実施形態に係る記録閲覧編集処理の一例を示すシーケンス図である。FIG. 13 is a sequence diagram showing an example of a record viewing and editing process according to the third embodiment. 第３の実施形態に係る作成者の通信端末の記録閲覧編集画面の画面表示例である。13 is a screen display example of a record viewing and editing screen on a creator's communication terminal according to the third embodiment. 第３の実施形態に係る作成者の通信端末の記録閲覧編集画面の他の画面表示例である。13 is another example of a screen display of the record viewing and editing screen of the creator's communication terminal according to the third embodiment. 第３の実施形態に係る作成者の通信端末の記録閲覧編集画面の他の画面表示例である。13 is another example of a screen display of the record viewing and editing screen of the creator's communication terminal according to the third embodiment. 第３の実施形態に係る記録閲覧編集画面の分割処理の一例を示すフローチャートである。13 is a flowchart showing an example of a division process of a record viewing and editing screen according to the third embodiment. 第３の実施形態に係る記録閲覧編集画面の分割処理後のテキスト情報管理テーブルの一例を示す概念図で、（a）は分割された一つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図、(b)は分割された二つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図である。A conceptual diagram showing an example of a text information management table after splitting processing of the record viewing and editing screen in the third embodiment, where (a) is a conceptual diagram of the text information management table constituting the first split audio recording screen, and (b) is a conceptual diagram of the text information management table constituting the second split audio recording screen. 第３の実施形態に係る記録閲覧編集画面の分割処理後のキャプチャ画像管理テーブルの一例を示す概念図で、（a）は分割された一つ目の音声記録画面を構成するキャプチャ画像管理テーブルの概念図、(b)は分割された二つ目の音声記録画面を構成するキャプチャ画像管理テーブルの概念図である。13 is a conceptual diagram showing an example of a capture image management table after splitting processing of a record viewing and editing screen relating to the third embodiment, where (a) is a conceptual diagram of a capture image management table constituting the first split audio recording screen, and (b) is a conceptual diagram of a capture image management table constituting the second split audio recording screen. 第３の実施形態に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された一つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。This is a conceptual diagram showing an example of a recorded bibliographic information management table after the division processing of the record viewing and editing screen related to the third embodiment, and is a conceptual diagram of the recorded bibliographic information management table that constitutes the first divided voice recording screen. 第３の実施形態に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された二つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。This is a conceptual diagram showing an example of a recorded bibliographic information management table after splitting processing of the record viewing and editing screen relating to the third embodiment, and is a conceptual diagram of the recorded bibliographic information management table constituting the second split voice recording screen. 第３の実施形態に係る通信端末における記録閲覧編集画面の分割処理後の記録選択時の画面表示例である。13 is an example of a screen display when selecting a record after division processing of the record viewing and editing screen on the communication terminal according to the third embodiment. 第３の実施形態に係る作成者以外の利用者の通信端末に表示される分割された一つ目の音声記録画面の画面表示例である。13 is a screen display example of a first divided voice recording screen displayed on a communication terminal of a user other than the creator according to the third embodiment. 第３の実施形態に係る通信端末における記録閲覧編集処理画面の分割処理後の記録選択時の他の画面表示例である。13 is a diagram illustrating another example of a screen display when selecting a record after division processing of the record viewing and editing processing screen in the communication terminal according to the third embodiment. 第３の実施形態に係る作成者以外の利用者の通信端末に表示される分割された二つ目の音声記録画面の画面表示例である。13 is a screen display example of a second divided voice recording screen displayed on a communication terminal of a user other than the creator according to the third embodiment. 第３の実施形態の変形例に係る記録閲覧編集画面の分割処理の一例を示すフローチャートである。13 is a flowchart showing an example of a division process of a record viewing and editing screen according to a modified example of the third embodiment. 第３の実施形態の変形例に係る記録閲覧編集画面の分割処理後のテキスト情報管理テーブルの一例を示す概念図で、（a）は分割された一つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図、(b)は分割された二つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図である。13 is a conceptual diagram showing an example of a text information management table after splitting processing of a record viewing and editing screen relating to a modified example of the third embodiment, where (a) is a conceptual diagram of a text information management table constituting a first split audio recording screen, and (b) is a conceptual diagram of a text information management table constituting a second split audio recording screen. 第３の実施形態の変形例に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された一つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。This is a conceptual diagram showing an example of a recorded bibliographic information management table after splitting processing of a record viewing and editing screen related to a modified example of the third embodiment, and is a conceptual diagram of the recorded bibliographic information management table that constitutes the first split voice recording screen. 第３の実施形態の変形例に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で分割された二つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。A conceptual diagram showing an example of a recorded bibliographic information management table after splitting processing of a record viewing and editing screen relating to a modified example of the third embodiment, which is a conceptual diagram of a recorded bibliographic information management table constituting a second split voice recording screen.

以下、図面を用いて、発明を実施するための形態について説明する。なお、図面の説明において同一要素には同一符号を付し、重複する部分があればその説明を省略する。 Below, we will explain the form for implementing the invention using the drawings. Note that in the explanation of the drawings, the same elements are given the same reference numerals, and if there are overlapping parts, their explanation will be omitted.

〔第１の実施形態〕
図１乃至図３９を用いて、第１の実施形態について説明する。 First Embodiment
The first embodiment will be described with reference to FIGS. 1 to 39. FIG.

〔通信システムの全体構成〕
＜システム構成例＞
図１は、通信システムの全体構成の一例を示す図である。図１に示されているように、通信システム１は、一以上の通信端末３、音声記録管理装置５及び音声認識サーバ７を含む各装置を有している。通信端末３、音声記録管理装置５及び音声認識サーバ７は、通信ネットワーク１００を介してそれぞれ互いに接続されている。ここで、通信ネットワーク１００は、不特定多数の通信が行われる通信ネットワークであり、インターネット、イントラネット、ＬＡＮ(Local Ａrea Network)等によって構築されている。なお、通信ネットワーク１００には、有線通信だけでなく、３Ｇ(3rd Generation)、４Ｇ(4th Generation)、５Ｇ(5th Generation)、ＷｉＭＡＸ(Worldwide Interoperability for Microwave Access)、ＬＴＥ(Long Term Evolution)等の無線通信による通信ネットワークが含まれてもよい。更に、通信システム１は、通信端末３及び音声記録管理装置５によって構築された音声記録管理システム２を含んでいる。また、通信端末３と音声記録管理装置５は、専用の社内ネットワーク等で互いに接続されていてもよいし、通信ネットワーク１００の内
側に、ファイアウォール(Fire Wall)を介して互いに接続されていてもよい。 [Overall configuration of communication system]
<System configuration example>
FIG. 1 is a diagram showing an example of the overall configuration of a communication system. As shown in FIG. 1, the communication system 1 has devices including one or more communication terminals 3, a voice recording management device 5, and a voice recognition server 7. The communication terminals 3, the voice recording management device 5, and the voice recognition server 7 are connected to each other via a communication network 100. Here, the communication network 100 is a communication network in which an unspecified number of communications are performed, and is constructed by the Internet, an intranet, a LAN (Local Area Network), or the like. Note that the communication network 100 may include not only wired communication, but also communication networks using wireless communication such as 3G (3rd Generation), 4G (4th Generation), 5G (5th Generation), WiMAX (Worldwide Interoperability for Microwave Access), and LTE (Long Term Evolution). Furthermore, the communication system 1 includes a voice recording management system 2 constructed by the communication terminals 3 and the voice recording management device 5. In addition, the communication terminal 3 and the voice recording management device 5 may be connected to each other via a dedicated in-house network or the like, or may be connected to each other inside the communication network 100 via a firewall.

＜通信端末＞
通信端末３は、一般的なＯＳなどが搭載された通信を行うための一以上の情報処理装置（コンピュータシステム）によって実現される。通信端末３は、通信ネットワーク１００を介して、音声記録管理装置５と通信が可能である。図１に示されているように、通信端末３は、通信端末３（Ａ）、通信端末３（Ｂ）、通信端末３（Ｃ）を含む一以上の通信端末で構成されている。 <Communication terminal>
The communication terminal 3 is realized by one or more information processing devices (computer systems) for communication equipped with a general OS, etc. The communication terminal 3 is capable of communicating with the voice recording management device 5 via a communication network 100. As shown in Fig. 1, the communication terminal 3 is composed of one or more communication terminals including a communication terminal 3(A), a communication terminal 3(B), and a communication terminal 3(C).

通信端末３（Ａ）は、例えば、音声記録管理装置５と通信するためのブラウザアプリ及び音声記録管理装置５が送信したテキスト情報に基づいて会議等の議事録などを作成するための記録管理アプリがそれぞれインストールされている。更に、通信端末３（Ａ）は、通信端末３（Ｂ）との間で、リモートワーク、テレビ会議、インスタントメッセージング、グループチャットなどを行うための汎用ツール（ここでは「会話ツール」と呼ぶ）を利用して会話等の所定のイベントに参加し、議事録を作成可能な通信端末である。このように、通信端末３（Ａ）は、例えば、議事録作成端末として機能する。 The communication terminal 3(A) has installed thereon, for example, a browser application for communicating with the audio recording management device 5 and a record management application for creating minutes of meetings and the like based on text information sent by the audio recording management device 5. Furthermore, the communication terminal 3(A) is a communication terminal capable of participating in a specific event such as a conversation and creating minutes between the communication terminal 3(B) and the communication terminal 3(A) by using a general-purpose tool (herein referred to as a "conversation tool") for remote work, video conferencing, instant messaging, group chat, and the like. In this way, the communication terminal 3(A) functions, for example, as a minutes-creation terminal.

通信端末３（Ｂ）は、通信端末３（Ａ）と上述した汎用ツールを利用し、所定のイベントに参加している通信端末３（Ａ）を使用する利用者(例えば、利用者Ａ)とともに所定のイベントに参加する。このように、通信端末３（Ｂ）は、例えば、イベント参加端末として機能する。 Communication terminal 3(B) uses communication terminal 3(A) and the general-purpose tool described above to participate in a specific event together with a user (e.g., user A) using communication terminal 3(A) who is participating in the specific event. In this way, communication terminal 3(B) functions, for example, as an event participation terminal.

通信端末３（Ａ）及び通信端末３（Ｂ）は、上述したような汎用ツールを利用して所定のイベントにおいて発話することにより、互いの発話音声を聞くことができる。そのため、議事録作成端末としての通信端末３（Ａ）は、通信端末３（Ａ）を利用する利用者Ａの発話音声のみならず、通信端末３（Ａ）と通信を行っているイベント参加端末としての通信端末３（Ｂ）を利用する利用者（例えば、利用者Ｂ）の発話音声も取得することができる。 The communication terminal 3(A) and the communication terminal 3(B) can hear each other's voice by speaking at a specific event using the general-purpose tool as described above. Therefore, the communication terminal 3(A) as a minutes-creation terminal can acquire not only the voice of user A who uses the communication terminal 3(A), but also the voice of a user (e.g., user B) who uses the communication terminal 3(B) as an event participation terminal communicating with the communication terminal 3(A).

通信端末３（Ｃ）は、通信端末３（Ａ）及び通信端末３（Ｂ）が参加した所定のイベントに基づいて作成された議事録を閲覧する端末である。この場合、通信端末３（Ｃ）は、記録管理アプリがインストールされていなくても、ブラウザ経由で所定のイベントの議事録を閲覧することが可能である。このように、通信端末３（Ｃ）は、例えば、議事録閲覧端末として機能する。 The communication terminal 3(C) is a terminal for viewing minutes created based on a specific event in which the communication terminals 3(A) and 3(B) participated. In this case, the communication terminal 3(C) is capable of viewing the minutes of the specific event via a browser even if a record management app is not installed. In this way, the communication terminal 3(C) functions as, for example, a minutes viewing terminal.

本実施形態において、特に指定がなければ単に「通信端末３」と記す。なお、通信端末３は、一般的に使用されるＰＣ(Personal Computer)、携帯型ノートＰＣ、携帯電話、スマートフォン、タブレット端末、ウェアラブル端末（サングラス型、腕時計型等）の通信機能を有する通信端末であってもよい。通信端末３は、更に、ブラウザソフトウエア等のソフトウエアを動作させることが可能な通信装置又は通信端末が用いられてもよい。 In this embodiment, unless otherwise specified, it will simply be referred to as "communication terminal 3". Note that the communication terminal 3 may be a communication terminal having a communication function, such as a commonly used PC (Personal Computer), a portable notebook PC, a mobile phone, a smartphone, a tablet terminal, or a wearable terminal (sunglasses type, wristwatch type, etc.). The communication terminal 3 may further be a communication device or communication terminal capable of operating software such as browser software.

＜音声記録管理装置＞
音声記録管理装置５は、一般的なサーバＯＳなどが搭載された一以上の情報処理装置（コンピュータシステム）によって実現される。音声記録管理装置５は、専用のアプリケーションプログラムを実行し、通信ネットワーク１００を介して通信端末３が送信した音声情報に基づいて得られた音声記録情報を管理するクラウドサーバの機能を有する。なお、音声記録管理装置５は、音声記録情報として、イベントの一例としての会議の議事録に限らず、個人、グループの任意の活動に対する音声メモ、オペレータの電話応対時の音声記録、工場等の特定の場所における作業記録等に基づいた各種音声記録情報を管理してもよい。 <Audio Recording Management Device>
The voice recording management device 5 is realized by one or more information processing devices (computer systems) equipped with a general server OS or the like. The voice recording management device 5 has a function of a cloud server that executes a dedicated application program and manages voice recording information obtained based on voice information transmitted by the communication terminal 3 via the communication network 100. Note that the voice recording management device 5 may manage various voice recording information based on not only minutes of a meeting as an example of an event, but also voice memos for any individual or group activity, voice records of operators answering the phone, work records at a specific location such as a factory, etc., as the voice recording information.

音声記録管理装置５は、更に、通信端末３を利用する利用者を作成者識別情報で管理する。本実施形態に係る通信システムでは、利用者は作成者識別情報を用いて音声記録管理装置５にログインすることが可能である。このため、作成者識別情報は、音声記録管理装置５が利用者を一意に特定する機能を有している。なお、作成者識別情報には、電子メール、ＩＤ、電話番号など、利用者を一意に識別することが可能な情報が含まれる。 The voice recording management device 5 further manages users who use the communication terminal 3 with creator identification information. In the communication system according to this embodiment, users can log in to the voice recording management device 5 using the creator identification information. Therefore, the creator identification information has a function that allows the voice recording management device 5 to uniquely identify the user. The creator identification information includes information that can uniquely identify the user, such as an email address, ID, or telephone number.

音声記録管理装置５は、単一のコンピュータによって構築されてもよいし、ストレージ等の各部（機能又は手段）を分割して任意に割り当てられた複数のコンピュータによって構築されてもよい。また、音声記録管理装置５の機能の全てまたは一部は、クラウド環境に存在するサーバコンピュータであってもよいし、オンプレミス環境に存在するサーバコンピュータであってもよい。 The audio recording management device 5 may be constructed by a single computer, or may be constructed by multiple computers to which each section (function or means) such as storage is divided and arbitrarily assigned. In addition, all or part of the functions of the audio recording management device 5 may be a server computer that exists in a cloud environment, or a server computer that exists in an on-premise environment.

＜音声認識サーバ＞
音声認識サーバ７は、一般的なサーバＯＳなどが搭載された一以上の情報処理装置（コンピュータシステム）によって実現される。音声認識サーバ７は、音声記録管理装置５が送信した音声情報(データ)を受信すると、音声認識エンジンを起動して音声情報(データ)をテキストデータに変換し、変換したテキストデータを音声記録管理装置５に返信(送信)する機能を有する。つまり本実施形態に係る通信システムでは、音声認識サーバ７が、音声情報からテキスト情報に変換するクラウドサービス機能を有している。 <Speech recognition server>
The voice recognition server 7 is realized by one or more information processing devices (computer systems) equipped with a general server OS, etc. When the voice recognition server 7 receives voice information (data) transmitted by the voice recording management device 5, the voice recognition server 7 has a function of activating a voice recognition engine to convert the voice information (data) into text data, and returning (transmitting) the converted text data to the voice recording management device 5. That is, in the communication system according to this embodiment, the voice recognition server 7 has a cloud service function of converting voice information into text information.

具体的には、音声認識サーバは、音声認識を可能とする他社サービスを利用するようにしてもよく、例えば、汎用の音声認識エンジンサービスで提供されてよい。 Specifically, the voice recognition server may use a third-party service that enables voice recognition, and may be provided, for example, by a general-purpose voice recognition engine service.

●用語について●
本実施形態において利用者とは、以下に該当する者をいう。例えば、利用者には、所定のイベントで発話する参加者、人間が話す言語を生成可能なＡＩを搭載した機械、人型ロボット等が含まれる。本実施形態では、説明の便宜上、利用者という用語を使用する。 ●About terminology●
In this embodiment, a user refers to a person who falls under any of the following categories. For example, users include participants who speak at a specific event, machines equipped with AI capable of generating language spoken by humans, humanoid robots, etc. In this embodiment, the term "user" is used for the sake of convenience of explanation.

更に、本実施形態においてイベントとは、各種行事、催し物などをいう。例えば、イベントには、会議、打合せ、講義、講演、レクチャー、競技大会などが含まれる。 Furthermore, in this embodiment, an event refers to various occasions and activities. For example, events include meetings, conferences, lectures, speeches, competitions, etc.

更に、本実施形態においてテキストとは、利用者が発話した音声に係る音声情報を、所定の辞書等によって認識された結果に基づいて、視認可能な文字で表される単語、熟語、数字、記号、文などに変換された各種情報をいう。 Furthermore, in this embodiment, text refers to various information that is converted from audio information related to the voice spoken by the user into words, phrases, numbers, symbols, sentences, etc., expressed in visible characters based on the results of recognition using a specified dictionary, etc.

更に、本実施形態において音声記録とは、会議等の所定のイベントにおいて一以上のイベントへの参加者(利用者)が発話した発話音声に係る音声情報(データ)に基づいて得られたテキスト情報(データ)、画像、発話音声を含む議事録などで構成される記録をいう。そして、音声記録情報とは、上述した音声記録に係る情報をいう。 Furthermore, in this embodiment, an audio recording refers to a record consisting of text information (data), images, minutes including spoken voice, etc. obtained based on audio information (data) related to spoken voices by participants (users) at one or more events at a specific event such as a conference. And, audio recording information refers to information related to the audio recording described above.

〔ハードウエア構成〕
続いて、図２及び図３を用いて、実施形態に係る通信システムを構成する装置又は端末のハードウエア構成について説明する。なお、図２及び図３に示されている装置又は端末のハードウエア構成は、必要に応じて構成要素が追加又は削除されてもよい。 [Hardware configuration]
Next, the hardware configuration of the device or terminal constituting the communication system according to the embodiment will be described with reference to Figures 2 and 3. Note that components may be added or deleted from the hardware configuration of the device or terminal shown in Figures 2 and 3 as necessary.

＜通信端末、音声認識サーバのハードウエア構成＞
図２は、通信端末及び音声認識サーバのハードウエア構成の一例を示す図である。図２に示されているように、通信端末３は、例えばコンピュータによって構築されている。通信端末３は、ＣＰＵ３０１、ＲＯＭ３０２、ＲＡＭ３０３、ＥＥＰＲＯＭ３０４、ＣＭＯＳ(Complementary Metal Oxide Semiconductor)センサ３０５、撮像素子Ｉ／Ｆ(Inter face)３０６、メディアＩ／Ｆ３０９、バスライン３１０、ネットワークＩ／Ｆ３１２、ネットワークＩ／Ｆ３１２のアンテナ３１２ａ、マイク３１５、スピーカ３１６、音入出力Ｉ／Ｆ３１７、ディスプレイ３１８、外部機器接続Ｉ／Ｆ３１９、近距離通信回路３２０、近距離通信回路３２０のアンテナ３２０ａ及びタッチパネル３２１を備えている。 <Hardware configuration of communication terminal and voice recognition server>
Fig. 2 is a diagram showing an example of the hardware configuration of the communication terminal and the voice recognition server. As shown in Fig. 2, the communication terminal 3 is constructed by, for example, a computer. The communication terminal 3 includes a CPU 301, a ROM 302, a RAM 303, an EEPROM 304, a CMOS (Complementary Metal Oxide Semiconductor) sensor 305, an image sensor I/F (Interface) 306, a media I/F 309, a bus line 310, a network I/F 312, an antenna 312a of the network I/F 312, a microphone 315, a speaker 316, a sound input/output I/F 317, a display 318, an external device connection I/F 319, a short-range communication circuit 320, an antenna 320a of the short-range communication circuit 320, and a touch panel 321.

これらのうち、ＣＰＵ３０１は、通信端末３の全体の動作を制御する。ＲＯＭ３０２は、ＣＰＵ３０１の処理に用いられるプログラムを記憶する。ＲＡＭ３０３は、ＣＰＵ３０１のワークエリアとして使用される。ＥＥＰＲＯＭ３０４は、ＣＰＵ３０１の制御にしたがって、アプリ等の各種データの読出し又は書込みを行う。ＣＭＯＳセンサ３０５は、ＣＰＵ３０１の制御にしたがって被写体を撮像して画像データ又は動画データを得る内蔵型の撮像手段の一種である。なお、撮像手段は、ＣＭＯＳセンサではなく、ＣＣＤ(Charge Coupled Device)センサ等で構成される撮像手段であってもよい。撮像素子Ｉ／Ｆ３０６は、ＣＭＯＳセンサ３０５の駆動を制御する回路である。メディアＩ／Ｆ３０９は、フラッシュメモリ等の記録メディア３０８に対するデータの読出し又は書込み(記憶)を制御する。バスライン３１０は、ＣＰＵ３０１等の各構成要素を電気的に接続するためのアドレスバスやデータバス等である。 Of these, the CPU 301 controls the overall operation of the communication terminal 3. The ROM 302 stores programs used in the processing of the CPU 301. The RAM 303 is used as a work area for the CPU 301. The EEPROM 304 reads or writes various data such as applications under the control of the CPU 301. The CMOS sensor 305 is a type of built-in imaging means that captures an image of a subject under the control of the CPU 301 to obtain image data or video data. The imaging means may be an imaging means configured with a CCD (Charge Coupled Device) sensor or the like instead of a CMOS sensor. The imaging element I/F 306 is a circuit that controls the driving of the CMOS sensor 305. The media I/F 309 controls the reading or writing (storing) of data from or to a recording medium 308 such as a flash memory. The bus line 310 is an address bus, a data bus, or the like for electrically connecting each component such as the CPU 301.

ネットワークＩ／Ｆ３１２は、通信ネットワーク１００を介して他の機器と各種データ(情報)通信するための通信インターフェイスである。このとき、ネットワークＩ／Ｆ３１２は、ネットワークＩ／Ｆ３１２のアンテナ３１２ａを使って通信を行ってもよい。マイク３１５は、音を電気信号に変える内蔵型の回路であり、外部のスピーカ等から発する音声や音波を取得し電気信号を用いた情報を取得する。スピーカ３１６は、電気信号を物理振動に変えて音楽や音声などの音を生み出す内蔵型の回路である。音入出力Ｉ／Ｆ３１７は、ＣＰＵ３０１の制御にしたがってマイク３１５及びスピーカ３１６との間で音信号の入出力を処理する回路である。ディスプレイ３１８は、被写体の画像や文字、各種アイコン等を表示する液晶や有機ＥＬ(Electro Luminescence)などの表示手段の一種である。外部機器接続Ｉ／Ｆ３１９は、各種の外部機器を接続するためのインターフェイスである。この場合の外部機器は、例えば、ＵＳＢ(Universal Serial Bus)メモリ等である。近距離通信回路３２０は、ＮＦＣ(Near Field Communication)、Ｂｌｕｅｔｏｏｔｈ（登録商標。以下省略）、ミリ波無線通信、Ｗｉ－Ｆｉ(登録商標。以下省略)、ＱＲコード（登録商標。以下省略）、可視光、環境音又は超音波等の無線通信インターフェイスを備える通信装置又は通信端末等と近距離無線通信を行うための通信回路である。また、近距離通信回路３２０には近距離通信回路３２０のアンテナ３２０ａが備わっている。タッチパネル３２１は、利用者がディスプレイ３１８上に配置された所定のボタン、アイコン等に対して押下、クリック又はタップ等の操作をすることで、通信端末３を操作する入力手段の一種である。 The network I/F 312 is a communication interface for communicating various data (information) with other devices via the communication network 100. At this time, the network I/F 312 may communicate using the antenna 312a of the network I/F 312. The microphone 315 is a built-in circuit that converts sound into an electrical signal, and acquires voice or sound waves emitted from an external speaker or the like to acquire information using the electrical signal. The speaker 316 is a built-in circuit that converts electrical signals into physical vibrations to generate sounds such as music and voice. The sound input/output I/F 317 is a circuit that processes input and output of sound signals between the microphone 315 and the speaker 316 under the control of the CPU 301. The display 318 is a type of display means such as liquid crystal or organic EL (Electro Luminescence) that displays images of subjects, characters, various icons, etc. The external device connection I/F 319 is an interface for connecting various external devices. In this case, the external device is, for example, a USB (Universal Serial Bus) memory or the like. The short-range communication circuit 320 is a communication circuit for performing short-range wireless communication with a communication device or communication terminal equipped with a wireless communication interface such as NFC (Near Field Communication), Bluetooth (registered trademark, omitted below), millimeter wave wireless communication, Wi-Fi (registered trademark, omitted below), QR code (registered trademark, omitted below), visible light, environmental sound, or ultrasonic wave. The short-range communication circuit 320 also includes an antenna 320a for the short-range communication circuit 320. The touch panel 321 is a type of input means for operating the communication terminal 3 by the user pressing, clicking, tapping, or other operations on a specific button, icon, or the like arranged on the display 318.

なお、通信端末３は、ブラウザソフトウエア等のプログラムを動作させることが可能な通信装置又は通信端末が用いられてもよい。 The communication terminal 3 may be a communication device or a communication terminal capable of running a program such as browser software.

音声認識サーバ７は、ＣＰＵ７０１、ＲＯＭ７０２、ＲＡＭ７０３、ＥＥＰＲＯＭ７０４、ＣＭＯＳ(Complementary Metal Oxide Semiconductor)センサ７０５、撮像素子Ｉ／Ｆ７０６、メディアＩ／Ｆ７０９、バスライン７１０、ネットワークＩ／Ｆ７１２、ネットワークＩ／Ｆ７１２のアンテナ７１２ａ、マイク７１５、スピーカ７１６、音入出力Ｉ／Ｆ７１７、ディスプレイ７１８、外部機器接続Ｉ／Ｆ７１９、近距離通信回路７２０、近距離通信回路７２０のアンテナ７２０ａ及びタッチパネル７２１を備えている。これらのハードウエア資源は、通信端末３のＣＰＵ３０１、ＲＯＭ３０２、ＲＡＭ３０３、ＥＥＰＲＯＭ３０４、ＣＭＯＳ(Complementary Metal Oxide Semiconductor)センサ３０５、撮像素子Ｉ／Ｆ３０６、メディアＩ／Ｆ３０９、バスライン３１０、ネットワークＩ／Ｆ３１２、ネットワークＩ／Ｆ３１２のアンテナ３１２ａ、マイク３１５、スピーカ３１６、音入出力Ｉ／Ｆ３１７、ディスプレイ３１８、外部機器接続Ｉ／Ｆ３１９、近距離通信回路３２０、近距離通信回路３２０のアンテナ３２０ａ及びタッチパネル３２１の各ハードウエア資源と同様であるため、説明を省略する。 The voice recognition server 7 includes a CPU 701, a ROM 702, a RAM 703, an EEPROM 704, a CMOS (Complementary Metal Oxide Semiconductor) sensor 705, an image sensor I/F 706, a media I/F 709, a bus line 710, a network I/F 712, an antenna 712a of the network I/F 712, a microphone 715, a speaker 716, an audio input/output I/F 717, a display 718, an external device connection I/F 719, a short-range communication circuit 720, an antenna 720a of the short-range communication circuit 720, and a touch panel 721. These hardware resources are similar to the hardware resources of the communication terminal 3, including the CPU 301, ROM 302, RAM 303, EEPROM 304, CMOS (Complementary Metal Oxide Semiconductor) sensor 305, image sensor I/F 306, media I/F 309, bus line 310, network I/F 312, antenna 312a of network I/F 312, microphone 315, speaker 316, sound input/output I/F 317, display 318, external device connection I/F 319, short-range communication circuit 320, antenna 320a of short-range communication circuit 320, and touch panel 321, and therefore will not be described.

＜音声記録管理装置のハードウエア構成＞
図３は、音声記録管理装置のハードウエア構成の一例を示す図である。図３に示されているように、音声記録管理装置５は、例えばコンピュータによって構築されており、ＣＰＵ５０１、ＲＯＭ５０２、ＲＡＭ５０３、ＥＥＰＲＯＭ５０４、ＨＤ５０５、ＨＤＤ(Hard Disk Drive)コントローラ５０６、ディスプレイ５０７、近距離通信Ｉ／Ｆ５０８、ＣＭＯＳセンサ５０９、撮像素子Ｉ／Ｆ５１０、ネットワークＩ／Ｆ５１１、キーボード５１２、ポインティングデバイス５１３、メディアＩ／Ｆ５１５、外部機器接続Ｉ／Ｆ５１６、音入出力Ｉ／Ｆ５１７、マイク５１８、スピーカ５１９及びバスライン５２０を備えている。 <Hardware configuration of the voice recording management device>
Fig. 3 is a diagram showing an example of the hardware configuration of the voice recording management device. As shown in Fig. 3, the voice recording management device 5 is constructed by, for example, a computer, and includes a CPU 501, a ROM 502, a RAM 503, an EEPROM 504, a HD 505, a HDD (Hard Disk Drive) controller 506, a display 507, a short-range communication I/F 508, a CMOS sensor 509, an image sensor I/F 510, a network I/F 511, a keyboard 512, a pointing device 513, a media I/F 515, an external device connection I/F 516, an audio input/output I/F 517, a microphone 518, a speaker 519, and a bus line 520.

これらのうち、ＣＰＵ５０１は、音声記録管理装置５全体の動作を制御する。ＲＯＭ５０２は、ＣＰＵ５０１の駆動に用いられるプログラムを記憶する。ＲＡＭ５０３は、ＣＰＵ５０１のワークエリアとして使用される。ＥＥＰＲＯＭ５０４は、ＣＰＵ５０１の制御にしたがって、アプリ等の各種データの読出し又は書込みを行う。ＨＤ５０５は、プログラム等の各種データを記憶する。ＨＤＤコントローラ５０６は、ＣＰＵ５０１の制御にしたがってＨＤ５０５に対する各種データの読出し又は書込みを制御する。ディスプレイ５０７は、カーソル、メニュー、ウィンドウ、文字又は画像などの各種情報を表示する。近距離通信Ｉ／Ｆ５０８は、ＮＦＣ(Near Field Communication)、Ｂｌｕｅｔｏｏｔｈ（登録商標。以下省略）、Ｗｉ－Ｆｉ(登録商標。以下省略)等の無線通信インターフェイスを備える通信装置、又は通信端末等とデータ通信を行うための通信回路である。ＣＭＯＳセンサ５０９は、ＣＰＵ５０１の制御にしたがって被写体を撮像して画像データ又は動画データを得る内蔵型の撮像手段の一種である。なお、撮像手段は、ＣＭＯＳセンサではなく、ＣＣＤ(Charge Coupled Device)センサ等で構成される撮像手段であってもよい。撮像素子Ｉ／Ｆ５１０は、ＣＭＯＳセンサ５０９の駆動を制御する回路である。 Of these, the CPU 501 controls the operation of the entire voice recording management device 5. The ROM 502 stores programs used to drive the CPU 501. The RAM 503 is used as a work area for the CPU 501. The EEPROM 504 reads or writes various data such as apps under the control of the CPU 501. The HD 505 stores various data such as programs. The HDD controller 506 controls the reading or writing of various data from the HD 505 under the control of the CPU 501. The display 507 displays various information such as a cursor, menu, window, text, or image. The short-range communication I/F 508 is a communication circuit for performing data communication with a communication device or a communication terminal equipped with a wireless communication interface such as NFC (Near Field Communication), Bluetooth (registered trademark, omitted below), or Wi-Fi (registered trademark, omitted below), etc. The CMOS sensor 509 is a type of built-in imaging means that captures an image of a subject under the control of the CPU 501 to obtain image data or video data. Note that the imaging means may be an imaging means configured with a CCD (Charge Coupled Device) sensor or the like instead of a CMOS sensor. The imaging element I/F 510 is a circuit that controls the driving of the CMOS sensor 509.

ネットワークＩ／Ｆ５１１は、通信ネットワーク１００を利用してデータ通信をするためのインターフェイスである。キーボード５１２は、文字、数値、各種指示などの入力のための複数のキーを備えた入力手段の一種である。ポインティングデバイス５１３は、各種指示の選択や実行、処理対象の選択、カーソルの移動などを行う入力手段の一種である。メディアＩ／Ｆ５１５は、フラッシュメモリ等の記録メディア５１４に対するデータの読出し又は書込み(記憶)を制御する。外部機器接続Ｉ／Ｆ５１６は、各種の外部機器を接続するためのインターフェイスである。この場合の外部機器は、例えば、ＵＳＢ(Universal Serial Bus)メモリ等である。音入出力Ｉ／Ｆ５１７は、ＣＰＵ５０１の制御にしたがってマイク５１８及びスピーカ５１９との間で音信号の入出力を処理する回路である。マイク５１８は、音を電気信号に変える内蔵型の回路であり、外部のスピーカ等から発する音声や音波を取得し電気信号を用いた情報を取得する。スピーカ５１９は、電気信号を物理振動に変えて音楽や音声などの音を生み出す内蔵型の回路である。バスライン５２０は、ＣＰＵ５０１等の各構成要素を電気的に接続するためのアドレスバスやデータバス等である。 The network I/F 511 is an interface for data communication using the communication network 100. The keyboard 512 is a type of input means having multiple keys for inputting characters, numbers, various instructions, etc. The pointing device 513 is a type of input means for selecting and executing various instructions, selecting a processing target, moving a cursor, etc. The media I/F 515 controls the reading or writing (storing) of data to a recording medium 514 such as a flash memory. The external device connection I/F 516 is an interface for connecting various external devices. In this case, the external device is, for example, a USB (Universal Serial Bus) memory, etc. The sound input/output I/F 517 is a circuit that processes the input and output of sound signals between the microphone 518 and the speaker 519 according to the control of the CPU 501. The microphone 518 is a built-in circuit that converts sound into an electrical signal, and obtains voice and sound waves emitted from an external speaker, etc., and obtains information using the electrical signal. The speaker 519 is a built-in circuit that converts electrical signals into physical vibrations to produce sounds such as music and voice. The bus line 520 is an address bus, data bus, etc., for electrically connecting each component such as the CPU 501.

また、音声記録管理装置５は、通信端末３に対してプッシュ通知(送信)によりデータ(情報)を通知(送信)してもよい。その場合、音声記録管理装置５は、例えば、プッシュ通知サーバの一例であるＦＣＭ(Firebase Cloud Messaging)を利用してプッシュ通知することで実現することが可能である。なお、音声記録管理装置５は、一般的に使用されるＰＣ(Personal Computer)であってもよい。音声記録管理装置５は、更に、ブラウザソフトウエア等のソフトウエアを動作させることが可能な通信装置又は通信端末が用いられてもよい。 The voice recording management device 5 may also notify (send) data (information) to the communication terminal 3 by push notification (transmission). In this case, the voice recording management device 5 can realize this by using, for example, FCM (Firebase Cloud Messaging), which is an example of a push notification server, to send a push notification. The voice recording management device 5 may be a commonly used PC (Personal Computer). The voice recording management device 5 may further be a communication device or communication terminal capable of running software such as browser software.

更に、上記プログラムは、インストール可能な形式又は実行可能な形式のファイルで、コンピュータで読取り可能な記録媒体に記録、又はネットワークを介してダウンロードを行い流通させるようにしてもよい。記録媒体の例として、ＣＤ－Ｒ(Compact Disc Recordable)、ＤＶＤ(Digital Versatile Disk)、Ｂｌｕ-ｒａｙＤｉｓｃ、ＳＤカード、ＵＳＢメモリ等が挙げられる。また、記録媒体は、プログラム製品(Program Product)として、国内又は国外へ提供されることができる。例えば、音声記録管理装置５は、本発明に係るプログラムが実行されることで、本発明に係る音声記録管理方法を実現する。 Furthermore, the above program may be recorded in a computer-readable recording medium as an installable or executable file, or may be distributed by downloading via a network. Examples of recording media include CD-Rs (Compact Disc Recordable), DVDs (Digital Versatile Disks), Blu-ray Discs, SD cards, and USB memories. The recording media may also be provided domestically or internationally as a program product. For example, the voice recording management device 5 realizes the voice recording management method according to the present invention by executing the program according to the present invention.

〔通信システムの機能構成〕
次に、図４乃至図９を用いて、本実施形態の機能構成について説明する。図４は、通信システムの機能構成の一例を示す図である。 [Functional configuration of communication system]
Next, the functional configuration of this embodiment will be described with reference to Fig. 4 to Fig. 9. Fig. 4 is a diagram showing an example of the functional configuration of a communication system.

＜通信端末の機能構成＞
図４に示されているように、通信端末３は、送受信部３１、操作受付部３２、音・画像取得部３３、表示制御部３４、音声再生部３６、アプリ起動部３８及び記憶読出部３９を有する。これら各機能部は、図２に示された各ハードウエア資源のいずれかが、ＲＯＭ３０２及びＥＥＰＲＯＭ３０４のうち少なくとも一つからＲＡＭ３０３に展開された通信端末３用のプログラムに従ったＣＰＵ３０１からの命令により動作することで実現される機能又は手段である。また、通信端末３は、図２に示されているＲＯＭ３０２及びＥＥＰＲＯＭ３０４のうち少なくとも一方により構築される記憶部３０００を有している。更に、記憶部３０００には、音声記録管理装置５と通信ネットワーク１００を介して通信を行うための通信プログラム(通信アプリ)と、音声情報に基づいて議事録等を生成するためのブラウザアプリ、記録管理アプリ等が記憶されている。 <Functional configuration of communication terminal>
As shown in Fig. 4, the communication terminal 3 has a transmission/reception unit 31, an operation reception unit 32, a sound/image acquisition unit 33, a display control unit 34, an audio playback unit 36, an application launch unit 38, and a memory readout unit 39. Each of these functional units is a function or means realized by operating any of the hardware resources shown in Fig. 2 by an instruction from the CPU 301 according to a program for the communication terminal 3 expanded from at least one of the ROM 302 and the EEPROM 304 to the RAM 303. The communication terminal 3 also has a storage unit 3000 constructed by at least one of the ROM 302 and the EEPROM 304 shown in Fig. 2. Furthermore, the storage unit 3000 stores a communication program (communication application) for communicating with the voice recording management device 5 via the communication network 100, and a browser application, a record management application, etc. for generating minutes, etc. based on voice information.

<<通信端末の各機能構成>>
次に、通信端末３の各機能構成について詳細に説明する。図４に示されている通信端末３の送受信部３１は、主に、ネットワークＩ／Ｆ３１２及び近距離通信回路３２０に対するＣＰＵ３０１の処理によって実現され、通信ネットワーク１００を介して音声記録管理装置５との間で各種データ(又は情報)の送受信を行う。本実施形態において、送受信部３１は、送信手段及び受信手段のうち少なくとも一方の手段の一例として機能する。 <<Functional configuration of communication terminal>>
Next, a detailed description will be given of each functional configuration of the communication terminal 3. The transmission/reception unit 31 of the communication terminal 3 shown in Fig. 4 is mainly realized by the processing of the CPU 301 on the network I/F 312 and the short-range communication circuit 320, and transmits and receives various data (or information) to and from the voice recording management device 5 via the communication network 100. In this embodiment, the transmission/reception unit 31 functions as an example of at least one of a transmitting means and a receiving means.

操作受付部３２は、主に、タッチパネル３２１が受け付けた各種操作により生成された信号をＣＰＵ３０１が処理することによって実現される。なお、操作受付部３２は、タッチパネル３２１に代えて、キーボード、ポインティングデバイス等の入力手段が用いられてもよい。本実施形態において、操作受付部３２は、受付手段の一例として機能する。 The operation reception unit 32 is realized mainly by the CPU 301 processing signals generated by various operations received by the touch panel 321. Note that the operation reception unit 32 may use input means such as a keyboard or a pointing device instead of the touch panel 321. In this embodiment, the operation reception unit 32 functions as an example of a reception means.

音・画像取得部３３は、主に、マイク３１５、音入出力Ｉ／Ｆ３１７、ＣＭＯＳセンサ３１３及び撮像素子Ｉ／Ｆ３１４に対するＣＰＵ３０１の処理によって実現され、通信端末３を利用する利用者が発話した発話音声等に係る音声(音)を集音して音声情報(音声データ)又は音情報(音データ)を取得する。音・画像取得部３３は、更に、利用者の顔などの画像に係る画像を撮影して画像情報(画像データ)を取得する。また、音・画像取得部３３は、ディスプレイ３１８に表示されている画面データを所定の時間間隔で取得する。なお、音声情報には、人間が発話した発話音声を示す発話音声情報、ＡＩを搭載した機械、人型ロボット等が生成した人工的な音声である人工音声情報が含まれる。本実施形態において、音・画像取得部３３は、取得手段の一例として機能する。 The sound and image acquisition unit 33 is mainly realized by the processing of the CPU 301 on the microphone 315, the sound input/output I/F 317, the CMOS sensor 313, and the image sensor I/F 314, and collects voice (sound) related to the speech of the user using the communication terminal 3 to acquire voice information (voice data) or sound information (sound data). The sound and image acquisition unit 33 further captures an image related to an image of the user's face, etc. to acquire image information (image data). The sound and image acquisition unit 33 also acquires screen data displayed on the display 318 at a predetermined time interval. Note that the sound information includes speech voice information indicating speech voice uttered by a human, and artificial voice information which is an artificial voice generated by a machine equipped with AI, a humanoid robot, etc. In this embodiment, the sound and image acquisition unit 33 functions as an example of an acquisition means.

表示制御部３４は、主に、ディスプレイ３１８に対するＣＰＵ３０１の処理によって実現され、通信端末３における各種画面及び情報(データ)の表示制御を行う。また、表示制御部３４は、例えば、ブラウザを用いて、ＨＴＭＬ等により作成された表示画面を、ディスプレイ３１８に表示させる。また、表示制御部３４は、音声記録管理装置５が送信した編集後画面データに係る編集後画面をディスプレイ３１８に表示する。また、表示制御部３４は、テキスト非表示要求を生成するために操作されるテキスト非表示操作部を、ディスプレイ３１８に表示された所定のテキストの近傍に表示する。また、表示制御部３４は、画像非表示要求を生成するために操作される画像非表示操作部を、ディスプレイ３１８に表示された所定の画像の近傍に表示する。ここで、テキスト非表示要求は、所定のテキストを非表示とするために音声記録管理装置５に対して送信される要求である。テキスト非表示操作部は、後述する「非公開」ボタン(アイコン)３５４２、「削除」ボタン(アイコン)３５４４を含む。また、画像非表示要求は、所定の画像を非表示とするために音声記録管理装置５に対して送信される要求である。画像非表示操作部は、後述する「非公開」ボタン(アイコン)３５４５、「削除」ボタン(アイコン)３５４６を含む。さらに、テキスト非表示要求と画像非表示要求によって非表示処理される第１の非表示テキストデータ、第２の非表示テキストデータ、第１の非表示画像データ、及び第２の非表示画像データはそれぞれ、所定の事業の業績情報、売上情報、利益情報及び個人情報を含む秘匿データである。 The display control unit 34 is mainly realized by the processing of the CPU 301 on the display 318, and controls the display of various screens and information (data) in the communication terminal 3. The display control unit 34 also uses, for example, a browser to display a display screen created by HTML or the like on the display 318. The display control unit 34 also displays an edited screen related to the edited screen data transmitted by the voice recording management device 5 on the display 318. The display control unit 34 also displays a text hide operation unit operated to generate a text hide request near a specified text displayed on the display 318. The display control unit 34 also displays an image hide operation unit operated to generate an image hide request near a specified image displayed on the display 318. Here, the text hide request is a request sent to the voice recording management device 5 to hide the specified text. The text hide operation unit includes a "private" button (icon) 3542 and a "delete" button (icon) 3544, which will be described later. Furthermore, the image non-display request is a request sent to the audio recording management device 5 to hide a specified image. The image non-display operation unit includes a "private" button (icon) 3545 and a "delete" button (icon) 3546, which will be described later. Furthermore, the first non-displayed text data, second non-displayed text data, first non-displayed image data, and second non-displayed image data that are hidden by the text non-display request and image non-display request are each confidential data that includes performance information, sales information, profit information, and personal information of the specified business.

また、表示制御部３４は、編集対象となる画面の画面データが取得部５２によって取得された取得時刻に跨って音声データに基づいて得られた特定のテキストデータが存在する状態のとき、以下の処理を行う。具体的には、表示制御部３４は、取得時刻に取得された所定の画像に対する画像非表示要求が受信された場合に、第２の非表示テキストデータに加えて、特定のテキストデータを非表示処理した第３の非表示画面データに係る第３の非表示画面をディスプレイ３１８に表示する。表示制御部３４は、更に、所定の画像に対する画像非表示要求が受信された後、所定の画像を再度表示する画像表示要求が受信された場合に、特定のテキストデータに係る特定のテキストが非表示状態に維持された画面をディスプレイ３１８に表示する。 The display control unit 34 also performs the following processing when there is specific text data obtained based on audio data spanning the acquisition time when the screen data of the screen to be edited is acquired by the acquisition unit 52. Specifically, when an image non-display request is received for a specific image acquired at the acquisition time, the display control unit 34 displays on the display 318, in addition to the second non-display text data, a third non-display screen related to third non-display screen data obtained by non-display processing of the specific text data. The display control unit 34 further displays on the display 318 a screen in which specific text related to the specific text data is maintained in a non-display state when an image display request to display the specific image again is received after an image non-display request for the specific image is received.

表示制御部３４は、更に、ディスプレイ３１８に表示された複数のテキストを含む一のテキストグループとして選択された場合に、テキストグループを表す要約を入力させるための要約入力操作部を一のテキストグループの近傍に表示する。ここで、要約入力操作部は、後述する要約入力ダイアログ３１８１を含む。表示制御部３４は、更に、要約入力操作部とあわせて、テキスト非表示操作部を一のテキストグループの近傍に表示する。ここで、テキスト非表示操作部は、後述する「非公開」ボタン(アイコン)３５４２又は「削除」ボタン(アイコン)３５４４を含む。本実施形態において、表示制御部３４は、表示制御手段の一例として機能する。 When a text group including a plurality of texts displayed on the display 318 is selected, the display control unit 34 further displays a summary input operation unit adjacent to the text group for inputting a summary representing the text group. Here, the summary input operation unit includes a summary input dialog 3181 described below. The display control unit 34 further displays a text hide operation unit adjacent to the text group together with the summary input operation unit. Here, the text hide operation unit includes a "Private" button (icon) 3542 or a "Delete" button (icon) 3544 described below. In this embodiment, the display control unit 34 functions as an example of a display control means.

音声再生部３６は、主に、スピーカ３１６及び音入出力Ｉ／Ｆ３１７に対するＣＰＵ３０１の処理によって実現され、通信端末３を利用する利用者に対して音声情報(音声データ)又は音情報(音データ)を再生する。また、音声再生部３６は、音声記録管理装置５が送信した、所定のテキストに対する非公開、公開、及び削除等の編集を行った編集後の音声データに係る編集後音声を再生する。また、音声再生部３６は、第２の非表示テキストデータと特定のテキストデータに係る各テキストで示される音声データが無音化処理された音声を再生する。音声再生部３６は、更に、特定のテキストで示される特定の音声データに係る特定の音声が無音化状態に維持された音声を再生する。本実施形態において、音声再生部３６は、音声再生手段の一例として機能する。 The audio playback unit 36 is mainly realized by the processing of the CPU 301 on the speaker 316 and the audio input/output I/F 317, and plays audio information (audio data) or sound information (sound data) for the user using the communication terminal 3. The audio playback unit 36 also plays edited audio related to edited audio data sent by the audio recording management device 5, in which editing such as making a specific text private, making it public, or deleting it has been performed. The audio playback unit 36 also plays audio in which audio data indicated by each text related to the second non-display text data and the specific text data has been muted. The audio playback unit 36 further plays audio in which a specific audio related to specific audio data indicated by specific text has been maintained in a muted state. In this embodiment, the audio playback unit 36 functions as an example of an audio playback means.

アプリ起動部３８は、主に、ＣＰＵ３０１の処理によって実現され、音声記録管理装置５との間で通信ネットワーク１００を介して各種アプリの起動を行う。また、アプリ起動部３８は、音声記録管理装置５で管理されている各種テキスト情報を編集、管理するための記録管理アプリ及びブラウザアプリを、ＲＡＭ３０３の所定の作業領域上で動作させる。本実施形態において、アプリ起動部３８は、起動手段の一例として機能する。 The application launch unit 38 is mainly realized by the processing of the CPU 301, and launches various applications between the application launch unit 38 and the voice recording management device 5 via the communication network 100. The application launch unit 38 also operates a recording management application and a browser application for editing and managing various text information managed by the voice recording management device 5 in a predetermined working area of the RAM 303. In this embodiment, the application launch unit 38 functions as an example of a launching means.

記憶読出部３９は、主に、図２に示されているＲＯＭ３０２及びＥＥＰＲＯＭ３０４のうち少なくとも一つに対するＣＰＵ３０１の処理によって実現され、記憶部３０００に各種データ(又は情報)を記憶したり、記憶部３０００から各種データ(又は情報)を読み出したりする。本実施形態において、記憶読出部３９は、記憶読出手段の一例として機能する。 The memory/read unit 39 is realized mainly by the processing of the CPU 301 on at least one of the ROM 302 and the EEPROM 304 shown in FIG. 2, and stores various data (or information) in the memory unit 3000 and reads various data (or information) from the memory unit 3000. In this embodiment, the memory/read unit 39 functions as an example of a memory/read means.

＜音声記録管理装置の機能構成＞
図４に示されているように、音声記録管理装置５は、送受信部５１、取得部５２、算出特定部５３、表示制御部５４、判断部５５、認証部５６、生成・処理部５７、設定登録部５８及び記憶読出部５９を有する。これら各機能部は、図３に示された各ハードウエア資源のいずれかが、ＲＯＭ５０２及びＨＤ５０５のうち少なくとも一つからＲＡＭ５０３に展開された音声記録管理装置５用のプログラムに従ったＣＰＵ５０１からの命令により動作することで実現される機能又は手段である。また、音声記録管理装置５は、図３に示されているＲＯＭ５０２及びＨＤ５０５のうち少なくとも一方により構築される記憶部５０００を有している。更に、記憶部５０００には、通信端末３及び音声認識サーバ７と通信ネットワーク１００を介してそれぞれ通信を行うための通信プログラム(通信アプリ)と、通信端末３との間で実行されるブラウザアプリ、記録管理アプリ等が記憶されている。 <Functional configuration of the voice recording management device>
As shown in Fig. 4, the voice recording management device 5 has a transmission/reception unit 51, an acquisition unit 52, a calculation specification unit 53, a display control unit 54, a judgment unit 55, an authentication unit 56, a generation/processing unit 57, a setting registration unit 58, and a storage/reading unit 59. Each of these functional units is a function or means realized by operating any of the hardware resources shown in Fig. 3 by an instruction from the CPU 501 according to a program for the voice recording management device 5 expanded from at least one of the ROM 502 and the HD 505 to the RAM 503. The voice recording management device 5 also has a storage unit 5000 constructed by at least one of the ROM 502 and the HD 505 shown in Fig. 3. Furthermore, the storage unit 5000 stores a communication program (communication application) for communicating with the communication terminal 3 and the voice recognition server 7 via the communication network 100, and a browser application, a record management application, etc. executed between the communication terminal 3 and the voice recognition server 7.

●ログイン管理テーブル●
図５は、ログイン管理テーブルの一例を示す概念図である。記憶部５０００には、図５に示されているようなログイン管理テーブルによって構成されたログイン管理ＤＢ５００１が構築されている。ログイン管理テーブルでは、セッションＩＤごとに、端末識別情報、参加した通信端末のＩＰアドレス、作成者識別情報(ユーザＩＤ)、パスワード及び利用者名が関連付けられて記憶、管理されている。これらのうち、セッションＩＤは、音声記録管理装置５と一以上の通信端末３との間で行われる通信で確立されるセッションを識別するための情報で、例えば、SE0001, SE0002等で与えられる。 ●Login management table●
Fig. 5 is a conceptual diagram showing an example of a login management table. A login management DB 5001 configured by a login management table as shown in Fig. 5 is constructed in the memory unit 5000. In the login management table, terminal identification information, IP addresses of participating communication terminals, creator identification information (user ID), passwords, and user names are stored and managed in association with each session ID. Of these, the session ID is information for identifying a session established by communication between the voice recording management device 5 and one or more communication terminals 3, and is given, for example, SE0001, SE0002, etc.

端末識別情報は、通信端末３を識別するための情報であり、例えば、TM0001, T0002等で与えられる。参加した通信端末のＩＰアドレスは、所定のイベントに参加した通信端末のＩＰアドレスを示し、IPv4, IPv6等のバージョンに対応させて与えられる固有の情報である。本実施形態では、例えば、1.2.1.3, 1.2.2.4などの情報で与えられる。作成者識別情報は、利用者を識別するための情報であり、本実施形態では、利用者のユーザＩＤとして、例えば、「taroh.r@ricoh.ex.com」等の電子メールアドレスが与えられる。パスワードは、通信システム１を利用する際のログイン時の作成者識別情報と関連付けられた識別情報であり、初回の起動時(ログイン時)等に利用者が設定する。なお、パスワードは、任意の文字列、数字、記号をランダムに含む複数文字(桁)の情報である。利用者名は、作成者識別情報で示される利用者の氏名を表し、例えば、「理光太郎」、「馬込花子」、「海老名二郎」等で与えられる。 The terminal identification information is information for identifying the communication terminal 3, and is given, for example, as TM0001, T0002, etc. The participating communication terminal IP address indicates the IP address of the communication terminal that participated in a specific event, and is unique information given in correspondence with the version such as IPv4, IPv6, etc. In this embodiment, it is given, for example, as information such as 1.2.1.3, 1.2.2.4, etc. The creator identification information is information for identifying the user, and in this embodiment, an email address such as "taroh.r@ricoh.ex.com" is given as the user ID of the user. The password is identification information associated with the creator identification information at the time of login when using the communication system 1, and is set by the user at the time of initial startup (login), etc. The password is information of multiple characters (digits) that randomly include any character string, numbers, and symbols. The user name represents the name of the user indicated by the creator identification information, and is given, for example, as "Rikotaro", "Hanako Magome", "Jiro Ebina", etc.

●記録書誌情報管理テーブル●
図６は、記録書誌情報管理テーブルの一例を示す概念図である。記憶部５０００には、図６に示されているような記録書誌情報管理テーブルによって構成された記録書誌情報管理ＤＢ５００２が構築されている。記録書誌情報管理テーブルでは、記録識別情報をタブとして、それぞれのタブで分けられた記録名称、開始日時、終了日時、音声データパス、作成者識別情報(ユーザＩＤ)、イベントＵＲＬ、パスコード及び対応付け処理の各項目が関連付けられて記憶、管理されている。なお、タブとしての記録識別情報は、例えば、「R5006」,「R5007」等で与えられる。 ●Record bibliographic information management table●
Fig. 6 is a conceptual diagram showing an example of a record bibliographic information management table. A record bibliographic information management DB 5002 configured by the record bibliographic information management table shown in Fig. 6 is constructed in the storage unit 5000. In the record bibliographic information management table, the record identification information is treated as tabs, and the record name, start date and time, end date and time, audio data path, creator identification information (user ID), event URL, passcode, and each item of the association process are associated and stored and managed by each tab. The record identification information as a tab is given, for example, as "R5006", "R5007", etc.

これらのうち、記録名称は、所定のイベントで記録された記録内容の名称であり、例えば、ヘルスケア事業業績報告会などの名称が与えられる。開始日時及び終了日時は、所定のイベントが開始された日時及びイベントが終了された日時が与えられ、例えば、「2021/03/31 11:00:00」、「2021/03/31 12:00:00」などの情報である。音声データパスは、所定のイベントで記録された音声記録データ(全体データ)が保管、管理されている場所を示すもので、例えば、「…/00005006/record.mp3」のようにファイル名を含めたパス情報として与えられる。なお、音声データパスは、通信システム１に配置された専用の音声サーバ装置を表すＵＲＬ情報であってもよい。 Of these, the recording name is the name of the recorded content recorded at a specific event, and may be given a name such as "Healthcare Business Performance Reporting Meeting." The start date and time and end date and time are given as the date and time when the specific event started and ended, and are, for example, information such as "2021/03/31 11:00:00" and "2021/03/31 12:00:00." The audio data path indicates the location where the audio recording data (all data) recorded at a specific event is stored and managed, and is given as path information including a file name, such as ".../00005006/record.mp3." The audio data path may be URL information representing a dedicated audio server device located in the communication system 1.

作成者識別情報(ユーザＩＤ)は、会議等のイベントにおける議事録等を一例とする記録閲覧を作成する作成者の識別情報であり、例えば、「taro.r@ricoh.ex.com」等で与えられる。イベントＵＲＬは、会議等のイベントにおける記録閲覧を編集するための場所を示すもので、利用者は、このイベントＵＲＬにアクセスすることで、後述する「記録閲覧編集画面」にアクセスすることができる。なお、イベントＵＲＬは、通信システム１に配置された専用の画像サーバ装置を表すＵＲＬ情報であってもよい。パスコードは、議事録等の記録閲覧を作成(編集)する作成者以外の他の利用者がその議事録等の記録閲覧を閲覧するためのコード情報である。具体的には、作成者がイベントＵＲＬとパスコードをコピーし、電子メール、チャット等を用いて他の利用者にそのイベントＵＲＬとパスコードを送信する。他の利用者は、そのイベントＵＲＬとパスコードを受信して、イベントＵＲＬとあわせてパスコードを入力する。これにより、他の利用者は、作成者が作成した記録閲覧を閲覧することが可能になる。 The creator identification information (user ID) is the identification information of the creator who creates the record view, such as the minutes of an event such as a conference, and is given, for example, as "taro.r@ricoh.ex.com". The event URL indicates the location for editing the record view of an event such as a conference, and the user can access the "record view editing screen" described later by accessing this event URL. The event URL may be URL information representing a dedicated image server device arranged in the communication system 1. The passcode is code information for other users other than the creator who creates (edits) the record view such as the minutes to view the record view of the minutes. Specifically, the creator copies the event URL and passcode, and sends the event URL and passcode to other users using e-mail, chat, etc. The other users receive the event URL and passcode, and enter the passcode along with the event URL. This allows other users to view the record view created by the creator.

対応付け処理は、会議等のイベントにおいて記録された音声情報(音声データ)に基づいて記録閲覧を生成する際に、音声データに基づいて得られた所定のテキストに対応する画像を表す画像識別情報の対応付け処理が行われたかを示す項目である。対応付け処理が「未処理」の場合は、上述した対応付け処理がまだ行われていない状態であり、対応付け処理が「処理済」の場合は、上述した対応付け処理が行われた状態であることを示す。なお、画像識別情報については、次のテキスト情報管理テーブルにて詳細に説明する。 The matching process is an item that indicates whether matching process has been performed on image identification information that represents an image that corresponds to a specific text obtained based on audio data when generating a record view based on audio information (audio data) recorded at an event such as a conference. If the matching process is "unprocessed", it indicates that the above-mentioned matching process has not yet been performed, and if the matching process is "processed", it indicates that the above-mentioned matching process has been performed. Image identification information will be explained in detail in the text information management table below.

本実施形態に係る記録書誌情報管理テーブルにおいて管理される項目のうち、終了日時と音声データパスの項目は、イベント終了時に利用者又は編集者によって編集、追加される項目である。 Of the items managed in the record bibliographic information management table in this embodiment, the end date and time and audio data path items are items that are edited or added by the user or editor when the event ends.

●テキスト情報管理テーブル●
図７は、テキスト情報管理テーブルの一例を示す概念図である。記憶部５０００には、図７に示されているようなテキスト情報管理テーブルによって構成されたテキスト情報管理ＤＢ５００３が構築されている。テキスト情報管理テーブルでは、記録識別情報をタブとして、それぞれのタブで分けられたテキスト識別情報、開始時刻、終了時刻、テキスト、公開フラグ、削除フラグ及び画像識別情報が関連付けられて記憶、管理されている。なお、タブとしての記録識別情報は、例えば、「R5006」,「R5007」等で与えられる。 ●Text information management table●
Fig. 7 is a conceptual diagram showing an example of a text information management table. A text information management DB 5003 configured by a text information management table as shown in Fig. 7 is constructed in the storage unit 5000. In the text information management table, the record identification information is treated as a tab, and the text identification information, start time, end time, text, publication flag, deletion flag, and image identification information separated by each tab are stored and managed in association with each other. The record identification information as a tab is given, for example, as "R5006", "R5007", etc.

これらのうち、テキスト識別情報は、所定の発話内容や発話文を一つのテキスト(又はテキスト情報)としたときの一単位として識別するための情報で、例えば、「TX0005」、「TX0006」、・・・「TX0009」等で与えられる。開始時刻及び終了時刻は、所定の発話が開始された時刻と終了された時刻を分と秒で管理する情報である。例えば、所定のイベントが開始された日時が、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている開始日時「2021/03/31 11:00:00」であった場合、開始時刻は、開始日時から経過した時間として管理される。つまり、開始時刻が「1分53秒」であれば、テキスト識別情報が「TX0005」で管理される発話が開始された日時は、「2021/03/31 11:01:53」であることを示している。終了時刻は、開始時刻と同様に、開始日時から経過した時間として管理される。つまり、終了時刻が「1分58秒」であれば、テキスト識別情報が「TX0005」で管理される発話が終了された日時は、「2021/03/31 11:01:58」であることを示している。したがって、この場合、テキスト情報が「TX0005」で管理される発話に要した時間は、58秒－53秒＝5秒間となる。 Among these, the text identification information is information for identifying a specific speech content or speech sentence as a unit when it is treated as one text (or text information), and is given, for example, as "TX0005", "TX0006", ... "TX0009". The start time and end time are information for managing the time when a specific speech starts and ends in minutes and seconds. For example, if the start date and time when a specific event starts is "2021/03/31 11:00:00" managed in the record bibliographic information management DB 5002 (see FIG. 6), the start time is managed as the time elapsed from the start date and time. In other words, if the start time is "1 minute 53 seconds", the date and time when the speech managed by the text identification information "TX0005" started is "2021/03/31 11:01:53". The end time is managed as the time elapsed from the start date and time, like the start time. In other words, if the end time is "1 minute 58 seconds", this indicates that the date and time when the utterance managed by the text identification information "TX0005" ended was "2021/03/31 11:01:58". Therefore, in this case, the time required for the utterance managed by the text information "TX0005" is 58 seconds - 53 seconds = 5 seconds.

テキストは、上述したように、開始時刻と終了時刻との間で利用者等が発話した発話内容をテキスト情報に置き換えたものである。テキストは、例えば、「本日の会議は録音させていただきます。」、「本日の議題はヘルスケア事業の業績です。」といった内容として管理される。 As described above, the text is the content of the speech spoken by the user between the start time and the end time replaced with text information. For example, the text is managed as content such as "Today's meeting will be recorded," or "Today's agenda is the performance of the healthcare business."

公開フラグは、テキスト情報管理テーブルで管理される所定のテキストが記録閲覧編集画面に表示(以下、公開ともいう)されているか否かを表す状態フラグである。所定のテキストが記録閲覧編集画面に公開されている場合に、公開フラグは「True」として管理される。他方、所定のテキストが記録閲覧編集画面に非表示又は削除(以下、非公開ともいう)されている場合に、公開フラグは「False」として管理される。なお、公開フラグの初期設定は、例えば、公開(表示)することを意味する「True」であってもよい。 The public flag is a status flag that indicates whether or not a specific text managed in the text information management table is displayed (hereinafter also referred to as public) on the record viewing and editing screen. When a specific text is publicly displayed on the record viewing and editing screen, the public flag is managed as "True." On the other hand, when a specific text is hidden or deleted (hereinafter also referred to as private) on the record viewing and editing screen, the public flag is managed as "False." Note that the initial setting of the public flag may be, for example, "True," which means that the text is public (displayed).

削除フラグは、テキスト情報管理テーブルで管理される所定のテキストが記録閲覧編集画面から削除されたか否かを表す状態フラグである。所定のテキストが記録閲覧編集画面から削除された場合に、削除フラグは「True」として管理される。他方、所定のテキストが記録閲覧編集画面から削除されていない場合は、「削除フラグは」「False」として管理される。なお、削除フラグの初期設定は、例えば、削除しないことを意味する「False」であってもよい。 The deletion flag is a status flag that indicates whether or not a specific text managed in the text information management table has been deleted from the record viewing/editing screen. If the specific text has been deleted from the record viewing/editing screen, the deletion flag is managed as "True." On the other hand, if the specific text has not been deleted from the record viewing/editing screen, the deletion flag is managed as "False." The initial setting of the deletion flag may be, for example, "False," which means that the text is not deleted.

画像識別情報は、記録閲覧編集画面に表示される所定のキャプチャ画像を識別する識別情報であり、例えば、「IM0004」、「IM0005」等で与えられる。 The image identification information is identification information that identifies a specific capture image displayed on the record viewing and editing screen, and is given, for example, "IM0004", "IM0005", etc.

●キャプチャ画像管理テーブル●
図８Ａは、キャプチャ画像管理テーブルの一例を示す概念図である。記憶部５０００には、図８Ａに示されているようなキャプチャ画像管理テーブルによって構成されたキャプチャ画像管理ＤＢ５００４が構築されている。本実施形態において、通信端末３の表示制御部３４は、所定のイベントの実行中に表示、編集される閲覧編集画面を、所定の時間間隔（例えば30秒間隔）でキャプチャする機能を有する。キャプチャ画像管理テーブルは、その画面キャプチャ機能に係る情報を記憶、管理するためのテーブルで、キャプチャ画像テーブルでは、記録識別情報をタブとして、それぞれのタブで分けられた画像識別情報、取得時刻、公開フラグ、削除フラグ及び画像データパスが関連付けられて記憶、管理されている。 ●Capture image management table●
Fig. 8A is a conceptual diagram showing an example of a capture image management table. A capture image management DB 5004 configured by the capture image management table shown in Fig. 8A is constructed in the storage unit 5000. In this embodiment, the display control unit 34 of the communication terminal 3 has a function of capturing a viewing and editing screen displayed and edited during execution of a predetermined event at a predetermined time interval (e.g., every 30 seconds). The capture image management table is a table for storing and managing information related to the screen capture function, and in the capture image table, record identification information is treated as a tab, and image identification information, acquisition time, public flag, deletion flag, and image data path, which are separated by each tab, are stored and managed in association with each other.

これらのうち、画像識別情報は、キャプチャされる画像を識別するための情報で、例えば、「IM0001」,・・・「IM0003」等で与えられる。取得時刻は、キャプチャ画像を取得した際の所定のイベントにおける経過時間を示すもので、例えば、「1分30秒」、「2分0秒」等で与えられる。 Of these, image identification information is information for identifying the captured image, and is given, for example, as "IM0001", ... "IM0003", etc. The acquisition time indicates the elapsed time of a specific event when the captured image was acquired, and is given, for example, as "1 minute 30 seconds", "2 minutes 0 seconds", etc.

公開フラグは、キャプチャ画像管理テーブルで管理される所定のキャプチャ画像が記録閲覧編集画面に表示(以下、公開ともいう)されているか否かを表す状態フラグである。所定のキャプチャ画像が記録閲覧編集画面に公開されている場合に、公開フラグは「True」として管理される。他方、所定のキャプチャ画像が記録閲覧編集画面に非表示又は削除(以下、非公開ともいう)されている場合に、公開フラグは「False」として管理される。なお、公開フラグの初期設定は、例えば、公開(表示)を意味する「True」であってもよい。 The public flag is a status flag that indicates whether or not a specific captured image managed in the captured image management table is displayed (hereinafter also referred to as public) on the record viewing and editing screen. When a specific captured image is publicly displayed on the record viewing and editing screen, the public flag is managed as "True." On the other hand, when a specific captured image is hidden or deleted (hereinafter also referred to as private) on the record viewing and editing screen, the public flag is managed as "False." Note that the initial setting of the public flag may be, for example, "True," which means public (displayed).

削除フラグは、キャプチャ画像管理テーブルで管理される所定のキャプチャ画像が記録閲覧編集画面から削除されたか否かを表す状態フラグである。所定のキャプチャ画像が記録閲覧編集画面から削除された場合に、削除フラグは「True」として管理される。他方、所定のキャプチャ画像が記録閲覧編集画面から削除されていない場合は、削除フラグは「False」として管理される。なお、削除フラグの初期設定は、例えば、非削除を意味する「False」であってもよい。 The deletion flag is a status flag that indicates whether or not a specific captured image managed in the captured image management table has been deleted from the record viewing/editing screen. If a specific captured image has been deleted from the record viewing/editing screen, the deletion flag is managed as "True." On the other hand, if a specific captured image has not been deleted from the record viewing/editing screen, the deletion flag is managed as "False." Note that the initial setting of the deletion flag may be, for example, "False," which means that the image has not been deleted.

画像データパスは、所定のイベントでキャプチャされたキャプチャ画像データが保管、管理されている場所を示すもので、例えば、「…/00005006/0003.jpg」のようなパス情報として与えられる。なお、画像データパスは、通信システム１に配置された専用の画像サーバ装置を表すＵＲＬ情報であってもよい。 The image data path indicates the location where the captured image data captured at a specific event is stored and managed, and is given as path information such as ".../00005006/0003.jpg". The image data path may be URL information indicating a dedicated image server device located in the communication system 1.

●キャプチャ画像取得間隔テーブル●
図８Ｂは、キャプチャ画像取得間隔テーブルの一例を示す概念図である。記憶部５０００には、図８Ｂに示されているようなキャプチャ画像取得間隔テーブルによって構成されたキャプチャ画像取得間隔ＤＢ５００５が構築されている。キャプチャ画像取得間隔テーブルでは、キャプチャ画像を取得するキャプチャ画像取得間隔が例えば、「30秒」、「60秒(1分)」等で設定され管理される。なお、キャプチャ画像を取得する所定の時間間隔は３０秒に限らず、１０秒毎、１分毎など、任意に設定されてよい。更に、キャプチャ画像取得管理テーブルは、音声記録管理装置５での管理に代えて、実際に自らの画面のキャプチャ画像を取得する通信端末３側で管理されるようにしてもよい。 ●Capture image acquisition interval table●
8B is a conceptual diagram showing an example of a capture image acquisition interval table. In the storage unit 5000, a capture image acquisition interval DB 5005 configured by the capture image acquisition interval table shown in FIG. 8B is constructed. In the capture image acquisition interval table, the capture image acquisition interval for acquiring a capture image is set and managed as, for example, "30 seconds,""60 seconds (1 minute)," etc. Note that the predetermined time interval for acquiring a capture image is not limited to 30 seconds, and may be set arbitrarily, such as every 10 seconds or every minute. Furthermore, the capture image acquisition management table may be managed by the communication terminal 3 side that actually acquires a capture image of its own screen, instead of being managed by the voice recording management device 5.

●非公開音声管理テーブル●
図９は、非公開音声管理テーブルの一例を示す概念図である。記憶部５０００には、図９に示されているような非公開音声管理テーブルによって構成された非公開音声管理ＤＢ５００６が構築されている。非公開音声管理テーブルでは、記録識別情報をタブとして、それぞれのタブで分けられた開始時刻及び終了時刻が関連付けられた記憶、管理されている。この開始時刻及び終了時刻は、取得された音声記録ごとに管理される。 ●Private audio management table●
Fig. 9 is a conceptual diagram showing an example of a private voice management table. A private voice management DB 5006 configured by the private voice management table shown in Fig. 9 is constructed in the storage unit 5000. In the private voice management table, the record identification information is treated as a tab, and start times and end times separated by each tab are stored and managed in association with each other. The start times and end times are managed for each acquired voice recording.

<<音声記録管理装置の各機能構成>>
次に、音声記録管理装置５の各機能構成について詳細に説明する。図４に示されている音声記録管理装置５の送受信部５１は、主に、近距離通信Ｉ／Ｆ５０８及びネットワークＩ／Ｆ５１１に対するＣＰＵ５０１の処理によって実現され、通信ネットワーク１００を介して通信端末３との間でそれぞれ各種データ(又は情報)の送受信を行う。送受信部５１は、一以上の通信端末のそれぞれを利用する一以上の利用者が発話した発話音声に係る音声情報を受信する。また、送受信部５１は、取得された音声データに基づいて得られた所定のテキストを表す所定のテキストデータと、取得された画面データに係る画面に含まれる画像であり、所定のテキストに対応付けられた所定の画像を表す所定の画像データと、所定のテキストで示される所定の音声データとを、通信端末３（Ａ）（第１の通信端末の一例）を含む一以上の通信端末に送信する。 <<Functional configuration of the voice recording management device>>
Next, each functional configuration of the voice recording management device 5 will be described in detail. The transmission/reception unit 51 of the voice recording management device 5 shown in FIG. 4 is mainly realized by the processing of the CPU 501 for the short-range communication I/F 508 and the network I/F 511, and transmits and receives various data (or information) to and from the communication terminal 3 via the communication network 100. The transmission/reception unit 51 receives voice information related to speech voices uttered by one or more users who use one or more communication terminals. In addition, the transmission/reception unit 51 transmits predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image that is included in a screen related to the acquired screen data and is associated with the predetermined text, and predetermined voice data indicated by the predetermined text to one or more communication terminals including the communication terminal 3 (A) (an example of a first communication terminal).

また、送受信部５１は、通信端末３（Ａ）（第１の通信端末）が送信した編集要求であり、所定のテキスト又は所定の画像に対する編集要求に応じて、所定のテキストデータを編集処理した編集後テキストデータと所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、所定の音声データを編集処理した編集後音声データとを、通信端末３（Ｂ）（第２の通信端末の一例）に対して送信する。 The transmitter/receiver 51 also transmits edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to the communication terminal 3(B) (an example of a second communication terminal) in response to an editing request for a specified text or a specified image, which is an editing request transmitted by the communication terminal 3(A) (a first communication terminal).

また、送受信部５１は、通信端末３（Ａ）が送信した、所定のテキストを非表示にするためのテキスト非表示要求及び所定のテキストに対応付けられた所定の画像を非表示にするための画像非表示要求のうちのいずれか一方の要求を編集要求として受信する。 The transmitter/receiver unit 51 also receives, as an editing request, either a text hide request for hiding specified text or an image hide request for hiding a specified image associated with the specified text, sent by the communication terminal 3 (A).

また、送受信部５１は、通信端末３が送信した編集要求がテキストを非表示とするためのテキスト非表示要求である場合、編集処理として、所定のテキストデータを非表示処理した第１の非表示テキストデータを通信端末３（Ｂ）に対して送信する。 In addition, when the editing request sent by the communication terminal 3 is a text non-display request for hiding text, the transmission/reception unit 51 transmits, as an editing process, to the communication terminal 3 (B) first non-display text data that has been subjected to non-display processing of the specified text data.

また、送受信部５１は、所定の画像データを非表示処理した第１の非表示画像データを含む第１の非表示画面データと、編集処理として所定の音声データを無音化処理した第１の無音化音声データとを、通信端末３（Ｂ）に対して送信する。 In addition, the transmission/reception unit 51 transmits to the communication terminal 3 (B) first non-display screen data including first non-display image data obtained by non-display processing of specific image data, and first muted audio data obtained by muting specific audio data as an editing process.

また、送受信部５１は、通信端末３が送信した編集要求が画像非表示要求である場合、編集処理として、所定の画像データを非表示処理した第２の非表示画像データ及び所定の画像に対応付けられた一以上のテキストを非表示処理した第２の非表示テキストデータを含む第２の非表示画面データを通信端末３（Ｂ）に対して送信する。また、送受信部５１は、編集処理として一以上のテキストに対応付けられた一以上の所定の音声データを無音化処理した第２の無音化音声データとを、通信端末３（Ｂ）に対して送信する。本実施形態において、送受信部５１は、送信手段及び受信手段のうち少なくとも一方の手段の一例として機能する。 When the editing request transmitted by the communication terminal 3 is a request to not display an image, the transmission/reception unit 51 transmits, as an editing process, to the communication terminal 3 (B) second hidden screen data including second hidden image data obtained by non-display processing of specific image data and second hidden text data obtained by non-display processing of one or more pieces of text associated with the specific image. The transmission/reception unit 51 also transmits, as an editing process, second muted audio data obtained by muting one or more pieces of specific audio data associated with the one or more pieces of text to the communication terminal 3 (B). In this embodiment, the transmission/reception unit 51 functions as an example of at least one of a transmitting means and a receiving means.

取得部５２は、主に、ＣＰＵ５０１の処理によって実現され、音声認識サーバ７が送信したテキスト情報に基づくテキストデータの取得、閲覧編集画面におけるキャプチャ情報の取得等を行う。また、取得部５２は、一以上の通信端末３のうち、音声記録を編集する作成者(編集者)が使用する通信端末３（Ａ）(第１の通信端末)が送信した音声情報を表す音声データ、及び通信端末３（Ａ）に表示された画面を表す画面データを取得する。本実施形態において、取得部５２は、取得手段の一例として機能する。 The acquisition unit 52 is mainly realized by the processing of the CPU 501, and performs the acquisition of text data based on text information transmitted by the voice recognition server 7, the acquisition of capture information on the viewing and editing screen, etc. The acquisition unit 52 also acquires voice data representing voice information transmitted by a communication terminal 3(A) (first communication terminal) used by a creator (editor) who edits the voice recording, among one or more communication terminals 3, and screen data representing a screen displayed on the communication terminal 3(A). In this embodiment, the acquisition unit 52 functions as an example of an acquisition means.

算出特定部５３は、主に、ＣＰＵ５０１の処理によって実現され、所定のイベントにおいて利用者が発話を開始した開始日時等を算出する。この算出にあたり、ＣＰＵ５０１のクロック信号を用いて生成された時計情報を用いてもよい。本実施形態において、算出特定部５３は、算出手段の一例として機能する。 The calculation/identification unit 53 is mainly realized by the processing of the CPU 501, and calculates the start date and time when the user starts speaking in a specified event. This calculation may use clock information generated using a clock signal from the CPU 501. In this embodiment, the calculation/identification unit 53 functions as an example of a calculation means.

表示制御部５４は、主に、ディスプレイ５０７に対するＣＰＵ５０１の処理によって実現され、音声記録管理装置５における各種画面及び情報(データ)の表示制御を行う。また表示制御部５４は、例えば、ブラウザを用いて、ＨＴＭＬ等により作成された表示画面を、通信ネットワーク１００を介して、通信端末３のディスプレイ３１８に表示させるとも可能である。本実施形態において、表示制御部５４は、表示制御手段の一例として機能する。 The display control unit 54 is mainly realized by the processing of the CPU 501 on the display 507, and controls the display of various screens and information (data) in the voice recording management device 5. The display control unit 54 can also use a browser to display a display screen created using HTML or the like on the display 318 of the communication terminal 3 via the communication network 100. In this embodiment, the display control unit 54 functions as an example of a display control means.

判断部５５は、主に、ＣＰＵ５０１の処理によって実現され、音声記録管理装置５における各種判断を行う。また、判断部５５は、音声認識サーバが送信した後述する「確信度」が所定の閾値を超えたかを判断する。本実施形態において、判断部５５は、与えられた所定の条件を満たすか否かを判断する判断手段の一例として機能する。 The judgment unit 55 is mainly realized by the processing of the CPU 501, and performs various judgments in the voice recording management device 5. The judgment unit 55 also judges whether the "certainty level" (described later) transmitted by the voice recognition server exceeds a predetermined threshold. In this embodiment, the judgment unit 55 functions as an example of a judgment means for judging whether a given predetermined condition is satisfied.

認証部５６は、主に、ＣＰＵ５０１の処理によって実現され、例えば、通信端末３から要求された認証要求に基づいて、通信端末３の認証処理を行う。本実施形態において、認証部５６は、認証手段の一例として機能する。 The authentication unit 56 is mainly realized by the processing of the CPU 501, and performs authentication processing of the communication terminal 3, for example, based on an authentication request made from the communication terminal 3. In this embodiment, the authentication unit 56 functions as an example of an authentication means.

生成・処理部５７は、主に、ＣＰＵ５０１の処理によって実現され、音声認識サーバが送信したテキスト情報に基づいて音声記録データを生成する。また、生成・処理部５７は、通信端末３で表示される記録閲覧編集画面の画面データを生成する。また、生成・処理部５７は、通信端末３（Ａ）が送信したテキスト非表示要求に応じて第１の非表示画面データ及び第１の無音化音声データを生成し、画像非表示要求に応じて第２の非表示画面データ及び第２の無音化音声データを生成する。生成・処理部５７は、更に、テキスト非表示要求と画像非表示要求がそれぞれ、所定のテキストデータ及び所定の画像データを削除又は非公開とする要求のうち削除する要求の場合に、所定のテキストデータ及び所定の画像データを削除する処理を行う。生成・処理部５７は、更に、所定のテキストデータ及び所定の画像データを非公開とする要求の場合に、所定のテキストデータ及び所定の画像データを削除せずに、第１の非表示画面データで表される第１の非表示画面又は第２の非表示画面データで表される第２の非表示画面において非表示とする処理を行う。本実施形態において、生成・処理部５７は、生成手段の一例として機能する。また、生成・処理部５７は、処理手段の一例として機能する。 The generation/processing unit 57 is mainly realized by the processing of the CPU 501, and generates voice recording data based on the text information transmitted by the voice recognition server. The generation/processing unit 57 also generates screen data for the record viewing/editing screen displayed on the communication terminal 3. The generation/processing unit 57 also generates first hidden screen data and first muted voice data in response to a text non-display request transmitted by the communication terminal 3 (A), and generates second hidden screen data and second muted voice data in response to an image non-display request. The generation/processing unit 57 further performs a process of deleting the specified text data and the specified image data when the text non-display request and the image non-display request are requests to delete the specified text data and the specified image data among requests to delete or make the specified text data and the specified image data private. The generation/processing unit 57 further performs a process of making the specified text data and the specified image data private without deleting them, on the first hidden screen represented by the first hidden screen data or the second hidden screen represented by the second hidden screen data. In this embodiment, the generation/processing unit 57 functions as an example of a generating means. Additionally, the generation/processing unit 57 functions as an example of a processing means.

設定登録部５８は、主に、ＣＰＵ５０１の処理によって実現され、例えば、音声記録管理装置５に対して行った通信端末３を利用する利用者の認証情報を記憶部５０００に登録する。設定登録部５８は、更に、補正テキスト、ブックマーク等の登録、並びにブックマークの削除を行う。本実施形態において、設定登録部５８は、設定手段一例として機能する。また、本実施形態において、設定登録部５８は、登録手段の一例として機能する。 The setting registration unit 58 is mainly realized by the processing of the CPU 501, and for example, registers the authentication information of the user who uses the communication terminal 3, which is made to the voice recording management device 5, in the storage unit 5000. The setting registration unit 58 further registers correction text, bookmarks, etc., and deletes bookmarks. In this embodiment, the setting registration unit 58 functions as an example of a setting means. Also, in this embodiment, the setting registration unit 58 functions as an example of a registration means.

記憶読出部５９は、主に、ＲＯＭ５０２、ＥＥＰＲＯＭ５０４及びＨＤ５０５のうち少なくとも一つに対するＣＰＵ５０１の処理によって実現され、記憶部５０００に各種データ(又は情報)を記憶したり、記憶部５０００から各種データ(又は情報)を読み出したりする。本実施形態において、記憶読出部５９は、記憶読出手段の一例として機能する。 The memory/read unit 59 is mainly realized by the processing of the CPU 501 on at least one of the ROM 502, the EEPROM 504, and the HD 505, and stores various data (or information) in the memory unit 5000 and reads various data (or information) from the memory unit 5000. In this embodiment, the memory/read unit 59 functions as an example of a memory/read means.

＜音声認識サーバの機能構成＞
図４に示されているように、音声認識サーバ７は、送受信部７１、音声認識部７６及び記憶読出部７９を有する。これら各機能部は、図２に示された各ハードウエア資源のいずれかが、ＲＯＭ７０２及びＥＥＰＲＯＭ７０４のうち少なくとも一つからＲＡＭ７０３に展開された音声認識サーバ７用のプログラムに従ったＣＰＵ７０１からの命令により動作することで実現される機能又は手段である。また、音声認識サーバ７は、図２に示されているＲＯＭ７０２及びＥＥＰＲＯＭ７０４のうち少なくとも一つにより構築される記憶部７０００を有している。更に、記憶部７０００には、音声記録管理装置５と通信ネットワーク１００を介して通信を行うための通信プログラム(通信アプリ)等が記憶されている。 <Functional configuration of the voice recognition server>
As shown in Fig. 4, the voice recognition server 7 has a transmission/reception unit 71, a voice recognition unit 76, and a storage/readout unit 79. Each of these functional units is a function or means realized by any of the hardware resources shown in Fig. 2 operating in response to an instruction from the CPU 701 in accordance with a program for the voice recognition server 7 expanded from at least one of the ROM 702 and the EEPROM 704 to the RAM 703. The voice recognition server 7 also has a storage unit 7000 constructed from at least one of the ROM 702 and the EEPROM 704 shown in Fig. 2. Furthermore, the storage unit 7000 stores a communication program (communication application) for communicating with the voice recording management device 5 via the communication network 100, etc.

<<音声認識サーバの各機能構成>>
次に、音声認識サーバ７の各機能構成について詳細に説明する。図４に示されている音声認識サーバ７の送受信部７１は、主に、ネットワークＩ／Ｆ７１２及び近距離通信回路７２０に対するＣＰＵ７０１の処理によって実現され、通信ネットワーク１００を介して音声記録管理装置５との間で各種データ(又は情報)の送受信を行う。本実施形態において、送受信部７１は、送信手段及び受信手段のうち少なくとも一方の手段の一例として機能する。 <<Functional configuration of the speech recognition server>>
Next, a detailed description will be given of each functional configuration of the voice recognition server 7. The transmission/reception unit 71 of the voice recognition server 7 shown in Fig. 4 is mainly realized by the processing of the CPU 701 on the network I/F 712 and the short-range communication circuit 720, and transmits and receives various data (or information) to and from the voice recording management device 5 via the communication network 100. In this embodiment, the transmission/reception unit 71 functions as an example of at least one of the transmitting means and the receiving means.

音声認識部７６は、主に、マイク７１５及び音入出力Ｉ／Ｆ７１７に対するＣＰＵ７０１の処理によって実現され、音声記録管理装置５が送信した音声データ又は音データ(音声情報)を認識してテキストデータ(テキスト情報)に変換する。本実施形態において、音声認識部７６は、音声認識手段の一例として機能する。 The voice recognition unit 76 is mainly realized by the processing of the CPU 701 on the microphone 715 and the sound input/output I/F 717, and recognizes the voice data or sound data (voice information) transmitted by the voice recording management device 5 and converts it into text data (text information). In this embodiment, the voice recognition unit 76 functions as an example of a voice recognition means.

記憶読出部７９は、主に、ＲＯＭ７０２及びＥＥＰＲＯＭ７０４のうち少なくとも一つに対するＣＰＵ７０１の処理によって実現され、記憶部７０００に各種データ(又は情報)を記憶したり、記憶部７０００から各種データ(又は情報)を読み出したりする。本実施形態において、記憶読出部７９は、記憶読出手段の一例として機能する。 The memory readout unit 79 is mainly realized by the processing of the CPU 701 on at least one of the ROM 702 and the EEPROM 704, and stores various data (or information) in the memory unit 7000 and reads various data (or information) from the memory unit 7000. In this embodiment, the memory readout unit 79 functions as an example of a memory readout means.

本実施形態に係る通信システムでは、上述した音声認識サーバ７に加えて音声認識サーバ９も含まれるが、音声認識サーバ９の各機能構成は、音声認識サーバ７の各機能構成と同様であるため、説明を省略する。 The communication system according to this embodiment includes a voice recognition server 9 in addition to the voice recognition server 7 described above, but the functional configurations of the voice recognition server 9 are similar to those of the voice recognition server 7, so a description of them will be omitted.

〔実施形態の処理又は動作〕
次に、図１０乃至図３９を用いて、第１の実施形態に係る音声記録管理システムにおける各処理又は動作を説明する。図１０は、アプリ起動、認証処理及びセッション確立処理の一例を示すシーケンス図である。 [Processing or Operation of the Embodiment]
Next, each process or operation in the voice recording management system according to the first embodiment will be described with reference to Fig. 10 to Fig. 39. Fig. 10 is a sequence diagram showing an example of application startup, authentication processing, and session establishment processing.

<<アプリ起動及び認証処理>>
まず、通信端末３の利用者は、通信端末３で動作する記録管理アプリ及びブラウザアプリの起動操作を行う。これにより、通信端末３の操作受付部３２は、利用者により通信アプリ及びブラウザアプリの起動操作を受け付ける（ステップＳ１１）。なお、本実施形態では、上述した議事録作成端末の一例である通信端末３（Ａ）が記録管理アプリを起動すればよく、通信端末３（Ａ）とともに所定のイベントに参加する通信端末３（Ｂ）は記録管理アプリを起動する必要はない。通信端末３（Ｂ）は、記録管理アプリの起動に代えて、上述した会話ツールを起動しておけば、通信端末３（Ａ）との音声通信が可能となり、その結果、互いの音声を認識することができる。更に、本実施形態では、通信端末３において他の装置との間で利用される通信アプリは、所定のプロトコル等によって他の装置との間で通信可能な状態になっていることを前提とする。 <<App launch and authentication process>>
First, the user of the communication terminal 3 performs an operation to start the record management application and the browser application running on the communication terminal 3. As a result, the operation acceptance unit 32 of the communication terminal 3 accepts the operation to start the communication application and the browser application by the user (step S11). In this embodiment, it is sufficient that the communication terminal 3 (A), which is an example of the minutes creation terminal described above, starts the record management application, and the communication terminal 3 (B), which participates in a predetermined event together with the communication terminal 3 (A), does not need to start the record management application. If the communication terminal 3 (B) starts the above-mentioned conversation tool instead of starting the record management application, voice communication with the communication terminal 3 (A) becomes possible, and as a result, each other's voice can be recognized. Furthermore, in this embodiment, it is assumed that the communication application used between the communication terminal 3 and other devices is in a state in which communication with other devices is possible by a predetermined protocol or the like.

次に、アプリ起動部３８は、予め記憶部３０００にインストールされている音声記録管理装置５との間で通信を行うための通信アプリ及びブラウザアプリを起動する（ステップＳ１２）。その後、表示制御部３４は、ディスプレイ３１８に認証用の認証画面(サインイン画面等)を表示して利用者による認証操作を待つ（ステップＳ１３）。 Next, the application launch unit 38 launches a communication application and a browser application for communicating with the voice recording management device 5 that have been pre-installed in the storage unit 3000 (step S12). After that, the display control unit 34 displays an authentication screen (such as a sign-in screen) on the display 318 and waits for the user to perform an authentication operation (step S13).

次に、利用者は、音声記録管理装置５に対する認証処理(サインイン)を行う。これにより、操作受付部３２は、利用者によって入力された認証情報を受け付ける（ステップＳ２１）。 Next, the user performs authentication processing (signs in) to the voice recording management device 5. As a result, the operation reception unit 32 accepts the authentication information input by the user (step S21).

続いて、送受信部３１は、音声記録管理装置５に対して受け付けた認証情報に基づいて認証処理の要求を送信する（ステップＳ２２）。これにより、音声記録管理装置５の送受信部４１は、通信端末３が送信した認証処理の要求を受信する。このとき、認証処理の要求には、音声記録管理装置５とのセッションを確立するためのセッションＩＤ、通信端末３の端末識別情報、利用者を識別する作成者識別情報及びパスワードが含まれる。 Then, the transmission/reception unit 31 transmits a request for authentication processing to the voice recording management device 5 based on the received authentication information (step S22). As a result, the transmission/reception unit 41 of the voice recording management device 5 receives the request for authentication processing transmitted by the communication terminal 3. At this time, the request for authentication processing includes a session ID for establishing a session with the voice recording management device 5, terminal identification information of the communication terminal 3, creator identification information for identifying the user, and a password.

次に、音声記録管理装置５の認証部４６は、受信されたセッションＩＤ、端末識別情報、作成者識別情報及びパスワードと記憶読出部５９によってログイン管理ＤＢ５００１（図５参照）から読み出されたセッションＩＤ、端末識別情報及び作成者識別情報に対応するパスワードとを比較してログイン認証処理を行う（ステップＳ２３）。ここでは、利用者による音声記録管理装置５に対するログイン認証処理が成功しているものとする。ステップＳ２３の処理において、設定登録部５８は、ログイン認証処理をした通信端末３のＩＰアドレスをログイン管理ＤＢ５００１（図５参照）に登録してもよい。 Next, the authentication unit 46 of the voice recording management device 5 performs login authentication processing by comparing the received session ID, terminal identification information, creator identification information, and password with the session ID, terminal identification information, and password corresponding to the creator identification information read from the login management DB 5001 (see FIG. 5) by the memory readout unit 59 (step S23). Here, it is assumed that the login authentication processing by the user to the voice recording management device 5 has been successful. In the processing of step S23, the setting registration unit 58 may register the IP address of the communication terminal 3 that has performed the login authentication processing in the login management DB 5001 (see FIG. 5).

次に、ステップＳ２３においてログイン認証処理が成功し、通信端末３との通信セッションが確立すると、送受信部５１は、通信端末３に対して認証処理の応答及び参加処理の応答を送信する（ステップＳ２４）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した認証処理の応答及び参加処理の応答を受信する。このとき、認証処理の応答及び参加処理の応答には、セッションＩＤと音声記録管理装置５との通信セッションへの参加処理を許可する参加処理結果が含まれる。 Next, when the login authentication process is successful in step S23 and a communication session with the communication terminal 3 is established, the transmission/reception unit 51 transmits a response to the authentication process and a response to the participation process to the communication terminal 3 (step S24). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the response to the authentication process and the response to the participation process transmitted by the voice recording management device 5. At this time, the response to the authentication process and the response to the participation process include the session ID and a participation process result that permits the participation process to the communication session with the voice recording management device 5.

続いて、通信端末３のアプリ起動部３８は、次回以降のログイン処理及び通信セッション確立の簡略化のために、記憶読出部３９と協働して、記憶部３０００の所定領域に、作成者識別情報、パスワード及び利用者名を組にして記憶させて登録する（ステップＳ２５）。なお、ステップＳ２５の処理は省略されてもよい。 Then, in order to simplify the login process and communication session establishment from the next time onwards, the application launch unit 38 of the communication terminal 3 cooperates with the memory readout unit 39 to store and register the creator identification information, password, and user name as a set in a predetermined area of the memory unit 3000 (step S25). Note that the process of step S25 may be omitted.

上述した処理シーケンスの例では、通信端末３にブラウザアプリがインストールされていることを前提に説明したが、議事録閲覧端末の一例として利用される通信端末３（Ｃ）のように、ブラウザを利用したＷｅｂサービスが提供されてもよい。Ｗｅｂサービスの場合、音声記録管理装置５は、Ｗｅｂサーバの機能を有し、閲覧編集画面等の画面データ（画面全体又は画面の一部を表示させるためのデータ）を通信端末３に送信して表示させるようにしてもよい。 The above-mentioned example of the processing sequence has been explained on the assumption that a browser application is installed in the communication terminal 3, but a web service using a browser may be provided, such as in the communication terminal 3 (C) used as an example of a minutes viewing terminal. In the case of a web service, the voice recording management device 5 has a web server function, and may transmit screen data such as a viewing and editing screen (data for displaying the entire screen or part of the screen) to the communication terminal 3 for display.

本実施形態に係る音声記録管理システムでは、例えば、上述したステップＳ２２及びＳ２４の処理が実行される場合、通信端末３と音声記録管理装置５との間に他の装置等が存在してもよい。つまり、通信端末３と音声記録管理装置５との間で送受信される各情報(データ)は、一度他の装置を介して送受信されるような構成であってもよい。上述した構成は、通信端末３と音声記録管理装置５との間に他の処理ステップが存在しても適用可能である。 In the voice recording management system according to this embodiment, for example, when the processing of steps S22 and S24 described above is executed, other devices may be present between the communication terminal 3 and the voice recording management device 5. In other words, each piece of information (data) transmitted and received between the communication terminal 3 and the voice recording management device 5 may be configured to be transmitted and received once via another device. The above-described configuration is applicable even if other processing steps exist between the communication terminal 3 and the voice recording management device 5.

●画面表示例●
図１１は、通信端末におけるアプリ起動時の画面表示例である。通信端末３のディスプレイ３１８には、表示制御部３４によってディスプレイ３１８にアプリ起動画面３１０１が表示される。アプリ起動画面３１０１には、例えば、アプリをイメージするマーク(マイクの絵)と、アプリのバージョン情報が表示されている。 ●Screen display example●
11 is an example of a screen display when an application is started in a communication terminal. An application start-up screen 3101 is displayed on the display 318 of the communication terminal 3 by the display control unit 34. The application start-up screen 3101 displays, for example, a mark representing the application (a picture of a microphone) and version information of the application.

<<記録開始処理>>
続いて、記録開始処理について説明する。図１２は、記録開始処理の一例を示すシーケンス図である。図１２に示されているように、通信端末３の表示制御部３４は、図１１に示したようなアプリ起動画面３１０１を表示した後、記録開始指示画面をディスプレイ３１８に表示する（ステップＳ３１）。 <<Recording start process>>
Next, the recording start process will be described. Fig. 12 is a sequence diagram showing an example of the recording start process. As shown in Fig. 12, the display control unit 34 of the communication terminal 3 displays the application start screen 3101 as shown in Fig. 11, and then displays a recording start instruction screen on the display 318 (step S31).

●画面表示例●
図１３は、通信端末における記録開始指示の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ３２の処理が実行されることにより、表示制御部３４によって記録開始指示画面３１１１が表示される。記録開始指示画面３１１１には、例えば、「概要(議題)」、「参加者」、「会議メモ」、「録音するマイク」、及び「録画する画面」の各入力欄が含まれる。 ●Screen display example●
13 is an example of a screen display of a recording start instruction in a communication terminal. When the process of step S32 described above is executed, a recording start instruction screen 3111 is displayed by the display control unit 34 on the display 318 of the communication terminal 3. The recording start instruction screen 3111 includes input fields for, for example, "Summary (topic),""Participants,""Meetingnotes,""Recordingmicrophone," and "Screen to record."

「概要(議題)」欄には、例えば、実行される会議等のイベントの議題が入力される。「参加者」欄には、例えば、イベントの参加者が入力される。「会議メモ」欄には、例えば、イベントにおける主な議事内容が入力される。主な議事内容は、例えば、概要、決定事項、アクションアイテム等である。これらの「概要(議題)」、「参加者」及び「会議メモ」の各入力欄に入力される項目は、会議等のイベントに参加する参加者等によって予め入力されてもよいし、イベントの終了後に追加編集されることも可能である。「録音するマイク」及び「録画する画面」の入力欄には、マイク配列や録画される画面の番号などが表示される。これらのマイク配列や録画される画面の番号は、音声記録管理装置５に対して利用者が予め設定しておいてもよいし、音声記録管理装置５が任意の条件、タイミング等に基づいて設定してもよい。なお、記録開始指示画面３１１１には、会議等のイベントにおいて特に重要な内容等が利用者によって入力することが可能な「ブックマーク」欄が設けられてもよい。 In the "Summary (agenda)" field, for example, the agenda of the event such as a meeting to be held is input. In the "Participants" field, for example, the participants of the event are input. In the "Meeting notes" field, for example, the main agenda of the event is input. The main agenda is, for example, an overview, decisions, action items, etc. The items input into each of the input fields "Summary (agenda)", "Participants", and "Meeting notes" may be input in advance by participants of the event such as a meeting, or may be added and edited after the event ends. In the input fields "Recording microphone" and "Recording screen", the microphone arrangement and the number of the screen to be recorded are displayed. These microphone arrangements and the number of the screen to be recorded may be set in advance by the user in the voice recording management device 5, or may be set by the voice recording management device 5 based on arbitrary conditions, timing, etc. In addition, the recording start instruction screen 3111 may be provided with a "bookmark" field in which the user can input particularly important contents of the event such as a meeting.

更に、記録開始指示画面３１１１には、表示制御部３４によって記録開始ボタン３５１１が表示される。通信端末３の利用者は、記録開始ボタン３５１１を操作(押下又はタップ等)することにより、会議等のイベントで発話される発話内容の記録を開始させることができる。 Furthermore, a start recording button 3511 is displayed on the start recording instruction screen 3111 by the display control unit 34. A user of the communication terminal 3 can start recording the content of speech spoken at an event such as a conference by operating (pressing or tapping, etc.) the start recording button 3511.

図１２に戻り、操作受付部３２は、利用者の操作による記録開始指示を受け付ける（ステップＳ３２）。記録開始指示の受付は、上述した記録開始ボタン３５０１に対する利用者からの操作を受け付けることにより行われる。 Returning to FIG. 12, the operation reception unit 32 receives a recording start instruction from a user (step S32). The recording start instruction is received by receiving an operation from the user on the above-mentioned recording start button 3501.

次に、通信端末３の送受信部３１は、音声記録管理装置５に対して記録開始要求を送信する（ステップＳ３３）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信した記録開始要求を受信する。 Next, the transmission/reception unit 31 of the communication terminal 3 transmits a recording start request to the voice recording management device 5 (step S33). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the recording start request transmitted by the communication terminal 3.

続いて、送受信部５１は、音声認識サーバ７に対して音声データ送信開始通知を送信する（ステップＳ３４）。これにより、音声認識サーバ７の送受信部７１は、音声記録管理装置５が送信した音声データ送信開始通知を受信する。 Next, the transmission/reception unit 51 transmits a voice data transmission start notification to the voice recognition server 7 (step S34). As a result, the transmission/reception unit 71 of the voice recognition server 7 receives the voice data transmission start notification transmitted by the voice recording management device 5.

次に、音声認識サーバ７の送受信部７１は、音声記録管理装置５に対して送信開始の許可応答を送信する（ステップＳ３５）。これにより、音声記録管理装置５の送受信部５１は、音声認識サーバ７が送信した送信開始の許可応答を受信する。 Next, the transmission/reception unit 71 of the voice recognition server 7 transmits a permission response to start transmission to the voice recording management device 5 (step S35). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the permission response to start transmission transmitted by the voice recognition server 7.

次に、音声記録管理装置５の送受信部５１は、通信端末３に対して記録中画面の表示要求を送信する（ステップＳ３６）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した記録中画面の表示要求を受信する。 Next, the transmission/reception unit 51 of the voice recording management device 5 transmits a request to display the recording screen to the communication terminal 3 (step S36). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the request to display the recording screen transmitted by the voice recording management device 5.

続いて、通信端末３の表示制御部３４は、ディスプレイ３１８に記録中画面を表示し、操作受付部３２は、利用者によって操作される「記録終了ボタン」の操作を受け付ける（ステップＳ３７）。 Next, the display control unit 34 of the communication terminal 3 displays a recording screen on the display 318, and the operation reception unit 32 receives the operation of the "Stop Recording button" operated by the user (step S37).

<<記録書誌情報の登録処理>>
図１４は、記録書誌情報の登録処理の一例を示すシーケンス図である。音声記録管理装置５の送受信部５１は更に、通信端末３に対して、音声データ送信開始要求を送信する（ステップＳ４１）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した音声データ送信開始要求を受信する。 <<Registration process for bibliographic information>>
14 is a sequence diagram showing an example of a process for registering bibliographic information. The transceiver 51 of the voice recording management device 5 further transmits a voice data transmission start request to the communication terminal 3 (step S41). As a result, the transceiver 31 of the communication terminal 3 receives the voice data transmission start request transmitted by the voice recording management device 5.

次に、通信端末３の送受信部３１は、音声記録管理装置５に対して音声データ及び記録書誌情報を送信する（ステップＳ４２）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信した音声データ及び記録書誌情報を受信する。このときに送受信される記録書誌情報には、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている記録名称、開始日時、作成者識別情報(ユーザＩＤ)、イベントＵＲＬ、パスコード及び対応付け処理の各項目に対応する情報が含まれる。 Next, the transmission/reception unit 31 of the communication terminal 3 transmits the voice data and the record bibliographic information to the voice recording management device 5 (step S42). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the voice data and the record bibliographic information transmitted by the communication terminal 3. The record bibliographic information transmitted and received at this time includes information corresponding to each item of the record name, start date and time, creator identification information (user ID), event URL, passcode, and association process managed in the record bibliographic information management DB 5002 (see FIG. 6).

次に、音声記録管理装置５の送受信部５１は、音声認識サーバ７のＡＰＩ(Application Programming Interface)に対して音声認識要求を送信する（ステップＳ４３）。これにより、音声認識サーバ７の送受信部７１は、音声記録管理装置５が送信した音声認識要求を受信する。このとき、音声認識要求には、音声認識サーバによってテキスト変換の対象となる音声データが含まれる。ステップＳ４２及びＳ４３の処理が行われることによって、音声記録管理装置５の送受信部５１は、通信端末３が送信した音声データ（音声ストリーミングによる音声データ）を継続的に音声認識サーバ７に対して送信する。この場合、利用者の発話が検出されていなくても、送受信部５１は、音声認識サーバ７に対して継続的に音声ストリーミングを送信し続けてよい。但し、音声記録管理装置５は、音声認識サーバ７のＡＰＩの仕様に基づいて音声データを個別に送信し、音声認識を要求するようにしてもよい。 Next, the transmission/reception unit 51 of the voice recording management device 5 transmits a voice recognition request to the API (Application Programming Interface) of the voice recognition server 7 (step S43). As a result, the transmission/reception unit 71 of the voice recognition server 7 receives the voice recognition request transmitted by the voice recording management device 5. At this time, the voice recognition request includes voice data to be converted into text by the voice recognition server. By performing the processes of steps S42 and S43, the transmission/reception unit 51 of the voice recording management device 5 continuously transmits the voice data transmitted by the communication terminal 3 (voice data by voice streaming) to the voice recognition server 7. In this case, even if the user's speech is not detected, the transmission/reception unit 51 may continue to transmit the voice streaming to the voice recognition server 7 continuously. However, the voice recording management device 5 may transmit the voice data individually based on the specifications of the API of the voice recognition server 7 and request voice recognition.

次に、音声記録管理装置５の設定登録部５８は、記録書誌情報管理ＤＢ５００２（図６参照）に対して、ステップＳ４２で受信した記録書誌情報を登録する（ステップＳ４４）。このとき登録される記録書誌情報には、記録名称としての「ヘルスケア事業業績報告会」、「開始日時」、「作成者識別情報(ユーザＩＤ)」、「イベントＵＲＬ」、パスコード及び対応付け処理に加えて、新たに「終了日時」と「音声データパス」を示す内容が含まれる。 Next, the setting registration unit 58 of the voice recording management device 5 registers the record bibliographic information received in step S42 in the record bibliographic information management DB 5002 (see FIG. 6) (step S44). The record bibliographic information registered at this time includes the record name "Healthcare Business Performance Report Meeting", "Start Date and Time", "Creator Identification Information (User ID)", "Event URL", passcode, and association process, as well as new information indicating the "End Date and Time" and "Voice Data Path".

<<音声認識処理>>
図１５は、音声認識処理の一例を示すシーケンス図である。まず、一以上の通信端末３のうち、議事録作成端末の一例である通信端末３（Ａ）の音・画像取得部３３は、マイク３１５を介して通信端末３（Ａ）を利用する利用者が発話した発話音声又は音を集音して音声情報(音声データ又は音データを含む。以下、単に「音声情報」と記す)を取得する（ステップＳ５１）。 <<Speech recognition processing>>
15 is a sequence diagram showing an example of the voice recognition process. First, the sound/image acquisition unit 33 of the communication terminal 3(A), which is an example of a minutes creation terminal among the one or more communication terminals 3, collects speech or sounds uttered by a user using the communication terminal 3(A) via the microphone 315 and acquires voice information (including voice data or sound data; hereinafter, simply referred to as "voice information") (step S51).

続いて、送受信部３１は、取得した音声情報を音声記録管理装置５に対して送信する（ステップＳ５２）。これにより、音声記録管理装置５の送受信部５１は、通信端末３（Ａ）の送受信部３１が送信した、通信端末３（Ａ）を利用する利用者が発話した発話音声に係る音声情報を受信する。なお、以降の説明においては、単に通信端末３と記載する。 Then, the transmission/reception unit 31 transmits the acquired voice information to the voice recording management device 5 (step S52). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the voice information related to the voice spoken by the user of the communication terminal 3(A) transmitted by the transmission/reception unit 31 of the communication terminal 3(A). In the following explanation, this will simply be referred to as the communication terminal 3.

次に、音声記録管理装置５の送受信部５１は、音声認識サーバ７に対して音声認識要求を送信する（ステップＳ５３）。これにより、音声認識サーバ７の送受信部７１は、音声記録管理装置５が送信した音声認識要求を受信する。このとき、音声認識要求には、通信端末３が送信した音声情報(音声データ、音データ)が含まれる。つまり、音声記録管理装置５は、通信端末３と音声認識サーバ７との間の仲介装置の役割も果たしている。 Next, the transmission/reception unit 51 of the voice recording management device 5 transmits a voice recognition request to the voice recognition server 7 (step S53). As a result, the transmission/reception unit 71 of the voice recognition server 7 receives the voice recognition request transmitted by the voice recording management device 5. At this time, the voice recognition request includes the voice information (voice data, sound data) transmitted by the communication terminal 3. In other words, the voice recording management device 5 also serves as an intermediary device between the communication terminal 3 and the voice recognition server 7.

次に、音声認識サーバ７の音声認識部７６は、受信した音声情報に対して音声認識処理を実行し、音声情報をテキスト情報(テキストデータ)に変換する（ステップＳ５４）。例えば、音声認識部７６は、受信された音声データを音データとして認識した後、認識した音データに対応する所定のテキストデータに変換する。その後、音声認識部７６は、変換した所定のテキストデータを、音声認識サーバ７が備える一以上の音声認識エンジン(辞書)を用いて最適なテキストデータに変換する。なお、上述した音声データからテキストデータに変換する手法はこの限りではない。そのため、通信システム１は、一般的に知られている音声認識エンジンを用いて、所望のテキストデータを得るようなシステム構成であってよい。 Next, the voice recognition unit 76 of the voice recognition server 7 performs voice recognition processing on the received voice information and converts the voice information into text information (text data) (step S54). For example, the voice recognition unit 76 recognizes the received voice data as sound data, and then converts it into predetermined text data corresponding to the recognized sound data. The voice recognition unit 76 then converts the converted predetermined text data into optimal text data using one or more voice recognition engines (dictionaries) provided in the voice recognition server 7. Note that the above-mentioned method of converting voice data into text data is not limited to this. Therefore, the communication system 1 may be configured as a system that obtains the desired text data using a generally known voice recognition engine.

続いて、送受信部７１は、音声記録管理装置５に対して音声認識結果を送信する（ステップＳ５５）。これにより、音声記録管理装置５の送受信部５１は、音声認識サーバ７が送信した音声認識結果を受信する。このとき、音声認識結果には、変換されたテキストデータ、開始時刻、終了時刻が含まれる。具体的には、音声認識サーバ７は、例えば、テキスト情報管理ＤＢ５００５Ａ（図９Ａ参照）で管理されている「今回の開発はジャイロ方式を採用します。」という内容と、その内容が発話された開始時刻（1分59秒）を、音声記録管理装置５に対して送信する。なお、開始時刻に関しては、テキスト情報管理テーブルで説明したように、開始日時から経過した時間として管理される。終了時刻についても同様の考え方が適用される。 Then, the transmission/reception unit 71 transmits the voice recognition result to the voice recording management device 5 (step S55). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the voice recognition result transmitted by the voice recognition server 7. At this time, the voice recognition result includes the converted text data, the start time, and the end time. Specifically, the voice recognition server 7 transmits, for example, the content "This development will use the gyro method" managed in the text information management DB 5005A (see FIG. 9A) and the start time (1 minute 59 seconds) when this content was spoken to the voice recording management device 5. Note that the start time is managed as the time elapsed from the start date and time, as explained in the text information management table. The same concept is applied to the end time.

次に、設定登録部５８は、受信した１レコード分のテキスト情報をテキスト情報管理ＤＢ５００３（図７参照）に登録する（ステップＳ５６）。この場合の１レコード分のテキスト情報とは、「テキスト識別情報」、「開始時刻」、「終了時刻」、「テキスト」、公開フラグ、「終了フラグ」、「画像識別情報」に対する情報である。 Next, the setting registration unit 58 registers the received text information for one record in the text information management DB 5003 (see FIG. 7) (step S56). In this case, the text information for one record is information for "text identification information," "start time," "end time," "text," public flag, "end flag," and "image identification information."

続いて、取得部５２は、例えば、受信した開始時刻を検索キーとしてテキスト情報管理ＤＢ５００５Ａ（図９Ａ参照）を検索することにより対応するテキストデータを取得する（ステップＳ５７）。 Then, the acquisition unit 52 acquires the corresponding text data by searching the text information management DB 5005A (see FIG. 9A), for example, using the received start time as a search key (step S57).

続いて、算出特定部５３は、開始日時を算出する（ステップＳ５８）。開始日時の算出については、以下の式に基づいて行われる。つまり、開始日時は、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている開始日時とテキスト情報管理ＤＢ５００５Ａ（図９Ａ参照）で管理されている開始日時を足し合わせた時間となる。具体的には、2021/03/31 11:00:00 ＋ 00:01:53 ＝ 2021/03/31 11:01:53が、ステップＳ５８で算出される開始日時となる。 Then, the calculation specification unit 53 calculates the start date and time (step S58). The calculation of the start date and time is performed based on the following formula. In other words, the start date and time is the sum of the start date and time managed in the record bibliographic information management DB 5002 (see FIG. 6) and the start date and time managed in the text information management DB 5005A (see FIG. 9A). Specifically, 2021/03/31 11:00:00 + 00:01:53 = 2021/03/31 11:01:53 is the start date and time calculated in step S58.

続いて、送受信部５１は、通信端末３に対して記録画面更新要求を送信する（ステップＳ５９）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した記録画面更新要求を受信する。このとき、記録画面更新要求には、音声認識サーバ７が認識したテキスト情報(テキストデータ)、及びステップＳ５８で算出された開始日時を示す開始日時情報が含まれる。 Then, the transmission/reception unit 51 transmits a recording screen update request to the communication terminal 3 (step S59). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the recording screen update request transmitted by the voice recording management device 5. At this time, the recording screen update request includes the text information (text data) recognized by the voice recognition server 7 and the start date and time information indicating the start date and time calculated in step S58.

次に、通信端末３の表示制御部３４は、音声記録管理装置５が送信した補正テキスト情報で示される音声記録をディスプレイ３１８に表示する（ステップＳ６０）。具体的には、表示制御部３４は、図１８に示したような記録中画面に含まれる第１のテキスト情報で示される音声記録を表示させる。 Next, the display control unit 34 of the communication terminal 3 displays the voice recording indicated by the corrected text information transmitted by the voice recording management device 5 on the display 318 (step S60). Specifically, the display control unit 34 displays the voice recording indicated by the first text information included in the recording screen as shown in FIG. 18.

なお、本実施形態に係る通信システムにおいて生成・処理部５７は、音声記録管理装置５に備えられる構成以外に、通信ネットワーク１００を介して音声記録管理装置５と通信端末３とを互いに通信可能な他の装置が有するような構成であってもよい。 In addition, in the communication system according to this embodiment, the generation/processing unit 57 may be provided in the voice recording management device 5, or may be provided in another device that can communicate between the voice recording management device 5 and the communication terminal 3 via the communication network 100.

本実施形態に係る音声記録管理システムでは、更に、例えば、上述したステップＳ５３及びＳ５５の処理が実行される場合、音声記録管理装置５と音声認識サーバ７との間に他の装置等が存在してもよい。つまり、音声記録管理装置５と音声認識サーバ７との間で送受信される各情報(データ)は、一度他の装置を介して送受信されるような構成であってもよい。上述した構成は、音声記録管理装置５と音声認識サーバ７との間に他の処理ステップが存在しても適用可能である。 In the voice recording management system according to this embodiment, for example, when the processes of steps S53 and S55 described above are executed, other devices may be present between the voice recording management device 5 and the voice recognition server 7. In other words, each piece of information (data) transmitted and received between the voice recording management device 5 and the voice recognition server 7 may be configured to be transmitted and received once via another device. The above-mentioned configuration is applicable even if other processing steps are present between the voice recording management device 5 and the voice recognition server 7.

●画面表示例●
図１６は、通信端末における記録中の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ６０の処理が実行されることにより、表示制御部３４によって記録中画面３１２１が表示される。記録中画面３１２１には、例えば、図１３に示した記録開始指示画面３１１１の内容に加えて、少なくとも一以上のテキスト表示欄が表示される。このテキスト表示欄では、発話した利用者ごとに、利用者の顔写真又はイメージ画像、発話日時、発話内容を一単位として時系列に表示される。ここで、テキスト表示欄には、例えば、音声認識サーバ７が送信した「本日の会議は録音させていただきます。」、「本日の議題はヘルスケア事業の業績です。」、「まず共有画面をご覧ください。」といった内容のテキスト情報が表示される。なお、テキスト表示欄に表示される内容は、後述する音声認識エンジン変更画面以降で変更されるが、記録中画面３１２１において音声認識を変更して再度認識するようにしてもよい。 ●Screen display example●
FIG. 16 is an example of a screen display during recording in the communication terminal. By executing the process of step S60 described above, a recording screen 3121 is displayed by the display control unit 34 on the display 318 of the communication terminal 3. In addition to the contents of the recording start instruction screen 3111 shown in FIG. 13, the recording screen 3121 displays at least one text display field. In this text display field, for each user who has spoken, a face photo or image of the user, the speech date and time, and the speech content are displayed in chronological order as one unit. Here, in the text display field, for example, text information such as "Today's meeting will be recorded,""Today's agenda is the performance of the healthcare business," and "Please look at the shared screen first" transmitted by the voice recognition server 7 is displayed. Note that the contents displayed in the text display field are changed after the voice recognition engine change screen described later, but the voice recognition may be changed in the recording screen 3121 and recognized again.

記録中画面３１２１には、更に、表示制御部３４によって通信端末３で画面キャプチャ処理されたキャプチャ画像としての「画面３」が表示される。この「画面３」の内容は、「画面３」を表すサムネイル画像、画像アイコン等であってもよい。 The recording screen 3121 further displays "screen 3" as a captured image captured by the display control unit 34 on the communication terminal 3. The content of this "screen 3" may be a thumbnail image, an image icon, or the like representing "screen 3."

記録中画面３１２１には、更に、表示制御部３４によって記録の一時停止を指示するための「一時停止」ボタン(アイコン)３５２１、及び記録終了を指示するための「記録終了」ボタン(アイコン)３５２２が表示される。利用者は、「一時停止」ボタン(アイコン)３５２１又は「記録終了」ボタン(アイコン)３５２２を操作(押下又はタップ等)することにより、会議等のイベントで発話される発話内容の記録を一時停止又は終了させることができる。そして、「記録終了」ボタン３５２２が操作されると、通信端末３は、音声記録管理装置５に対して音声データの送信を開始する。 The recording screen 3121 further displays a "pause" button (icon) 3521 for instructing the user to pause recording by the display control unit 34, and an "end recording" button (icon) 3522 for instructing the user to end recording. A user can pause or end the recording of speech content spoken at an event such as a conference by operating (pressing or tapping, etc.) the "pause" button (icon) 3521 or the "end recording" button (icon) 3522. Then, when the "end recording" button 3522 is operated, the communication terminal 3 starts transmitting audio data to the audio recording management device 5.

<<画面キャプチャ処理>>
次に、画面キャプチャ処理について説明する。図１７は、画面キャプチャ処理の一例を示すシーケンス図である。図１７に示されているように、音声記録管理装置５の記憶読出部５９は、キャプチャ画像取得間隔ＤＢ５００５（図８Ｂ参照）で管理されているキャプチャ画像取得間隔の情報(例えば、３０秒)を読み出す（ステップＳ７１）。 <<Screen capture processing>>
Next, the screen capture process will be described. Fig. 17 is a sequence diagram showing an example of the screen capture process. As shown in Fig. 17, the memory readout unit 59 of the voice recording management device 5 reads out information on the capture image acquisition interval (e.g., 30 seconds) managed in the capture image acquisition interval DB 5005 (see Fig. 8B) (step S71).

続いて、送受信部５１は、通信端末３に対して、読み出されたキャプチャ画像取得間隔の情報を送信する（ステップＳ７２）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信したキャプチャ画像取得間隔の情報を受信する。このとき、キャプチャ画像取得間隔の情報は、３０秒という時間情報である。 Then, the transmission/reception unit 51 transmits the read information on the capture image acquisition interval to the communication terminal 3 (step S72). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the information on the capture image acquisition interval transmitted by the voice recording management device 5. At this time, the information on the capture image acquisition interval is time information of 30 seconds.

次に、通信端末３の音・画像取得部３３は、通信端末３自身のディスプレイ３１８に表示された画面を３０秒ごとにキャプチャ処理してキャプチャ画像を取得する（ステップＳ７３）。画面キャプチャ処理については、例えば、一般的に知られているプリントスクリーンキーを用いた画面キャプチャ処理に相当する手法を用いてもよい。その場合、音声記録管理装置５は、利用者によるプリントスクリーンキーの操作を介さずに、上述した所定の時間間隔で画面キャプチャ処理を自動実行することで実現される。 Next, the sound/image acquisition unit 33 of the communication terminal 3 captures the screen displayed on the display 318 of the communication terminal 3 itself every 30 seconds to acquire a captured image (step S73). The screen capture process may be, for example, a method equivalent to the commonly known screen capture process using a print screen key. In this case, the voice recording management device 5 realizes this by automatically executing the screen capture process at the above-mentioned predetermined time intervals, without the user operating the print screen key.

続いて、送受信部３１は、取得したキャプチャ画像を示すキャプチャ画像データを音声記録管理装置５に対して送信する（ステップＳ７４）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信したキャプチャ画像データを受信する。このとき、通信端末３が送信する情報には、画像識別情報と画像識別情報に対応するキャプチャ画像データが含まれる。 Then, the transmission/reception unit 31 transmits capture image data indicating the acquired capture image to the voice recording management device 5 (step S74). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the capture image data transmitted by the communication terminal 3. At this time, the information transmitted by the communication terminal 3 includes the image identification information and the capture image data corresponding to the image identification information.

次に、音声記録管理装置５の設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）に、画像識別情報に対応するキャプチャ画像情報としてのキャプチャ画像データ、取得時刻及びキャプチャ画像の保存先を示す画像データパスを含む情報を登録する（ステップＳ７５）。 Next, the setting registration unit 58 of the audio recording management device 5 registers information including the captured image data as the captured image information corresponding to the image identification information, the acquisition time, and the image data path indicating the storage destination of the captured image in the captured image management DB 5004 (see FIG. 8A) (step S75).

続いて、送受信部５１は、通信端末３に対して、記録画面更新要求を送信する（ステップＳ７６）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した記録画面更新要求を受信する。 Next, the transmission/reception unit 51 transmits a recording screen update request to the communication terminal 3 (step S76). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the recording screen update request transmitted by the voice recording management device 5.

次に、通信端末３の表示制御部３４は、記録中画面を更新し（ステップＳ７７）、ディスプレイ３１８に更新後の画面を表示する（ステップＳ７８）。このときに更新される記録中画面は、後述する図２４にて詳細に説明する。 Next, the display control unit 34 of the communication terminal 3 updates the recording screen (step S77) and displays the updated screen on the display 318 (step S78). The recording screen that is updated at this time will be described in detail later with reference to FIG. 24.

本実施形態では、音声記録管理システム２は、上述したステップＳ７１－Ｓ７８までの処理を、所定のイベントが終了するまで、つまり、音声記録管理システム２は、例えば、会議における音声記録が終了するまで、上述したステップＳ７１－Ｓ７８までの処理を繰り返して実行する。 In this embodiment, the audio recording management system 2 repeatedly executes the above-mentioned processes from steps S71 to S78 until the specified event ends, that is, the audio recording management system 2 repeatedly executes the above-mentioned processes from steps S71 to S78 until, for example, audio recording of a conference ends.

<<記録終了処理>>
次に、記録終了処理について説明する。図１８は、記録終了処理の一例を示すシーケンス図である。まず、通信端末３の操作受付部３２は、利用者による「記録終了」ボタン(アイコン)３５２２に対する操作によって、記録終了指示を受け付ける（ステップＳ８１）。この記録終了指示は、会議等のイベントで発話される発話内容の記録を終了させるための指示であり、例えば、図１６に示した記録中画面３１２１に表示された「記録終了」ボタン(アイコン)３５２２を利用者が操作(押下又はタップ等)することで行われる。 <<End of recording process>>
Next, the recording end process will be described. Fig. 18 is a sequence diagram showing an example of the recording end process. First, the operation reception unit 32 of the communication terminal 3 receives a recording end instruction by a user's operation on the "recording end" button (icon) 3522 (step S81). This recording end instruction is an instruction to end the recording of the contents of speech spoken at an event such as a conference, and is issued, for example, by the user operating (pressing or tapping, etc.) the "recording end" button (icon) 3522 displayed on the recording screen 3121 shown in Fig. 16.

次に、送受信部３１は、音声記録管理装置５に対して、記録終了要求を送信する（ステップＳ８２）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信した記録終了要求を受信する。 Next, the transmission/reception unit 31 transmits a request to end recording to the voice recording management device 5 (step S82). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the request to end recording transmitted by the communication terminal 3.

次に、音声記録管理装置５の送受信部５１は、音声認識サーバ７に対して音声データ送信終了通知を送信する（ステップＳ８３）。これにより、音声認識サーバ７の送受信部７１は、音声記録管理装置５が送信した音声データ送信終了通知を受信する。 Next, the transmission/reception unit 51 of the voice recording management device 5 transmits a voice data transmission end notification to the voice recognition server 7 (step S83). As a result, the transmission/reception unit 71 of the voice recognition server 7 receives the voice data transmission end notification transmitted by the voice recording management device 5.

続いて、音声認識サーバ７の送受信部７１は、音声記録管理装置５に対して音声データ送信終了通知の受領を送信する（ステップＳ８４）。これにより、音声記録管理装置５の送受信部５１は、音声認識サーバ７が送信した音声データ送信終了通知の受領を受信する。 Then, the transmission/reception unit 71 of the voice recognition server 7 transmits a receipt of the voice data transmission end notification to the voice recording management device 5 (step S84). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the receipt of the voice data transmission end notification transmitted by the voice recognition server 7.

次に、音声記録管理装置５の送受信部５１は、通信端末３に対して、記録終了画面表示要求を送信する（ステップＳ８５）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した記録終了画面表示要求を受信する。 Next, the transmission/reception unit 51 of the voice recording management device 5 transmits a recording end screen display request to the communication terminal 3 (step S85). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the recording end screen display request transmitted by the voice recording management device 5.

次に、通信端末３の表示制御部３４は、ディスプレイ３１８に、図１９に示すような記録終了画面を表示する（ステップＳ８６）。なお、ステップＳ８６の処理は省略されてもよい。 Next, the display control unit 34 of the communication terminal 3 displays a recording end screen as shown in FIG. 19 on the display 318 (step S86). Note that the process of step S86 may be omitted.

一方、ステップＳ８５において記録終了画面表示要求を送信した音声記録管理装置５の生成・処理部５７は、記録終了操作が行われた所定のイベントの音声記録データを生成する（ステップＳ８７）。具体的には、生成・処理部５７は、記録終了操作が行われた所定のイベントの音声記録データが記憶、管理される音声データパス（「…/00005006/record.mp3）」等）を生成する。 Meanwhile, the generation/processing unit 57 of the audio recording management device 5 that sent the recording end screen display request in step S85 generates audio recording data for the specified event for which the recording end operation was performed (step S87). Specifically, the generation/processing unit 57 generates an audio data path (such as ".../00005006/record.mp3") in which the audio recording data for the specified event for which the recording end operation was performed is stored and managed.

続いて、設定登録部５８は、ステップＳ８７の処理で生成された音声データパスを含む記録書誌情報を、音声データパスの記憶先である記録書誌情報管理ＤＢ５００２（図６参照）に登録する（ステップＳ８８）。このとき、設定登録部５８によって登録される記録書誌情報には、終了日時及び音声データパス(「…/00005006/record.mp3」)が含まれる。 Then, the setting registration unit 58 registers the record bibliographic information including the audio data path generated in the processing of step S87 in the record bibliographic information management DB 5002 (see FIG. 6), which is the storage destination of the audio data path (step S88). At this time, the record bibliographic information registered by the setting registration unit 58 includes the end date and time and the audio data path (".../00005006/record.mp3").

一方、通信端末３の操作受付部３２は、後述する記録選択画面が表示されている状態において、利用者によって操作される共有議事録の選択を受け付ける（ステップＳ８９）。具体的には、操作受付部３２は、利用者が操作した所定のイベントタイトルに対応付けられた後述する「議事録共有」ボタンの選択を受け付ける。 Meanwhile, the operation acceptance unit 32 of the communication terminal 3 accepts the selection of the shared minutes operated by the user while the record selection screen described below is displayed (step S89). Specifically, the operation acceptance unit 32 accepts the selection of the "Share minutes" button, described below, which is associated with a specific event title operated by the user.

続いて、通信端末３の送受信部３１は、音声記録管理装置５に対してＵＲＬの要求を送信する（ステップＳ９０）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信したＵＲＬの要求を受信する。このとき、ＵＲＬの要求には、ステップＳ８９で選択された所定のイベントタイトルとセットで配置された「議事録共有」ボタンに対応付けられた記録識別情報が含まれる。 Then, the transmitting/receiving unit 31 of the communication terminal 3 transmits a URL request to the audio recording management device 5 (step S90). As a result, the transmitting/receiving unit 51 of the audio recording management device 5 receives the URL request transmitted by the communication terminal 3. At this time, the URL request includes the recording identification information associated with the "Share minutes" button that is arranged in combination with the specified event title selected in step S89.

次に、音声記録管理装置５の記憶読出部５９は、ステップＳ９０の処理で受信した記録識別情報を検索キーとして記録書誌情報管理ＤＢ５００２（図６参照）を検索することにより、対応するイベントのイベントＵＲＬを検索する(読み出す)（ステップＳ９１）。 Next, the memory readout unit 59 of the voice recording management device 5 searches the record bibliographic information management DB 5002 (see FIG. 6) using the record identification information received in the processing of step S90 as a search key to search (read) the event URL of the corresponding event (step S91).

次に、送受信部５１は、通信端末３に対して、ダイアログ画面情報を送信する（ステップＳ９２）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信したダイアログ画面情報を受信する。このとき、ダイアログ画面情報には、イベントＵＲＬ、パスコードが含まれる。 Next, the transmitting/receiving unit 51 transmits the dialog screen information to the communication terminal 3 (step S92). As a result, the transmitting/receiving unit 31 of the communication terminal 3 receives the dialog screen information transmitted by the voice recording management device 5. At this time, the dialog screen information includes the event URL and the passcode.

次に、通信端末３の表示制御部３４は、ディスプレイ３１８に後述する共有情報入力ダイアログを表示する（ステップＳ９３）。これにより、例えば、記録閲覧編集画面を作成する作成者は、共有情報入力ダイアログに表示されたイベントＵＲＬ、パスコードをコピーして、他の利用者にこれらのイベントＵＲＬとパスコードの各情報を提供することが可能になる。 Next, the display control unit 34 of the communication terminal 3 displays a shared information input dialog, which will be described later, on the display 318 (step S93). This allows, for example, a creator who creates a record viewing and editing screen to copy the event URL and passcode displayed in the shared information input dialog and provide the event URL and passcode information to other users.

なお、上述したステップＳ８７，Ｓ８８の各処理とステップＳ８６，Ｓ８９の各処理は非同期で行われるため、どちらが先に実行されてもよい。 Note that the processes of steps S87 and S88 and steps S86 and S89 are performed asynchronously, so either one can be performed first.

●画面表示例●
図１９は、通信端末における記録終了時の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ８６の処理が実行されることにより、表示制御部３４によって記録終了画面３１３１が表示される。記録終了画面３１３１には、例えば、保存終了を示す保存終了マーク、新しい会議を記録するための「会議記録」ボタン３５３１、及びログを見るための「ログ確認」ボタン３５３２が表示される。利用者は「会議記録」ボタン３５３１を操作することによって新しい会議を記録する画面に遷移することができる。また、利用者は、「ログ確認」ボタン３５３２を操作することによって、所定のイベントで発話された内容の履歴、利用者によって操作された各種ボタン、処理のログを含む各種情報(データ)を確認することができる。 ●Screen display example●
19 is an example of a screen display at the end of recording in the communication terminal. When the process of step S86 described above is executed, a recording end screen 3131 is displayed by the display control unit 34 on the display 318 of the communication terminal 3. On the recording end screen 3131, for example, a save end mark indicating the end of saving, a "conference record" button 3531 for recording a new conference, and a "log check" button 3532 for viewing the log are displayed. By operating the "conference record" button 3531, the user can transition to a screen for recording a new conference. In addition, by operating the "log check" button 3532, the user can check various information (data) including the history of the contents spoken in a specific event, various buttons operated by the user, and a processing log.

●画面表示例●
図２０は、通信端末における記録選択時の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ８９の処理が実行されることにより、表示制御部３４によって記録選択画面３１４１が表示される。記録選択画面３１４１には、例えば、会議等のイベントの記録内容(議事録等)を示す日付、イベントタイトル、議事録共有ボタン(アイコン)３５３５が一揃えとして選択可能な表示形態で表示される。これにより、利用者は、任意の日付及びイベントタイトルで表された所定のイベントに対応付けられた議事録共有ボタン(アイコン)３５３５を操作して選択することができる。本実施形態では、「2021/3/31 11:01:28-12:00:00」を日付情報として与えられた「ヘルスケア事業業績報告会」のイベントに対応する議事録共有ボタン(アイコン)３５３５が、利用者によって選択された場合が示されている。具体的には、通信端末３(Ａ)の利用者が、記録選択画面３１４１中の所定のイベントタイトルをマウスオーバー操作によってマウスポインタ(カーソル)３７０１を翳すと、マウスポインタ(カーソル)３７０１によって翳されたイベントタイトルに対応付けられた議事録共有ボタン(アイコン)３５３５が表示される。そこで、通信端末３(Ａ)の利用者は、議事録共有ボタン(アイコン)３５３５を操作することによって、所定のＵＲＬとパスコードを含むダイアログにアクセスすることが可能となる。通信端末３(Ａ)の利用者は、このダイアログに所定のＵＲＬとパスコードを入力することにより、後述する音声認識エンジン変更画面へのアクセスが可能になる。 ●Screen display example●
FIG. 20 is an example of a screen display when selecting a record in a communication terminal. By executing the process of step S89 described above, a record selection screen 3141 is displayed by the display control unit 34 on the display 318 of the communication terminal 3. The record selection screen 3141 displays, for example, a date indicating the recorded contents (minutes, etc.) of an event such as a meeting, an event title, and a minutes share button (icon) 3535 in a selectable display form as a set. This allows a user to operate and select the minutes share button (icon) 3535 associated with a specific event represented by an arbitrary date and event title. In this embodiment, a case is shown in which the minutes share button (icon) 3535 corresponding to the event "Healthcare Business Performance Report Meeting" given date information of "2021/3/31 11:01:28-12:00:00" is selected by the user. Specifically, when the user of communication terminal 3(A) moves mouse pointer (cursor) 3701 over a specific event title on record selection screen 3141 by mouse over operation, minutes share button (icon) 3535 corresponding to the event title overlaid by mouse pointer (cursor) 3701 is displayed. Then, the user of communication terminal 3(A) can access a dialogue including a specific URL and a passcode by operating minutes share button (icon) 3535. The user of communication terminal 3(A) can access a voice recognition engine change screen, which will be described later, by inputting a specific URL and passcode into this dialogue.

●画面表示例●
図２１は、通信端末における共有情報入力ダイアログの画面表示例である。通信端末３のディスプレイ３１８には、図２０で示した任意のイベントタイトルに対応付けて配置された「議事録共有」ボタン３５３５を操作することで、表示制御部３４により、図２１に示したような共有情報入力ダイアログ３１４２が表示される。共有情報入力ダイアログ３１４２には、イベントＵＲＬの表示、及びパスコードの入力欄と「ＵＲＬパスコードコピー」ボタン３５５１が含まれる。音声記録を作成、編集する作成者は、表示されたイベントＵＲＬを確認して、パスコード入力欄に所定のパスコードを入力して「ＵＲＬパスコードコピー」ボタン３５５１を操作することができる。その後、作成者は、電子メール、チャット等を利用して他の利用者に対してコピーしたＵＲＬ(「https://・・・/00005006」等)とパスコードを提供することができる。 ●Screen display example●
Fig. 21 is a screen display example of a shared information input dialogue in a communication terminal. By operating a "Minutes Share" button 3535 arranged in association with any event title shown in Fig. 20 on the display 318 of the communication terminal 3, the display control unit 34 displays a shared information input dialogue 3142 as shown in Fig. 21. The shared information input dialogue 3142 includes an event URL display, a passcode input field, and a "URL passcode copy" button 3551. A creator who creates and edits an audio recording can check the displayed event URL, input a predetermined passcode in the passcode input field, and operate the "URL passcode copy" button 3551. The creator can then provide the copied URL (e.g., "https://.../00005006") and passcode to other users by email, chat, or the like.

<<記録閲覧編集画面の生成処理>>
次に、音声記録管理装置５による記録閲覧編集画面の生成処理について説明する。図２２は、記録閲覧編集画面の生成処理の一例を示すシーケンス図である。まず、通信端末３の操作受付部３２は、利用者により入力されたイベントＵＲＬとパスコードを受け付ける（ステップＳ１０１）。具体的な処理の例として、利用者はイベントＵＲＬをブラウザのアドレス入力欄に入力し、パスワードの入力の必要性があればパスワードの入力欄が表示され、そこにパスワード(パスコード)を入力することで実現できる。 <<Record viewing and editing screen generation process>>
Next, the process of generating the record viewing and editing screen by the voice recording management device 5 will be described. Fig. 22 is a sequence diagram showing an example of the process of generating the record viewing and editing screen. First, the operation acceptance unit 32 of the communication terminal 3 accepts the event URL and passcode entered by the user (step S101). As a specific example of the process, the user enters the event URL into the address input field of the browser, and if it is necessary to enter a password, a password input field is displayed and the password (passcode) is entered there.

次に、送受信部３１は、記録閲覧編集画面要求を音声記録管理装置５に対して送信する（ステップＳ１０２）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信した記録閲覧編集画面要求を受信する。このとき、記録閲覧編集画面要求には、イベントＵＲＬで示される所定のイベントの音声記録が記録された記録識別情報が含まれる。 Next, the transmission/reception unit 31 transmits a record viewing/editing screen request to the voice recording management device 5 (step S102). As a result, the transmission/reception unit 51 of the voice recording management device 5 receives the record viewing/editing screen request transmitted by the communication terminal 3. At this time, the record viewing/editing screen request includes the recording identification information in which the voice recording of the specified event indicated by the event URL is recorded.

続いて、音声記録管理装置５の取得部５２は、記録書誌情報、テキスト情報、キャプチャ情報を取得する（ステップＳ１０３）。具体的には、取得部５２は、ステップＳ１０２で受信した記録識別情報を検索キーとして記録書誌情報管理ＤＢ５００２（図６参照）を検索することにより、対応する記録書誌情報を取得する。ここで記録書誌情報には、開始日時、ステップＳ８８で登録された終了日時と及び音声データパス(「…/00005006/record.mp3」)、「イベントＵＲＬ」、「パスコード」が含まれる。更に、取得部５２は、ステップＳ１０２で受信した記録識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応するテキスト情報を取得する。ここでテキスト情報には、「テキスト識別情報」、「開始時刻」、「終了時刻」、「テキスト」、公開フラグ、削除フラグ及び「画像識別情報」が含まれる。更に、取得部５２は、ステップＳ１０２で受信した記録識別情報を検索キーとしてキャプチャ画像管理ＤＢ５００４（図８Ａ参照）を検索することにより、対応するキャプチャ情報を取得する。ここでキャプチャ情報には、「画像識別情報」、「取得時刻」、公開フラグ、削除フラグ及び「画像データパス」が含まれる。 Next, the acquisition unit 52 of the voice recording management device 5 acquires the record bibliographic information, text information, and capture information (step S103). Specifically, the acquisition unit 52 acquires the corresponding record bibliographic information by searching the record bibliographic information management DB 5002 (see FIG. 6) using the record identification information received in step S102 as a search key. Here, the record bibliographic information includes the start date and time, the end date and time registered in step S88, and the voice data path (".../00005006/record.mp3"), "event URL", and "passcode". Furthermore, the acquisition unit 52 acquires the corresponding text information by searching the text information management DB 5003 (see FIG. 7) using the record identification information received in step S102 as a search key. Here, the text information includes "text identification information", "start time", "end time", "text", a public flag, a deletion flag, and "image identification information". Furthermore, the acquisition unit 52 searches the capture image management DB 5004 (see FIG. 8A) using the record identification information received in step S102 as a search key to acquire corresponding capture information. Here, the capture information includes "image identification information," "acquisition time," a public flag, a deletion flag, and an "image data path."

次に、生成・処理部５７は、記録書誌情報管理ＤＢ５００２（図６参照）、テキスト情報管理ＤＢ５００３（図７参照）、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）及びキャプチャ画像取得間隔ＤＢ５００５（図８Ｂ参照）を用いて記録閲覧編集画面を生成し、記憶部５０００の所定領域に記憶、管理する（ステップＳ１０４）。より詳細には、生成・処理部５７は記憶読出部５９と協働して、記録閲覧編集画面を構成する画面構成データ(画面用のテンプレートデータ)を、例えば、記憶部５０００の所定領域から読み出す。その後、生成・処理部５７は、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている「記録名称」、「開始日時」、「作成者識別情報(ユーザＩＤ)」、「イベントＵＲＬ」を含む記録書誌情報、テキスト情報、キャプチャ画像等を画面構成データに組み込み、記録閲覧編集画面データを生成する。 Next, the generation/processing unit 57 generates a record viewing/editing screen using the record bibliographic information management DB 5002 (see FIG. 6), the text information management DB 5003 (see FIG. 7), the capture image management DB 5004 (see FIG. 8A), and the capture image acquisition interval DB 5005 (see FIG. 8B), and stores and manages it in a predetermined area of the storage unit 5000 (step S104). More specifically, the generation/processing unit 57 cooperates with the storage/reading unit 59 to read screen configuration data (template data for the screen) that constitutes the record viewing/editing screen, for example, from a predetermined area of the storage unit 5000. After that, the generation/processing unit 57 incorporates the record bibliographic information, including the "record name", "start date and time", "creator identification information (user ID)", and "event URL" managed in the record bibliographic information management DB 5002 (see FIG. 6), text information, and capture images into the screen configuration data, and generates record viewing/editing screen data.

<<記録閲覧編集画面の生成処理の詳細：１回目>>
続いて、記録閲覧編集画面の生成処理について詳細に説明する。図２３は、作成者の通信端末又は最初の閲覧時に対する記録閲覧編集画面生成処理の一例を示すフローチャートである。このフローチャートは、会議等の所定のイベントにおける音声記録が記録された後、音声データに基づいて得られた複数のテキストと各テキストに対応付けられた画像に係る画像データ、及び複数のテキストの各々に対応する音声データとの対応付け処理が最初に行われる処理を示している。更に、このフローチャートは、音声記録を編集可能な作成者(編集者)が記録閲覧編集画面を開く場合に実行される処理を示している。 <<Details of the process of generating the record viewing and editing screen: 1st time>>
Next, the process of generating the record viewing and editing screen will be described in detail. Fig. 23 is a flowchart showing an example of the process of generating the record viewing and editing screen for the creator's communication terminal or the first viewing. This flowchart shows the process of first associating a plurality of texts obtained based on the voice data with image data related to images associated with each text, and voice data corresponding to each of the plurality of texts, after a voice recording of a specific event such as a conference is recorded. Furthermore, this flowchart shows the process executed when a creator (editor) who can edit the voice recording opens the record viewing and editing screen.

まず、音声記録管理装置５の取得部５２は、ステップＳ１０２で受信した記録識別情報を検索キーとして記録書誌情報管理ＤＢ５００２（図６参照）を検索することにより、対応する「対応付け処理」項目の項目値を取得する（ステップＳ１０４－１－１）。 First, the acquisition unit 52 of the voice recording management device 5 searches the record bibliographic information management DB 5002 (see Figure 6) using the recording identification information received in step S102 as a search key to acquire the item value of the corresponding "matching processing" item (step S104-1-1).

続いて、判断部５５は、取得部５２によって取得された「対応付け処理」の項目値が「処理済」であるかを判断する（ステップＳ１０４－１－２）。取得された「対応付け処理」の項目値が「処理済」である場合（ステップＳ１０４－１－２：ＹＥＳ）、生成・処理部５７は、キャプチャ画像単位でテキストをグルーピングする（ステップＳ１０４－１－３）。 Then, the determination unit 55 determines whether the item value of the "Matching Process" acquired by the acquisition unit 52 is "Processed" (step S104-1-2). If the item value of the acquired "Matching Process" is "Processed" (step S104-1-2: YES), the generation/processing unit 57 groups the text by captured image (step S104-1-3).

続いて、生成・処理部５７は、グルーピング結果に基づいて画面を生成し、記憶読出部５９は、生成された画面に係る画面データを記憶部５０００の所定領域に記憶してこのフローを抜ける（ステップＳ１０４－１－４）。 Next, the generation/processing unit 57 generates a screen based on the grouping results, and the storage/reading unit 59 stores screen data relating to the generated screen in a specified area of the storage unit 5000, and then exits this flow (step S104-1-4).

他方、取得された「対応付け処理」の項目値が「処理済」でない場合、すなわち「未処理」である場合（ステップＳ１０４－１－２：ＮＯ）、音声記録管理装置５は、以下のステップＳ１０４－１－５からステップＳ１０４－１０までの処理を繰り返し実行する。具体的には、音声記録管理装置５は、テキスト情報管理ＤＢ５００３（図７参照）に登録されたテキストごとに以下の処理を実行する（ステップＳ１０４－１－５）。 On the other hand, if the acquired item value of "Matching Process" is not "Processed", i.e., if it is "Unprocessed" (step S104-1-2: NO), the voice recording management device 5 repeatedly executes the following processes from step S104-1-5 to step S104-10. Specifically, the voice recording management device 5 executes the following processes for each text registered in the text information management DB 5003 (see FIG. 7) (step S104-1-5).

まず、取得部５２は、テキストに対応する開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１０４－１－６）。具体的には、取得部５２は、それぞれのテキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する開始時刻及び終了時刻の各時刻情報を取得する。 First, the acquisition unit 52 acquires each piece of time information of the start time and end time corresponding to the text (step S104-1-6). Specifically, the acquisition unit 52 acquires each piece of time information of the corresponding start time and end time by searching the text information management DB 5003 (see FIG. 7) using each piece of text identification information as a search key.

次に、算出特定部５３は、取得した開始時刻及び終了時刻の間の第１の区間を特定する（ステップＳ１０４－７）。具体的には、算出特定部５３は、第１の区間として以下の区間をそれぞれ特定する。
１．1分53秒から1分58秒までの5秒間（ステップＳ１０４－１－５のＮ回目）
２．1分59秒から2分2秒までの3秒間（ステップＳ１０４－１－５のＮ＋１回目）
３．2分5秒から2分12秒までの8秒間（ステップＳ１０４－１－５のＮ＋２回目）
４．2分18秒から2分23秒までの5秒間（ステップＳ１０４－１－５のＮ＋３回目）
５．2分25秒から2分28秒までの3秒間（ステップＳ１０４－１－５のＮ＋４回目）
６．・・・
次に、算出特定部５３は、取得時刻と「キャプチャ画像取得間隔」後の時刻との間の第２の区間が、第１の区間と重なったキャプチャ画像を特定する（ステップＳ１０４－８）。具体的には、算出特定部５３は、キャプチャ画像取得間隔ＤＢ５００５（図８Ｂ参照）を検索して、設定されているキャプチャ画像取得間隔：３０秒を読み出す。続いて、算出特定部５３は、テキスト情報管理ＤＢ５００３（図７参照）とキャプチャ画像管理ＤＢ５００４（図８Ａ参照）を用いて、以下の特定を行う。この場合、取得時刻と「キャプチャ画像取得間隔」後の時刻との間の第２の区間は、
１１．1分00秒から1分30秒までの30秒間
１２．1分30秒から2分00秒までの30秒間
１３．2分00秒から2分30秒までの30秒間
１４．・・・
これらの区間より、算出特定部５３は、第２の区間が第１の区間と重なるのは、
２．1分59秒から2分2秒までの3秒間と、１３．2分00秒から2分30秒までの30秒間であると特定する。そして、算出特定部５３は、第２の区間が第１の区間と重なったキャプチャ画像として、画像識別情報：「IM0005」に対応付けられたキャプチャ画像(画像データパス：「・・・/00005006/0005.jpg」)を特定する。 Next, the calculation determination unit 53 determines a first interval between the acquired start time and end time (step S104-7). Specifically, the calculation determination unit 53 determines the following intervals as the first intervals:
1. Five seconds from 1 minute 53 seconds to 1 minute 58 seconds (Nth time of step S104-1-5)
2. Three seconds from 1 minute 59 seconds to 2 minutes 2 seconds (N+1 times of step S104-1-5)
3. 8 seconds from 2 minutes 5 seconds to 2 minutes 12 seconds (N+2 times of step S104-1-5)
4. Five seconds from 2 minutes 18 seconds to 2 minutes 23 seconds (step S104-1-5 N+3)
5. Three seconds from 2 minutes 25 seconds to 2 minutes 28 seconds (step S104-1-5 N+4)
6. ...
Next, the calculation identification unit 53 identifies captured images in which the second interval between the acquisition time and the time after the "captured image acquisition interval" overlaps with the first interval (step S104-8). Specifically, the calculation identification unit 53 searches the captured image acquisition interval DB 5005 (see FIG. 8B) and reads out the set captured image acquisition interval: 30 seconds. Next, the calculation identification unit 53 performs the following identification using the text information management DB 5003 (see FIG. 7) and the captured image management DB 5004 (see FIG. 8A). In this case, the second interval between the acquisition time and the time after the "captured image acquisition interval" is
11. The 30 seconds from 1 minute 00 seconds to 1 minute 30 seconds 12. The 30 seconds from 1 minute 30 seconds to 2 minutes 00 seconds 13. The 30 seconds from 2 minutes 00 seconds to 2 minutes 30 seconds 14. ...
From these sections, the calculation specification unit 53 determines that the second section overlaps with the first section when
2. The 3 seconds from 1 minute 59 seconds to 2 minutes 2 seconds, and 13. the 30 seconds from 2 minutes 00 seconds to 2 minutes 30 seconds. The calculation and identification unit 53 then identifies the captured image (image data path: ".../00005006/0005.jpg") associated with the image identification information "IM0005" as a captured image in which the second interval overlaps with the first interval.

次に、設定登録部５８は、特定したキャプチャ画像の画像識別情報を、対象テキストの「画像識別情報」の項目に登録する（ステップＳ１０４－１－９）。具体的には、設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）のテキスト識別情報：「TX0006」に対応する画像識別情報の項目に「IM0005」を登録する。 Next, the setting registration unit 58 registers the image identification information of the identified capture image in the "image identification information" field of the target text (step S104-1-9). Specifically, the setting registration unit 58 registers "IM0005" in the image identification information field corresponding to the text identification information "TX0006" in the text information management DB 5003 (see FIG. 7).

ステップＳ１０４－９の処理が実行された後のステップＳ１０４－１－１０では、音声記録管理装置５は、ループ処理を行うために、ステップＳ１０４－１－５に戻り、上述したステップＳ１０４－５からステップＳ１０４－１－１０までの処理を繰り返す。 In step S104-1-10 after the processing of step S104-9 is executed, the voice recording management device 5 returns to step S104-1-5 to perform loop processing, and repeats the processing from step S104-5 to step S104-1-10 described above.

ステップＳ１０４－５からステップＳ１０４－１－１０までの処理を繰り返した後、設定登録部５８は、記録書誌情報の「対応付け処理」項目の項目値を「未処理」から「処理済」に設定する（ステップＳ１０４－１－１１）。 After repeating the processes from step S104-5 to step S104-1-10, the setting registration unit 58 sets the item value of the "Matching Process" item in the recorded bibliographic information from "Unprocessed" to "Processed" (step S104-1-11).

続いて、音声記録管理装置５は、上述したステップＳ１０４－１－３及びステップＳ１０４－１－４の処理をそれぞれ実行して、このフローを抜ける。 Then, the voice recording management device 5 executes the processes of steps S104-1-3 and S104-1-4 described above, and then exits this flow.

<<記録閲覧編集画面の生成処理の詳細：２回目以降>>
次に、記録閲覧編集画面の生成処理の他の場合について詳細に説明する。図２４は、作成者以外の利用者の通信端末に対する記録閲覧画面生成処理の一例を示すフローチャートである。このフローチャートは、会議等の所定のイベントにおける音声記録が記録された後、音声データに基づいて得られた複数のテキストと各テキストに対応付けられた画像に係る画像データ、及び複数のテキストの各々に対応する音声データとの対応付け処理が２回目以降に行われる処理を示している。更に、このフローチャートは、音声記録を編集可能な作成者(編集者)以外の利用者が記録閲覧画面を開く場合に実行される処理を示している。 <<Details of the process for generating the record viewing and editing screen: second time onwards>>
Next, another case of the generation process of the record viewing and editing screen will be described in detail. Fig. 24 is a flowchart showing an example of the generation process of the record viewing screen for a communication terminal of a user other than the creator. This flowchart shows the process of associating a plurality of texts obtained based on the voice data with image data related to the images associated with each text, and the voice data corresponding to each of the plurality of texts, which is performed for the second or subsequent times after an audio recording of a specific event such as a conference is recorded. Furthermore, this flowchart shows the process executed when a user other than the creator (editor) who can edit the audio recording opens the record viewing screen.

記録閲覧編集画面の生成処理において、利用者、削除フラグ、公開フラグに対応する、「テキスト又はキャプチャ画像の表示」及び「各操作ボタンの表示」の内容をまとめると、表１のような組合せとなる。本実施形態では、以下の表１に纏めた組合せに基づく図２４のフローチャートを説明する。 When the contents of "display of text or captured image" and "display of each operation button" corresponding to the user, deletion flag, and public flag are compiled in the process of generating the record viewing and editing screen, the combinations shown in Table 1 are obtained. In this embodiment, the flowchart in Figure 24 will be explained based on the combinations summarized in Table 1 below.

表１は、利用者、各種フラグ、テキスト又は画像の表示、並びに各種操作ボタン表示の組合せの一例を示す。 Table 1 shows an example of a combination of users, various flags, text or image displays, and various operation button displays.

まず、取得部５２は、ログイン管理ＤＢ５００１（図５参照）を検索することにより、ログイン処理を行ったログインユーザのログインユーザＩＤ(利用者識別情報)と、ステップＳ１０２で受信した記録識別情報を検索キーとして記録書誌情報管理ＤＢ５００２（図６参照）を検索することにより、対応する作成者識別情報を取得する（ステップＳ１０４－２－１）。 First, the acquisition unit 52 searches the login management DB 5001 (see FIG. 5) to obtain the corresponding creator identification information by searching the record bibliographic information management DB 5002 (see FIG. 6) using the login user ID (user identification information) of the login user who performed the login process and the record identification information received in step S102 as search keys (step S104-2-1).

続いて、音声記録管理装置５は、以下のステップＳ１０４－２－２からステップＳ１０４－２－９までの処理を繰り返し実行する。まず、音声記録管理装置５は、テキスト情報管理ＤＢ５００３（図７参照）に登録された「テキスト」及びキャプチャ画像管理ＤＢ５００４（図８Ａ参照）に登録されたキャプチャ画像(「画像識別情報」)ごとに、以下の処理を実行する（ステップＳ１０４－２－２）。 Then, the voice recording management device 5 repeatedly executes the following processes from step S104-2-2 to step S104-2-9. First, the voice recording management device 5 executes the following processes (step S104-2-2) for each "text" registered in the text information management DB 5003 (see FIG. 7) and each captured image ("image identification information") registered in the captured image management DB 5004 (see FIG. 8A).

まず、取得部５２は、記録識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、各テキスト識別情報に対応する公開フラグ及び削除フラグを取得する。取得部５２は更に、記録識別情報を検索キーとしてキャプチャ画像管理ＤＢ５００４（図８Ａ参照）を検索することにより、各画像識別情報に対応する公開フラグ及び削除フラグを取得する（ステップＳ１０４－２－３）。 First, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the record identification information as a search key to acquire the publication flag and deletion flag corresponding to each piece of text identification information. The acquisition unit 52 further searches the capture image management DB 5004 (see FIG. 8A) using the record identification information as a search key to acquire the publication flag and deletion flag corresponding to each piece of image identification information (step S104-2-3).

続いて判断部５５は、ログインユーザＩＤ(利用者識別情報)と作成者識別情報とが一致するかを判断する（ステップＳ１０４－２－４）。ログインユーザＩＤ(利用者識別情報)と作成者識別情報とが一致すると判断された場合（ステップＳ１０４－２－４：ＹＥＳ）、判断部は更に、ステップＳ１０４－２－３で取得した削除フラグが「True」であるかを判断する（ステップＳ１０４－２－５）。 Then, the judgment unit 55 judges whether the login user ID (user identification information) and the creator identification information match (step S104-2-4). If it is judged that the login user ID (user identification information) and the creator identification information match (step S104-2-4: YES), the judgment unit further judges whether the deletion flag acquired in step S104-2-3 is "True" (step S104-2-5).

削除フラグが「True」であると判断された場合（ステップＳ１０４－２－５：ＹＥＳ）、判断部５５は更に、公開フラグが「True」であるかを判断する（ステップＳ１０４－２－６）。 If it is determined that the deletion flag is "True" (step S104-2-5: YES), the judgment unit 55 further determines whether the publication flag is "True" (step S104-2-6).

公開フラグが「True」であると判断された場合（ステップＳ１０４－２－６：ＹＥＳ）、判断部５５は、「テキスト」又はキャプチャ画像(「画像識別情報」)と、「削除」ボタンと、「非公開」ボタンとを通信端末３に「表示する」と判断して（ステップＳ１０４－２－７）、ステップＳ１０４－２－９の処理に遷移する。 If it is determined that the public flag is "True" (step S104-2-6: YES), the determination unit 55 determines that the "text" or the capture image ("image identification information"), the "Delete" button, and the "Private" button should be "displayed" on the communication terminal 3 (step S104-2-7), and the process proceeds to step S104-2-9.

他方、公開フラグが「True」でないと判断された場合（ステップＳ１０４－２－６：ＮＯ）、判断部５５は、「テキスト」又はキャプチャ画像(「画像識別情報」)と、「削除」ボタンと、「公開」ボタンとを通信端末３に「表示する」と判断して（ステップＳ１０４－２－８）、ステップＳ１０４－２－９の処理に遷移し、以降の丸１の処理を実行する。 On the other hand, if it is determined that the public flag is not "True" (step S104-2-6: NO), the judgment unit 55 determines that the "text" or the capture image ("image identification information"), the "Delete" button, and the "Public" button are to be "displayed" on the communication terminal 3 (step S104-2-8), and proceeds to the processing of step S104-2-9, where the subsequent processing of circled 1 is executed.

ステップＳ１０４－２－４の処理において、「ログインユーザＩＤ(利用者識別情報)」と「作成者識別情報(ユーザＩＤ)」とが一致すると判断された場合、判断部５５は更に、公開フラグが「True」、且つ削除フラグが「False」であるかを判断する（ステップＳ１０４－２－１０）。 If it is determined in the processing of step S104-2-4 that the "login user ID (user identification information)" and the "creator identification information (user ID)" match, the judgment unit 55 further determines whether the publication flag is "True" and the deletion flag is "False" (step S104-2-10).

公開フラグが「True」、且つ削除フラグが「False」であると判断された場合（ステップＳ１０４－２－１０：ＹＥＳ）、判断部５５は、各編集ボタンを通信端末３に「表示しない」、テキスト又はキャプチャ画像を通信端末３に「表示する」と判断して（ステップＳ１０４－２－１１）、ステップＳ１０４－２－９の処理に遷移する。 If it is determined that the publication flag is "True" and the deletion flag is "False" (step S104-2-10: YES), the determination unit 55 determines that each edit button is "not displayed" on the communication terminal 3 and that the text or capture image is "displayed" on the communication terminal 3 (step S104-2-11), and proceeds to the processing of step S104-2-9.

他方、公開フラグが「True」、且つ削除フラグが「False」でないと判断された場合（ステップＳ１０４－２－１０：ＮＯ）、判断部５５は、各編集ボタンと、テキスト又はキャプチャ画像とを通信端末３に「表示しない」と判断して（ステップＳ１０４－２－１２）、ステップＳ１０４－２－９の処理に遷移する。 On the other hand, if it is determined that the publication flag is "True" and the deletion flag is not "False" (step S104-2-10: NO), the determination unit 55 determines that each edit button and the text or capture image are "not to be displayed" on the communication terminal 3 (step S104-2-12), and the process proceeds to step S104-2-9.

上述した各処理が実行され、ステップＳ１０４－２－９の処理が実行されると、音声記録管理装置５は、ステップＳ１０４－２－１に戻り、対象となるテキストすべてに対して同様の処理を行う。 After each of the above processes has been performed and step S104-2-9 has been performed, the voice recording management device 5 returns to step S104-2-1 and performs the same process on all of the target texts.

ステップＳ１０４－２－１からステップＳ１０４－２－９までの繰返し処理を実行後、生成・処理部５７は、以降の丸１の処理としてキャプチャ画像単位でテキストをグルーピングする（ステップＳ１０４－２－１３）。 After repeating the process from step S104-2-1 to step S104-2-9, the generation/processing unit 57 groups the text in units of captured images as the subsequent process of circle 1 (step S104-2-13).

続いて、生成・処理部５７は、グルーピング結果に基づいて画面を生成し、記憶読出部５９は、生成された画面に係る画面データを記憶部５０００の所定領域に記憶してこのフローを抜ける（ステップＳ１０４－２－１４）。 Next, the generation/processing unit 57 generates a screen based on the grouping results, and the storage/reading unit 59 stores screen data relating to the generated screen in a specified area of the storage unit 5000, and then exits this flow (step S104-2-14).

図２２に戻り、送受信部５１は、通信端末３に対して、記録閲覧編集画面要求の応答として記録閲覧編集画面応答を送信する（ステップＳ１０５）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した記録閲覧編集画面応答を受信する。このとき、記録閲覧編集画面応答には、記録閲覧編集画面を構成する画面データが含まれる。 Returning to FIG. 22, the transmission/reception unit 51 transmits a record viewing/editing screen response to the communication terminal 3 as a response to the record viewing/editing screen request (step S105). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the record viewing/editing screen response transmitted by the voice recording management device 5. At this time, the record viewing/editing screen response includes screen data that constitutes the record viewing/editing screen.

続いて、通信端末３の表示制御部３４は、図２５に示すような記録閲覧編集画面をディスプレイ３１８に表示する（ステップＳ１０６）。 Next, the display control unit 34 of the communication terminal 3 displays a record viewing and editing screen as shown in FIG. 25 on the display 318 (step S106).

●画面表示例●
図２５は、作成者の通信端末の記録閲覧編集画面の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ１０６の処理が実行されることにより、表示制御部３４によって記録閲覧編集画面３１５１が表示される。記録閲覧編集画面３１５１には、図１６に示した記録中画面３１２１と同様に、例えば、「概要(議題)」、「参加者」、「会議メモ」の各入力欄が表示される。記録閲覧編集画面３１５１には、更に、少なくとも一以上のテキスト表示欄が表示される。記録閲覧編集画面３１５１には、更に、画面キャプチャ処理によってキャプチャ処理された画面３、画面４及び画面５が、それぞれキャプチャ処理された時刻ごとに、各テキスト表示欄に表示された各テキストと対応付けて表示される。なお、記録閲覧編集画面３１５１が表示される通信端末３の利用者は、例えば、「理光太郎」である。 ●Screen display example●
FIG. 25 is a screen display example of the record viewing and editing screen of the creator's communication terminal. By executing the process of step S106 described above, the display control unit 34 displays the record viewing and editing screen 3151 on the display 318 of the communication terminal 3. The record viewing and editing screen 3151 displays, for example, each input field of "Summary (topic)", "Participants", and "Meeting notes" in the same manner as the recording screen 3121 shown in FIG. 16. The record viewing and editing screen 3151 further displays at least one text display field. The record viewing and editing screen 3151 further displays screens 3, 4, and 5 captured by the screen capture process in association with each text displayed in each text display field for each capture time. The user of the communication terminal 3 on which the record viewing and editing screen 3151 is displayed is, for example, "Rikotaro".

記録閲覧編集画面３１５１では、例えば、各テキスト表示欄の近傍にマウス３７０１が置かれる操作(マウスオーバー操作)が行われた場合、表示制御部３４によって、「議事録共有」ボタン(アイコン)３５４１、「非公開」ボタン(アイコン)３５４２、「音声再生」ボタン(アイコン)３５４３及び「削除」ボタン(アイコン)３５４４が、マウスオーバー操作された近傍に表示される。 On the record viewing and editing screen 3151, for example, when the mouse 3701 is placed near each text display field (mouse-over operation), the display control unit 34 displays the "Share minutes" button (icon) 3541, "Private" button (icon) 3542, "Play audio" button (icon) 3543, and "Delete" button (icon) 3544 near the area where the mouse was over.

これらの各ボタン(アイコン)のうち、「議事録共有」ボタン(アイコン)３５４１は、所定のイベントの音声記録(例えば、議事録)を作成する作成者が編集した議事録が表示された記録閲覧画面を作成者以外の他の利用者が利用する通信端末３に共有させるためのボタン(アイコン)である。この「議事録共有」ボタン(アイコン)３５４１は、図２０で説明した「議事録共有」ボタン(アイコン)３５３５と同様の機能を備える。そのため、図２０で示した記録選択画面３１４１が表示されない場合は、この記録閲覧編集画面３１５１中に表示される「議事録共有」ボタン(アイコン)３５４１を操作して議事録等を共有する処理を行ってもよい。これにより作成者は、この「議事録共有」ボタン(アイコン)３５４１を操作することにより、他の利用者に対して、議事録等を共有させることができる。 Of these buttons (icons), the "Share minutes" button (icon) 3541 is a button (icon) for allowing a record viewing screen displaying the minutes edited by the creator who creates an audio record (e.g., minutes) of a specified event to be shared with a communication terminal 3 used by a user other than the creator. This "Share minutes" button (icon) 3541 has the same function as the "Share minutes" button (icon) 3535 described in FIG. 20. Therefore, if the record selection screen 3141 shown in FIG. 20 is not displayed, the "Share minutes" button (icon) 3541 displayed in the record viewing and editing screen 3151 may be operated to share the minutes, etc. This allows the creator to share the minutes, etc. with other users by operating this "Share minutes" button (icon) 3541.

「非公開」ボタン(アイコン)３５４２は、後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍のテキストを非公開(非表示)にさせるためのボタン(アイコン)である。利用者によって「非公開」ボタン(アイコン)３５４２が操作されることにより、音声記録管理装置５は、後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍のテキストを非公開(非表示)にさせることができる。なお、「非公開」ボタン(アイコン)が表示され、非公開処理が実行された後、再度同じテキストに対してマウスオーバー操作が行われると、記録閲覧編集画面３１５１では、対象のテキストの近傍に「再生」ボタン(アイコン)が表示されるようにしてもよい。利用者によって「公開」ボタン(アイコン)が操作されることにより、音声記録管理装置５は、後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍のテキストを公開(再表示)させることができる。このように、非公開処理は、対象となるテキストが削除されるのではなく、一時的に非公開(非表示)とさせる機能の一例である。 The "Private" button (icon) 3542 is a button (icon) for making the text near the mouse over private (hidden) on the record viewing screen of the communication terminal 3 of a user other than the creator, which will be described later. When the user operates the "Private" button (icon) 3542, the voice recording management device 5 can make the text near the mouse over private (hidden) on the record viewing screen of the communication terminal 3 of a user other than the creator, which will be described later. Note that after the "Private" button (icon) is displayed and the private process is executed, if the same text is again moused over, the record viewing and editing screen 3151 may display a "Play" button (icon) near the target text. When the user operates the "Publish" button (icon), the voice recording management device 5 can make the text near the mouse over public (redisplay) on the record viewing screen of the communication terminal 3 of a user other than the creator, which will be described later. In this way, the private process is an example of a function that does not delete the target text, but makes it temporarily private (hidden).

「音声再生」ボタン(アイコン)３５４３は、利用者が特定の音声を再生させるためのボタンである。利用者は、「音声再生」ボタン(アイコン)３５４３を操作することにより、利用者が選択した特定の音声記録に係る音声を再生させることができる。 The "Play Audio" button (icon) 3543 is a button that allows the user to play back a specific audio. By operating the "Play Audio" button (icon) 3543, the user can play back the audio associated with a specific audio recording that the user has selected.

「削除」ボタン(アイコン)３５４４は、特定のテキストを削除するためのボタンである。利用者によって「削除」ボタン(アイコン)３５４４が操作されることにより、音声記録管理装置５は、記録閲覧編集画面３１５１及び後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍のテキストを削除させることができる。 The "Delete" button (icon) 3544 is a button for deleting specific text. When the user operates the "Delete" button (icon) 3544, the voice recording management device 5 can delete the text near the area where the mouse is hovered over on the record viewing and editing screen 3151 and the record viewing screen on the communication terminal 3 of a user other than the creator, which will be described later.

記録閲覧編集画面３１５１では更に、各画面の近傍でマウスオーバー操作がされた場合に、表示制御部３４によって、「非公開」ボタン(アイコン)３５４５、及び「削除」ボタン(アイコン)３５４６が表示される。 Furthermore, on the record viewing and editing screen 3151, when the mouse is hovered over the vicinity of each screen, the display control unit 34 displays a "Private" button (icon) 3545 and a "Delete" button (icon) 3546.

これらの各ボタン(アイコン)のうち、「非公開」ボタン(アイコン)３５４５は、後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍の画面内の画像を非公開(非表示)にさせるためのボタン(アイコン)である。利用者によって「非公開」ボタン(アイコン)３５４５が操作されることにより、音声記録管理装置５は、後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍の画面内の画像を非公開(非表示)にさせることができる。 Of these buttons (icons), the "Private" button (icon) 3545 is a button (icon) for making private (hidden) an image in a nearby screen that is moused over on the record viewing screen of a communication terminal 3 of a user other than the creator, which will be described later. When the user operates the "Private" button (icon) 3545, the audio recording management device 5 can make private (hidden) an image in a nearby screen that is moused over on the record viewing screen of a communication terminal 3 of a user other than the creator, which will be described later.

「削除」ボタン(アイコン)３５４６は、特定の画像を削除するためのボタンである。利用者によって「削除」ボタン(アイコン)３５４６が操作されることにより、音声記録管理装置５は、記録閲覧編集画面３１５１及び後述する作成者以外の利用者の通信端末３における記録閲覧画面において、マウスオーバーされた近傍の画面内の画像を削除させることができる。 The "Delete" button (icon) 3546 is a button for deleting a specific image. When the user operates the "Delete" button (icon) 3546, the voice recording management device 5 can delete images in the screen near the area where the mouse is hovered on the record viewing and editing screen 3151 and the record viewing screen on the communication terminal 3 of a user other than the creator, which will be described later.

記録閲覧編集画面３１５１では更に、上述した「音声再生」ボタン(アイコン)３５４３に対する操作に連動した音声再生表示部３６０１が表示制御部３４によって表示される。音声再生表示部３６０１における●印は、各テキストが表示されている日時情報(時刻情報)に対応して移動する。例えば、利用者が「音声再生」ボタン(アイコン)３５４３を操作して「2021/03/31 11:02:18」の位置に●印をシーク操作させると、音声記録管理装置５の音声再生部３６によって、「2020年度の売上は〇〇です」という音声が再生される。 Furthermore, on the record viewing and editing screen 3151, an audio playback display section 3601 linked to the operation of the above-mentioned "audio playback" button (icon) 3543 is displayed by the display control section 34. The ● mark in the audio playback display section 3601 moves corresponding to the date and time information (time information) at which each text is displayed. For example, when the user operates the "audio playback" button (icon) 3543 to seek the ● mark to the position of "2021/03/31 11:02:18", the audio playback section 36 of the audio recording management device 5 plays the audio "Sales for fiscal year 2020 are XX."

なお、上述した各「非公開」ボタン(アイコン)３５４２、「音声再生」ボタン(アイコン)３５４３、「削除」ボタン(アイコン)３５４４、「非公開」ボタン(アイコン)３５４５、「削除」ボタン(アイコン)３５４６は、マウスオーバー操作等によってそれぞれのテキスト及び画面の近傍に表示される。図２５に示した例では、便宜上、いずれか一つのボタン(アイコン)に対してのみ符号(番号)が付与され、他のボタン(アイコン)は点線表示されている。但し、同じ種類のボタンに対しては、同じ符号(番号)が付与される。 The above-mentioned "Private" button (icon) 3542, "Play audio" button (icon) 3543, "Delete" button (icon) 3544, "Private" button (icon) 3545, and "Delete" button (icon) 3546 are displayed near the respective text and screen by hovering the mouse over them, etc. In the example shown in FIG. 25, for convenience, a symbol (number) is given to only one of the buttons (icons), and the other buttons (icons) are displayed with dotted lines. However, the same symbol (number) is given to buttons of the same type.

●画面表示例●
図２６は、作成者以外の利用者の通信端末における記録閲覧の画面表示例である。作成者以外の利用者の通信端末３のディスプレイ３１８には、図２５で示した記録閲覧編集画面３１５１に対応して、表示制御部３４によって記録閲覧画面３１６１が表示される。記録閲覧画面３１６１には、記録閲覧編集画面３１５１と同様の内容が表示されるが、「非公開」ボタン(アイコン)３５４５、「削除」ボタン(アイコン)３５４６、「非公開」ボタン(アイコン)３５４５、及び「削除」ボタン(アイコン)３５４６は表示されない。これは、記録閲覧編集画面の作成者が「理光太郎」であって作成者以外の利用者には、記録閲覧編集画面に対する編集権限が与えられていないためである。 ●Screen display example●
Fig. 26 is an example of a screen display for viewing a record on a communication terminal of a user other than the creator. On the display 318 of the communication terminal 3 of the user other than the creator, a record viewing screen 3161 is displayed by the display control unit 34 in correspondence with the record viewing and editing screen 3151 shown in Fig. 25. The record viewing screen 3161 displays the same content as the record viewing and editing screen 3151, but the "Private" button (icon) 3545, the "Delete" button (icon) 3546, the "Private" button (icon) 3545, and the "Delete" button (icon) 3546 are not displayed. This is because the creator of the record viewing and editing screen is "Rikotaro" and users other than the creator are not given editing authority for the record viewing and editing screen.

なお、記録閲覧編集画面３１５１が表示される作成者以外の利用者は、例えば、「海老名二郎」である。 The user other than the creator for whom the record viewing and editing screen 3151 is displayed is, for example, "Ebina Jiro."

<<記録閲覧編集処理>>
次に、記録閲覧編集処理について説明する。図２７は、記録閲覧編集処理の一例を示すシーケンス図である。図２７に示されているように、通信端末３の表示制御部３４は、ディスプレイ３１８にステップＳ１０６で生成された記録閲覧編集画面を表示し、操作受付部３２は、利用者(作成者)により編集操作指示を受け付ける（ステップＳ１１１）。 <<Record viewing and editing process>>
Next, the record viewing and editing process will be described. Fig. 27 is a sequence diagram showing an example of the record viewing and editing process. As shown in Fig. 27, the display control unit 34 of the communication terminal 3 displays the record viewing and editing screen generated in step S106 on the display 318, and the operation reception unit 32 receives an editing operation instruction from the user (creator) (step S111).

続いて、送受信部３１は、音声記録管理装置５に対して、編集操作指示要求を送信する（ステップＳ１１２）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信した編集操作指示要求を受信する。このとき、編集操作指示要求には、以下の三つの情報のうち、少なくとも一つの情報が含まれる。一つは、対象のテキストを識別するテキスト識別情報と対象のテキストに対応付けられた各種操作ボタン情報である。もう一つは、対象の画像を識別する画像識別情報と対象の画像に対応付けられた各種操作ボタン情報である。更にもう一つは、後述するテキストグループを識別するテキストグループＩＤと対象のテキストグループに対応付けられた各種操作ボタン情報である。 Then, the transmission/reception unit 31 transmits an editing operation instruction request to the audio recording management device 5 (step S112). As a result, the transmission/reception unit 51 of the audio recording management device 5 receives the editing operation instruction request transmitted by the communication terminal 3. At this time, the editing operation instruction request includes at least one of the following three pieces of information. One is text identification information that identifies the target text and various operation button information associated with the target text. The other is image identification information that identifies the target image and various operation button information associated with the target image. The third is a text group ID that identifies a text group, which will be described later, and various operation button information associated with the target text group.

次に、音声記録管理装置５は、記録閲覧編集処理を行う（ステップＳ１１３）。この記録閲覧編集処理においては、記録書誌情報管理ＤＢ５００２（図６参照）、テキスト情報管理ＤＢ５００３（図７参照）、キャプチャ画像管理ＤＢ５００４（図７Ａ参照）、キャプチャ画像取得間隔ＤＢ５００５（図７Ｂ参照）、及び非公開音声管理ＤＢ５００６（図８参照）がそれぞれ用いられる。 Next, the voice recording management device 5 performs a record viewing and editing process (step S113). In this record viewing and editing process, the record bibliographic information management DB 5002 (see FIG. 6), the text information management DB 5003 (see FIG. 7), the capture image management DB 5004 (see FIG. 7A), the capture image acquisition interval DB 5005 (see FIG. 7B), and the private voice management DB 5006 (see FIG. 8) are used.

<<記録閲覧編集処理の振分け>>
続いて、記録閲覧編集処理の振分けについて説明する。図２８Ａは、各種ボタン操作により分岐される処理の一例を示すフローチャートである。まず、取得部５２は、ステップＳ１１２で受信した各種ボタン情報を取得する（ステップＳ１１３－１）。 <<Distribution of record viewing and editing processes>>
Next, the allocation of the record viewing and editing process will be described. Fig. 28A is a flow chart showing an example of a process branched by various button operations. First, the acquisition unit 52 acquires the various button information received in step S112 (step S113-1).

続いて、判断部５５は、取得したボタン情報がテキストの「非公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－１－１）。取得したボタン情報がテキストの「非公開」ボタン(アイコン)である場合（ステップＳ１１３－１－１：ＹＥＳ）、音声記録管理装置５は、後述する丸Ａの処理に遷移する。 Then, the judgment unit 55 judges whether the acquired button information is a "Private" button (icon) in the text (step S113-1-1). If the acquired button information is a "Private" button (icon) in the text (step S113-1-1: YES), the voice recording management device 5 transitions to the process of circle A described later.

取得したボタン情報がテキストの「非公開」ボタン(アイコン)でない場合（ステップＳ１１３－１－１：ＮＯ）、判断部５５は更に、取得したボタン情報がテキストの「公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－１－２）。取得したボタン情報がテキストの「公開」ボタン(アイコン)である場合（ステップＳ１１３－１－２：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｂの処理に遷移する。 If the acquired button information is not a "Private" button (icon) of the text (step S113-1-1: NO), the judgment unit 55 further judges whether the acquired button information is a "Public" button (icon) of the text (step S113-1-2). If the acquired button information is a "Public" button (icon) of the text (step S113-1-2: YES), the voice recording management device 5 transitions to the process of circle B described below.

取得したボタン情報がテキストの「公開」ボタン(アイコン)でない場合（ステップＳ１１３－１－２：ＮＯ）、判断部５５は更に、取得したボタン情報がテキストの「削除」ボタン(アイコン)であるかを判断する（ステップＳ１１３－１－３）。取得したボタン情報がテキストの「削除」ボタン(アイコン)である場合（ステップＳ１１３－１－３：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｃの処理に遷移する。 If the acquired button information is not a "Publish" button (icon) in the text (step S113-1-2: NO), the judgment unit 55 further judges whether the acquired button information is a "Delete" button (icon) in the text (step S113-1-3). If the acquired button information is a "Delete" button (icon) in the text (step S113-1-3: YES), the voice recording management device 5 transitions to the process of circle C described below.

取得したボタン情報がテキストの「削除」ボタン(アイコン)でない場合（ステップＳ１１３－１－３：ＮＯ）、音声記録管理装置５は、各種ボタン情報を取得２の処理に遷移する（ステップＳ１１３－１－４）。 If the acquired button information is not a text "Delete" button (icon) (step S113-1-3: NO), the voice recording management device 5 transitions to process 2 of acquiring various button information (step S113-1-4).

図２８Ｂは、各種ボタン操作により分岐される処理の一例を示すフローチャートである。図２８Ｂにおいて、取得部５２は、ステップＳ１１２で受信した各種ボタン情報を取得する（ステップＳ１１３－２）。 Figure 28B is a flowchart showing an example of a process branched by various button operations. In Figure 28B, the acquisition unit 52 acquires the various button information received in step S112 (step S113-2).

続いて、判断部５５は、取得したボタン情報がキャプチャ画像の「非公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－２－１）。取得したボタン情報がキャプチャ画像の「非公開」ボタン(アイコン)である場合（ステップＳ１１３－２－１：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｄの処理に遷移する。 Then, the judgment unit 55 judges whether the acquired button information is a "Private" button (icon) of the captured image (step S113-2-1). If the acquired button information is a "Private" button (icon) of the captured image (step S113-2-1: YES), the voice recording management device 5 transitions to the process of circle D described below.

取得したボタン情報がキャプチャ画像の「非公開」ボタン(アイコン)でない場合（ステップＳ１１３－２－１：ＮＯ）、判断部５５は更に、取得したボタン情報がキャプチャ画像の「公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－２－２）。取得したボタン情報がキャプチャ画像の「公開」ボタン(アイコン)である場合（ステップＳ１１３－２－２：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｅの処理に遷移する。 If the acquired button information is not the "Private" button (icon) of the captured image (step S113-2-1: NO), the judgment unit 55 further judges whether the acquired button information is the "Public" button (icon) of the captured image (step S113-2-2). If the acquired button information is the "Public" button (icon) of the captured image (step S113-2-2: YES), the voice recording management device 5 transitions to the process of circle E described below.

取得したボタン情報がキャプチャ画像の「公開」ボタン(アイコン)でない場合（ステップＳ１１３－２－２：ＮＯ）、判断部５５は更に、取得したボタン情報がキャプチャ画像の「削除」ボタン(アイコン)であるかを判断する（ステップＳ１１３－２－３）。取得したボタン情報がキャプチャ画像の「削除」ボタン(アイコン)である場合（ステップＳ１１３－２－３：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｆの処理に遷移する。 If the acquired button information is not the "Publish" button (icon) of the captured image (step S113-2-2: NO), the judgment unit 55 further judges whether the acquired button information is the "Delete" button (icon) of the captured image (step S113-2-3). If the acquired button information is the "Delete" button (icon) of the captured image (step S113-2-3: YES), the voice recording management device 5 transitions to the process of circle F described below.

取得したボタン情報がテキストの「削除」ボタン(アイコン)でない場合（ステップＳ１１３－２－３：ＮＯ）、音声記録管理装置５は、各種ボタン情報を取得３の処理に遷移する（ステップＳ１１３－２－４）。 If the acquired button information is not a text "Delete" button (icon) (step S113-2-3: NO), the voice recording management device 5 transitions to process 3 of acquiring various button information (step S113-2-4).

<<テキストに対する非公開ボタン操作時の処理>>
次に、テキストに対する非公開ボタン操作時の処理について説明する。図２９は、テキストに対する非公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図２８Ａで判断された遷移先としての丸Ａの処理(ステップＳ１１３－１－１０１からステップＳ１１３－１－１０５)が実行される。 <<Processing when the private button is pressed on text>>
Next, the process when the non-disclosure button is operated on the text will be described. Fig. 29 is a flowchart showing an example of the process when the non-disclosure button is operated on the text. In this flowchart, the process of circle A as the transition destination determined in Fig. 28A (steps S113-1-101 to S113-1-105) is executed.

まず、音声記録管理装置５の設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている、ステップＳ１１２で受信したテキスト識別情報に対応する公開フラグを「False」に設定する（ステップＳ１１３－１－１０１）。 First, the setting registration unit 58 of the voice recording management device 5 sets the public flag corresponding to the text identification information received in step S112, which is managed in the text information management DB 5003 (see Figure 7), to "False" (step S113-1-101).

次に、取得部５２は、テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対象テキストに対応する画像識別情報を取得する（ステップＳ１１３－１－１０２）。このときの画像識別情報は、対象テキストを含む画面をキャプチャ処理したキャプチャ画像を識別するための情報である。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information as a search key to acquire image identification information corresponding to the target text (step S113-1-102). The image identification information at this time is information for identifying a capture image obtained by capturing a screen including the target text.

次に、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、各キャプチャ画像を識別する画像識別情報に対応する公開フラグを「False」に設定する（ステップＳ１１３－２－２０１）。（ステップＳ１１３－１－１０３）。 Next, the setting registration unit 58 sets the public flag that is managed in the capture image management DB 5004 (see FIG. 8A) and corresponds to the image identification information that identifies each capture image to "False" (step S113-2-201). (step S113-1-103).

次に、取得部５２は、テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－１－１０４）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information as a search key to acquire the time information of the corresponding start time and end time (step S113-1-104).

続いて、設定登録部５８は、取得した開始時刻及び終了時刻の各時刻情報を記録書誌情報管理ＤＢ５００２（図６参照）に登録し、非公開音声管理ＤＢ５００６（図９参照）で管理されている各時刻情報の間の音声を無音化処理してこのフローを抜ける（ステップＳ１１３－１－１０５）。 Next, the setting registration unit 58 registers the acquired start time and end time information in the recorded bibliographic information management DB 5002 (see Figure 6), mutes the audio between each piece of time information managed in the private audio management DB 5006 (see Figure 9), and exits this flow (step S113-1-105).

●画面表示例●
図３０は、テキストに対する非公開ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。作成者以外の利用者の通信端末３のディスプレイ３１８には、図２４で示した処理及び上述した丸Ａの処理に基づいて、表示制御部３４によって、図３０に示した記録閲覧画面３１６１が表示される。この場合、記録閲覧画面３１６１では、図２５で指定した特定のテキストに対する非表示操作に基づいて、特定のテキスト(「2020年度下期の売上は〇〇です。」)がテキスト表示欄から非表示処理された状態が表示されている。このとき、テキスト表示欄はグレーのハッチング処理、網目模様処理等の処理によってテキストが解読できない状態となっていてもよい。さらに、テキスト表示欄は、非表示処理されたテキストを表示させない処理を施すものでもよい。記録閲覧画面３１６１では、更に、特定のテキストに対応する画面５の画像も併せて非表示処理される。この場合、画面５の画像は空白画像となっていてもよい。また、画面５の画像から別の画像に置き換えられていてもよい。記録閲覧画面３１６１では、更に、特定のテキストに対応する音声データも無音化処理される。この場合、特定のテキストに対応する発話音声が再生されるべき時間に、その発話音声が無音化処理される。また、無音化処理に代えて、他の信号音等に変換されていてもよい。 ●Screen display example●
FIG. 30 is an example of a screen display displayed on a communication terminal of a user other than the creator when the private button for the text is operated. On the display 318 of the communication terminal 3 of the user other than the creator, the display control unit 34 displays the record viewing screen 3161 shown in FIG. 30 based on the process shown in FIG. 24 and the process of circle A described above. In this case, the record viewing screen 3161 displays a state in which the specific text ("Sales for the second half of fiscal year 2020 are XX.") has been hidden from the text display field based on the hide operation for the specific text specified in FIG. 25. At this time, the text display field may be in a state in which the text cannot be deciphered by a process such as gray hatching or mesh pattern processing. Furthermore, the text display field may be processed so that the hidden text is not displayed. In the record viewing screen 3161, the image of the screen 5 corresponding to the specific text is also hidden. In this case, the image of the screen 5 may be a blank image. Also, the image of the screen 5 may be replaced with another image. Furthermore, the audio data corresponding to the specific text is also muted on the record viewing screen 3161. In this case, the uttered voice corresponding to the specific text is muted at the time when the uttered voice is to be played. Instead of being muted, the uttered voice may be converted into another signal sound or the like.

<<テキストに対する公開ボタン操作時の処理>>
次に、テキストに対する公開ボタン操作時の処理について説明する。図３１は、テキストに対する公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図２８Ａで判断された遷移先としての丸Ｂの処理(ステップＳ１１３－１－２０１からステップＳ１１３－１－２０７)が実行される。 <<Processing when the publish button is used on text>>
Next, the process when the publish button is operated on the text will be described. Fig. 31 is a flowchart showing an example of the process when the publish button is operated on the text. In this flowchart, the process of circle B as the transition destination determined in Fig. 28A (steps S113-1-201 to S113-1-207) is executed.

まず、音声記録管理装置５の設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている、ステップＳ１１２で受信したテキスト識別情報に対応する公開フラグを「True」に設定する（ステップＳ１１３－１－２０１）。 First, the setting registration unit 58 of the voice recording management device 5 sets the public flag corresponding to the text identification information received in step S112, which is managed in the text information management DB 5003 (see Figure 7), to "True" (step S113-1-201).

次に、取得部５２は、テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－１－２０２）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information as a search key to acquire the time information of the corresponding start time and end time (step S113-1-202).

次に、取得部５２は、テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対象テキストに対応する画像識別情報を取得する（ステップＳ１１３－１－２０３）。このときの画像識別情報は、対象テキストを含む画面をキャプチャ処理したキャプチャ画像を識別するための情報である。 Then, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information as a search key to acquire image identification information corresponding to the target text (step S113-1-203). The image identification information at this time is information for identifying a capture image obtained by capturing a screen including the target text.

次に、取得部５２は、取得した画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、取得した画像識別情報で示されるキャプチャ画像に対応する各テキストの公開フラグを取得する（ステップＳ１１３－１－２０４）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the acquired image identification information as a search key to acquire the public flag of each text corresponding to the capture image indicated by the acquired image identification information (step S113-1-204).

次に、判断部５５は、取得した公開フラグが全て「True」であるかを判断する（ステップＳ１１３－１－２０５）。取得した公開フラグが全て「True」である場合（ステップＳ１１３－１－２０５：ＹＥＳ）、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、取得した画像識別情報で示されるキャプチャ画像の公開フラグを「True」に設定する（ステップＳ１１３－１－２０６）。 Next, the judgment unit 55 judges whether all of the acquired public flags are "True" (step S113-1-205). If all of the acquired public flags are "True" (step S113-1-205: YES), the setting registration unit 58 sets the public flag of the capture image, which is managed in the capture image management DB 5004 (see FIG. 8A) and indicated by the acquired image identification information, to "True" (step S113-1-206).

続いて、設定登録部５８は、非公開音声管理ＤＢ５００６（図９参照）で管理されている開始時刻及び終了時刻の各時刻情報を削除して、このフローを抜ける（ステップＳ１１３－１－２０７）。 Then, the setting registration unit 58 deletes the time information for the start time and end time managed in the private audio management DB 5006 (see Figure 9) and exits this flow (step S113-1-207).

他方、取得した公開フラグが全て「True」でない場合、すなわち、少なくとも一つの公開フラグが「False」である場合（ステップＳ１１３－１－２０５：ＮＯ）、上述したステップＳ１１３－１－２０７の処理を実行してこのフローを抜ける。 On the other hand, if all of the retrieved public flags are not "True," i.e., if at least one public flag is "False" (step S113-1-205: NO), the process of step S113-1-207 described above is executed and the flow is exited.

●画面表示例●
図３２は、作成者の通信端末におけるテキストに対する公開ボタン操作時の画面表示例である。作成者の通信端末３のディスプレイ３１８には、図２４で示した処理及び上述した丸Ｂの処理に基づいて、表示制御部３４によって、記録閲覧編集画面３１５１が表示される。記録閲覧編集画面３１５１では、図２５で示した記録閲覧編集画面３１５１における「非公開」ボタン(アイコン)３５４２の表示後に、「非公開」ボタン(アイコン)３５４２に代えて「公開」ボタン(アイコン)３５４７が表示された状態が示されている。つまり、「非公開」ボタン(アイコン)３５４２と「公開」ボタン(アイコン)３５４７は、同一のテキストに対して対となって記録閲覧編集画面３１５１上に表示される。「公開」ボタン(アイコン)３５４７は、作成者以外の利用者の通信端末３における記録閲覧画面３１６１において、特定のテキストが非公開(非表示)にされた後、再度特定のテキストを記録閲覧画面３１６１に表示させるときに操作されるボタン又はアイコンである。記録閲覧編集画面３１５１では、作成者が特定のテキストに対して「非公開」処理を行うと、例えば、特定のテキストの横に、「非公開」に設定されたことを示す特有のマークが表示制御部３４によって表示されてもよい。この特有のマークは、例えば、四角の中に「非」という文字を入れた「非公開」マーク３２０１である。これにより、作成者は、どのテキストを非公開処理したかを簡単に見分けることができる。なお、「非公開」マーク３２０１の内容は、上述した四角の中に「非」という文字を入れた内容に限らず、「非公開」という記号と文字の組合せであってもよい。更に、「非公開」マーク３２０１の表示位置は、上述した特定のテキストの横に限らず、非公開処理が行われたテキストの近傍であればどの位置でもよい。 ●Screen display example●
FIG. 32 is an example of a screen display when the public button is operated for text on the creator's communication terminal. On the display 318 of the creator's communication terminal 3, a record viewing and editing screen 3151 is displayed by the display control unit 34 based on the process shown in FIG. 24 and the process of circle B described above. The record viewing and editing screen 3151 shows a state in which the "Private" button (icon) 3542 is displayed on the record viewing and editing screen 3151 shown in FIG. 25, and then the "Private" button (icon) 3547 is displayed instead of the "Private" button (icon) 3542. In other words, the "Private" button (icon) 3542 and the "Public" button (icon) 3547 are displayed on the record viewing and editing screen 3151 in pairs for the same text. The "Public" button (icon) 3547 is a button or icon that is operated when a specific text is made private (hidden) on the record viewing screen 3161 of the communication terminal 3 of a user other than the creator and then a specific text is made to be displayed again on the record viewing screen 3161. In the record viewing and editing screen 3151, when the creator performs the "private" process on a specific text, for example, a unique mark indicating that the specific text has been set to "private" may be displayed by the display control unit 34 next to the specific text. This unique mark is, for example, a "private" mark 3201 with the character "private" in a square. This allows the creator to easily distinguish which text has been made private. Note that the content of the "private" mark 3201 is not limited to the content with the character "private" in a square as described above, but may be a combination of the symbol "private" and the character. Furthermore, the display position of the "private" mark 3201 is not limited to the side of the specific text described above, and may be any position in the vicinity of the text that has been made private.

作成者によって非公開処理されたテキストが再度マウスオーバー操作されると、表示制御部３４によって、非公開処理されたテキストの近傍に「公開」ボタン(アイコン)３５４７が表示される。これに伴い、「非公開」マーク３２０１は非表示となる。つまり、記録閲覧編集画面３１５１において、あるテキストに対する「非公開」ボタン(アイコン)３５４２と「公開」ボタン(アイコン)３５４７は、トグル表示されることになる。この「公開」ボタン(アイコン)３５４７が作成者の通信端末３において操作されると、作成者以外の利用者の通信端末３で非公開となっていた特定のテキストが再表示されるとともに、画面５の画像も再表示される。さらに、特定のテキストに対応する音声データも、元の再生可能状態となり、特定のテキストに対応する音声を聞くことが可能となる。 When the mouse is again placed over text that has been made private by the creator, the display control unit 34 displays a "Public" button (icon) 3547 near the text that has been made private. Accordingly, the "Private" mark 3201 is hidden. In other words, on the record viewing and editing screen 3151, the "Private" button (icon) 3542 and the "Public" button (icon) 3547 for a certain text are displayed in a toggle manner. When this "Public" button (icon) 3547 is operated on the communication terminal 3 of the creator, the specific text that was made private on the communication terminal 3 of a user other than the creator is redisplayed, and the image on screen 5 is also redisplayed. Furthermore, the audio data corresponding to the specific text is restored to its original playable state, making it possible to listen to the audio corresponding to the specific text.

<<テキストに対する削除ボタン操作時の処理>>
次に、テキストに対する削除ボタン操作時の処理について説明する。図３３は、テキストに対する削除ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図２８Ａで判断された遷移先としての丸Ｃの処理(ステップＳ１１３－１－３０１からステップＳ１１３－１－３０７)が実行される。 <<Processing when the delete button is used on text>>
Next, the process when the delete button is operated on the text will be described. Fig. 33 is a flowchart showing an example of the process when the delete button is operated on the text. In this flowchart, the process of circle C as the transition destination determined in Fig. 28A (steps S113-1-301 to S113-1-307) is executed.

まず、音声記録管理装置５の設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている、ステップＳ１１２で受信したテキスト識別情報に対応するテキストを空白に設定し、削除フラグを「True」に設定する（ステップＳ１１３－１－３０１）。つまり、この時点で対象テキストの内容が削除される。 First, the setting registration unit 58 of the voice recording management device 5 sets the text managed in the text information management DB 5003 (see FIG. 7) that corresponds to the text identification information received in step S112 to blank, and sets the deletion flag to "True" (step S113-1-301). In other words, the contents of the target text are deleted at this point.

次に、取得部５２は、テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対象テキストに対応する画像識別情報を取得する（ステップＳ１１３－１－３０２）。このときの画像識別情報は、対象テキストを含む画面をキャプチャ処理したキャプチャ画像を識別するための情報である。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information as a search key to acquire image identification information corresponding to the target text (step S113-1-302). The image identification information at this time is information for identifying a capture image obtained by capturing a screen including the target text.

次に、取得部５２は、取得した画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、取得した画像識別情報で示されるキャプチャ画像に対応する各テキストの公開フラグを取得する（ステップＳ１１３－１－３０３）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the acquired image identification information as a search key to acquire the public flag of each text corresponding to the capture image indicated by the acquired image identification information (step S113-1-303).

次に、判断部５５は、取得した削除フラグが全て「True」であるかを判断する（ステップＳ１１３－１－３０４）。取得した削除フラグが全て「True」である場合（ステップＳ１１３－１－３０４：ＹＥＳ）、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、取得した画像識別情報に対応するキャプチャ画像の画像データを所定の画像に置き換え、公開フラグを「True」に設定する（ステップＳ１１３－１－３０５）。 Next, the judgment unit 55 judges whether all of the acquired deletion flags are "True" (step S113-1-304). If all of the acquired deletion flags are "True" (step S113-1-304: YES), the setting registration unit 58 replaces the image data of the capture image managed in the capture image management DB 5004 (see FIG. 8A) and corresponding to the acquired image identification information with a specified image, and sets the disclosure flag to "True" (step S113-1-305).

続いて、取得部５２は、テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－１－３０６）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information as a search key to acquire the time information of the corresponding start time and end time (step S113-1-306).

続いて、設定登録部５８は、取得した開始時刻及び終了時刻の間の音声データであり、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている音声データを削除（無音化）処理してこのフローを抜ける（ステップＳ１１３－１－３０７）。 Next, the setting registration unit 58 deletes (silences) the voice data between the acquired start time and end time, which is managed in the record bibliographic information management DB 5002 (see Figure 6), and exits this flow (step S113-1-307).

他方、取得した削除フラグが全て「True」でない場合、すなわち、少なくとも一つの削除フラグが「False」である場合（ステップＳ１１３－１－３０４：ＮＯ）、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、取得した画像識別情報に対応する各キャプチャ画像の公開フラグを「False」に設定し（ステップＳ１１３－１－３０８）、以降、ステップＳ１１３－１－３０６，Ｓ１１３－１－３０７の処理を実行してこのフローを抜ける。 On the other hand, if all of the acquired deletion flags are not "True", that is, if at least one deletion flag is "False" (step S113-1-304: NO), the setting registration unit 58 sets the public flag of each capture image that is managed in the capture image management DB 5004 (see FIG. 8A) and corresponds to the acquired image identification information to "False" (step S113-1-308), and thereafter executes the processing of steps S113-1-306 and S113-1-307 to exit this flow.

●画面表示例●
図３４は、テキストに対する削除ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。作成者以外の利用者の通信端末３のディスプレイ３１８には、図２４で示した処理及び上述した丸Ｃの処理に基づいて、表示制御部３４によって、記録閲覧画面３１６１が表示される。この場合、記録閲覧画面３１６１では、図２５で指定した特定のテキストに対する削除操作に基づいて、他の特定のテキスト(「本日の議題はヘルスケア事業の業績です。」)がテキスト表示欄から削除処理された状態が表示されている。さらに、他の特定のテキストに対応付けられた画面４の画像も削除対象となるため、画面４の画像も削除処理される。その結果、例えば、他の特定のテキストが表示されていたテキスト表示欄は黒塗り処理され、画面４は空白画像又は他の画像に置き換えられた画像が表示される。更に、削除処理が実行された場合は、削除対象となったテキスト及びテキストに対応付けられた画像は、以降、公開(復元)できない状態となる。記録閲覧画面３１６１では、更に、他の特定のテキストに対応する音声データも無音化処理される。この場合、他の特定のテキストに対応する発話音声が再生されるべき時間に、その発話音声が無音化処理される。なお、無音化処理に代えて、他の信号音等への変換処理が行われてもよい。 ●Screen display example●
FIG. 34 is an example of a screen display displayed on a communication terminal of a user other than the creator when the delete button for the text is operated. On the display 318 of the communication terminal 3 of the user other than the creator, a record viewing screen 3161 is displayed by the display control unit 34 based on the process shown in FIG. 24 and the process of circle C described above. In this case, the record viewing screen 3161 displays a state in which the other specific text ("Today's agenda is the performance of the healthcare business.") has been deleted from the text display field based on the delete operation for the specific text specified in FIG. 25. Furthermore, since the image of the screen 4 corresponding to the other specific text is also subject to deletion, the image of the screen 4 is also deleted. As a result, for example, the text display field in which the other specific text was displayed is blacked out, and the screen 4 displays a blank image or an image replaced with another image. Furthermore, when the delete process is executed, the text to be deleted and the image corresponding to the text cannot be made public (restored) thereafter. Furthermore, on the record viewing screen 3161, the audio data corresponding to the other specific text is also muted. In this case, the speech corresponding to the other specific text is silenced at the time when the speech is to be reproduced. Note that instead of the silence process, a conversion process to another signal sound or the like may be performed.

<<キャプチャ画像に対する非公開ボタン操作時の処理>>
次に、キャプチャ画像に対する非公開ボタン操作時の処理について説明する。図３５は、キャプチャ画像に対する非公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図２８Ｂで判断された遷移先としての丸Ｄの処理(ステップＳ１１３－２－１０１からステップＳ１１３－２－１０５)が実行される。 <<Processing when pressing the private button on a captured image>>
Next, the process when the private button is operated on a captured image will be described. Fig. 35 is a flowchart showing an example of the process when the private button is operated on a captured image. In this flowchart, the process of circle D as the transition destination determined in Fig. 28B (steps S113-2-101 to S113-2-105) is executed.

まず、音声記録管理装置５の設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、ステップＳ１１２で受信した対象キャプチャ画像の画像識別情報に対応付けられた公開フラグを「False」に設定する（ステップＳ１１３－２－１０１）。 First, the setting registration unit 58 of the audio recording management device 5 sets the public flag associated with the image identification information of the target capture image managed in the capture image management DB 5004 (see FIG. 8A) and received in step S112 to "False" (step S113-2-101).

次に、取得部５２は、画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対象キャプチャ画像に対応する各テキストのテキス識別情報を取得する（ステップＳ１１３－２－１０２）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the image identification information as a search key to acquire text identification information for each text corresponding to the target captured image (step S113-2-102).

次に、設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理され、取得した各テキスト識別情報に対応する各テキストの公開フラグを「False」に設定する（ステップＳ１１３－２－１０３）。 Next, the setting registration unit 58 sets the public flag of each text that is managed in the text information management DB 5003 (see FIG. 7) and corresponds to each acquired text identification information to "False" (step S113-2-103).

次に、取得部５２は、取得した各テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－２－１０４）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using each acquired text identification information as a search key to acquire time information for the start time and end time of each corresponding text (step S113-2-104).

次に、設定登録部５８は、取得した各時刻情報のうち、最も早い開始時刻及び最も遅い終了時刻の各時刻情報を、非公開音声管理ＤＢ５００６（図９参照）に登録する（ステップＳ１１３－２－１０５）。なお、上述したステップＳ１１３－２－１０３からステップＳ１１３－２－１０５までの処理は丸Ｘの処理として定義され、以降の処理において再度実行される。 Next, the setting and registration unit 58 registers the earliest start time and the latest end time of the acquired time information in the private audio management DB 5006 (see FIG. 9) (step S113-2-105). Note that the above-mentioned processes from step S113-2-103 to step S113-2-105 are defined as the circle X process, and are executed again in the subsequent processes.

●画面表示例●
図３６は、キャプチャ画像に対する非公開ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。作成者以外の利用者の通信端末３のディスプレイ３１８には、図２４で示した処理及び上述した丸Ｄの処理に基づいて、表示制御部３４によって、記録閲覧画面３１６１が表示される。この場合、記録閲覧画面３１６１では、図２５で指定した特定の画像に対する非公開操作に基づいて、特定の画像(画面５に表示されたキャプチャ画像)と特定の画像に対応付けられた一以上のテキストが非表示処理された状態が表示されている。一以上のテキストには、画面５のキャプチャ画像が取得された時刻(「11:02:00」)以降に発話された三つのテキスト(「11:02:05」、「11:02:18」、「11:02:25」)に加えて、画面４のキャプチャ画像が取得された時刻(「11:01:30」)以降に発話され、「11:02:00」に跨る時刻(「11:01:59」)に発話されたテキストが含まれる。すなわち、画面５を表す画面データを取得した取得時刻(「11:02:00」)に跨って特定のテキストデータが存在する場合、取得時刻に取得された画面５の画面データと、画面５の画面データに対応付けられた三つのテキストのテキストデータと、特定のテキストデータとが、非公開処理の対象となる。これは、「11:02:00」に跨る時刻(「11:01:59」)に発話されたテキストの内容は、画面５の画像の内容に関与する可能性が高いと判断されるため、画面５の画像に対応付けられた三つのテキストと同様に非表示処理を行う必要があるという前提による。なお、非表示処理は、図３０に示した方法と同様でよい。 ●Screen display example●
Fig. 36 is an example of a screen display displayed on a communication terminal of a user other than the creator when the private button for a captured image is operated. On the display 318 of the communication terminal 3 of the user other than the creator, a record viewing screen 3161 is displayed by the display control unit 34 based on the process shown in Fig. 24 and the process of circle D described above. In this case, the record viewing screen 3161 displays a state in which the specific image (the captured image displayed on the screen 5) and one or more texts associated with the specific image have been hidden based on the private operation for the specific image specified in Fig. 25. The one or more pieces of text include three pieces of text ("11:02:05", "11:02:18", and "11:02:25") uttered after the time ("11:02:00") when the capture image of screen 5 was acquired, as well as a piece of text ("11:02:05", "11:02:18", and "11:02:25") uttered after the time ("11:01:30") when the capture image of screen 4 was acquired and at a time ("11:01:59") that straddles "11:02:00". In other words, when specific text data exists that straddles the acquisition time ("11:02:00") when the screen data representing screen 5 was acquired, the screen data of screen 5 acquired at the acquisition time, the text data of the three pieces of text associated with the screen data of screen 5, and the specific text data are subject to non-disclosure processing. This is based on the premise that the content of the text uttered at a time ("11:01:59") that straddles "11:02:00" is determined to have a high probability of being related to the content of the image on screen 5, and therefore needs to be hidden like the three pieces of text associated with the image on screen 5. The hiding process may be the same as the method shown in FIG.

これまでに説明した通り、上述した非表示対象となるテキストに対応付けられた発話音声(音声データ)も同様に無音化処理される。なお、無音化処理に代えて、他の信号音等への変換処理が行われてもよい。 As explained above, the spoken voice (audio data) associated with the text to be hidden is also muted in the same way. Note that instead of muting, conversion to other signal sounds, etc. may be performed.

<<キャプチャ画像に対する公開ボタン操作時の処理>>
続いて、キャプチャ画像に対する公開ボタン操作時の処理について説明する。図３７は、キャプチャ画像に対する公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図２８Ｂで判断された遷移先としての丸Ｅの処理(ステップＳ１１３－２－２０１からステップＳ１１３－２－２１１)が実行される。 <<Processing when the publish button is operated on a captured image>>
Next, the process when the publish button is operated for a captured image will be described. Fig. 37 is a flowchart showing an example of the process when the publish button is operated for a captured image. In this flowchart, the process of circle E as the transition destination determined in Fig. 28B (steps S113-2-201 to S113-2-211) is executed.

まず、音声記録管理装置５の設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、ステップＳ１１２で受信した対象キャプチャ画像の画像識別情報に対応付けられた公開フラグを「True」に設定する（ステップＳ１１３－２－２０１）。 First, the setting registration unit 58 of the audio recording management device 5 sets the public flag associated with the image identification information of the target capture image managed in the capture image management DB 5004 (see FIG. 8A) and received in step S112 to "True" (step S113-2-201).

次に、取得部５２は、画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対象キャプチャ画像に対応する各テキストのテキスト識別情報を取得する（ステップＳ１１３－２－２０２）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the image identification information as a search key to acquire text identification information for each text corresponding to the target captured image (step S113-2-202).

続いて取得部５２は、取得した各テキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、各テキスト識別情報に対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－２－２０３）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using each acquired text identification information as a search key to acquire time information for the start time and end time of each text corresponding to each text identification information (step S113-2-203).

次に、設定登録部５８は、取得した各時刻情報に基づいて、非公開音声管理ＤＢ５００６（図９参照）で管理されている時刻情報のうち、最も早い開始時刻及び最も遅い終了時刻の各時刻情報を削除する（ステップＳ１１３－２－２０４）。 Next, based on the acquired time information, the setting registration unit 58 deletes the time information of the earliest start time and the latest end time from the time information managed in the private audio management DB 5006 (see Figure 9) (step S113-2-204).

次に、音声記録管理装置５は、以下のステップＳ１１３－２－２０５からステップＳ１１３－２－２０９までの処理を繰り返し実行する。具体的には、音声記録管理装置５は、特定されたテキストごとに以下の処理を実行する（ステップＳ１１３－２－２０５）。 Then, the voice recording management device 5 repeatedly executes the processes from step S113-2-205 to step S113-2-209 below. Specifically, the voice recording management device 5 executes the following processes for each identified text (step S113-2-205).

まず、取得部５２は、対象テキストに対応する各キャプチャ画像の画像識別情報を検索キーとしてキャプチャ画像管理ＤＢ５００４（図８Ａ参照）を検索することにより、対応する公開フラグを取得する（ステップＳ１１３－２－２０６）。 First, the acquisition unit 52 searches the capture image management DB 5004 (see FIG. 8A) using the image identification information of each capture image corresponding to the target text as a search key to acquire the corresponding public flag (step S113-2-206).

次に、判断部５５は、取得した各キャプチャ画像に係る公開フラグが全て「True」であるかを判断する（ステップＳ１１３－２－２０７）。 Next, the judgment unit 55 judges whether the public flags for each acquired capture image are all "True" (step S113-2-207).

取得した各キャプチャ画像に係る公開フラグが全て「True」である場合（ステップＳ１１３－２－２０７：ＹＥＳ）、設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている対象テキストの公開フラグを「True」に設定し（ステップＳ１１３－２－２０８）、次のステップＳ１１３－２－２０９を経てステップＳ１１３－２－２０５に戻る。このステップＳ１１３－２－２０５からステップＳ１１３－３－２０９までの処理は、テキストごとに繰り返される。 If the public flags for all of the acquired captured images are "True" (step S113-2-207: YES), the setting and registration unit 58 sets the public flag of the target text managed in the text information management DB 5003 (see FIG. 7) to "True" (step S113-2-208), and returns to step S113-2-205 via the next step S113-2-209. The processes from step S113-2-205 to step S113-3-209 are repeated for each text.

他方、取得した各キャプチャ画像に係る公開フラグが全て「True」でない場合、すなわち、少なくとも一つの公開フラグが「False」である場合（ステップＳ１１３－２－２０７：ＮＯ）、取得部５２は、公開フラグに対応するテキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する開始時刻及び終了時刻を取得する（ステップＳ１１３－２－２１０）。 On the other hand, if all of the public flags for each of the acquired capture images are not "True", i.e., if at least one public flag is "False" (step S113-2-207: NO), the acquisition unit 52 acquires the corresponding start time and end time by searching the text information management DB 5003 (see FIG. 7) using the text identification information corresponding to the public flag as a search key (step S113-2-210).

続いて、設定登録部５８は、取得した開始時刻及び終了時刻の各時刻情報を非公開音声管理ＤＢ５００６（図９参照）に登録し（ステップＳ１１３－２－２１１）、ステップＳ１１３－２－２０９に遷移してステップＳ１１３－２－２０５の処理に戻る。そして、ステップＳ１１３－２－２０５からステップＳ１１３－２－２０９までの繰返し処理が完了したとき、音声記録管理装置５は、このフローを抜ける。なお、上述したステップＳ１１３－２－２０３からステップＳ１１３－２－２１１までの処理は丸Ｙの処理として定義され、以降の処理において再度実行される。 Then, the setting registration unit 58 registers the acquired start time and end time information in the private audio management DB 5006 (see FIG. 9) (step S113-2-211), transitions to step S113-2-209, and returns to the processing of step S113-2-205. Then, when the repeated processing from step S113-2-205 to step S113-2-209 is completed, the audio recording management device 5 exits this flow. Note that the processing from step S113-2-203 to step S113-2-211 described above is defined as processing marked with a circle Y, and will be executed again in the subsequent processing.

<<キャプチャ画像に対する削除ボタン操作時の処理>>
次に、キャプチャ画像に対する削除ボタン操作時の処理について説明する。図３８は、キャプチャ画像に対する削除ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図２８Ｂで判断された遷移先としての丸Ｆの処理(ステップＳ１１３－２－３０１からステップＳ１１３－２－３１２)が実行される。 <<Processing when the delete button is used on a captured image>>
Next, the process when the delete button is operated on a captured image will be described. Fig. 38 is a flowchart showing an example of the process when the delete button is operated on a captured image. In this flowchart, the process of circle F as the transition destination determined in Fig. 28B (steps S113-2-301 to S113-2-312) is executed.

まず、音声記録管理装置５の設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、ステップＳ１１２で受信した画像識別情報に対応する対象キャプチャ画像の画像データを所定の画像データに置き換え(書き換え)、画像識別情報に対応する削除フラグを「True」に設定する（ステップＳ１１３－２－３０１）。 First, the setting registration unit 58 of the audio recording management device 5 replaces (rewrites) the image data of the target capture image managed by the capture image management DB 5004 (see FIG. 8A) and corresponding to the image identification information received in step S112 with specified image data, and sets the deletion flag corresponding to the image identification information to "True" (step S113-2-301).

次に、取得部５２は、画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対象キャプチャ画像に対応する各テキストのテキスト識別情報を取得する（ステップＳ１１３－２－３０２）。 Next, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the image identification information as a search key to acquire text identification information for each text corresponding to the target captured image (step S113-2-302).

次に、音声記録管理装置５は、以下のステップＳ１１３－２－３０３からステップＳ１１３－２－３１２までの処理を繰り返し実行する。具体的には、音声記録管理装置５は、特定されたテキストごとに以下の処理を実行する（ステップＳ１１３－２－３０３）。 Then, the voice recording management device 5 repeatedly executes the processes from step S113-2-303 to step S113-2-312 below. Specifically, the voice recording management device 5 executes the following processes for each identified text (step S113-2-303).

まず、取得部５２は、対象テキストに対応する各キャプチャ画像の画像識別情報を検索キーとしてキャプチャ画像管理ＤＢ５００４（図８Ａ参照）を検索することにより、対応する削除フラグを取得する（ステップＳ１１３－２－３０４）。 First, the acquisition unit 52 searches the capture image management DB 5004 (see FIG. 8A) using the image identification information of each capture image corresponding to the target text as a search key to acquire the corresponding deletion flag (step S113-2-304).

次に、判断部５５は、取得した各キャプチャ画像に係る削除フラグが全て「True」であるかを判断する（ステップＳ１１３－２－３０５）。 Next, the judgment unit 55 judges whether the deletion flags for each acquired capture image are all "True" (step S113-2-305).

取得した各キャプチャ画像に係る削除フラグが全て「True」である場合（ステップＳ１１３－２－３０５：ＹＥＳ）、設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理され、画像識別情報に対応付けられた対象テキストを空白に設定(削除)し、対象テキストに対応する削除フラグを「True」に設定する（ステップＳ１１３－２－３０６）。 If all of the deletion flags for each acquired capture image are "True" (step S113-2-305: YES), the setting registration unit 58 sets the target text managed in the text information management DB 5003 (see FIG. 7) and associated with the image identification information to blank (deletes), and sets the deletion flag corresponding to the target text to "True" (step S113-2-306).

次に、取得部５２は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている、対象テキストに対応する開始時刻及び終了時刻の各時刻情報を取得し（ステップＳ１１３－２－３０７）、次のステップＳ１１３－２－３０８を経てステップＳ１１３－２－３０３に戻る。このステップＳ１１３－２－３０３からステップＳ１１３－３－３０８までの処理は、テキストごとに繰り返される。 Next, the acquisition unit 52 acquires the time information of the start time and end time corresponding to the target text, which are managed in the text information management DB 5003 (see FIG. 7) (step S113-2-307), and returns to step S113-2-303 via the next step S113-2-308. The process from step S113-2-303 to step S113-3-308 is repeated for each text.

他方、取得した各キャプチャ画像に係る削除フラグが全て「True」でない場合、すなわち、少なくとも一つの削除フラグが「False」である場合（ステップＳ１１３－２－３０５：ＮＯ）、設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている対象テキストの公開フラグを「False」に設定する（ステップＳ１１３－２－３０９）。 On the other hand, if all of the deletion flags for each acquired capture image are not "True", i.e., if at least one deletion flag is "False" (step S113-2-305: NO), the setting registration unit 58 sets the public flag of the target text managed in the text information management DB 5003 (see FIG. 7) to "False" (step S113-2-309).

続いて、取得部５２は、対象テキストのテキスト識別情報を検索キーとしてテキスト情報管理ＤＢ５００３（図７参照）を検索することにより、対応する開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－２－３１０）。 Then, the acquisition unit 52 searches the text information management DB 5003 (see FIG. 7) using the text identification information of the target text as a search key to acquire the time information of the corresponding start time and end time (step S113-2-310).

次に、設定登録部５８は、取得した開始時刻及び終了時刻の各時刻情報を、非公開音声管理ＤＢ５００６（図９参照）に登録して（ステップＳ１１３－２－３１１）、ステップＳ１１３－２－３０８を経てステップＳ１１３－２－３０３に戻る。 Next, the setting registration unit 58 registers the acquired start time and end time information in the private audio management DB 5006 (see FIG. 9) (step S113-2-311), and returns to step S113-2-303 via step S113-2-308.

ステップＳ１１３－２－３０８までの繰返し処理が終了すると、設定登録部５８は、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている音声データの一部を削除(無音化)して（ステップＳ１１３－２－３１２）、このフローを抜ける。具体的には、設定登録部５８は、テキスト情報管理ＤＢ５００３（図７参照）で管理されている削除フラグが「True」に設定されたテキストに対応する音声データであり、記録書誌情報管理ＤＢ００２（図６参照）で管理されている音声データのうち、最も早い開始時刻及び最も遅い終了時刻の間の音声データを削除(無音化)する。なお、上述したステップＳ１１３－２－３０３からステップＳ１１３－２－３１２までの処理は丸Ｚの処理として定義され、以降の処理において再度実行される。 When the repeated processing up to step S113-2-308 is completed, the setting registration unit 58 deletes (silences) a part of the voice data managed in the record bibliographic information management DB 5002 (see FIG. 6) (step S113-2-312) and exits this flow. Specifically, the setting registration unit 58 deletes (silences) the voice data between the earliest start time and the latest end time among the voice data managed in the record bibliographic information management DB 002 (see FIG. 6), which is voice data corresponding to text managed in the text information management DB 5003 (see FIG. 7) and whose deletion flag is set to "True." Note that the processing from step S113-2-303 to step S113-2-312 described above is defined as the processing of circle Z, and is executed again in the subsequent processing.

●画面表示例●
図３９は、キャプチャ画像に対する削除ボタンが操作された時の作成者以外の利用者の通信端末に表示される画面表示例である。作成者以外の利用者の通信端末３のディスプレイ３１８には、図２４で示した処理及び上述した丸Ｆの処理に基づいて、表示制御部３４によって、記録閲覧画面３１６１が表示される。この場合、記録閲覧画面３１６１では、図２５で指定した特定の画像に対する削除操作に基づいて、特定の画像(画面５に表示されたキャプチャ画像)と特定の画像に対応付けられた一以上のテキストが非表示処理された状態が表示されている。一以上のテキストには、画面５のキャプチャ画像が取得された時刻(「11:02:00」)以降に発話された三つのテキスト(「11:02:05」、「11:02:18」、「11:02:25」)に加えて、画面４のキャプチャ画像が取得された時刻(「11:01:30」)以降に発話され、「11:02:00」に跨る時刻(「11:01:59」)に発話されたテキストが含まれる。すなわち、画面５を表す画面データを取得した取得時刻(「11:02:00」)に跨って特定のテキストデータが存在する場合、取得時刻に取得された画面５の画面データと、画面５の画面データに対応付けられた三つのテキストのテキストデータが削除対象となる。更に、特定のテキストデータが、非公開処理の対象となる。つまり、図３６との相違点は、画面５のキャプチャ画像が取得された時刻(「11:02:00」)以降に発話された三つのテキスト(「11:02:05」、「11:02:18」、「11:02:25」)が非公開ではなく削除される点である。削除されるテキストについては、例えば、黒塗りされた状態となっている。なお、特定のテキストデータに対応付けられた画面４に係る画面データは、非表示処理の対象とならない。これは、画面４の画面データに対応付けられた特定のテキストデータのほかに別のテキストデータが存在するためである。別のテキストデータは、例えば、「2021/03/31 11:01:53」テキストデータである。別のテキストデータは、特定のテキストデータに対応付けられた画面５の画面データには対応付けられていないため、画面４の画面データは、非表示処理の対象にはならない。 ●Screen display example●
FIG. 39 is an example of a screen display displayed on a communication terminal of a user other than the creator when the delete button for a captured image is operated. On the display 318 of the communication terminal 3 of the user other than the creator, a record viewing screen 3161 is displayed by the display control unit 34 based on the process shown in FIG. 24 and the process of circle F described above. In this case, the record viewing screen 3161 displays a state in which a specific image (the captured image displayed on the screen 5) and one or more texts associated with the specific image have been hidden based on the delete operation for the specific image specified in FIG. 25. The one or more texts include three texts ("11:02:05", "11:02:18", "11:02:25") uttered after the time ("11:02:00") when the captured image of the screen 5 was acquired, as well as text uttered after the time ("11:01:30") when the captured image of the screen 4 was acquired and at a time ("11:01:59") that straddles "11:02:00". That is, when specific text data exists across the acquisition time ("11:02:00") when the screen data representing the screen 5 is acquired, the screen data of the screen 5 acquired at the acquisition time and the text data of the three texts associated with the screen data of the screen 5 are deleted. Furthermore, the specific text data is subject to non-disclosure processing. That is, the difference from FIG. 36 is that the three texts ("11:02:05", "11:02:18", "11:02:25") uttered after the time ("11:02:00") when the capture image of the screen 5 is acquired are deleted instead of being made private. The text to be deleted is, for example, blacked out. Note that the screen data related to the screen 4 associated with the specific text data is not subject to non-disclosure processing. This is because there is another text data in addition to the specific text data associated with the screen data of the screen 4. The other text data is, for example, the "2021/03/31 11:01:53" text data. Since the other text data is not associated with the screen data of screen 5 that is associated with the specific text data, the screen data of screen 4 is not subject to the non-display process.

<<更新画面の生成処理>>
続いて、更新画面の生成処理について説明する。図２７に戻り、生成・処理部５７は、通信端末３における更新画面を生成して、生成した更新画面の画面データ(テンプレートデータ)を記憶部５０００の所定領域に記憶する（ステップＳ１１４）。なお、更新画面の生成処理の具体的な内容は、上述したステップＳ１０４の処理がそれぞれの場合に応じて実行される。 <<Update screen generation process>>
Next, the process of generating the updated screen will be described. Returning to Fig. 27, the generation/processing unit 57 generates an updated screen for the communication terminal 3, and stores screen data (template data) of the generated updated screen in a predetermined area of the storage unit 5000 (step S114). Note that the specific content of the process of generating the updated screen is the process of step S104 described above that is executed depending on each case.

次に、音声記録管理装置５の送受信部５１は、通信端末３に対して編集操作指示応答を送信する（ステップＳ１１５）。これにより、通信端末３の送受信部３１は、編集操作指示応答を受信する。このとき、編集操作指示応答には、更新後の記録閲覧編集画面の画面データ(テキストデータ、画面データ)、及び音声データが含まれる。 Next, the transmission/reception unit 51 of the audio recording management device 5 transmits an editing operation instruction response to the communication terminal 3 (step S115). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the editing operation instruction response. At this time, the editing operation instruction response includes the screen data (text data, screen data) of the updated record viewing/editing screen, and the audio data.

続いて、通信端末３の表示制御部３４は、各種編集操作指示に伴う更新後の記録閲覧編集画面をディスプレイ３１８に表示する（ステップＳ１１６）。 Next, the display control unit 34 of the communication terminal 3 displays the updated record viewing and editing screen in response to the various editing operation instructions on the display 318 (step S116).

〔第１の実施形態の主な効果〕
以上説明したように本実施形態によれば、音声記録管理装置５は、通信端末３が送信した、所定のテキスト又は所定の画像に対する編集要求に応じて、所定のテキストデータを編集処理した編集後テキストデータと所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、所定の音声データを編集処理した編集後音声データとを、通信端末３に対して送信する。通信端末３は、音声記録管理装置５が送信した編集後画面データに係る編集後画面をディスプレイ３１８に表示手段に表示し、音声記録管理装置５が送信した編集後音声データに係る編集後音声を再生する。これにより、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータ又はそのテキストデータに対応する画像データを編集すればよいので、音声記録の編集における利便性を向上させることができるという効果を奏する。 [Major Effects of the First Embodiment]
As described above, according to this embodiment, the voice recording management device 5 transmits edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to the communication terminal 3 in response to an editing request for a specified text or a specified image transmitted from the communication terminal 3. The communication terminal 3 displays an edited screen related to the edited screen data transmitted by the voice recording management device 5 on the display means of the display 318, and plays back edited audio related to the edited audio data transmitted by the voice recording management device 5. This provides the effect of improving convenience in editing voice recordings, since when editing a voice recording generated based on audio information, it is only necessary to edit the text data included in the voice recording or the image data corresponding to the text data.

更に、本実施形態によれば、音声記録において編集したいテキスト又は画像を選択的に操作可能なＵＩ(User Interface)が提供されるので、利用者は、非公開、公開、削除及び再生を含む各種処理を所望のテキスト又は画像に対して行うことで、公開範囲を限定的に操作することが可能になる。これにより、秘匿情報を含む音声記録に対する秘匿性を向上させることが可能になる。 Furthermore, according to this embodiment, a UI (User Interface) is provided that allows the user to selectively edit text or images in an audio recording, enabling the user to perform various processes, including non-disclosure, disclosure, deletion, and playback, on the desired text or image, thereby allowing the user to control the scope of disclosure in a limited manner. This makes it possible to improve confidentiality for audio recordings that contain confidential information.

〔第２の実施形態〕
次に、図４０乃至図５１を用いて、第２の実施形態について説明する。第２の実施形態に係る第１の実施形態との相違点は、所定のテキストに対する編集要求(例えば、非公開、公開、及び削除の各要求)があった場合に、編集の対象となるテキストをテキストグループとし、テキストグループに対して所定の要約を与えた上で編集処理を行うようにした点である。つまり、通信システム１を構成する各ハードウエア資源、各ハードウエア資源における機能構成は、第１の実施形態と同様であり、データテーブルの構造が一部変更されている。そのため、図２７に示した、記録閲覧編集処理(ステップＳ１１１－Ｓ１１６)に係るシーケンス図は、第２の実施形態においても同様に使用される。 Second Embodiment
Next, the second embodiment will be described with reference to Fig. 40 to Fig. 51. The difference between the first embodiment and the second embodiment is that when there is an editing request for a specific text (for example, a request for non-disclosure, disclosure, or deletion), the text to be edited is treated as a text group, and a specific summary is given to the text group before editing. In other words, the hardware resources constituting the communication system 1 and the functional configuration of each hardware resource are the same as those in the first embodiment, with the structure of the data table being partially changed. Therefore, the sequence diagram for the record viewing and editing process (steps S111-S116) shown in Fig. 27 is also used in the second embodiment.

●テキスト情報管理テーブル●
図４０は、第２の実施形態に係るテキスト情報管理テーブルの一例を示す概念図である。記憶部５０００には、図４０に示されているようなテキスト情報管理テーブルによって構成されたテキスト情報管理ＤＢ５００７が構築されている。テキスト情報管理ＤＢ５００７では、図７に示されたテキスト情報管理ＤＢ５００３を構成するテキスト情報管理テーブルに、テキストグループＩＤの項目が追加されている。このテキストグループＩＤは、記録閲覧編集画面において作成者がマウス操作等により編集対象の一以上のテキストを一つのテキストグループとして生成したときに付与される識別情報である。テキストグループＩＤは、例えば、「TG0003」、「TG0004」等で与えられる。テキスト情報管理ＤＢ５００７では、このテキストグループＩＤが、記録識別情報ごとに管理される。 ●Text information management table●
FIG. 40 is a conceptual diagram showing an example of a text information management table according to the second embodiment. In the storage unit 5000, a text information management DB 5007 configured by a text information management table as shown in FIG. 40 is constructed. In the text information management DB 5007, an item of a text group ID is added to the text information management table constituting the text information management DB 5003 shown in FIG. 7. This text group ID is identification information that is given when the creator generates one or more texts to be edited as one text group by mouse operation or the like on the record viewing and editing screen. The text group ID is given, for example, as "TG0003", "TG0004", etc. In the text information management DB 5007, this text group ID is managed for each record identification information.

●要約情報管理テーブル●
図４１は、第２の実施形態に係る要約情報管理テーブルの一例を示す概念図である。記憶部５０００には、図４１に示されているような要約情報管理テーブルによって構成された要約情報管理ＤＢ５００８が構築されている。要約情報管理ＤＢ５００８では、記録識別情報をタブとして、それぞれのタブで分けられたテキストグループＩＤ及び要約内容が関連付けられた記憶、管理されている。このテキストグループＩＤ及び要約内容は、取得された音声記録ごとに管理される。 ●Summary information management table●
Fig. 41 is a conceptual diagram showing an example of a summary information management table according to the second embodiment. A summary information management DB 5008 configured by a summary information management table as shown in Fig. 41 is constructed in the storage unit 5000. In the summary information management DB 5008, the record identification information is treated as a tab, and text group IDs and summary contents separated by each tab are stored and managed in association with each other. The text group IDs and summary contents are managed for each acquired voice recording.

<<記録閲覧編集処理の振分け>>
続いて、記録閲覧編集処理の振分けについて説明する。図４２Ａは、第２の実施形態に係る各種ボタン操作により分岐される処理の一例を示すフローチャートである。まず、取得部５２は、図２８ＢのステップＳ１１３－２－４の処理を続けて、ステップＳ１１２で受信した各種ボタン情報を取得する（ステップＳ１１３－３）。 <<Distribution of record viewing and editing processes>>
Next, the allocation of the record viewing and editing process will be described. Fig. 42A is a flowchart showing an example of a process branched by various button operations according to the second embodiment. First, the acquisition unit 52 continues the process of step S113-2-4 in Fig. 28B and acquires various button information received in step S112 (step S113-3).

続いて、判断部５５は、取得したボタン情報がテキストグループの「非公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－３－１）。取得したボタン情報がテキストグループの「非公開」ボタン(アイコン)である場合（ステップＳ１１３－３－１：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｇの処理に遷移する。 Then, the judgment unit 55 judges whether the acquired button information is a "Private" button (icon) of a text group (step S113-3-1). If the acquired button information is a "Private" button (icon) of a text group (step S113-3-1: YES), the voice recording management device 5 transitions to the process of circle G described below.

取得したボタン情報がテキストグループの「非公開」ボタン(アイコン)でない場合（ステップＳ１１３－３－１：ＮＯ）、判断部５５は更に、取得したボタン情報がテキストグループの「公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－３－２）。取得したボタン情報がテキストグループの「公開」ボタン(アイコン)である場合（ステップＳ１１３－３－２：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｈの処理に遷移する。 If the acquired button information is not a "Private" button (icon) of a text group (step S113-3-1: NO), the judgment unit 55 further judges whether the acquired button information is a "Public" button (icon) of a text group (step S113-3-2). If the acquired button information is a "Public" button (icon) of a text group (step S113-3-2: YES), the voice recording management device 5 transitions to the process of circle H described below.

取得したボタン情報がテキストグループの「公開」ボタン(アイコン)でない場合（ステップＳ１１３－３－２：ＮＯ）、判断部５５は更に、取得したボタン情報がテキストグループの「削除」ボタン(アイコン)であるかを判断する（ステップＳ１１３－３－３）。取得したボタン情報がテキストグループの「削除」ボタン(アイコン)である場合（ステップＳ１１３－３－３：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｉの処理に遷移する。 If the acquired button information is not the "Publish" button (icon) of the text group (step S113-3-2: NO), the judgment unit 55 further judges whether the acquired button information is the "Delete" button (icon) of the text group (step S113-3-3). If the acquired button information is the "Delete" button (icon) of the text group (step S113-3-3: YES), the voice recording management device 5 transitions to the process of circle I described below.

取得したボタン情報がテキストグループの「削除」ボタン(アイコン)でない場合（ステップＳ１１３－３－３：ＮＯ）、音声記録管理装置５は、各種ボタン情報を取得４の処理に遷移する（ステップＳ１１３－３－４）。 If the acquired button information is not the "Delete" button (icon) of the text group (step S113-3-3: NO), the voice recording management device 5 transitions to process 4 of acquiring various button information (step S113-3-4).

図４２Ｂは、第２の実施形態に係る各種ボタン操作により分岐される処理の一例を示すフローチャートである。図４２Ｂにおいて、取得部５２は、ステップＳ１１２で受信した各種ボタン情報を取得する（ステップＳ１１３－４）。 Fig. 42B is a flowchart showing an example of a process branched by various button operations according to the second embodiment. In Fig. 42B, the acquisition unit 52 acquires the various button information received in step S112 (step S113-4).

続いて、判断部５５は、取得したボタン情報がキャプチャ画像の「非公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－４－１）。取得したボタン情報がキャプチャ画像の「非公開」ボタン(アイコン)である場合（ステップＳ１１３－４－１：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｊの処理に遷移する。 Then, the judgment unit 55 judges whether the acquired button information is a "Private" button (icon) of the captured image (step S113-4-1). If the acquired button information is a "Private" button (icon) of the captured image (step S113-4-1: YES), the voice recording management device 5 transitions to the process of circle J, which will be described later.

取得したボタン情報がキャプチャ画像の「非公開」ボタン(アイコン)でない場合（ステップＳ１１３－４－１：ＮＯ）、判断部５５は更に、取得したボタン情報がキャプチャ画像の「公開」ボタン(アイコン)であるかを判断する（ステップＳ１１３－４－２）。取得したボタン情報がキャプチャ画像の「公開」ボタン(アイコン)である場合（ステップＳ１１３－４－２：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｋの処理に遷移する。 If the acquired button information is not the "Private" button (icon) of the captured image (step S113-4-1: NO), the judgment unit 55 further judges whether the acquired button information is the "Public" button (icon) of the captured image (step S113-4-2). If the acquired button information is the "Public" button (icon) of the captured image (step S113-4-2: YES), the voice recording management device 5 transitions to the process of circle K described below.

取得したボタン情報がキャプチャ画像の「公開」ボタン(アイコン)でない場合（ステップＳ１１３－４－２：ＮＯ）、判断部５５は更に、取得したボタン情報がキャプチャ画像の「削除」ボタン(アイコン)であるかを判断する（ステップＳ１１３－４－３）。取得したボタン情報がキャプチャ画像の「削除」ボタン(アイコン)である場合（ステップＳ１１３－４－３：ＹＥＳ）、音声記録管理装置５は、後述する丸Ｌの処理に遷移する。 If the acquired button information is not the "Publish" button (icon) of the captured image (step S113-4-2: NO), the judgment unit 55 further judges whether the acquired button information is the "Delete" button (icon) of the captured image (step S113-4-3). If the acquired button information is the "Delete" button (icon) of the captured image (step S113-4-3: YES), the voice recording management device 5 transitions to the process of circle L described below.

取得したボタン情報がテキストの「削除」ボタン(アイコン)でない場合（ステップＳ１１３－４－３：ＮＯ）、音声記録管理装置５は、各種ボタン情報を取得１の処理に遷移する（ステップＳ１１３－４－４）。つまり、音声記録管理装置５は、図２８Ａの処理に戻る。 If the acquired button information is not a text "Delete" button (icon) (step S113-4-3: NO), the voice recording management device 5 transitions to the process of acquiring various button information 1 (step S113-4-4). In other words, the voice recording management device 5 returns to the process of FIG. 28A.

<<テキストグループに対する非公開ボタン操作時の処理>>
次に、テキストグループに対する非公開ボタン操作時の処理について説明する。図４３は、第２の実施形態に係るテキストグループに対する非公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図４２Ａで判断された遷移先としての丸Ｇの処理(ステップＳ１１３－３－１０１からステップＳ１１３－３－１０５)が実行される。 <<Processing when the private button is operated on a text group>>
Next, a process when the private button is operated on a text group will be described. Fig. 43 is a flowchart showing an example of a process when the private button is operated on a text group according to the second embodiment. In this flowchart, the process of circle G as the transition destination determined in Fig. 42A (steps S113-3-101 to S113-3-105) is executed.

まず、音声記録管理装置５の記憶読出部５９は、ステップＳ１１２で受信した要約内容を検索キーとして要約情報管理ＤＢ５００８（図４１参照）を検索することにより、対応するテキストグループＩＤを読み出す。続いて、設定登録部５８は、処理の対象となるテキストグループのテキストグループＩＤに対応する各テキストの公開フラグを「False」に設定する。具体的には、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理されている、読み出されたテキストグループＩＤに対応する各テキストの公開フラグを「False」に設定する（ステップＳ１１３－３－１０１）。 First, the storage/reading unit 59 of the voice recording management device 5 searches the summary information management DB 5008 (see FIG. 41) using the summary content received in step S112 as a search key to read out the corresponding text group ID. Next, the setting/registration unit 58 sets the public flag of each text corresponding to the text group ID of the text group to be processed to "False." Specifically, the setting/registration unit 58 sets the public flag of each text corresponding to the read out text group ID, which is managed in the text information management DB 5007 (see FIG. 40), to "False" (step S113-3-101).

次に、取得部５２は、テキストグループＩＤに対応し、各テキストに対応する画像識別情報を取得する。具体的には、取得部５２は、テキストグループのテキストグループＩＤを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストに関連付けられた画像識別情報を取得する（ステップＳ１１３－３－１０２）。 Next, the acquisition unit 52 acquires image identification information corresponding to each text, which corresponds to the text group ID. Specifically, the acquisition unit 52 acquires image identification information associated with each corresponding text by searching the text information management DB 5007 (see FIG. 40) using the text group ID of the text group as a search key (step S113-3-102).

次に、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、取得した画像識別情報で示されるキャプチャ画像の公開フラグを「False」に設定する（ステップＳ１１３－３－１０３）。 Next, the setting registration unit 58 sets the public flag of the capture image managed in the capture image management DB 5004 (see FIG. 8A) and indicated by the acquired image identification information to "False" (step S113-3-103).

次に、取得部５２は、テキストグループＩＤを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストの開始時刻及び終了時刻を取得する（ステップＳ１１３－３－１０４）。 Next, the acquisition unit 52 searches the text information management DB 5007 (see FIG. 40) using the text group ID as a search key to acquire the start time and end time of each corresponding text (step S113-3-104).

続いて、設定登録部５８は、取得した各時刻情報のうち、最も早い開始時刻及び最も遅い終了時刻の各時刻情報を、非公開音声管理ＤＢ５００６（図９参照）に登録して（ステップＳ１１３－３－１０５）、このフローを抜ける。 Next, the setting registration unit 58 registers the earliest start time and the latest end time among the acquired time information in the private audio management DB 5006 (see FIG. 9) (step S113-3-105), and then exits this flow.

●画面表示例●
図４４は、第２の実施形態に係る通信端末におけるテキストグループに対する要約入力ボタン操作時の画面表示例である。作成者の通信端末３のディスプレイ３１８には、表制御部３４によって、記録閲覧編集画面３１７１が表示される。図４４では、図２５に示した作成者の記録閲覧編集画面３１５１に対して、グループ指定領域３１７２と、グループ指定領域３１７２の近傍に表示される「要約入力」ボタン(アイコン)３５６１が、表示制御部３４によって表示される。グループ指定領域３１７２は、例えば、作成者によるマウス３７０１のドラッグ＆ドロップ操作によって指定、表示されるようにしてもよい。例えば、利用者は、記録閲覧編集画面３１７１の任意の始点Ａからマウス３７０１をドラッグ操作し、任意の終点Ｂでマウス３７０１をドロップ操作して任意の領域を指定する。これにより、記録閲覧編集画面３１７１上に、図４４に示したようなグループ指定領域３１７２が表示される。これにあわせて、算出特定部５３と判断部５５は、グループ指定領域３１７２が有する各頂点の座標値と各テキスト表示欄の各頂点の座標値とを比較して、どのテキスト表示欄がグループ指定領域３１７２内に存在するかを算出して特定する。このようにして、算出特定部５３と判断部５５は、どのテキスト表示欄がグループ指定領域３１７２に含まれるかを特定することができる。 ●Screen display example●
Fig. 44 is an example of a screen display when a summary input button is operated for a text group in a communication terminal according to the second embodiment. A record viewing and editing screen 3171 is displayed by the table control unit 34 on the display 318 of the creator's communication terminal 3. In Fig. 44, a group designation area 3172 and a "summary input" button (icon) 3561 displayed near the group designation area 3172 are displayed by the display control unit 34 for the creator's record viewing and editing screen 3151 shown in Fig. 25. The group designation area 3172 may be designated and displayed by, for example, a drag-and-drop operation of the mouse 3701 by the creator. For example, the user drags the mouse 3701 from an arbitrary start point A on the record viewing and editing screen 3171 and drops the mouse 3701 at an arbitrary end point B to designate an arbitrary area. As a result, the group designation area 3172 as shown in Fig. 44 is displayed on the record viewing and editing screen 3171. In addition, calculation/identification unit 53 and determination unit 55 compare the coordinate values of each vertex of group designation area 3172 with the coordinate values of each vertex of each text display field, and calculate and identify which text display fields are present within group designation area 3172. In this way, calculation/identification unit 53 and determination unit 55 can identify which text display fields are included in group designation area 3172.

また、「要約入力」ボタン(アイコン)３５６１は、領域指定されたグループ指定領域３１７２の近傍に対して行われたマウスオーバー操作にあわせて、表示制御部３４によって表示される。 The "summary input" button (icon) 3561 is also displayed by the display control unit 34 in response to a mouseover operation performed near the designated group designation area 3172.

●画面表示例●
図４５は、第２の実施形態に係る通信端末における要約情報入力ダイアログの画面表示例である。作成者の通信端末３のディスプレイ３１８には、図４４に示した「要約入力」ボタン(アイコン)３５６１に対して操作が行われると、表示制御部３４によって、要約入力ダイアログ３１８１が表示される。「要約入力」ダイアログ３１８１は、作成者が任意に指定したグループ指定領域３１７２を表す任意の要約(文)を入力するためのダイアログである。このダイアログに対して作成者が任意の単語、文、リンク先等の各種情報を入力し、「登録」ボタン(アイコン)３５７１を操作すると、入力された各種情報が確定され、音声記録管理装置５に送信される。 ●Screen display example●
Fig. 45 is an example of a screen display of a summary information input dialogue in a communication terminal according to the second embodiment. When an operation is performed on the "summary input" button (icon) 3561 shown in Fig. 44 on the display 318 of the creator's communication terminal 3, a summary input dialogue 3181 is displayed by the display control unit 34. The "summary input" dialogue 3181 is a dialogue for inputting an arbitrary summary (sentence) representing a group designation area 3172 arbitrarily designated by the creator. When the creator inputs various information such as an arbitrary word, sentence, link destination, etc. into this dialogue and operates the "register" button (icon) 3571, the various input information is confirmed and transmitted to the voice recording management device 5.

●画面表示例●
図４６は、第２の実施形態に係る通信端末における要約表示欄を含む画面表示例である。作成者の通信端末３のディスプレイ３１８には、表示制御部３４によって、記録閲覧編集画面３１７１が表示される。図４６では、例えば、グループ指定領域３１７２の近傍においてマウスオーバー操作が行われると、表示制御部３４によって吹き出し状の要約表示欄３１９１が表示される。この要約表示欄３１９１は、図４５に示した「要約入力」ダイアログ３１８１に入力した要約内容(要約情報)が表示される。そして「編集」ボタン(アイコン)３５８１が操作されると、入力済みの要約情報を編集するための編集画面(ダイアログ画面)が表示される。このときの編集画面(ダイアログ画面)は、図４５に示された要約入力ダイアログ３１８１と同じものでもよい。そして、要約入力ダイアログ３１８１で任意の要約内容に再編集され、「登録」ボタン(アイコン)３５７１が操作されると、再編集された内容が音声記録管理装置５に送信される。 ●Screen display example●
Fig. 46 is an example of a screen display including a summary display field in a communication terminal according to the second embodiment. A record viewing and editing screen 3171 is displayed by the display control unit 34 on the display 318 of the creator's communication terminal 3. In Fig. 46, for example, when a mouse-over operation is performed near the group designation area 3172, a balloon-shaped summary display field 3191 is displayed by the display control unit 34. In this summary display field 3191, the summary content (summary information) input in the "summary input" dialogue 3181 shown in Fig. 45 is displayed. Then, when the "edit" button (icon) 3581 is operated, an editing screen (dialog screen) for editing the input summary information is displayed. The editing screen (dialog screen) at this time may be the same as the summary input dialogue 3181 shown in Fig. 45. Then, the summary is re-edited to an arbitrary summary content in the summary input dialogue 3181, and when the "register" button (icon) 3571 is operated, the re-edited content is transmitted to the voice recording management device 5.

また、図４６に示したグループ指定領域３１７２の近傍に対して行われたマウスオーバー操作にあわせて、表示制御部３４によって「非公開」ボタン(アイコン)３５９１が表示される。通信端末３では、「非公開」ボタン(アイコン)３５９１と同様に、「公開」ボタン(アイコン)及び「削除」ボタン(アイコン)が、表示制御部３４によってそれぞれの状態に応じて表示される。 In addition, in response to a mouse-over operation performed near the group designation area 3172 shown in FIG. 46, the display control unit 34 displays a "private" button (icon) 3591. In the communication terminal 3, similar to the "private" button (icon) 3591, a "public" button (icon) and a "delete" button (icon) are displayed by the display control unit 34 according to their respective states.

<<テキストグループに対する公開ボタン操作時の処理>>
次に、テキストグループに対する公開ボタン操作時の処理について説明する。図４７は、第２の実施形態に係るテキストグループに対する公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図４２Ａで判断された遷移先としての丸Ｈの処理(ステップＳ１１３－３－２０１からステップＳ１１３－３－２０７)が実行される。 <<Processing when the publish button is used on a text group>>
Next, the process when the disclosure button is operated on a text group will be described. Fig. 47 is a flowchart showing an example of the process when the disclosure button is operated on a text group according to the second embodiment. In this flowchart, the process of the circle H as the transition destination determined in Fig. 42A (steps S113-3-201 to S113-3-207) is executed.

まず、音声記録管理装置５の記憶読出部５９は、ステップＳ１１２で受信した要約内容を検索キーとして要約情報管理ＤＢ５００８（図４１参照）を検索することにより、対応するテキストグループＩＤを読み出す。続いて、設定登録部５８は、処理の対象となるテキストグループのテキストグループＩＤに対応する各テキストの公開フラグを「True」に設定する。具体的には、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理されている、読み出されたテキストグループＩＤに対応する各テキストの公開フラグを「True」に設定する（ステップＳ１１３－３－２０１）。 First, the storage/reading unit 59 of the voice recording management device 5 searches the summary information management DB 5008 (see FIG. 41) using the summary content received in step S112 as a search key to read out the corresponding text group ID. Next, the setting/registration unit 58 sets the public flag of each text corresponding to the text group ID of the text group to be processed to "True." Specifically, the setting/registration unit 58 sets the public flag of each text corresponding to the read out text group ID, which is managed in the text information management DB 5007 (see FIG. 40), to "True" (step S113-3-201).

次に、取得部５２は、テキストグループＩＤに対応し、各テキストに対応する画像識別情報を取得する。具体的には、取得部５２は、テキストグループＩＤを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する画像識別情報を取得する（ステップＳ１１３－３－２０２）。 Next, the acquisition unit 52 acquires image identification information corresponding to each text, which corresponds to the text group ID. Specifically, the acquisition unit 52 acquires the corresponding image identification information by searching the text information management DB 5007 (see FIG. 40) using the text group ID as a search key (step S113-3-202).

次に、取得部５２は、テキスト情報管理ＤＢ５００７（図４０参照）で管理され、取得した画像識別情報で示されるキャプチャ画像に対応する各テキストの公開フラグを取得する（ステップＳ１１３－３－２０３）。 Next, the acquisition unit 52 acquires the public flag of each text that is managed in the text information management DB 5007 (see FIG. 40) and corresponds to the captured image indicated by the acquired image identification information (step S113-3-203).

続いて、判断部５５は、取得した公開フラグが全て「True」であるかを判断する（ステップＳ１１３－３－２０４）。取得した公開フラグが全て「True」である場合（ステップＳ１１３－３－２０４：ＹＥＳ）、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理されている、取得した画像識別情報で示されるキャプチャ画像の画像識別情報に対応する公開フラグを「True」に設定する（ステップＳ１１３－３－２０５）。 Then, the judgment unit 55 judges whether all of the acquired public flags are "True" (step S113-3-204). If all of the acquired public flags are "True" (step S113-3-204: YES), the setting registration unit 58 sets the public flag corresponding to the image identification information of the capture image indicated by the acquired image identification information, which is managed in the capture image management DB 5004 (see FIG. 8A), to "True" (step S113-3-205).

続いて、取得部５２は、テキストグループＩＤに対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する。具体的には、取得部５２は、テキストグループＩＤを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストの開始時刻及び終了時刻を取得する（ステップＳ１１３－３－２０６）。 Then, the acquisition unit 52 acquires time information on the start time and end time of each text corresponding to the text group ID. Specifically, the acquisition unit 52 acquires the start time and end time of each corresponding text by searching the text information management DB 5007 (see FIG. 40) using the text group ID as a search key (step S113-3-206).

続いて、設定登録部５８は、取得した各時刻情報のうち、最も早い開始時刻及び最も遅い終了時刻の各時刻情報を、非公開音声管理ＤＢ５００６（図９参照）から削除して（ステップＳ１１３－３－２０７）、このフローを抜ける。 Then, the setting registration unit 58 deletes the earliest start time and the latest end time from the acquired time information from the private audio management DB 5006 (see FIG. 9) (step S113-3-207), and exits this flow.

他方、取得した公開フラグが全て「True」でない場合、すなわち、取得した公開フラグのうち、少なくとも一つが「False」である場合（ステップＳ１１３－３－２０４：ＮＯ）、音声記録管理装置５は、上述したステップＳ１１３－２－２０６及びステップＳ１１３－３－２０７の処理を実行してこのフローを抜ける。 On the other hand, if all of the obtained public flags are not "True", that is, if at least one of the obtained public flags is "False" (step S113-3-204: NO), the voice recording management device 5 executes the processing of the above-mentioned steps S113-2-206 and S113-3-207 and exits this flow.

<<テキストグループに対する削除ボタン操作時の処理>>
次に、テキストグループに対する削除ボタン操作時の処理について説明する。図４８は、第２の実施形態に係るテキストグループに対する削除ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図４２Ａで判断された遷移先としての丸Iの処理(ステップＳ１１３－３－３０１からステップＳ１１３－３－３０７)が実行される。 <<Processing when the Delete button is used on a text group>>
Next, the process when the delete button is operated on a text group will be described. Fig. 48 is a flowchart showing an example of the process when the delete button is operated on a text group according to the second embodiment. In this flowchart, the process of circle I as the transition destination determined in Fig. 42A (steps S113-3-301 to S113-3-307) is executed.

まず、音声記録管理装置５の記憶読出部５９は、ステップＳ１１２で受信した要約内容を検索キーとして要約情報管理ＤＢ５００８（図４１参照）を検索することにより、対応するテキストグループＩＤを読み出す。続いて、設定登録部５８は、対象のテキストグループＩＤに対応する全てのテキストを空白に設定し、削除フラグを「True」に設定する。具体的には、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理されている、読み出されたテキストグループＩＤに対応する全てのテキストを空白に設定し、対応する削除フラグを「True」に設定する（ステップＳ１１３－３－３０１）。 First, the storage/reading unit 59 of the voice recording management device 5 searches the summary information management DB 5008 (see FIG. 41) using the summary content received in step S112 as a search key to read out the corresponding text group ID. Next, the setting/registration unit 58 sets all text corresponding to the target text group ID to blank, and sets the deletion flag to "True." Specifically, the setting/registration unit 58 sets all text corresponding to the read-out text group ID, which is managed in the text information management DB 5007 (see FIG. 40), to blank, and sets the corresponding deletion flag to "True" (step S113-3-301).

次に、取得部５２は、対象のテキストグループＩＤに対応し、各テキストに対応する画像識別情報を取得する。具体的には、取得部５２は、テキストグループＩＤを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する画像識別情報を取得する（ステップＳ１１３－３－３０２）。 Next, the acquisition unit 52 acquires image identification information corresponding to each text, which corresponds to the target text group ID. Specifically, the acquisition unit 52 acquires the corresponding image identification information by searching the text information management DB 5007 (see FIG. 40) using the text group ID as a search key (step S113-3-302).

次に、取得部５２は、テキスト情報管理ＤＢ５００７（図４０参照）で管理され、取得した画像識別情報で示されるキャプチャ画像に対応する各テキストの削除フラグを取得する（ステップＳ１１３－３－３０３）。 Next, the acquisition unit 52 acquires the deletion flag of each text that is managed in the text information management DB 5007 (see FIG. 40) and corresponds to the captured image indicated by the acquired image identification information (step S113-3-303).

続いて、判断部５５は、取得した削除フラグが全て「True」であるかを判断する（ステップＳ１１３－３－３０４）。取得した削除フラグが全て「True」である場合（ステップＳ１１３－３－３０４：ＹＥＳ）、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理されている、取得した画像識別情報で示されるキャプチャ画像の画像データを所定の画像データに置き換え(書き換え)、対応する削除フラグを「True」に設定する（ステップＳ１１３－３－３０５）。 Then, the judgment unit 55 judges whether all of the acquired deletion flags are "True" (step S113-3-304). If all of the acquired deletion flags are "True" (step S113-3-304: YES), the setting registration unit 58 replaces (rewrites) the image data of the capture image indicated by the acquired image identification information, which is managed in the capture image management DB 5004 (see FIG. 8A), with specified image data, and sets the corresponding deletion flag to "True" (step S113-3-305).

続いて、取得部５２は、対象のテキストグループＩＤに対応し、各テキストに対応する開始時刻及び終了時刻の各時刻情報を取得する。具体的には、取得部５２は、テキストグループＩＤを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、処理の対象となるテキストグループに対応する各テキストの開始時刻及び終了時刻を取得する（ステップＳ１１３－３－３０６）。 Then, the acquisition unit 52 acquires time information of the start time and end time corresponding to each text, which corresponds to the target text group ID. Specifically, the acquisition unit 52 acquires the start time and end time of each text corresponding to the text group to be processed by searching the text information management DB 5007 (see FIG. 40) using the text group ID as a search key (step S113-3-306).

続いて、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理されている削除フラグが「True」に設定されたテキストに対応する音声データであり、記録書誌情報管理ＤＢ５００２（図６参照）で管理されている音声データのうち、最も早い開始時刻及び最も遅い終了時刻の間の音声データを削除(無音化)して（ステップＳ１１３－３－３０７）、このフローを抜ける。 Then, the setting registration unit 58 deletes (silences) the audio data between the earliest start time and the latest end time among the audio data managed in the record bibliographic information management DB 5002 (see FIG. 6) that corresponds to text managed in the text information management DB 5007 (see FIG. 40) and whose deletion flag is set to "True" (step S113-3-307), and exits this flow.

他方、取得した削除フラグが全て「True」でない場合、すなわち、取得した削除フラグのうち、少なくとも一つが「False」である場合（ステップＳ１１３－３－３０４：ＮＯ）、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理されている、取得した画像識別情報で示されるキャプチャ画像に対応する削除フラグを「False」に設定し（ステップＳ１１３－３－３０８）、上述したステップＳ１１３－３－３０６及びステップＳ１１３－３－３０７の各処理を実行してこのフローを抜ける。 On the other hand, if all of the acquired deletion flags are not "True", that is, if at least one of the acquired deletion flags is "False" (step S113-3-304: NO), the setting registration unit 58 sets the deletion flag managed in the capture image management DB 5004 (see FIG. 8A) corresponding to the captured image indicated by the acquired image identification information to "False" (step S113-3-308), and executes the processes of steps S113-3-306 and S113-3-307 described above before exiting this flow.

<<キャプチャ画像に対する非公開ボタン操作時の処理>>
次に、キャプチャ画像に対する非公開ボタン操作時の処理について説明する。図４９は、第２の実施形態に係るキャプチャ画像に対する非公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図４２Ｂで判断された遷移先としての丸Ｊの処理(ステップＳ１１３－４－１０１からステップＳ１１３－４－１０７)が実行される。 <<Processing when pressing the private button on a captured image>>
Next, a process when the private button is operated on a captured image will be described. Fig. 49 is a flowchart showing an example of a process when the private button is operated on a captured image according to the second embodiment. In this flowchart, the process of the circle J as the transition destination determined in Fig. 42B (steps S113-4-101 to S113-4-107) is executed.

まず、設定登録部５８は、対象キャプチャ画像を示す画像識別情報に対応する公開フラグを「False」に設定する。具体的には、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、非公開ボタン(アイコン)が操作された対象キャプチャ画像を示す画像識別情報に対応する公開フラグを「False」に設定する（ステップＳ１１３－４－１０１）。 First, the setting registration unit 58 sets the public flag corresponding to the image identification information indicating the target capture image to "False." Specifically, the setting registration unit 58 sets the public flag corresponding to the image identification information indicating the target capture image, which is managed in the capture image management DB 5004 (see FIG. 8A) and for which the private button (icon) has been operated, to "False" (step S113-4-101).

次に、取得部５２は、対象キャプチャ画像を示す画像識別情報に対応する各テキストを取得する。具体的には、取得部５２は、処理の対象となるキャプチャ画像を示す画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストを取得する（ステップＳ１１３－４－１０２）。 Next, the acquisition unit 52 acquires each piece of text corresponding to the image identification information indicating the target capture image. Specifically, the acquisition unit 52 acquires each piece of corresponding text by searching the text information management DB 5007 (see FIG. 40) using the image identification information indicating the capture image to be processed as a search key (step S113-4-102).

次に、算出特定部５３は、取得した各テキストを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応するテキストグループ(テキストグループＩＤ)を特定する（ステップＳ１１３－４－１０３）。 Next, the calculation/identification unit 53 searches the text information management DB 5007 (see FIG. 40) using each acquired text as a search key to identify the corresponding text group (text group ID) (step S113-4-103).

次に、判断部５５は、テキストグループ(テキストグループＩＤ)を特定できたかを判断する（ステップＳ１１３－４－１０４）。テキストグループ(テキストグループＩＤ)を判断できた場合（ステップＳ１１３－４－１０４：ＹＥＳ）、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理され、特定したテキストグループ(テキストグループＩＤ)に対応する全てのテキストの公開フラグを「False」に設定する（ステップＳ１１３－４－１０５）。 Next, the judgment unit 55 judges whether the text group (text group ID) has been identified (step S113-4-104). If the text group (text group ID) has been identified (step S113-4-104: YES), the setting registration unit 58 sets the public flag of all texts that are managed in the text information management DB 5007 (see FIG. 40) and that correspond to the identified text group (text group ID) to "False" (step S113-4-105).

続いて、取得部５２は、特定したテキストグループ(テキストグループＩＤ)を検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する全てのテキストの開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－４－１０６）。 Next, the acquisition unit 52 searches the text information management DB 5007 (see FIG. 40) using the identified text group (text group ID) as a search key to acquire time information for the start and end times of all corresponding texts (step S113-4-106).

続いて、音声記録管理装置５は、図３５で定義した処理Ｘ(ステップＳ１１３－２－１０３からステップＳ１１３－２－１０５)を実行して（ステップＳ１１３－４－１０７）、このフローを抜ける。 Then, the voice recording management device 5 executes process X (steps S113-2-103 to S113-2-105) defined in FIG. 35 (step S113-4-107) and exits this flow.

他方、テキストグループ(テキストグループＩＤ)を判断できなかった場合（ステップＳ１１３－４－１０４：ＮＯ）、音声記録管理装置５は、図３５で定義した処理Ｘ(ステップＳ１１３－２－１０３からステップＳ１１３－２－１０５)を実行して（ステップＳ１１３－４－１０７）、このフローを抜ける。 On the other hand, if the text group (text group ID) cannot be determined (step S113-4-104: NO), the voice recording management device 5 executes process X (steps S113-2-103 to S113-2-105) defined in FIG. 35 (step S113-4-107) and exits this flow.

<<キャプチャ画像に対する公開ボタン操作時の処理>>
次に、キャプチャ画像に対する公開ボタン操作時の処理について説明する。図５０は、第２の実施形態に係るキャプチャ画像に対する公開ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図４２Ｂで判断された遷移先としての丸Ｋの処理(ステップＳ１１３－４－２０１からステップＳ１１３－４－２０８)が実行される。 <<Processing when the publish button is operated on a captured image>>
Next, a process when the public button is operated on a captured image will be described. Fig. 50 is a flowchart showing an example of a process when the public button is operated on a captured image according to the second embodiment. In this flowchart, the process of the circle K as the transition destination determined in Fig. 42B (steps S113-4-201 to S113-4-208) is executed.

まず、設定登録部５８は、対象キャプチャ画像を示す画像識別情報に対応する公開フラグを「True」に設定する。具体的には、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、公開ボタン(アイコン)が操作された対象キャプチャ画像を示す画像識別情報に対応する公開フラグを「True」に設定する（ステップＳ１１３－４－２０１）。 First, the setting registration unit 58 sets the public flag corresponding to the image identification information indicating the target capture image to "True." Specifically, the setting registration unit 58 sets the public flag corresponding to the image identification information indicating the target capture image, which is managed in the capture image management DB 5004 (see FIG. 8A) and for which the public button (icon) has been operated, to "True" (step S113-4-201).

次に、取得部５２は、対象キャプチャ画像を示す画像識別情報に対応する各テキストを取得する。具体的には、取得部５２は、対象キャプチャ画像を示す画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストを取得する（ステップＳ１１３－４－２０２）。 Next, the acquisition unit 52 acquires each piece of text corresponding to the image identification information indicating the target capture image. Specifically, the acquisition unit 52 acquires each piece of corresponding text by searching the text information management DB 5007 (see FIG. 40) using the image identification information indicating the target capture image as a search key (step S113-4-202).

次に、算出特定部５３は、取得した各テキストを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応するテキストグループ(テキストグループＩＤ)を特定する（ステップＳ１１３－４－２０３）。 Next, the calculation/identification unit 53 searches the text information management DB 5007 (see FIG. 40) using each acquired text as a search key to identify the corresponding text group (text group ID) (step S113-4-203).

次に、判断部５５は、テキストグループ(テキストグループＩＤ)を特定できたかを判断する（ステップＳ１１３－４－２０４）。テキストグループ(テキストグループＩＤ)を判断できた場合（ステップＳ１１３－４－２０４：ＹＥＳ）、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理され、特定したテキストグループ(テキストグループＩＤ)に対応する全てのテキストの公開フラグを「True」に設定する（ステップＳ１１３－４－２０５）。 Next, the judgment unit 55 judges whether the text group (text group ID) has been identified (step S113-4-204). If the text group (text group ID) has been identified (step S113-4-204: YES), the setting registration unit 58 sets the public flag of all texts that are managed in the text information management DB 5007 (see FIG. 40) and that correspond to the identified text group (text group ID) to "True" (step S113-4-205).

続いて、取得部５２は、特定したテキストグループ(テキストグループＩＤ)に対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する。具体的には、取得部５２、特定したテキストグループ(テキストグループＩＤ)を検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－４－２０６）。 Then, the acquisition unit 52 acquires time information on the start time and end time of each text corresponding to the identified text group (text group ID). Specifically, the acquisition unit 52 acquires time information on the start time and end time of each corresponding text by searching the text information management DB 5007 (see FIG. 40) using the identified text group (text group ID) as a search key (step S113-4-206).

続いて、設定登録部５８は、取得した各時刻情報のうち、最も早い開始時刻及び最も遅い終了時刻の各時刻情報を、非公開音声管理ＤＢ５００６（図９参照）から削除する（ステップＳ１１３－４－２０７）。 Next, the setting registration unit 58 deletes the earliest start time and the latest end time from the acquired time information from the private audio management DB 5006 (see FIG. 9) (step S113-4-207).

続いて、音声記録管理装置５は、図３７で定義した処理Ｙ(ステップＳ１１３－２－２０３からステップＳ１１３－２－２１１)を実行して（ステップＳ１１３－４－２０８）、このフローを抜ける。 Then, the voice recording management device 5 executes process Y defined in FIG. 37 (steps S113-2-203 to S113-2-211) (step S113-4-208) and exits this flow.

他方、テキストグループ(テキストグループＩＤ)を判断できなかった場合（ステップＳ１１３－４－２０４：ＮＯ）、音声記録管理装置５は、図３７で定義した処理Ｙ(ステップＳ１１３－２－２０３からステップＳ１１３－２－２１１)を実行して（ステップＳ１１３－４－２０８）、このフローを抜ける。 On the other hand, if the text group (text group ID) cannot be determined (step S113-4-204: NO), the voice recording management device 5 executes process Y (steps S113-2-203 to S113-2-211) defined in FIG. 37 (step S113-4-208) and exits this flow.

<<キャプチャ画像に対する削除ボタン操作時の処理>>
次に、キャプチャ画像に対する削除ボタン操作時の処理について説明する。図５１は、第２の実施形態に係るキャプチャ画像に対する削除ボタン操作時の処理の一例を示すフローチャートである。このフローチャートでは、図４２Ｂで判断された遷移先としての丸Ｌの処理(ステップＳ１１３－４－３０１からステップＳ１１３－４－３０８)が実行される。 <<Processing when the delete button is used on a captured image>>
Next, a process when the delete button is operated on a captured image will be described. Fig. 51 is a flowchart showing an example of a process when the delete button is operated on a captured image according to the second embodiment. In this flowchart, the process of the circle L as the transition destination determined in Fig. 42B (steps S113-4-301 to S113-4-308) is executed.

まず、設定登録部５８は、対象キャプチャ画像の画像データを所定の画像に置き換え、対象キャプチャ画像を示す画像識別情報に対応する削除フラグを「True」に設定する。具体的には、設定登録部５８は、キャプチャ画像管理ＤＢ５００４（図８Ａ参照）で管理され、非公開ボタン(アイコン)が操作された処理の対象となるキャプチャ画像を示す画像識別情報に対応する公開フラグを「True」に設定する（ステップＳ１１３－４－３０１）。 First, the setting registration unit 58 replaces the image data of the target capture image with a specified image, and sets the deletion flag corresponding to the image identification information indicating the target capture image to "True." Specifically, the setting registration unit 58 sets to "True" the public flag corresponding to the image identification information indicating the capture image that is managed in the capture image management DB 5004 (see FIG. 8A) and is the target of the process for which the private button (icon) was operated (step S113-4-301).

次に、取得部５２は、処理の対象となるキャプチャ画像を示す画像識別情報に対応する各テキストを取得する。具体的には、取得部５２は、対象キャプチャ画像を示す画像識別情報を検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストを取得する（ステップＳ１１３－４－３０２）。 Next, the acquisition unit 52 acquires each piece of text corresponding to the image identification information indicating the capture image to be processed. Specifically, the acquisition unit 52 acquires each piece of corresponding text by searching the text information management DB 5007 (see FIG. 40) using the image identification information indicating the target capture image as a search key (step S113-4-302).

次に、算出特定部５３は、取得した各テキストを検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応するテキストグループ(テキストグループＩＤ)を特定する（ステップＳ１１３－４－３０３）。 Next, the calculation/identification unit 53 searches the text information management DB 5007 (see FIG. 40) using each acquired text as a search key to identify the corresponding text group (text group ID) (step S113-4-303).

次に、判断部５５は、テキストグループ(テキストグループＩＤ)を特定できたかを判断する（ステップＳ１１３－４－３０４）。テキストグループ(テキストグループＩＤ)を判断できた場合（ステップＳ１１３－４－３０４：ＹＥＳ）、設定登録部５８は、テキスト情報管理ＤＢ５００７（図４０参照）で管理され、特定したテキストグループ(テキストグループＩＤ)に対応する全てのテキストを空白に設定し、削除フラグを「True」に設定する（ステップＳ１１３－４－３０５）。 Next, the judgment unit 55 judges whether the text group (text group ID) has been identified (step S113-4-304). If the text group (text group ID) has been identified (step S113-4-304: YES), the setting registration unit 58 sets all text managed in the text information management DB 5007 (see FIG. 40) and corresponding to the identified text group (text group ID) to blank, and sets the deletion flag to "True" (step S113-4-305).

続いて、取得部５２は、特定したテキストグループ(テキストグループＩＤ)に対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する。具体的には、取得部５２は、特定したテキストグループ(テキストグループＩＤ)を検索キーとしてテキスト情報管理ＤＢ５００７（図４０参照）を検索することにより、対応する各テキストの開始時刻及び終了時刻の各時刻情報を取得する（ステップＳ１１３－４－３０６）。 Then, the acquisition unit 52 acquires time information on the start time and end time of each text corresponding to the identified text group (text group ID). Specifically, the acquisition unit 52 acquires time information on the start time and end time of each corresponding text by searching the text information management DB 5007 (see FIG. 40) using the identified text group (text group ID) as a search key (step S113-4-306).

続いて、設定登録部５８は、取得した各時刻情報のうち、最も早い開始時刻及び最も遅い終了時刻の間の音声データを、記録書誌情報管理ＤＢ５００２（図６参照）から削除(無音化)する（ステップＳ１１３－４－３０７）。 Next, the setting registration unit 58 deletes (silences) the audio data between the earliest start time and the latest end time of each acquired piece of time information from the recorded bibliographic information management DB 5002 (see Figure 6) (step S113-4-307).

続いて、音声記録管理装置５は、図３８で定義した処理Ｚ(ステップＳ１１３－２－３０３からステップＳ１１３－２－３１２)を実行して（ステップＳ１１３－４－３０８）、このフローを抜ける。 Then, the voice recording management device 5 executes process Z (steps S113-2-303 to S113-2-312) defined in FIG. 38 (step S113-4-308) and exits this flow.

他方、テキストグループ(テキストグループＩＤ)を判断できなかった場合（ステップＳ１１３－４－３０４：ＮＯ）、音声記録管理装置５は、図３８で定義した処理Ｚ(ステップＳ１１３－２－３０３からステップＳ１１３－２－３１２)を実行して（ステップＳ１１３－４－３０８）、このフローを抜ける。 On the other hand, if the text group (text group ID) cannot be determined (step S113-4-304: NO), the voice recording management device 5 executes process Z (steps S113-2-303 to S113-2-312) defined in FIG. 38 (step S113-4-308) and exits this flow.

〔第２の実施形態の主な効果〕
以上説明したように本実施形態によれば、音声記録管理システムは、非公開、公開、削除を含む編集処理の対象となるテキストデータを纏めて設定する。これにより、第１の実施形態の効果に加えて、編集したい一以上のテキストデータを纏めて選択し、編集処理を行うことが可能になるため、音声記録の編集における操作性をさらに向上させることが可能になるという効果を奏する。 [Major Effects of the Second Embodiment]
As described above, according to this embodiment, the voice recording management system collectively sets text data to be edited, including non-disclosure, disclosure, and deletion. This provides the effect of the first embodiment, as well as the effect of making it possible to collectively select one or more pieces of text data to be edited and perform editing, thereby further improving the operability of editing voice recordings.

更に、本実施形態によれば、編集処理を行いたい一以上のテキストが含まれるテキストグループに対して要約内容を設定するため、利用者は、その要約内容に応じた編集の内容を選択しやすくなるという効果も期待できる。つまり、利用者は、一以上のテキストを纏めた要約内容を参考にすることで、編集処理の対象となる音声記録の一部を公開とするか、非公開とするか、更には、要約された部分の音声記録自体を削除すべきかといった判断をしやすくなるという効果も期待できる。 Furthermore, according to this embodiment, since summary content is set for a text group that contains one or more pieces of text to be edited, it is expected that the user will be able to easily select the editing content according to the summary content. In other words, by referring to the summary content that combines one or more pieces of text, the user will be able to easily decide whether to make public or private the part of the audio recording that is the subject of the editing process, and even whether to delete the audio recording of the summarized part itself.

〔第３の実施形態〕
次に、図５２乃至図６８を用いて、第３の実施形態について説明する。第３の実施形態では、所定のテキストデータを編集処理した編集後テキストデータと所定の画像データを編集処理した編集後画像データとを含む編集後画面データで示される編集後画面に対して、所定の日時で分割する記録閲覧編集処理（記録閲覧編集画面の分割処理）を行うようにした。なお、第３の実施形態に係る通信システム１を構成する各ハードウエア資源は、第１の実施形態と同様である。ここで、図５２は、第３の実施形態に係る通信システムの機能構成の一例を示す図である。第３の実施形態に係る通信システムにおいても、各ハードウエア資源と協働で動作する各機能の機能構成は、第１の実施形態と同様である。そのため、図２７に示した記録閲覧編集画面の生成処理(ステップＳ１０１－Ｓ１０６)に係るシーケンス図は、第３の実施形態においても同様に適用される。 Third Embodiment
Next, the third embodiment will be described with reference to Figs. 52 to 68. In the third embodiment, a record viewing and editing process (record viewing and editing screen division process) is performed on a post-edit screen shown by post-edit screen data including post-edit text data obtained by editing predetermined text data and post-edit image data obtained by editing predetermined image data, in which the record viewing and editing process is divided at a predetermined date and time. Each hardware resource constituting the communication system 1 according to the third embodiment is the same as that of the first embodiment. Here, Fig. 52 is a diagram showing an example of the functional configuration of the communication system according to the third embodiment. In the communication system according to the third embodiment, the functional configuration of each function that operates in cooperation with each hardware resource is the same as that of the first embodiment. Therefore, the sequence diagram related to the generation process (steps S101-S106) of the record viewing and editing screen shown in Fig. 27 is also applied to the third embodiment.

但し、第３の実施形態では、以下に説明する各データテーブルの内容が第１の実施形態と異なるため、それぞれの相違点について説明する。 However, in the third embodiment, the contents of each data table described below differ from those in the first embodiment, so the differences will be explained below.

●記録書誌情報管理テーブル●
図５３は、記録書誌情報管理テーブルの一例を示す概念図である。記憶部５０００には、図５３に示されているような記録書誌情報管理テーブルによって構成された記録書誌情報管理ＤＢ５３０２が構築されている。第１の実施形態における記録書誌情報管理ＤＢ５００２との相違点は、各記録識別情報のタブごとに、「参加者」及び「会議メモ」の項目が追加されている点である。なお、後述する図５８等に記載された記録閲覧編集画面の中の概要（議題）欄に表示される内容は、記録書誌情報管理テーブルで管理されている「記録名称」の項目の内容に対応する。 ●Record bibliographic information management table●
Fig. 53 is a conceptual diagram showing an example of a record bibliographic information management table. In the storage unit 5000, a record bibliographic information management DB 5302 configured by the record bibliographic information management table shown in Fig. 53 is constructed. The difference from the record bibliographic information management DB 5002 in the first embodiment is that the items "Participants" and "Meeting Notes" are added to each tab of each record identification information. Note that the contents displayed in the summary (topic) column in the record viewing and editing screen shown in Fig. 58 etc. described later correspond to the contents of the "Record Name" item managed in the record bibliographic information management table.

●テキスト情報管理テーブル●
図５４は、第３の実施形態に係るテキスト情報管理テーブルの一例を示す概念図である。記憶部５０００には、図５４に示されているようなテキスト情報管理テーブルによって構成されたテキスト情報管理ＤＢ５３０３が構築されている。第１の実施形態におけるテキスト情報管理ＤＢ５００３との相違点は、各記録識別情報のタブごとに、ブックマークの項目が追加されている点である。このブックマークは、記録閲覧編集画面のそれぞれのテキストの近傍に表示されるブックマークボタン（アイコン）を利用者が操作することにより関連付けられるフラグである。このブックマークは、例えば、通常時「False」の値（フラグ）で管理され、上述したように利用者が所定のテキストに対してブックマークボタン（アイコン）を操作することにより、「True」の値（フラグ）で管理される。 ●Text information management table●
FIG. 54 is a conceptual diagram showing an example of a text information management table according to the third embodiment. In the storage unit 5000, a text information management DB 5303 configured by a text information management table as shown in FIG. 54 is constructed. The difference from the text information management DB 5003 in the first embodiment is that a bookmark item is added to each tab of each record identification information. This bookmark is a flag associated by a user operating a bookmark button (icon) displayed near each text on the record viewing and editing screen. For example, this bookmark is normally managed with a value (flag) of "False", and is managed with a value (flag) of "True" by a user operating a bookmark button (icon) for a specific text as described above.

●キャプチャ画像管理テーブル●
図５５は、第３の実施形態に係るキャプチャ画像管理テーブルの一例を示す概念図である。記憶部５０００には、図５４に示されているようなキャプチャ画像管理テーブルによって構成されたキャプチャ画像管理ＤＢ５３０４が構築されている。第１の実施形態におけるキャプチャ画像管理ＤＢ５００４との相違点は、各記録識別情報のタブごとに、公開フラグ及び削除フラグの各項目が削除されている点である。 ●Capture image management table●
Fig. 55 is a conceptual diagram showing an example of a capture image management table according to the third embodiment. A capture image management DB 5304 configured by the capture image management table as shown in Fig. 54 is constructed in the storage unit 5000. The difference from the capture image management DB 5004 in the first embodiment is that the disclosure flag and deletion flag items have been deleted for each tab of each record identification information.

<<通信端末の各機能構成>>
第３の実施形態では、通信端末３において以下の各機能に係る詳細が追加されるため、それらの機能について詳細に説明する。 <<Functional configuration of communication terminal>>
In the third embodiment, details regarding the following functions are added to the communication terminal 3, and these functions will be described in detail.

通信端末３の表示制御部３４は、主に、ディスプレイ３１８に対するＣＰＵ３０１の処理によって実現され、通信端末３における各種画面及び情報(データ)の表示制御を行う。また、表示制御部３４は、例えば、ブラウザを用いて、ＨＴＭＬ等により作成された表示画面を、ディスプレイ３１８に表示させる。また表示制御部３４は、音声記録管理装置５が送信した分割編集後画面データに係る分割編集後画面をディスプレイ３１８に表示する。また表示制御部３４は、音声記録管理装置５が送信した分割編集後画面データに含まれるブックマーク情報を分割編集後画面に含めて、ディスプレイ３１８に表示する。本実施形態において表示制御部３４は、表示制御手段の一例として機能する。 The display control unit 34 of the communication terminal 3 is mainly realized by the processing of the CPU 301 on the display 318, and controls the display of various screens and information (data) on the communication terminal 3. The display control unit 34 also causes the display 318 to display a display screen created by HTML or the like using a browser, for example. The display control unit 34 also displays a split edited screen related to the split edited screen data transmitted by the voice recording management device 5 on the display 318. The display control unit 34 also displays the split edited screen on the display 318, including bookmark information contained in the split edited screen data transmitted by the voice recording management device 5. In this embodiment, the display control unit 34 functions as an example of a display control means.

音声再生部３６は、主に、スピーカ３１６及び音入出力Ｉ／Ｆ３１７に対するＣＰＵ３０１の処理によって実現され、通信端末３を利用する利用者に対して音声情報(音声データ)又は音情報(音データ)を再生する。また音声再生部３６は、音声記録管理装置５が送信した分割編集後音声データに係る分割編集後音声を再生する。また音声再生部３６は、所定のテキストデータを分割した分割日時よりも前の日時であって、分割日時に最も近い日時に開始された発話の終了日時が分割日時を跨ぐ場合に、発話の終了日時を分割日時として分割され音声記録管理装置５が送信した分割編集後音声データに係る分割編集後音声を再生する。本実施形態において音声再生部３８は、音声再生手段の一例として機能する。 The audio playback unit 36 is mainly realized by the processing of the CPU 301 on the speaker 316 and the audio input/output I/F 317, and plays audio information (audio data) or sound information (sound data) for the user using the communication terminal 3. The audio playback unit 36 also plays the divided and edited audio related to the divided and edited audio data transmitted by the audio recording management device 5. When the end date and time of an utterance that started at a date and time before the division date and time at which the specified text data was divided and closest to the division date and time straddles the division date and time, the audio playback unit 36 plays the divided and edited audio related to the divided and edited audio data transmitted by the audio recording management device 5, with the end date and time of the utterance as the division date and time. In this embodiment, the audio playback unit 38 functions as an example of an audio playback means.

<<音声記録管理装置の各機能構成>>
第３の実施形態では、音声記録管理装置５において以下の各機能に係る詳細が追加されるため、それらの機能について詳細に説明する。 <<Functional configuration of the voice recording management device>>
In the third embodiment, details regarding the following functions are added to the voice recording management device 5, and these functions will be described in detail.

音声記録管理装置５の送受信部５１は、主に、近距離通信Ｉ／Ｆ５０８及びネットワークＩ／Ｆ５１１に対するＣＰＵ５０１の処理によって実現され、通信ネットワーク１００を介して通信端末３との間でそれぞれ各種データ(又は情報)の送受信を行う。また、送受信部５１は、通信端末３が送信した編集要求として、編集後画面に表示された所定の領域を分割するための分割操作指示要求を受信する。また、送受信部５１は、分割操作指示要求に応じて、所定のテキストデータを分割処理した分割編集後テキストデータと所定の画像データを分割処理した分割編集後画像データとを含む分割編集後画面データを、通信端末３に対して送信する。また、送受信部５１は、領域分割要求に応じて、分割編集後画面データに含まれる所定のテキストデータを含む所定のブックマーク情報を通信端末３に対して送信する。送受信部５１は更に、所定の音声データを分割処理した分割編集後音声データを、通信端末３に対して送信する。本実施形態において送受信部５１は、送信手段及び受信手段のうち少なくとも一方の手段の一例として機能する。 The transmission/reception unit 51 of the voice recording management device 5 is mainly realized by the processing of the CPU 501 for the short-range communication I/F 508 and the network I/F 511, and transmits and receives various data (or information) to and from the communication terminal 3 via the communication network 100. The transmission/reception unit 51 also receives a split operation instruction request for splitting a predetermined area displayed on the post-edit screen as an editing request transmitted by the communication terminal 3. In response to the split operation instruction request, the transmission/reception unit 51 also transmits to the communication terminal 3 split-edited screen data including split-edited text data obtained by splitting the predetermined text data and split-edited image data obtained by splitting the predetermined image data. In response to the area division request, the transmission/reception unit 51 also transmits to the communication terminal 3 predetermined bookmark information including the predetermined text data contained in the split-edited screen data. The transmission/reception unit 51 further transmits to the communication terminal 3 split-edited audio data obtained by splitting the predetermined audio data. In this embodiment, the transmission/reception unit 51 functions as an example of at least one of the transmission means and the reception means.

＜記録（イベント）選択処理＞
続いて、第３の実施形態に係る記録（イベント）選択処理について説明する。上述したように、第３の実施形態では、第１の実施形態と同様に適用される記録閲覧編集画面の生成処理(ステップＳ１０１－Ｓ１０６)は、予め実行されていることを前提とするため、ここでの説明を省略する。 <Record (event) selection process>
Next, a record (event) selection process according to the third embodiment will be described. As described above, in the third embodiment, the process for generating a record viewing and editing screen (steps S101-S106) that is applied in the same manner as in the first embodiment is assumed to have been executed in advance, and therefore a description thereof will be omitted here.

●画面表示例●
図５６は、第３の実施形態に係る通信端末における記録選択時の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ８９の処理が実行されることにより、表示制御部３４によって記録選択画面３３４１が表示される。記録選択画面３３４１には、例えば、会議等のイベントの記録内容(議事録等)を示す日付、イベントタイトル、議事録共有ボタン(アイコン)３５３５が一揃えとして選択可能な表示形態で表示される。これにより、例えば、議事録を作成する通信端末３(Ａ)の利用者は、任意の日付及びイベントタイトルで表された所定のイベントに対応付けられた議事録共有ボタン(アイコン)３５３５を操作して選択することができる。本実施形態では、「2021/3/31 11:00:00-12:00:00」を日付情報として与えられた「ヘルスケア事業業績報告会」のイベントに対応する議事録共有ボタン(アイコン)３５３５が、利用者によって選択された場合が示されている。具体的には、通信端末３(Ａ)の利用者が、記録選択画面３３４１中の所定のイベントタイトルをマウスオーバー操作によってマウスポインタ(カーソル)３７０１を翳すと、マウスポインタ(カーソル)３７０１によって翳されたイベントタイトルに対応付けられた議事録共有ボタン(アイコン)３５３５が表示される。そこで、通信端末３(Ａ)の利用者は、議事録共有ボタン(アイコン)３５３５をクリック等で作することによって、所定のＵＲＬとパスコードを含むダイアログにアクセスすることが可能となる（ダイアログ画面は第１の実施形態と同様のため、説明を省略する）。通信端末３(Ａ)の利用者は、このダイアログに所定のＵＲＬとパスコードを入力することにより、第１の実施形態と同様に、記録閲覧編集画面へのアクセスが可能になる。 ●Screen display example●
FIG. 56 is an example of a screen display when selecting a record in a communication terminal according to the third embodiment. The display control unit 34 displays a record selection screen 3341 on the display 318 of the communication terminal 3 by executing the process of step S89 described above. The record selection screen 3341 displays, for example, a date indicating the recorded contents (minutes, etc.) of an event such as a meeting, an event title, and a minutes share button (icon) 3535 in a selectable display form as a set. As a result, for example, a user of the communication terminal 3 (A) that creates minutes can operate and select the minutes share button (icon) 3535 associated with a specific event represented by an arbitrary date and event title. In this embodiment, a case is shown in which the minutes share button (icon) 3535 corresponding to the event "Healthcare Business Performance Report Meeting" given date information of "2021/3/31 11:00:00-12:00:00" is selected by the user. Specifically, when the user of the communication terminal 3(A) places the mouse pointer (cursor) 3701 over a specific event title in the record selection screen 3341 by mouse over operation, a minutes share button (icon) 3535 corresponding to the event title overlaid by the mouse pointer (cursor) 3701 is displayed. The user of the communication terminal 3(A) can then access a dialogue including a specific URL and a passcode by clicking or otherwise operating the minutes share button (icon) 3535 (the dialogue screen is similar to that of the first embodiment, and therefore description thereof is omitted). The user of the communication terminal 3(A) can access the record viewing and editing screen by inputting a specific URL and a passcode into this dialogue, as in the first embodiment.

＜記録閲覧画面の編集処理＞
次に、第３の実施形態に係る記録閲覧画面の編集処理について説明する。図５７は、第３の実施形態に係る記録閲覧画面の編集処理の一例を示すシーケンス図である。まず、通信端末３の表示制御部３４は、ディスプレイ３１８にステップＳ１０６で生成された記録閲覧編集画面を表示し、操作受付部３２は、利用者(作成者)により編集操作指示の一例としての分割操作を受け付ける（ステップＳ３１１）。なお、第３の実施形態における「分割」処理も、第１の実施形態及び第２の実施形態で示した「編集」処理の一例として扱割れる。 <Editing process on the record viewing screen>
Next, the editing process of the record viewing screen according to the third embodiment will be described. Fig. 57 is a sequence diagram showing an example of the editing process of the record viewing screen according to the third embodiment. First, the display control unit 34 of the communication terminal 3 displays the record viewing and editing screen generated in step S106 on the display 318, and the operation acceptance unit 32 accepts a split operation as an example of an editing operation instruction by the user (creator) (step S311). Note that the "split" process in the third embodiment can also be treated as an example of the "edit" process shown in the first and second embodiments.

●画面表示例●
図５８は、第３の実施形態に係る作成者の通信端末の記録閲覧編集画面の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ３１１の処理（Ｓ１０６と同様の処理）が実行されることにより、表示制御部３４によって記録閲覧編集画面３３６１が表示される。記録閲覧編集画面３３６１には、図２５に示した記録閲覧編集画面３１５１と同様に、例えば、「概要(議題)」、「参加者」、「会議メモ」の各入力欄、音声再生表示部３６０１及びブックマーク表示欄３６０３が表示される。記録閲覧編集画面３３６１には、更に、少なくとも一以上のテキスト表示欄が表示される。記録閲覧編集画面３３６１には、更に、画面キャプチャ処理によってキャプチャ処理された画面３、画面４及び画面５が、それぞれキャプチャ処理された時刻ごとに、各テキスト表示欄に表示された各テキストと対応付けて表示される。なお、記録閲覧編集画面３３６１が表示される通信端末３の利用者は、例えば、音声記録（議事録）の作成者である「理光太郎」である。 ●Screen display example●
FIG. 58 is a screen display example of the record viewing and editing screen of the creator's communication terminal according to the third embodiment. The display control unit 34 displays the record viewing and editing screen 3361 on the display 318 of the communication terminal 3 by executing the process of step S311 (similar to the process of S106) described above. The record viewing and editing screen 3361 displays, for example, each input field of "Summary (topic)", "Participants", and "Meeting notes", an audio playback display section 3601, and a bookmark display field 3603, similar to the record viewing and editing screen 3151 shown in FIG. 25. The record viewing and editing screen 3361 further displays at least one or more text display fields. The record viewing and editing screen 3361 further displays screens 3, 4, and 5 captured by the screen capture process in association with each text displayed in each text display field for each capture time. The user of the communication terminal 3 on which the record viewing and editing screen 3361 is displayed is, for example, "Rikotaro", the creator of the audio record (minutes).

記録閲覧編集画面３３６１では、例えば、各テキスト表示欄の近傍にマウス３７０１が置かれる操作(マウスオーバー操作)が行われた場合、表示制御部３４によって、「音声再生」ボタン(アイコン)３５４３、「削除」ボタン(アイコン)３５４４及び「ブックマーク」ボタン（アイコン）３６０２が、マウスオーバー操作されたテキスト表示欄の近傍に表示される。第３の実施形態で新たに設けられた「ブックマーク」ボタン（アイコン）３６０２は、利用者によって選択された所望のテキストに対してブックマークとしてブックマーク表示欄３６０３に表示させるためのボタン（アイコン）である。利用者は、「ブックマーク」ボタン（アイコン）３６０２を操作することにより、利用者が選択した特定のテキストをブックマーク表示欄３６０３に表示させることができる。 On the record viewing and editing screen 3361, for example, when the mouse 3701 is placed near each text display field (mouse-over operation), the display control unit 34 displays a "Play audio" button (icon) 3543, a "Delete" button (icon) 3544, and a "Bookmark" button (icon) 3602 near the text display field where the mouse was over. The "Bookmark" button (icon) 3602, which is newly provided in the third embodiment, is a button (icon) for displaying the desired text selected by the user as a bookmark in the bookmark display field 3603. By operating the "Bookmark" button (icon) 3602, the user can display the specific text selected by the user in the bookmark display field 3603.

なお、利用者が「ブックマーク」ボタン（アイコン）３６０２を操作（クリック）すると、通信端末３は、後述するように、テキスト識別情報、記録識別情報及びブックマーク要求を音声記憶管理装置５に送信する。音声記憶管理装置５は、テキスト識別情報と記録識別情報に基づいてテキスト情報管理ＤＢ５３０３（図５３参照）を参照し、当該テキストを特定して「ブックマーク」の項目の項目値を「true」に設定する。その後、音声記憶管理装置５は、設定が成功した通知を通信端末３に送信する。この通知を受信することにより、通信端末３は、当該テキストの「ブックマーク」ボタン（アイコン）３６０２の表示態様をオフ（ＯＦＦ）からオン（ＯＮ）に切り替え、ブックマーク表示欄３６０３に表示させることが可能になる。 When the user operates (clicks) the "Bookmark" button (icon) 3602, the communication terminal 3 transmits the text identification information, record identification information, and bookmark request to the voice storage management device 5, as described below. The voice storage management device 5 refers to the text information management DB 5303 (see FIG. 53) based on the text identification information and record identification information, identifies the text, and sets the item value of the "Bookmark" item to "true". The voice storage management device 5 then transmits a notification to the communication terminal 3 that the setting was successful. By receiving this notification, the communication terminal 3 is able to switch the display mode of the "Bookmark" button (icon) 3602 for the text from off (OFF) to on (ON), and display it in the bookmark display field 3603.

記録閲覧編集画面３３６１では更に、テキストグループ及びそのテキストグループに対応する画像を区切るための所定の時間間隔（例えば、30秒）で区切られた区切り線（日時表示部（「11:02:00」等））の近傍にマウス３７０１が置かれる操作(マウスオーバー操作)が行われた場合に、例えば、ハサミの形状をした分割ボタン（アイコン）３６０４が表示される。この状態で利用者は、例えば、分割ボタン（アイコン）３６０４をクリックすることで、表示制御部３４は、次に説明する会議ログの分割ダイアログを分割ボタン（アイコン）３６０４の近傍にポップアップ表示させる。 Furthermore, on the record viewing and editing screen 3361, when the mouse 3701 is placed (mouse-over operation) near a dividing line (date and time display area (e.g., "11:02:00")) that is a predetermined time interval (e.g., 30 seconds) that separates text groups and the images corresponding to those text groups, a split button (icon) 3604 in the shape of scissors, for example, is displayed. In this state, the user can, for example, click on the split button (icon) 3604, which causes the display control unit 34 to pop up a conference log split dialog box, which will be described next, near the split button (icon) 3604.

具体的な処理の一例として、音声記録管理装置５では、分割ボタン（アイコン）３６０４と対応する区切り線（日時表示部）で示される日時とが画面の画面データにおいて対応付けられている。この状態で利用者により分割ボタン（アイコン）３６０４がクリックされると、通信端末３はこの分割ボタン（アイコン）３６０４に対応する日時情報を取得する。そして、利用者によりダイアログの「分割する」ボタン（アイコン）がクリックされると（ステップＳ３１１）、取得した分割日時情報、音声記録の記録識別情報、及び分割操作指示要求を音声記録管理装置５に送信する（ステップＳ３１２）。 As an example of specific processing, in the audio recording management device 5, the split button (icon) 3604 and the date and time indicated by the corresponding dividing line (date and time display section) are associated in the screen data of the screen. When the user clicks the split button (icon) 3604 in this state, the communication terminal 3 acquires the date and time information corresponding to this split button (icon) 3604. Then, when the user clicks the "Split" button (icon) in the dialog (step S311), the acquired split date and time information, the recording identification information of the audio recording, and a split operation instruction request are sent to the audio recording management device 5 (step S312).

●画面表示例●
図５９は、第３の実施形態に係る作成者の通信端末の記録閲覧編集画面の他の画面表示例である。通信端末３のディスプレイ３１８には、分割ボタン（アイコン）３６０４への操作が実行されることにより、表示制御部３４によって会議ログ分割ダイアログ３３６２が分割ボタン（アイコン）３６０４の近傍にポップアップ表示される。利用者は、会議ログ分割ダイアログ３３６２の「分割する」ボタン（アイコン）をマウス３７０１でクリック等の操作を行うことができる。 ●Screen display example●
59 is another example of a display screen of the record viewing and editing screen of the creator's communication terminal according to the third embodiment. When an operation is performed on the split button (icon) 3604 on the display 318 of the communication terminal 3, the display control unit 34 pops up a conference log split dialog 3362 near the split button (icon) 3604. The user can perform an operation such as clicking the "Split" button (icon) of the conference log split dialog 3362 with the mouse 3701.

図５７に戻り、送受信部３１は、音声記録管理装置５に対して、分割操作指示要求を送信する（ステップＳ３１２）。これにより、音声記録管理装置５の送受信部５１は、通信端末３が送信した分割操作指示要求を受信する。このとき、分割操作指示要求には、以下の三つの情報のうち、少なくとも一つの情報が含まれる。一つは、対象のテキストを識別するテキスト識別情報、対象の画像を識別する画像識別情報と上述した分割操作ボタンに対して行われた分割操作ボタン情報が含まれる。なお、分割操作指示要求は、通信端末３が音声記録管理装置５に対して送信する記録閲覧画面の分割処理を行うための要求の一例である。 Returning to FIG. 57, the transmission/reception unit 31 transmits a split operation instruction request to the audio recording management device 5 (step S312). As a result, the transmission/reception unit 51 of the audio recording management device 5 receives the split operation instruction request transmitted by the communication terminal 3. At this time, the split operation instruction request includes at least one of the following three pieces of information. One of them includes text identification information that identifies the target text, image identification information that identifies the target image, and split operation button information performed on the split operation button described above. Note that the split operation instruction request is an example of a request to perform split processing on the record viewing screen that is transmitted by the communication terminal 3 to the audio recording management device 5.

次に、音声記録管理装置５は、ステップＳ３１２で受信した分割操作指示要求に対する応答として分割操作指示応答を送信する（ステップＳ３１３）。これにより、通信端末３の送受信部３１は、音声記録管理装置５が送信した分割操作指示応答を受信する。このとき、分割操作指示応答には、分割操作指示の受付ＩＤが含まれる。これにより、通信端末３は、後述する図６０に示すダイアログ画面をディスプレイ３１８に表示させることができる。 Next, the audio recording management device 5 transmits a split operation instruction response as a response to the split operation instruction request received in step S312 (step S313). As a result, the transmission/reception unit 31 of the communication terminal 3 receives the split operation instruction response transmitted by the audio recording management device 5. At this time, the split operation instruction response includes the acceptance ID of the split operation instruction. This allows the communication terminal 3 to display a dialog screen shown in FIG. 60 (described later) on the display 318.

次に、操作受付部３２は、ダイアログ画面に対する「ＯＫ」ボタン操作を受け付ける（ステップＳ３１４）。具体的には、操作受付部３２は、ステップＳ３１３で送受信された分割操作指示の受付ＩＤに基づいて音声記録管理装置５により表示されたダイアログ画面に対する利用者による「ＯＫ」ボタンの操作を受け付ける。なお、利用者によって「ＯＫ」ボタンへの操作が行われると、表示制御部３４は操作受付部３２を介して、分割開始通知ダイアログ３３６３及び会議ログ分割ダイアログ３３６２を消去し、元の画面に遷移させる。なお、ステップＳ３１４の処理において、通信端末３は、所定のアプリを起動して、通信端末３で生成したダイアログ画面を表示するようにしてもよい。通信端末３は更に、ステップＳ３１４の処理において、ステップＳ３１３で音声記録管理装置５が送信したダイアログ画面に係る画面データを通信端末３に表示させるようにしてもよい。 Next, the operation acceptance unit 32 accepts the operation of the "OK" button on the dialog screen (step S314). Specifically, the operation acceptance unit 32 accepts the operation of the "OK" button by the user on the dialog screen displayed by the voice recording management device 5 based on the acceptance ID of the split operation instruction transmitted and received in step S313. When the user operates the "OK" button, the display control unit 34 erases the split start notification dialog 3363 and the conference log split dialog 3362 via the operation acceptance unit 32 and transitions to the original screen. In the process of step S314, the communication terminal 3 may start a predetermined application and display the dialog screen generated by the communication terminal 3. In the process of step S314, the communication terminal 3 may further display the screen data related to the dialog screen transmitted by the voice recording management device 5 in step S313 on the communication terminal 3.

●画面表示例●
図６０は、第３の実施形態に係る作成者の通信端末の記録閲覧編集画面の他の画面表示例である。通信端末３のディスプレイ３１８には、図５８で示した「分割する」ボタン（アイコン）への操作が実行されることにより、表示制御部３４によって例えば、分割開始通知ダイアログ３３６３が会議ログ分割ダイアログ３３６２に重畳するようにポップアップ表示される。これにより、利用者は、分割開始通知ダイアログ３３６３の「ＯＫ」ボタン（アイコン）をマウス３７０１でクリック等の操作を行うことができる。 ●Screen display example●
Fig. 60 shows another example of the record viewing and editing screen of the creator's communication terminal according to the third embodiment. When the "Split" button (icon) shown in Fig. 58 is operated on the display 318 of the communication terminal 3, the display control unit 34 pops up a split start notification dialog 3363 so as to be superimposed on the conference log split dialog 3362. This allows the user to perform an operation such as clicking the "OK" button (icon) of the split start notification dialog 3363 with the mouse 3701.

再度図５７に戻り、音声記録管理装置５は、記録閲覧編集画面の分割処理を行う（ステップＳ３２１）。ここで、記録閲覧編集画面の分割処理には、上述した分割処理が含まれる。また、この記録閲覧編集画面の分割処理においては、記録書誌情報管理ＤＢ５３０２（図５３参照）、テキスト情報管理ＤＢ５３０３（図５４参照）及びキャプチャ画像管理ＤＢ５３０４（図５５参照）がそれぞれ用いられる。なお、ステップＳ３２１の処理は、上述したステップＳ３１４の処理と非同期で実行されるため、どちらの処理が先に実行されてもかまわない。 Returning to FIG. 57, the voice recording management device 5 performs a split process of the record viewing and editing screen (step S321). Here, the split process of the record viewing and editing screen includes the split process described above. In addition, in this split process of the record viewing and editing screen, the record bibliographic information management DB 5302 (see FIG. 53), the text information management DB 5303 (see FIG. 54), and the capture image management DB 5304 (see FIG. 55) are each used. Note that the process of step S321 is executed asynchronously with the process of step S314 described above, so it does not matter which process is executed first.

<<記録閲覧編集画面の分割処理の詳細>>
ここで、ステップＳ３２１の記録閲覧編集画面の分割処理の詳細について説明する。図６１は、第３の実施形態に係る記録閲覧編集画面の分割処理の一例を示すフローチャートである。まず、音声記録管理装置５の送受信部５１は、音声記録の記録識別情報、分割時刻及び分割要求を受信する（ステップＳ３２１－１）。なお、ステップＳ３２１－１の処理は、上述したステップＳ３１２の処理に相当し、分割要求にはテキスト識別情報、画像識別情報、分割操作ボタン情報が含まれる。 <<Details on splitting the record viewing and editing screen>>
Here, the details of the division process of the record viewing and editing screen in step S321 will be described. Fig. 61 is a flow chart showing an example of the division process of the record viewing and editing screen according to the third embodiment. First, the transmission/reception unit 51 of the voice recording management device 5 receives the recording identification information of the voice recording, the division time, and a division request (step S321-1). Note that the process of step S321-1 corresponds to the process of step S312 described above, and the division request includes text identification information, image identification information, and division operation button information.

次に、算出特定部５３は、テキスト情報管理テーブル（テキスト情報管理ＤＢ５３０３（図５４参照））から記録識別情報に対応するものを特定し、開始時刻が分割時刻より前のレコードと分割時刻以降のレコードとを特定する（ステップＳ３２１－２）。具体的には、算出特定部５３は、テキスト識別情報及び分割時刻を検索キーとしてテキスト情報管理ＤＢ５３０３を検索することにより、開始時刻が分割時刻より前のレコードと、分割時刻以降のレコードを特定する。 The calculation identification unit 53 then identifies the text information management table (text information management DB 5303 (see FIG. 54)) that corresponds to the record identification information, and identifies records whose start times are before the division time and records whose start times are after the division time (step S321-2). Specifically, the calculation identification unit 53 searches the text information management DB 5303 using the text identification information and the division time as search keys, thereby identifying records whose start times are before the division time and records whose start times are after the division time.

次に、設定登録部５８は、それぞれに基づいてテキスト情報管理テーブルを更新して登録する（ステップＳ３２１－３）。具体的には、設定登録部５８は、ステップＳ３２１－２で特定された開始時刻が分割時刻より前のレコードと終了時刻が分割時刻以降のレコードとに基づいて、新しく設定された分割前の記録識別情報のタブと分割後の記録識別情報のタブとで関連付けられた二つのテキスト情報管理テーブル（テキスト情報管理ＤＢ５３０３）を更新して登録する。 Next, the setting registration unit 58 updates and registers the text information management table based on each (step S321-3). Specifically, the setting registration unit 58 updates and registers two text information management tables (text information management DB 5303) associated with the newly set tabs for the pre-split record identification information and the post-split record identification information based on the records with start times before the split time and the records with end times after the split time identified in step S321-2.

図６２は、第３の実施形態に係る記録閲覧編集画面の分割処理後のテキスト情報管理テーブルの一例を示す概念図で、（a）は分割された一つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図、(b)は分割された二つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図である。図６２（a）では、新しく設定された分割後の記録識別情報「R5301」のタブに関連付けられたテキスト情報管理テーブルが示されている。具体的には、テキスト識別情報「TX3006」に対応する終了時刻が、設定登録部５８によって「2分00秒」に更新される。このとき、「2分00秒」は、図５８で示した日時表示部（「11:02:00」）で分割された場合の日時情報に対応する時間である。なお、テキスト情報識別情報に対応するそれぞれのブックマークの項目はすべて「False」が設定されており、このテーブルで管理されている発話内容については、ブックマーク処理が行われなかったことを示している。 Figure 62 is a conceptual diagram showing an example of a text information management table after the split processing of the record viewing and editing screen according to the third embodiment, where (a) is a conceptual diagram of the text information management table constituting the first split voice recording screen, and (b) is a conceptual diagram of the text information management table constituting the second split voice recording screen. Figure 62 (a) shows the text information management table associated with the tab of the newly set post-split record identification information "R5301". Specifically, the end time corresponding to the text identification information "TX3006" is updated to "2 minutes 00 seconds" by the setting registration unit 58. At this time, "2 minutes 00 seconds" is the time corresponding to the date and time information when split at the date and time display unit ("11:02:00") shown in Figure 58. Note that all of the bookmark items corresponding to the text information identification information are set to "False", indicating that the bookmark processing was not performed for the spoken content managed in this table.

図６２（ｂ）では、新しく設定された分割後の記録識別情報「R5302」のタブに関連付けられたテキスト情報管理テーブルが示されている。具体的には、テキスト識別情報「TX3007」、「TX3008」、「TX3009」、・・・にそれぞれ対応する開始時刻及び終了時刻が、設定登録部５８によって更新して登録される。より具体的には、テキスト識別情報「TX3007」に対応する開始時刻は「0分5秒」、終了時刻は「0分12秒」となる。これらの時刻は、「まず共有画面をご覧ください。」の内容の発話が開始された開始日時（開始時刻）及び終了された終了日時（終了時刻）を表している。同様に、テキスト識別情報「TX3008」に対応する開始時刻は「0分18秒」、終了時刻は「0分23秒」となる。これらの時刻は、「2020年度下期の売り上げは○○です。」の内容の発話が開始及び終了された各時刻を表している。同様に、テキスト識別情報「TX3009」に対応する開始時刻は「0分25秒」、終了時刻は「0分28秒」となる。これらの時刻は、「営業利益は△△です。」の内容の発話が開始及び終了された各時刻を表している。なお、テキスト情報識別情報に対応するブックマークの項目のうち、テキスト識別情報「TX3008」に対応するブックマークに「True」が設定されている。つまり、このテーブルで管理されている発話内容については、テキスト識別情報「TX3008」に対応するテキストに対してブックマーク処理が行われたことを示している。 In FIG. 62(b), the text information management table associated with the tab of the newly set post-division record identification information "R5302" is shown. Specifically, the start time and end time corresponding to each of the text identification information "TX3007", "TX3008", "TX3009", ... are updated and registered by the setting registration unit 58. More specifically, the start time corresponding to the text identification information "TX3007" is "0 minutes 5 seconds" and the end time is "0 minutes 12 seconds". These times represent the start date and time (start time) when the speech of the content "Please look at the shared screen first" started and the end date and time (end time) when the speech of the content "Please look at the shared screen first" ended. Similarly, the start time corresponding to the text identification information "TX3008" is "0 minutes 18 seconds" and the end time is "0 minutes 23 seconds". These times represent the times when the speech of the content "Sales for the second half of fiscal year 2020 are XX" started and ended. Similarly, the start time corresponding to text identification information "TX3009" is "0 minutes 25 seconds" and the end time is "0 minutes 28 seconds." These times represent the times when the speech of "Operating profit is △△" started and ended. Note that, among the bookmark items corresponding to text information identification information, the bookmark corresponding to text identification information "TX3008" is set to "True." In other words, this indicates that for the speech content managed in this table, bookmark processing was performed on the text corresponding to text identification information "TX3008."

図６１に戻り、算出特定部５３は、キャプチャ画像管理テーブル（キャプチャ画像管理ＤＢ５３０４（図５５参照））から記録識別情報に対応するものを特定し、取得時刻が分割時刻より前のレコードと、分割時刻以降のレコードとを特定する（ステップＳ３２１－４）。具体的には、算出特定部５３は、分割ボタン（アイコン）３６０４が操作された日時表示部に表示された日時に対応する取得時刻を検索キーとしてキャプチャ画像管理ＤＢ５３０４を検索することにより、対応する画像データパスを読み出して特定する。このとき、算出特定部５３は、例えば、記録書誌情報管理ＤＢ５００２（図５３参照）の開始日時と経過時間とに基づいて、実際の開始時刻を算出する。これは、テキスト情報管理ＤＢ５３０３（図５４参照）及びキャプチャ画像管理ＤＢ５３０４（図５５参照）において、会議等のイベント開始時刻からの経過時間が記録、管理されているためであり、算出特定部５３は、記録書誌情報管理ＤＢ５００２の開始日時と経過時間とに基づいて、実際の開始時刻を算出する。 Returning to FIG. 61, the calculation identification unit 53 identifies the record identification information from the capture image management table (capture image management DB 5304 (see FIG. 55)) and identifies records whose acquisition time is before the division time and records whose acquisition time is after the division time (step S321-4). Specifically, the calculation identification unit 53 searches the capture image management DB 5304 using as a search key the acquisition time corresponding to the date and time displayed in the date and time display unit when the division button (icon) 3604 is operated, thereby reading and identifying the corresponding image data path. At this time, the calculation identification unit 53 calculates the actual start time based on, for example, the start date and time and the elapsed time in the record bibliographic information management DB 5002 (see FIG. 53). This is because the elapsed time from the start time of an event such as a meeting is recorded and managed in the text information management DB 5303 (see FIG. 54) and the capture image management DB 5304 (see FIG. 55), and the calculation identification unit 53 calculates the actual start time based on the start date and time and the elapsed time in the record bibliographic information management DB 5002.

次に、設定登録部５８は、それぞれに基づいてテキスト情報管理テーブルを更新して登録する（ステップＳ３２１－５）。具体的には、設定登録部５８は、ステップＳ３２１－４で特定された分割時刻より前のレコードと、分割時刻以降のレコードとに基づいて、新しく設定された分割前の記録識別情報のタブと分割後の記録識別情報のタブとで関連付けられた二つのキャプチャ画像管理テーブル（キャプチャ画像管理ＤＢ５３０４）を更新して登録する。 Then, the setting registration unit 58 updates and registers the text information management table based on each (step S321-5). Specifically, the setting registration unit 58 updates and registers two capture image management tables (capture image management DB 5304) associated with the newly set tabs for the record identification information before the division and the record identification information after the division based on the records before the division time identified in step S321-4 and the records after the division time.

図６３は、第３の実施形態に係る記録閲覧編集画面の分割処理後のキャプチャ画像管理テーブルの一例を示す概念図で、（a）は分割された一つ目の音声記録画面を構成するキャプチャ画像管理テーブルの概念図、(b)は分割された二つ目の音声記録画面を構成するキャプチャ画像管理テーブルの概念図である。図６３(a)では、新しく設定された分割後の記録識別情報「R5301」のタブに関連付けられたキャプチャ画像管理テーブルが示されている。具体的には、画像識別情報「IM0003」、「IM0004」、・・・に対応する各取得時刻及び各画像データパスが、設定登録部５８によって予め登録されていた内容で引き継がれている状態を示している。 Figure 63 is a conceptual diagram showing an example of a capture image management table after splitting processing of the record viewing and editing screen according to the third embodiment, where (a) is a conceptual diagram of a capture image management table constituting the first split voice recording screen, and (b) is a conceptual diagram of a capture image management table constituting the second split voice recording screen. Figure 63(a) shows a capture image management table associated with the tab of the newly set post-split record identification information "R5301". Specifically, it shows a state in which each acquisition time and each image data path corresponding to image identification information "IM0003", "IM0004", ... are inherited from the contents previously registered by the setting registration unit 58.

図６３(b)では、新しく設定された分割後の記録識別情報「R5302」のタブに関連付けられたキャプチャ画像管理テーブルが示されている。具体的には、画像識別情報「IM0005」に対応する取得時刻が、設定登録部５８によって「0分00秒」に更新される。より具体的には、分割された二つ目の音声記録に関しては、分割された日時を示す日時表示部が「11:02:00」を示していたことから、この日時に対応する取得時刻「2分00秒」は、設定登録部５８によって「0分00秒」に更新される。 Figure 63(b) shows the capture image management table associated with the tab for the newly set post-split recording identification information "R5302". Specifically, the acquisition time corresponding to the image identification information "IM0005" is updated to "0 minutes 00 seconds" by the setting registration unit 58. More specifically, for the second split audio recording, since the date and time display showing the split date and time showed "11:02:00", the acquisition time "2 minutes 00 seconds" corresponding to this date and time is updated to "0 minutes 00 seconds" by the setting registration unit 58.

再び図６１に戻り、算出特定部５３は、記録書誌情報管理テーブル（記録書誌情報管理ＤＢ５３０２（図５３参照））から記録識別情報に対応する記録書誌情報を特定する（ステップＳ３２１－６）。具体的には、算出特定部５３は、分割された日時を示す日時表示部が「11:02:00」を示していたことから、この日時を含む記録書誌情報を有する記録書誌情報のタブを特定する。 Returning to FIG. 61, the calculation and identification unit 53 identifies the recorded bibliographic information corresponding to the record identification information from the recorded bibliographic information management table (recorded bibliographic information management DB 5302 (see FIG. 53)) (step S321-6). Specifically, since the date and time display portion showing the split date and time shows "11:02:00", the calculation and identification unit 53 identifies the tab of the recorded bibliographic information having the recorded bibliographic information including this date and time.

次に、取得部５２は、音声データパスから音声データを取得する（ステップＳ３２１－７）。具体的には、取得部５２は、分割された日時を示す日時表示部が「11:02:00」を示していたことから、この日時を含む記録書誌情報を有する記録書誌情報のタブで管理されている音声データパスを取得する。この場合、取得される音声データパスは、例えば、「・・・/00005006/record.mp3」である。 Next, the acquisition unit 52 acquires the audio data from the audio data path (step S321-7). Specifically, since the date and time display portion showing the split date and time indicates "11:02:00", the acquisition unit 52 acquires the audio data path managed in the tab of the record bibliographic information having the record bibliographic information including this date and time. In this case, the acquired audio data path is, for example, ".../00005006/record.mp3".

次に、生成・処理部５７は、分割時刻より前の部分音声データと、分割時刻以降の部分音声データとを生成し、それぞれに対応するイベントＵＲＬを生成する（ステップＳ３２１－８）。 Next, the generation/processing unit 57 generates partial audio data before the division time and partial audio data after the division time, and generates event URLs corresponding to each (step S321-8).

次に、設定登録部５８は、音声データのパスとイベントＵＲＬとに基づいて記録書誌情報管理テーブルを更新して登録し、このフローを抜ける（ステップＳ３２１－９）。 Next, the setting registration unit 58 updates and registers the recorded bibliographic information management table based on the path of the audio data and the event URL, and then exits this flow (step S321-9).

図６４Ａは、第３の実施形態に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された一つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。図６４Ａでは、新しく設定された分割後の記録識別情報「R5301」のタブに関連付けられた記録書誌情報管理テーブルが示されている。具体的には、設定登録部５８によって記録名称が「分割1_ヘルスケア事業業績報告会」に更新される。更に、設定登録部５８によって終了日時が「2021/03/31 11:02:00」に更新される。更に、設定登録部５８によって音声データパスが「・・・/00005301/record.mp3」に更新される。更に、イベントＵＲＬが、設定登録部５８によって「https://・・・/00005301」に更新される。 Figure 64A is a conceptual diagram showing an example of a record bibliographic information management table after the division process of the record viewing and editing screen according to the third embodiment, and is a conceptual diagram of the record bibliographic information management table constituting the first divided voice recording screen. In Figure 64A, the record bibliographic information management table associated with the tab of the newly set post-division record identification information "R5301" is shown. Specifically, the setting registration unit 58 updates the record name to "Division 1_Healthcare Business Performance Reporting Meeting". Furthermore, the setting registration unit 58 updates the end date and time to "2021/03/31 11:02:00". Furthermore, the setting registration unit 58 updates the voice data path to ".../00005301/record.mp3". Furthermore, the setting registration unit 58 updates the event URL to "https://.../00005301".

図６４Ｂは、第３の実施形態に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された二つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。図６４Ｂでは、新しく設定された分割後の記録識別情報「R5302」のタブに関連付けられた記録書誌情報管理テーブルが示されている。具体的には、設定登録部５８によって記録名称が「分割2_ヘルスケア事業業績報告会」に更新される。更に、設定登録部５８によって開始日時が「2021/03/31 11:02:00」に更新される。更に、設定登録部５８によって音声データパスが「・・・/00005302/record.mp3」に更新される。更に、イベントＵＲＬが、設定登録部５８によって「https://・・・/00005302」に更新される。 Figure 64B is a conceptual diagram showing an example of a record bibliographic information management table after the division process of the record viewing and editing screen according to the third embodiment, which is a conceptual diagram of the record bibliographic information management table constituting the second divided audio recording screen. In Figure 64B, the record bibliographic information management table associated with the tab of the newly set post-division record identification information "R5302" is shown. Specifically, the setting registration unit 58 updates the record name to "Division 2_Healthcare Business Performance Reporting Meeting". Furthermore, the setting registration unit 58 updates the start date and time to "2021/03/31 11:02:00". Furthermore, the setting registration unit 58 updates the audio data path to ".../00005302/record.mp3". Furthermore, the setting registration unit 58 updates the event URL to "https://.../00005302".

なお、上述したステップＳ３２１において特定、更新された各レコードを含む分割後の音声記録画面は、利用者が分割された所望のイベントを示すイベントＵＲＬとパスコードとを入力した場合（上述したステップＳ１０１の処理が実行された場合）に、通信端末３において表示される。 The divided voice recording screen including each record identified and updated in step S321 described above is displayed on the communication terminal 3 when the user inputs the event URL indicating the desired divided event and the passcode (when the processing of step S101 described above is executed).

＜分割後の記録選択画面表示＞
続いて、分割後の記録（イベント）の選択、及び選択されたそれぞれの選択画面の表示について説明する。 <Recording selection screen display after division>
Next, the selection of the divided records (events) and the display of each selected selection screen will be described.

●画面表示例●
図６５は、第３の実施形態に係る通信端末における記録閲覧編集画面の分割処理後の記録選択時の画面表示例である。通信端末３のディスプレイ３１８には、上述したステップＳ８９の処理が実行されることにより、表示制御部３４によって記録選択画面３３４１が表示される。第３の実施形態に係る記録選択画面３３４１には、図５６で説明した内容に加えて、分割された二つのイベント新たに表示される。本実施形態では、「2021/3/31 11:00:00-11:02:00」を日付情報として与えられた「分割１_ヘルスケア事業業績報告会」と、「2021/3/31 11:02:00-11:02:00」を日付情報として与えられた「分割２_ヘルスケア事業業績報告会」の二つのイベントタイトル及びそれらに対応する議事録共有ボタン(アイコン)が新たに表示される。これにより、議事録を作成する通信端末３(Ａ)の利用者は、新たに表示されたイベントのいずれかのイベントタイトルをマウスオーバー操作によってマウスポインタ(カーソル)３７０１を翳すことにより、マウスポインタ(カーソル)３７０１によって翳されたイベントタイトルに対応付けられた議事録共有ボタン(アイコン)を操作することができる。そこで、通信端末３(Ａ)の利用者は、所望のイベントの議事録共有ボタン(アイコン)３５３５をクリック等で操作することによって、所定のＵＲＬとパスコードを含むダイアログにアクセスすることが可能となる。なお、ポップアップ表示されるダイアログ画面は第１の実施形態と同様のため、その説明を省略する。通信端末３(Ａ)の利用者は、このダイアログに所定のＵＲＬとパスコードを入力することにより、第１の実施形態と同様に、記録閲覧編集画面へのアクセスが可能になる。また、利用者は、所望のイベントタイトルを操作（クリック、タップ等）することにより、操作したイベントの音声記録画面を開くことができる。図６５では、「分割１_ヘルスケア事業業績報告会」のイベントに対する操作が利用者によって行われる例が示されている。ここで、音声記録管理装置５は、議事録共有ボタン(アイコン)を操作した利用者と分割された音声記録に参加した利用者の識別情報を管理することにより、議事録共有ボタン(アイコン)を操作した利用者が分割された議事録に係るイベントの閲覧を制限するようにしてもよい。具体的には、例えば、議事録共有ボタン(アイコン)を操作した利用者が分割された議事録で示されるイベントに参加していない場合は、音声記録管理装置５は、その利用者に対して分割された議事録の閲覧を禁止するようにしてもよい。なお、本実施形態においては、イベントに係る音声記録の分割は二つに限らず、三つ以上の分割を行うものであってもよい。 ●Screen display example●
FIG. 65 is an example of a screen display when selecting a record after the division process of the record viewing and editing screen in the communication terminal according to the third embodiment. By executing the process of step S89 described above, the display control unit 34 displays a record selection screen 3341 on the display 318 of the communication terminal 3. In addition to the contents described in FIG. 56, the record selection screen 3341 according to the third embodiment newly displays two divided events. In this embodiment, two event titles, "Division 1_Healthcare Business Performance Report Meeting" given the date information of "2021/3/31 11:00:00-11:02:00" and "Division 2_Healthcare Business Performance Report Meeting" given the date information of "2021/3/31 11:02:00-11:02:00", and the corresponding minutes sharing buttons (icons) are newly displayed. As a result, the user of the communication terminal 3(A) who creates the minutes can operate the minutes share button (icon) associated with the event title covered by the mouse pointer (cursor) 3701 by hovering the mouse over any of the event titles of the newly displayed events. The user of the communication terminal 3(A) can access a dialogue including a predetermined URL and a passcode by operating the minutes share button (icon) 3535 of a desired event by clicking or the like. Note that the dialogue screen that is displayed as a pop-up is the same as that of the first embodiment, and therefore a description thereof is omitted. The user of the communication terminal 3(A) can access the record viewing and editing screen by inputting a predetermined URL and a passcode into this dialogue, as in the first embodiment. In addition, the user can open the voice recording screen of the operated event by operating (clicking, tapping, etc.) the desired event title. FIG. 65 shows an example in which an operation is performed by a user on the event "Division 1_Healthcare Business Performance Reporting Meeting". Here, the audio recording management device 5 may manage the identification information of the user who operated the minutes share button (icon) and the users who participated in the divided audio recordings, thereby restricting the user who operated the minutes share button (icon) from viewing the event related to the divided minutes. Specifically, for example, if the user who operated the minutes share button (icon) did not participate in the event indicated in the divided minutes, the audio recording management device 5 may prohibit the user from viewing the divided minutes. Note that in this embodiment, the division of the audio recording related to the event is not limited to two, and it may be divided into three or more parts.

●画面表示例●
図６６は、第３の実施形態に係る作成者以外の利用者の通信端末に表示される分割された一つ目の音声記録画面の画面表示例である。通信端末３のディスプレイ３１８には、上述した「分割１_ヘルスケア事業業績報告会」のイベントタイトルへの操作が利用者によって実行されることにより、表示制御部３４によって音声記録画面３３６１が表示される。音声記録画面３３６１には、図５８に示した記録閲覧編集画面３３６１と同様に、例えば、「概要(議題)」、「参加者」、「会議メモ」の各入力欄、音声再生表示部及びブックマーク表示欄３６０３が表示される。記録閲覧編集画面３３６１には、更に、少なくとも一以上のテキスト表示欄が表示される。記録閲覧編集画面３３６１では、図５８で示した11:01:30の日時表示部より前の各テキスト、及びそれらのテキストに対応付けられた画面３及び画面４が表示される。なお、記録閲覧編集画面３３６１が表示される通信端末３の利用者は、例えば、音声記録（議事録）の閲覧者である「海老名二郎」である。なお、音声記録画面３３６１に含まれるブックマーク表示欄３６０３には、表示される内容が無いことが示されている。 ●Screen display example●
FIG. 66 is a screen display example of the first divided voice recording screen displayed on the communication terminal of a user other than the creator according to the third embodiment. When the user operates the event title "Division 1_Healthcare Business Performance Reporting Meeting" described above, the display control unit 34 displays a voice recording screen 3361 on the display 318 of the communication terminal 3. On the voice recording screen 3361, as in the record viewing and editing screen 3361 shown in FIG. 58, for example, each input field of "Summary (topic)", "Participants", and "Meeting Notes", a voice playback display section, and a bookmark display section 3603 are displayed. On the record viewing and editing screen 3361, at least one text display section is further displayed. On the record viewing and editing screen 3361, each text before the date and time display section of 11:01:30 shown in FIG. 58, and screens 3 and 4 associated with those texts are displayed. Note that the user of the communication terminal 3 on which the record viewing and editing screen 3361 is displayed is, for example, "Ebina Jiro", who is the viewer of the voice recording (minutes). It should be noted that the bookmark display field 3603 included in the voice recording screen 3361 indicates that there is no content to be displayed.

●画面表示例●
図６７は、第３の実施形態に係る通信端末における記録閲覧編集画面の分割処理後の記録選択時の他の画面表示例である。図６７で表示される表示内容は、図６５で説明した記録選択画面３３４１の表示内容と同様であり、利用者による操作も同様の手順が可能であるため、説明を省略する。図６７では、「分割２_ヘルスケア事業業績報告会」のイベントに対する操作が利用者によって行われる例が示されている。ここで、音声記録管理装置５は、議事録共有ボタン(アイコン)を操作した利用者と分割された音声記録に参加した利用者の識別情報を管理することにより、議事録共有ボタン(アイコン)を操作した利用者が分割された議事録に係るイベントの閲覧を制限するようにしてもよい。具体的には、例えば、議事録共有ボタン(アイコン)を操作した利用者が分割された議事録で示されるイベントに参加していない場合は、音声記録管理装置５は、その利用者に対して分割された議事録の閲覧を禁止するようにしてもよい。なお、本実施形態においては、イベントに係る音声記録の分割は二つに限らず、三つ以上の分割を行うものであってもよい。 ●Screen display example●
FIG. 67 is another example of a screen display when selecting a record after the division process on the record viewing and editing screen in the communication terminal according to the third embodiment. The display contents displayed in FIG. 67 are the same as those displayed on the record selection screen 3341 described in FIG. 65, and the user can perform the same procedure, so the description will be omitted. FIG. 67 shows an example in which the user performs an operation on the event "Division 2_Healthcare Business Performance Reporting Meeting". Here, the voice recording management device 5 may restrict the user who operated the minutes sharing button (icon) from viewing the event related to the divided minutes by managing the identification information of the user who operated the minutes sharing button (icon) and the users who participated in the divided voice recording. Specifically, for example, if the user who operated the minutes sharing button (icon) did not participate in the event shown in the divided minutes, the voice recording management device 5 may prohibit the user from viewing the divided minutes. Note that in this embodiment, the division of the voice recording related to the event is not limited to two, and may be three or more divisions.

●画面表示例●
図６８は、第３の実施形態に係る作成者以外の利用者の通信端末に表示される分割された二つ目の音声記録画面の画面表示例である。通信端末３のディスプレイ３１８には、上述した「分割２_ヘルスケア事業業績報告会」のイベントタイトルへの操作が利用者によって実行されることにより、表示制御部３４によって音声記録画面３３６１が表示される。音声記録画面３３６１には、図５８に示した記録閲覧編集画面３３６１と同様に、例えば、「概要(議題)」、「参加者」、「会議メモ」の各入力欄、音声再生表示部及びブックマーク表示欄３６０３が表示される。記録閲覧編集画面３３６１には、更に、少なくとも一以上のテキスト表示欄が表示される。記録閲覧編集画面３３６１では、図５８で示した11:02:00の日時表示部以降の各テキスト、及びそれらのテキストに対応付けられた画面３及び画面４が表示される。なお、記録閲覧編集画面３３６１が表示される通信端末３の利用者は、例えば、音声記録（議事録）の閲覧者である「海老名二郎」である。なお、図６８に示した音声記録画面３３６１には、ブックマーク表示欄３６０３には、「2021/03/31 11:02:18」に発話された「2020年度下期の売上は○○です。」の内容のテキストがブックマークとして表示されている。これは、ブックマークされた部分は「分割２_ヘルスケア事業業績報告会」のイベントに含まれるため、この画面に反映される。 ●Screen display example●
FIG. 68 is a screen display example of the second divided voice recording screen displayed on the communication terminal of a user other than the creator according to the third embodiment. When the user operates the event title "Division 2_Healthcare Business Performance Reporting Meeting" described above, the display control unit 34 displays a voice recording screen 3361 on the display 318 of the communication terminal 3. On the voice recording screen 3361, as in the record viewing and editing screen 3361 shown in FIG. 58, for example, each input field of "Summary (topic)", "Participants", and "Meeting Notes", a voice playback display section, and a bookmark display section 3603 are displayed. On the record viewing and editing screen 3361, at least one text display section is further displayed. On the record viewing and editing screen 3361, each text after the date and time display section of 11:02:00 shown in FIG. 58, and screens 3 and 4 associated with those texts are displayed. Note that the user of the communication terminal 3 on which the record viewing and editing screen 3361 is displayed is, for example, "Ebina Jiro", who is the viewer of the voice recording (minutes). In addition, in the voice recording screen 3361 shown in Fig. 68, the text "Sales for the second half of fiscal year 2020 are XX" spoken on "2021/03/31 11:02:18" is displayed as a bookmark in the bookmark display field 3603. This is because the bookmarked part is included in the event "Division 2_Healthcare Business Performance Reporting", and is reflected on this screen.

上述したように、第３の実施形態では、編集後画面を分割した結果、ブックマークも分割されることが特徴になっている。更に、通信端末３は、音声記録管理装置５が送信した分割音声データに係る分割音声を、分割画像の表示とあわせて再生させることができる。 As described above, the third embodiment is characterized in that bookmarks are also divided as a result of dividing the screen after editing. Furthermore, the communication terminal 3 can play back the divided audio related to the divided audio data transmitted by the audio recording management device 5 together with the display of the divided image.

〔第３の実施形態の主な効果〕
以上説明したように本実施形態によれば、音声記録管理システム２の音声記録管理装置５は、通信端末３が送信した編集要求として、記録閲覧編集画面に表示された所定の領域を分割するための分割操作指示要求を受信し（Ｓ３１２）、分割操作指示要求に応じて、所定のテキストデータを分割処理して得られた分割編集後テキストデータと、所定の画像データを分割処理して得られた分割編集後画像データと、を含む分割編集後画面データを、通信端末３に対して送信する（Ｓ１０５）。その後、通信端末３は、音声記録管理装置５が送信した分割編集後画面データに係る分割編集後画面（記録閲覧編集画面）を通信端末３のディスプレイ３１８に表示させる（Ｓ１０６）。これにより、第１の実施形態の効果に加えて、作成した会議議事録等が表示される音声記録画面に対して会議等の参加者に無関係な画面を分割編集し、必要な情報だけをその情報を必要とする参加者に閲覧させることが可能になる。その結果、音声記録画面に表示される表示画面データ、音声再生される音声データ等の秘匿性を向上させることが可能になるという効果を奏する。 [Major Effects of the Third Embodiment]
As described above, according to this embodiment, the voice recording management device 5 of the voice recording management system 2 receives a split operation instruction request for splitting a predetermined area displayed on the record viewing and editing screen as an editing request transmitted from the communication terminal 3 (S312), and transmits split edited screen data including split edited text data obtained by split processing of the predetermined text data and split edited image data obtained by split processing of the predetermined image data to the communication terminal 3 in response to the split operation instruction request (S105). After that, the communication terminal 3 displays the split edited screen (record viewing and editing screen) related to the split edited screen data transmitted from the voice recording management device 5 on the display 318 of the communication terminal 3 (S106). In this way, in addition to the effects of the first embodiment, it is possible to split and edit a screen that is unrelated to the participants of a meeting, etc., on the voice recording screen on which the created meeting minutes, etc. are displayed, and only the necessary information can be viewed by the participants who need that information. As a result, it is possible to improve the confidentiality of the display screen data displayed on the voice recording screen, the voice data played back, etc.

更に、本実施形態によれば、音声記録管理装置５は、各分割編集後画面データに含まれる所定のブックマーク情報を含めた各分割編集後画面データを送信し、通信端末３は、分割された各分割編集後画面データに含まれるテキスト及び画面に対応付けられたブックマークをブックマーク表示欄４６０３に表示させる。これにより、第１の実施形態の効果に加えて、ブックマーク処理したテキストについても、分割された画面に対応させて表示又は非表示させることが可能になる。その結果、利用者に対して音声記録画面における編集の利便性をさらに向上させることが可能になるという効果も期待できる。 Furthermore, according to this embodiment, the voice recording management device 5 transmits each piece of split edited screen data including the specified bookmark information contained in each piece of split edited screen data, and the communication terminal 3 displays the bookmarks associated with the text and screens contained in each piece of split edited screen data in the bookmark display field 4603. This makes it possible to display or hide the bookmarked text in accordance with the split screens, in addition to the effect of the first embodiment. As a result, it is expected that the effect will be that it will be possible to further improve the convenience of editing on the voice recording screen for the user.

〔第３の実施形態の変形例〕
次に、図６９乃至図７１を用いて、第３の実施形態の変形例について説明する。第３の実施形態では、分割時刻を基準に音声データを分割していたが、分割日時（時刻）の直前の発話が分割日時（時刻）を跨いでいると、発話の途中で発話に伴う音声データが分割されてしまうことになる。一方テキストデータは、開始日時（時刻）を基準に分割されるため、音声データとテキストデータとの整合が取れなくなってしまう。 [Modification of the third embodiment]
Next, a modified example of the third embodiment will be described with reference to Fig. 69 to Fig. 71. In the third embodiment, the voice data is divided based on the division time, but if the utterance immediately before the division date and time (time) straddles the division date and time (time), the voice data accompanying the utterance will be divided in the middle of the utterance. On the other hand, the text data is divided based on the start date and time (time), so that the voice data and the text data cannot be matched.

そこで、第３の実施形態の変形例では、記録閲覧編集画面における切れ目のテキスト（音声）の終了日時（終了時刻）が分割日時（分割時刻）より後である場合に、分割日時（分割時刻）を当該テキスト（音声）の終了日時（終了時刻）に変更させて、変更後の終了日時（終了時刻）で、テキストデータ、画像データ及び音声データを分割する。このようにすることにより、記録閲覧編集画面の分割処理においても音声データを途切れさせることなく、適切に分割することができるようにする。なお、第３の実施形態の変形例においても、第３の実施形態と同様に、第１の実施形態で利用されたシステム構成、ハードウエア構成及び各ハードウエア資源、並びに、各ハードウエア資源を利用した各機能構成によって実現される。 Therefore, in a modified example of the third embodiment, if the end date and time (end time) of the text (audio) at the break on the record viewing and editing screen is later than the division date and time (division time), the division date and time (division time) is changed to the end date and time (end time) of the text (audio), and the text data, image data, and audio data are divided at the changed end date and time (end time). In this way, audio data can be appropriately divided without being interrupted even during the division process on the record viewing and editing screen. Note that, like the third embodiment, the modified example of the third embodiment is realized by the system configuration, hardware configuration, and each hardware resource used in the first embodiment, and each functional configuration using each hardware resource.

<<記録閲覧編集画面の分割処理の詳細>>
ここで、ステップＳ３２１の記録閲覧編集画面の分割処理の変形例における詳細について説明する。図６９は、第３の実施形態の変形例に係る記録閲覧編集画面の分割処理の一例を示すフローチャートである。まず、音声記録管理装置５の送受信部５１は、音声記録の記録識別情報、分割時刻及び分割要求を受信する（ステップＳ３２１－２１）。なお、ステップＳ３２１－１の処理は、上述したステップＳ３１２の処理に相当し、分割要求にはテキスト識別情報、画像識別情報、分割操作ボタン情報が含まれる。 <<Details on splitting the record viewing and editing screen>>
Here, details of the modified example of the split process of the record viewing and editing screen in step S321 will be described. Fig. 69 is a flow chart showing an example of the split process of the record viewing and editing screen according to the modified example of the third embodiment. First, the transmission/reception unit 51 of the voice recording management device 5 receives the recording identification information of the voice recording, the split time, and a split request (step S321-21). Note that the process of step S321-1 corresponds to the process of step S312 described above, and the split request includes text identification information, image identification information, and split operation button information.

次に、取得部５２は、テキスト情報管理テーブル（テキスト情報管理ＤＢ５３０３（図５４参照））から記録識別情報に対応するものを特定し、開始時刻が分割時刻より前のレコードのうち、最後のレコードの終了時刻情報を取得する（ステップＳ３２１－２２）。具体的には、取得部５２は、テキスト情報管理ＤＢ５３０３（図５４参照）から記録識別情報「R5006」のレコードを参照し、開始時刻が分割時刻より前のレコードのうち最後のレコードの終了時刻情報を取得する。すなわち、取得部５２は、開始時刻が「1分59秒」に発話された「本日の議題はヘルスケア事業の業績です。」のレコードの終了時刻「2分02秒」の時刻情報を取得する。 Next, the acquisition unit 52 identifies the text information management table (text information management DB 5303 (see FIG. 54)) corresponding to the record identification information, and acquires the end time information of the last record among the records whose start time is before the division time (step S321-22). Specifically, the acquisition unit 52 references the record of record identification information "R5006" from the text information management DB 5303 (see FIG. 54), and acquires the end time information of the last record among the records whose start time is before the division time. That is, the acquisition unit 52 acquires the time information of the end time "2 minutes 02 seconds" of the record of "Today's agenda is the performance of the healthcare business," which was uttered at a start time of "1 minute 59 seconds."

次に、判断部５５は、取得した終了時刻が分割時刻より後かを判断する（ステップＳ３２１－２３）。取得した終了時刻が分割時刻より後である場合（ステップＳ３２１－２３：ＹＥＳ）、算出特定部５３は、取得した終了時刻を分割時刻にして次のステップＳ３２１－２５の処理に進む（ステップＳ３２１－２４）。具体的には、算出特定部５３は、ステップＳ３２１－２２で取得した「2分02秒」を分割時刻にする。 Next, the judgment unit 55 judges whether the acquired end time is later than the division time (step S321-23). If the acquired end time is later than the division time (step S321-23: YES), the calculation determination unit 53 sets the acquired end time as the division time and proceeds to the next step S321-25 (step S321-24). Specifically, the calculation determination unit 53 sets the "2 minutes 02 seconds" acquired in step S321-22 as the division time.

他方、取得した終了時刻が分割時刻より後でない場合（ステップＳ３２１－２３：ＮＯ）、ステップＳ３２１－２５の処理に進む。 On the other hand, if the acquired end time is not later than the division time (step S321-23: NO), the process proceeds to step S321-25.

ステップＳ３２１－２５では、算出特定部５３は、テキスト情報管理テーブル（テキスト情報管理ＤＢ５３０３（図５４参照））から記録識別情報に対応するものを特定し、開始時刻が分割時刻より前のレコードと分割時刻以降のレコードとを特定する。この処理は、図６１で説明したフローチャートのステップＳ３２１－２の処理と同様であるため、詳細の説明を省略する。上述したように、図５４で示されたテキスト情報管理ＤＢ５３０３において、「本日の議題はヘルスケア事業の業績です。」の発話の終了時刻が「2分02秒」であることから、分割時刻「11:02:00」よりも２秒遅いことがわかる。これにより、ステップＳ３２１－２４で得られた分割時刻は、ステップＳ３２１－２２の処理にしたがって、「2分02秒」が採用される。換言すれば、以降の処理において、分割処理された時刻から音声データが終了するまでの「２秒」分のオフセット（保障時間）が設けられることになる。 In step S321-25, the calculation and identification unit 53 identifies the text information management table (text information management DB 5303 (see FIG. 54)) that corresponds to the record identification information, and identifies records whose start times are before the division time and records whose start times are after the division time. This process is similar to the process of step S321-2 in the flowchart described in FIG. 61, so detailed description is omitted. As described above, in the text information management DB 5303 shown in FIG. 54, the end time of the utterance "Today's topic is the performance of the healthcare business" is "2 minutes and 2 seconds", which is two seconds later than the division time "11:02:00". As a result, "2 minutes and 2 seconds" is adopted as the division time obtained in step S321-24, in accordance with the process of step S321-22. In other words, in the subsequent processes, an offset (guaranteed time) of "2 seconds" is provided from the time of division processing to the end of the voice data.

次に、設定登録部５８は、それぞれに基づいてテキスト情報管理テーブルを更新して登録する（ステップＳ３２１－２６）。この処理は、図６１で説明したフローチャートのステップＳ３２１－３の処理と同様であるため、詳細の説明を省略する。 Next, the setting registration unit 58 updates and registers the text information management table based on each of the settings (step S321-26). This process is similar to the process of step S321-3 in the flowchart described in FIG. 61, so a detailed description is omitted.

図７０は、第３の実施形態の変形例に係る記録閲覧編集画面の分割処理後のテキスト情報管理テーブルの一例を示す概念図で、（a）は分割された一つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図、(b)は分割された二つ目の音声記録画面を構成するテキスト情報管理テーブルの概念図である。図７０(a)では、新しく設定された分割後の記録識別情報「R5311」のタブに関連付けられたテキスト情報管理テーブルが示されている。具体的には、算出特定部５３は、テキスト識別情報「TX3106」に対応する終了時刻に対して、ステップＳ３２１－２４で設けられたオフセット分に相当する「２秒」を加算する。これにより、設定登録部５８は、終了時刻を「2分02秒」に更新する。このとき、「2分02秒」は、図５８で示した日時表示部（「11:02:00」）で分割された日時（時刻）に対してオフセット分の「２秒」を加算した時間である。この処理により、「本日の議題はヘルスケア事業の業績です。」の音声は、記録閲覧編集画面に対する分割処理が実行されても途中で音声が途切れることなく最後まで保証されることになる。なお、テキスト情報識別情報に対応するそれぞれのブックマークの項目はすべて「False」が設定されており、このテーブルで管理されている発話内容については、ブックマーク処理が行われなかったことを示している。 Figure 70 is a conceptual diagram showing an example of a text information management table after the division process of the record viewing and editing screen according to the modified example of the third embodiment, where (a) is a conceptual diagram of the text information management table constituting the first divided voice recording screen, and (b) is a conceptual diagram of the text information management table constituting the second divided voice recording screen. Figure 70(a) shows the text information management table associated with the tab of the newly set post-division record identification information "R5311". Specifically, the calculation identification unit 53 adds "2 seconds", which corresponds to the offset set in step S321-24, to the end time corresponding to the text identification information "TX3106". As a result, the setting registration unit 58 updates the end time to "2 minutes 02 seconds". At this time, "2 minutes 02 seconds" is the time obtained by adding the offset amount "2 seconds" to the date and time (time) divided in the date and time display unit ("11:02:00") shown in Figure 58. This process ensures that the audio "Today's topic is the performance of the healthcare business" will be heard to the end without any interruption, even if the split process is performed on the record viewing and editing screen. Note that all bookmark items corresponding to the text information identification information are set to "False," which indicates that bookmark processing was not performed on the spoken content managed in this table.

図７０(b)では、新しく設定された分割後の記録識別情報「R5312」のタブに関連付けられたテキスト情報管理テーブルが示されている。具体的には、テキスト識別情報「TX3107」、「TX3108」、「TX3109」、・・・にそれぞれ対応する開始時刻及び終了時刻が、設定登録部５８によって更新して登録される。より具体的には、テキスト識別情報「TX3107」に対応する開始時刻は「0分3秒」、終了時刻は「0分10秒」となる。これらの時刻は、「まず共有画面をご覧ください。」の内容の発話が開始及び終了された新たな各時刻を表している。つまり、「まず共有画面をご覧ください。」の内容の発話は、実際には「2分05秒」に開始されていても、分割処理の段階で前の発話内容を保証するためのオフセット分の「２秒」がすでに経過している状態であるため、音声記録管理装置５では、「2分05秒」から２秒を差し引いた「2分03秒」を新たな発話開始時刻に設定する。同様に、発話終了時刻についても、「2分12秒」２秒を差し引いた「2分10秒」を新たな発話終了時刻に設定する。但し、テキスト情報管理ＤＢ５３０３の記録識別情報「R5302」のタブで管理されるテキスト情報管理テーブルにおいては、その前に発話が存在しない状態であるため、「0分03秒」が新たな発話開始時刻、「0分10秒」が新たな発話終了時刻に設定される。以下、同様に、テキスト識別情報「TX3108」に対応する開始時刻は「0分16秒」、終了時刻は「0分21秒」となる。これらの時刻は、「2020年度下期の売り上げは○○です。」の内容の発話が開始及び終了された新たな各時刻を表している。同様に、テキスト識別情報「TX3109」に対応する開始時刻は「0分23秒」、終了時刻は「0分26秒」となる。これらの時刻は、「営業利益は△△です。」の内容の発話が開始及び終了された新たな各時刻を表している。なお、テキスト情報識別情報に対応するブックマークの項目のうち、テキスト識別情報「TX3108」に対応するブックマークに「True」が設定されている。つまり、このテーブルで管理されている発話内容については、テキスト識別情報「TX3108」に対応するテキストに対してブックマーク処理が行われたことを示している。 Figure 70 (b) shows the text information management table associated with the tab of the newly set post-division record identification information "R5312". Specifically, the start time and end time corresponding to each of the text identification information "TX3107", "TX3108", "TX3109", ... are updated and registered by the setting registration unit 58. More specifically, the start time corresponding to the text identification information "TX3107" is "0 minutes 3 seconds" and the end time is "0 minutes 10 seconds". These times represent the new times at which the speech of the content "Please look at the shared screen first" started and ended. In other words, even if the speech of the content "Please look at the shared screen first" actually started at "2 minutes 05 seconds", the offset of "2 seconds" to guarantee the previous speech content has already passed at the stage of the division process, so the voice recording management device 5 sets the new speech start time to "2 minutes 03 seconds", which is obtained by subtracting 2 seconds from "2 minutes 05 seconds". Similarly, for the speech end time, "2 minutes 12 seconds" is subtracted and "2 minutes 10 seconds" is set as the new speech end time. However, in the text information management table managed in the tab of the record identification information "R5302" of the text information management DB 5303, since there is no previous speech, "0 minutes 03 seconds" is set as the new speech start time and "0 minutes 10 seconds" is set as the new speech end time. Similarly, the start time corresponding to the text identification information "TX3108" is "0 minutes 16 seconds" and the end time is "0 minutes 21 seconds". These times represent the new times at which the speech of "Sales for the second half of fiscal year 2020 are XX" started and ended. Similarly, the start time corresponding to the text identification information "TX3109" is "0 minutes 23 seconds" and the end time is "0 minutes 26 seconds". These times represent the new times at which the speech of "Operating profit is △△" started and ended. Among the bookmark items corresponding to text information identification information, the bookmark corresponding to the text identification information "TX3108" is set to "True." This indicates that for the spoken content managed in this table, bookmark processing was performed on the text corresponding to the text identification information "TX3108."

図６９に戻り、算出特定部５３は、キャプチャ画像管理テーブル（キャプチャ画像管理ＤＢ５３０４（図５５参照））から記録識別情報に対応するものを特定し、取得時刻が分割時刻より前のレコードと、分割時刻以降のレコードとを特定する（ステップＳ３２１－２７）。この処理は、図６１で説明したフローチャートのステップＳ３２１－４の処理と同様であるため、詳細の説明を省略する。 Returning to FIG. 69, the calculation identification unit 53 identifies the record identification information from the capture image management table (capture image management DB 5304 (see FIG. 55)) and identifies records whose acquisition times are before the division time and records whose acquisition times are after the division time (step S321-27). This process is similar to the process in step S321-4 of the flowchart described in FIG. 61, so a detailed description will be omitted.

次に、設定登録部５８は、それぞれに基づいてテキスト情報管理テーブルを更新して登録する（ステップＳ３２１－２８）。具体的には、設定登録部５８は、ステップＳ３２１－４で特定された分割時刻より前のレコードと、分割時刻以降のレコードとに基づいて、新しく設定された分割前の記録識別情報のタブと分割後の記録識別情報のタブとで関連付けられた二つのキャプチャ画像管理テーブル（キャプチャ画像管理ＤＢ５３０４）を更新して登録する。 Then, the setting registration unit 58 updates and registers the text information management table based on each (step S321-28). Specifically, the setting registration unit 58 updates and registers two capture image management tables (capture image management DB 5304) associated with the newly set tabs for the record identification information before the division and the record identification information after the division based on the records before the division time identified in step S321-4 and the records after the division time.

次に、算出特定部５３は、記録書誌情報管理テーブル（記録書誌情報管理ＤＢ５３０２（図５３参照））から記録識別情報に対応する記録書誌情報を特定する（ステップＳ３２１－２９）。具体的には、算出特定部５３は、分割された日時を示す日時表示部が「11:02:00」を示していたことから、この日時を含む記録書誌情報を有する記録書誌情報のタブを特定する。 Next, the calculation and identification unit 53 identifies the recorded bibliographic information corresponding to the record identification information from the recorded bibliographic information management table (recorded bibliographic information management DB 5302 (see FIG. 53)) (step S321-29). Specifically, since the date and time display portion showing the split date and time shows "11:02:00", the calculation and identification unit 53 identifies the tab of the recorded bibliographic information having the recorded bibliographic information including this date and time.

次に、取得部５２は、音声データパスから音声データを取得する（ステップＳ３２１－３０）。具体的には、取得部５２は、分割された日時を示す日時表示部が「11:02:00」を示していたことから、この日時を含む記録書誌情報を有する記録書誌情報のタブで管理されている音声データパスを取得する。この場合、取得される音声データパスは、例えば、「・・・/00005006/record.mp3」である。 Next, the acquisition unit 52 acquires the audio data from the audio data path (step S321-30). Specifically, since the date and time display portion showing the split date and time indicates "11:02:00", the acquisition unit 52 acquires the audio data path managed in the tab of the record bibliographic information having the record bibliographic information including this date and time. In this case, the acquired audio data path is, for example, ".../00005006/record.mp3".

次に、生成・処理部５７は、分割時刻より前の部分音声データと、分割時刻以降の部分音声データとを生成し、それぞれに対応するイベントＵＲＬを生成する（ステップＳ３２１－３１）。 Next, the generation/processing unit 57 generates partial audio data before the division time and partial audio data after the division time, and generates event URLs corresponding to each (step S321-31).

次に、設定登録部５８は、音声データのパスとイベントＵＲＬとに基づいて記録書誌情報管理テーブルを更新して登録し、このフローを抜ける（ステップＳ３２１－３２）。 Next, the setting registration unit 58 updates and registers the recorded bibliographic information management table based on the path of the audio data and the event URL, and then exits this flow (step S321-32).

図７１Ａは、第３の実施形態に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された一つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。図７１Ａでは、新しく設定された分割後の記録識別情報「R5301」のタブに関連付けられた記録書誌情報管理テーブルが示されている。具体的には、設定登録部５８によって記録名称が「分割1_ヘルスケア事業業績報告会」に更新される。更に、設定登録部５８によって終了日時が「2021/03/31 11:02:02」に更新される。更に、設定登録部５８によって音声データパスが「・・・/00005301/record.mp3」に更新される。更に、設定登録部５８によってイベントＵＲＬが「https://・・・/00005301」に更新される。 Figure 71A is a conceptual diagram showing an example of a record bibliographic information management table after the division process of the record viewing and editing screen according to the third embodiment, and is a conceptual diagram of the record bibliographic information management table constituting the first divided voice recording screen. In Figure 71A, the record bibliographic information management table associated with the tab of the newly set post-division record identification information "R5301" is shown. Specifically, the setting registration unit 58 updates the record name to "Division 1_Healthcare Business Performance Reporting Meeting". Furthermore, the setting registration unit 58 updates the end date and time to "2021/03/31 11:02:02". Furthermore, the setting registration unit 58 updates the voice data path to ".../00005301/record.mp3". Furthermore, the setting registration unit 58 updates the event URL to "https://.../00005301".

図７１Ｂは、第３の実施形態に係る記録閲覧編集画面の分割処理後の記録書誌情報管理テーブルの一例を示す概念図で、分割された二つ目の音声記録画面を構成する記録書誌情報管理テーブルの概念図である。図７１Ｂでは、新しく設定された分割後の記録識別情報「R5302」のタブに関連付けられた記録書誌情報管理テーブルが示されている。具体的には、設定登録部５８によって記録名称が「分割2_ヘルスケア事業業績報告会」に更新される。更に、設定登録部５８によって開始日時が「2021/03/31 11:02:02」に更新される。更に、設定登録部５８によって音声データパスが「・・・/00005302/record.mp3」に更新される。更に、設定登録部５８によってイベントＵＲＬが「https://・・・/00005302」に更新される。 Figure 71B is a conceptual diagram showing an example of a record bibliographic information management table after the division process of the record viewing and editing screen according to the third embodiment, and is a conceptual diagram of the record bibliographic information management table constituting the second divided audio recording screen. In Figure 71B, the record bibliographic information management table associated with the tab of the newly set post-division record identification information "R5302" is shown. Specifically, the setting registration unit 58 updates the record name to "Division 2_Healthcare Business Performance Reporting Meeting". Furthermore, the setting registration unit 58 updates the start date and time to "2021/03/31 11:02:02". Furthermore, the setting registration unit 58 updates the audio data path to ".../00005302/record.mp3". Furthermore, the setting registration unit 58 updates the event URL to "https://.../00005302".

なお、上述したステップＳ３２１において特定、更新された各レコードを含む分割後の音声記録画面は、利用者が分割された所望のイベントを示すイベントＵＲＬとパスコードとを入力した場合（ステップＳ１０１の処理が実行された場合）に、通信端末３に表示される。 The divided voice recording screen including each record identified and updated in step S321 described above is displayed on the communication terminal 3 when the user inputs the event URL indicating the desired divided event and the passcode (when the processing of step S101 is executed).

〔第３の実施形態の変形例の主な効果〕
以上説明したように本実施形態によれば、音声記録管理装置５は、テキスト情報管理テーブルから記録識別情報に対応するものを特定し、開始時刻が分割時刻より前のレコードのうち、最後のレコードの終了時刻情報を取得し（ステップＳ３２１－２２）、取得した終了時刻が分割時刻より後である場合に、取得した終了時刻を分割時刻にする（ステップＳ３２１－２４）。これにより、記録閲覧編集画面に対する分割処理が実行されても途中で音声が途切れることなく最後まで保証されるという効果を奏する。 [Major Effects of the Modification of the Third Embodiment]
As described above, according to this embodiment, the voice recording management device 5 identifies the record identification information from the text information management table, obtains the end time information of the last record among the records whose start time is before the division time (step S321-22), and if the obtained end time is after the division time, sets the obtained end time as the division time (step S321-24). This has the effect of ensuring that the voice is not interrupted and is played to the end even if a division process is performed on the record viewing and editing screen.

〔実施形態の補足〕
上述した実施形態の各機能は、一又は複数の処理回路によって実現することが可能である。ここで、本明細書における「処理回路」とは、電子回路により実装されるプロセッサのようにソフトウエアによって各機能を実行するようプログラミングされたデバイスを含むものとする。このデバイスとは、例えば、プロセッサ、上記で説明した各機能を実行するよう設計されたＡＳＩＣ(Application Specific Integrated Circuit)、ＤＳＰ（Digital Signal Processor）、ＦＰＧＡ（Field Programmable Gate Array）、ＳＯＣ(System On a Chip)、ＧＰＵ（Graphics Processing Unit）、及び従来の回路モジュール等をいう。 [Supplementary description of the embodiment]
Each function of the above-mentioned embodiment can be realized by one or more processing circuits. Here, the term "processing circuit" in this specification includes a device programmed to execute each function by software, such as a processor implemented by an electronic circuit. This device refers to, for example, a processor, an ASIC (Application Specific Integrated Circuit), a DSP (Digital Signal Processor), an FPGA (Field Programmable Gate Array), an SOC (System On a Chip), a GPU (Graphics Processing Unit), and a conventional circuit module, etc.

更に、上述した実施形態により得られる各種テキスト及びテキスト情報は、人工知能（ＡＩ）を利用した機械学習の学習効果によって取得されたものでもよい。この場合、音声認識サーバが機械学習を用いて音声情報から各種テキスト及びテキスト情報を取得してもよいし、音声認識サーバと異なるデータベース等が機械学習を用いて音声情報から各種テキスト及びテキスト情報を取得してもよい。ここで、機械学習とは、コンピュータに人のような学習能力を獲得させるための技術であり、コンピュータが、データ識別等の判断に必要なアルゴリズムを事前に取り込まれる学習データから自律的に作成、新たなデータについてこれを適用して予測を行う技術のことをいう。機械学習のための学習方法は、教師あり学習、教師なし学習、半教師学習、強化学習、深層学習のいずれかの方法でもよい。更に、機械学習のための学習方法は、これらの学習方法を組み合わせた学習方法でもよく、機械学習のための学習方法は問わない。 Furthermore, the various texts and text information obtained by the above-mentioned embodiment may be obtained by the learning effect of machine learning using artificial intelligence (AI). In this case, the voice recognition server may obtain various texts and text information from voice information using machine learning, or a database different from the voice recognition server may obtain various texts and text information from voice information using machine learning. Here, machine learning is a technology for making a computer acquire human-like learning ability, and refers to a technology in which a computer autonomously creates an algorithm required for judgment such as data identification from learning data that is previously loaded, and applies this to new data to make predictions. The learning method for machine learning may be any of supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, and deep learning. Furthermore, the learning method for machine learning may be a combination of these learning methods, and any learning method for machine learning may be used.

これまで本発明の一実施形態に係る音声記録管理システム、音声記録管理装置、音声記録管理方法及びプログラムについて説明してきたが、本発明は、上述した実施形態に限定されるものではなく、他の実施形態の追加、変更又は削除等、当業者が想到することができる範囲内で変更することができ、いずれの態様においても本発明の作用・効果を奏する限り、本発明の範囲に含まれるものである。 So far, we have described an audio recording management system, an audio recording management device, an audio recording management method, and a program according to one embodiment of the present invention, but the present invention is not limited to the above-described embodiment, and can be modified within the scope of what a person skilled in the art can imagine, such as adding, changing, or deleting other embodiments, and any aspect is within the scope of the present invention as long as it achieves the functions and effects of the present invention.

■まとめ■
本発明に係る態様は、例えば、以下のとおりである。 ■ Summary ■
For example, aspects of the present invention are as follows.

＜第１態様＞
第１態様としての音声記録管理システム（例えば、音声記録管理システム２。以下省略）は、音声情報に基づいて得られた音声記録情報を管理する音声記録管理装置（例えば、音声記録管理装置５。以下省略）と、前記音声記録管理装置と通信することで前記音声記録情報を表示可能な一以上の通信端末（例えば、通信端末３。以下省略）と、を含み、前記音声記録管理装置は、前記一以上の通信端末のうち、第１の通信端末が送信した前記音声情報を表す音声データ、及び前記第１の通信端末に表示された画面を表す画面データを取得する取得手段（例えば、取得部５２）と、取得された前記音声データに基づいて得られた所定のテキストを表す所定のテキストデータと、取得された前記画面データに係る前記画面に含まれる画像であり、前記所定のテキストに対応付けられた所定の画像を表す所定の画像データと、前記所定のテキストで示される所定の音声データとを、前記第１の通信端末を含む前記一以上の通信端末に送信する送信手段（例えば、送受信部５１。以下省略）と、を有し、前記送信手段は、前記第１の通信端末が送信した編集要求であり、前記所定のテキスト又は前記所定の画像に対する編集要求に応じて、前記所定のテキストデータを編集処理した編集後テキストデータと前記所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、前記所定の音声データを編集処理した編集後音声データとを、前記第１の通信端末と異なる第２の通信端末に対して送信し、前記第２の通信端末は、前記音声記録管理装置が送信した前記編集後画面データに係る編集後画面を表示手段に表示する表示制御手段（例えば、表示制御部３４）と、前記音声記録管理装置が送信した前記編集後音声データに係る編集後音声を再生する音声再生手段（例えば、音声再生部３６）と、を有する。 <First aspect>
A voice recording management system (e.g., voice recording management system 2; omitted below) as a first aspect includes a voice recording management device (e.g., voice recording management device 5; omitted below) that manages voice recording information obtained based on voice information, and one or more communication terminals (e.g., communication terminal 3; omitted below) that can display the voice recording information by communicating with the voice recording management device, and the voice recording management device includes an acquisition means (e.g., acquisition unit 52) that acquires voice data representing the voice information transmitted by a first communication terminal among the one or more communication terminals, and screen data representing a screen displayed on the first communication terminal, predetermined text data representing predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image associated with the predetermined text, and a predetermined voice indicated by the predetermined text. The voice recording management device has a transmitting means (e.g., a transceiver unit 51, omitted below) for transmitting the edit request sent by the first communication terminal, and edited screen data including edited text data edited by editing the specified text data and edited image data edited by editing the specified image data, and edited audio data edited by editing the specified audio data, to a second communication terminal different from the first communication terminal in response to the editing request for the specified text or the specified image, and the second communication terminal has a display control means (e.g., a display control unit 34) for displaying an edited screen related to the edited screen data transmitted by the voice recording management device on a display means, and an audio playback means (e.g., an audio playback unit 36) for playing back the edited audio related to the edited audio data transmitted by the voice recording management device.

第１態様によれば、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータ又はそのテキストデータに対応する画像データを編集すればよいので、音声記録の編集における利便性を向上させることが可能になる。 According to the first aspect, when editing a voice recording generated based on voice information, it is only necessary to edit the text data included in the voice recording or the image data corresponding to the text data, thereby improving the convenience of editing the voice recording.

＜第２態様＞
第２態様としての音声記録管理システムは、第１態様において、前記音声記録管理装置は、更に、前記編集要求として、前記所定のテキストを非表示にするためのテキスト非表示要求及び前記所定の画像を非表示にするための画像非表示要求のうちのいずれか一方の要求を受信する受信手段（例えば、送受信部５１。以下省略）を有し、前記送信手段は、受信された前記編集要求が前記テキスト非表示要求である場合、前記編集処理として、前記所定のテキストデータを非表示処理した第１の非表示テキストデータ及び前記所定の画像データを非表示処理した第１の非表示画像データを含む第１の非表示画面データと、前記編集処理として前記所定の音声データを無音化処理した第１の無音化音声データとを、前記第２の通信端末に対して送信し、受信された前記編集要求が前記画像非表示要求である場合、前記編集処理として、前記所定の画像データを非表示処理した第２の非表示画像データ及び前記所定の画像に対応付けられた一以上のテキストを非表示処理した第２の非表示テキストデータを含む第２の非表示画面データと、前記編集処理として前記一以上のテキストに対応付けられた一以上の所定の音声データを無音化処理した第２の無音化音声データとを、前記第２の通信端末に対して送信する。 <Second aspect>
In a second aspect of the voice recording management system, in the first aspect, the voice recording management device further includes a receiving means (e.g., a transmitting/receiving unit 51; omitted below) for receiving, as the editing request, either one of a text non-display request for non-displaying the specified text and an image non-display request for non-displaying the specified image, and when the received editing request is the text non-display request, the transmitting means transmits, as the editing process, a first non-display text data including first non-displayed text data obtained by non-displaying the specified text data and first non-displayed image data obtained by non-displaying the specified image data. The screen data and first muted audio data obtained by muting the specified audio data as the editing process are transmitted to the second communications terminal, and if the received editing request is a request to not display an image, second hidden screen data including second hidden image data obtained by not displaying the specified image data and second hidden text data obtained by not displaying one or more texts associated with the specified image as the editing process, and second muted audio data obtained by muting one or more specified audio data associated with the one or more texts as the editing process are transmitted to the second communications terminal.

第２態様によれば、第１態様と同様に、音声記録の編集における利便性を向上させることが可能になる。 According to the second aspect, like the first aspect, it is possible to improve the convenience of editing audio recordings.

＜第３態様＞
第３態様としての音声記録管理システムは、第２態様において、前記表示制御手段が、前記テキスト非表示要求を生成するために操作されるテキスト非表示操作部を、前記画面に表示された前記所定のテキストの近傍に表示し、前記画像非表示要求を生成するために操作される画像非表示操作部を、前記画面に表示された前記所定の画像の近傍に表示する。 <Third aspect>
A third aspect of the audio recording management system is such that, in the second aspect, the display control means displays a text hide operation unit operated to generate the text hide request near the specified text displayed on the screen, and displays an image hide operation unit operated to generate the image hide request near the specified image displayed on the screen.

第３態様によれば、第２態様に係る音声記録の編集における利便性の向上に加えて、音声記録の編集における操作性を向上させることが可能になる。 According to the third aspect, in addition to improving the convenience of editing audio recordings according to the second aspect, it is possible to improve the operability of editing audio recordings.

＜第４態様＞
第４態様としての音声記録管理システムは、第２態様又は第３態様において、前記音声記録管理装置は、更に、前記テキスト非表示要求に応じて前記第１の非表示画面データ及び前記第１の無音化音声データを生成し、前記画像非表示要求に応じて前記第２の非表示画面データ及び前記第２の無音化音声データを生成する生成手段を有する。 <Fourth aspect>
A fourth aspect of the audio recording management system is, in the second or third aspect, the audio recording management device further has a generation means for generating the first hidden screen data and the first muted audio data in response to the request to not display text, and generating the second hidden screen data and the second muted audio data in response to the request to not display an image.

第４態様によれば、第２態様又は第３態様に係る音声記録の編集における利便性及び操作性の向上に加えて、音声再生時の秘匿性を向上させることが可能になる。 According to the fourth aspect, in addition to improving the convenience and operability of editing the audio recording according to the second or third aspect, it is possible to improve confidentiality during audio playback.

＜第５態様＞
第５態様としての音声記録管理システムは、第２態様乃至第４態様において、前記第１の通信端末が、更に、前記画面に係る画面データを所定の時間間隔で取得する取得手段を有し、前記表示制御手段は、前記画面データが前記取得手段によって取得された取得時刻に跨って前記音声データに基づいて得られた特定のテキストデータが存在する状態で、前記取得時刻に取得された前記所定の画像に対する前記画像非表示要求が受信された場合に、前記第２の非表示テキストデータに加えて、前記特定のテキストデータを非表示処理した第３の非表示画面データに係る第３の非表示画面を前記表示手段に表示し、前記音声再生手段は、前記第２の非表示テキストデータと前記特定のテキストデータに係る各テキストで示される音声データが無音化処理された音声を再生する。 <Fifth aspect>
In the voice recording management system as a fifth aspect, in the second to fourth aspects, the first communication terminal further has an acquisition means for acquiring screen data related to the screen at a predetermined time interval, and when a request to hide an image for the specified image acquired at the acquisition time is received in a state in which specific text data obtained based on the voice data exists across the acquisition time at which the screen data was acquired by the acquisition means, the display control means displays on the display means a third hidden screen related to third hidden screen data obtained by hiding the specific text data in addition to the second hidden text data, and the voice playback means plays voice in which the voice data indicated by each text related to the second hidden text data and the specific text data has been muted.

第５態様によれば、第２態様乃至第４態様に係る音声記録の編集における利便性及び操作性の向上に加えて、音声再生時の秘匿性を更に向上させることが可能になる。 According to the fifth aspect, in addition to improving the convenience and operability of editing audio recordings according to the second to fourth aspects, it is possible to further improve confidentiality during audio playback.

＜第６態様＞
第６態様としての音声記録管理システムは、第５態様において、前記表示制御手段が、前記所定の画像に対する前記画像非表示要求が受信された後、前記所定の画像を再度表示する画像表示要求が受信された場合に、前記特定のテキストデータに係る特定のテキストが非表示状態に維持された画面を表示し、前記音声再生手段が、前記特定のテキストで示される特定の音声データに係る特定の音声が無音化状態に維持された音声を再生する。 <Sixth aspect>
A sixth aspect of the audio recording management system is such that, in the fifth aspect, when an image display request to display the specified image again is received after the display control means has received the request to not display the specified image for the specified image, the display control means displays a screen in which specific text related to the specific text data is maintained in a hidden state, and the audio playback means plays audio in which specific audio related to the specific audio data indicated by the specific text is maintained in a muted state.

第６態様によれば、第５態様に係る音声再生時の秘匿性を更に向上させることが可能になる。 According to the sixth aspect, it is possible to further improve confidentiality during audio playback according to the fifth aspect.

＜第７態様＞
第７態様としての音声記録管理システムは、第２態様乃至第６態様において、前記音声記録管理装置が更に、前記テキスト非表示要求と前記画像非表示要求はそれぞれ、前記所定のテキストデータ及び前記所定の画像データを削除又は非公開とする要求であり、前記削除する要求の場合に、前記所定のテキストデータ及び前記所定の画像データを削除処理し、前記非公開とする要求の場合に、前記所定のテキストデータ及び前記所定の画像データを削除せずに、前記第１の非表示画面データで表される第１の非表示画面又は前記第２の非表示画面データで表される第２の非表示画面において非表示処理する処理手段（例えば、生成・処理部５７）を有する。 <Seventh aspect>
In the seventh aspect of the audio recording management system, in the second to sixth aspects, the audio recording management device further has a processing means (e.g., a generation/processing unit 57) in which the request to not make any text visible and the request to not make any image visible are respectively requests to delete or make the specified text data and the specified image data private, and in the case of a request to delete, the specified text data and the specified image data are deleted, and in the case of a request to make private, the specified text data and the specified image data are not deleted, but are hidden on the first hidden screen represented by the first hidden screen data or the second hidden screen represented by the second hidden screen data.

第７態様によれば、第２態様乃至第６態様に係る音声記録の編集における操作性及び秘匿性を更に向上させることが可能になる。 According to the seventh aspect, it is possible to further improve the operability and confidentiality of editing the audio recordings according to the second to sixth aspects.

＜第８態様＞
第８態様としての音声記録管理システムは、第１態様乃至第７態様において、前記表示制御手段が、前記画面に表示された複数のテキストを含む一のテキストグループとして選択された場合に、前記テキストグループを表す要約を入力させるための要約入力操作部（例えば、要約入力ダイアログ３１８１。以下省略）を前記一のテキストグループの近傍に表示する。 <Eighth aspect>
In the eighth aspect of the audio recording management system, in the first to seventh aspects, when the display control means selects a text group including a plurality of texts displayed on the screen, a summary input operation section (e.g., summary input dialog 3181; omitted below) for inputting a summary representing the text group is displayed near the text group.

第８態様によれば、音声記録の編集時の操作性と視認性をさらに向上させることが可能になる。 According to the eighth aspect, it is possible to further improve operability and visibility when editing audio recordings.

＜第９態様＞
第９態様としての音声記録管理システムは、第８態様において、前記表示制御手段が、前記要約入力操作部とあわせて、前記テキスト非表示操作部（例えば、「非公開」ボタン(アイコン)３５４２、「削除」ボタン(アイコン)３５４４）を前記一のテキストグループの近傍に表示する。 <Ninth aspect>
A ninth aspect of the audio recording management system is the eighth aspect, in which the display control means displays the text hiding operation section (e.g., a "Private" button (icon) 3542, a "Delete" button (icon) 3544) in the vicinity of the one text group together with the summary input operation section.

第９態様によれば、第８態様に係る音声記録の編集における操作性と視認性をさらに向上させることが可能になる。 According to the ninth aspect, it is possible to further improve the operability and visibility when editing the audio recording according to the eighth aspect.

＜第１０態様＞
第１０態様としての音声記録管理システムは、第１態様乃至第９態様において、前記テキスト非表示要求又は前記画像非表示要求により非表示処理される前記第１の非表示テキストデータ、前記第２の非表示テキストデータ、前記第１の非表示画像データ、及び前記第２の非表示画像データはそれぞれ、所定の事業の業績情報、売上情報、利益情報及び個人情報を含む秘匿データである。 <Tenth aspect>
In the audio recording management system of the 10th aspect, in the 1st to 9th aspects, the first hidden text data, the second hidden text data, the first hidden image data, and the second hidden image data which are processed to be hidden by the text non-display request or the image non-display request are each confidential data including performance information, sales information, profit information, and personal information of a specified business.

第１０態様によれば、第１態様乃至第９態様に係る音声記録の編集時における秘匿データの種別を管理することが可能になる。 According to the tenth aspect, it becomes possible to manage the type of confidential data when editing the audio recording according to the first to ninth aspects.

＜第１１態様＞
第１１態様としての音声情報に基づいて得られた音声記録情報を管理する音声記録管理装置は、前記音声記録管理装置と通信することで前記音声記録情報を表示可能な一以上の通信端末のうち、第１の通信端末が送信した前記音声情報を表す音声データ、及び前記第１の通信端末に表示された画面を表す画面データを取得する取得手段と、取得された前記音声データに基づいて得られた所定のテキストを表す所定のテキストデータと、取得された前記画面データに係る前記画面に含まれる画像であり、前記所定のテキストに対応付けられた所定の画像を表す所定の画像データと、前記所定のテキストで示される所定の音声データとを、前記第１の通信端末を含む前記一以上の通信端末に送信する送信手段と、を有し、前記送信手段は、前記第１の通信端末が送信した編集要求であり、前記所定のテキスト又は前記所定の画像に対する編集要求に応じて、前記所定のテキストデータを編集処理した編集後テキストデータと前記所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、前記所定の音声データを編集処理した編集後音声データとを、前記第１の通信端末と異なる第２の通信端末に対して送信する。 <Eleventh aspect>
An eleventh aspect of the present invention is an audio recording management device that manages audio recording information obtained based on audio information, the audio recording management device having an acquisition means for acquiring audio data representing the audio information transmitted by a first communication terminal among one or more communication terminals capable of displaying the audio recording information by communicating with the audio recording management device, and screen data representing a screen displayed on the first communication terminal, and a transmission means for transmitting predetermined text data representing a predetermined text obtained based on the acquired audio data, predetermined image data representing a predetermined image that is included on the screen related to the acquired screen data and corresponds to the predetermined text, and predetermined audio data indicated by the predetermined text to the one or more communication terminals including the first communication terminal, wherein the transmission means transmits edited screen data including edited text data obtained by editing the predetermined text data and edited image data obtained by editing the predetermined image data, and edited audio data obtained by editing the predetermined audio data to a second communication terminal different from the first communication terminal, in response to an editing request transmitted by the first communication terminal for the predetermined text or the predetermined image.

第１１態様によれば、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータ又はそのテキストデータに対応する画像データを編集すればよいので、音声記録の編集における利便性を向上させることが可能になる。 According to the eleventh aspect, when editing a voice recording generated based on voice information, it is only necessary to edit the text data included in the voice recording or the image data corresponding to the text data, thereby improving the convenience of editing the voice recording.

＜第１２態様＞
第１２態様としての音声情報に基づいて得られた音声記録情報を管理する音声記録管理装置が実行する音声記録管理方法は、前記音声記録管理装置と通信することで前記音声記録情報を表示可能な一以上の通信端末のうち、第１の通信端末が送信した前記音声情報を表す音声データ、及び前記第１の通信端末に表示された画面を表す画面データを取得する取得ステップと、取得された前記音声データに基づいて得られた所定のテキストを表す所定のテキストデータと、取得された前記画面データに係る前記画面に含まれる画像であり、前記所定のテキストに対応付けられた所定の画像を表す所定の画像データと、前記所定のテキストで示される所定の音声データとを、前記第１の通信端末を含む前記一以上の通信端末に送信する送信ステップと、を含み、前記送信ステップは、前記第１の通信端末が送信した編集要求であり、前記所定のテキスト又は前記所定の画像に対する編集要求に応じて、前記所定のテキストデータを編集処理した編集後テキストデータと前記所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、前記所定の音声データを編集処理した編集後音声データとを、前記第１の通信端末と異なる第２の通信端末に対して送信する。 <Twelfth aspect>
A voice recording management method executed by a voice recording management device that manages voice recording information obtained based on voice information as a twelfth aspect includes an acquisition step of acquiring voice data representing the voice information transmitted by a first communication terminal among one or more communication terminals capable of displaying the voice recording information by communicating with the voice recording management device, and screen data representing a screen displayed on the first communication terminal, and a transmission step of transmitting, to the one or more communication terminals including the first communication terminal, predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image that is included on the screen related to the acquired screen data and corresponds to the predetermined text, and predetermined voice data indicated by the predetermined text, wherein the transmission step is an editing request transmitted by the first communication terminal, and in response to the editing request for the predetermined text or the predetermined image, edited screen data including edited text data obtained by editing the predetermined text data and edited image data obtained by editing the predetermined image data, and edited voice data obtained by editing the predetermined voice data are transmitted to a second communication terminal different from the first communication terminal.

第１２態様によれば、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータ又はそのテキストデータに対応する画像データを編集すればよいので、音声記録の編集における利便性を向上させることが可能になる。 According to the twelfth aspect, when editing a voice recording generated based on voice information, it is only necessary to edit the text data included in the voice recording or the image data corresponding to the text data, thereby improving the convenience of editing the voice recording.

＜第１３態様＞
第１３態様としての音声情報に基づいて得られた音声記録情報を管理する音声記録管理装置に以下の処理を実行させるプログラムは、前記音声記録管理装置と通信することで前記音声記録情報を表示可能な一以上の通信端末のうち、第１の通信端末が送信した前記音声情報を表す音声データ、及び前記第１の通信端末に表示された画面を表す画面データを取得する取得ステップと、取得された前記音声データに基づいて得られた所定のテキストを表す所定のテキストデータと、取得された前記画面データに係る前記画面に含まれる画像であり、前記所定のテキストに対応付けられた所定の画像を表す所定の画像データと、前記所定のテキストで示される所定の音声データとを、前記第１の通信端末を含む前記一以上の通信端末に送信する送信ステップと、を含み、前記送信ステップとして、前記第１の通信端末が送信した編集要求であり、前記所定のテキスト又は前記所定の画像に対する編集要求に応じて、前記所定のテキストデータを編集処理した編集後テキストデータと前記所定の画像データを編集処理した編集後画像データとを含む編集後画面データと、前記所定の音声データを編集処理した編集後音声データとを、前記第１の通信端末と異なる第２の通信端末に対して送信する、処理を実行させる。 <Thirteenth aspect>
The program for causing a voice recording management device that manages voice recording information obtained based on voice information as a thirteenth aspect to execute the following processes includes an acquisition step of acquiring voice data representing the voice information transmitted by a first communication terminal among one or more communication terminals capable of displaying the voice recording information by communicating with the voice recording management device, and screen data representing a screen displayed on the first communication terminal, and a transmission step of transmitting, to the one or more communication terminals including the first communication terminal, predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image that is included on the screen related to the acquired screen data and corresponds to the predetermined text, and predetermined voice data indicated by the predetermined text, and the transmission step executes a process of transmitting, to a second communication terminal different from the first communication terminal, edited screen data including edited text data obtained by editing the predetermined text data and edited image data obtained by editing the predetermined image data, and edited voice data obtained by editing the predetermined voice data, in response to an editing request transmitted by the first communication terminal and an editing request for the predetermined text or the predetermined image, to a second communication terminal different from the first communication terminal.

第１３態様によれば、音声情報に基づいて生成された音声記録を編集する場合、音声記録に含まれるテキストデータ又はそのテキストデータに対応する画像データを編集すればよいので、音声記録の編集における利便性を向上させることが可能になる。 According to the thirteenth aspect, when editing a voice recording generated based on voice information, it is only necessary to edit the text data included in the voice recording or the image data corresponding to the text data, thereby improving the convenience of editing the voice recording.

＜第１４態様＞
第１４態様としての音声記録管理システムは、第１態様において、前記音声記録管理装置の前記受信手段が、前記編集要求として、前記編集後画面に表示された所定の領域を分割するための分割操作指示要求を受信し、前記送信手段が、前記分割操作指示要求に応じて、前記所定のテキストデータを分割処理して得られた分割編集後テキストデータと、前記所定の画像データを分割処理して得られた分割編集後画像データと、を含む分割編集後画面データを、前記第２の通信端末に対して送信し、前記第２の通信端末の表示制御手段が、前記音声記録管理装置が送信した前記分割編集後画面データに係る分割編集後画面を、前記表示手段に表示する。 <14th aspect>
A fourteenth aspect of the voice recording management system is, in the first aspect, the receiving means of the voice recording management device receives a split operation instruction request to split a specified area displayed on the post-edit screen as the editing request, and the transmitting means transmits split edited screen data to the second communication terminal in response to the split operation instruction request, the split edited text data obtained by split processing the specified text data and split edited image data obtained by split processing the specified image data, and the display control means of the second communication terminal displays the split edited screen related to the split edited screen data transmitted by the voice recording management device on the display means.

第１４態様によれば、作成した会議議事録等が表示される音声記録画面に対して会議等の参加者に無関係な画面を分割編集し、必要な情報だけをその情報を必要とする参加者に閲覧させることが可能になる。その結果、音声記録画面に表示される表示画面データ、音声再生される音声データ等の秘匿性を向上させることが可能になる。 According to the 14th aspect, it is possible to split and edit the screens that are not relevant to the participants of the meeting, etc., on the audio recording screen on which the created meeting minutes, etc. are displayed, and to allow the participants who need the information to view only the necessary information. As a result, it is possible to improve the confidentiality of the display screen data displayed on the audio recording screen, the audio data played back, etc.

＜第１５態様＞
第１５態様としての音声記録管理システムは、第１態様において、前記音声記録管理装置の前記送信手段が、前記分割操作指示要求に応じて、前記分割編集後画面データに含まれる所定のテキストデータを含む所定のブックマーク情報を前記第２の通信端末に対して送信し、前記第２の通信端末の表示制御手段が、前記音声記録管理装置が送信した前記分割編集後画面データに含まれる前記所定のブックマーク情報を含めた前記分割編集後画面データに係る分割編集後画面を、前記表示手段に表示する。 <Fifteenth aspect>
In the voice recording management system of the fifteenth aspect, in the first aspect, the transmission means of the voice recording management device transmits specified bookmark information including specified text data contained in the split edited screen data to the second communication terminal in response to the split operation instruction request, and the display control means of the second communication terminal displays on the display means a split edited screen related to the split edited screen data including the specified bookmark information contained in the split edited screen data transmitted by the voice recording management device.

第１５態様によれば、第１４態様に係る音声記録画面に表示される表示画面データ、音声再生される音声データ等の秘匿性の向上に加えて、ブックマーク処理したテキストについても、分割された画面に対応させて表示又は非表示させることが可能になる。その結果、利用者に対して音声記録画面における編集の利便性をさらに向上させることが可能になる。 According to the fifteenth aspect, in addition to improving the confidentiality of the display screen data displayed on the voice recording screen according to the fourteenth aspect, the voice data played back, etc., it is also possible to display or hide bookmarked text in accordance with the divided screens. As a result, it is possible to further improve the convenience of editing on the voice recording screen for the user.

＜第１６態様＞
第１６態様としての音声記録管理システムは、第１態様において、前記第２の通信端末の音声再生手段は、前記所定のテキストデータを分割した分割日時よりも前の日時であって、前記分割日時に最も近い日時に開始された発話の終了日時が前記分割日時を跨ぐ場合に、前記発話の終了日時を前記分割日時として分割され前記音声記録管理装置が送信した分割編集後音声データに係る分割編集後音声を再生する。 <16th aspect>
A 16th aspect of the voice recording management system is, in the first aspect, an audio playback means of the second communication terminal plays back the split edited audio related to the split edited audio data transmitted by the voice recording management device, which is split using the end date and time of the utterance as the split date and time, when the end date and time of an utterance that began at a date and time prior to the division date and time at which the specified text data was divided and that is closest to the division date and time straddles the division date and time.

第１６態様によれば、第１４態様及び第１５態様に係る音声記録画面における編集の利便性の向上に加えて、音声データの内容を保証することが可能になる。 According to the 16th aspect, in addition to improving the convenience of editing on the audio recording screen according to the 14th and 15th aspects, it is possible to guarantee the content of the audio data.

１通信システム
２音声記録管理システム
３通信端末
５音声記録管理装置
７音声認識サーバ（クラウドサービス）
３１送受信部（受信手段の一例、送信手段の一例）
３２操作受付部（受付手段の一例）
３３音・画像取得部（取得手段の一例）
３４表示制御部（表示制御手段の一例）
３６音声再生部（音声再生手段の一例）
５１送受信部（受信手段の一例、送信手段の一例）
５２取得部（取得手段の一例）
５３算出特定部（算出手段の一例、特定手段の一例）
５４表示制御部（表示制御手段の一例）
５５判断部（判断手段の一例）
５６認証部（認証手段の一例）
５７生成・処理部（生成手段の一例）
５８設定登録部（設定手段の一例、登録手段の一例）
５９記憶読出部（記憶読出手段の一例） 1 Communication system 2 Voice recording management system 3 Communication terminal 5 Voice recording management device 7 Voice recognition server (cloud service)
31 Transmitting/receiving unit (an example of a receiving means, an example of a transmitting means)
32 Operation reception unit (an example of a reception means)
33 Sound/image acquisition unit (an example of an acquisition means)
34 Display control unit (an example of a display control means)
36 Audio playback unit (an example of audio playback means)
51 Transmitting/receiving unit (an example of a receiving means, an example of a transmitting means)
52 Acquisition unit (an example of an acquisition means)
53 Calculation determination unit (an example of a calculation means, an example of a determination means)
54 Display control unit (an example of a display control means)
55 Determination unit (an example of a determination means)
56 Authentication unit (an example of an authentication means)
57 Generation/processing unit (an example of a generation means)
58 Setting registration unit (an example of a setting means, an example of a registration means)
59 Memory readout unit (an example of a memory readout means)

特開２０１９‐１３９５７２号公報JP 2019-139572 A

Claims

A voice recording management system including: a voice recording management device that manages voice recording information obtained based on voice information; and one or more communication terminals that can display the voice recording information by communicating with the voice recording management device,
The voice recording management device includes:
an acquisition means for acquiring voice data representing the voice information transmitted by a first communication terminal among the one or more communication terminals, and screen data representing a screen displayed on the first communication terminal;
a transmission means for transmitting, to the one or more communication terminals including the first communication terminal, predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image included on the screen related to the acquired screen data and associated with the predetermined text, and predetermined voice data represented by the predetermined text;
having
The transmitting means is
transmit edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to a second communication terminal different from the first communication terminal in response to an editing request transmitted by the first communication terminal for the specified text or the specified image;
The second communication terminal comprises:
a display control means for displaying an edited screen related to the edited screen data transmitted from the voice recording management device on a display means;
a voice reproducing means for reproducing the edited voice related to the edited voice data transmitted from the voice recording management device;
having
1. A voice recording management system comprising:

2. The voice recording management system according to claim 1,
The voice recording management device further includes:
a receiving means for receiving, as the editing request, either one of a text non-display request for non-displaying the predetermined text and an image non-display request for non-displaying the predetermined image,
The transmitting means is
when the received editing request is the text non-display request, as the editing process, transmit to the second communication terminal first non-display screen data including first non-display text data obtained by non-display processing the predetermined text data and first non-display image data obtained by non-display processing the predetermined image data, and first muted audio data obtained by muting the predetermined audio data as the editing process;
When the received editing request is the non-display request for the image, second non-display screen data including second non-display image data obtained by non-display processing of the specified image data and second non-display text data obtained by non-display processing of one or more texts associated with the specified image, and second muted audio data obtained by muting one or more specified audio data associated with the one or more texts as the editing processing are transmitted to the second communication terminal as the editing processing.
2. The voice recording management system according to claim 1.

The display control means
a text non-display operation unit operated to generate the text non-display request is displayed near the predetermined text displayed on the screen, and an image non-display operation unit operated to generate the image non-display request is displayed near the predetermined image displayed on the screen.
3. The voice recording management system according to claim 2.

4. The voice recording management system according to claim 2,
The voice recording management device further comprises:
a generating means for generating the first non-display screen data and the first muted audio data in response to the request to not display the text, and generating the second non-display screen data and the second muted audio data in response to the request to not display the image,
1. A voice recording management system comprising:

The first communication terminal further comprises:
an acquisition means for acquiring screen data relating to the screen at a predetermined time interval;
The display control means
when the image non-display request for the predetermined image acquired at the acquisition time is received in a state in which specific text data obtained based on the voice data exists across an acquisition time at which the screen data is acquired by the acquisition means, displaying on the display means, in addition to the second non-display text data, a third non-display screen related to third non-display screen data obtained by non-display processing of the specific text data;
The audio reproducing means includes:
reproducing the audio in which the audio data indicated by the second non-display text data and each text related to the specific text data have been muted;
3. The voice recording management system according to claim 2.

The display control means
displaying a screen on which a specific text related to the specific text data is kept in a non-display state when an image display request for displaying the specific image again is received after the image non-display request for the specific image is received;
The audio reproducing means includes:
reproducing a sound in which a specific sound related to the specific sound data indicated by the specific text is kept in a muted state;
6. The voice recording management system according to claim 5.

3. The voice recording management system according to claim 2,
The voice recording management device further comprises:
the text non-display request and the image non-display request are requests to delete or make private the specified text data and the specified image data, respectively, and the device has a processing means for deleting the specified text data and the specified image data in the case of the request to delete, and for making private the specified text data and the specified image data in the case of the request to make private, without deleting the specified text data and the specified image data, and for making them non-display on a first non-display screen represented by the first non-display screen data or a second non-display screen represented by the second non-display screen data.
1. A voice recording management system comprising:

The display control means
when a text group including a plurality of texts displayed on the screen is selected, a summary input operation section for inputting a summary representing the text group is displayed near the text group.
2. The voice recording management system according to claim 1.

The display control means
a text non-display operation section is displayed in the vicinity of the one text group together with the summary input operation section;
9. The voice recording management system according to claim 8.

The first hidden text data, the second hidden text data , the first hidden image data , and the second hidden image data which are subjected to a hidden processing in response to a text hidden request or an image hidden request are confidential data including performance information, sales information, profit information, and personal information of a predetermined business, respectively.
2. The voice recording management system according to claim 1.

A voice recording management device for managing voice recording information obtained based on voice information, comprising:
an acquisition means for acquiring voice data representing the voice information transmitted by a first communication terminal among one or more communication terminals capable of displaying the voice recording information by communicating with the voice recording management device, and screen data representing a screen displayed on the first communication terminal;
a transmission means for transmitting, to the one or more communication terminals including the first communication terminal, predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image included on the screen related to the acquired screen data and associated with the predetermined text, and predetermined voice data represented by the predetermined text;
having
The transmitting means is
the editing request being transmitted from the first communication terminal, in response to the editing request for the specified text or the specified image, transmitting edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to a second communication terminal different from the first communication terminal;
1. A voice recording management device comprising:

A voice recording management method executed by a voice recording management device that manages voice recording information obtained based on voice information, comprising:
an acquiring step of acquiring voice data representing the voice information transmitted by a first communication terminal among one or more communication terminals capable of displaying the voice recording information by communicating with the voice recording management device, and screen data representing a screen displayed on the first communication terminal;
a transmission step of transmitting, to the one or more communication terminals including the first communication terminal, predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image that is included on the screen related to the acquired screen data and corresponds to the predetermined text, and predetermined voice data indicated by the predetermined text;
Including,
The transmitting step includes:
the editing request being transmitted from the first communication terminal, in response to the editing request for the specified text or the specified image, transmitting edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to a second communication terminal different from the first communication terminal;
A voice recording management method comprising:

A voice recording management device for managing voice recording information obtained based on voice information,
an acquiring step of acquiring voice data representing the voice information transmitted by a first communication terminal among one or more communication terminals capable of displaying the voice recording information by communicating with the voice recording management device, and screen data representing a screen displayed on the first communication terminal;
a transmission step of transmitting, to the one or more communication terminals including the first communication terminal, predetermined text data representing a predetermined text obtained based on the acquired voice data, predetermined image data representing a predetermined image that is included on the screen related to the acquired screen data and corresponds to the predetermined text, and predetermined voice data indicated by the predetermined text;
Including,
In the transmitting step,
the editing request being transmitted from the first communication terminal, in response to the editing request for the specified text or the specified image, transmitting edited screen data including edited text data obtained by editing the specified text data and edited image data obtained by editing the specified image data, and edited audio data obtained by editing the specified audio data, to a second communication terminal different from the first communication terminal;
A program that executes a process.

The receiving means includes:
receiving, as the editing request, a division operation instruction request for dividing a predetermined area displayed on the post-editing screen;
The transmitting means is
transmit, in response to the split operation instruction request, split-edited screen data including split-edited text data obtained by splitting the specified text data and split-edited image data obtained by splitting the specified image data, to the second communication terminal;
The display control means of the second communication terminal
a divided edited screen related to the divided edited screen data transmitted from the voice recording management device is displayed on the display means;
3. The voice recording management system according to claim 2.

The transmitting means is
transmitting, in response to the split operation instruction request, predetermined bookmark information including predetermined text data included in the post-split edit screen data to the second communication terminal;
The display control means of the second communication terminal
a display means for displaying a post-edited screen related to the post-edited screen data including the predetermined bookmark information included in the post-edited screen data transmitted from the voice recording management device;
15. The voice recording management system according to claim 14.

The audio reproducing means of the second communication terminal comprises:
When an end date and time of an utterance that started at a date and time prior to the division date and time at which the specified text data was divided and closest to the division date and time straddles the division date and time, the end date and time of the utterance is set as the division date and time, and the divided edited voice related to the divided edited voice data transmitted by the voice recording management device is played back.
16. The voice recording management system according to claim 14 or 15.