JP7651833B2

JP7651833B2 - IMAGE EDITING DEVICE, IMAGE EDITING METHOD, AND IMAGE EDITING PROGRAM

Info

Publication number: JP7651833B2
Application number: JP2020172874A
Authority: JP
Inventors: 悠貴堀; 恵里渡辺; 拓郎安田; 健太郎萩田; 拓朗内藤; 宏昌田中; 輝憲小山
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2020-10-13
Filing date: 2020-10-13
Publication date: 2025-03-27
Anticipated expiration: 2040-10-13
Also published as: JP2022064243A

Description

本発明は、画像編集装置、画像編集方法、及び画像編集プログラムに関する。 The present invention relates to an image editing device, an image editing method, and an image editing program.

コミュニケーションの場面では、効率的に正確な情報伝達を行うために、言語情報によるコミュニケーションだけでは不十分であることがある。例えば、新しい企画やアイデアを考える際に行われる会議では、意思疎通のために視覚情報の活用が有効である。コミュニケーションの場面では、正確な視覚情報化の手段が望まれる。 In communication situations, verbal communication alone may not be sufficient to transmit information efficiently and accurately. For example, in meetings held to think up new plans or ideas, it is effective to use visual information to communicate. In communication situations, accurate means of visual information are desirable.

特許文献１には、グラフィックレコーディングシステムの議事録編集画面において、イラスト検索結果表示領域に表示されたイラストが選択され、選択されたイラストが議事録に貼り付けられて、イラストを交えた議事録が作成される。イラストを交えた議事録を眺めることで会議の振り返りを効率的に行うことができることが記載されている。 Patent Document 1 describes how an illustration displayed in an illustration search result display area is selected on the minutes editing screen of a graphic recording system, and the selected illustration is pasted into the minutes to create minutes that include illustrations. It describes how looking at the minutes that include illustrations allows for an efficient review of the meeting.

特許文献１に記載の技術では、グラフィックレコーディングシステムに設定されたイラストを選択することしかできず、設定されたイラストによっては、ユーザの意図を十分に情報伝達できない可能性がある。 The technology described in Patent Document 1 only allows the user to select an illustration set in the graphic recording system, and depending on the illustration set, it may not be possible to fully convey the user's intentions.

本発明は、上記に鑑みてなされたものであって、ユーザによって編集可能な視覚情報を提示できる画像編集装置、画像編集方法、及び画像編集プログラムを提供することを目的とする。 The present invention has been made in consideration of the above, and aims to provide an image editing device, an image editing method, and an image editing program that can present visual information that can be edited by a user.

上述した課題を解決し、目的を達成するために、本発明の１つの側面にかかる画像編集装置は、画像データを表示する表示手段と、前記表示手段に表示される画像データのうち第１のデータフォーマットを有する第１の画像データに第１の編集処理を施す第１の編集手段と、前記表示手段に表示される画像データのうち前記第１のデータフォーマットとは異なる第２のデータフォーマットを有する第２の画像データに第２の編集処理を施す第２の編集手段と、前記第１のデータフォーマットを前記第２のデータフォーマットに変換する変換手段と、を有し、前記表示手段は、前記第１の編集手段によって前記第１の編集処理が施された場合に、前記第１の画像データの表示が前記第１の編集処理後の画像データに表示が変更され、前記変換手段によって変換された前記第２の画像データに前記第２の編集処理が施された場合に、前記第２の画像データの表示が前記第２の編集処理後の表示に変更される。 In order to solve the above-mentioned problems and achieve the object, an image editing device according to one aspect of the present invention has a display means for displaying image data, a first editing means for performing a first editing process on first image data having a first data format among the image data displayed on the display means, a second editing means for performing a second editing process on second image data having a second data format different from the first data format among the image data displayed on the display means, and a conversion means for converting the first data format into the second data format, and when the first editing process is performed by the first editing means, the display of the first image data is changed to the image data after the first editing process, and when the second editing process is performed on the second image data converted by the conversion means, the display of the second image data is changed to the display after the second editing process.

本発明によれば、ユーザによって編集可能な視覚情報を提示できるという効果を奏する。 The present invention has the advantage of being able to present visual information that can be edited by the user.

図１は、実施形態にかかるビジュアルコミュニケーションシステムの構成を示す図である。FIG. 1 is a diagram showing a configuration of a visual communication system according to an embodiment. 図２は、実施形態にかかるビジュアルコミュニケーションシステムに適用されるコンピュータのハードウェア構成を示す図である。FIG. 2 is a diagram showing a hardware configuration of a computer applied to the visual communication system according to the embodiment. 図３は、実施形態における言語情報入力部の構成を示す図である。FIG. 3 is a diagram showing a configuration of a language information input unit in the embodiment. 図４は、実施形態におけるイラスト表示部の構成を示す図である。FIG. 4 is a diagram showing the configuration of an illustration display unit in the embodiment. 図５は、実施形態における画像データの変換を示すデータフロー図である。FIG. 5 is a data flow diagram illustrating the conversion of image data in an embodiment. 図６は、実施形態におけるイラスト蓄積部に蓄積されるイラスト情報のデータ構造を示す図である。FIG. 6 is a diagram showing a data structure of illustration information stored in the illustration storage unit in the embodiment. 図７は、実施形態におけるサムネイル蓄積部に蓄積されるサムネイル情報のデータ構造を示す図である。FIG. 7 is a diagram showing the data structure of thumbnail information stored in the thumbnail storage unit in the embodiment. 図８は、実施形態における描画操作部の構成を示す図である。FIG. 8 is a diagram showing the configuration of a drawing operation unit in the embodiment. 図９は、実施形態にかかるビジュアルコミュニケーションシステムの動作を示すフローチャートである。FIG. 9 is a flowchart showing the operation of the visual communication system according to the embodiment. 図１０は、実施形態における登録処理の流れを示すフローチャートである。FIG. 10 is a flowchart showing the flow of the registration process in the embodiment. 図１１は、実施形態におけるコミュニケーション支援処理の流れを示すフローチャートである。FIG. 11 is a flowchart showing the flow of the communication support process in the embodiment. 図１２は、実施形態におけるコミュニケーション支援処理の流れを示すフローチャートである。FIG. 12 is a flowchart showing the flow of the communication support process in the embodiment. 図１３は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の構成を示す図である。FIG. 13 is a diagram showing the configuration of a display screen of the visual communication system according to the embodiment. 図１４は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の概略動作を示す図である。FIG. 14 is a diagram showing an outline of the operation of a display screen of the visual communication system according to the embodiment. 図１５は、実施形態における２段階の編集処理の流れを示す図である。FIG. 15 is a diagram showing the flow of a two-stage editing process in this embodiment. 図１６は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 16 is a diagram showing a detailed operation of a display screen of the visual communication system according to the embodiment. 図１７は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 17 is a diagram showing a detailed operation of a display screen of the visual communication system according to the embodiment. 図１８は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 18 is a diagram showing a detailed operation of a display screen by the visual communication system in the embodiment. 図１９は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 19 is a diagram showing a detailed operation of a display screen of the visual communication system according to the embodiment. 図２０は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 20 is a diagram showing a detailed operation of a display screen of the visual communication system in the embodiment. 図２１は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 21 is a diagram showing a detailed operation of a display screen of the visual communication system according to the embodiment. 図２２は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 22 is a diagram showing a detailed operation of a display screen by the visual communication system in the embodiment. 図２３は、実施形態におけるビジュアルコミュニケーションシステムによる表示画面の詳細動作を示す図である。FIG. 23 is a diagram showing a detailed operation of a display screen by the visual communication system in the embodiment. 図２４は、実施形態の変形例にかかるオンライン会議システムの構成を示す図である。FIG. 24 is a diagram illustrating a configuration of an online conference system according to a modified example of the embodiment. 図２５は、実施形態の変形例におけるクライアント端末のカメラによって撮像されたカメラ画像を示す図である。FIG. 25 is a diagram showing a camera image captured by a camera of a client terminal in a modified example of the embodiment. 図２６は、実施形態の変形例における描画表示部が図２５のカメラ画像に重畳する画像を示す図である。FIG. 26 is a diagram showing an image superimposed on the camera image of FIG. 25 by a drawing display unit in a modification of the embodiment. 図２７は、実施形態の変形例における他のクライアント端末で表示される画像を示す図である。FIG. 27 is a diagram showing an image displayed on another client terminal in the modified example of the embodiment.

（実施形態）
実施形態にかかるビジュアルコミュニケーションシステムは、コミュニケーションを支援する機能を有する。コミュニケーションの場面では、効率的に正確な情報伝達を行うために、言語情報によるコミュニケーションだけでは不十分であることがある。例えば、新しい企画やアイデアを考える際に行われる会議では、意思疎通のために視覚情報の活用が有効である。視覚情報化の手段として手でアイデアの絵をスケッチすることが考えられるが、頭の中の情景そのものを０から視覚情報化することが容易でないことがある。そのため、誰でも扱いが簡単な言語情報を基に視覚情報に変換する第１の技術が知られている。 (Embodiment)
The visual communication system according to the embodiment has a function of supporting communication. In communication situations, communication by language information alone may not be sufficient to efficiently and accurately transmit information. For example, in a meeting held when thinking of new plans or ideas, it is effective to use visual information for communication. One possible way to visualize information is to sketch a picture of an idea by hand, but it may not be easy to visualize the scene in one's head from scratch. For this reason, a first technology is known that converts information into visual information based on language information that is easy for anyone to use.

しかし、言語情報を基に視覚情報に変換する第１の技術では、予め用意されたイラストが検索されるにすぎない。具体的に説明すると、頭の中に存在する伝えたい情景と一致する向きや組み合わせのイラストが存在せず適切に情報伝達ができない場合が多い。予め用意されたイラストの数を増加させていけば、頭の中の情景と一致するイラストが存在する確率は高まっていくが、イラストの数の増加に伴って、選択するための時間が長時間化してしまう。また、頭の中の情景を正確に再現するために、イラストを基に自分で編集しようとしても、簡易な編集機能しかなければ、編集後のイラストは、頭の中の情景に近いものとなりにくい。そのため、第１の技術では、会議などのコミュニケーションの場で正確に情報伝達ができない可能性がある。 However, the first technology, which converts language information into visual information, merely searches for illustrations prepared in advance. More specifically, there are many cases where the information cannot be properly transmitted because there are no illustrations with orientations or combinations that match the scene in one's head. Increasing the number of prepared illustrations increases the probability that an illustration matching the scene in one's head is present, but as the number of illustrations increases, the time required to select increases. Furthermore, even if one tries to edit the illustrations themselves in order to accurately reproduce the scene in one's head, if there is only a simple editing function, the edited illustration is unlikely to be close to the scene in one's head. For this reason, the first technology may not be able to accurately transmit information in communication situations such as meetings.

そこで、本実施形態では、ビジュアルコミュニケーションシステムにおいて、ユーザから受け付けた言語情報に対応するイラストを検索して表示し、表示されたイラストに対して表示形態を変更させながら２段階の編集処理を可能とすることで、コミュニケーションの場面における正確な情報伝達の支援を図る。 In this embodiment, the visual communication system searches for and displays illustrations that correspond to the language information received from the user, and enables a two-stage editing process in which the display form of the displayed illustration is changed, thereby supporting accurate information transmission in communication situations.

具体的には、ビジュアルコミュニケーションシステムは、言語情報を基に思い通りに編集可能な視覚情報に変換し、会議などのコミュニケーションの場で頭の中の情景の短時間での正確な意思疎通を可能にする。ビジュアルコミュニケーションシステムは、コミュニケーションの場における発話や文字入力などの言語情報をリアルタイムで解析し、解析すると同時に言語情報に関連するイラストの候補として１以上のサムネイル画像を特定エリアに表示する。ビジュアルコミュニケーションシステムは、言語情報とサムネイル画像とが対応付けられた第１の対応情報を有しており、言語情報を受け付けると、第１の対応情報におけるその言語情報に対応する１以上のサムネイル画像を特定して表示する。また、ビジュアルコミュニケーションシステムは、サムネイル画像とイラストとが対応付けられた第２の対応情報を有している。イラストが３次元画像である場合に、サムネイル画像を２次元画像とすることで、ビジュアルコミュニケーションシステムは、イラストを表示する場合に比べて、サムネイル画像を高速に表示できる。ユーザからの言語情報が受け付けられるたびに、表示画面上のサムネイル画像が新しく切り替わって高速に更新表示され得る。これにより、ユーザが会話を止めずにその中から、発話又は文字入力された際の頭の中の情景に近い１以上のサムネイル画像を選択できる。第２の対応情報に応じて、選択されたサムネイル画像に紐づけられた３次元イラストデータが検索され、特定された３次元イラストデータが表示される。これにより、３次元イラストデータに対して３次元的な編集処理（３次元的な移動、拡縮、回転）を行うことができる。その後に、ビジュアルコミュニケーションシステムは、３次元的な編集処理が可能な３次元イラストデータを２次元的な編集処理が可能な２次元イラストデータに変換（固定化）する。これに応じて、２次元イラストデータに対して、２次元的な編集処理（一部を自由に消したり上から付け加えたり色を塗ったりすること）が可能である。この言語情報の自動認識をトリガーとする２段階の編集処理により、リアルタイムのコミュニケーションの場で、頭の中の情景を短時間で正確に表現でき、迅速かつ正確な意思疎通が可能となる。 Specifically, the visual communication system converts language information into visual information that can be edited as desired, enabling accurate communication of the scene in one's head in a short time in a communication situation such as a conference. The visual communication system analyzes language information such as speech and character input in a communication situation in real time, and at the same time displays one or more thumbnail images in a specific area as candidates for illustrations related to the language information. The visual communication system has first correspondence information in which language information and thumbnail images are associated, and when language information is received, one or more thumbnail images corresponding to the language information in the first correspondence information are specified and displayed. In addition, the visual communication system has second correspondence information in which thumbnail images are associated with illustrations. When the illustration is a three-dimensional image, the visual communication system can display the thumbnail image at a higher speed than when displaying an illustration by making the thumbnail image a two-dimensional image. Each time language information from a user is received, the thumbnail image on the display screen can be switched to a new one and updated and displayed at a high speed. This allows the user to select one or more thumbnail images that are close to the scene in his or her head when the speech or character input is received from among them without stopping the conversation. According to the second correspondence information, three-dimensional illustration data associated with the selected thumbnail image is searched for, and the identified three-dimensional illustration data is displayed. This allows three-dimensional editing (three-dimensional movement, enlargement, reduction, and rotation) of the three-dimensional illustration data. The visual communication system then converts (fixes) the three-dimensional illustration data that can be three-dimensionally edited into two-dimensional illustration data that can be two-dimensionally edited. In response to this, two-dimensional editing (freely erasing parts, adding on top, or painting) of the two-dimensional illustration data is possible. This two-stage editing process triggered by automatic recognition of language information allows the scene in one's head to be accurately expressed in a short time in a real-time communication situation, enabling rapid and accurate communication.

より具体的には、ビジュアルコミュニケーションシステム４は、図１に示すように構成され得る。図１は、ビジュアルコミュニケーションシステム４の構成を示す図である。 More specifically, the visual communication system 4 may be configured as shown in FIG. 1. FIG. 1 is a diagram showing the configuration of the visual communication system 4.

ビジュアルコミュニケーションシステム４は、クライアント端末１、サーバ２、及び接続部３を有する。接続部３は、クライアント端末１及びサーバ２を互いに通信可能に接続する。 The visual communication system 4 has a client terminal 1, a server 2, and a connection unit 3. The connection unit 3 connects the client terminal 1 and the server 2 so that they can communicate with each other.

クライアント端末１は、言語情報入力部１００、イラスト表示部２００、描画操作部３００を有する。サーバ２は、描画表示部４００及び記憶部５００を有する。記憶部５００は、プログラム５００ａを格納する。 The client terminal 1 has a language information input unit 100, an illustration display unit 200, and a drawing operation unit 300. The server 2 has a drawing display unit 400 and a memory unit 500. The memory unit 500 stores a program 500a.

ビジュアルコミュニケーションシステム４は、ユーザからの起動要求をクライアント端末１で受け付けると、起動要求がクライアント端末１からサーバ２へ送信され、サーバ２で起動要求に応じてプログラム５００ａが記憶部５００から読み出される。ビジュアルコミュニケーションシステム４は、プログラム５００ａに従い、例えば図１に例示するように、クライアント端末１内に言語情報入力部１００、イラスト表示部２００、描画操作部３００を機能的に構成し、サーバ２内に描画表示部４００を機能的に構成する。 When the visual communication system 4 receives a startup request from a user at the client terminal 1, the startup request is transmitted from the client terminal 1 to the server 2, and the server 2 reads the program 500a from the storage unit 500 in response to the startup request. In accordance with the program 500a, the visual communication system 4 functionally configures a language information input unit 100, an illustration display unit 200, and a drawing operation unit 300 in the client terminal 1, and functionally configures a drawing display unit 400 in the server 2, as shown in FIG. 1, for example.

なお、ビジュアルコミュニケーションシステム４は、クライアント端末１内のイラスト表示部２００と描画操作部３００とをサーバ２または別のサーバ内に構成してもよい。あるいは、ビジュアルコミュニケーションシステム４は、言語情報入力部１００、イラスト表示部２００、描画操作部３００、描画表示部４００を含むすべての機能構成をクライアント端末１内で完結するように構成しても良い。あるいは、ビジュアルコミュニケーションシステム４は、イラスト表示部２００に含まれる複数の要素の一部（例えば、ユーザインタフェース及びそれに近い部分）をクライアント端末１内に構成し、残りの部分をサーバ２または別のサーバ内に構成してもよい。同様に、ビジュアルコミュニケーションシステム４は、描画操作部３００に含まれる複数の要素の一部（例えば、ユーザインタフェース及びそれに近い部分）をクライアント端末１内に構成し、残りの部分をサーバ２または別のサーバ内に構成してもよい。 In addition, the visual communication system 4 may configure the illustration display unit 200 and the drawing operation unit 300 in the client terminal 1 in the server 2 or another server. Alternatively, the visual communication system 4 may be configured so that all functional configurations including the language information input unit 100, the illustration display unit 200, the drawing operation unit 300, and the drawing display unit 400 are completed in the client terminal 1. Alternatively, the visual communication system 4 may configure a part of the multiple elements included in the illustration display unit 200 (e.g., the user interface and a part similar thereto) in the client terminal 1, and the remaining part in the server 2 or another server. Similarly, the visual communication system 4 may configure a part of the multiple elements included in the drawing operation unit 300 (e.g., the user interface and a part similar thereto) in the client terminal 1, and the remaining part in the server 2 or another server.

接続部３は、有線通信回線及び／又は無線通信回線であってもよく、いわゆる通信ネットワークであってもよいし、通信ケーブル等であってもよい。接続部３は、インターネット、移動体通信網、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）等のうち１つ以上を用いて構築されていてもよい。接続部３は、有線通信だけでなく、３Ｇ（３ｒｄＧｅｎｅｒａｔｉｏｎ）、４Ｇ（４ｔｈＧｅｎｅｒａｔｉｏｎ）、５Ｇ（５ｔｈＧｅｎｅｒａｔｉｏｎ）、Ｗｉ－Ｆｉ（ＷｉｒｅｌｅｓｓＦｉｄｅｌｉｔｙ）（登録商標）、ＷｉＭＡＸ（ＷｏｒｌｄｗｉｄｅＩｎｔｅｒｏｐｅｒａｂｉｌｉｔｙｆｏｒＭｉｃｒｏｗａｖｅＡｃｃｅｓｓ）またはＬＴＥ（ＬｏｎｇＴｅｒｍＥｖｏｌｕｔｉｏｎ）等の無線通信によるネットワークが含まれてもよい。プログラム５００ａがクライアント端末１内に格納され各機能構成がクライアント端末１内で完結するように構成される場合、接続部３は省略されてもよい。 The connection unit 3 may be a wired communication line and/or a wireless communication line, may be a so-called communication network, may be a communication cable, etc. The connection unit 3 may be constructed using one or more of the Internet, a mobile communication network, a LAN (Local Area Network), etc. The connection unit 3 may include not only wired communication but also a wireless communication network such as 3G (3rd Generation), 4G (4th Generation), 5G (5th Generation), Wi-Fi (Wireless Fidelity) (registered trademark), WiMAX (Worldwide Interoperability for Microwave Access), or LTE (Long Term Evolution). If the program 500a is stored in the client terminal 1 and each functional configuration is configured to be completed within the client terminal 1, the connection unit 3 may be omitted.

クライアント端末１において、言語情報入力手段としての言語情報入力部１００は、ユーザによる言語情報の入力を受け付ける。イラスト表示部２００は、ディスプレイに文字・イラスト等の画像を表示する。描画操作部３００は、ユーザによる描画操作を受け付ける。ここで、描画とは、手書きだけでなく、ディスプレイへの描画を目的としたディスプレイ上での選択動作等も含まれる。クライアント端末１の描画操作部３００は、サーバ２の描画表示部４００へ描画操作要求を送信する。サーバ２の描画表示部４００は、描画操作要求に応じて、表示画像の表示形態を変化させるように表示情報を更新してクライアント端末１へ送信する。クライアント端末１は、更新後の表示情報を受信し、その表示情報に応じた画像をディスプレイに表示する。これにより、ユーザによる描画操作の結果がクライアント端末１のディスプレイに表示される。 In the client terminal 1, the language information input unit 100, which serves as a language information input means, accepts input of language information by the user. The illustration display unit 200 displays images such as characters and illustrations on the display. The drawing operation unit 300 accepts drawing operations by the user. Here, drawing includes not only handwriting but also selection operations on the display for the purpose of drawing on the display. The drawing operation unit 300 of the client terminal 1 transmits a drawing operation request to the drawing display unit 400 of the server 2. In response to the drawing operation request, the drawing display unit 400 of the server 2 updates the display information so as to change the display form of the displayed image, and transmits the updated display information to the client terminal 1. The client terminal 1 receives the updated display information and displays an image corresponding to the display information on the display. As a result, the result of the drawing operation by the user is displayed on the display of the client terminal 1.

クライアント端末１、サーバ２は、それぞれ、図２に示すようなコンピュータ５でハードウェア的に構成されてもよい。図２は、ビジュアルコミュニケーションシステム４に適用されるコンピュータ５のハードウェア構成を示す図である。 The client terminal 1 and the server 2 may each be configured in hardware as a computer 5 as shown in FIG. 2. FIG. 2 is a diagram showing the hardware configuration of the computer 5 applied to the visual communication system 4.

コンピュータ５は、図２に示されているように、ＣＰＵ５０１、ＲＯＭ５０２、ＲＡＭ５０３、ＨＤ５０４、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）コントローラ５０５、ディスプレイ５０６、外部機器接続Ｉ／Ｆ（Ｉｎｔｅｒｆａｃｅ）５０８、ネットワークＩ／Ｆ５０９、データバス５１０、キーボード５１１、ポインティングデバイス５１２、ＤＶＤ－ＲＷ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋＲｅｗｒｉｔａｂｌｅ）ドライブ５１４、メディアＩ／Ｆ５１６、動作取得デバイス５１７、マイク５１８、スピーカ５１９、カメラ５２０を備えている。 As shown in FIG. 2, the computer 5 includes a CPU 501, a ROM 502, a RAM 503, a HD 504, a HDD (Hard Disk Drive) controller 505, a display 506, an external device connection I/F (Interface) 508, a network I/F 509, a data bus 510, a keyboard 511, a pointing device 512, a DVD-RW (Digital Versatile Disk Rewritable) drive 514, a media I/F 516, a motion acquisition device 517, a microphone 518, a speaker 519, and a camera 520.

これらのうち、ＣＰＵ５０１は、コンピュータ５全体の動作を制御する。ＲＯＭ５０２は、ＩＰＬ等のＣＰＵ５０１の駆動に用いられるプログラムを記憶する。ＲＡＭ５０３は、ＣＰＵ５０１のワークエリアとして使用される。ＨＤ５０４は、プログラム５００ａ等の各種データを記憶する。ＨＤＤコントローラ５０５は、ＣＰＵ５０１の制御にしたがってＨＤ５０４に対する各種データの読み出し又は書き込みを制御する。表示手段としてのディスプレイ５０６は、カーソル、メニュー、ウィンドウ、文字、又は画像などの各種情報を表示する。外部機器接続Ｉ／Ｆ５０８は、各種の外部機器を接続するためのインターフェースである。この場合の外部機器は、例えば、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリやプリンタ等である。ネットワークＩ／Ｆ５０９は、接続部３を利用してデータ通信をするためのインターフェースである。バスライン５１０は、図２に示されているＣＰＵ５０１等の各構成要素を電気的に接続するためのアドレスバスやデータバス等である。 Of these, the CPU 501 controls the operation of the entire computer 5. The ROM 502 stores programs used to drive the CPU 501, such as IPL. The RAM 503 is used as a work area for the CPU 501. The HD 504 stores various data such as the program 500a. The HDD controller 505 controls the reading or writing of various data from the HD 504 according to the control of the CPU 501. The display 506 as a display means displays various information such as a cursor, menu, window, character, or image. The external device connection I/F 508 is an interface for connecting various external devices. In this case, the external device is, for example, a USB (Universal Serial Bus) memory or a printer. The network I/F 509 is an interface for data communication using the connection unit 3. The bus line 510 is an address bus, a data bus, or the like for electrically connecting each component such as the CPU 501 shown in FIG. 2.

また、キーボード５１１は、文字、数値、各種指示などの入力のための複数のキーを備えた入力手段の一種である。ポインティングデバイス５１２は、各種指示の選択や実行、処理対象の選択、カーソルの移動などを行う入力手段の一種である。ＤＶＤ－ＲＷドライブ５１４は、着脱可能な記録媒体の一例としてのＤＶＤ－ＲＷ５１３に対する各種データの読み出し又は書き込みを制御する。なお、ＤＶＤ－ＲＷに限らず、ＤＶＤ－Ｒ等であってもよい。メディアＩ／Ｆ５１６は、フラッシュメモリ等の記録メディア５１５に対するデータの読み出し又は書き込み（記憶）を制御する。 The keyboard 511 is a type of input means equipped with multiple keys for inputting characters, numbers, various instructions, etc. The pointing device 512 is a type of input means for selecting and executing various instructions, selecting a processing target, moving the cursor, etc. The DVD-RW drive 514 controls the reading and writing of various data from the DVD-RW 513, which is an example of a removable recording medium. Note that this is not limited to a DVD-RW, and may be a DVD-R, etc. The media I/F 516 controls the reading and writing (storing) of data from the recording medium 515, such as a flash memory.

動作取得デバイス５１７は、ユーザの動作を検出し、電気信号に変える回路で、入力手段の一種である。ユーザの動作の検出は、光の反射を検出する光学式、ユーザに取り付けられた磁器センサ、機械式センサ、磁気センサの検出結果を用いるもの、またはユーザの画像を解析する方式等いずれか、またはそれらを組み合わせてもよい。 The motion acquisition device 517 is a circuit that detects the user's motion and converts it into an electrical signal, and is a type of input means. The user's motion may be detected by any of a number of methods, including an optical method that detects the reflection of light, a magnetic sensor, a mechanical sensor, or a magnetic sensor attached to the user, or a method that analyzes an image of the user, or a combination of these.

マイク５１８は、音を電気信号に変える内蔵型の回路である。スピーカ５１９は、電気信号を物理振動に変えて音楽や音声などの音を生み出す内蔵型の回路である。 The microphone 518 is a built-in circuit that converts sound into an electrical signal. The speaker 519 is a built-in circuit that converts the electrical signal into physical vibrations to produce sounds such as music or voice.

スピーカ５１９は、電気信号を物理振動に変えて音楽や音声などの音を生み出す内蔵型の回路である。 Speaker 519 is a built-in circuit that converts electrical signals into physical vibrations to produce sounds such as music and voice.

カメラ５２０は、被写体を撮像して画像データを得る内蔵型の撮像手段の一種である。 Camera 520 is a type of built-in imaging means that captures an image of a subject and obtains image data.

なお、サーバ２に適用されるコンピュータ５において、ＨＤ５０４は、記憶部５００に対応し、プログラム５００ａを格納する。クライアント端末１に適用されるコンピュータ５において、サーバ２から接続部３経由でプログラム５００ａがダウンロードされＨＤ５０４に格納され、ＣＰＵ５０１によりプログラム５００ａがＨＤ５０４から読み出され実行されることで、ＲＡＭ５０３内に図１に示すような機能構成が、コンパイル時に一括して又は処理の進行に応じて順次に展開され得る。また、サーバ２に適用されるコンピュータ５において、プログラム５００ａがＣＰＵ５０１により実行されることで、ＲＡＭ５０３内に図１に示すような機能構成が、コンパイル時に一括して又は処理の進行に応じて展開され得る。 In the computer 5 applied to the server 2, the HD 504 corresponds to the storage unit 500 and stores the program 500a. In the computer 5 applied to the client terminal 1, the program 500a is downloaded from the server 2 via the connection unit 3 and stored in the HD 504, and the program 500a is read from the HD 504 and executed by the CPU 501, so that the functional configuration as shown in FIG. 1 can be expanded in the RAM 503 either collectively at the time of compilation or sequentially as the processing progresses. In the computer 5 applied to the server 2, the program 500a is executed by the CPU 501, so that the functional configuration as shown in FIG. 1 can be expanded in the RAM 503 either collectively at the time of compilation or sequentially as the processing progresses.

また、サーバ２に適用されるコンピュータ５は、ユーザインタフェースに関する構成が省略されていてもよく、ディスプレイ５０６、キーボード５１１、及びポインティングデバイス５１２、動作取得デバイス５１７のうち少なくとも１つが省略されていてもよい。 In addition, the computer 5 applied to the server 2 may omit configuration related to the user interface, and at least one of the display 506, keyboard 511, pointing device 512, and action acquisition device 517 may be omitted.

また、クライアント端末１、サーバ２は、コンピュータ５以外にも、ＩＷＢ（ＩｎｔｅｒａｃｔｉｖｅＷｈｉｔｅＢｏａｒｄ：相互通信が可能な電子式の黒板機能を有する白板）、デジタルサイネージ等の出力装置、ＨＵＤ（ＨｅａｄＵｐＤｉｓｐｌａｙ）装置、産業機械、医療機器、ネットワーク家電、自動車（ＣｏｎｎｅｃｔｅｄＣａｒ）、携帯電話、スマートフォン、タブレット端末、ゲーム機、ＰＤＡ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＡｓｓｉｓｔａｎｔ）等であってもよい。 In addition to the computer 5, the client terminal 1 and the server 2 may be an output device such as an IWB (Interactive White Board: an electronic whiteboard capable of intercommunication), digital signage, a HUD (Head Up Display) device, industrial machinery, medical equipment, network home appliances, automobiles (Connected Cars), mobile phones, smartphones, tablet terminals, game consoles, PDAs (Personal Digital Assistants), etc.

図１に示す言語情報入力部１００は、機能的に、図３に示すように構成され得る。図３は、言語情報入力部１００の構成を示す図である。 The language information input unit 100 shown in FIG. 1 can be functionally configured as shown in FIG. 3. FIG. 3 is a diagram showing the configuration of the language information input unit 100.

言語情報入力部１００は、音声入力部１０１、文字入力部１０２、認識部１０３、及び送信部１０４を備える。これら各部は、ＨＤ５０４からＲＡＭ５０３上に展開されたプログラムに従ったＣＰＵ５０１からの命令によって動作することで実現される機能または手段である。 The language information input unit 100 includes a voice input unit 101, a character input unit 102, a recognition unit 103, and a transmission unit 104. Each of these units is a function or means realized by operating according to an instruction from the CPU 501 in accordance with a program loaded from the HD 504 onto the RAM 503.

音声入力部１０１は、マイク３１８によってユーザの音声が変換された音声信号、またはネットワークＩ／Ｆ５０９経由で受信された音声信号を言語情報として受け付ける。音声入力部１０１は、音声信号を認識部１０３へ供給する。 The voice input unit 101 accepts, as language information, a voice signal converted from the user's voice by the microphone 318, or a voice signal received via the network I/F 509. The voice input unit 101 supplies the voice signal to the recognition unit 103.

文字入力部１０２は、キーボード５４１、ポインティングデバイス５１２、動作取得デバイス５１７によってユーザから入力された文字信号、またはネットワークＩ／Ｆ経由で受信された文字信号を言語情報として受け付ける。文字入力部１０２は、文字信号を認識部１０３へ供給する。文字入力部１０２への文字の入力は、キーボード５４１へのタイピングまたは、ポインティングデバイス５１２や動作取得デバイス５１７による手書き入力を想定する。 The character input unit 102 accepts, as language information, character signals input by the user via the keyboard 541, the pointing device 512, or the motion acquisition device 517, or character signals received via the network I/F. The character input unit 102 supplies the character signals to the recognition unit 103. Characters are input to the character input unit 102 by typing on the keyboard 541, or by handwriting input using the pointing device 512 or the motion acquisition device 517.

認識部１０３は、音声入力部１０１又は文字入力部１０２から言語情報を受けると、言語情報に対して文字情報の認識を行う。 When the recognition unit 103 receives language information from the voice input unit 101 or the character input unit 102, it recognizes character information based on the language information.

認識部１０３は、音声信号を音声入力部１０１から受けると、音声信号に対して音声認識処理を行い文字情報へ変換する。認識部１０３は、文字ごとにテンプレート音声信号を有する。認識部１０３は、音声信号に対してテンプレート音声信号を用いたマッチング処理を行い、そのマッチングスコアに基づいて音声ごとに文字を認識できる。これにより、認識部１０３は、音声信号に対する認識結果として文字情報を生成する。 When the recognition unit 103 receives a voice signal from the voice input unit 101, it performs voice recognition processing on the voice signal and converts it into character information. The recognition unit 103 has a template voice signal for each character. The recognition unit 103 performs matching processing on the voice signal using the template voice signal, and can recognize characters for each voice based on the matching score. In this way, the recognition unit 103 generates character information as the recognition result for the voice signal.

認識部１０３は、文字信号を文字入力部１０２から受けると、文字信号に対する認識結果として文字情報を生成する。認識部１０３は、キーボード５４１へのタイピングによる文字情報を受けた場合、その文字情報を認識結果とする。認識部１０３は、ポインティングデバイス５１２や動作取得デバイス５１７による手書き文字画像を受けた場合、手書き文字画像に対してテキスト認識処理を行い文字情報へ変換する。認識部１０３は、文字ごとにテンプレート文字画像を有する。認識部１０３は、手書き文字画像に対してテンプレート文字画像を用いたマッチング処理を行い、そのマッチングスコアに基づいて手書き文字画像ごとに文字を認識できる。これにより、認識部１０３は、手書き文字画像に対する認識結果として文字情報を生成する。 When the recognition unit 103 receives a character signal from the character input unit 102, it generates character information as the recognition result for the character signal. When the recognition unit 103 receives character information typed on the keyboard 541, it takes the character information as the recognition result. When the recognition unit 103 receives a handwritten character image from the pointing device 512 or the motion acquisition device 517, it performs text recognition processing on the handwritten character image and converts it into character information. The recognition unit 103 has a template character image for each character. The recognition unit 103 performs matching processing on the handwritten character image using the template character image, and can recognize characters for each handwritten character image based on the matching score. In this way, the recognition unit 103 generates character information as the recognition result for the handwritten character image.

認識部１０３は、認識結果を送信部１０４へ供給する。送信部１０４は、認識結果をイラスト表示部２００に送信する。 The recognition unit 103 supplies the recognition result to the transmission unit 104. The transmission unit 104 transmits the recognition result to the illustration display unit 200.

図１に示すイラスト表示部２００は、機能的に、図４に示すように構成され得る。図４は、イラスト表示部２００の構成を示す図である。 The illustration display unit 200 shown in FIG. 1 can be functionally configured as shown in FIG. 4. FIG. 4 is a diagram showing the configuration of the illustration display unit 200.

イラスト表示部２００は、受信部２０１、サムネイル検索部２０２、サムネイル表示部２０３、サムネイル選択部２０４、イラスト検索部２０５、イラスト表示部２０６、文脈解析部２０７、優先度決定部２０８、サムネイル化部２１１、イラスト化部２１３、３次元データ入力部２１４、キーワード化部２１５、及び登録文字入力部２１６を備える。 The illustration display unit 200 includes a receiving unit 201, a thumbnail search unit 202, a thumbnail display unit 203, a thumbnail selection unit 204, an illustration search unit 205, an illustration display unit 206, a context analysis unit 207, a priority determination unit 208, a thumbnail conversion unit 211, an illustration conversion unit 213, a three-dimensional data input unit 214, a keyword conversion unit 215, and a registered character input unit 216.

これら各部は、ＨＤ５０４からＲＡＭ５０３上に展開されたプログラム５００ａに従ったＣＰＵ５０１からの命令によって動作することで実現される機能または手段である。 Each of these units is a function or means that is realized by operating according to instructions from the CPU 501 in accordance with the program 500a loaded onto the RAM 503 from the HD 504.

また、イラスト表示部２００は、選択傾向蓄積部２０９、サムネイル蓄積部２１０、イラスト蓄積部２１２を備える。これら各部は、ＲＯＭ５０２またはＲＡＭ５０３またはＨＤ５０４によって構築される。 The illustration display unit 200 also includes a selection tendency storage unit 209, a thumbnail storage unit 210, and an illustration storage unit 212. Each of these units is constructed using a ROM 502, a RAM 503, or a HD 504.

サムネイル蓄積部２１０及びイラスト蓄積部２１２へのデータ蓄積について図４及び図５を用いて説明する。図５は、画像データの変換を示すデータフロー図である。 The storage of data in the thumbnail storage unit 210 and the illustration storage unit 212 will be explained using Figures 4 and 5. Figure 5 is a data flow diagram showing the conversion of image data.

３次元データ入力部２１４は、３次元データが入力される。３次元データは、例えば図５（ａ）に示すようなポリゴンデータであり、複数の空間座標を含む。３次元データは、３次元画像データフォーマット（例えば、ＯＢＪフォーマット）に対応している。３次元データ入力部２１４は、３次元データをイラスト化部２１３へ供給する。 The three-dimensional data input unit 214 receives three-dimensional data. The three-dimensional data is, for example, polygon data as shown in FIG. 5(a) and includes multiple spatial coordinates. The three-dimensional data corresponds to a three-dimensional image data format (for example, OBJ format). The three-dimensional data input unit 214 supplies the three-dimensional data to the illustration unit 213.

イラスト化部２１３は、３次元データを３次元イラストデータに変換する。３次元イラストデータは、例えば図５（ｂ）に示すような３次元的な線画データであり、複数の空間座標を含む。３次元イラストデータは、３次元画像データフォーマット（例えば、ＯＢＪフォーマット）に対応している。イラスト化部２１３は、３次元データで示される３次元形状のエッジの３次元位置を特定して線画でつなぎ合わせることなどにより、３次元データから線画の情報を抽出し、３次元イラストデータを生成する。イラスト化部２１３は、３次元イラストデータをイラスト蓄積部２１２に追加的に格納する。 The illustration conversion unit 213 converts the three-dimensional data into three-dimensional illustration data. The three-dimensional illustration data is, for example, three-dimensional line drawing data as shown in FIG. 5(b) and includes multiple spatial coordinates. The three-dimensional illustration data corresponds to a three-dimensional image data format (for example, the OBJ format). The illustration conversion unit 213 extracts line drawing information from the three-dimensional data, for example by identifying the three-dimensional positions of the edges of the three-dimensional shape represented by the three-dimensional data and connecting them with line drawing, thereby generating three-dimensional illustration data. The illustration conversion unit 213 additionally stores the three-dimensional illustration data in the illustration storage unit 212.

これにより、イラスト蓄積部２１２には、図６に示すようなイラスト情報２１２ａが蓄積される。図６は、イラスト蓄積部２１２に蓄積されるイラスト情報２１２ａのデータ構造を示す図である。イラスト情報２１２ａは、３次元イラストデータとその識別情報とが１以上の３次元イラストデータについて対応付けられている。例えば、イラスト情報２１２ａは、識別情報欄２１２ａ１及びアクセス情報欄２１２ａ２を有する。識別情報欄２１２ａ１には、３次元イラストデータを識別するための情報が記録され、例えば３次元イラストデータのＩＤ番号が記録される。アクセス情報欄２１２ａ２には、３次元イラストデータにアクセスするための情報が記録され、例えば３次元イラストデータのファイル名が記録される。 As a result, illustration information 212a as shown in FIG. 6 is stored in the illustration storage unit 212. FIG. 6 is a diagram showing the data structure of illustration information 212a stored in illustration storage unit 212. In illustration information 212a, three-dimensional illustration data and its identification information are associated with one or more pieces of three-dimensional illustration data. For example, illustration information 212a has an identification information column 212a1 and an access information column 212a2. In identification information column 212a1, information for identifying the three-dimensional illustration data is recorded, for example, an ID number of the three-dimensional illustration data is recorded. In access information column 212a2, information for accessing the three-dimensional illustration data is recorded, for example, a file name of the three-dimensional illustration data is recorded.

サムネイル化部２１１は、イラスト蓄積部２１２に３次元イラストデータが追加されたタイミングで、又は、所定の周期ごとに、３次元イラストデータをイラスト蓄積部２１２から取得し、３次元イラストデータを２次元サムネイルデータに変換する。２次元サムネイルデータは、例えば図５（ｃ）に示すような２次元的な線画データであり、３次元イラストデータを縮小し２次元化されたことに相当する線画データである。２次元サムネイルデータは、２次元画像データフォーマット（例えば、ＢＭＰフォーマット）に対応している。２次元サムネイルデータは、３次元イラストデータの識別情報に関連付けられる。サムネイル化部２１１は、３次元イラストデータに含まれた複数の空間座標が所定の平面に投影された複数の平面座標を求め、求められた複数の平面座標に応じた２次元的な線画データを縮小することなどにより、３次元イラストデータから縮小及び２次元化された２次元サムネイルデータを生成する。サムネイル化部２１１は、２次元サムネイルデータを３次元イラストデータの識別情報に関連付けた形でサムネイル蓄積部２１０に追加的に格納する。 The thumbnail generating unit 211 acquires the three-dimensional illustration data from the illustration storage unit 212 when the three-dimensional illustration data is added to the illustration storage unit 212 or at predetermined intervals, and converts the three-dimensional illustration data into two-dimensional thumbnail data. The two-dimensional thumbnail data is, for example, two-dimensional line drawing data as shown in FIG. 5(c), and is line drawing data equivalent to the three-dimensional illustration data being reduced and two-dimensionalized. The two-dimensional thumbnail data corresponds to a two-dimensional image data format (for example, BMP format). The two-dimensional thumbnail data is associated with the identification information of the three-dimensional illustration data. The thumbnail generating unit 211 obtains a plurality of plane coordinates in which a plurality of spatial coordinates included in the three-dimensional illustration data are projected onto a predetermined plane, and reduces the two-dimensional line drawing data corresponding to the obtained plurality of plane coordinates, thereby generating two-dimensional thumbnail data that has been reduced and two-dimensionalized from the three-dimensional illustration data. The thumbnail generating unit 211 additionally stores the two-dimensional thumbnail data in the thumbnail storage unit 210 in a form associated with the identification information of the three-dimensional illustration data.

これにより、サムネイル蓄積部２１０には、図７に示すようなサムネイル情報２１０ａが蓄積される。図７は、サムネイル蓄積部２１０に蓄積されるサムネイル情報２１０ａのデータ構造を示す図である。サムネイル情報２１０ａは、キーワードと３次元イラストデータと２次元サムネイルデータとが１以上の２次元サムネイルデータについて対応付けられた情報である。サムネイル情報２１０ａは、第１の対応情報として、キーワードと２次元サムネイルデータとが対応付けられた情報を含む。図７に示されるように、一つの２次元サムネイルデータに対して複数のキーワードが設定されていてもよい。また、サムネイル情報２１０ａは、第２の対応情報として、２次元サムネイルデータと３次元イラストデータとが対応付けられた情報を含む。例えば、サムネイル情報２１０ａは、キーワード欄２１０ａ１、識別情報欄２１０ａ２及びアクセス情報欄２１０ａ３を有する。キーワード欄２１０ａ１には、２次元サムネイルデータが呼び出されるためのキーワードが記録されるが、キーワードが未登録の状態では空欄になっている。識別情報欄２１０ａ２には、２次元サムネイルデータに紐づけられた３次元イラストデータを識別するための情報が記録され、例えば３次元イラストデータのＩＤ番号が記録される。アクセス情報欄２１０ａ３には、２次元サムネイルデータにアクセスするための情報が記録され、例えば２次元サムネイルデータのファイル名が記録される。 As a result, thumbnail information 210a as shown in FIG. 7 is stored in the thumbnail storage unit 210. FIG. 7 is a diagram showing the data structure of thumbnail information 210a stored in the thumbnail storage unit 210. Thumbnail information 210a is information in which a keyword, three-dimensional illustration data, and two-dimensional thumbnail data are associated with one or more pieces of two-dimensional thumbnail data. Thumbnail information 210a includes information in which a keyword and two-dimensional thumbnail data are associated with each other as first association information. As shown in FIG. 7, multiple keywords may be set for one piece of two-dimensional thumbnail data. Furthermore, thumbnail information 210a includes information in which two-dimensional thumbnail data and three-dimensional illustration data are associated with each other as second association information. For example, thumbnail information 210a has a keyword column 210a1, an identification information column 210a2, and an access information column 210a3. A keyword for calling up two-dimensional thumbnail data is recorded in the keyword column 210a1, but it is blank when no keyword is registered. The identification information field 210a2 records information for identifying the 3D illustration data linked to the 2D thumbnail data, for example, an ID number for the 3D illustration data. The access information field 210a3 records information for accessing the 2D thumbnail data, for example, a file name for the 2D thumbnail data.

登録文字入力部２１６は、イラスト蓄積部２１２に３次元イラストデータが追加されたタイミングで、又は、所定の周期ごとに、３次元イラストデータに紐づけるべき文字情報が入力される。登録文字入力部２１６は、文字情報を３次元イラストデータの識別情報に関連付けられた形で受け付ける。このとき、登録文字入力部２１６は、点線の矢印で示すように、イラスト蓄積部２１２から文字情報が紐づけるべき候補となる複数の３次元イラストデータの識別情報を取得して、複数の３次元イラストデータの識別情報から識別情報が選択入力されてもよい。登録文字入力部２１６は、文字情報を３次元イラストデータの識別情報に関連付けられた形でキーワード化部２１５へ供給する。 The registered character input unit 216 receives character information to be linked to the three-dimensional illustration data when three-dimensional illustration data is added to the illustration storage unit 212, or at predetermined intervals. The registered character input unit 216 accepts the character information in a form associated with identification information of the three-dimensional illustration data. At this time, as indicated by the dotted arrow, the registered character input unit 216 may obtain identification information of multiple three-dimensional illustration data that are candidates to be linked to the character information from the illustration storage unit 212, and select and input identification information from the identification information of the multiple three-dimensional illustration data. The registered character input unit 216 supplies the character information to the keyword unit 215 in a form associated with the identification information of the three-dimensional illustration data.

キーワード化部２１５は、文字情報を３次元イラストデータの識別情報に関連付けられた形で受けると、サムネイル蓄積部２１０にアクセスして、３次元イラストデータの識別情報に対応したキーワード欄２１０ａ１に文字情報を追加的に書き込む。 When the keyword generation unit 215 receives the text information associated with the identification information of the three-dimensional illustration data, it accesses the thumbnail storage unit 210 and writes the text information into the keyword field 210a1 that corresponds to the identification information of the three-dimensional illustration data.

これにより、サムネイル蓄積部２１０に蓄積されるサムネイル情報２１０ａにおいて、キーワード欄２１０ａ１には、図７に示すように、１以上のキーワードが追加的に記録される。 As a result, in the thumbnail information 210a stored in the thumbnail storage unit 210, one or more keywords are additionally recorded in the keyword column 210a1, as shown in FIG. 7.

ここで、２次元サムネイルデータのデータサイズは、３次元イラストデータのデータサイズより大幅に小さい。言語情報からのイラスト検索時にデータサイズの軽いサムネイルを使用することで画像呼び出し時のタイムラグを最小限に抑えることを可能とする。 The data size of the 2D thumbnail data is significantly smaller than the data size of the 3D illustration data. By using thumbnails with a small data size when searching for illustrations based on language information, it is possible to minimize the time lag when retrieving images.

選択傾向蓄積部２０９へのデータ蓄積について図４を用いて説明する。受信部２０１は、言語情報認識結果を受信すると、言語情報認識結果を文脈解析部２０７へ供給する。文脈解析部２０７は、言語情報認識結果を基に文脈を解析する。また、サムネイル選択部２０４での選択情報を選択傾向蓄積部２０９でユーザの選択傾向として蓄積する。この文脈解析部２０７の解析結果と選択傾向蓄積部２０９の蓄積結果とを基に、優先度決定部２０８は、２次元サムネイルデータを表示する際の優先度を決定し、決定結果をサムネイル表示部２０３へ供給する。サムネイル表示部２０３は、優先度決定部２０８の決定結果に応じて、優先度の高い順に２次元サムネイルデータをディスプレイに表示させる。 The data storage in the selection tendency storage unit 209 will be described with reference to FIG. 4. When the receiving unit 201 receives the language information recognition result, it supplies the language information recognition result to the context analysis unit 207. The context analysis unit 207 analyzes the context based on the language information recognition result. The selection information in the thumbnail selection unit 204 is stored as the user's selection tendency in the selection tendency storage unit 209. Based on the analysis result of the context analysis unit 207 and the storage result of the selection tendency storage unit 209, the priority determination unit 208 determines the priority when displaying the two-dimensional thumbnail data, and supplies the determination result to the thumbnail display unit 203. The thumbnail display unit 203 displays the two-dimensional thumbnail data on the display in descending order of priority according to the determination result of the priority determination unit 208.

優先度の例としては、言語情報としての異国語に対応して決定された優先度であってもよい。優先度決定部２０８は、言語の種類に応じて、出現サムネイルが変化されるような優先度を決定しても良い。例えば、アフリカ語で「学校」という言葉を検知した際には、日本の一般的な学校を示すイラストではなくアフリカに多く見られる学校を示すイラストが優先的に表示されるように優先度が決定される。 An example of the priority may be a priority determined in response to a foreign language as language information. The priority determination unit 208 may determine a priority such that the thumbnails that appear are changed depending on the type of language. For example, when the word "school" is detected in an African language, the priority is determined such that an illustration showing a school that is commonly seen in Africa is preferentially displayed rather than an illustration showing a typical school in Japan.

イラストの表示方法について図４を用いて説明する。受信部２０１は、言語情報認識結果を受信すると、言語情報認識結果をサムネイル検索部２０２へ供給する。サムネイル検索部２０２は、言語情報認識結果を基に２次元サムネイルデータを検索する。サムネイル検索部２０２は、サムネイル蓄積部２１０にアクセスし、言語情報認識結果に応じた文字情報をキーワードとしてサムネイル情報２１０ａを検索し、その文字情報（例えば、キーワード）に対応した１以上の２次元サムネイルデータとそれに関連付けられた３次元イラストデータの識別情報とを検索結果として呼び出しサムネイル表示部２０３へ供給する。第１の表示制御手段としてのサムネイル表示部２０３は、検索された１以上の２次元サムネイルデータを接続部３経由で描画表示部４００（図１参照）へ供給する。これに応じて、描画表示部４００は、検索された１以上の２次元サムネイルデータをクライアント端末１のディスプレイの特定場所に表示させる。選択手段としてのサムネイル選択部２０４は、ユーザによる選択操作、一例としてディスプレイ上に表示された２次元サムネイルデータをポインティングデバイスで選択する操作に応じて、ディスプレイに表示された１以上の２次元サムネイルデータから２次元サムネイルデータを選択し、選択された２次元サムネイルデータに関連付けられた３次元イラストデータの識別情報をイラスト検索部２０５へ供給する。イラスト検索部２０５は、３次元イラストデータの識別情報を基に３次元イラストデータを検索する。イラスト検索部２０５は、イラスト蓄積部２１２にアクセスし、その識別情報に対応した３次元イラストデータを検索結果として呼び出しイラスト表示部２０６へ供給する。第２の表示制御手段としてのイラスト表示部２０６は、検索された３次元イラストデータを接続部３経由で描画表示部４００（図１参照）へ供給する。これに応じて、描画表示部４００は、検索された３次元イラストデータをクライアント端末１のディスプレイ上に表示させる。 The method of displaying the illustration will be described with reference to FIG. 4. When the receiving unit 201 receives the language information recognition result, it supplies the language information recognition result to the thumbnail search unit 202. The thumbnail search unit 202 searches for two-dimensional thumbnail data based on the language information recognition result. The thumbnail search unit 202 accesses the thumbnail storage unit 210, searches for thumbnail information 210a using character information corresponding to the language information recognition result as a keyword, and calls up one or more two-dimensional thumbnail data corresponding to the character information (e.g., a keyword) and identification information of the three-dimensional illustration data associated therewith as search results, and supplies them to the thumbnail display unit 203. The thumbnail display unit 203 as the first display control means supplies the searched one or more two-dimensional thumbnail data to the drawing display unit 400 (see FIG. 1) via the connection unit 3. In response to this, the drawing display unit 400 displays the searched one or more two-dimensional thumbnail data in a specific location on the display of the client terminal 1. The thumbnail selection unit 204 as a selection means selects two-dimensional thumbnail data from one or more two-dimensional thumbnail data displayed on the display in response to a selection operation by the user, for example, an operation of selecting two-dimensional thumbnail data displayed on the display with a pointing device, and supplies identification information of three-dimensional illustration data associated with the selected two-dimensional thumbnail data to the illustration search unit 205. The illustration search unit 205 searches for three-dimensional illustration data based on the identification information of the three-dimensional illustration data. The illustration search unit 205 accesses the illustration storage unit 212, retrieves the three-dimensional illustration data corresponding to the identification information as a search result, and supplies it to the illustration display unit 206. The illustration display unit 206 as a second display control means supplies the searched three-dimensional illustration data to the drawing display unit 400 (see FIG. 1) via the connection unit 3. In response to this, the drawing display unit 400 displays the searched three-dimensional illustration data on the display of the client terminal 1.

図１に示す描画操作部３００は、機能的に、図８に示すように構成され得る。図８は、描画操作部３００の構成を示す図である。 The drawing operation unit 300 shown in FIG. 1 can be functionally configured as shown in FIG. 8. FIG. 8 is a diagram showing the configuration of the drawing operation unit 300.

描画操作部３００は、イラスト編集部３０１、固定化部３０２、描画部３０３、手描き入力部３０４、及び出力部３０５を備える。 The drawing operation unit 300 includes an illustration editing unit 301, a fixation unit 302, a drawing unit 303, a hand-drawn input unit 304, and an output unit 305.

第１の編集手段（３次元データ編集入力手段）としてのイラスト編集部３０１は、３次元イラストデータをイラスト表示部２００から受け、３次元イラストデータに対する３次元的な編集処理を行う。イラスト編集部３０１は、３次元的な編集処理において、ユーザによる、３次元的な回転操作、３次元的な移動操作、３次元的な拡大操作、３次元的な縮小操作などを受け付け、それらの操作要求を描画部３０３、出力部３０５、接続部３経由で描画表示部４００（図１参照）へ供給する。３次元的な回転操作は、３次元イラストデータに含まれた複数の空間座標がそれらの相対的な位置関係を維持しながら所定の軸周りに３次元的に回転されるように変更される操作である。３次元的な拡大操作は、３次元イラストデータに含まれた複数の空間座標が所定の点から等しい距離割合で放射状に遠ざかるように変更される操作である。３次元的な縮小操作は、３次元イラストデータに含まれた複数の空間座標が所定の点に対して等しい距離割合で放射状に近づくように変更される操作である。これらの操作要求に応じて、描画表示部４００は、クライアント端末１の描画部３０３における３次元イラストデータの表示形態を変更する。これにより、クライアント端末１のディスプレイ上における３次元イラストデータの位置、大きさ、向きが３次元的に変更され得る。 The illustration editing unit 301 as the first editing means (three-dimensional data editing input means) receives three-dimensional illustration data from the illustration display unit 200 and performs three-dimensional editing processing on the three-dimensional illustration data. In the three-dimensional editing processing, the illustration editing unit 301 accepts three-dimensional rotation operations, three-dimensional movement operations, three-dimensional enlargement operations, three-dimensional reduction operations, etc. by the user, and supplies these operation requests to the drawing display unit 400 (see FIG. 1) via the drawing unit 303, the output unit 305, and the connection unit 3. The three-dimensional rotation operation is an operation in which multiple spatial coordinates included in the three-dimensional illustration data are changed so that they are rotated three-dimensionally around a predetermined axis while maintaining their relative positional relationship. The three-dimensional enlargement operation is an operation in which multiple spatial coordinates included in the three-dimensional illustration data are changed so that they move radially away from a predetermined point at an equal distance rate. The three-dimensional reduction operation is an operation in which multiple spatial coordinates included in the three-dimensional illustration data are changed so that they move radially closer to a predetermined point at an equal distance rate. In response to these operation requests, the drawing display unit 400 changes the display format of the three-dimensional illustration data in the drawing unit 303 of the client terminal 1. This allows the position, size, and orientation of the three-dimensional illustration data on the display of the client terminal 1 to be changed three-dimensionally.

変換手段としての固定化部３０２は、所定の操作（例えば、２次元的な編集処理が活性化される操作、より具体的には２次元的な編集処理が可能な状態へ移行する操作）に応じて、３次元的な編集処理が可能な３次元イラストデータを２次元的な編集処理が可能な２次元イラストデータへ変換（固定化）する。２次元イラストデータは、図５（ｄ）に示すような２次元的な線画データであり、３次元イラストデータに３次元的な編集処理が施され２次元化されたことに相当する線画データである。２次元イラストデータは、２次元画像データフォーマット（例えば、ＢＭＰフォーマット）に対応している。固定化部３０２は、３次元イラストデータに含まれた複数の空間座標がディスプレイの画面に対応した平面に投影された複数の平面座標を求め、求められた複数の平面座標に応じた２次元的な線画データを生成することなどにより、３次元イラストデータを２次元イラストデータへ変換して描画部３０３に固定する。 The fixing unit 302 as a conversion means converts (fixes) three-dimensional illustration data that can be edited in three dimensions into two-dimensional illustration data that can be edited in two dimensions in response to a predetermined operation (for example, an operation that activates two-dimensional editing, more specifically, an operation that transitions to a state where two-dimensional editing is possible). The two-dimensional illustration data is two-dimensional line drawing data as shown in FIG. 5(d), and is line drawing data that corresponds to three-dimensional illustration data that has been subjected to three-dimensional editing and turned into two dimensions. The two-dimensional illustration data corresponds to a two-dimensional image data format (for example, BMP format). The fixing unit 302 obtains a plurality of plane coordinates in which a plurality of spatial coordinates included in the three-dimensional illustration data are projected onto a plane corresponding to the display screen, and generates two-dimensional line drawing data corresponding to the obtained plurality of plane coordinates, thereby converting the three-dimensional illustration data into two-dimensional illustration data and fixing it to the drawing unit 303.

第２の編集手段（２次元データ編集入力手段）としての２次元データ編集入力部３０４は、２次元イラストデータに対する２次元的な編集処理を行う。２次元データ編集入力部３０４は、２次元的な編集処理において、ユーザによる、２次元的な回転操作、２次元的な移動操作、２次元的な拡大操作、２次元的な縮小操作、一部を消す操作、手書きの線画を付け加える操作、色を塗る操作などを受け付け、それらの操作要求を描画部３０３、出力部３０５、接続部３経由で描画表示部４００（図１参照）へ供給する。２次元的な回転操作は、２次元イラストデータに含まれた複数の平面座標がそれらの相対的な位置関係を維持しながら所定の点周りに２次元的に回転されるように変更される操作である。２次元的な拡大操作は、２次元イラストデータに含まれた複数の平面座標が所定の点から等しい距離割合で放射状に遠ざかるように変更される操作である。２次元的な縮小操作は、２次元イラストデータに含まれた複数の平面座標が所定の点に対して等しい距離割合で放射状に近づくように変更される操作である。一部を消す操作は、２次元イラストデータに含まれた複数の平面座標の一部が削除される操作である。手書きの線画を付け加える操作は、２次元イラストデータに含まれた複数の平面座標に、手書きの線画に対応した複数の平面座標が追加される操作である。色を塗る操作は、２次元イラストデータに含まれた複数の平面座標に、所定の色属性に紐づけられた複数の平面座標が追加される操作である。これらの操作要求に応じて、描画表示部４００は、クライアント端末１の描画部３０３における２次元イラストデータの表示形態を変更する。これにより、クライアント端末１のディスプレイ上における２次元イラストデータの位置、大きさ、向きが２次元的に変更されたり、２次元イラストデータの一部が消されたり、２次元イラストデータに手書きの線画が付け加えたり、２次元イラストデータに色が塗られたりする。 The two-dimensional data editing input unit 304 as the second editing means (two-dimensional data editing input means) performs two-dimensional editing processing on the two-dimensional illustration data. In the two-dimensional editing processing, the two-dimensional data editing input unit 304 accepts two-dimensional rotation operations, two-dimensional movement operations, two-dimensional enlargement operations, two-dimensional reduction operations, partial erasure operations, operations to add hand-drawn lines, coloring operations, and the like, performed by the user, and supplies these operation requests to the drawing display unit 400 (see FIG. 1) via the drawing unit 303, the output unit 305, and the connection unit 3. The two-dimensional rotation operation is an operation in which multiple planar coordinates included in the two-dimensional illustration data are changed so as to rotate two-dimensionally around a specified point while maintaining their relative positional relationship. The two-dimensional enlargement operation is an operation in which multiple planar coordinates included in the two-dimensional illustration data are changed so as to move radially away from a specified point at equal distance ratios. The two-dimensional reduction operation is an operation in which multiple plane coordinates included in the two-dimensional illustration data are changed so that they approach a predetermined point radially at an equal distance ratio. The partial erasure operation is an operation in which a portion of multiple plane coordinates included in the two-dimensional illustration data is deleted. The operation of adding handwritten lines is an operation in which multiple plane coordinates corresponding to handwritten lines are added to multiple plane coordinates included in the two-dimensional illustration data. The coloring operation is an operation in which multiple plane coordinates associated with a predetermined color attribute are added to multiple plane coordinates included in the two-dimensional illustration data. In response to these operation requests, the drawing display unit 400 changes the display form of the two-dimensional illustration data in the drawing unit 303 of the client terminal 1. As a result, the position, size, and orientation of the two-dimensional illustration data on the display of the client terminal 1 are changed two-dimensionally, a portion of the two-dimensional illustration data is erased, handwritten lines are added to the two-dimensional illustration data, or the two-dimensional illustration data is colored.

描画部３０３には、マウスや指やスタイラスペンやジェスチャーなどによって入力も可能である。描画部３０３の情報を出力部３０５にて描画表示部４００に出力する。これに応じて、描画表示部４００は、クライアント端末１の描画部３０３における手書き入力された線画等が追加される。これにより、クライアント端末１のディスプレイ上に手書き入力された線画等が表示される。 Input to the drawing unit 303 is also possible using a mouse, a finger, a stylus pen, gestures, etc. Information from the drawing unit 303 is output to the drawing display unit 400 by the output unit 305. In response to this, the drawing display unit 400 adds line drawings, etc. handwritten input in the drawing unit 303 of the client terminal 1. As a result, the handwritten input line drawings, etc. are displayed on the display of the client terminal 1.

これら各部は、ＨＤ５０４からＲＡＭ５０３上に展開されたプログラムに従ったＣＰＵ５０１からの命令によって動作することで実現される機能または手段である。 Each of these units is a function or means that is realized by operating according to instructions from the CPU 501 in accordance with a program loaded onto the RAM 503 from the HD 504.

次に、ビジュアルコミュニケーションシステム４の動作について図９を用いて説明する。図９は、ビジュアルコミュニケーションシステムの動作を示すフローチャートである。 Next, the operation of the visual communication system 4 will be described with reference to FIG. 9. FIG. 9 is a flowchart showing the operation of the visual communication system.

ビジュアルコミュニケーションシステム４は、コミュニケーションを支援するための準備として、所定の情報が登録される登録処理（Ｓ１）を行う。その後、ビジュアルコミュニケーションシステム４は、ユーザから起動要求があるまで（Ｓ２でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、ユーザから起動要求があると（Ｓ２でＹｅｓ）、プログラム５００ａを起動し、コミュニケーション支援処理（Ｓ３）を開始する。ビジュアルコミュニケーションシステム４は、ユーザから終了要求があるまで（Ｓ４でＮｏ）コミュニケーション支援処理（Ｓ３）を継続的に行う。ビジュアルコミュニケーションシステム４は、ユーザから終了要求があると（Ｓ４でＹｅｓ）、処理を終了する。 The visual communication system 4 performs a registration process (S1) in which predetermined information is registered in preparation for supporting communication. The visual communication system 4 then waits until a startup request is received from the user (No in S2). When a startup request is received from the user (Yes in S2), the visual communication system 4 starts the program 500a and starts the communication support process (S3). The visual communication system 4 continues to perform the communication support process (S3) until a termination request is received from the user (No in S4). When a termination request is received from the user (Yes in S4), the visual communication system 4 terminates the process.

次に、登録処理（Ｓ１）の詳細について図１０を用いて説明する。図１０は、登録処理の流れを示すフローチャートである。 Next, the details of the registration process (S1) will be explained using FIG. 10. FIG. 10 is a flowchart showing the flow of the registration process.

ビジュアルコミュニケーションシステム４は、プログラム５００ａが起動されると、３次元データの登録要求があるまで（Ｓ１１でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、３次元データの登録要求があると（Ｓ１１でＹｅｓ）、３次元データが入力される（Ｓ１２）。３次元データは、例えばポリゴンデータ（図５（ａ）参照）である。ビジュアルコミュニケーションシステム４は、３次元データをイラスト化する（Ｓ１３）。すなわち、ビジュアルコミュニケーションシステム４は、３次元データを３次元イラストデータに変換する。３次元イラストデータは、例えば３次元的な線画データ（図５（ｂ）参照）である。ビジュアルコミュニケーションシステム４は、３次元イラストデータをイラスト蓄積部２１２に追加的に格納する。これにより、イラスト蓄積部２１２には、イラスト情報２１２ａ（図６参照）が登録される。 When program 500a is started, visual communication system 4 waits until a request to register three-dimensional data is received (No in S11). When a request to register three-dimensional data is received (Yes in S11), the three-dimensional data is input to visual communication system 4 (S12). The three-dimensional data is, for example, polygon data (see FIG. 5(a)). The visual communication system 4 converts the three-dimensional data into three-dimensional illustration data (S13). That is, visual communication system 4 converts the three-dimensional data into three-dimensional illustration data. The three-dimensional illustration data is, for example, three-dimensional line drawing data (see FIG. 5(b)). The visual communication system 4 additionally stores the three-dimensional illustration data in illustration storage unit 212. As a result, illustration information 212a (see FIG. 6) is registered in illustration storage unit 212.

ビジュアルコミュニケーションシステム４は、３次元イラストデータをサムネイル化する（Ｓ１４）。すなわち、ビジュアルコミュニケーションシステム４は、３次元イラストデータを２次元サムネイルデータに変換する。２次元サムネイルデータは、例えば２次元的な線画データ（図５（ｃ）参照）である。ビジュアルコミュニケーションシステム４は、２次元サムネイルデータを３次元イラストデータの識別情報に関連付けた形でサムネイル蓄積部２１０に追加的に格納する。これにより、サムネイル蓄積部２１０に、サムネイル情報２１０ａ（図７参照）における２次元サムネイルデータと３次元イラストデータの識別情報とが登録される。 The visual communication system 4 thumbnails the three-dimensional illustration data (S14). That is, the visual communication system 4 converts the three-dimensional illustration data into two-dimensional thumbnail data. The two-dimensional thumbnail data is, for example, two-dimensional line drawing data (see FIG. 5(c)). The visual communication system 4 additionally stores the two-dimensional thumbnail data in the thumbnail storage unit 210 in a form associated with identification information of the three-dimensional illustration data. As a result, the two-dimensional thumbnail data and the identification information of the three-dimensional illustration data in the thumbnail information 210a (see FIG. 7) are registered in the thumbnail storage unit 210.

そして、ビジュアルコミュニケーションシステム４は、キーワードの登録要求があるまで（Ｓ１５でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、キーワードの登録要求があると（Ｓ１５でＹｅｓ）、文字情報が３次元イラストデータの識別情報に関連付けられた形で入力される（Ｓ１６）。ビジュアルコミュニケーションシステム４は、文字情報をキーワード化する（Ｓ１７）。すなわち、ビジュアルコミュニケーションシステム４は、サムネイル蓄積部２１０にアクセスして、３次元イラストデータの識別情報に対応したキーワード欄２１０ａ１に文字情報を追加的に書き込む。これにより、サムネイル蓄積部２１０に、サムネイル情報２１０ａ（図７参照）における文字情報（キーワード）が登録される。なお、登録処理とコミュニケーション支援処理は、図９に示したように両方行う必要はなく、その時点でサムネイル蓄積部２１０、イラスト蓄積部２１２に登録された情報に基づき、コミュニケーション支援処理のみ実行することも可能である。 Then, the visual communication system 4 waits until a keyword registration request is made (No in S15). When the visual communication system 4 receives a keyword registration request (Yes in S15), the text information is input in a form associated with the identification information of the three-dimensional illustration data (S16). The visual communication system 4 converts the text information into a keyword (S17). That is, the visual communication system 4 accesses the thumbnail storage unit 210 and additionally writes the text information in the keyword field 210a1 corresponding to the identification information of the three-dimensional illustration data. As a result, the text information (keyword) in the thumbnail information 210a (see FIG. 7) is registered in the thumbnail storage unit 210. Note that it is not necessary to perform both the registration process and the communication support process as shown in FIG. 9, and it is also possible to execute only the communication support process based on the information registered in the thumbnail storage unit 210 and the illustration storage unit 212 at that time.

次に、コミュニケーション支援処理（Ｓ３）の詳細について図１１、図１２を用いて説明する。図１１、図１２は、コミュニケーション支援処理の流れを示すフローチャートである。図１１の処理と図１２の処理とは、互いに並行して行われ得る。 Next, the details of the communication support process (S3) will be described with reference to Figs. 11 and 12. Figs. 11 and 12 are flowcharts showing the flow of the communication support process. The process of Fig. 11 and the process of Fig. 12 can be performed in parallel with each other.

図１１の処理において、ビジュアルコミュニケーションシステム４は、プログラム５００ａが起動され、ディスプレイに初期画面が表示されると、初期画面を介して、言語情報入力機能のＯＮ要求があるまで（Ｓ２１でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、言語情報入力機能のＯＮ要求があると（Ｓ２１でＹｅｓ）、言語情報の入力があるまで（Ｓ２２でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、言語情報の入力があると（Ｓ２２でＹｅｓ）、言語情報が音声情報であるか否かを判断する（Ｓ２３）。ビジュアルコミュニケーションシステム４は、言語情報が音声情報であれば（Ｓ２３でＹｅｓ）、音声情報に対して音声認識処理を行い文字情報へ変換し（Ｓ２４）、その文字情報を認識結果とする。ビジュアルコミュニケーションシステム４は、言語情報が音声情報でなければ（Ｓ２３でＮｏ）、言語情報が手書き文字画像であるか否かを判断する（Ｓ２５）。ビジュアルコミュニケーションシステム４は、言語情報が手書き文字画像であれば（Ｓ２５でＹｅｓ）、手書き文字画像に対してテキスト認識処理を行い文字情報へ変換し（Ｓ２６）、その文字情報を認識結果とする。ビジュアルコミュニケーションシステム４は、言語情報が手書き文字画像でなければ、すなわちキーボード５４１へのタイピングによる文字情報であれば（Ｓ２５でＮｏ）、その文字情報を認識結果とし、処理をＳ２７へ進める。 In the process of FIG. 11, when the program 500a is started and an initial screen is displayed on the display, the visual communication system 4 waits until there is a request to turn on the language information input function via the initial screen (No in S21). When there is a request to turn on the language information input function (Yes in S21), the visual communication system 4 waits until language information is input (No in S22). When language information is input (Yes in S22), the visual communication system 4 determines whether the language information is voice information (S23). If the language information is voice information (Yes in S23), the visual communication system 4 performs voice recognition processing on the voice information to convert it into character information (S24), and the character information is the recognition result. If the language information is not voice information (No in S23), the visual communication system 4 determines whether the language information is a handwritten character image (S25). If the language information is a handwritten character image (Yes in S25), the visual communication system 4 performs text recognition processing on the handwritten character image to convert it into character information (S26), and sets the character information as the recognition result. If the language information is not a handwritten character image, that is, if the language information is character information typed on the keyboard 541 (No in S25), the visual communication system 4 sets the character information as the recognition result, and the process proceeds to S27.

ビジュアルコミュニケーションシステム４は、サムネイル蓄積部２１０にアクセスし、認識結果である文字情報をキーワードとして２次元サムネイルデータを検索する（Ｓ２７）。ビジュアルコミュニケーションシステム４は、検索された１以上の２次元サムネイルデータをディスプレイの特定場所に表示させる（Ｓ２８）。 The visual communication system 4 accesses the thumbnail storage unit 210 and searches for two-dimensional thumbnail data using the character information that is the recognition result as a keyword (S27). The visual communication system 4 displays one or more pieces of searched two-dimensional thumbnail data in a specific location on the display (S28).

ビジュアルコミュニケーションシステム４は、言語情報入力機能のＯＦＦ要求があるまで（Ｓ２９でＮｏ）、Ｓ２２～Ｓ２８の処理が高速に繰り返され得る。すなわち、ビジュアルコミュニケーションシステム４は、ユーザからの言語情報が受け付けられるたびに、表示画面上のサムネイル画像を新しく切り替えて高速に更新表示できる。これにより、ユーザが会話を止めずにその中から、発話又は文字入力された際の頭の中の情景に近い１以上のサムネイル画像を選択できる。 The visual communication system 4 can rapidly repeat the processes of S22 to S28 until a request to turn off the language information input function is received (No in S29). In other words, the visual communication system 4 can rapidly update and display new thumbnail images on the display screen every time language information is received from the user. This allows the user to select one or more thumbnail images that are closest to the scene in their mind when they speak or input text, without having to stop the conversation.

ビジュアルコミュニケーションシステム４は、言語情報入力機能のＯＦＦ要求があると（Ｓ２９でＹｅｓ）、図１１の処理を終了する。 When a request to turn off the language information input function is received (Yes in S29), the visual communication system 4 ends the processing of FIG. 11.

図１２の処理において、ビジュアルコミュニケーションシステム４は、Ｓ２８で表示された１以上の２次元サムネイルデータのうちの２次元サムネイルデータが選択されるまで（Ｓ３１でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、Ｓ２８で表示された１以上の２次元サムネイルデータのうちの２次元サムネイルデータが選択されると（Ｓ３１でＹｅｓ）、イラスト蓄積部２１２にアクセスし、選択された２次元サムネイルデータに関連付けられた３次元イラストデータを検索する（Ｓ３２）。ビジュアルコミュニケーションシステム４は、検索された３次元イラストデータをディスプレイに表示させる（Ｓ３３）。 In the process of FIG. 12, the visual communication system 4 waits until 2D thumbnail data is selected from the one or more 2D thumbnail data displayed in S28 (No in S31). When 2D thumbnail data is selected from the one or more 2D thumbnail data displayed in S28 (Yes in S31), the visual communication system 4 accesses the illustration storage unit 212 and searches for 3D illustration data associated with the selected 2D thumbnail data (S32). The visual communication system 4 displays the searched 3D illustration data on the display (S33).

ビジュアルコミュニケーションシステム４は、３次元イラストデータに対する編集操作があるまで（Ｓ３４でＮｏ）待機し、３次元イラストデータに対する編集操作があると（Ｓ３４でＹｅｓ）、第１の編集処理を行う（Ｓ３５）。第１の編集処理は、３次元的な編集処理である。ビジュアルコミュニケーションシステム４は、第１の編集処理において、３次元的な回転操作、３次元的な移動操作、３次元的な拡大操作、３次元的な縮小操作などを受け付ける。それらの操作要求に応じて、ビジュアルコミュニケーションシステム４は、ディスプレイに表示された３次元イラストデータの表示形態を変更する（Ｓ３６）。ビジュアルコミュニケーションシステム４は、３次元イラストデータが２次元イラストデータに固定化されるまで（Ｓ３７でＮｏ）、Ｓ３４～Ｓ３６の処理を繰り返す。 The visual communication system 4 waits until an editing operation is performed on the three-dimensional illustration data (No in S34), and when an editing operation is performed on the three-dimensional illustration data (Yes in S34), it performs a first editing process (S35). The first editing process is a three-dimensional editing process. In the first editing process, the visual communication system 4 accepts three-dimensional rotation operations, three-dimensional movement operations, three-dimensional enlargement operations, three-dimensional reduction operations, and the like. In response to these operation requests, the visual communication system 4 changes the display form of the three-dimensional illustration data displayed on the display (S36). The visual communication system 4 repeats the processes of S34 to S36 until the three-dimensional illustration data is fixed to two-dimensional illustration data (No in S37).

ビジュアルコミュニケーションシステム４は、３次元イラストデータが２次元イラストデータに固定化されると（Ｓ３７でＹｅｓ）、２次元イラストデータに対する編集操作があるまで（Ｓ３８でＮｏ）待機する。ビジュアルコミュニケーションシステム４は、２次元イラストデータに対する編集操作があると（Ｓ３８でＹｅｓ）、第２の編集処理を行う（Ｓ３９）。第２の編集処理は、２次元的な編集処理である。ビジュアルコミュニケーションシステム４は、第２の編集処理において、２次元的な回転操作、２次元的な移動操作、２次元的な拡大操作、２次元的な縮小操作、一部を消す操作、手書きの線画を付け加える操作、色を塗る操作などを受け付ける。それらの操作要求に応じて、ビジュアルコミュニケーションシステム４は、ディスプレイに表示された２次元イラストデータの表示形態を変更する（Ｓ４０）。ビジュアルコミュニケーションシステム４は、編集完了の要求があるまで（Ｓ４１でＮｏ）、Ｓ３８～Ｓ４０の処理を繰り返す。すなわち、Ｓ３４～Ｓ３７のループによる１段階目の編集処理とＳ３８～Ｓ４１のループによる２段階目の編集処理とにより、発話又は文字入力された際の頭の中の情景に近いイラストを短時間で正確に表現できる。 When the three-dimensional illustration data is fixed to two-dimensional illustration data (Yes in S37), the visual communication system 4 waits until an editing operation is performed on the two-dimensional illustration data (No in S38). When an editing operation is performed on the two-dimensional illustration data (Yes in S38), the visual communication system 4 performs a second editing process (S39). The second editing process is a two-dimensional editing process. In the second editing process, the visual communication system 4 accepts two-dimensional rotation operations, two-dimensional movement operations, two-dimensional enlargement operations, two-dimensional reduction operations, partial erasure operations, operations to add hand-drawn lines, coloring operations, and the like. In response to these operation requests, the visual communication system 4 changes the display form of the two-dimensional illustration data displayed on the display (S40). The visual communication system 4 repeats the processes of S38 to S40 until a request to complete editing is received (No in S41). In other words, the first stage of editing processing in the loop of S34 to S37 and the second stage of editing processing in the loop of S38 to S41 can quickly and accurately express an illustration that closely resembles the scene that occurs in the user's mind when the speech or text is entered.

ビジュアルコミュニケーションシステム４は、編集完了の要求があると（Ｓ４１でＹｅｓ）、図１２の処理を終了する。 When a request to complete editing is received (Yes in S41), the visual communication system 4 ends the processing of FIG. 12.

次に、ビジュアルコミュニケーションシステム４によりクライアント端末１のディスプレイに表示される画面（ビジュアルコミュニケーションシステム４による表示画面）の構成について図１３を用いて説明する。図１３は、ビジュアルコミュニケーションシステム４による表示画面の構成を示す図である。 Next, the configuration of the screen displayed on the display of the client terminal 1 by the visual communication system 4 (display screen by the visual communication system 4) will be described with reference to FIG. 13. FIG. 13 is a diagram showing the configuration of the display screen by the visual communication system 4.

ビジュアルコミュニケーションシステム４による表示画面は、図１３に示すように、描画結果表示エリアＳ１００、操作パレットＳ２００、サムネイル表示エリアＳ３００を含む。 As shown in FIG. 13, the display screen of the visual communication system 4 includes a drawing result display area S100, an operation palette S200, and a thumbnail display area S300.

描画結果表示エリアＳ１００は、描画表示部４００（図１参照）の結果を出力するエリアである。操作パレットＳ２００は、音声入力部１０１、文字入力部１０２（図３参照）、手描き入力部３０４（図８参照）などの各入力部を呼び出すために使用する機能が配置されている。サムネイル表示エリアＳ３００は、認識部１０３（図３参照）によって認識された言語情報と、サムネイル表示部２０３（図４参照）によって出力された２次元サムネイルデータが表示される。 The drawing result display area S100 is an area where the results of the drawing display unit 400 (see FIG. 1) are output. The operation palette S200 has functions used to call up each input unit, such as the voice input unit 101, the character input unit 102 (see FIG. 3), and the hand-drawn input unit 304 (see FIG. 8). The thumbnail display area S300 displays the language information recognized by the recognition unit 103 (see FIG. 3) and the two-dimensional thumbnail data output by the thumbnail display unit 203 (see FIG. 4).

図５で示す各エリアのレイアウトおよび意匠形状はあくまで一例であり、権利範囲を制限するものではないものとする。例えばＳ３００に円状の枠で囲まれた８つのサムネイルが表示されているが、枠の有無と形状や表示数については制限しないものとする。また、認識された言語情報も表示しなくても良いものとする。 The layout and design shape of each area shown in Figure 5 is merely an example and does not limit the scope of rights. For example, eight thumbnails surrounded by circular frames are displayed in S300, but there are no restrictions on the presence or absence of frames, their shape, or the number displayed. In addition, recognized language information does not need to be displayed.

次に、ビジュアルコミュニケーションシステム４によりクライアント端末１のディスプレイに表示される画面（ビジュアルコミュニケーションシステム４による表示画面）の概略動作について図１４を用いて説明する。図１４は、ビジュアルコミュニケーションシステム４による表示画面の概略動作を示す図である。図１４では、言語情報を基に２次元サムネイルデータが呼び出される動作について示す。 Next, the general operation of the screen displayed on the display of the client terminal 1 by the visual communication system 4 (display screen by the visual communication system 4) will be described with reference to FIG. 14. FIG. 14 is a diagram showing the general operation of the display screen by the visual communication system 4. FIG. 14 shows the operation of calling up two-dimensional thumbnail data based on language information.

図１４（ａ）に示す言語情報認識ボタンＳ２０１を押す操作を検知すると、ビジュアルコミュニケーションシステム４は、言語情報認識モードに移行する。ビジュアルコミュニケーションシステム４は、言語情報認識モード時に、発話、手書き、タイピングなどによって言語情報（第１の言語情報）を取得すると、サムネイル表示エリアＳ３００にて言語情報認識結果および認識結果に紐づく１以上の２次元サムネイルデータが表示される。また、２次元データ編集ボタンＳ２０２を押す操作を検知すると、図１３の描画結果表示エリアＳ１００を編集可能な、描画結果編集モードに移行する。 When the visual communication system 4 detects an operation of pressing the language information recognition button S201 shown in FIG. 14(a), it transitions to a language information recognition mode. In the language information recognition mode, when the visual communication system 4 acquires language information (first language information) by speech, handwriting, typing, etc., it displays the language information recognition result and one or more pieces of two-dimensional thumbnail data linked to the recognition result in the thumbnail display area S300. In addition, when the visual communication system 4 detects an operation of pressing the two-dimensional data editing button S202, it transitions to a drawing result editing mode in which the drawing result display area S100 in FIG. 13 can be edited.

例えば、図１４（ａ）では、ビジュアルコミュニケーションシステム４は、描画結果表示エリアＳ１００上に描画操作された手書き文字を認識し、認識結果の文字情報とそれに紐づけられた１以上の２次元サムネイルデータとをサムネイル表示エリアＳ３００に表示した状態が示されている。 For example, FIG. 14(a) shows the visual communication system 4 recognizing handwritten characters drawn in the drawing result display area S100, and displaying the character information of the recognition result and one or more pieces of two-dimensional thumbnail data associated with the character information in the thumbnail display area S300.

ビジュアルコミュニケーションシステム４は、さらに、発話、手書き、タイピングなどによって言語情報（第２の言語情報）を取得すると、サムネイル表示エリアＳ３００にて表示されている１以上の２次元サムネイルデータの少なくとも一部が変更されて言語情報認識結果および認識結果に紐づく１以上の２次元サムネイルデータが更新表示される。 When the visual communication system 4 further acquires language information (second language information) by speech, handwriting, typing, etc., at least a portion of the one or more two-dimensional thumbnail data displayed in the thumbnail display area S300 is changed, and the language information recognition result and the one or more two-dimensional thumbnail data linked to the recognition result are updated and displayed.

例えば、図１４（ａ）の状態で図１４（ｂ）の様な発話を認識すると、ビジュアルコミュニケーションシステム４は、認識結果の文字情報に応じて、サムネイル表示エリアＳ３００の状態をリアルタイムに変化させる。図１４（ｂ）では、ビジュアルコミュニケーションシステム４が、新たな認識結果の文字情報とそれに紐づけられた１以上の２次元サムネイルデータとに基づきサムネイル表示エリアＳ３０１に更新表示した状態が示されている。 For example, when the visual communication system 4 recognizes an utterance such as that shown in FIG. 14(b) in the state shown in FIG. 14(a), the visual communication system 4 changes the state of the thumbnail display area S300 in real time according to the character information of the recognition result. FIG. 14(b) shows the state in which the visual communication system 4 updates and displays in the thumbnail display area S301 based on the character information of the new recognition result and one or more pieces of two-dimensional thumbnail data associated with it.

具体的には、新たに入力された言語情報（第２の言語情報）に対応するサムネイル画像に応じて、すでに表示されたサムネイル画像の表示のうち、少なくとも一部を変更してディスプレイ（表示手段）に表示させる。 Specifically, in accordance with the thumbnail image corresponding to the newly input language information (second language information), at least a portion of the display of the thumbnail image already displayed is changed and displayed on the display (display means).

すでに表示されたサムネイル画像の表示のうち、少なくとも一部を変更とは、例として、新たに入力された言語情報に対応するサムネイル画像をすでに表示されたサムネイル画像に追加して表示するためにすでに表示されているサムネイル画像の位置や大きさを変更したり、新たに入力された言語情報に対応するサムネイル画像を既に表示された画像に代えて表示するために削除したり、既に表示されているサムネイル画像の周囲の画像を変更する等である。 Changing at least a portion of the display of a thumbnail image that is already displayed means, for example, changing the position or size of a thumbnail image that is already displayed in order to add a thumbnail image that corresponds to newly input language information to the thumbnail image that is already displayed, deleting a thumbnail image that corresponds to newly input language information in order to display it in place of the image that is already displayed, or changing the image around the thumbnail image that is already displayed.

なお、発話だけでなく、新しい手書き文字やタイピングした文字を認識する度に、ビジュアルコミュニケーションシステム４は、サムネイル表示エリアＳ３００の表示を変化させても良い。 In addition to speech, the visual communication system 4 may change the display in the thumbnail display area S300 each time it recognizes new handwritten or typed characters.

また、ビジュアルコミュニケーションシステム４は、２次元サムネイルデータがサムネイル表示エリアＳ３００に表示しきれない場合は、古いものから順番に新しく検索された２次元サムネイルデータに置き換わるようにサムネイル表示エリアＳ３００に表示してもよい。 In addition, if the 2D thumbnail data cannot be displayed in the thumbnail display area S300, the visual communication system 4 may display the 2D thumbnail data in the thumbnail display area S300 in order from oldest to newest, replacing the oldest.

図１４では、言語情報認識モードへの切り替えスイッチを言語情報認識ボタンＳ２０１としているが、ボタンではなく特定単語の発話やコマンド入力等、手法は限定しないものとする。 In FIG. 14, the switch to switch to the language information recognition mode is the language information recognition button S201, but the method is not limited to this and can be, for example, speaking specific words or inputting commands instead of using a button.

図１４に例示されるように、ビジュアルコミュニケーションシステム４は、ユーザからの言語情報が受け付けられるたびに、表示画面上のサムネイル画像を新しく切り替えて高速に更新表示できる。これにより、ユーザが会話を止めずにその中から、発話又は文字入力された際の頭の中の情景に近い１以上のサムネイル画像を選択できる。 As illustrated in FIG. 14, the visual communication system 4 can quickly update and display new thumbnail images on the display screen every time language information is received from the user. This allows the user to select one or more thumbnail images that are closest to the scene in their mind when they speak or input text, without having to stop the conversation.

次に、ビジュアルコミュニケーションシステム４によるイラストの２段階の編集処理の流れについて図１５を用いて説明する。図１５は、２段階の編集処理の流れを示す図である。 Next, the flow of the two-stage illustration editing process by the visual communication system 4 will be explained with reference to FIG. 15. FIG. 15 is a diagram showing the flow of the two-stage editing process.

図１５（ａ）では、ビジュアルコミュニケーションシステム４は、言語情報「人」に応じて、「人」の文字情報ＬＩとそれに紐づけられた２次元サムネイルデータＳＭ１～ＳＭ６とがサムネイル表示エリアＳ３００に表示する。 In FIG. 15(a), the visual communication system 4 displays the character information LI of "person" and the two-dimensional thumbnail data SM1 to SM6 associated therewith in the thumbnail display area S300 in response to the linguistic information "person."

２次元サムネイルデータＳＭ５の選択操作を受けると、ビジュアルコミュニケーションシステム４は、図１５（ｂ）に示すように、２次元サムネイルデータＳＭ５に紐づけられた３次元イラストデータを呼び出して描画結果表示エリアＳ１００に表示する。 When the two-dimensional thumbnail data SM5 is selected, the visual communication system 4 calls up the three-dimensional illustration data linked to the two-dimensional thumbnail data SM5 and displays it in the drawing result display area S100, as shown in FIG. 15(b).

３次元的な編集処理において、ビジュアルコミュニケーションシステム４は、３次元的な回転操作、３次元的な移動操作、３次元的な拡大操作、３次元的な縮小操作などを受け付け、それらの操作要求に応じて、図１５（ｃ）に示すように、３次元イラストデータの表示形態を３次元的に変化させる。 In three-dimensional editing processing, the visual communication system 4 accepts three-dimensional rotation operations, three-dimensional movement operations, three-dimensional enlargement operations, three-dimensional reduction operations, and the like, and in response to these operation requests, changes the display form of the three-dimensional illustration data three-dimensionally, as shown in FIG. 15(c).

所定のトリガーとなる操作（例えば、２次元的な編集操作のためのボタン（例えば、図１４（ａ）に示す２次元データ編集ボタンＳ２０２）が押されることなど）を受けて、ビジュアルコミュニケーションシステム４は、図１５（ｄ）に示すように、３次元イラストデータを２次元イラストデータに固定化する。 When a predetermined triggering operation (e.g., pressing a button for a two-dimensional editing operation (e.g., the two-dimensional data editing button S202 shown in FIG. 14(a)) is received, the visual communication system 4 fixes the three-dimensional illustration data to two-dimensional illustration data, as shown in FIG. 15(d).

２次元的な編集処理において、ビジュアルコミュニケーションシステム４は、２次元的な回転操作、２次元的な移動操作、２次元的な拡大操作、２次元的な縮小操作、一部を消す操作、手書きの線画を付け加える操作、色を塗る操作などを受け付け、それらの操作要求に応じて、図１５（ｅ）に示すように、２次元イラストデータの表示形態を２次元的に変化させる。 In two-dimensional editing processing, the visual communication system 4 accepts two-dimensional rotation operations, two-dimensional movement operations, two-dimensional enlargement operations, two-dimensional reduction operations, partial erasure operations, operations to add hand-drawn line drawings, coloring operations, and the like, and in response to these operation requests, changes the display form of the two-dimensional illustration data two-dimensionally, as shown in FIG. 15(e).

図１５に例示されるように、１段階目の編集処理（図１５（ｂ）、図１５（ｃ））と２段階目の編集処理（図１５（ｅ））とにより、発話又は文字入力された際の頭の中の情景に近いイラストを短時間で正確に表現できる。 As shown in FIG. 15, the first stage of editing (FIG. 15(b) and FIG. 15(c)) and the second stage of editing (FIG. 15(e)) can quickly and accurately express an illustration that closely resembles the scene that appears in the user's mind when the speech or text is input.

次に、ビジュアルコミュニケーションシステム４によりクライアント端末１のディスプレイに表示される画面（ビジュアルコミュニケーションシステム４による表示画面）の詳細動作について図１６～図２３を用いて説明する。図１６～図２３は、それぞれ、ビジュアルコミュニケーションシステム４による表示画面の詳細動作を示す図である。図１４では、言語情報を基に２次元サムネイルデータが呼び出され、サムネイルを選択し、イラスト編集を行う動作について示す。 Next, the detailed operation of the screen (display screen by visual communication system 4) displayed on the display of client terminal 1 by visual communication system 4 will be described with reference to Figs. 16 to 23. Figs. 16 to 23 are diagrams each showing the detailed operation of the display screen by visual communication system 4. Fig. 14 shows the operation of calling up two-dimensional thumbnail data based on language information, selecting a thumbnail, and performing illustration editing.

図１６（ａ）に示す操作パレットＳ２００における音声入力ボタンＳ２０１１が押されたことを検知すると、ビジュアルコミュニケーションシステム４は、言語情報認識機能をＯＮさせ、図１６（ｂ）に示すように、音声入力が待機状態にあることを示すアイコンＳ３０２１をサムネイル表示エリアＳ３００に表示する。なお、操作パレットＳ２００におけるキー入力ボタン２０１２が押されると、キーボードによるタイピング入力が可能な状態になり、タイピング入力が待機状態であることを示すアイコンが表示される。これら、アイコンＳ３０２１やタイピング入力が待機状態であることを示すアイコンは、言語情報認状態表示アイコンの一例である。 When it is detected that the voice input button S2011 in the operation palette S200 shown in FIG. 16(a) has been pressed, the visual communication system 4 turns on the language information recognition function, and displays an icon S3021 indicating that voice input is in a standby state in the thumbnail display area S300, as shown in FIG. 16(b). When the key input button 2012 in the operation palette S200 is pressed, typing input via the keyboard becomes possible, and an icon indicating that typing input is in a standby state is displayed. These icons S3021 and the icon indicating that typing input is in a standby state are examples of language information recognition state display icons.

「人と乗り物」と発話されたことを検知すると、ビジュアルコミュニケーションシステム４は、図１７に示すように、「人と乗り物」の文字情報ＬＩ１とそれに紐づけられた２次元サムネイルデータＳＭ１１～ＳＭ１８とをサムネイル表示エリアＳ３００に表示する。なお、文字情報と紐づけられたサムネイルデータとは、「人と乗り物」という文字情報全体と紐づけられたサムネイルデータでもよいし、「人」「乗り物」等のキーワードに分解し、それぞれのキーワードと紐づけられたサムネイルデータからなる群であってもよい。 When it is detected that "people and vehicles" has been spoken, the visual communication system 4 displays the text information LI1 of "people and vehicles" and the two-dimensional thumbnail data SM11 to SM18 linked to it in the thumbnail display area S300, as shown in FIG. 17. Note that the thumbnail data linked to the text information may be thumbnail data linked to the entire text information of "people and vehicles," or may be a group of thumbnail data broken down into keywords such as "people" and "vehicles" and linked to each keyword.

さらに「動物」と発話されたことを検知すると、ビジュアルコミュニケーションシステム４は、図１８に示すように、「人と乗り物動物」の文字情報ＬＩ２とそれに紐づけられた２次元サムネイルデータＳＭ２１～ＳＭ２８とをサムネイル表示エリアＳ３００に更新表示する。すなわち、ビジュアルコミュニケーションシステム４は、言語情報の入力を検知する度に、リアルタイムで２次元サムネイルデータを更新させて表示する。なお、文字情報と紐づけられたサムネイルデータとは、「人と乗り物動物」という文字情報全体と紐づけられたサムネイルデータでもよいし、「人」「乗り物」「動物」等のキーワードに分解し、それぞれのキーワードと紐づけられたサムネイルデータからなる群であってもよい。 When it detects that "animals" has been further uttered, the visual communication system 4 updates and displays the text information LI2 of "people and vehicles and animals" and the two-dimensional thumbnail data SM21 to SM28 linked to it in the thumbnail display area S300, as shown in FIG. 18. That is, each time the visual communication system 4 detects the input of linguistic information, it updates and displays the two-dimensional thumbnail data in real time. Note that the thumbnail data linked to the text information may be thumbnail data linked to the entire text information of "people and vehicles and animals", or it may be a group of thumbnail data linked to each keyword that has been broken down into keywords such as "people", "vehicles", and "animals".

図１７、図１８に例示されるように、ビジュアルコミュニケーションシステム４は、ユーザからの言語情報が受け付けられるたびに、表示画面上のサムネイル画像を新しく切り替えて高速に更新表示できる。これにより、ユーザが会話を止めずにその中から、発話又は文字入力された際の頭の中の情景に近い１以上のサムネイル画像を選択できる。 As illustrated in Figures 17 and 18, the visual communication system 4 can quickly update and display new thumbnail images on the display screen every time language information is received from the user. This allows the user to select one or more thumbnail images that are closest to the scene in their mind when they speak or input text, without having to stop the conversation.

図１７の画面で２次元サムネイルデータＳＭ１４が選択されたことを検知すると、ビジュアルコミュニケーションシステム４は、図１９に示すように、２次元サムネイルデータＳＭ１４に紐づけられた３次元イラストデータＩＬ１を呼び出して描画結果表示エリアＳ１００に表示する。 When it is detected that the 2D thumbnail data SM14 has been selected on the screen of FIG. 17, the visual communication system 4 calls up the 3D illustration data IL1 linked to the 2D thumbnail data SM14 and displays it in the drawing result display area S100, as shown in FIG. 19.

３次元的な編集処理において、ビジュアルコミュニケーションシステム４は、３次元的な回転操作、３次元的な移動操作、３次元的な拡大操作、３次元的な縮小操作などを受け付け、それらの操作要求に応じて、図２０に示すように、３次元イラストデータＩＬ１の表示形態を３次元的に変化させる。 In the three-dimensional editing process, the visual communication system 4 accepts three-dimensional rotation operations, three-dimensional movement operations, three-dimensional enlargement operations, three-dimensional reduction operations, and the like, and in response to these operation requests, changes the display form of the three-dimensional illustration data IL1 three-dimensionally, as shown in FIG. 20.

２次元的な編集処理のためのボタン（加筆ボタンＳ２０１３，色塗りボタンＳ２０１４，消しゴムボタンＳ２０１５）のいずれかが押されたことを検知すると、ビジュアルコミュニケーションシステム４は、図２１に示すように、３次元イラストデータＩＬ１が２次元イラストデータＩＬ２に固定化される。本実施形態では一例として、この２次元的な編集処理のためのボタンが押されたことを言語報認識機能のＯＦＦ要求受付と判断し、２次元的な編集処理が可能な状態へと移行する。 When the visual communication system 4 detects that any of the buttons for two-dimensional editing (the add button S2013, the color button S2014, the eraser button S2015) has been pressed, the three-dimensional illustration data IL1 is fixed to two-dimensional illustration data IL2, as shown in FIG. 21. As an example, in this embodiment, the system determines that the button for two-dimensional editing has been pressed as a request to turn off the language recognition function, and transitions to a state in which two-dimensional editing is possible.

２次元的な編集処理において、ビジュアルコミュニケーションシステム４は、２次元的な回転操作、２次元的な移動操作、２次元的な拡大操作、２次元的な縮小操作、一部を消す操作、手書きの線画を付け加える操作、色を塗る操作などを受け付け、それらの操作要求に応じて、図２１、図２２、図２３に示すように、２次元イラストデータＩＬ２の表示形態を２次元的に変化させる。 In two-dimensional editing processing, the visual communication system 4 accepts two-dimensional rotation operations, two-dimensional movement operations, two-dimensional enlargement operations, two-dimensional reduction operations, partial erasure operations, operations to add hand-drawn line drawings, coloring operations, and the like, and in response to these operation requests, changes the display form of the two-dimensional illustration data IL2 two-dimensionally as shown in Figures 21, 22, and 23.

図２１では、加筆ボタンＳ２０１３により、２次元イラストデータＩＬ２の背景となる風景の線画が手書きで追加され、色塗りボタンＳ２０１４により、３次元イラストデータＩＬ２に色が塗られる。 In FIG. 21, the retouch button S2013 is used to add handwritten line drawings of the scenery that will form the background of the two-dimensional illustration data IL2, and the color button S2014 is used to add color to the three-dimensional illustration data IL2.

また図２１では、図１９、図２０においてサムネイル表示エリアＳ３００に表示されていた、言語情報認状態表示アイコン、文字情報、サムネイルデータは非表示となっている。サムネイル表示エリアＳ３００の各種情報を非表示とするタイミングは、２次元的な編集処理のためのボタン（加筆ボタンＳ２０１３，色塗りボタンＳ２０１４，消しゴムボタンＳ２０１５）のいずれかが押されたことを検知したタイミング、固定化が実行されたタイミング、ユーザにより２次元的な編集集処理が開始されたタイミング等、適宜選択できる。 In addition, in Fig. 21, the language information recognition status display icon, character information, and thumbnail data that were displayed in the thumbnail display area S300 in Figs. 19 and 20 are not displayed. The timing for hiding the various information in the thumbnail display area S300 can be appropriately selected, such as when it is detected that any of the buttons for two-dimensional editing processes (the Add button S2013, the Color button S2014, the Eraser button S2015) has been pressed, when fixation is performed, when two-dimensional editing processes are started by the user, etc.

また、２次元的な編集を実施した後も、音声入力ボタンＳ２０１１、キー入力ボタンＳ２０１２を押すことで、言語情報認識機能をＯＮさせ、サムネイル表示エリアＳ３００に各種情報を表示させて、２次元的な編集が行われた画像上に新たな３次元イラストデータを表示、編集し、固定化された新たな２次元イラストデータを追加可能である。 Even after performing two-dimensional editing, the voice input button S2011 or the key input button S2012 can be pressed to turn on the language information recognition function and display various information in the thumbnail display area S300, allowing new three-dimensional illustration data to be displayed and edited on the image that has been two-dimensionally edited, and new fixed two-dimensional illustration data to be added.

図２２では、消しゴムボタンＳ２０１５により、２次元イラストデータＩＬ２’における線画の一部が削除される。 In FIG. 22, the eraser button S2015 is used to delete part of the line drawing in the two-dimensional illustration data IL2'.

図２３では、加筆ボタンＳ２０１３により、２次元イラストデータＩＬ２”に線画が追加される。 In FIG. 23, line art is added to the two-dimensional illustration data IL2" by using the Add button S2013.

図１９～図２３に例示されるように、１段階目の編集処理（図１９、図２０）と２段階目の編集処理（図２１～図２３）とにより、発話又は文字入力された際の頭の中の情景に近いイラストを短時間で正確に表現できる。 As shown in Figures 19 to 23, the first stage of editing (Figures 19 and 20) and the second stage of editing (Figures 21 to 23) allow an illustration that closely resembles the scene that occurs in the user's mind when the speech or text is input to be accurately expressed in a short period of time.

以上のように、本実施形態では、ビジュアルコミュニケーションシステム４において、ユーザから受け付けた言語情報に対応するイラストを検索して表示し、表示されたイラストに対して表示形態を変更させながら２段階の編集処理を可能とする。これにより、コミュニケーションの場面における正確な情報伝達を支援できる。 As described above, in this embodiment, the visual communication system 4 searches for and displays illustrations that correspond to the language information received from the user, and enables a two-stage editing process while changing the display form of the displayed illustration. This can support accurate information transmission in communication situations.

なお、ビジュアルコミュニケーションシステム４の考え方は、オンライン会議システム２４に適用されてもよい。オンライン会議システム２４は、図２４に示すように構成され得る。図２４は、実施形態の変形例にかかるオンライン会議システム２４の構成を示す図であり、描画表示部４００として、オンライン会議ツールを利用する例を示す。 The concept of the visual communication system 4 may be applied to an online conference system 24. The online conference system 24 may be configured as shown in FIG. 24. FIG. 24 is a diagram showing the configuration of an online conference system 24 according to a modified example of the embodiment, and shows an example in which an online conference tool is used as the drawing display unit 400.

オンライン会議システム２４は、複数のクライアント端末２１ａ，２１ｂ、通信監理サーバ２２、及び接続部２３を有する。接続部２３は、複数のクライアント端末２１ａ，２１ｂ、通信監理サーバ２２を互いに通信可能に接続する。複数のクライアント端末２１ａ，２１ｂは、描画表示部４００により実現される表示画面を画面共有することができる。 The online conference system 24 has multiple client terminals 21a, 21b, a communication management server 22, and a connection unit 23. The connection unit 23 connects the multiple client terminals 21a, 21b and the communication management server 22 so that they can communicate with each other. The multiple client terminals 21a, 21b can share a display screen realized by the drawing display unit 400.

各クライアント端末２１ａ，２１ｂは、言語情報入力部１００ａ，１００ｂ、イラスト表示部２００ａ，２００ｂ、描画操作部３００ａ，３００ｂを有する。言語情報入力部１００ａ，１００ｂ、イラスト表示部２００ａ，２００ｂ、描画操作部３００ａ，３００ｂの機能及び動作は、それぞれ、実施形態における言語情報入力部１００、イラスト表示部２００、描画操作部３００の機能及び動作と同様である。 Each client terminal 21a, 21b has a language information input unit 100a, 100b, an illustration display unit 200a, 200b, and a drawing operation unit 300a, 300b. The functions and operations of the language information input unit 100a, 100b, the illustration display unit 200a, 200b, and the drawing operation unit 300a, 300b are similar to the functions and operations of the language information input unit 100, the illustration display unit 200, and the drawing operation unit 300 in the embodiment, respectively.

通信監理サーバ２２は、描画表示部４００及び記憶部５００に加えて、通信管理部６００を有する。描画表示部４００及び記憶部５００の機能及び動作は、それぞれ、実施形態における描画表示部４００及び記憶部５００の機能及び動作と同様である。 The communication management server 22 has a communication management unit 600 in addition to a drawing display unit 400 and a memory unit 500. The functions and operations of the drawing display unit 400 and the memory unit 500 are similar to those of the drawing display unit 400 and the memory unit 500 in the embodiment, respectively.

通信管理部６００は、会議参加者である複数のクライアント端末２１ａ，２１ｂそれぞれから受信した音声やカメラ画像を、他のクライアント端末に送信して管理する。描画表示部４００は、オンライン会議のカメラ画像に重畳して描画表示する。通信管理部６００は、重畳された画像を他のクライアント端末に送信する。 The communication management unit 600 transmits the audio and camera images received from each of the multiple client terminals 21a and 21b, which are conference participants, to the other client terminals and manages them. The drawing display unit 400 draws and displays the images by superimposing them on the camera images of the online conference. The communication management unit 600 transmits the superimposed images to the other client terminals.

重畳は画像の一部に重畳しても良いし、画像全体に重畳しても良い。また描画した画像だけでなく、Ｓ２００，Ｓ３００等も同時に重畳しても良い。 The overlay may be applied to a portion of the image, or to the entire image. In addition to the drawn image, S200, S300, etc. may also be overlaid at the same time.

例えば、図２５～図２７に示すように、ユーザが映っているカメラ画像と、動作取得デバイスにより取得されたユーザのジェスチャーによる描画を重畳させることもできる。図２５は、実施形態の変形例におけるクライアント端末２１ａのカメラ５２０によって撮像されたカメラ画像を示す図であり、クライアント端末２１ａのユーザが映っているカメラ画像を例示している。図２６は、実施形態の変形例における描画表示部４００が図２５のカメラ画像に重畳する画像を示す図である。図２６では、描画結果表示エリアＳ１００、操作パレットＳ２００、サムネイル表示エリアＳ３００のうち、操作パレットＳ２００が重畳されず、描画結果表示エリアＳ１００、サムネイル表示エリアＳ３００が重畳された例が示されている。図２７は、実施形態の変形例における他のクライアント端末２１ｂで表示される画像を示す図である。この場合、有るクライアント端末においてユーザが自分のディスプレイに対して行っている指先による手書きが、他のクライアント端末では、ユーザの指先によってカメラ画像内に描画が重畳されていく画像として表示される。 For example, as shown in Figs. 25 to 27, a camera image showing a user and a drawing made by a gesture of the user acquired by a motion acquisition device can be superimposed. Fig. 25 is a diagram showing a camera image captured by the camera 520 of the client terminal 21a in a modified embodiment, and illustrates a camera image showing the user of the client terminal 21a. Fig. 26 is a diagram showing an image superimposed on the camera image of Fig. 25 by the drawing display unit 400 in a modified embodiment. Fig. 26 shows an example in which the drawing result display area S100 and the thumbnail display area S300 are superimposed, but the operation palette S200 is not superimposed, among the drawing result display area S100, the operation palette S200, and the thumbnail display area S300. Fig. 27 is a diagram showing an image displayed on another client terminal 21b in a modified embodiment. In this case, handwriting with the fingertip of a user on his/her display at a certain client terminal is displayed as an image in which a drawing is superimposed on the camera image by the user's fingertip at the other client terminal.

このように会話しながらジェスチャー操作でイラストを用いたビジュアルコミュニケーションを実現することが可能である。 In this way, visual communication using illustrations can be achieved through gesture control while talking.

上記で説明した実施形態の、ビジュアルコミュニケーションシステム４、オンライン会議システム２４は、画像編集システムまたは画像表示システムの例である。またクライアント端末１、クライアント端末２１ａ、クライアント端末２１ｂ、サーバ２、通信管理端末２２は、画像編集装置または画像表示装置の例である。 In the embodiment described above, the visual communication system 4 and the online conference system 24 are examples of an image editing system or an image display system. Also, the client terminal 1, the client terminal 21a, the client terminal 21b, the server 2, and the communication management terminal 22 are examples of an image editing device or an image display device.

なお、上記で説明した実施形態の各機能は、一又は複数の処理回路によって実現することが可能である。ここで、本明細書における「処理回路」とは、電子回路により実装されるプロセッサのようにソフトウェアによって各機能を実行するようプログラミングされたプロセッサや、上記で説明した各機能を実行するよう設計されたＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）、ＤＳＰ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ）、ＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）や従来の回路モジュール等のデバイスを含むものとする。 Each function of the embodiments described above can be realized by one or more processing circuits. Here, the term "processing circuit" in this specification includes a processor programmed to execute each function by software, such as a processor implemented by an electronic circuit, and devices such as an ASIC (Application Specific Integrated Circuit), DSP (Digital Signal Processor), FPGA (Field Programmable Gate Array), and conventional circuit modules designed to execute each function described above.

また、明細書中の対応テーブル（表）は、機械学習の学習効果によって生成されたものでもよい。ここで、機械学習とは、コンピュータに人のような学習能力を獲得させるための技術であり，コンピュータが，データ識別等の判断に必要なアルゴリズムを，事前に取り込まれる学習データから自律的に生成し，新たなデータについてこれを適用して予測を行う技術のことをいう。機械学習のための学習方法は、教師あり学習、教師なし学習、半教師学習、強化学習、深層学習のいずれかの方法でもよく、さらに、これらの学習方法を組み合わせた学習方法でもよく、機械学習のための学習方法は問わない。 The correspondence table in the specification may be one generated by the learning effect of machine learning. Here, machine learning is a technology for enabling a computer to acquire human-like learning capabilities, and refers to a technology in which a computer autonomously generates algorithms required for judgments such as data identification from learning data that is previously loaded, and applies these to new data to make predictions. The learning method for machine learning may be any of supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, and deep learning, or may be a combination of these learning methods; any learning method for machine learning is acceptable.

また、ビジュアルコミュニケーションシステム４又はオンライン会議システム２４で実行されるプログラム５００ａは、ＲＯＭ等に予め組み込まれて提供されてもよい。あるいは、プログラム５００ａは、インストール可能な形式又は実行可能な形式のファイルでＣＤ－ＲＯＭ、フレキシブルディスク（ＦＤ）、ＣＤ－Ｒ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）等のコンピュータで読み取り可能な記録媒体に記録して提供するように構成してもよい。あるいは、プログラム５００ａは、インターネット等のネットワークに接続されたコンピュータ上に格納され、ネットワーク経由でダウンロードされることにより提供するように構成しても良い。また、プログラム５００ａをインターネット等のネットワーク経由で提供または配布するように構成しても良い。 The program 500a executed by the visual communication system 4 or the online conference system 24 may be provided in advance in a ROM or the like. Alternatively, the program 500a may be provided by recording it in an installable or executable file format on a computer-readable recording medium such as a CD-ROM, a flexible disk (FD), a CD-R, or a digital versatile disk (DVD). Alternatively, the program 500a may be provided by being stored on a computer connected to a network such as the Internet and downloaded via the network. The program 500a may also be provided or distributed via a network such as the Internet.

１クライアント端末
２サーバ
３接続部
４ビジュアルコミュニケーションシステム
２１ａ，２１ｂクライアント端末
２２通信監理サーバ
２３接続部
２４オンライン会議システム Reference Signs List 1 Client terminal 2 Server 3 Connection unit 4 Visual communication system 21a, 21b Client terminal 22 Communication management server 23 Connection unit 24 Online conference system

特許第６３３９５２９号公報Patent No. 6339529

Claims

a first input means for receiving voice input information, text input information or motion input information;
a recognition means for recognizing character information from the voice input information, the text input information, or the action input information;
a search means for searching for a plurality of thumbnail images, each of which is associated with image data, by using the recognized character information;
a display means for displaying the retrieved thumbnail images;
a first editing means for performing a first editing process on first image data that is associated with a thumbnail image selected from among the plurality of thumbnail images displayed on the display means and has a first data format;
a second editing means for performing a second editing process on second image data having a second data format different from the first data format among the image data displayed on the display means;
a conversion means for converting the first data format into the second data format,
The display means includes:
when the first editing process is performed by the first editing means, a display of the first image data is changed to a display of image data after the first editing process;
when the second editing process is performed on the second image data converted by the conversion means, a display of the second image data is changed to a display after the second editing process.
Image editing device.

When none of the plurality of thumbnail images displayed on the display means is selected, and the first input means receives second voice input information, second text input information, or second action input information, and second character information is recognized by the recognition means for the second voice input information, the second text input information, or the second action input information, the search means searches for a plurality of second thumbnail images each associated with image data, using the recognized second character information, and the display means displays the searched second thumbnail image in place of the plurality of thumbnail images displayed on the display means.
The image editing device according to claim 1 .

a second input means for receiving a selection operation for designating a thumbnail image to be selected from the plurality of thumbnail images;
a selection means for selecting image data linked to a thumbnail image designated by the selection operation from among a plurality of image data linked to a plurality of thumbnail images displayed on the display means;
Further equipped with
The first editing means performs the first editing process on the first image data when the first image data is selected by the selection means from among a plurality of image data linked to a plurality of thumbnail images displayed on the display means.
The image editing device according to claim 1 .

2. The image editing device according to claim 1, wherein the conversion means converts the first data format into the second data format in response to a state in which the second editing process by the second editing means is enabled.

the first editing means performs the first editing process on the three-dimensional image data displayed on the display means;
the second editing means performs the second editing process on the two-dimensional image data displayed on the display means;
4. The image editing device according to claim 1 , wherein the conversion means converts a three-dimensional image data format into a two-dimensional image data format.

6. The image editing device according to claim 5, wherein the conversion means determines a plurality of planar coordinates projected onto a plane corresponding to the display means from a plurality of spatial coordinates contained in the three-dimensional image data, and generates two -dimensional image data including the plurality of planar coordinates.

the first editing process includes a process corresponding to a three-dimensional editing operation;
The image editing device according to claim 1 , wherein the second editing process includes a process corresponding to a two-dimensional editing operation.

receiving voice or text input information;
recognizing character information from the voice information or the text input information;
using the recognized character information to search for a plurality of thumbnail images, each of which is associated with an image data;
displaying the retrieved thumbnail images on a display means;
a step of performing a first editing process on first image data that is associated with a thumbnail image selected from the plurality of thumbnail images displayed on the display means and has a first data format;
a step of changing a display of the first image data to a display of image data after the first edit processing when the first edit processing is performed in the step of performing the first edit processing;
converting the first data format into a second data format;
a step of performing a second editing process on second image data having the second data format among the image data displayed on the display means;
a step of changing a display of the second image data to a display after the second editing process when the second editing process is applied to the second image data converted in the converting step in a step of applying the second editing process;
Image editing methods including.

receiving voice or text input information;
recognizing character information from the voice information or the text input information;
using the recognized character information to search for a plurality of thumbnail images, each of which is associated with an image data;
displaying the retrieved thumbnail images on a display means;
a step of performing a first editing process on first image data that is associated with a thumbnail image selected from the plurality of thumbnail images displayed on the display means and has a first data format;
a step of changing a display of the first image data to a display of image data after the first edit processing when the first edit processing is performed in the step of performing the first edit processing;
converting the first data format into a second data format;
a step of performing a second editing process on second image data having the second data format among the image data displayed on the display means;
a step of changing a display of the second image data to a display after the second editing process when the second editing process is applied to the second image data converted in the converting step in a step of applying the second editing process;
An image editing program that causes a computer to execute the following: