JP6891744B2

JP6891744B2 - Image map creation device, display device and program

Info

Publication number: JP6891744B2
Application number: JP2017174205A
Authority: JP
Inventors: 伊藤　直之; 直之伊藤; 拓也生駒; ジュリアンピアテック; 錬松山
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2017-09-11
Filing date: 2017-09-11
Publication date: 2021-06-18
Anticipated expiration: 2037-09-11
Also published as: JP2019049898A

Description

本発明は、画像マップ作成装置、表示装置及びプログラムに関する。 The present invention relates to an image map creating device, a display device and a program.

多数のデータからユーザが所望する目的に最適なデータを探し出すことは、容易ではない。そこで、従来、多数のデータから目的に最適なデータを探すことができるように、全体の俯瞰的な位置づけを把握可能なデータのマップを作成する技術が開示されている（例えば、特許文献１）。 It is not easy to find the most suitable data for the purpose desired by the user from a large amount of data. Therefore, conventionally, a technique for creating a data map capable of grasping an overall bird's-eye view position has been disclosed so that the optimum data for a purpose can be searched from a large number of data (for example, Patent Document 1). ..

特開２００８−８４１５１号公報Japanese Unexamined Patent Publication No. 2008-84151

ところで、多数のデータからユーザが所望する目的に最適なデータを探し出すニーズは、文書データに限らない。例えば、多数の画像データからユーザが所望する目的に最適な画像データを探し出したいニーズがある。 By the way, the need to find the most suitable data for the purpose desired by the user from a large amount of data is not limited to document data. For example, there is a need to find the most suitable image data for a purpose desired by a user from a large number of image data.

そこで、本発明は、ユーザが所望する目的に最適な画像を探し出すための画像マップ作成装置、表示装置及びプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide an image map creation device, a display device, and a program for finding an optimum image for a purpose desired by a user.

本発明は、以下のような解決手段により、前記課題を解決する。
第１の発明は、画像を解析して複数の単語を抽出する単語抽出手段と、前記単語抽出手段により抽出した複数の単語をクラスタリングして分類語を生成する分類語生成手段と、前記分類語生成手段によって生成された分類語をカウントすることによりマップを作成するマップ作成手段と、を備える画像マップ作成装置である。
第２の発明は、第１の発明の画像マップ作成装置において、前記単語抽出手段により抽出した複数の単語を、複数のグループに分ける分類手段を備え、前記分類語生成手段は、前記分類手段により分けられた各グループの単語をクラスタリングして分類語を生成し、前記マップ作成手段は、前記分類手段により分けられた前記複数のグループうち少なくとも２つのグループの各分類語によって分類された前記マップを作成すること、を特徴とする画像マップ作成装置である。
第３の発明は、第１の発明又は第２の発明の画像マップ作成装置において、前記分類語生成手段は、前記単語抽出手段により抽出した複数の単語から、上位概念にあたる語を前記分類語として生成すること、を特徴とする画像マップ作成装置である。
第４の発明は、第１の発明から第３の発明までのいずれかの画像マップ作成装置において、前記分類語生成手段は、前記単語抽出手段により抽出した複数の単語の出現頻度に基づいて、前記分類語を生成すること、を特徴とする画像マップ作成装置である。
第５の発明は、第１の発明から第３の発明までのいずれかの画像マップ作成装置において、画像を解析して前記画像の特徴を表す複数の特徴画像を抽出し、前記画像の領域に対して抽出した前記特徴画像の占める割合を算出する画像割合算出手段を備え、前記単語抽出手段は、前記特徴画像の特徴を表す単語を抽出し、前記分類語生成手段は、前記画像割合算出手段により算出した割合に応じて、前記単語抽出手段により抽出した単語から前記分類語を生成すること、を特徴とする画像マップ作成装置である。
第６の発明は、第１の発明から第５の発明までのいずれかの画像マップ作成装置において、前記単語抽出手段は、前記画像の特徴ごとに関連付いた単語を参照して説明文を生成し、生成した説明文から複数の単語を抽出すること、を特徴とする画像マップ作成装置である。
第７の発明は、第１の発明から第６の発明までのいずれかの画像マップ作成装置において、前記分類語生成手段は、複数の画像の各々から前記単語抽出手段により抽出された複数の単語をクラスタリングして分類語を生成すること、を特徴とする画像マップ作成装置である。
第８の発明は、第１の発明から第７の発明までのいずれかの画像マップ作成装置に対して通信可能に接続された表示装置であって、前記マップ作成手段により作成された前記マップを出力するマップ出力手段と、前記マップ出力手段により出力された前記マップに示され、前記分類語に対応付けられた要素の選択を受け付ける要素受付手段と、前記要素受付手段により受け付けた前記要素に対応する画像を出力する画像出力手段と、を備えること、を特徴とする表示装置である。
第９の発明は、第８の発明の表示装置において、前記画像出力手段は、前記要素受付手段により受け付けた前記要素に対応する画像が複数ある場合に、複数の前記画像の類似度に応じて各画像を配置して、前記複数の画像を関連画像として出力すること、を特徴とする表示装置である。
第１０の発明は、第９の発明の表示装置において、前記類似度は、各画像の特徴量と、各画像に対応した単語との少なくとも一方に基づいて算出されるものであること、を特徴とする表示装置である。
第１１の発明は、第９の発明又は第１０の発明の表示装置において、前記関連画像は、類似度を、画像間の線の長さ若しくは太さ、又は、基準にした画像からの奥行方向の距離によって表したものであること、を特徴とする表示装置である。
第１２の発明は、第１の発明から第７の発明までのいずれかの画像マップ作成装置としてコンピュータを機能させるためのプログラムである。
第１３の発明は、第８の発明から第１１の発明までのいずれかの表示装置としてコンピュータを機能させるためのプログラムである。 The present invention solves the above problems by the following solutions.
The first invention comprises a word extraction means for analyzing an image to extract a plurality of words, a classification word generation means for clustering a plurality of words extracted by the word extraction means to generate a classification word, and the classification word. It is an image map creation device including a map creation means for creating a map by counting the taxonomy generated by the generation means.
The second invention includes, in the image map creating apparatus of the first invention, a classification means for dividing a plurality of words extracted by the word extraction means into a plurality of groups, and the classification word generation means is based on the classification means. The words of each of the divided groups are clustered to generate a classification word, and the map creation means creates the map classified by each classification word of at least two groups out of the plurality of groups divided by the classification means. It is an image map creation device characterized by creating.
The third invention is the image map creating apparatus of the first invention or the second invention, in which the classified word generating means uses a word corresponding to a higher concept as the classified word from a plurality of words extracted by the word extracting means. It is an image map creation device characterized by generating.
The fourth invention is in any of the image map creating devices from the first invention to the third invention, in which the classified word generating means is based on the appearance frequency of a plurality of words extracted by the word extracting means. It is an image map creation device characterized by generating the classification word.
In a fifth aspect of the present invention, in any of the image map creating devices from the first invention to the third invention, an image is analyzed to extract a plurality of feature images representing the features of the image, and the image is located in the area of the image. The image ratio calculating means for calculating the ratio occupied by the feature image extracted with respect to the feature image is provided, the word extracting means extracts a word representing the feature of the feature image, and the classified word generating means is the image ratio calculating means. The image map creating apparatus is characterized in that the classified word is generated from the word extracted by the word extracting means according to the ratio calculated by the above.
A sixth aspect of the present invention is the image map creating apparatus according to any one of the first to fifth inventions, wherein the word extraction means generates an explanatory text by referring to words associated with each feature of the image. It is an image map creation device characterized by extracting a plurality of words from the generated explanatory text.
According to a seventh aspect of the present invention, in any of the image map creating devices from the first invention to the sixth invention, the classified word generating means has a plurality of words extracted from each of the plurality of images by the word extracting means. It is an image map creation device characterized by generating classification words by clustering.
The eighth invention is a display device communicatively connected to any of the image map creating devices from the first invention to the seventh invention, and the map created by the map creating means is displayed. Corresponds to the map output means to be output, the element receiving means shown in the map output by the map output means and accepting the selection of the element associated with the classification term, and the element received by the element receiving means. The display device is provided with an image output means for outputting an image to be output.
According to a ninth aspect of the present invention, in the display device of the eighth invention, when there are a plurality of images corresponding to the elements received by the element receiving means, the image output means responds to the similarity of the plurality of images. It is a display device characterized in that each image is arranged and the plurality of images are output as related images.
A tenth aspect of the invention is characterized in that, in the display device of the ninth invention, the similarity is calculated based on at least one of a feature amount of each image and a word corresponding to each image. It is a display device.
The eleventh invention is the display device of the ninth invention or the tenth invention, in which the related image has a similarity based on the length or thickness of a line between images or the depth direction from an image as a reference. It is a display device characterized by being represented by the distance of.
A twelfth invention is a program for operating a computer as an image map creating device according to any one of the first to seventh inventions.
The thirteenth invention is a program for operating a computer as any display device from the eighth invention to the eleventh invention.

本発明によれば、ユーザが所望する目的に最適な画像を探し出すための画像マップ作成装置、表示装置及びプログラムを提供することができる。 According to the present invention, it is possible to provide an image map creation device, a display device, and a program for finding an optimum image for a desired purpose by a user.

本実施形態に係る画像マップ出力システムの全体概要図である。It is an overall schematic diagram of the image map output system which concerns on this embodiment. 本実施形態に係る画像マップ作成装置及び表示装置の機能ブロック図である。It is a functional block diagram of the image map creation device and the display device which concerns on this embodiment. 本実施形態に係る画像マップ作成装置で作成されるマップの例を示す図である。It is a figure which shows the example of the map created by the image map making apparatus which concerns on this embodiment. 本実施形態に係る画像マップ出力システムのマップ出力処理を示すフローチャートである。It is a flowchart which shows the map output processing of the image map output system which concerns on this embodiment. 本実施形態に係る表示装置での表示例を示す図である。It is a figure which shows the display example in the display device which concerns on this embodiment. 本実施形態に係る画像マップ作成装置のマップ作成処理を示すフローチャートである。It is a flowchart which shows the map creation process of the image map creation apparatus which concerns on this embodiment. 本実施形態に係る画像マップ作成装置によるマップ作成処理での具体例を示す図である。It is a figure which shows the specific example in the map making process by the image map making apparatus which concerns on this embodiment. 本実施形態に係る画像マップ作成装置によるマップ作成処理での具体例を示す図である。It is a figure which shows the specific example in the map making process by the image map making apparatus which concerns on this embodiment. 本実施形態に係る画像マップ出力システムの関連画像出力処理を示すフローチャートである。It is a flowchart which shows the related image output processing of the image map output system which concerns on this embodiment. 本実施形態に係る画像マップ作成装置の関連画像生成処理を示すフローチャートである。It is a flowchart which shows the related image generation processing of the image map making apparatus which concerns on this embodiment. 本実施形態に係る表示装置での表示例を示す図である。It is a figure which shows the display example in the display device which concerns on this embodiment.

以下、本発明を実施するための形態について、図を参照しながら説明する。なお、これは、あくまでも一例であって、本発明の技術的範囲はこれに限られるものではない。
（実施形態）
＜画像マップ出力システム１００＞
図１は、本実施形態に係る画像マップ出力システム１００の全体概要図である。
図２は、本実施形態に係る画像マップ作成装置１及び表示装置４の機能ブロック図である。
図３は、本実施形態に係る画像マップ作成装置１で作成されるマップ８０の例を示す図である。 Hereinafter, embodiments for carrying out the present invention will be described with reference to the drawings. It should be noted that this is only an example, and the technical scope of the present invention is not limited to this.
(Embodiment)
<Image map output system 100>
FIG. 1 is an overall schematic view of the image map output system 100 according to the present embodiment.
FIG. 2 is a functional block diagram of the image map creating device 1 and the display device 4 according to the present embodiment.
FIG. 3 is a diagram showing an example of a map 80 created by the image map creating device 1 according to the present embodiment.

図１に示す画像マップ出力システム１００は、画像ＤＢ（データベース）３に記憶された画像データを、画像の特徴に基づく分類語によって分類し、画像の分類を可視化したマップを作成するためのシステムである。また、画像マップ出力システム１００は、マップを構成する、画像データの集合を表す要素のうち１つの選択を受け付けた場合に、選択された要素に含まれる画像データを出力する。
以下の説明において、画像ＤＢ３に記憶される画像データは、例えば、美術館や博物館等で展示されている絵画等を、画像データ化したものであるとする。そして、画像データは、例えば、その画像を保有する美術館等を識別するための美術館ＩＤ（ＩＤｅｎｔｉｆｉｃａｔｉｏｎ）を含むデータを、メタデータとして有するものとする。
一例として、画像マップ出力システム１００では、美術館や博物館で保有する絵画に関するマップを、美術館や博物館ごとに作成するものを説明する。 The image map output system 100 shown in FIG. 1 is a system for classifying image data stored in an image DB (database) 3 by classification words based on image characteristics and creating a map that visualizes the classification of images. is there. Further, when the image map output system 100 accepts the selection of one of the elements representing the set of image data constituting the map, the image map output system 100 outputs the image data included in the selected element.
In the following description, it is assumed that the image data stored in the image DB 3 is, for example, an image data of a painting or the like exhibited in a museum or a museum. Then, the image data is assumed to have, for example, data including a museum ID (IDentification) for identifying a museum or the like that owns the image as metadata.
As an example, in the image map output system 100, a map related to a painting held in a museum or a museum is created for each museum or the museum.

画像マップ出力システム１００は、画像マップ作成装置１と、画像ＤＢ３と、表示装置４とを備える。画像マップ作成装置１と、画像ＤＢ３と、表示装置４とは、通信ネットワークＮを介して通信可能に接続されている。通信ネットワークＮは、例えば、インターネット回線等であり、有線であるか無線であるかを問わない。 The image map output system 100 includes an image map creation device 1, an image DB 3, and a display device 4. The image map creating device 1, the image DB 3, and the display device 4 are communicably connected via the communication network N. The communication network N is, for example, an Internet line or the like, regardless of whether it is wired or wireless.

＜画像マップ作成装置１＞
画像マップ作成装置１は、画像ＤＢ３に記憶された画像データを用いる。画像マップ作成装置１は、画像データが示す画像を解析して、画像の内容を表す複数の単語を抽出する。画像マップ作成装置１は、抽出した複数の単語を、画像データに関連付けて、画像ＤＢ３に記憶させてもよい。そして、画像マップ作成装置１は、抽出した複数の単語から、例えば、上位概念にあたる語を生成して、生成した語を、マップで用いる分類語にし、分類語を用いた、例えば、バブルチャートで表されたマップを作成する。 <Image map creation device 1>
The image map creating device 1 uses the image data stored in the image DB 3. The image map creating device 1 analyzes the image indicated by the image data and extracts a plurality of words representing the contents of the image. The image map creating device 1 may associate the extracted plurality of words with the image data and store them in the image DB 3. Then, the image map creation device 1 generates, for example, a word corresponding to a superordinate concept from the extracted plurality of words, uses the generated word as a classification word to be used in the map, and uses the classification word, for example, in a bubble chart. Create a represented map.

画像マップ作成装置１は、例えば、サーバである。画像マップ作成装置１は、その他、パーソナルコンピュータ（ＰＣ）等であってもよい。
図２（Ａ）に示すように、画像マップ作成装置１は、制御部１０と、記憶部２０と、通信インタフェース部２９とを備える。
制御部１０は、画像マップ作成装置１の全体を制御する中央処理装置（ＣＰＵ）である。制御部１０は、記憶部２０に記憶されているオペレーティングシステム（ＯＳ）やアプリケーションプログラムを適宜読み出して実行することにより、上述したハードウェアと協働し、各種機能を実行する。 The image map creating device 1 is, for example, a server. The image map creating device 1 may also be a personal computer (PC) or the like.
As shown in FIG. 2A, the image map creating device 1 includes a control unit 10, a storage unit 20, and a communication interface unit 29.
The control unit 10 is a central processing unit (CPU) that controls the entire image map creation device 1. The control unit 10 appropriately reads and executes the operating system (OS) and the application program stored in the storage unit 20 to cooperate with the above-mentioned hardware and execute various functions.

制御部１０は、画像受付部１１と、単語抽出部１２（単語抽出手段）と、分類部１３（分類手段）と、分類語生成部１４（分類語生成手段）と、マップ作成部１５（マップ作成手段）と、マップ登録部１６と、関連画像生成部１７とを備える。
画像受付部１１は、マップの作成対象になる画像データを受け付ける。画像受付部１１は、例えば、画像データのメタデータに有する美術館ＩＤに基づいて、画像ＤＢ３に記憶された、対象の美術館が保有する絵画の画像データを受け付ける。 The control unit 10 includes an image reception unit 11, a word extraction unit 12 (word extraction means), a classification unit 13 (classification means), a classification word generation unit 14 (classification word generation means), and a map creation unit 15 (map). A creation means), a map registration unit 16, and a related image generation unit 17.
The image receiving unit 11 receives image data to be created as a map. The image receiving unit 11 receives, for example, image data of a painting owned by the target museum stored in the image DB 3 based on the museum ID contained in the metadata of the image data.

単語抽出部１２は、受け付けた画像データの内容を表す単語を抽出する。
単語抽出部１２は、例えば、公知の説明文生成ソフトウェアである“ＤｅｎｓｅＣａｐ”を使用して、画像データに関する説明文を生成する。“ＤｅｎｓｅＣａｐ”を使用する場合、単語抽出部１２により生成した説明文は、英語で記載されている。なお、単語抽出部１２は、“ＤｅｎｓｅＣａｐ”の代わりに、公知の説明文生成ソフトウェアである“ＮｅｕｒａｌＴａｌｋ２”等を用いてもよい。但し、説明文を複数生成する“ＤｅｎｓｅＣａｐ”の方が、画像内を複数領域に分けて、各領域の説明文を生成するため、画像に関する説明文から多くの単語を得ることができる。 The word extraction unit 12 extracts a word representing the content of the received image data.
The word extraction unit 12 uses, for example, "DenseCap", which is known explanatory text generation software, to generate explanatory text regarding image data. When "DenseCap" is used, the explanatory text generated by the word extraction unit 12 is written in English. The word extraction unit 12 may use known explanatory text generation software "NeuralTalk2" or the like instead of "DenseCap". However, since "DenseCap" that generates a plurality of explanatory texts divides the image into a plurality of regions and generates explanatory texts for each region, many words can be obtained from the explanatory texts related to the image.

次に、説明文が英語で記載されている場合、単語抽出部１２は、英文の説明文を、日本語に翻訳する。単語抽出部１２は、例えば、公知の自動翻訳のソフトウェアを用いて、説明文を翻訳することができる。なお、その他として、単語抽出部１２は、通信ネットワークＮを介して接続された翻訳のウェブサービスを行うサーバ（図示せず）に対して通信をして、英語から日本語に翻訳してもよい。 Next, when the explanation is written in English, the word extraction unit 12 translates the explanation in English into Japanese. The word extraction unit 12 can translate the explanatory text using, for example, known automatic translation software. In addition, the word extraction unit 12 may translate from English to Japanese by communicating with a server (not shown) that provides a translation web service connected via the communication network N. ..

なお、“ＤｅｎｓｅＣａｐ”や“ＮｅｕｒａｌＴａｌｋ２”の場合、説明文の生成にあたって、現在の画像データを使用して学習した学習器を用いている。そのため、“ＤｅｎｓｅＣａｐ”等は、学習結果として、画像データから現在においてありふれた物を用いた説明文を生成する。他方、美術館や博物館等の絵画の画像データは、古い時代の絵画の画像データを含むため、“ＤｅｎｓｅＣａｐ”等では、誤変換が生じる場合がある。例えば、人物が手に持っている聖書を、現在においてありふれた物であるタブレットに、誤って変換する場合がある。よって、その場合には、生成された説明文に含まれるタブレットを、聖書に置き換えたり、絵画がモチーフとしている時代にはない物の説明については、削除したりする必要がある。 In the case of "DenseCap" and "NeuralTalk2", a learning device learned using the current image data is used to generate the explanatory text. Therefore, "DenseCap" or the like generates an explanatory text using an ordinary object from the image data as a learning result. On the other hand, since the image data of paintings in museums and the like includes the image data of paintings of old times, erroneous conversion may occur in "DenseCap" and the like. For example, a person's hand may erroneously convert the Bible into a tablet, which is now commonplace. Therefore, in that case, it is necessary to replace the tablet included in the generated explanation with the Bible, or delete the explanation of things that are not in the era when the painting is the motif.

その後、単語抽出部１２は、説明文から複数の単語を抽出する。具体的には、単語抽出部１２は、例えば、説明文に対して形態素解析を行うことによって、説明文から複数の単語を抽出する。ここで、単語抽出部１２が抽出する単語としては、名詞、形容詞、動詞等がある。
なお、上述では、“ＤｅｎｓｅＣａｐ”を用いた説明文の生成及び単語の抽出を説明したが、これに限定するものではない。 After that, the word extraction unit 12 extracts a plurality of words from the explanatory text. Specifically, the word extraction unit 12 extracts a plurality of words from the explanatory text, for example, by performing morphological analysis on the explanatory text. Here, the words extracted by the word extraction unit 12 include nouns, adjectives, verbs, and the like.
In the above description, the generation of the explanatory text and the extraction of words using "DenseCap" have been described, but the present invention is not limited to this.

分類部１３は、マップを作成する対象である複数の画像データから抽出した単語を、複数のグループに分ける。ここで、マップを作成する対象である複数の画像データは、例えば、対象の美術館が保有する複数の絵画の画像データである。
分類部１３は、例えば、マップを作成する対象である複数の画像データに基づいて取得した処理対象の単語から、品詞によってフィルタリングすることで、名詞、形容詞、動詞等のグループに分けることができる。
なお、分類部１３は、「彼」や「これ」等の代名詞や、「側」や「製」等の接尾名詞、「中」や「一部」等の副詞可能名詞等を除外するようにしてもよい。また、分類部１３による分類は、品詞によるものに限定されない。一例として、分類部１３による分類は、予め決められたカテゴリ（ジャンル）によるものであってもよい。 The classification unit 13 divides words extracted from a plurality of image data for which a map is created into a plurality of groups. Here, the plurality of image data for which the map is created is, for example, image data of a plurality of paintings owned by the target museum.
The classification unit 13 can be divided into groups such as nouns, adjectives, and verbs by filtering by part of speech from words to be processed that are acquired based on a plurality of image data for which a map is to be created, for example.
The classification unit 13 excludes pronouns such as "he" and "kore", suffixes such as "side" and "made", and adverb and count nouns such as "middle" and "part". You may. Further, the classification by the classification unit 13 is not limited to the part of speech. As an example, the classification by the classification unit 13 may be based on a predetermined category (genre).

分類語生成部１４は、処理対象の単語をクラスタリングして、分類語を生成する。
分類語生成部１４は、例えば、処理対象の単語を上位概念化して、分類語にする。一例として、単語抽出部１２によって「少女」、「男性」という単語が抽出された場合、分類語生成部１４は、「少女」、「男性」に共通した上位概念の語である「人」を、分類語として生成してもよい。
また、分類語生成部１４は、処理対象の単語の出現頻度に基づいて、出現頻度の高い単語を、分類語にしてもよい。具体的には、出現した単語と、出現数とを対応付けて記憶した上で、分類語生成部１４は、出現数が多い方から上位ｎ個（ｎは１以上の整数）の単語や、ある閾値以上の出現数の単語を、分類語として取得してもよい。 The classification word generation unit 14 clusters the words to be processed to generate the classification word.
For example, the classification word generation unit 14 superordinates the word to be processed into a classification word. As an example, when the words "girl" and "male" are extracted by the word extraction unit 12, the classification word generation unit 14 uses "person", which is a superordinate concept word common to "girl" and "male". , May be generated as a taxonomy.
Further, the classification word generation unit 14 may use a word having a high frequency of occurrence as a classification word based on the frequency of appearance of the word to be processed. Specifically, after storing the words that have appeared in association with the number of occurrences, the classification word generation unit 14 has the highest n words (n is an integer of 1 or more) from the one with the largest number of occurrences, or Words having a number of occurrences equal to or greater than a certain threshold may be acquired as classification words.

さらに、分類語生成部１４は、処理対象の単語の出現頻度の高い単語から、上位概念化する単語を選択して、上位概念の語を分類語にしてもよい。
さらにまた、分類語生成部１４は、処理対象の単語が分類部１３によって分類された場合には、分類されたグループごとにクラスタリングをして分類語を生成してもよい。
その他、分類語生成部１４は、単語の意味の類似性に基づいてクラスタリングして、分類語を生成してもよい。 Further, the classification word generation unit 14 may select a word to be super-conceptualized from the words having a high frequency of occurrence of the word to be processed, and use the word of the super-concept as the classification word.
Furthermore, when the words to be processed are classified by the classification unit 13, the classification word generation unit 14 may generate the classification words by clustering for each classified group.
In addition, the classification word generation unit 14 may generate classification words by clustering based on the similarity of word meanings.

マップ作成部１５は、分類部１３によりグループ分けがされた各グループを軸にして分類語を並べ、分類語に対応する位置に、対応する各画像データの数をカウントして円領域（要素）の大きさで表したバブルチャートであるマップ８０を作成する。
ここで、マップ作成部１５は、分類語に対応する位置が、複数の分類語を含む場合、つまり２軸以上の場合に、全ての分類語を含む説明文があるか否かを、各画像データについて判断する。そして、マップ作成部１５は、全ての分類語を含む説明文がある場合に、その画像データの画像ＩＤを、対応する円領域の要素ＩＤに関連付ける。ここで、画像ＩＤは、画像データを識別するための識別情報である。また、要素ＩＤは、マップ８０に含まれる円領域を識別するための識別情報である。 The map creation unit 15 arranges the classification words around each group grouped by the classification unit 13, counts the number of each corresponding image data at the position corresponding to the classification word, and makes a circle area (element). A map 80, which is a bubble chart represented by the size of, is created.
Here, the map creation unit 15 determines whether or not there is an explanatory text including all the classification words when the position corresponding to the classification word includes a plurality of classification words, that is, when there are two or more axes. Judge about the data. Then, when there is an explanatory text including all the classification words, the map creation unit 15 associates the image ID of the image data with the element ID of the corresponding circular region. Here, the image ID is identification information for identifying the image data. The element ID is identification information for identifying the circular area included in the map 80.

図３は、分類部１３によるグループ分けから２つのグループを選んだ場合のマップ８０の例を示す。マップ８０は、縦軸をクラスタ代表語とし、横軸を形容詞として、縦軸と横軸との交わる位置に、両方の分類語で分類される画像データの数を、円領域の大きさで表して配置した２次元マトリックスでの表示態様である。
ここで、１つの画像データに対して、複数の分類語が対応する場合、複数の円領域に、ある１つの画像データを含んでもよい。例えば、ある画像データに、「丸くて赤い果物」の説明文が含まれる場合、その画像データは、分類語として、「丸い」「果物」と、「赤い」「果物」とを含む。よって、「丸い」と「果物」とが交差する位置の円領域と、「赤い」と「果物」とが交差する位置の円領域との両方に、この画像データが含まれる。 FIG. 3 shows an example of the map 80 when two groups are selected from the grouping by the classification unit 13. In the map 80, the vertical axis is a cluster representative word, the horizontal axis is an adjective, and the number of image data classified by both classification words is represented by the size of a circular region at the intersection of the vertical axis and the horizontal axis. It is a display mode in the two-dimensional matrix arranged in the above.
Here, when a plurality of classification words correspond to one image data, a certain image data may be included in a plurality of circular regions. For example, when a certain image data includes a description of "round and red fruit", the image data includes "round" and "fruit" and "red" and "fruit" as classification words. Therefore, this image data is included in both the circular area at the position where the "round" and the "fruit" intersect and the circular area at the position where the "red" and the "fruit" intersect.

図２（Ａ）に戻り、マップ登録部１６は、マップ作成部１５により作成されたマップ８０を、マップ記憶部２２に記憶させる。また、マップ登録部１６は、マップ８０を構成する画像データの画像ＩＤを、円領域を表す要素ＩＤに対応付けて、マップ記憶部２２に記憶させる。
関連画像生成部１７は、マップ８０に含まれる円領域に関連付けられた画像データを、類似度に基づいて配置した、関連画像データを生成する。
ここで、類似度は、円領域に関連付けられた複数の画像データの中から２つの画像データの組合せを選択し、選択した各組合せの画像データの類似度合いを表す。
類似度は、２つの画像データの特徴量に基づいて算出してもよい。特徴量としては、例えば、２つの画像データを構成する色調の類似度合い等である。また、類似度は、２つの画像データに対応した単語の一致度合いや、類似度合いであってもよい。そして、類似度は、その度合いが高いほど、２つの画像データが似ていることを示す。 Returning to FIG. 2A, the map registration unit 16 stores the map 80 created by the map creation unit 15 in the map storage unit 22. Further, the map registration unit 16 stores the image ID of the image data constituting the map 80 in the map storage unit 22 in association with the element ID representing the circular area.
The related image generation unit 17 generates related image data in which image data associated with the circular region included in the map 80 is arranged based on the degree of similarity.
Here, the degree of similarity represents the degree of similarity of the image data of each selected combination by selecting a combination of two image data from a plurality of image data associated with the circular region.
The similarity may be calculated based on the features of the two image data. The feature amount is, for example, the degree of similarity of the color tones constituting the two image data. Further, the degree of similarity may be the degree of matching of words corresponding to the two image data or the degree of similarity. The degree of similarity indicates that the two image data are similar as the degree is higher.

関連画像生成部１７は、より具体的には、円領域に関連付けられた複数の画像データを含み、算出した類似度を、２つの画像データを結ぶ線の長さや、線の太さによって表した関連画像データを生成する。なお、関連画像データは、２つの画像データの類似度が閾値以上である場合に、２つの画像データを線で結ぶようにしてもよい。その他、関連画像データは、類似度を、基準にした画像からの奥行方向の距離によって表してもよい。 More specifically, the related image generation unit 17 includes a plurality of image data associated with the circular region, and expresses the calculated similarity by the length of the line connecting the two image data and the thickness of the line. Generate related image data. The related image data may be connected by a line when the similarity between the two image data is equal to or higher than the threshold value. In addition, the related image data may express the degree of similarity by the distance in the depth direction from the reference image.

記憶部２０は、制御部１０が各種の処理を実行するために必要なプログラム、データ等を記憶するためのハードディスク、半導体メモリ素子等の記憶領域である。
記憶部２０は、プログラム記憶部２１と、マップ記憶部２２とを備える。
プログラム記憶部２１は、各種のプログラムを記憶する記憶領域である。プログラム記憶部２１は、上述した公知の説明文生成ソフトウェアや、公知の自動翻訳ソフトウェアの他、マップ作成プログラム２１ａを記憶している。マップ作成プログラム２１ａは、画像マップ作成装置１の制御部１０が実行する各種機能を行うためのプログラムである。なお、マップ作成プログラム２１ａは、公知の説明文生成ソフトウェアや、公知の自動翻訳ソフトウェアを含むものであってもよい。 The storage unit 20 is a storage area for a hard disk, a semiconductor memory element, or the like for storing programs, data, and the like necessary for the control unit 10 to execute various processes.
The storage unit 20 includes a program storage unit 21 and a map storage unit 22.
The program storage unit 21 is a storage area for storing various programs. The program storage unit 21 stores the map creation program 21a in addition to the above-mentioned known explanatory text generation software and known automatic translation software. The map creation program 21a is a program for performing various functions executed by the control unit 10 of the image map creation device 1. The map creation program 21a may include known explanatory text generation software and known automatic translation software.

マップ記憶部２２は、マップ作成部１５によって作成されたマップ８０のデータを記憶する記憶領域である。
マップ記憶部２２は、マップ８０のデータの他、マップ８０を構成する円領域ごとに、円領域に含まれる各画像の画像ＩＤを記憶する。
通信インタフェース部２９は、通信ネットワークＮを介して画像ＤＢ３や、表示装置４との間の通信を行うためのインタフェースである。 The map storage unit 22 is a storage area for storing the data of the map 80 created by the map creation unit 15.
In addition to the data of the map 80, the map storage unit 22 stores the image ID of each image included in the circular area for each circular area constituting the map 80.
The communication interface unit 29 is an interface for performing communication with the image DB 3 and the display device 4 via the communication network N.

＜画像ＤＢ３＞
図１に示す画像ＤＢ３は、ハードディスク、半導体メモリ素子等の記憶領域を有する、例えば、画像のデータベースサーバである。画像ＤＢ３は、画像ＩＤに対応付けて、画像データを記憶する。画像ＤＢ３は、この例では、美術館ごとに、画像データを、物理的に又は論理的に分けて記憶してもよい。また、画像ＤＢ３は、図示しない他の端末等から画像データを受信して記憶してもよい。なお、画像ＤＢ３は、図示しないが、制御部、記憶部、通信インタフェース部等を備える。 <Image DB3>
The image DB 3 shown in FIG. 1 is, for example, an image database server having a storage area such as a hard disk and a semiconductor memory element. The image DB 3 stores image data in association with the image ID. In this example, the image DB 3 may physically or logically store the image data for each museum. Further, the image DB 3 may receive and store image data from another terminal or the like (not shown). Although not shown, the image DB 3 includes a control unit, a storage unit, a communication interface unit, and the like.

＜表示装置４＞
表示装置４は、例えば、タブレットに代表されるコンピュータの機能を併せ持った携帯型の装置である。表示装置４は、その他、スマートフォンや携帯電話機等でもよいし、ＰＣ等であってもよい。
表示装置４は、ユーザが所望する画像のマップ８０を表示させる。また、表示装置４は、表示されたマップ８０の要素を受け付け、要素に対応した画像データを、画像ＤＢ３から抽出して表示させる。
図２（Ｂ）に示すように、表示装置４は、制御部４０と、記憶部５０と、タッチパネルディスプレイ５７と、通信インタフェース部５９とを備える。 <Display device 4>
The display device 4 is, for example, a portable device having a computer function represented by a tablet. The display device 4 may also be a smartphone, a mobile phone, or the like, or a PC or the like.
The display device 4 displays a map 80 of an image desired by the user. Further, the display device 4 receives the displayed elements of the map 80, extracts the image data corresponding to the elements from the image DB 3, and displays the elements.
As shown in FIG. 2B, the display device 4 includes a control unit 40, a storage unit 50, a touch panel display 57, and a communication interface unit 59.

制御部４０は、表示装置４の全体を制御するＣＰＵである。制御部４０は、記憶部５０に記憶されているＯＳやアプリケーションプログラムを適宜読み出して実行することにより、上述したハードウェアと協働し、各種機能を実行する。
制御部４０は、マップ出力部４１（マップ出力手段）と、要素受付部４２（要素受付手段）と、画像出力部４３（画像出力手段）とを備える。
マップ出力部４１は、画像マップ作成装置１のマップ作成部１５により作成され、又は、画像マップ作成装置１のマップ登録部１６によりマップ記憶部２２に登録されたマップ８０のデータを、タッチパネルディスプレイ５７に出力する。これにより、表示装置４のユーザは、タッチパネルディスプレイ５７に表示されたマップ８０を視認できる。
要素受付部４２は、マップ８０が表示されている画面において、ユーザにより選択された１つの円領域を受け付ける。
画像出力部４３は、画像マップ作成装置１の関連画像生成部１７により生成された、受け付けた円領域に対応する関連画像データを、タッチパネルディスプレイ５７に出力する。 The control unit 40 is a CPU that controls the entire display device 4. The control unit 40 executes various functions in cooperation with the above-mentioned hardware by appropriately reading and executing the OS and application programs stored in the storage unit 50.
The control unit 40 includes a map output unit 41 (map output means), an element reception unit 42 (element reception means), and an image output unit 43 (image output means).
The map output unit 41 displays the data of the map 80 created by the map creation unit 15 of the image map creation device 1 or registered in the map storage unit 22 by the map registration unit 16 of the image map creation device 1 on the touch panel display 57. Output to. As a result, the user of the display device 4 can visually recognize the map 80 displayed on the touch panel display 57.
The element receiving unit 42 receives one circular area selected by the user on the screen on which the map 80 is displayed.
The image output unit 43 outputs the related image data corresponding to the received circular area generated by the related image generation unit 17 of the image map creation device 1 to the touch panel display 57.

記憶部５０は、制御部４０が各種の処理を実行するために必要なプログラム、データ等を記憶するための半導体メモリ素子等の記憶領域である。記憶部５０は、プログラム記憶部５１を備える。
プログラム記憶部５１は、各種のプログラムを記憶する記憶領域である。プログラム記憶部５１は、画像表示プログラム５１ａを記憶している。画像表示プログラム５１ａは、制御部４０が実行する各種機能を行うためのプログラムである。
タッチパネルディスプレイ５７は、液晶パネル等で構成される表示部としての機能と、ユーザからの指等によるタッチ入力を検出する入力部としての機能とを有する。
通信インタフェース部５９は、通信ネットワークＮを介して画像マップ作成装置１等との間で通信を行うためのインタフェースである。 The storage unit 50 is a storage area such as a semiconductor memory element for storing programs, data, and the like necessary for the control unit 40 to execute various processes. The storage unit 50 includes a program storage unit 51.
The program storage unit 51 is a storage area for storing various programs. The program storage unit 51 stores the image display program 51a. The image display program 51a is a program for performing various functions executed by the control unit 40.
The touch panel display 57 has a function as a display unit composed of a liquid crystal panel or the like and a function as an input unit for detecting a touch input by a finger or the like from a user.
The communication interface unit 59 is an interface for communicating with the image map creating device 1 and the like via the communication network N.

ここで、コンピュータとは、制御部、記憶装置等を備えた情報処理装置をいい、画像マップ作成装置１及び表示装置４は、各々制御部、記憶部等を備えた情報処理装置であり、コンピュータの概念に含まれる。 Here, the computer means an information processing device including a control unit, a storage device, and the like, and the image map creation device 1 and the display device 4 are information processing devices including a control unit, a storage unit, and the like, respectively, and are computers. Included in the concept of.

＜画像マップ出力システム１００の処理＞
次に、表示装置４にマップ８０を表示させる処理について説明する。
図４は、本実施形態に係る画像マップ出力システムのマップ出力処理を示すフローチャートである。
図５は、本実施形態に係る表示装置４での表示例を示す図である。
図６は、本実施形態に係る画像マップ作成装置１のマップ作成処理を示すフローチャートである。
図７及び図８は、本実施形態に係る画像マップ作成装置１によるマップ作成処理での具体例を示す図である。 <Processing of image map output system 100>
Next, a process of displaying the map 80 on the display device 4 will be described.
FIG. 4 is a flowchart showing a map output process of the image map output system according to the present embodiment.
FIG. 5 is a diagram showing a display example on the display device 4 according to the present embodiment.
FIG. 6 is a flowchart showing a map creation process of the image map creation device 1 according to the present embodiment.
7 and 8 are diagrams showing a specific example in the map creation process by the image map creation device 1 according to the present embodiment.

図４のステップＳ（以下、単に「Ｓ」という。）１０において、表示装置４の制御部４０は、画像表示プログラム５１ａを実行する。そして、制御部４０は、表示装置４のタッチパネルディスプレイ５７に、美術館選択画面７０を出力する。
図５（Ａ）に、美術館選択画面７０の一例を示す。制御部４０は、画像表示プログラム５１ａを実行した後に、画像マップ作成装置１に対して問合せを行って、画像マップ作成装置１が送信した美術館選択画面７０を出力してもよい。また、制御部４０は、画像表示プログラム５１ａに予め用意された美術館選択画面７０を出力してもよい。
表示装置４のユーザは、マップを表示させたい対象の美術館名の左横のチェックボックスにチェックを入れて、実行ボタンを選択する。そうすることで、Ｓ１１において、制御部４０は、美術館名の選択を受け付けて、画像マップ作成装置１にその美術館の美術館ＩＤを送信する。制御部４０は、美術館ＩＤ以外の美術館を特定可能なデータを送信しても、もちろんよい。
Ｓ１２において、画像マップ作成装置１の制御部１０は、美術館ＩＤを受信する。
Ｓ１３において、制御部１０は、マップ作成処理を行う。 In step S (hereinafter, simply referred to as “S”) 10 of FIG. 4, the control unit 40 of the display device 4 executes the image display program 51a. Then, the control unit 40 outputs the museum selection screen 70 to the touch panel display 57 of the display device 4.
FIG. 5A shows an example of the museum selection screen 70. After executing the image display program 51a, the control unit 40 may make an inquiry to the image map creation device 1 and output the museum selection screen 70 transmitted by the image map creation device 1. Further, the control unit 40 may output the museum selection screen 70 prepared in advance in the image display program 51a.
The user of the display device 4 checks the check box on the left side of the name of the museum to which the map is to be displayed, and selects the execute button. By doing so, in S11, the control unit 40 accepts the selection of the museum name and transmits the museum ID of the museum to the image map creation device 1. Of course, the control unit 40 may transmit data that can identify the museum other than the museum ID.
In S12, the control unit 10 of the image map creation device 1 receives the museum ID.
In S13, the control unit 10 performs a map creation process.

ここで、マップ作成処理について、図６に基づき説明する。
図６のＳ２０において、画像マップ作成装置１の制御部１０（画像受付部１１）は、受信した美術館ＩＤを有する画像データを、画像ＤＢ３から抽出する。
Ｓ２１において、制御部１０（単語抽出部１２）は、抽出した画像データから１つの画像データを選択し、画像データが示す画像の内容を言語化して単語を抽出する。
図７（Ａ）は、言語化する対象の画像データが示す画像６０の例である。
図７（Ｂ）は、画像６０に対して“ＤｅｎｓｅＣａｐ”を使用した場合の結果画像６１及びキャプション６２の例である。“ＤｅｎｓｅＣａｐ”では、画像６０の領域ごとに文を出力する。そして、結果画像６１が示すように、領域は、画像６０に複数生成されるので、文も複数生成される。なお、画像６０の画素数が６４０ｐｘ（ピクセル）である場合には、“ＤｅｎｓｅＣａｐ”は、約８０個のキャプション６２を生成する。 Here, the map creation process will be described with reference to FIG.
In S20 of FIG. 6, the control unit 10 (image reception unit 11) of the image map creation device 1 extracts the image data having the received museum ID from the image DB 3.
In S21, the control unit 10 (word extraction unit 12) selects one image data from the extracted image data, verbalizes the content of the image indicated by the image data, and extracts a word.
FIG. 7A is an example of the image 60 shown by the image data to be verbalized.
FIG. 7B is an example of the result image 61 and the caption 62 when "DenseCap" is used for the image 60. In "DenseCap", a sentence is output for each area of the image 60. Then, as the result image 61 shows, a plurality of regions are generated in the image 60, so that a plurality of sentences are also generated. When the number of pixels of the image 60 is 640 px (pixels), "DenseCap" generates about 80 captions 62.

図７（Ｃ）は、キャプション６２を翻訳した翻訳文（説明文）６３の例である。
制御部１０は、翻訳文６３を、画像データに対応付けて画像ＤＢ３に記憶してもよい。
図８（Ａ）は、図７（Ｃ）に示す翻訳文６３に対して形態素解析をして得られた結果データ６４を示す。
図８（Ｂ）は、結果データ６４に基づいて作成した単語と品詞とを対応付けた単語テーブル６５の例を示す。 FIG. 7C is an example of a translated sentence (explanatory sentence) 63 translated from the caption 62.
The control unit 10 may store the translated sentence 63 in the image DB 3 in association with the image data.
FIG. 8A shows result data 64 obtained by performing morphological analysis on the translated sentence 63 shown in FIG. 7C.
FIG. 8B shows an example of a word table 65 in which a word created based on the result data 64 and a part of speech are associated with each other.

図６に戻り、Ｓ２２において、制御部１０は、抽出した全ての画像データについて、言語化して単語を抽出する処理を行ったか否かを判断する。抽出した全ての画像データについて処理を行った場合（Ｓ２２：ＹＥＳ）には、制御部１０は、処理をＳ２３に移す。他方、抽出した全ての画像データについて処理を行っていない場合（Ｓ２２：ＮＯ）には、制御部１０は、処理をＳ２１に移し、未だ言語化をしていない画像データについて、言語化する処理を行う。 Returning to FIG. 6, in S22, the control unit 10 determines whether or not all the extracted image data has been verbalized to extract words. When processing is performed on all the extracted image data (S22: YES), the control unit 10 shifts the processing to S23. On the other hand, when all the extracted image data has not been processed (S22: NO), the control unit 10 shifts the processing to S21 and performs a process of verbalizing the image data that has not yet been verbalized. Do.

Ｓ２３において、制御部１０（分類部１３、分類語生成部１４）は、抽出した全ての画像データから抽出された単語を、２つのグループに分けて、各グループでクラスタリングをする。制御部１０は、例えば、名詞と、形容詞との２つのグループに分ける。これにより、２つのグループに含まれない動詞等は、除外される。また、制御部１０は、グループごとに、クラスタリングをして分類語を抽出する。その際、制御部１０は、単語の出現頻度が一定の閾値以上のものを、クラスタリングの対象としてもよい。また、分類語を、単語の出現頻度に応じて、所定数に制限してもよい。
Ｓ２４において、制御部１０（マップ作成部１５）は、２つのグループを縦軸及び横軸にし、各軸の分類語が交わる位置に、カウントすることで得た分類語を含む画像データの件数を円領域の大きさで表したバブルチャートのマップ８０を作成する。
Ｓ２５において、制御部１０（マップ登録部１６）は、作成したマップ８０のデータと、円領域を示す要素ＩＤと画像データのＩＤとの対応付けを、マップ記憶部２２に記憶させる。その後、制御部１０は、本処理を終了し、処理を図４に移す。 In S23, the control unit 10 (classification unit 13, classification word generation unit 14) divides the words extracted from all the extracted image data into two groups and clusters them in each group. The control unit 10 is divided into two groups, for example, a noun and an adjective. As a result, verbs and the like that are not included in the two groups are excluded. Further, the control unit 10 performs clustering for each group and extracts the classification words. At that time, the control unit 10 may target words whose frequency of appearance is equal to or higher than a certain threshold value for clustering. Further, the number of classified words may be limited to a predetermined number according to the frequency of occurrence of words.
In S24, the control unit 10 (map creation unit 15) sets the two groups on the vertical axis and the horizontal axis, and counts the number of image data including the classification words obtained by counting at the positions where the classification words of each axis intersect. A map 80 of a bubble chart represented by the size of a circular area is created.
In S25, the control unit 10 (map registration unit 16) stores the correspondence between the created map 80 data, the element ID indicating the circular area, and the image data ID in the map storage unit 22. After that, the control unit 10 ends this process and shifts the process to FIG.

なお、この例では、画像データを言語化してからマップ８０を作成し、マップ記憶部２２に記憶させるものを例に説明したが、これに限定されない。例えば、予め画像ＤＢ３に記憶された画像データに対して言語化する処理までを行って、画像データに単語を関連付けておき、マップ作成処理として、Ｓ２３以降の処理を行うようにしてもよい。 In this example, the map 80 is created after the image data is verbalized and stored in the map storage unit 22 as an example, but the present invention is not limited to this. For example, the image data stored in the image DB 3 in advance may be verbalized, words may be associated with the image data, and the map creation process may be performed after S23.

図４に戻り、Ｓ１４において、制御部１０は、作成したマップ８０のデータを、表示装置４に対して送信する。その後、制御部１０は、本処理を終了する。
Ｓ１５において、表示装置４の制御部４０（マップ出力部４１）は、マップ８０のデータを受信して、タッチパネルディスプレイ５７にマップ８０を含むマップ表示画面７１を出力する。
図５（Ｂ）は、タッチパネルディスプレイ５７に表示されたマップ表示画面７１の例を示す。マップ表示画面７１に表示されたマップ８０は、分類語によって、選択した美術館の絵画の特徴が分かりやすく示されるので、ユーザが所望の画像を探しやすいものにできる。
その後、制御部４０は、本処理を終了する。 Returning to FIG. 4, in S14, the control unit 10 transmits the created map 80 data to the display device 4. After that, the control unit 10 ends this process.
In S15, the control unit 40 (map output unit 41) of the display device 4 receives the data of the map 80 and outputs the map display screen 71 including the map 80 to the touch panel display 57.
FIG. 5B shows an example of the map display screen 71 displayed on the touch panel display 57. In the map 80 displayed on the map display screen 71, the characteristics of the painting of the selected museum are shown in an easy-to-understand manner by the classification word, so that the user can easily find the desired image.
After that, the control unit 40 ends this process.

次に、マップ８０に示される円領域を１つ選択することで、円領域に含まれる画像データを出力する処理について説明する。
図９は、本実施形態に係る画像マップ出力システム１００の関連画像出力処理を示すフローチャートである。
図１０は、本実施形態に係る画像マップ作成装置１の関連画像生成処理を示すフローチャートである。
図１１は、本実施形態に係る表示装置４での表示例を示す図である。 Next, a process of outputting image data included in the circle area by selecting one circle area shown in the map 80 will be described.
FIG. 9 is a flowchart showing a related image output process of the image map output system 100 according to the present embodiment.
FIG. 10 is a flowchart showing a related image generation process of the image map creating device 1 according to the present embodiment.
FIG. 11 is a diagram showing a display example in the display device 4 according to the present embodiment.

例えば、図５（Ｂ）に示すように、マップ表示画面７１の１つの円領域（バブル）をユーザが選択することで、図９のＳ３０において、表示装置４の制御部４０（要素受付部４２）は、マップ８０上の円領域に対する選択を受け付ける。
Ｓ３１において、制御部４０は、選択された円領域を識別する要素ＩＤを、画像マップ作成装置１に対して送信する。
Ｓ３２において、画像マップ作成装置１の制御部１０は、要素ＩＤを受信する。
Ｓ３３において、制御部１０（関連画像生成部１７）は、関連画像生成処理を行う。 For example, as shown in FIG. 5B, when the user selects one circular area (bubble) on the map display screen 71, in S30 of FIG. 9, the control unit 40 (element reception unit 42) of the display device 4 is selected. ) Accepts selections for the circular area on the map 80.
In S31, the control unit 40 transmits an element ID that identifies the selected circular region to the image map creation device 1.
In S32, the control unit 10 of the image map creation device 1 receives the element ID.
In S33, the control unit 10 (related image generation unit 17) performs the related image generation process.

ここで、関連画像生成処理について、図１０に基づき説明する。
Ｓ４０において、制御部１０は、要素ＩＤに対応付けられた画像データを、画像ＤＢ３から抽出する。具体的には、制御部１０は、マップ記憶部２２を参照して、要素ＩＤに対応付けられた画像データの画像ＩＤを特定する。そして、制御部１０は、画像ＩＤに対応した画像データを、画像ＤＢ３から抽出する。
Ｓ４１において、制御部１０は、抽出した画像データの類似度を算出する。複数の画像データが抽出された場合には、制御部１０は、２つの画像データの各組合せに対して、類似度を算出する。 Here, the related image generation process will be described with reference to FIG.
In S40, the control unit 10 extracts the image data associated with the element ID from the image DB 3. Specifically, the control unit 10 refers to the map storage unit 22 to specify the image ID of the image data associated with the element ID. Then, the control unit 10 extracts the image data corresponding to the image ID from the image DB 3.
In S41, the control unit 10 calculates the similarity of the extracted image data. When a plurality of image data are extracted, the control unit 10 calculates the similarity for each combination of the two image data.

Ｓ４２において、制御部１０は、算出した類似度に応じて各画像データを配置した関連画像９０を生成する。
図１１は、関連画像９０を含む関連画像画面７２の例を示す。関連画像９０は、図５（Ｂ）でユーザにより選択された円領域に含まれる７つの画像データを、類似度に基づいて配置したものになっている。ここで、関連画像９０は、ユーザにより選択された円領域に含まれる７つの画像データのうち、太さの異なる線によって２つの画像データが結び付けられている。この線の太さは、２つの画像データの類似度を表す。
図１０に戻り、その後、制御部１０は、本処理を終了し、処理を図９に移す。 In S42, the control unit 10 generates a related image 90 in which each image data is arranged according to the calculated similarity.
FIG. 11 shows an example of a related image screen 72 including the related image 90. The related image 90 is formed by arranging seven image data included in the circular region selected by the user in FIG. 5B based on the degree of similarity. Here, in the related image 90, of the seven image data included in the circular region selected by the user, two image data are connected by lines having different thicknesses. The thickness of this line represents the similarity between the two image data.
Returning to FIG. 10, after that, the control unit 10 ends this process and shifts the process to FIG.

図９に戻り、Ｓ３４において、制御部１０は、生成した関連画像９０のデータを、表示装置４に対して送信する。その後、制御部１０は、本処理を終了する。
Ｓ３５において、表示装置４の制御部４０は、関連画像９０のデータを受信し、タッチパネルディスプレイ５７に関連画像９０を表示させる。その後、制御部４０は、本処理を終了する。 Returning to FIG. 9, in S34, the control unit 10 transmits the generated data of the related image 90 to the display device 4. After that, the control unit 10 ends this process.
In S35, the control unit 40 of the display device 4 receives the data of the related image 90 and causes the touch panel display 57 to display the related image 90. After that, the control unit 40 ends this process.

このように、本実施形態の画像マップ出力システム１００によれば、以下のような効果がある。
（１）画像マップ作成装置１は、画像データに関する説明文に形態素解析を行って、得られた単語から分類語を抽出し、分類語によるマップ８０を作成する。よって、抽出した単語から抽出する分類語は、画像の特徴を表すものにできる。また、マップ８０は、画像データの特徴を分類語により示したものであるので、マップ８０を参照することで、ユーザが所望する目的に最適な画像を探し出すことができる。
（２）画像マップ作成装置１は、複数の単語を複数のグループに分け、各グループの単語をクラスタリングして分類語を生成し、分けられた複数のグループうち少なくとも２つのグループの各分類語によって分類されたマップ８０を作成する。よって、マップ８０を２次元以上のものにできる。 As described above, the image map output system 100 of the present embodiment has the following effects.
(1) The image map creation device 1 performs morphological analysis on the explanatory text relating to the image data, extracts the classified words from the obtained words, and creates a map 80 using the classified words. Therefore, the classification word extracted from the extracted word can represent the feature of the image. Further, since the map 80 shows the features of the image data by the classification word, it is possible to find the most suitable image for the purpose desired by the user by referring to the map 80.
(2) The image map creation device 1 divides a plurality of words into a plurality of groups, clusters the words in each group to generate a classification word, and uses each classification word of at least two groups out of the divided plurality of groups. Create a classified map 80. Therefore, the map 80 can be made into two or more dimensions.

（３）画像マップ作成装置１は、抽出した複数の単語から上位概念を抽出して分類語を生成する。よって、上位概念の語によるマップ８０を作成でき、分類語の数を最適な数に抑えることができる。
（４）画像マップ作成装置１は、複数の画像データの各々から抽出された複数の単語をクラスタリングして分類語を生成するので、複数の画像データに共通の分類語を生成でき、分類しやすくできる。
（５）画像マップ作成装置１は、マップ８０に示され、分類語に対応付けられた円領域の選択を受け付けたことに応じて、円領域に対応する画像データを出力するので、分類語に対応する画像データを見ることができる。 (3) The image map creating device 1 extracts a superordinate concept from a plurality of extracted words and generates a classified word. Therefore, the map 80 using the words of the superordinate concept can be created, and the number of the classified words can be suppressed to the optimum number.
(4) Since the image map creation device 1 generates a classification word by clustering a plurality of words extracted from each of the plurality of image data, it is possible to generate a common classification word for the plurality of image data, and it is easy to classify. it can.
(5) The image map creation device 1 outputs the image data corresponding to the circle area according to the acceptance of the selection of the circle area associated with the classification word shown in the map 80. You can see the corresponding image data.

（６）画像マップ作成装置１は、円領域に対応する画像データが複数ある場合に、複数の画像データの類似度に応じて各画像データを配置して、複数の画像データを含む関連画像９０を表示装置４に出力するので、表示装置４では、類似度に対応した画像データの配置によって、各画像データの類似度合いを分かりやすく示すことができる。
（７）類似度は、各画像データの特徴量と、各画像データに対応した単語との少なくとも一方に基づいて算出されるので、各画像データの特徴を反映させたものにできる。 (6) When there are a plurality of image data corresponding to the circular region, the image map creating device 1 arranges each image data according to the similarity of the plurality of image data, and the related image 90 including the plurality of image data. Is output to the display device 4, so that the display device 4 can show the degree of similarity of each image data in an easy-to-understand manner by arranging the image data corresponding to the degree of similarity.
(7) Since the similarity is calculated based on at least one of the feature amount of each image data and the word corresponding to each image data, the feature of each image data can be reflected.

以上、本発明の実施形態について説明したが、本発明は上述した実施形態に限定されるものではない。また、実施形態に記載した効果は、本発明から生じる最も好適な効果を列挙したに過ぎず、本発明による効果は、実施形態に記載したものに限定されない。なお、上述した実施形態及び後述する変形形態は、適宜組み合わせて用いることもできるが、詳細な説明は省略する。 Although the embodiments of the present invention have been described above, the present invention is not limited to the above-described embodiments. Moreover, the effects described in the embodiments are merely a list of the most preferable effects arising from the present invention, and the effects according to the present invention are not limited to those described in the embodiments. The above-described embodiment and the modified form described later can be used in combination as appropriate, but detailed description thereof will be omitted.

（変形形態）
（１）本実施形態では、２つのグループに分けて各グループでクラスタリングするものを例に説明したが、これに限定されない。グループ分けせず、全体でクラスタリングしてもよいし、３つ以上のグループに分けて各グループでクラスタリングしてもよい。
（２）本実施形態では、バブルチャートのマップを作成するものを例に説明したが、これに限定されない。全体の俯瞰的な位置づけを把握可能なものであれば、他の方法によって表示可能なマップであってもよい。
（３）本実施形態では、複数の画像データに対するマップを作成するものを説明したが、これに限定されない。１つの画像データに対するマップを作成するものであってもよい。その場合、マップに表示される円領域内の数は、全て１になる。 (Transformed form)
(1) In the present embodiment, the example of dividing into two groups and clustering in each group has been described as an example, but the present invention is not limited to this. It may be clustered as a whole without being divided into groups, or it may be divided into three or more groups and clustered in each group.
(2) In the present embodiment, the example of creating a map of a bubble chart has been described, but the present invention is not limited to this. A map that can be displayed by other methods may be used as long as the overall position can be grasped from a bird's-eye view.
(3) In the present embodiment, a map for creating a map for a plurality of image data has been described, but the present invention is not limited to this. A map for one image data may be created. In that case, the number in the circle area displayed on the map is all 1.

（４）本実施形態では、抽出した複数の単語から上位概念を抽出して、分類語を生成するものを例に説明したが、これに限定されない。例えば、抽出した複数の単語の出現頻度に基づいて、分類語を生成してもよい。また、例えば、画像データ全体の面積に対して、抽出した単語の元になる画像（特徴画像）の領域が占める割合を算出する画像割合算出手段を制御部が備え、例えば、割合の高い語を分類語として取得する等、算出した割合に応じて分類語を取得してもよい。
（５）本実施形態では、画像ＤＢは、通信ネットワークを介して接続されているものとして説明したが、これに限定されない。例えば、本実施形態であれば、画像マップ作成装置の記憶部に有していてもよい。 (4) In the present embodiment, a superordinate concept is extracted from a plurality of extracted words to generate a classified word as an example, but the present embodiment is not limited to this. For example, a classification word may be generated based on the frequency of occurrence of a plurality of extracted words. Further, for example, the control unit is provided with an image ratio calculation means for calculating the ratio of the area of the image (feature image) that is the source of the extracted word to the total area of the image data. The classification word may be acquired according to the calculated ratio, such as acquisition as the classification word.
(5) In the present embodiment, the image DB has been described as being connected via a communication network, but the present invention is not limited to this. For example, in the present embodiment, it may be stored in the storage unit of the image map creating device.

（６）本実施形態では、美術館や博物館で保有する絵画の画像データを対象とした例を説明したが、これに限定されない。画像化されているデータであればよく、例えば、特許図面の画像データであってもよいし、写真等の画像データであってもよい。
（７）本実施形態では、画像データのメタデータとして、その画像データを保有する美術館等の美術館ＩＤを有するものとして説明したが、これに限定されない。画像データが絵画の場合には、例えば、絵画の画家や、絵画のモチーフの年代等を、メタデータに含んでもよい。また、画像データが、写真の場合には、例えば、撮影日や撮影場所等を、メタデータに含んでもよい。
そして、類似度を、メタデータを使用して算出してもよい。例えば、絵画の場合、同じ画家のものであれば、類似度を高くしてもよい。 (6) In the present embodiment, an example in which image data of a painting held in a museum or a museum is targeted has been described, but the present invention is not limited to this. It may be any data that has been imaged, and may be, for example, image data of a patent drawing or image data such as a photograph.
(7) In the present embodiment, it has been described that the metadata of the image data has a museum ID of a museum or the like that holds the image data, but the present invention is not limited to this. When the image data is a painting, for example, the painter of the painting, the age of the motif of the painting, and the like may be included in the metadata. Further, when the image data is a photograph, for example, the shooting date, the shooting location, and the like may be included in the metadata.
Then, the similarity may be calculated using the metadata. For example, in the case of paintings, if they belong to the same painter, the degree of similarity may be increased.

（８）本実施形態では、関連画像を表示する際に、類似度を線の太さで表すものを例に説明したが、これに限定されない。例えば、類似度を、各画像間の距離によって表すようにしてもよい。その際、各画像間の距離を、画像の大きさによって表してもよい。具体例としては、マップから関連画像を表示させる際には、本実施形態で説明したものを出力し、さらに、ユーザが表示された関連画像から１つの画像を選択した場合に、選択した画像を中心にして、類似度を、奥行方向で表して出力するようにしてもよい。そのようにすることで、選択した画像との類似度合いを分かりやすく示すことができる。 (8) In the present embodiment, when displaying the related image, the similarity is expressed by the thickness of the line as an example, but the present invention is not limited to this. For example, the similarity may be expressed by the distance between each image. At that time, the distance between each image may be expressed by the size of the image. As a specific example, when displaying a related image from a map, the one described in this embodiment is output, and when one image is selected from the displayed related images by the user, the selected image is displayed. The similarity may be expressed in the depth direction and output at the center. By doing so, the degree of similarity with the selected image can be shown in an easy-to-understand manner.

１画像マップ作成装置
３画像ＤＢ
４表示装置
１０，４０制御部
１２単語抽出部
１４分類語生成部
１５マップ作成部
１７関連画像生成部
２０，５０記憶部
２１ａマップ作成プログラム
２２マップ記憶部
４１マップ出力部
４２要素受付部
４３画像出力部
５１ａ画像表示プログラム
５７タッチパネルディスプレイ
８０マップ
９０関連画像
１００画像マップ出力システム 1 Image map creation device 3 Image DB
4 Display device 10, 40 Control unit 12 Word extraction unit 14 Classification word generation unit 15 Map creation unit 17 Related image generation unit 20, 50 Storage unit 21a Map creation program 22 Map storage unit 41 Map output unit 42 Element reception unit 43 Image output Part 51a Image display program 57 Touch panel display 80 Map 90 Related images 100 Image map output system

Claims

A word extraction method that analyzes images to extract multiple words,
A classification word generation means for generating a classification word by clustering a plurality of words extracted by the word extraction means,
A map creation means that creates a map by counting the classification words generated by the classification word generation means, and
An image map creation device equipped with.

In the image map creating apparatus according to claim 1,
A classification means for dividing a plurality of words extracted by the word extraction means into a plurality of groups is provided.
The classification word generation means clusters the words of each group divided by the classification means to generate a classification word.
The map creation means creates the map classified by each classification word of at least two groups out of the plurality of groups divided by the classification means.
An image map creation device characterized by.

In the image map creating apparatus according to claim 1 or 2.
The classification word generation means generates a word corresponding to a superordinate concept as the classification word from a plurality of words extracted by the word extraction means.
An image map creation device characterized by.

In the image map creating apparatus according to any one of claims 1 to 3.
The classification word generation means generates the classification word based on the appearance frequency of a plurality of words extracted by the word extraction means.
An image map creation device characterized by.

In the image map creating apparatus according to any one of claims 1 to 3.
An image ratio calculation means for analyzing an image, extracting a plurality of feature images representing the features of the image, and calculating the ratio of the extracted feature image to the region of the image is provided.
The word extraction means extracts a word representing a feature of the feature image and obtains a word.
The classification word generation means generates the classification word from the words extracted by the word extraction means according to the ratio calculated by the image ratio calculation means.
An image map creation device characterized by.

In the image map creating apparatus according to any one of claims 1 to 5.
The word extraction means generates a descriptive text by referring to the words associated with each feature of the image, and extracts a plurality of words from the generated descriptive text.
An image map creation device characterized by.

In the image map creating apparatus according to any one of claims 1 to 6.
The classification word generation means generates a classification word by clustering a plurality of words extracted by the word extraction means from each of a plurality of images.
An image map creation device characterized by.

A display device communicatively connected to the image map creating device according to any one of claims 1 to 7.
A map output means for outputting the map created by the map creation means, and a map output means.
An element receiving means that is shown in the map output by the map output means and accepts the selection of the element associated with the classification word.
An image output means that outputs an image corresponding to the element received by the element reception means, and an image output means.
To prepare
A display device characterized by.

In the display device according to claim 8,
When there are a plurality of images corresponding to the elements received by the element receiving means, the image output means arranges each image according to the similarity of the plurality of images, and uses the plurality of images as related images. To output,
A display device characterized by.

In the display device according to claim 9,
The similarity is calculated based on at least one of the feature amount of each image and the word corresponding to each image.
A display device characterized by.

In the display device according to claim 9 or 10.
The related images represent the similarity by the length or thickness of the lines between the images or the distance in the depth direction from the reference image.
A display device characterized by.

A program for operating a computer as the image map creating device according to any one of claims 1 to 7.

A program for operating a computer as a display device according to any one of claims 8 to 11.