JP7329933B2

JP7329933B2 - Similar image retrieval device, similar image retrieval method and similar image retrieval program

Info

Publication number: JP7329933B2
Application number: JP2019038608A
Authority: JP
Inventors: 浩一黄瀬; 雅一岩村; 基岩田; 力工藤; 一視白井
Original assignee: Toshiba Digital Solutions Corp; University Public Corporation Osaka
Current assignee: Toshiba Digital Solutions Corp; University Public Corporation Osaka
Priority date: 2019-03-04
Filing date: 2019-03-04
Publication date: 2023-08-21
Anticipated expiration: 2039-03-04
Also published as: JP2020144440A

Description

この発明は、検索質問として画像が与えられた場合、分類情報が付与された複数の画像の中から、画像の形状に基づく類似性に加えて分類情報に基づく類似性を考慮して類似する画像を見出す装置、方法およびプログラムに関する。 When an image is given as a search query, the present invention selects similar images from among a plurality of images to which classification information has been assigned, considering the similarity based on the classification information in addition to the similarity based on the shape of the image. Apparatus, method and program for finding

近年、画像を検索質問として画像を検索する技術が注目されている。自動運転、セキュリティシステムあるいはロボティクス等、多くの分野への適用が期待されている。その多くは、画像の図形的、形状的特徴のみに基づいて検索を行うものである。
しかし、画像の中には分類体系が定められ、その分類体系に基づく分類情報が例えば人手によって付与されて整理、蓄積されてきた分野がある。一つの具体例として商標の審査、特に図形商標の審査が挙げられる。 In recent years, a technique for retrieving an image using an image as a retrieval query has attracted attention. It is expected to be applied to many fields such as autonomous driving, security systems, and robotics. Many of them perform searches based only on the graphical and shape features of the image.
However, there are fields in which a classification system has been established for images, and classification information based on the classification system has been manually assigned, sorted, and accumulated. One specific example is the examination of trademarks, especially figure trademarks.

商標は事業者や企業が自己の提供する商品やサービスに付ける標識であり、ブランドイメージを示す重要な役割を担っている。独占排他的な商標権を得るためには、特許庁に申請し、審査を受けて登録が認められる必要がある。商標の審査とは、申請された商標に対して、外観、観念、称呼の観点から、既に登録された商標（登録商標）等の中に類似したものがないかを確認する作業である。もし、登録商標の中に類似したものが見つかれば拒絶される。これはビジネスの機会損失につながる。そこで、特許庁への出願前に、商標審査官が行うようなシステムを用いて申請者が類似商標を検索できれば、商標出願後に拒絶されるリスクを低減でき、ビジネスの機会損失を防ぐことができると考えられる。
商標審査のプロセスを考えてみる。大量にあるすべての登録商標に対して類似性を人手で評価することは非現実的である。特に、図形やデザイン性を有する文字を含む商標について類似商標の検索は容易でない。そのため、特許庁では独自の商標検索システムを使って、登録商標を絞り込んでから、類似性を評価している。 Trademarks are signs attached to products and services that businesses and companies provide, and play an important role in showing brand images. In order to obtain exclusive trademark rights, it is necessary to apply to the Patent Office, undergo examination, and be approved for registration. Trademark examination is the process of confirming whether or not there are already registered trademarks (registered trademarks) that are similar to the registered trademark in terms of appearance, concept, and pronunciation. If a similar one is found among registered trademarks, it will be rejected. This leads to lost business opportunities. Therefore, if an applicant can search for similar trademarks using a system similar to that used by trademark examiners before filing an application with the Patent Office, it will be possible to reduce the risk of rejection after the trademark application and prevent loss of business opportunities. it is conceivable that.
Consider the trademark examination process. It is unrealistic to manually evaluate the similarity of all registered trademarks in large quantities. In particular, it is not easy to search for similar trademarks that include graphics or design characters. Therefore, the JPO uses its own trademark search system to narrow down the registered trademarks and then evaluates the similarity.

この明細書では、外観、観念、称呼のうち最も基本的で多数の事例を有する外観の類似を対象として説明を進める。外観が類似した商標を絞り込むには、商標の図形要素を分類したウィーン分類が用いられている。具体的には、出願商標に対して専門家が付与したウィーン分類コードをシステムに入力し登録商標のウィーン分類コードと照合する。審査を経た登録商標には、既にウィーン分類コードが付与されている。そして、同じ分類の図形が描かれた商標を出力する。このようにして絞り込まれた商標を対象に、商標審査官は出願商標との類似性を判別している。 In this specification, explanations will be directed to the most basic of the appearances, concepts, and appellations, and the analogy of appearances, which has a large number of instances. To narrow down trademarks that are similar in appearance, the Wiener Classification, which classifies the graphic elements of trademarks, is used. Specifically, the Vienna classification code given by the expert to the applied trademark is entered into the system and compared with the Vienna classification code of the registered trademark. Registered trademarks that have undergone examination have already been assigned a Vienna Classification Code. Then, the trademarks on which the figures of the same classification are drawn are output. The trademark examiner determines the similarity of the trademarks narrowed down in this way to the applied trademark.

従来から、商標の外観類似を検索する手法が幾つか提案されている。しかし、これは主に形状の類似性から類似商標を検索している。しかし、例えば、形状が異なっていても、同じ分類の図形が描かれていれば、その商標を類似しているとみなすケースがある。つまり外観類似の商標を検索する手法は、図形の形状のみならず分類も考慮しなければならない。 Conventionally, several methods have been proposed for retrieving appearance similarities of trademarks. However, it mainly searches for similar trademarks based on shape similarity. However, there are cases where, for example, even if the shapes are different, the trademarks are considered to be similar if the figures of the same classification are drawn. In other words, the method of retrieving similar-looking trademarks must consider not only the shape of figures but also their classification.

図形の分類も考慮した検索には、図形の分類に基づいた類似性を評価する必要がある。登録商標に関しては、我が国を含む多くの国の登録商標にウィーン分類コードが付与されている。一方、出願商標は新規の商標であるからウィーン分類コードが付与されていない。出願商標にウィーン分類コードを付与するには、専門家が持つ知識や経験則が不可欠である。しかし、我々が知る限り、ウィーン分類コードを自動的に付与する研究事例はない。
この問題を解決できる可能性のある技術としてディープラーニングがある。ディープラーニングを用いた画像分類では、人手を介さずに分類するうえで最適な特徴を抽出するため、分類精度が飛躍的に向上している。そこでディープラーニングを用いた画像分類についての関連研究を述べる。 Retrieval that also considers the classification of figures requires similarity evaluation based on the classification of figures. Regarding registered trademarks, the Vienna Classification Code is assigned to registered trademarks in many countries, including Japan. On the other hand, since the applied trademark is a new trademark, it has not been given a Vienna classification code. Expert knowledge and rules of thumb are indispensable for assigning a Vienna Classification Code to an applied trademark. However, as far as we know, there is no case study that automatically assigns the Vienna classification code.
Deep learning is a technology that has the potential to solve this problem. Image classification using deep learning extracts the optimal features for classification without human intervention, dramatically improving classification accuracy. Therefore, we describe related research on image classification using deep learning.

Saitoらの研究に、イラストに分類を付与するIllustration2vecがある（例えば、非特許文献１参照）。この研究では、イラストに描かれている要素を1,539個の分類で表し、入力したイラストに対して自動で分類を複数付与するネットワークを作成している。例えば、青い目で金髪の1人の少女が描かれたイラストをこのネットワークに入力すると、「青い目」、「１人」、「金髪」「少女」といった分類を付与することができる。また、Panらは、食品が映った画像を41種類の中から１種類に分類するDeepFoodの研究を行っている（例えば、非特許文献２参照）。各食品に対して100画像あるデータセットを用いて、three-fold cross validationで評価したところ、約85％以上の精度で分類することができている。このように近年、ディープラーニングを用いた画像分類の研究は目覚ましい成果を出している。また、Heらは、自然画像に映っている物体を1,000クラスの物体のなかから認識するResNetの研究を行っている（例えば、非特許文献３参照）。ResNetは、Large Scale Visual Recognition Challenge 2015という画像認識コンペティションにおいて第一位の精度を出しており、その認識エラー率が約4％と非常に低いことから画像分類手法の中でも代表的な手法となっている。 Research by Saito et al. includes Illustration2vec, which assigns classification to illustrations (see, for example, Non-Patent Document 1). In this research, the elements drawn in the illustration are represented by 1,539 classifications, and a network is created that automatically assigns multiple classifications to the input illustration. For example, if an illustration of a blue-eyed, blonde girl is input into the network, it can be classified as "blue-eyed," "single," "blonde," and "girl." In addition, Pan et al. are conducting research on DeepFood, which classifies images of food into one type out of 41 types (see, for example, Non-Patent Document 2). Using a data set of 100 images for each food, three-fold cross validation was used to classify the foods with an accuracy of about 85% or more. In this way, in recent years, research on image classification using deep learning has produced remarkable results. In addition, He et al. are conducting research on ResNet that recognizes objects in natural images from among 1,000 classes of objects (see, for example, Non-Patent Document 3). ResNet achieved the highest accuracy in the Large Scale Visual Recognition Challenge 2015 image recognition competition, and its recognition error rate is very low at about 4%, making it a representative method among image classification methods. there is

M Saito, and M Yusuke. “Illustration2vec: a semantic vector representation of illustrations.” SIGGRAPH Asia 2015 Technical Briefs. ACM, (2015), pp. 5:1-5:4M Saito, and M Yusuke. “Illustration2vec: a semantic vector representation of illustrations.” SIGGRAPH Asia 2015 Technical Briefs. ACM, (2015), pp. 5:1-5:4 C. Liu, Y. Cao, Y. Luo, G. Chen, V. Vokkarane, and Y. Ma, “Deepfood: ディープラーニング-based food image recognition for computer-aided dietary assessment,” International Conference on Smart Homes and Health Telematics. Springer, Cham, (2016), pp. 37-48C. Liu, Y. Cao, Y. Luo, G. Chen, V. Vokkarane, and Y. Ma, “Deepfood: Deep learning-based food image recognition for computer-aided dietary assessment,” International Conference on Smart Homes and Health Telematics. Springer, Cham, (2016), pp. 37-48 K. He, X. Zhang, S. Ren and J. Sun “Deep residual learning for image recognition.” Proceedings of the IEEE conference on computer vision and pattern recognition. (2016), pp.770-778K. He, X. Zhang, S. Ren and J. Sun “Deep residual learning for image recognition.” Proceedings of the IEEE conference on computer vision and pattern recognition. (2016), pp.770-778

前述のように、ウィーン分類コードが付与されていない出願商標に外観が類似した登録商標を検索するには、出願商標に相応しいウィーン分類コードを付与したうえで検索を行うことが好ましい。しかし、未付与の商標にウィーン分類コードを付与する作業は、一般の企業や事業者である申請者が行うには手間がかかり、また安定性に欠ける。ウィーン分類コードを自動的に付与することができれば、誰もが簡単に使え、より効率的な類似商標の検索手法が実現できる。
この発明は、以上のような事情を考慮してなされたものであって、予め分類情報が付与された複数の画像の中から、画像の形状に基づく類似性に加えて分類情報に基づく類似性を考慮して検索質問画像に類似する画像を見出す手法を提供するものである。例えば、外観が類似した商標を検索する手法として、図形の形状に基づく類似性に加えて図形の分類に基づく類似性を考慮した検索手法が実現される。 As described above, in order to search for registered trademarks similar in appearance to an applied trademark to which no Vienna classification code has been assigned, it is preferable to perform the search after assigning a suitable Vienna classification code to the applied trademark. However, the task of assigning Vienna Classification Codes to unassigned trademarks is troublesome and lacks stability for applicants who are ordinary companies or businesses. If the Vienna classification code can be assigned automatically, anyone can use it easily, and a more efficient search method for similar trademarks can be realized.
The present invention has been made in view of the circumstances described above, and is capable of determining similarity based on classification information in addition to similarity based on image shape from among a plurality of images to which classification information has been assigned in advance. is considered to provide a method for finding images similar to the query image. For example, as a method for retrieving trademarks that are similar in appearance, a search method that considers similarity based on the classification of figures in addition to similarity based on the shape of figures is implemented.

この発明は、
（１）検索質問画像から１以上の形状的特徴を抽出するクエリ特徴抽出部と、予め１以上の分類情報が付与されかつ１以上の形状的特徴が抽出された複数の参照画像を参照し、検索質問画像と形状的特徴の照合を行う形状照合部と、前記照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出部と、各参照画像に付与された分類情報に基づいて検索質問画像につき１以上の分類情報を推定し付与する分類付与部と、検索質問画像に付与された分類情報と各参照画像に付与された分類情報との一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出部と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力部と、を備える類似画像検索装置を提供する。 This invention
(1) A query feature extraction unit that extracts one or more shape features from a search query image, and a plurality of reference images to which one or more classification information is assigned in advance and from which one or more shape features are extracted, A shape matching unit that compares a search query image with shape features, a shape similarity calculation unit that calculates the shape similarity of each reference image to the search query image based on the matching, and a classification given to each reference image A classification assigning unit for estimating and assigning one or more pieces of classification information to a search question image based on information, and based on the degree of matching between the classification information assigned to the search question image and the classification information assigned to each reference image. a classification similarity calculation unit that calculates the classification similarity of each reference image to a search question image; and an output unit that outputs one or more reference images similar to the search question image based on the shape similarity and the classification similarity. To provide a similar image search device comprising:

（２）コンピュータが実行する処理であって、検索質問画像から１以上の形状的特徴を抽出するクエリ特徴抽出処理と、予め１以上の分類情報が付与されかつ１以上の形状的特徴が抽出された複数の参照画像を参照し、検索質問画像と形状的特徴の照合を行う形状照合処理と、前記照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出処理と、各参照画像に付与された分類情報に基づいて検索質問画像につき１以上の分類情報を推定し付与する分類付与処理と、検索質問画像に付与された分類情報と各参照画像に付与された分類情報との一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出処理と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力処理と、を備える類似画像検索方法を提供する。 (2) A process executed by a computer, which includes a query feature extraction process for extracting one or more shape features from a search question image, and one or more classification information given in advance and one or more shape features extracted. A shape matching process of referring to a plurality of reference images and matching the search question image and shape features, and a shape similarity calculation process of calculating the shape similarity of each reference image to the search question image based on the matching. , a classification assignment process for estimating and assigning one or more classification information to a search question image based on the classification information assigned to each reference image; and the classification information assigned to the search question image and the classification assigned to each reference image. a classification similarity calculation process for calculating the classification similarity of each reference image to a search question image based on the degree of matching with information; and one or more similar to the search question image based on the shape similarity and the classification similarity and an output process for outputting the reference image of the similar image retrieval method.

（３）コンピュータに処理を実行させるプログラムであって、コンピュータに処理を実行させるプログラムであって、検索質問画像から１以上の形状的特徴を抽出するクエリ特徴抽出処理と、予め１以上の分類情報が付与されかつ１以上の形状的特徴が抽出された複数の参照画像を参照し、検索質問画像と形状的特徴の照合を行う形状照合処理と、前記照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出処理と、各参照画像に付与された分類情報に基づいて検索質問画像につき１以上の分類情報を推定し付与する分類付与処理と、検索質問画像に付与された分類情報と各参照画像に付与された分類情報との一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出処理と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力処理と、を備える類似画像検索プログラムを提供する。 (3) A program for causing a computer to execute processing, which is a program for causing a computer to execute processing, which is a query feature extraction process for extracting one or more shape features from a search question image, and one or more classification information in advance and a plurality of reference images from which one or more shape features are extracted, shape matching processing for matching the search question image and shape features, and each reference image for the search question image based on the matching A shape similarity calculation process for calculating the shape similarity of each reference image, a classification assignment process for estimating and assigning one or more classification information to the search question image based on the classification information assigned to each reference image, and a classification assignment process for assigning to the search question image a classification similarity calculation process for calculating a classification similarity of each reference image to a search query image based on the degree of matching between the classification information assigned to each reference image and the classification information assigned to each reference image, and the shape similarity and the classification and output processing for outputting one or more reference images similar to a query image based on similarity.

この発明による類似画像検索装置は、照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出部と、検索質問画像に分類情報を付与する分類付与部と、分類情報の一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出部と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力部と、を備えるので、画像の形状に基づく類似性に加えて分類情報に基づく類似性を考慮して検索質問画像に類似する画像を見出すことができる。
この発明による類似画像検索方法および類似画像検索プログラムも同様である。 A similar image retrieval apparatus according to the present invention includes a shape similarity calculation unit that calculates the shape similarity of each reference image to a search question image based on matching, a classification imparting unit that assigns classification information to the search question image, and a classification information a classification similarity calculation unit that calculates the classification similarity of each reference image to the search question image based on the degree of matching of the reference image, and one or more references similar to the search question image based on the shape similarity and the classification similarity and an output unit that outputs an image, so that an image similar to the search query image can be found by considering the similarity based on the classification information in addition to the similarity based on the shape of the image.
The same applies to the similar image retrieval method and similar image retrieval program according to the present invention.

この実施形態に係るウィーン分類コードの分類体系を示す説明図である。FIG. 4 is an explanatory diagram showing the classification system of the Wien classification code according to this embodiment; この実施形態に係る登録商標に付与されているウィーン分類コードの一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of a Vienna classification code assigned to a registered trademark according to this embodiment; この実施形態に係る登録商標に付与されているウィーン分類コードの異なる一例を示す説明図である。FIG. 10 is an explanatory diagram showing a different example of the Vienna classification code assigned to the registered trademark according to this embodiment; この実施形態に係る登録商標に付与されているウィーン分類コードのさらに異なる一例を示す説明図である。FIG. 11 is an explanatory diagram showing a further different example of the Vienna classification code assigned to the registered trademark according to this embodiment; この実施形態に係る外観類似商標検索の処理の流れの概要を示す説明図である。FIG. 4 is an explanatory diagram showing an overview of the flow of processing for searching for similar-appearance trademarks according to this embodiment; 図５に示す処理のうちウィーン分類コード付与の詳細を示す説明図である。FIG. 6 is an explanatory diagram showing details of giving a Wien classification code among the processes shown in FIG. 5; この実施形態において分類類似度ａの算出例を示す説明図である。It is explanatory drawing which shows the calculation example of the classification similarity a in this embodiment. この実施形態における実験例で、ウィーン分類コードの大分類の学習に用いたデータの数を大分類毎に示すグラフである。10 is a graph showing the number of data used for learning the large classification of the Wien classification code for each large classification in an experimental example in this embodiment. この実施形態において予測精度が最も高い閾値パラメータＮと重みパラメータｍおよびその周辺の値を用いた結果を示すグラフである。It is a graph which shows the result using the threshold parameter N and the weight parameter m with the highest prediction accuracy in this embodiment, and its peripheral value. この実施形態において、機械付与されたウィーン分類コードの具体例を示す図である。FIG. 10 is a diagram showing a specific example of a Wien classification code given by machine in this embodiment; この実施形態において、機械付与されたウィーン分類コードの異なる例を示す図である。FIG. 4 shows different examples of machine-assigned Wien Classification codes in this embodiment; この実施形態において、機械付与されたウィーン分類コードのさらに異なる例を示す図である。FIG. 11 shows yet another example of a machine-assigned Wien Classification code in this embodiment; この実施形態による手法と専門家による付与の検索手法と既存手法との11点補完平均適合率を示すグラフである。It is a graph which shows the 11-point complement average precision of the method by this embodiment, the search method of the assignment|providing by an expert, and the existing method. この実施形態による外観類似商標の検索結果の一例を示す説明図である。FIG. 10 is an explanatory diagram showing an example of search results of appearance-similar trademarks according to this embodiment; この実施形態による外観類似商標検索の異なる例を示す説明図である。FIG. 10 is an explanatory diagram showing a different example of appearance-similar trademark search according to this embodiment;

以下、図面を用いてこの発明をさらに詳述する。なお、以下の説明は、すべての点で例示であって、この発明を限定するものと解されるべきではない。
以下、分類情報が付与され、整理、蓄積されてきた画像の一例としてウィーン分類コードが付与された登録商標を挙げ、具体的な画像検索について述べる。
≪ウィーン分類とコード体系≫
この発明に関連するウィーン分類について、初めに述べておく。
ウィーン分類とは、1973年にオーストリアのウィーンで作成された商標に含まれる図形要素を形状などの特徴によって分類した国際指標である。わが国では2004年度からウィーン分類を商標審査に利用している。 The present invention will be described in further detail below with reference to the drawings. It should be noted that the following description is illustrative in all respects and should not be construed as limiting the present invention.
As an example of images that have been assigned classification information and have been sorted and accumulated, a registered trademark assigned with a Vienna classification code will be given below, and a specific image search will be described.
≪Vienna Classification and Coding System≫
First, the Wien classification related to this invention will be described.
The Vienna Classification is an international index created in 1973 in Vienna, Austria, that classifies figurative elements contained in trademarks according to features such as shape. In Japan, the Vienna Classification has been used for trademark examination since 2004.

ウィーン分類は、図１に示すとおり大分類、中分類、小分類の木構造をもち、それぞれ29/144/1887の分類から成りたっている。例えば、大分類は表１に示すように、29の区分に分けられている。
大分類1（天体、自然現象、地図）のように物体や現象を表す分類や、大分類２６（幾何学図形及び立体的幾何学図形）のように図形の形状を表す分類などがある。また大分類１（天体、自然現象、地図）に属する中分類に着目すると、中分類1.1（星、彗星）、中分類1.3（太陽）、中分類1.5（地球、地球儀、惑星）、中分類1.7（月）のように枝分かれし、中分類1.1（星、彗星）では、小分類1.1.1（星）、小分類1.1.2（一つの星）、小分類1.1.3（二つの星）さらに細かく分けられている。つまりウィーン分類では下層の分類ほど図形の複雑な要素を示すことができる。
基本的にウィーン分類コードは、大分類、中分類、小分類の組み合わせで表現されるが、日本では独自に小分類をもう一段階細かく分類した細分類を設けている。 As shown in Fig. 1, the Wien Classification has a tree structure of large, medium, and small classifications, each of which consists of 29/144/1887 classifications. For example, major categories are divided into 29 categories as shown in Table 1.
There are classifications such as major category 1 (celestial bodies, natural phenomena, maps) representing objects and phenomena, and major category 26 (geometric figures and three-dimensional geometric figures) representing shapes of figures. Focusing on the major categories belonging to major category 1 (celestial bodies, natural phenomena, and maps), major category 1.1 (stars and comets), major category 1.3 (sun), major category 1.5 (earth, globes and planets), and major category 1.7 (Moon), major category 1.1 (stars, comets), minor category 1.1.1 (stars), minor category 1.1.2 (one star), minor category 1.1.3 (two stars) finely divided. In other words, in the Wien classification, the lower the classification, the more complicated elements of the figure can be indicated.
Basically, the Vienna Classification Code is expressed as a combination of large, medium, and small classifications, but Japan has its own fine classifications, which are one level finer than the small classifications.

実際に、登録商標に付与されているウィーン分類コードの例を図２から図４に示す。各図には、上側に対象となる商標を示し、下側の表の各行が商標に付与されたウィーン分類コードを示している。また、表の１列目に付与されたウィーン分類コードを示し、２列目に大分類、３列目に中分類、４列目に小分類、５列目に細分類を示している。 Examples of Vienna classification codes actually assigned to registered trademarks are shown in FIGS. 2 to 4. FIG. Each figure shows the trademark in question on the top, and the Vienna classification code given to the trademark on each row of the table on the bottom. The first column of the table shows the assigned Wien classification code, the second column shows the major classification, the third column shows the middle classification, the fourth column shows the minor classification, and the fifth column shows the fine classification.

図２で扱う商標（登録商標第１４０６５８９号）は、「ＴＡＭＩＹＡ」と書かれた登録商標でありアルファベットの上部に正方形の中に星が描かれた図が２つ並んでいる。この登録商標の図形を表した部分は、正方形の中に星が描かれた部分であるため、大分類１（天体、自然現象、地図）と大分類２６（幾何学図形及び立体的幾何学図形）に属する７個のウィーン分類コードが付与されている。その中で、「星」を表現しているウィーン分類コードは、大分類１（天体、自然現象、地図）、中分類１（星、彗星）、小分類１（星）となり1.1.1と表される。またこの星に対して、1.1.1（星）と1.1.3（二つの星）と内容が重複されているウィーン分類コードが付与されている。このような内容が重複しているウィーン分類コードの付与は、基本的には検索漏れを防ぐために行われていると考えられるが、その判断は専門家の知識や経験則によるものである。 The trademark (Registered Trademark No. 1406589) dealt with in FIG. 2 is a registered trademark written as "TAMIYA", and two figures in which a star is drawn in a square are arranged above the alphabet. Since the part representing the figure of this registered trademark is the part where a star is drawn in a square, ) have been given 7 Viennese classification codes. Among them, the Vienna classification code that expresses "stars" is major category 1 (celestial bodies, natural phenomena, maps), major category 1 (stars, comets), minor category 1 (stars), and is listed as 1.1.1. be done. This star has also been assigned the Vienna Classification Code, which overlaps with 1.1.1 (star) and 1.1.3 (two stars). The assignment of such duplicate Vienna classification codes is considered to be basically done to prevent search omissions, but such judgments are based on expert knowledge and empirical rules.

次に、図３に文字を図形のように変形した登録商標（登録商標第３２２０４４８号）と付与されたウィーン分類コードを示す。この例では、「Ｈｉｓａｍｉｔｓｕ」と書かれた「Ｈ」の部分に3本の横線が引かれており、特殊な書体となっている。以上のことから、ウィーン分類コード27.5.1.8（H,h）が付与されている。また、ウィーン分類コード27.5.21（一文字）が付与されているように、ウィーン分類コードが付与されている箇所は、「Ｈ」の部分だけであり、残りの「ｉｓａｍｉｔｓｕ」にはウィーン分類コードが付与されていない。このように、どの文字が図形要素を表しているかを判断する基準も専門家の知識や経験則による。 Next, FIG. 3 shows the registered trademark (registered trademark No. 3220448) and the Vienna classification code, which is obtained by transforming characters into figures. In this example, three horizontal lines are drawn in the "H" part written as "Hisamitsu", which is a special typeface. For these reasons, the Vienna classification code 27.5.1.8 (H,h) has been assigned. In addition, as the Vienna classification code 27.5.21 (one letter) is given, the part where the Vienna classification code is given is only the "H" part, and the remaining "isamitsu" has the Vienna classification code. Not granted. In this way, the criteria for judging which character represents a graphic element also depend on expert knowledge and empirical rules.

最後に図４にキャラクターを用いた登録商標（登録商標第４６２４３７２号）と付与されたウィーン分類コードを示す。この登録商標には、円に手足の生えたキャラクターが描かれており、７個のウィーン分類コードが付与されている。ウィーン分類コード4.5.5に着目すると、「その他の確認できない物の擬人化、その他人間の姿をした空想の物」とある。ここで示す「その他」とは、中分類4.5の下層の他のウィーン分類コード（例えば4.5.1（擬人化した植物、擬人化した植物の組合せ）や、4.5.4（人間の容姿のロボット）など）で表現できないものを指す。このように「その他」を含むウィーン分類コードは、対象となる層の他のウィーン分類コードも考慮する必要があるため、専門家の知識が必要となる。 Finally, FIG. 4 shows the registered trademark (registered trademark No. 4624372) using characters and the given Vienna classification code. The registered trademark depicts a circular character with limbs and has been given seven Vienna Classification codes. Focusing on the Vienna Classification Code 4.5.5, it states "personification of other unidentifiable things and other imaginary things in human form." "Others" here refers to other Vienna classification codes under the middle classification 4.5 (e.g. 4.5.1 (anthropomorphic plants, combinations of anthropomorphic plants) and 4.5.4 (human-looking robots) etc.). Such a Wien Classification code that includes "Others" requires expert knowledge because other Wien Classification codes of the target layer must also be considered.

≪外観類似商標検索の概要≫
図５は、この実施形態に係る外観類似商標検索の処理の流れの概要を示す説明図である。図５に示すように、外観類似商標検索の処理は、大別すると、登録処理１１と検索処理２１に分けられる。
この実施形態において、登録処理１１および検索処理２１は、ＣＰＵ（中央処理装置）やメモリ等のハードウェア資源を有するコンピュータが、メモリに予め格納された処理プログラムを実行することによって実現される。
登録処理１１は、前記コンピュータがデータベースに商標画像を登録する処理である。コンピュータは、登録すべき商標画像（図５に示す登録商標１３、以下、データベース画像ともいう）から局所特徴を抽出する（図５に示す特徴抽出処理１５）。そして、抽出された局所特徴を、登録商標１３に付与されたウィーン分類コードと関連付けてデータベース１７に登録する。
検索処理２１は，コンピュータが検索質問２３の図形としての形状とウィーン分類とに基づく類似性をそれぞれ測り、それらの類似性から外観が類似した商標を検索する処理である。
ここで、検索処理を実行するコンピュータは、２１登録処理１１を実行するコンピュータと同一のものであってもよいが、異なるものであってもよい。 ≪Overview of Similar Appearance Trademark Search≫
FIG. 5 is an explanatory diagram showing an overview of the flow of processing for searching for similar-appearance trademarks according to this embodiment. As shown in FIG. 5, the appearance-similar trademark search process can be roughly divided into a registration process 11 and a search process 21 .
In this embodiment, the registration process 11 and the search process 21 are realized by a computer having hardware resources such as a CPU (Central Processing Unit) and memory executing a processing program stored in advance in the memory.
Registration processing 11 is processing for the computer to register the trademark image in the database. The computer extracts local features from the trademark image to be registered (registered trademark 13 shown in FIG. 5, hereinafter also referred to as database image) (feature extraction processing 15 shown in FIG. 5). Then, the extracted local features are registered in the database 17 in association with the Vienna classification code assigned to the registered trademark 13 .
The search process 21 is a process in which the computer measures the similarity based on the shape as a graphic of the search question 23 and the Wien classification, and searches for trademarks with similar appearances based on the similarity.
Here, the computer that executes the search process may be the same as the computer that executes the 21 registration process 11, or may be different.

さらに検索処理２１は、図形の形状に基づく類似性を測る処理と、図形の分類に基づく類似性を測る処理に分けられる。
はじめに図形の形状に基づく類似性を測る処理について述べる。
コンピュータは、まずクエリ（図５に示す検索質問２３）となる商標画像（以下、クエリ画像）から局所特徴を抽出する（図５に示すクエリ特徴抽出処理２５）。次に、データベース１７に登録されたそれぞれのデータベース画像について、そのデータベース画像から抽出された局所特徴とクエリ画像から得られた各局所特徴とを照合する（図５に示す形状照合処理２７）。そして、照合された各局所特徴が信頼性のある特徴かどうかを評価する検証処理を行う（図５に示す検証処理２９）。
検証処理２９においてコンピュータは、局所的には類似するが大域的には類似しない箇所の局所特徴の照合を信頼性の低いものとみなし、照合された局所特徴を形状類似度の算出に含めないように廃棄する処理を行う。言い換えると、各局所特徴の位置関係を考慮することにより非類似商標の検索を抑制する。
より詳細には、照合された局所特徴の信頼性を、他の照合された局所特徴との相対的な位置関係を基準に評価することにより判断し、信頼性の低いものを廃棄する。
ここでいう信頼性は、照合された局所特徴が形状類似度の決定に貢献するか否かを指す。各局所特徴は、画像の局所的な領域に着目した特徴であることから、大域的には似ていない画像の局所特徴と対応づくことがある。そのような局所特徴は、形状類似度の決定に貢献するものといえずノイズでしかない。検証処理２９においてコンピュータは、そのようなノイズを除去する。
最後にコンピュータは、信頼性のある局所特徴と評価された照合数に基づいて、データベース画像の図形の形状に基づく類似度ｓ（以下、形状類似度ｓとする）を測る（図５に示す形状類似度算出処理３１）。
検証処理２９および形状類似度算出処理３１の詳細については工藤らの研究（R. Kudo, M Iwata, M. Iwamura, and K Koichi, “Proposal of Similar Image Retrieval System Using Large Scale Trademark Image Database,” Journal of the Imaging Society of Japan, 57, pp.403-412 (2018)）を参照されたい。 Further, the search processing 21 is divided into processing for measuring similarity based on the shapes of figures and processing for measuring similarities based on the classification of figures.
First, the process of measuring the similarity based on the shape of figures will be described.
The computer first extracts local features from a trademark image (hereinafter, query image) that serves as a query (search question 23 shown in FIG. 5) (query feature extraction processing 25 shown in FIG. 5). Next, for each database image registered in the database 17, the local feature extracted from the database image and each local feature obtained from the query image are matched (shape matching process 27 shown in FIG. 5). Then, verification processing is performed to evaluate whether each collated local feature is a reliable feature (verification processing 29 shown in FIG. 5).
In the verification process 29, the computer considers the matching of local features in locations that are locally similar but not globally similar to be unreliable, and does not include the matched local features in the calculation of the shape similarity. to dispose of. In other words, the search for dissimilar trademarks is suppressed by considering the positional relationship of each local feature.
More specifically, the reliability of the matched local features is determined by evaluating the relative positional relationship with other matched local features, and those with low reliability are discarded.
Reliability here refers to whether or not the matched local features contribute to the determination of shape similarity. Since each local feature is a feature focused on a local region of an image, it may correspond to a local feature of an image that is globally dissimilar. Such local features do not contribute to the determination of shape similarity and are merely noise. In verification process 29 the computer removes such noise.
Finally, the computer measures the similarity s (hereinafter referred to as shape similarity s) based on the shape of the figure in the database image based on the number of matches evaluated as reliable local features (shape similarity s shown in FIG. 5). Similarity calculation processing 31).
Details of the verification process 29 and the shape similarity calculation process 31 can be found in the research by Kudo et al. of the Imaging Society of Japan, 57, pp.403-412 (2018)).

局所特徴を用いた手法は、大域的にも部分的にも類似した商標の検索を行うことができる。しかし、部分的に似ている商標が類似商標とは限らないため、この手法は同時に非類似商標の検索の原因にもなる。
この問題を解決するために、この実施形態では局所特徴に基づく検索手法に対して、非類似商標の検索を抑制する処理と類似商標の検索を促進する処理を導入した。前者の非類似商標の抑制については、局所特徴の位置関係を考慮することにより対応している。後者の類似商標を促進する処理については擬似適合フィードバック（Pseudo relevance feedback、詳細はR. Kudo, M. Iwata, M. Iwamura, and K. Kise, “Improvement of Trademark Retrieval Using Pseudo Relevance Feedback,” 11th Indian Conference on Vision, Graphics and Image Processing, (2018) 参照）を導入すること対応している。
しかし、これらの手法を用いてもなおすべての類似商標を検索できるわけではない。図形の形状の類似度のみで商標の外観類似を評価するだけでなく、図形の分類に基づく類似性も考慮する必要がある。 Techniques using local features can search for globally and partially similar trademarks. However, since partially similar trademarks are not necessarily similar trademarks, this method also causes searches for dissimilar trademarks.
In order to solve this problem, in this embodiment, a process of suppressing retrieval of dissimilar trademarks and a process of promoting retrieval of similar trademarks are introduced into the retrieval method based on local features. The former suppression of dissimilar trademarks is handled by considering the positional relationship of local features. Pseudo relevance feedback, see R. Kudo, M. Iwata, M. Iwamura, and K. Kise, “Improvement of Trademark Retrieval Using Pseudo Relevance Feedback,” 11th Indian Conference on Vision, Graphics and Image Processing, (2018)).
However, even with these methods, it is still not possible to retrieve all similar trademarks. It is necessary not only to evaluate the appearance similarity of trademarks based on the degree of similarity in shape of figures, but also to consider the similarity based on the classification of figures.

次に、図形の分類に基づく類似性を測る処理について述べる。コンピュータはまず、クエリ画像にウィーン分類コードを付与する（図５に示すウィーン分類コード付与処理３３による機械付与）。一以上のウィーン分類コードが付与される。次に、クエリ画像のウィーン分類コードとデータベース画像のウィーン分類コードの一致した数から図形の分類に基づく類似度ａ（以下、分類類似度ａとする）を測る（図５に示す分類類似度算出処理３５）。これらの処理を経て、得られた形状類似度ｓと分類類似度ａとに基づいて、総合的な類似度が高い順に出力する画像を決定する。
前述のように、類似商標の検索精度を向上させるために、この実施形態では擬似適合フィードバックを採用している。以下、各処理の詳細について述べる。 Next, processing for measuring similarity based on the classification of graphics will be described. The computer first assigns a Wien classification code to the query image (machine assignment by the Wien classification code assignment process 33 shown in FIG. 5). One or more Vienna classification codes are assigned. Next, the similarity a (hereinafter referred to as classification similarity a) based on the classification of the figure is measured from the number of matches between the Wien classification code of the query image and the Wien classification code of the database image (classification similarity calculation shown in FIG. 5). process 35). Based on the obtained shape similarity s and classification similarity a through these processes, an image to be output is determined in descending order of overall similarity.
As mentioned above, in order to improve the search accuracy of similar trademarks, this embodiment employs pseudo-match feedback. The details of each process will be described below.

〔ウィーン分類コード付与処理〕
コンピュータは、ディープラーニング（Deep Learning）の手法を適用して、クエリ画像としての商標にウィーン分類コードの付与を行う。ウィーン分類コード付与処理は、学習処理と付与処理の２つの段階に分けられる。
それらのうち学習処理では、既にウィーン分類コードが付与されている登録商標を用いて、各登録商標の図形を入力とし、その登録商標に付与されたウィーン分類コードの1,644分類の小分類を出力とした場合の入出力関係について学習し、学習済みモデルを生成する。この実施形態では日本独自の指標である細分類は学習の対象としないものとする。すなわち、細分類を含んだウィーン分類コードが付与されている登録商標については、細分類を除いた大分類・中分類・小分類までを学習に用いる。例えば、ウィーン分類コード2.9.1.2（トランプのハート）は2.9.1（心臓）として学習を行う。 [Vienna classification code assigning process]
The computer applies a deep learning technique to assign a Wien classification code to the trademark as the query image. The Wien classification code assigning process is divided into two stages, a learning process and an assigning process.
Among them, in the learning process, registered trademarks that have already been assigned a Vienna classification code are used, the figure of each registered trademark is input, and 1,644 small classifications of the Vienna classification code assigned to the registered trademark are output. It learns the input-output relationship when In this embodiment, fine classification, which is an index unique to Japan, is not subject to learning. That is, for a registered trademark assigned a Vienna classification code that includes a detailed classification, the major, middle, and minor classifications excluding the detailed classification are used for learning. For example, the Vienna classification code 2.9.1.2 (Trump's Heart) learns as 2.9.1 (Heart).

ウィーン分類コード付与処理３３の学習処理に用いるディープラーニングのネットワークの種類は、画像認識で代表的なネットワークであるResNet（Residual Network）とする。ResNetは、50層、101層、152層の3種の層を用いたモデルが一般的だが、この実施形態では最もシンプルなResNet50を用いる。
ResNet50で扱うloss関数は、通常シングルラベルを扱うsoftmax cross entropyが用いられる。しかし、一つの登録商標に対して唯一つのウィーン分類コードが付与、即ちラベル付けされるだけでなく複数が付与されることがあるので、この実施形態ではマルチラベルを扱うためsigmoid cross entropyを用いる。sigmoid cross entropyを用いることにより、対象となるそれぞれのウィーン分類コードが存在するか、もしくは存在しないという二値のマルチラベル分類を行う。またsigmoid関数を通すことで、出力されるベクトルの要素は[-∞, ∞]から[0, 1]の値に正規化される。正規化された値が１に近いほど、対応したウィーン分類コードが商標に付与されている可能性が高いと考えられる。またウィーン分類コードの付与精度を向上させる手法として、学習に用いるデータの正規化やData augmentationを用いる。 The type of deep learning network used for the learning process of the Wien classification code assigning process 33 is ResNet (Residual Network), which is a typical network for image recognition. ResNet is generally a model that uses three types of layers, 50 layers, 101 layers, and 152 layers, but this embodiment uses the simplest ResNet50.
The loss function handled by ResNet50 usually uses softmax cross entropy that handles single labels. However, one registered trademark may be assigned not only one Wien classification code, ie, labeled, but also multiple labels, so this embodiment uses sigmoid cross entropy to handle multi-labels. By using sigmoid cross entropy, we perform binary multi-label classification where each Wien classification code of interest exists or does not exist. Also, by passing the sigmoid function, the elements of the output vector are normalized from [-∞, ∞] to [0, 1]. The closer the normalized value is to 1, the more likely it is that the corresponding Vienna Classification code has been assigned to the trademark. In addition, normalization of data used for learning and data augmentation are used as techniques for improving the accuracy of assigning the Wien classification code.

次に、ウィーン分類コード付与処理３３の付与処理について述べる。付与処理の流れを図６に示す。コンピュータはまず、ウィーン分類コードを付与すべき新規の商標（検索質問２３）を上述の学習済みモデルに入力する。そして、前記学習済みモデルから入力した商標に対するウィーン分類コードの出力を得る。前記学習済みモデルは、出力として1,644次元のベクトルを出力する。出力するベクトルの各次元の要素は、学習を行ったウィーン分類コードの各小分類と対応している。図６に示す検索質問は、新規の商標に代えて、登録商標第４７７２２３４号を入力した例を示している。コンピュータは、各次元について、要素が閾値Ｎ以上の場合その要素に対応するウィーン分類コードを検索質問２３の商標に対して付与する。例えば、ウィーン分類コード2.5.6（乳児）に対応するベクトルの要素が0.78の場合、Ｎ＝0.5ならば検索質問２３にウィーン分類コード2.5.6を付与する。しかし、Ｎ＝0.8ならば、そのウィーン分類コードを付与しない（図６参照）。ここで用いる閾値パラメータＮは、後述の実験により決定する。 Next, the assignment processing of the Vienna classification code assignment processing 33 will be described. FIG. 6 shows the flow of provision processing. The computer first inputs a new trademark (search query 23) to which a Vienna classification code is to be assigned into the trained model described above. Then, the output of the Vienna classification code for the input trademark is obtained from the trained model. The trained model outputs a 1,644-dimensional vector as output. Each dimensional element of the output vector corresponds to each small classification of the Wien classification code that has been trained. The search question shown in FIG. 6 shows an example in which registered trademark No. 4772234 is entered instead of a new trademark. For each dimension, the computer assigns the Wien classification code corresponding to the element to the trademark of the search query 23 if the element is equal to or greater than the threshold N. For example, if the element of the vector corresponding to the Wien classification code 2.5.6 (infant) is 0.78, then if N=0.5, the query 23 is given the Wien classification code 2.5.6. However, if N=0.8, the Wien classification code is not assigned (see FIG. 6). The threshold parameter N used here is determined by experiments described later.

〔分類類似度算出処理〕
図５に示す分類類似度算出処理３５で、コンピュータは、ウィーン分類コードを用いてデータベース画像の分類類似度ａを算出する。すなわちクエリ画像とデータベース画像に付与された同じウィーン分類コードの数から分類類似度aを算出する。この場合、該当するウィーン分類コードの数をそのまま分類類似度aと定義すると、データベース画像に付与されたウィーン分類コードの数が多いほど、分類類似度aが高くなる。よって、この実施形態で用いる分類類似度aは、データベース画像に付与されたウィーン分類コード数の値で正規化する。
つまりクエリ画像の商標xに付与されたウィーン分類コードの集合がXであり、データベース画像yに付与されたウィーン分類コードの集合がYである場合、分類類似度aを、以下のように定義する。ただし、|Y|はデータベース画像yに付与されたウィーン分類コードの数を表す。 [Classification similarity calculation process]
In the classification similarity calculation process 35 shown in FIG. 5, the computer calculates the classification similarity a of the database image using the Wien classification code. That is, the classification similarity a is calculated from the number of the same Wien classification codes given to the query image and the database image. In this case, if the number of corresponding Wien classification codes is defined as the classification similarity a as it is, the more the number of Wien classification codes given to the database image, the higher the classification similarity a. Therefore, the classification similarity a used in this embodiment is normalized by the number of Wien classification codes assigned to database images.
In other words, if the set of Wien classification codes assigned to the trademark x in the query image is X and the set of Wien classification codes assigned to the database image y is Y, then the classification similarity a is defined as follows: . where |Y| represents the number of Wien classification codes assigned to the database image y.

なお、正規化をする際に、平方根をとったデータベース画像のウィーン分類コード数の値で除算することは、事前実験による知見から得たものである。
分類類似度aの算出例を図７に示す。このクエリ画像にはウィーン分類コードが3個付与されており、データベース画像（登録商標第４４８８６４７号）にはウィーン分類コードが5個付与されている。また同じウィーン分類コードが付与された数は2個ある。すなわち図７の例では、データベース画像の分類類似度aは2/√5となる。 It should be noted that dividing by the value of the number of Wien classification codes of the database image whose square root is taken in the normalization is obtained from knowledge obtained from prior experiments.
FIG. 7 shows a calculation example of the classification similarity a. Three Wien classification codes are assigned to this query image, and five Wien classification codes are assigned to the database image (Registered Trademark No. 4488647). Also, there are two numbers with the same Vienna classification code. That is, in the example of FIG. 7, the classification similarity a of the database image is 2/√5.

〔出力処理〕
以上のプロセスを経た後に、コンピュータはクエリ画像と照合されたデータベース画像をすべて出力する（図５に示す出力処理３７）。出力処理３７で、コンピュータはデータベース画像の形状類似度ｓと分類類似度ａをもとに総合類似度ｓ’を算出し、総合類似度ｓ’の値が高い順に出力を行う。算出には以下の式を用いる。
（２）式に示すように、データベース画像の形状類似度ｓと分類類似度ａの重みを示す定数パラメータｍに対して形状類似度ｓと分類類似度ａとは相補的な関係にある。なお、重みパラメータｍの値は、後述の実験により決定する。 [Output processing]
After going through the above process, the computer outputs all the database images matched with the query image (output process 37 shown in FIG. 5). In the output process 37, the computer calculates the overall similarity s' based on the shape similarity s and the classification similarity a of the database image, and outputs the values in descending order of the overall similarity s'. The following formula is used for calculation.
As shown in the formula (2), the shape similarity s and the classification similarity a have a complementary relationship with respect to the constant parameter m indicating the weight of the shape similarity s and the classification similarity a of the database image. Note that the value of the weight parameter m is determined by an experiment to be described later.

〔擬似適合フィードバック（Pseudo relevance feedback）〕
擬似適合フィードバック（図５の擬似適合フィードバック処理３９参照）は、検索質問２３を拡張して検索漏れを低減する手法である。まず、元のクエリ画像（以下、初期クエリ）を使用して、出力処理を行い、総合類似度ｓ’の高い順に上位Ｎ個の画像（初期出力）を得る。その後、上位Ｎ個の前記画像を新たな検索質問２３として再度検索を行う。再度の検索によって得られた出力を用いて初期出力を更新し、最終出力４１を得る。この手法を用いることにより、初期検索で欠落した類似商標を最終出力４１に含めることができる。以下の式を用いて各商標に対する最終の総合類似度ｓ’’を得る。
（３）式で、s’₀は、初期クエリを用いた場合の総合類似度s’を示す。ｓ’_kは、検索結果ｋ位の商標を検索質問に用いた場合の総合類似度s’を示す。 [Pseudo relevance feedback]
Pseudo-relevant feedback (see Pseudo-relevant feedback process 39 in FIG. 5) is a technique for extending the search query 23 to reduce search omissions. First, using the original query image (hereinafter referred to as initial query), output processing is performed to obtain the top N images (initial output) in descending order of total similarity s′. After that, the top N images are used as a new search question 23 and the search is performed again. The output obtained by the second search is used to update the initial output and obtain the final output 41 . By using this technique, similar trademarks missing in the initial search can be included in the final output 41 . The following formula is used to obtain the final overall similarity score s'' for each trademark.
In equation (3), _s'0 indicates the total similarity s' when the initial query is used. s' _k indicates the overall similarity s' when the k-ranked trademark in the search results is used in the search query.

≪実験≫
この実施形態による手法の性能を調査するために２種類の実験を行った。
第１の実験では、この手法の検索に用いる適切な検索パラメータ、具体的には閾値パラメータＮおよび重みパラメータｍを調べる。商標検索の性質上、類似商標の検索漏れを避けることが最も重要である。そこで検索パラメータを変化させてその傾向を調査し、この手法の検索漏れが最も少なくなる場合の検索パラメータ（以下、最適な検索パラメータとする。）を求めた。 ≪Experiment≫
Two experiments were performed to investigate the performance of the approach according to this embodiment.
In a first experiment, we examine the appropriate search parameters, specifically the threshold parameter N and the weight parameter m, to use in searching for this approach. Due to the nature of trademark searches, it is most important to avoid missing similar trademarks. Therefore, we investigated the trend by changing the search parameters, and obtained the search parameters (hereinafter referred to as the optimum search parameters) for the case where the search omission of this method is minimized.

第２の実験では、この手法の有効性を示すために、形状類似度ｓのみで外観が類似した商標を検索する手法（以下、既存手法とする。）とこの実施形態による手法の検索結果の比較を行った。また、検索質問２３にウィーン分類コードが正確に付与された際に、この実施形態による手法の検索精度がどの程度向上するかについても調べた。具体的な方法として、図５に示すウィーン分類コード付与処理３３を取り除き、代わりに専門家が手動で付与したウィーン分類コードを適用して分類類似度ａを算出した検索手法（以下、専門家による付与の検索手法とする。）とこの実施形態による手法の検索結果の比較を行った。 In the second experiment, in order to demonstrate the effectiveness of this method, we compared the search results of a method of searching for trademarks similar in appearance only by the shape similarity s (hereinafter referred to as the existing method) and the method according to this embodiment. made a comparison. We also investigated how much the search accuracy of the method according to this embodiment improves when the Wien classification code is correctly assigned to the search query 23 . As a specific method, the Wien classification code assigning process 33 shown in FIG. The search results of the given search method) and the method according to this embodiment were compared.

〔第１の実験：検索に用いる適切なパラメータを調査する実験〕
（実験で使用したデータセット）
日本の登録商標の画像623,101枚をデータベース画像とし、4,222枚をクエリ画像とした。まず4,222枚中1,000枚のクエリ画像を用いて最適な検索パラメータを求め、その後残りの3,222枚でウィーン分類の機械付与が正確に行われているかとこの手法の有効性を確かめた。
1,000枚のクエリ画像の正解画像数は14,226枚であり、残りの3,222枚のクエリ画像の正解画像数は45,422枚であった。この正解画像は、商標サーチャーと大学院２８名の協力のもと、１枚ずつ目視で評価して定めた。この際、「クエリ画像に対して、解像度の違いや色の違いは考慮せず、画像の一部または全体に、人が見て明らかに同一の形状が存在していると判断できる画像」を選択させた。また、本実験で使用されるデータセットは、完全に同一な商標はなく、解像度や色などの差異が認められた商標を用いた。 [First Experiment: Experiment to Investigate Appropriate Parameters Used for Search]
(Dataset used in the experiment)
623,101 images of Japanese registered trademarks were used as database images, and 4,222 images were used as query images. First, 1,000 out of 4,222 query images were used to find the optimal search parameters, and then the remaining 3,222 images were used to confirm the accuracy of machine assignment of Wien classification.
The number of correct images for the 1,000 query images was 14,226, and the number of correct images for the remaining 3,222 query images was 45,422. This correct image was determined by visual evaluation one by one with the cooperation of a trademark searcher and 28 graduate students. In this case, "an image in which it can be judged that the same shape clearly exists in part or all of the image without considering the difference in resolution and color with respect to the query image". made me choose. In addition, the data set used in this experiment does not have completely identical trademarks, and trademarks with differences in resolution, color, etc. were used.

（実験条件）
実験で使用したCPUはAMD Opteron（登録商標） Processor 6238でメモリは512GBの計算機である。またGPUはNVIDIA Corporation GP102 を9枚使用した。また形状類似度ｓを求める際に、用いたパラメータは以下のものとする。各種パラメータの詳細は工藤らの研究（R. Kudo, M Iwata, M. Iwamura, and K Koichi, “Proposal of Similar Image Retrieval System Using Large Scale Trademark Image Database,” Journal of the Imaging Society of Japan, 57, pp.403-412 (2018)）を参考にされたい。
形状類似度ｓの算出に用いたパラメータは以下の通り。
距離の概算値により検索する局所特徴の数 C＝1000
ｋ近傍探索におけるパラメータ k＝10
今回の実験で用いたパラメータは、前述の工藤らの研究で使用したパラメータk＝50からk＝10に変更している。正解画像の追加やプログラムの実装の修正により、最も高速・高精度に検索できるパラメータに変更したためである。 (Experimental conditions)
The CPU used in the experiment was an AMD Opteron (registered trademark) Processor 6238 and the memory was a computer with 512 GB. Also, 9 NVIDIA Corporation GP102 GPUs were used. The following parameters are used when obtaining the shape similarity s. For details of various parameters, refer to the research by Kudo et al. (R. Kudo, M Iwata, M. Iwamura, and K Koichi, “Proposal of Similar Image Retrieval System Using Large Scale Trademark Image Database,” Journal of the Imaging Society of Japan, 57, pp.403-412 (2018)).
The parameters used to calculate the shape similarity s are as follows.
Number of local features to search by estimated distance C = 1000
Parameter k = 10 in k-neighbor search
The parameters used in this experiment were changed from the parameter k=50 used in the study by Kudo et al. to k=10. This is because the parameters have been changed to enable the fastest and most accurate search by adding correct images and correcting the implementation of the program.

ResNet50の学習には、日本の登録商標694,428枚を用いた。学習に用いた登録商標には、ウィーン分類コードが最大で40個、最小で1個付与されていた。さらにウィーン分類コードの付与されていた数の平均は8.1であり標準偏差は5.75であった。より詳細に学習データを分析するために、図８に各大分類の学習に用いたデータ数を載せる。図８は、横軸に大分類を示しており、縦軸に学習に用いたウィーン分類コードの数を示している。このグラフから、大分類２６（幾何学図形及び立体的幾何学図形）の学習データ数が2,332,188個と多く、最も学習データ数が少なかった大分類１２（家具、衛生施設）は5,617個であったことがわかる。つまり、登録された商標においてウィーン分類コードには偏りが生じており、不均衡なデータを学習することとなった。 ResNet50 was trained using 694,428 Japanese registered trademarks. A maximum of 40 Vienna classification codes and a minimum of 1 were assigned to the registered trademarks used for training. Furthermore, the average number assigned to the Vienna classification code was 8.1 with a standard deviation of 5.75. In order to analyze the learning data in more detail, FIG. 8 shows the number of data used for learning each major classification. In FIG. 8, the horizontal axis indicates the major classification, and the vertical axis indicates the number of Wien classification codes used for learning. From this graph, the number of learning data for major category 26 (geometric figures and three-dimensional geometric figures) is large at 2,332,188, and the smallest number of learning data for major category 12 (furniture, sanitary facilities) is 5,617. I understand. In other words, there was a bias in the Vienna Classification Code for registered trademarks, resulting in learning imbalanced data.

またResNet50で用いた学習パラメータは以下のとおりである。
バッチサイズ：512
epoch数：100
optimizer ：Adam(alpha＝0.0001、beta1＝0.9, beta2＝0.999, eps＝1.0 x 10-8, eta＝1.0)
alphaの減少： 60epochおよび90epochにてalpha の値を1/10
データの正規化：学習に用いる画像のChannel毎に学習データセットの平均値を引き1/255スケーリング
Data augmentation：Random Crop, Horizontal Flip
ResNet50のパラメータの初期化：Heらの研究に従う（K. He, X. Zhang, S. Ren and J. Sun “Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification,” arXiv preprint arXiv, (2015) 参照）。 The learning parameters used in ResNet50 are as follows.
Batch size: 512
Number of epochs: 100
optimizer : Adam(alpha=0.0001, beta1=0.9, beta2=0.999, eps=1.0 x 10-8, eta=1.0)
Decrease alpha: 1/10 alpha value at 60epoch and 90epoch
Data normalization: 1/255 scaling by subtracting the average value of the learning data set for each image channel used for learning
Data augmentation: Random Crop, Horizontal Flip
Parameter initialization of ResNet50: Following the work of He et al. (K. He, X. Zhang, S. Ren and J. Sun “Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification,” arXiv preprint arXiv, (2015 ) reference).

（最適な検索パラメータを求める実験の詳細）
この実験では、1,000枚のクエリ画像を対象に、この手法の最適な検索パラメータをグリッドサーチによって求めることで、各検索パラメータを変化させた場合の傾向を調べた。
さらに、最適な検索パラメータNを用いた場合の機械付与のパフォーマンスを調べるために、専門家が付けたウィーン分類コードとの差を3,222枚のクエリに対して確認した。 (Details of the experiment to find the optimal search parameters)
In this experiment, the optimal search parameters for this method were obtained by grid search using 1,000 query images, and the tendency when each search parameter was changed was investigated.
Furthermore, to examine the performance of the machine assignment when using the optimal search parameter N, we checked the difference with the Wien classification code assigned by experts for 3,222 queries.

［実験条件］
最も検索漏れが少ない指標として、再現率（Recall）が100％の場合の適合率（Precision）を使用した。この実験において再現率は、検索質問とされた商標のうちであるウィーン分類コードが付与されるべきすべての商標のうちで、そのウィーン分類コードが機械付与されたものの割合といえる。再現率100％は、各分類について、その分類のウィーン分類コードが付与されるべき商標には、漏れなくそのウィーン分類コードが機械付与されたことを意味する。一方、この実験における適合率は、あるウィーン分類コードが機械付与されたすべての商標のうちで、本来そのウィーン分類コードが付与されるべきでないものを除いた割合といえる。再現率と適合率の算出には、11点補完平均適合率（Edited by K. Kita, K. Tuda and M. Shishibori, “Jyouhoukensaku arugorizumu,” Kyoritsu Shuppan (2002), pp.20-21参照）を採用した。この実験で対象とする検索パラメータは以下の２点であり、加えて各検索パラメータに関し、調査する値を以下に示す。
ウィーン分類コードの機械付与に関する閾値パラメータＮ
調査する値：0.1から1.0まで0.1間隔で設定
データベース画像の形状類似度ｓと分類類似度ａの重みパラメータｍ
調査する値：0.01から0.2まで0.01間隔で設定 [Experimental conditions]
As the index with the least search omissions, the precision when the recall is 100% was used. In this experiment, the recall rate can be said to be the ratio of the trademarks to which the Vienna classification code was assigned by machine, out of all the trademarks that should have been given the Vienna classification code, among the trademarks that were used as search queries. A recall rate of 100% means that, for each class, the Vienna Classification Code was automatically assigned to all trademarks that should have been given the Vienna Classification Code of that class. On the other hand, the precision rate in this experiment can be said to be the ratio of trademarks to which a certain Vienna classification code should not have been assigned, out of all the trademarks to which a certain Vienna classification code should have been assigned. 11-point imputed mean precision (Edited by K. Kita, K. Tuda and M. Shishibori, “Jyouhoukensaku arugorizumu,” Kyoritsu Shuppan (2002), pp.20-21) was used to calculate recall and precision. adopted. The search parameters targeted in this experiment are the following two points, and in addition, the values to be investigated for each search parameter are shown below.
Threshold Parameter N for Machine Assignment of Wien Classification Codes
Value to be investigated: Weight parameter m of shape similarity s and classification similarity a of database images set in 0.1 intervals from 0.1 to 1.0
Value to investigate: set from 0.01 to 0.2 in 0.01 intervals

［実験結果・考察］
まず、再現率が100％の場合の適合率が最も高い値を示す閾値パラメータＮと重みパラメータｍを上記の値で調べた。適合率が最も高い値を示す閾値パラメータＮとその周辺の値を用いた結果を図９に示す。
この図では、縦軸は再現率が100％の場合の適合率をプロットしており、横軸は重みパラメータｍを示している。適合率が最も高い値を示すＮとｍの値は、それぞれ0.3と0.1であった。検索にこれらのパラメータを用いることで、この実施形態における手法は約６７％の適合率を得ることができた。
次に、各パラメータＮおよびｍを変化させた場合の適合率の傾向について調べる。
まず、閾値パラメータＮの各設定値について、ｍを変化させた場合のそれぞれ最も高い値を示すピーク適合率を比較すると、最大のピーク適合率はＮ＝0.3，ｍ＝0.1の場合における66.5％である。これに対して、最少のピーク適合率はＮ＝0.5，ｍ＝0.09の場合における66.75％であって、その差が0.2％以内に収まっている。このことから、閾値パラメータＮに対するピーク適合率の変化については大きな差異が見られなかった。
一方、重みパラメータｍを大きくすると、どの閾値パラメータＮ対してもｍ＝0.15付近で適合率が減少傾向になるが、その度合いはＮの値が大きいほうが緩やかである。つまりＮの値が大きい程、重みパラメータｍの値を大きくすることの影響度が小さい。 [Experimental results and discussion]
First, the threshold parameter N and the weight parameter m, which indicate the highest precision when the recall is 100%, were examined using the above values. FIG. 9 shows the result of using the threshold parameter N that indicates the highest matching rate and its surrounding values.
In this figure, the vertical axis plots the precision when the recall is 100%, and the horizontal axis shows the weight parameter m. The values of N and m showing the highest precision were 0.3 and 0.1, respectively. Using these parameters in the search, the technique in this embodiment was able to obtain a precision of about 67%.
Next, the tendency of the precision when each parameter N and m is changed is examined.
First, when comparing the peak matching ratios showing the highest values when m is changed for each set value of the threshold parameter N, the maximum peak matching ratio is 66.5% when N = 0.3 and m = 0.1. be. On the other hand, the minimum peak matching ratio is 66.75% when N=0.5 and m=0.09, and the difference is within 0.2%. From this, no significant difference was found in the change of the peak matching rate with respect to the threshold parameter N.
On the other hand, when the weight parameter m is increased, the matching rate tends to decrease near m=0.15 for any threshold parameter N, but the degree of decrease is gentler as the value of N increases. That is, the larger the value of N, the smaller the influence of increasing the value of the weight parameter m.

次に、閾値パラメータＮ＝0.3に定めた場合にウィーン分類が正確に付与されているかを3,222枚のクエリに対して調べた。機械付与できた割合は再現率が50.2％の場合であって、適合率が50.6％であった。ウィーン分類コードの機械付与は十分高い性能を達成しているとはいえないかもしれない。
機械付与されたウィーン分類コードの具体例を図１０から図１２示し、考察する。これらの例では、対象となるクエリ画像とそれに付与されたウィーン分類コードを示す。専門家が付与したウィーン分類コードを実線で囲み、機械付与したウィーン分類コードを破線で囲んで示している。
図１０は、専門家が付与したウィーン分類コードをすべて正確に機械付与できた例を示している。図１０に示す検索質問画像（登録商標４１７０４７２号）においては、人間の顔を持つ三日月が描かれており、専門家がつけた８個のウィーン分類コードをすべて正確に機械付与できている。このようにすべてのウィーン分類コードが正確に付与できた例は519枚あった。 Next, 3,222 queries were examined to see if the Wien classification was correctly assigned when the threshold parameter N=0.3. The rate of machine assignment was 50.2% for recall and 50.6% for precision. Machine assignment of the Wien Classification code may not achieve sufficiently high performance.
A specific example of a machine-assigned Wien Classification code is shown and discussed in FIGS. 10-12. These examples show the query image of interest and the Wien classification code assigned to it. The expert-assigned Wien classification code is shown enclosed by a solid line, and the machine-assigned Vienna classification code is shown enclosed by a dashed line.
FIG. 10 shows an example in which all expert-given Wien Classification codes could be correctly machine-given. In the search query image (Registered Trademark No. 4170472) shown in FIG. 10, a crescent moon with a human face is depicted, and all eight Wien classification codes assigned by experts can be accurately machine-assigned. In this way, there were 519 examples in which all Vienna classification codes could be correctly assigned.

図１１は、専門家が付与したウィーン分類コードが１つも機械付与できなかった例を示す。図１１に示す検索質問画像（登録商標４７６４２０６号）においては、動物のキャラクターが描かれている。専門家がつけた７個のウィーン分類コードが、機械付与では１つも付与されなかった。このように、ウィーン分類コードが１1個も機械付与されなかった例は54枚あった。
図１２は、専門家が付与したウィーン分類コードに対して、機械付与では漏れが生じたり誤ったウィーン分類コードが付与された例を示す。図１２に示す検索質問画像（登録商標第４３４１４２１号）においては、馬を引き連れた男が描かれている。専門家がつけた１６個のウィーン分類コードのうち９個は機械付与できているが、残りの７個は付与されず漏れが生じている。さらに、６個のウィーン分類コードが誤って機械付与されている。このように、１個も機械付与されなかった例を除き、付け漏れや誤った付与がおこなわれた例は、2,649枚あった。 FIG. 11 shows an example in which none of the expert-given Wien Classification codes could be machine-given. In the search question image (registered trademark No. 4764206) shown in FIG. 11, an animal character is drawn. None of the 7 expertly assigned Wien Classification codes were assigned by machine assignment. Thus, there were 54 cases in which no 11 Vienna classification codes were machine assigned.
FIG. 12 shows an example in which omissions or erroneous Wien classification codes are assigned to the Wien classification codes assigned by an expert in machine assignment. In the search question image (Registered Trademark No. 4341421) shown in FIG. 12, a man with a horse is depicted. Of the 16 Wien classification codes assigned by experts, 9 have been machine assigned, but the remaining 7 have not been assigned and omissions have occurred. In addition, 6 Wien classification codes are erroneously machine assigned. In this way, there were 2,649 cases of omission or incorrect application, excluding cases in which none were applied mechanically.

次に、図１２の具体例を精査する。機械付与できなかったウィーン分類コードは７個あるが、03.03.01（馬、ラバ）と02.01.20（馬・ラバ又はロバに乗ったあるいは連れた男性）のように共通に付与できたウィーン分類コードと重複した内容が多い。また誤って機械付与されたウィーン分類コードの６個中４個が大分類２９（色）に関する内容であり29.01.01（赤、ピンク、オレンジ）のように商標を正しく表現できているとも思えるものもあったが、専門家はそれを付与していなかった。
以上の点から、専門家が重複したウィーン分類コードを付与する傾向と、色のウィーン分類コードの付与する傾向に関して、学習が十分でないと考えられる。また26.01.16（その他の図形要素を内包する円または楕円）が機械付与されているが、「その他」を含んだ内容であり対象となる層の他のウィーン分類コードも考慮する必要があるため、正確に付与することが難しいと考えられる。参考までにウィーン分類コード26.01.16の専門家が付与した数は288枚で、機械付与した数は260枚であったが、正確に付与できたものは１枚もなかった。 Next, the specific example of FIG. 12 is scrutinized. There are 7 Vienna Classification codes that could not be assigned by machine, but there are 03.03.01 (horses, mules) and 02.01.20 (men riding or accompanied by horses, mules or donkeys) that could be assigned in common. A lot of code and duplicate content. In addition, 4 out of 6 Vienna Classification Codes that were erroneously assigned by machine are related to Major Classification 29 (colors), such as 29.01.01 (red, pink, orange), which seems to represent the trademark correctly. There was also, but the experts did not grant it.
From the above points, it is considered that experts do not sufficiently learn the tendency to assign duplicate Wien classification codes and the tendency to assign color Wien classification codes. Also, 26.01.16 (circle or ellipse containing other graphic elements) is machine assigned, but it contains "other" and other Vienna classification codes of the target layer must also be considered. , is considered to be difficult to assign accurately. For reference, 288 were given by experts with the Vienna classification code 26.01.16, and 260 were given by machine, but none of them could be given correctly.

〔第２の実験：この手法の有効性を示す実験〕
この実験では、最適な検索パラメータを用いたこの手法の有効性を示すために、3,222枚のクエリ画像を用いて、既存手法との比較を行った。
（実験条件）
この比較では、第１の実験と同様、11点補完平均適合率（Edited by K. Kita, K. Tuda and M. Shishibori, “Jyouhoukensaku arugorizumu,” Kyoritsu Shuppan (2002), pp.20-21参照）を採用した。加えて、この手法と既存手法の検索に有意差があるかを調べるために、平均適合率（Average Precision）を算出し、統計的な検定を行った。平均適合率はランキングの上位にどれだけ適合する画像が出現しているかを測る指標である。用いた検定法は、ウィルコクソンの符号付順位和検定（Frank Wilcoxon, “Individual comparisons by ranking methods,” Biometrics bulletin, 1, pp.80-83 (1945) 参照）である。ウィルコクソンの符号付順位和検定は、２つのデータ間に対応関係がある際に用いられ、データの中央値に差があれば、有意差が認められる手法である。 [Second experiment: Experiment showing the effectiveness of this method]
In this experiment, 3,222 query images were used to demonstrate the effectiveness of this method with optimal search parameters, and a comparison with existing methods was performed.
(Experimental conditions)
In this comparison, as in the first experiment, the 11-point imputed average precision (see Edited by K. Kita, K. Tuda and M. Shishibori, “Jyouhoukensaku arugorizumu,” Kyoritsu Shuppan (2002), pp.20-21) It was adopted. In addition, in order to investigate whether there is a significant difference between the search of this method and the existing method, we calculated the average precision (Average Precision) and performed a statistical test. The average matching rate is an index that measures how many matching images appear at the top of the ranking. The test used is the Wilcoxon signed rank sum test (see Frank Wilcoxon, "Individual comparisons by ranking methods," Biometrics bulletin, 1, pp. 80-83 (1945)). Wilcoxon's signed rank sum test is used when there is correspondence between two data, and is a method that recognizes a significant difference if there is a difference in the median values of the data.

またウィーン分類コードが正確に付与されるとどの程度検索精度が向上するかを調べるために、専門家による付与の検索手法との比較も行った。専門家による付与の検索手法の最適な重みパラメータｍは、第１の実験と同様にグリッドサーチによって調べた結果、0.13であった。よって、専門家による付与の検索手法では、パラメータｍを0.13とする。最後に、具体的な検索例を示し、考察する。 In addition, in order to investigate how much the retrieval accuracy improves when the Vienna classification code is assigned correctly, we compared it with the retrieval method assigned by experts. The optimal weight parameter m for the retrieval method given by experts was 0.13 as a result of examining by grid search in the same manner as in the first experiment. Therefore, the parameter m is set to 0.13 in the search method for granting by experts. Finally, a specific search example will be shown and discussed.

（実験結果・考察）
この実施形態による手法と専門家による付与の検索手法と既存手法との11点補完平均適合率の結果を図１３に示す。図１３のような再現率－適合率曲線（Precision-Recall曲線あるいはPR曲線ともいう）は、機械学習の分野において予測の正確さを表す指標としてよく用いられる。実験対象の新規な商標に類似する登録商標を、類似度が高いと予測される順にランキングした場合、実際に類似度の高い類似商標を上位に固められたかという、検索の正確さを表すものといえる。
この結果から、専門家による付与の検索手法、この実施形態による手法、既存手法の順にすべての再現率において適合率が高くなっていることがわかる。
また、この実施形態による手法と既存手法を再現率が100％の場合の適合率を比較する。再現率が100％の場合に、この実施形態による手法は適合率が66.5％であり、既存手法の63.5％よりも適合率が約3.0％向上している。また既存手法とこの手法の平均適合率に検定法を適用した結果、Ｐ値は7.71×10^(-85)となった。Ｐ値が有意水準１％を下回っているため、この手法と既存手法の平均適合率は有意差があるといえる。
以上により、ディープラーニングにより機械付与したウィーン分類コードを検索に用いることは有効であるといえる。 (Experimental results and discussion)
FIG. 13 shows the results of the 11-point complement average precision of the method according to this embodiment, the search method given by experts, and the existing method. A recall-precision curve (also referred to as a Precision-Recall curve or a PR curve) as shown in FIG. 13 is often used as an indicator of prediction accuracy in the field of machine learning. When registered trademarks similar to the new trademark being tested were ranked in the order of predicted similarity, it shows the accuracy of the search, as to whether similar trademarks with a high degree of similarity were actually grouped at the top. I can say.
From this result, it can be seen that the precision rate is high in all recall rates in order of the retrieval method given by the expert, the method according to this embodiment, and the existing method.
Also, the precision of the method according to this embodiment and the existing method when the recall is 100% will be compared. When the recall rate is 100%, the precision rate of the method according to this embodiment is 66.5%, which is about 3.0% higher than the 63.5% of the existing method. As a result of applying the test method to the average precision of the existing method and this method, the P value was 7.71×10 ^(-85) . Since the P-value is below the significance level of 1%, it can be said that there is a significant difference between the average precision of this method and the existing method.
From the above, it can be said that it is effective to use the Wien classification code machine-assigned by deep learning for retrieval.

さらに専門家による付与の検索手法と比較すると、再現率が100％の場合に専門家による付与の検索手法の適合率が68.5％であって、この実施形態による手法よりも適合率が約2.0％高かった。以上のことから、ウィーン分類コードの付与精度をより向上させることができれば、この実施形態による手法の適合率を専門家による付与の検索手法の適合率まで引き上げることができると考えられる。 Furthermore, when compared with the search method given by experts, the precision rate of the search method given by experts is 68.5% when the recall rate is 100%, which is about 2.0% higher than the method according to this embodiment. it was high. From the above, if the accuracy of assigning the Wien classification code can be further improved, it is thought that the precision of the method according to this embodiment can be raised to the precision of the search method of assigning by experts.

次に、検索処理２１の結果について具体例を図１４と図１５に示す。図１４および図１５は、図全体を上下および左右に４分割している。左上にクエリ画像を示し、右上に既存手法、右下にこの実施形態による手法、左下に専門家による付与の検索手法の上位15位までの検索結果を示す。正解画像は赤枠で囲んでいる。
図１４は、既存手法に比べてこの手法が優れていた例である。図１４の既存手法では、検索順位１５位までに正解画像が６枚検索されている。それに対して、専門家による付与の検索手法とこの手法では検索順位１５位までにそれぞれ正解画像が１１枚検索されている。クエリ画像に着目すると、描かれているのは上部に王冠、両側に獅子を携え中央に矢と剣を交差させた紋章である。一方、専門家による付与の検索手法とこの手法の検索できた正解画像には、剣と矢を交差させた紋章や王冠や盾などが描かれている。これらの正解画像が出力できた理由として、クエリ画像の商標について、専門家による付与と火回付与の両方で24.1.19（盾持ちを有する盾）や24.9.5（一つの王冠）などのウィーン分類コードが付与されていたことが挙げられる。 14 and 15 show specific examples of the results of the search processing 21. FIG. 14 and 15 are divided vertically and horizontally into four parts. The upper left shows the query image, the upper right shows the existing method, the lower right shows the method according to this embodiment, and the lower left shows the top 15 search results of the search methods assigned by experts. The correct image is surrounded by a red frame.
FIG. 14 shows an example in which this method is superior to the existing method. In the existing method of FIG. 14, 6 correct images are retrieved up to the 15th place in the retrieval ranking. On the other hand, 11 correct images were retrieved up to the 15th place in the retrieval rankings by the retrieval method given by the expert and by this method. Focusing on the query image, what is drawn is a crest with a crown at the top, lions on both sides, and an arrow and a sword crossed in the center. On the other hand, the method of searching for grants by experts and the correct images retrieved by this method include emblems with crossed swords and arrows, crowns, shields, and the like. The reason why these correct images could be output is that for the trademark of the query image, there are Vienna A classification code was given.

一方、図１５はこの手法に比べて既存手法が優れていた例である。ここでは、既存手法の検索精度が最も高かった例を示している。既存手法は正解画像が１位と２位に検索できており、この手法と専門家による付与の検索手法では、２枚目の正解画像がそれぞれ５位と１１位に検索されている。このようにこの手法と専門家による付与の検索手法では、２枚目の正解画像の検索順位が下がった。その理由は、両手法において、クエリ画像に付与したウィーン分類コードと２枚目の正解画像のウィーン分類コードが１個も一致しなかったためである。このように既存手法の検索精度がよかったクエリ画像数とこの手法の検索精度がよかったクエリ画像数を表２に示す。表２から、機械付与したウィーン分類コードを検索に加えることによって、検索精度が改善されたクエリ画像が多くなったことがわかる。
≪まとめ≫ On the other hand, FIG. 15 shows an example in which the existing method is superior to this method. Here, an example in which the search accuracy of the existing method was the highest is shown. According to the existing method, the correct image was retrieved at 1st and 2nd place, and in this method and the retrieval method given by the expert, the second correct image was retrieved at 5th and 11th place, respectively. As described above, in this method and the search method with the assignment by the expert, the search ranking of the second correct image was lowered. The reason is that the Wien classification code assigned to the query image and the Wien classification code of the second correct image do not match even one in both methods. Table 2 shows the number of query images for which the existing method has good retrieval accuracy and the number of query images for which this method has good retrieval accuracy. From Table 2, it can be seen that adding the machine-assigned Wien classification code to the search resulted in more query images with improved search accuracy.
≪Summary≫

この明細書では、外観が類似した商標を検索するために、図形の形状に基づく類似性に加えて図形の分類に基づく類似性も評価する検索手法を提案した。この発明の特徴は、図形の分類に基づく類似性を評価する際に、ディープラーニングによる機械付与したウィーン分類コードを用いることである。そこでまず、ウィーン分類コードを正確に機械付与できていたかを調べた。その結果、再現率が50.2％の場合に適合率が50.4％であった。このことから、ウィーン分類コードの機械付与は十分高い精度が達成できているとはいえないかもしれない。 This specification proposes a search method that evaluates similarity based on the classification of figures in addition to similarity based on the shape of figures in order to search for trademarks that are similar in appearance. A feature of the present invention is the use of deep learning machine-assigned Wien classification codes in evaluating the classification-based similarity of figures. Therefore, we first examined whether the Wien classification code could be accurately machine-assigned. As a result, the precision was 50.4% when the recall was 50.2%. From this, it may be said that the machine assignment of the Wien classification code does not achieve sufficiently high accuracy.

しかし、この実施形態による手法と既存手法の検索精度を比較した結果、この実施形態による手法は再現率が100％の場合に、適合率が約3.0％向上している。また、この実施形態による手法の機械付与の精度を向上させることで、検索精度をさらに約2.0％向上させ得る。この実施形態による手法と既存手法の平均適合率に検定法を適用した結果、有意差があることが実証された。以上のことから、図形の形状に基づく類似性に加えて図形の分類に基づく類似性も評価する検索手法の有効性が示された。 However, as a result of comparing the search accuracy of the method according to this embodiment and the existing method, the precision rate of the method according to this embodiment is improved by about 3.0% when the recall rate is 100%. Also, by improving the accuracy of machine assignment of the technique according to this embodiment, the search accuracy can be further improved by about 2.0%. As a result of applying the test method to the average precision of the method according to this embodiment and the existing method, it was demonstrated that there is a significant difference. From the above, the effectiveness of the search method that evaluates the similarity based on the classification of figures in addition to the similarity based on the shape of figures was demonstrated.

以上に述べたように、
（i）この発明による類似画像検索装置は、検索質問画像から１以上の形状的特徴を抽出するクエリ特徴抽出部と、予め１以上の分類情報が付与されかつ１以上の形状的特徴が抽出された複数の参照画像を参照し、検索質問画像と形状的特徴の照合を行う形状照合部と、前記照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出部と、各参照画像に付与された分類情報に基づいて検索質問画像につき１以上の分類情報を推定し付与する分類付与部と、検索質問画像に付与された分類情報と各参照画像に付与された分類情報との一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出部と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力部と、を備えることを特徴とする。
この発明において、形状的特徴は、画像の形状の特徴を表すものであって、画像検索に適用可能な特徴量である。その具体的な例としては、ＳＩＦＴやそれに改良を加えた局所特徴が挙げられる。形状的特徴は必ずしも局所的な特徴を表す局所特徴に限定されるものでない。しかし、近年の画像検索の進歩は局所特徴量の貢献によるところが少なくない。
また、分類情報は、画像にラベル付けされた情報であって、付与された分類情報によってその画像が属するカテゴリが決まるものである。分類情報の具体例として、上述の実施形態においては図形商標に付与されたウィーン分類コードについて述べたが、それに限定されるものでない。ただし、この明細書における分類情報は、画像の形状的な類似性とは異なる観点を含んでカテゴリが決められるものである。例えば、画像がそれを観る者に与える観念、印象、意味あるいは象徴などを含んでカテゴリが決まる。そうでなければ、形状的特徴のみに基づく類似画像検索で足りるからである。
さらにまた、参照画像は、検索質問画像に対する類似画像の検索対象となる画像である。参照画像は、予め分類情報が付与されている。その具体的として、前述の実施形態では登録商標を挙げたが、これに限るものでない。なお、この明細書において、参照画像は予め分類情報が付与されているものとしているが、これに対し検索質問画像は未だ分類情報が付与されていない状態で検索に用いられるものとしている。
上述の実施形態および図５に示すクエリ特徴抽出処理は、この発明に係るクエリ特徴抽出部の機能に対応する。同様に、形状照合処理はこの発明に係る形状照合部の機能に対応し、形状類似度算出処理はこの発明に係る形状照合部の機能に対応する。さらに、上述の実施形態および図５に示すウィーン分類コード付与処理３３はこの発明に係る分類付与部の機能に対応し、分類類似度算出処理はこの発明に係る分類類似度算出部に対応する。同様に、出力処理はこの発明に係る出力部の機能に対応する。
上述の実施形態は、各参照画像に対してあらかじめウィーン分類コードが付与されていることを前提としている。その前提を満たした商標に対する検索のみに適用可能である。しかし、図５に示すウィーン分類コード付与処理３３を参照画像に対して適用することも考えられる。そうしてウィーン分類コードが付与された画像を対象としてデータベースを作成することも考えられる。そのようにすれば、予め分類情報が付与されていない未登録商標や商標以外の画像、例えばイラストや一般の物体の画像などの検索に対しても、この実施形態で述べた手法を拡張し適用できるものと考えられる。 As mentioned above,
(i) A similar image retrieval device according to the present invention comprises a query feature extraction unit for extracting one or more shape features from a query image, and one or more classification information assigned in advance and one or more shape features extracted. A shape matching unit that refers to a plurality of reference images and compares the search question image and shape features, and a shape similarity calculation unit that calculates the shape similarity of each reference image to the search question image based on the matching. , a classification assigning unit that estimates and assigns one or more classification information per search question image based on the classification information assigned to each reference image; and the classification information assigned to the search question image and the classification assigned to each reference image. a classification similarity calculation unit that calculates the classification similarity of each reference image to a search question image based on the degree of matching with information; and one or more similar to the search question image based on the shape similarity and the classification similarity and an output unit for outputting the reference image.
In the present invention, the shape feature represents the shape feature of an image and is a feature amount applicable to image retrieval. Specific examples thereof include SIFT and local features obtained by improving it. Shape features are not necessarily limited to local features representing local features. However, recent advances in image retrieval are largely due to the contribution of local features.
Classification information is information labeled with an image, and the category to which the image belongs is determined by the given classification information. As a specific example of the classification information, the Vienna classification code given to the graphic trademark was described in the above-described embodiment, but it is not limited to this. However, in the classification information in this specification, categories are determined from a viewpoint different from the shape similarity of images. For example, the category is determined by including the idea, impression, meaning or symbol that the image gives to the viewer. Otherwise, similar image retrieval based only on shape features will suffice.
Furthermore, the reference image is an image to be searched for images similar to the query image. The reference image is provided with classification information in advance. As a specific example, the registered trademark was mentioned in the above-described embodiment, but it is not limited to this. In this specification, it is assumed that the reference image is given classification information in advance, but the search question image is used for retrieval without being given classification information.
The above embodiment and the query feature extraction processing shown in FIG. 5 correspond to the function of the query feature extraction unit according to the present invention. Similarly, the shape matching process corresponds to the function of the shape matching section according to the present invention, and the shape similarity calculation process corresponds to the function of the shape matching section according to the present invention. Furthermore, the Vienna classification code assigning process 33 shown in the above embodiment and FIG. 5 corresponds to the function of the classification assigning unit according to the invention, and the classification similarity calculation process corresponds to the classification similarity calculation part according to the invention. Similarly, output processing corresponds to the function of the output unit according to the invention.
The above-described embodiment assumes that each reference image is given a Wien classification code in advance. Applicable only to searches for trademarks that meet that premise. However, it is also conceivable to apply the Wien classification code assigning process 33 shown in FIG. 5 to the reference image. It is also conceivable to create a database for images to which the Wien classification code has been assigned. By doing so, the method described in this embodiment can be extended and applied to searches for unregistered trademarks and images other than trademarks that have not been given classification information in advance, such as illustrations and images of general objects. It is considered possible.

さらに、この発明の好ましい態様について述べる。
（ii）前記参照画像が登録商標の画像であり、各参照画像に付与された分類情報がウィーン分類コードであり、検索質問画像および各参照画像に係る形状的特徴がそれぞれの画像から複数抽出される局所特徴であり、前記形状照合部は、検索質問画像に係る各局所特徴と各参照画像に係る各局所特徴とをそれぞれ照合し、照合された形状的特徴について類否を判断するものであってもよい。
このようにすれば、外観が類似した商標を検索する手法として、図形の形状に基づく類似性に加えて図形の分類に基づく類似性を考慮した検索が実現できる。 Furthermore, preferred embodiments of the present invention will be described.
(ii) the reference image is a registered trademark image, the classification information assigned to each reference image is a Vienna classification code, and a plurality of shape features related to the search question image and each reference image are extracted from each image; The shape matching unit compares each local feature related to the query image with each local feature related to each reference image, and determines the similarity of the matched shape features. may
In this way, as a method of searching for trademarks similar in appearance, it is possible to realize a search that considers similarity based on the classification of figures in addition to similarity based on the shape of figures.

（iii）前記出力部は、各参照画像について、形状類似度と分類類似度とを相補的に重み付けした総合類似度に基づいて検索質問画像に対する各参照画像の類否を判断してもよい。
このようにすれば、総合類似度を用いて、図形の形状に基づく類似性に加えて図形の分類に基づく類似性を考慮した検索が実現できる。 (iii) The output unit may determine the similarity of each reference image to the search question image based on a comprehensive similarity obtained by complementary weighting shape similarity and classification similarity for each reference image.
In this way, using the total similarity, it is possible to realize a search that takes into account the similarity based on the classification of the figure in addition to the similarity based on the shape of the figure.

（iv）前記形状照合部が照合に用いた局所特徴が、信頼性のある特徴か否かを検証する検証部と、検索質問画像に対し出力部が出力した１以上の登録商標画像の少なくとも一部を新たな検索質問として、その新たな検索質問に対する類似画像を得る擬似適合フィードバック部とをさらに備え、前記出力部は、前記検索質問に対する出力として得られた登録商標画像に擬似適合フィードバック処理で得られた登録商標画像を加えて最終出力とするものであってもよい。
このようにすれば、検証部によって信頼性のない特徴を排除し、擬似適合フィードバックによって類似画像の検索漏れを低減することによって検索の精度をより高めることができる。
上述の実施形態および図５に示す検証処理は上記検証部の機能に対応する。同様に、擬似適合フィードバック処理は、上記擬似適合フィードバック部の機能に対応する。 (iv) at least one of a verification unit that verifies whether the local features used for matching by the shape matching unit are reliable features, and one or more registered trademark images output by the output unit in response to the search query image; as a new search question, a pseudo-match feedback unit that obtains a similar image for the new search question, and the output unit performs pseudo-match feedback processing on the registered trademark image obtained as an output for the search question. The final output may be obtained by adding the obtained registered trademark image.
In this way, the verification unit eliminates unreliable features, and the pseudo-match feedback reduces retrieval omissions for similar images, thereby further enhancing retrieval accuracy.
The above embodiment and the verification process shown in FIG. 5 correspond to the function of the verification unit. Similarly, the pseudo-adaptive feedback process corresponds to the function of the pseudo-adaptive feedback section described above.

（v）前記分類付与部は、各参照画像とそれに対し付与された分類情報とを教師データとして学習することで画像入力に対して分類情報を出力するよう構築された機械学習モデルを含んで構成されてもよい。
このようにすれば、これまでに分類情報が付与された参照画像を学習することによって未だ分類情報が付与されていない任意の画像に分類情報を機械付与できる。そして、機械付与された分類情報を用い、画像の形状に基づく類似性に加えて分類情報に基づく類似性を考慮して類似画像を見出すことができる。 (v) The classification imparting unit includes a machine learning model constructed to output classification information for an image input by learning each reference image and the classification information assigned thereto as teacher data. may be
In this way, classification information can be mechanically assigned to any image to which no classification information has yet been assigned by learning reference images to which classification information has been assigned. Then, using the machine-given classification information, similar images can be found by considering the similarity based on the classification information in addition to the similarity based on the shape of the image.

（vi）また、この発明の好ましい一態様が、コンピュータが実行する処理であって、検索質問画像から１以上の形状的特徴を抽出するクエリ特徴抽出処理と、予め１以上の分類情報が付与されかつ１以上の形状的特徴が抽出された複数の参照画像を参照し、検索質問画像と形状的特徴の照合を行う形状照合処理と、前記照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出処理と、各参照画像に付与された分類情報に基づいて検索質問画像につき１以上の分類情報を推定し付与する分類付与処理と、検索質問画像に付与された分類情報と各参照画像に付与された分類情報との一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出処理と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力処理と、を備える類似画像検索方法であってもよい。 (vi) A preferred aspect of the present invention is a process executed by a computer, comprising query feature extraction processing for extracting one or more shape features from a search question image, and referring to a plurality of reference images from which one or more shape features have been extracted, shape matching processing for matching the search question image with the shape features, and shape similarity of each reference image to the search question image based on the matching. A shape similarity calculation process for calculating the degree of similarity, a classification assignment process for estimating and assigning one or more classification information to the search question image based on the classification information assigned to each reference image, and a classification assigned to the search question image Classification similarity calculation processing for calculating the classification similarity of each reference image to a search query image based on the degree of matching between information and classification information assigned to each reference image; output processing for outputting one or more reference images similar to the query image based on the similar image retrieval method.

（vii）あるいはまた、この発明の好ましい一態様がコンピュータに処理を実行させるプログラムであって、コンピュータに処理を実行させるプログラムであって、検索質問画像から１以上の形状的特徴を抽出するクエリ特徴抽出処理と、予め１以上の分類情報が付与されかつ１以上の形状的特徴が抽出された複数の参照画像を参照し、検索質問画像と形状的特徴の照合を行う形状照合処理と、前記照合に基づいて検索質問画像に対する各参照画像の形状類似度を算出する形状類似度算出処理と、各参照画像に付与された分類情報に基づいて検索質問画像につき１以上の分類情報を推定し付与する分類付与処理と、検索質問画像に付与された分類情報と各参照画像に付与された分類情報との一致の度合いに基づいて検索質問画像に対する各参照画像の分類類似度を算出する分類類似度算出処理と、前記形状類似度および前記分類類似度に基づいて検索質問画像に類似する１以上の参照画像を出力する出力処理と、を備える類似画像検索プログラムであってもよい。
この発明の好ましい態様には、上述した複数の態様のうちの何れかを組み合わせたものも含まれる。
前述した実施の形態の他にも、この発明について種々の変形例があり得る。それらの変形例は、この発明の範囲に属さないと解されるべきものではない。この発明には、請求の範囲と均等の意味および前記範囲内でのすべての変形とが含まれるべきである。 (vii) Alternatively, a preferred aspect of the present invention is a program for causing a computer to execute processing, which is a program for causing a computer to execute processing, wherein query features for extracting one or more shape features from a query image an extraction process; a shape matching process for matching a search question image with a shape feature by referring to a plurality of reference images to which one or more classification information has been assigned in advance and from which one or more shape features have been extracted; A shape similarity calculation process for calculating the shape similarity of each reference image with respect to the search question image based on and estimating and assigning one or more classification information to the search question image based on the classification information assigned to each reference image Classification assignment processing and classification similarity calculation for calculating the classification similarity of each reference image to the query image based on the degree of matching between the classification information assigned to the query image and the classification information assigned to each reference image and an output process of outputting one or more reference images similar to the query image based on the shape similarity and the classification similarity.
Preferred aspects of the present invention include combinations of any of the aspects described above.
Various modifications of the present invention are possible in addition to the above-described embodiments. Those modifications should not be construed as not belonging to the scope of the present invention. The present invention should include all modifications within the scope of the claims, the meaning of equivalents, and the scope.

１１：登録処理、１３：登録商標、１５：特徴抽出処理、１７：データベース、２１：検索処理、２３：検索質問、２５：クエリ特徴抽出処理、２７：形状照合処理、２９：検証処理、３１：形状類似度算出処理、３３：ウィーン分類コード付与処理、３５：分類類似度算出処理、３７：出力処理、３９：擬似適合フィードバック処理、４１：最終出力 11: Registration process 13: Registered trademark 15: Feature extraction process 17: Database 21: Search process 23: Search question 25: Query feature extraction process 27: Shape matching process 29: Verification process 31: Shape similarity calculation process 33: Vienna classification code assignment process 35: Classification similarity calculation process 37: Output process 39: Pseudo-match feedback process 41: Final output

Claims

a query feature extraction unit that extracts one or more local features from a search question image that is a trademark image;
Search by referring to registered trademarks as reference images stored in a database with one or more Vienna classification codes assigned in advance as classification information relating to the shape features of the graphic elements of the image and one or more local features extracted. a shape matching unit that matches the query image and local features;
A shape similarity calculation unit that calculates the shape similarity of each reference image with respect to the search query image based on the collation;
a classification assigning unit that estimates and assigns one or more Wien classification codes as classification information for each search query image based on the classification information assigned to each reference image;
a classification similarity calculation unit that calculates the classification similarity of each reference image to the search question image based on the degree of matching between the classification information assigned to the search question image and the classification information assigned to each reference image;
an output unit that outputs one or more reference images similar to the search query image based on the shape similarity and the classification similarity;
with
The classification similarity is calculated by the following formula
However, a: Classification similarity, X: a set of Vienna classification codes assigned to trademark images by the classification assigning unit, Y: a set of Vienna classification codes assigned to each registered trademark stored in the database

calculated by
The output unit uses the shape similarity s calculated by the shape similarity calculation unit and the classification similarity a, the following formula
where m is a constant indicating the weight of shape similarity s and classification similarity a

A similar image search device that calculates a total similarity s' with each reference image by and outputs a predetermined number of reference images in descending order of the calculated total similarity s'.

The shape matching unit compares each local feature related to the query image with each local feature related to each reference image, and determines the similarity of each reference image to the query image with respect to the matched shape features. Item 2. The similar image search device according to Item 1.

a verification unit that verifies whether the local feature used for matching by the shape matching unit is a reliable feature;
Among the predetermined number of reference images output by the output unit for the search question image, a predetermined number of reference images in descending order of overall similarity s' are used as a new search question, and a similar image for the new search question is obtained. and a pseudo adaptive feedback unit for obtaining
The output unit adds the registered trademark image obtained by the pseudo-matching feedback process to the reference image as an initial output obtained as an output for the initial search question, which is the search question, as a final output, and a predetermined number of images that are the final output The following formula is used for the total similarity when outputting the reference image of
However, s' ₀ is the total similarity s' when using the initial search question, s' _k is the total similarity s' calculated using the initial output of rank k as the search question, and N is the pseudo-matching feedback process. number of initial outputs applied to

3. The similar image retrieval device according to claim 2, wherein

The classification imparting unit includes a machine learning model configured to output classification information for an image input by learning each reference image and the classification information imparted thereto as teacher data. Item 4. The similar image search device according to any one of items 1 to 3.

A process executed by a computer,
a query feature extraction process for extracting one or more local features from a search question image that is a trademark image;
Search by referring to registered trademarks as reference images stored in a database with one or more Vienna classification codes assigned in advance as classification information relating to the shape features of the graphic elements of the image and one or more local features extracted. Shape matching processing for matching the query image and local features,
A shape similarity calculation process for calculating the shape similarity of each reference image with respect to the search question image based on the collation;
a classification assignment process for estimating and assigning one or more Wien classification codes as classification information for each search query image based on the classification information assigned to each reference image;
Classification similarity calculation processing for calculating the classification similarity of each reference image to a search question image based on the degree of matching between the classification information given to the search question image and the classification information given to each reference image;
an output process of outputting one or more reference images similar to the search query image based on the shape similarity and the classification similarity;
with
The classification similarity is calculated by the following formula
However, a: Classification similarity, X: a set of Vienna classification codes assigned to trademark images by the classification process, and Y: a set of Vienna classification codes assigned to each registered trademark stored in the database.

calculated by
The output process uses the shape similarity s calculated by the shape similarity calculation process and the classification similarity a, the following formula
where m is a constant indicating the weight of shape similarity s and classification similarity a

A similar image retrieval method for calculating a total similarity s' with each reference image by and outputting a predetermined number of reference images in descending order of the calculated total similarity s'.

A program that causes a computer to perform processing,
a query feature extraction process for extracting one or more local features from a search question image that is a trademark image;
Search by referring to registered trademarks as reference images stored in a database with one or more Vienna classification codes assigned in advance as classification information relating to the shape features of the graphic elements of the image and one or more local features extracted. Shape matching processing for matching the query image and local features,
A shape similarity calculation process for calculating the shape similarity of each reference image with respect to the search question image based on the collation;
a classification assignment process for estimating and assigning one or more Wien classification codes as classification information for each search query image based on the classification information assigned to each reference image;
Classification similarity calculation processing for calculating the classification similarity of each reference image to a search question image based on the degree of matching between the classification information given to the search question image and the classification information given to each reference image;
an output process of outputting one or more reference images similar to the search query image based on the shape similarity and the classification similarity;
with
The classification similarity is calculated by the following formula
However, a: Classification similarity, X: a set of Vienna classification codes assigned to trademark images by the classification process, Y: a set of Vienna classification codes assigned to each registered trademark stored in the database

calculated by
The output process uses the shape similarity s calculated by the shape similarity calculation process and the classification similarity a, the following formula
where m is a constant indicating the weight of shape similarity s and classification similarity a

A similar image search program for calculating a total similarity s' with each reference image, and outputting a predetermined number of reference images in descending order of the calculated total similarity s'.