JP7822915B2

JP7822915B2 - Information processing device, information processing method, and information processing program

Info

Publication number: JP7822915B2
Application number: JP2022185304A
Authority: JP
Inventors: 望吉成; 伸裕鍜治
Original assignee: Individual
Current assignee: Individual
Priority date: 2022-11-18
Filing date: 2022-11-18
Publication date: 2026-03-03
Anticipated expiration: 2042-11-18
Also published as: JP2024074178A

Description

本発明は、情報処理装置、情報処理方法及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing method, and an information processing program.

従来、インターネット検索等を行う際に、ユーザが入力した文字列である検索クエリに関する支援を行うための技術が知られている。このような技術の一例として、入力された検索クエリが、誤記として入力され得る文字列に該当する場合に、当該誤記として入力され得る文字列に対応付けられている文字列に変換し、変換した後の文字列を検索クエリとして情報を検索する技術が提供されている。 Technologies have been known in the past to provide assistance with search queries, which are character strings entered by users when conducting internet searches, etc. One example of such technology is a technique that, when an entered search query corresponds to a character string that could be entered as a typographical error, converts it into a character string associated with that character string that could be entered as a typographical error, and then uses the converted character string as a search query to search for information.

特許第６５２９４５６号公報Patent No. 6529456

しかしながら、上述した技術では、検索クエリの適切なスペース区切りの位置を特定できるとは言えない場合がある。 However, the above-mentioned techniques may not be able to identify the appropriate space separator position in a search query.

例えば、上述した技術では、誤記の文字列を、対応する正規の文字列に変換しているに過ぎず、検索クエリの適切なスペース区切りの位置を特定できるとは言えない。 For example, the above-mentioned technology only converts typographical errors into corresponding correct strings, and cannot be said to be able to identify the appropriate space separators in a search query.

本願は、上記に鑑みてなされたものであって、検索クエリの適切なスペース区切りの位置を特定することを目的とする。 This application was made in consideration of the above, and aims to identify appropriate space delimiter positions in a search query.

本願に係る情報処理装置は、利用者が入力した検索クエリに含まれるスペース区切りを削除することで、当該検索クエリが示す文字列を生成する生成部と、前記生成部により生成された文字列に基づいて、前記検索クエリを分類する分類部と、同一のグループに分類された各検索クエリが入力された回数に基づいて、当該各検索クエリから生成された前記文字列におけるスペース区切りの位置を特定する特定部とを有することを特徴とする。 The information processing device according to the present application is characterized by having a generation unit that generates a character string indicated by a search query entered by a user by deleting space separators included in the search query; a classification unit that classifies the search queries based on the character strings generated by the generation unit; and an identification unit that identifies the position of space separators in the character strings generated from each search query based on the number of times each search query classified into the same group has been entered.

実施形態の一態様によれば、検索クエリの適切なスペース区切りの位置を特定することができるという効果を奏する。 According to one aspect of the embodiment, it is possible to identify the appropriate space delimiter position in a search query.

図１は、実施形態に係る情報処理の一例を示す図である。FIG. 1 is a diagram illustrating an example of information processing according to an embodiment. 図２は、実施形態に係る情報処理装置１０の構成例を示す図である。FIG. 2 is a diagram illustrating an example of the configuration of the information processing device 10 according to the embodiment. 図３は、実施形態に係る検索クエリ情報データベース３１の一例を示す図である。FIG. 3 is a diagram illustrating an example of the search query information database 31 according to the embodiment. 図４は、実施形態に係るグループ情報データベース３２の一例を示す図である。FIG. 4 is a diagram showing an example of the group information database 32 according to the embodiment. 図５は、実施形態に係る情報処理の手順の一例を示すフローチャートである。FIG. 5 is a flowchart illustrating an example of a procedure for information processing according to the embodiment. 図６は、情報処理装置１０の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 6 is a hardware configuration diagram showing an example of a computer that realizes the functions of the information processing device 10.

以下に、本願に係る情報処理装置、情報処理方法及び情報処理プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ説明する。なお、この実施形態により本願に係る情報処理装置、情報処理方法及び情報処理プログラムが限定されるものではない。また、以下の実施形態において、同一の部位には同一の符号を付し、重複する説明は省略される。 Below, modes for implementing an information processing device, information processing method, and information processing program according to the present application (hereinafter referred to as "embodiments") will be described with reference to the drawings. Note that the information processing device, information processing method, and information processing program according to the present application are not limited to these embodiments. Furthermore, in the following embodiments, identical components will be designated by the same reference numerals, and duplicate explanations will be omitted.

〔１．実施形態〕
図１を用いて、本実施形態の情報処理装置等により実現される情報処理を説明する。図１は、実施形態に係る情報処理の一例を示す図である。なお、図１では、本願に係る情報処理装置の一例である情報処理装置１０によって、実施形態に係る情報処理などが実現されるものとする。 1. Embodiment
Information processing implemented by an information processing device or the like according to this embodiment will be described with reference to Fig. 1. Fig. 1 is a diagram showing an example of information processing according to this embodiment. Note that in Fig. 1, the information processing according to this embodiment is implemented by an information processing device 10, which is an example of the information processing device according to the present application.

図１に示すように、実施形態に係る情報処理システム１は、情報処理装置１０と、利用者端末１００とを含む。情報処理装置１０及び利用者端末１００は、ネットワークＮ（例えば、図２参照）を介して有線または無線により相互に通信可能に接続される。ネットワークＮは、例えば、インターネットなどのＷＡＮ（Wide Area Network）である。なお、図１に示した情報処理システム１には、複数台の情報処理装置１０及び複数台の利用者端末１００が含まれていてもよい。 As shown in FIG. 1, the information processing system 1 according to the embodiment includes an information processing device 10 and a user terminal 100. The information processing device 10 and the user terminal 100 are connected to each other via a network N (see, for example, FIG. 2) via a wired or wireless connection so that they can communicate with each other. The network N is, for example, a wide area network (WAN) such as the Internet. Note that the information processing system 1 shown in FIG. 1 may include multiple information processing devices 10 and multiple user terminals 100.

図１に示す情報処理装置１０は、情報処理を行う情報処理装置であり、例えば、サーバ装置やクラウドシステム等により実現される。例えば、情報処理装置１０は、検索サービスを提供し、利用者が検索サービスにおいて入力した検索クエリの履歴を示す情報を自装置の記憶部で管理する。そして、情報処理装置１０は、検索クエリの履歴に基づいて、検索クエリの適切なスペース区切りの位置を特定する。 The information processing device 10 shown in FIG. 1 is an information processing device that performs information processing, and is realized, for example, by a server device or a cloud system. For example, the information processing device 10 provides a search service and manages information indicating the history of search queries entered by users in the search service in a storage unit of the information processing device itself. Then, the information processing device 10 identifies appropriate space delimiter positions for the search query based on the search query history.

なお、情報処理装置１０が提供するサービスは、例えば、検索サービスや、ショッピングサービス、決済サービス、路線検索サービス、地図提供サービス、旅行サービス、飲食店紹介サービス、天気予報サービス、スケジュール管理サービス、ニュース提供サービス、オークションサービス、動画コンテンツ配信サービス、金融取引（株取引等）サービスなどのサービスであってもよい。 The services provided by the information processing device 10 may be, for example, a search service, shopping service, payment service, route search service, map provision service, travel service, restaurant introduction service, weather forecast service, schedule management service, news provision service, auction service, video content distribution service, financial transaction (stock trading, etc.) service, etc.

ここで、従来、利用者が検索サービス等において検索クエリを入力する際、利用者の入力ミス等により、検索クエリにスペース区切りが入力されない場合や、検索クエリを音声入力で行ったことにより、スペース区切りが入力されない場合がある。このような場合、形態素解析技術を用いることにより、利用者が入力した検索クエリに対しスペース区切りを自動的に入力する方法が考えられる。しかしながら、このような方法を用いると、例えば、検索クエリに、最近新たに生じた固有名詞（例えば、漫画や映画、楽曲などのコンテンツの名称）が含まれる場合、固有名詞が過剰に区切られてしまう場合がある。 Here, conventionally, when a user enters a search query into a search service, etc., there are cases where spaces are not entered into the search query due to the user's input error, or where spaces are not entered because the search query is entered by voice. In such cases, a method can be considered that uses morphological analysis technology to automatically enter spaces into the search query entered by the user. However, when using such a method, for example, if the search query includes recently created proper nouns (e.g., the names of content such as manga, movies, or songs), the proper nouns may be excessively separated.

そこで、本実施形態の情報処理装置等は、検索クエリの適切なスペース区切りの位置を特定することを目的として、以下の情報処理を実行する。 The information processing device of this embodiment therefore performs the following information processing to identify appropriate space delimiter positions in a search query.

図１に示す利用者端末１００は、利用者によって利用される情報処理装置である。利用者端末１００は、例えば、スマートフォンや、タブレット型端末や、ノート型ＰＣ（Personal Computer）や、デスクトップＰＣや、携帯電話機や、ＰＤＡ（Personal Digital Assistant）等により実現される。なお、図１に示す例において、利用者端末１００は、利用者によって利用されるスマートフォンである場合を示す。 The user terminal 100 shown in Figure 1 is an information processing device used by a user. The user terminal 100 may be realized, for example, by a smartphone, tablet terminal, notebook PC (Personal Computer), desktop PC, mobile phone, or PDA (Personal Digital Assistant). In the example shown in Figure 1, the user terminal 100 is a smartphone used by a user.

以下、図１を用いて、情報処理装置１０が行う情報処理の一例について説明する。なお、以下の説明において、利用者端末１００を利用する利用者に応じて、利用者端末１００を利用者端末１００－１～１００－Ｎ（Ｎは任意の自然数）として説明する。例えば、利用者端末１００－１は、利用者ＩＤ「ＵＩＤ＃１」により識別される利用者（利用者Ｕ１）により使用される利用者端末１００である。また、以下では、利用者端末１００－１～１００－Ｎについて、特に区別なく説明する場合には、利用者端末１００と記載する。また、以下の説明では、利用者端末１００を利用者と同一視する場合がある。すなわち、以下では、利用者Ｕ１を利用者端末１００－１と読み替えることもできる。 An example of information processing performed by information processing device 10 will be described below using FIG. 1. In the following description, user terminal 100 will be described as user terminals 100-1 to 100-N (N is any natural number) depending on the user using user terminal 100. For example, user terminal 100-1 is the user terminal 100 used by a user (user U1) identified by user ID "UID#1." In the following description, user terminals 100-1 to 100-N will be referred to as user terminal 100 when there is no particular distinction between them. In the following description, user terminal 100 may be considered the same as the user. In other words, user U1 may be read as user terminal 100-1.

まず、情報処理装置１０は、利用者が検索サービスにおいて入力した検索クエリが示す文字列を生成する（ステップＳ１）。例えば、情報処理装置１０は、検索クエリに含まれるスペース区切りを削除することにより、文字列を生成する。 First, the information processing device 10 generates a character string indicated by a search query entered by a user in a search service (step S1). For example, the information processing device 10 generates the character string by deleting space separators included in the search query.

具体的な例を挙げると、情報処理装置１０は、検索クエリ「ＡＡＢＢＣＣ」において、「ＡＡ」及び「ＢＢ」の間に存在するスペース区切り、並びに、「ＢＢ」及び「ＣＣ」の間に存在するスペース区切りを削除することにより、検索クエリ「ＡＡＢＢＣＣ」から文字列「ＡＡＢＢＣＣ」を生成する。また、検索クエリ「ＡＡＢＢＣＣ」において、「ＡＡ」及び「ＢＢＣＣ」の間に存在するスペース区切りを削除することにより、検索クエリ「ＡＡＢＢＣＣ」から文字列「ＡＡＢＢＣＣ」を生成する。なお、検索クエリ「ＡＡＢＢＣＣ」にはスペース区切りが存在しないため、情報処理装置１０は、検索クエリ「ＡＡＢＢＣＣ」をそのまま示す文字列「ＡＡＢＢＣＣ」を生成する。 As a specific example, the information processing device 10 generates the character string "AABBCC" from the search query "AA BB CC" by deleting the space separator between "AA" and "BB" and the space separator between "BB" and "CC" in the search query "AA BB CC." Furthermore, the information processing device 10 generates the character string "AABBCC" from the search query "AA BBCC" by deleting the space separator between "AA" and "BBCC" in the search query "AA BBCC." Note that, because there are no space separators in the search query "AABBCC," the information processing device 10 generates the character string "AABBCC" that directly represents the search query "AABBCC."

続いて、情報処理装置１０は、検索クエリから生成した文字列に基づいて、検索クエリを分類する（ステップＳ２）。例えば、情報処理装置１０は、スペース区切りを削除した後の文字列が一致する検索クエリを、同一のグループに分類する。具体的な例を挙げると、情報処理装置１０は、検索クエリ「ＡＡＢＢＣＣ」、「ＡＡＢＢＣＣ」及び「ＡＡＢＢＣＣ」のそれぞれから生成された文字列が「ＡＡＢＢＣＣ」であるため、これらの検索クエリをグループ＃１に分類する。また情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」、「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」のそれぞれから生成された文字列が「ＤＤＥＥＦＦ」であるため、これらの検索クエリをグループ＃２に分類する。 Next, the information processing device 10 categorizes the search queries based on the character strings generated from the search queries (step S2). For example, the information processing device 10 categorizes search queries that have matching character strings after removing spaces into the same group. To give a specific example, the information processing device 10 categorizes the search queries "AABBCC," "AA BB CC," and "AA BBCC" into group #1 because the character string generated from each of the search queries is "AABBCC." Furthermore, the information processing device 10 categorizes the search queries "DDEEFF," "DDEE FF," and "DD EE FF" into group #2 because the character string generated from each of the search queries is "DDEEFF."

続いて、情報処理装置１０は、同一のグループに分類された各検索クエリのうち、検索サービスにおける利用者の入力回数が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定する（ステップＳ３）。例えば、情報処理装置１０は、グループ＃１において、検索クエリ「ＡＡＢＢＣＣ」の入力回数が最も多いため、文字列「ＡＡＢＢＣＣ」の適切なスペース区切りの位置が、「ＡＡ」及び「ＢＢ」の間、並びに、「ＢＢ」及び「ＣＣ」の間であると特定する。 Next, the information processing device 10 identifies the positions of space separators in the character string based on the search query that has been most frequently entered by users in the search service among the search queries classified into the same group (step S3). For example, since the search query "AA BB CC" has been entered most frequently in group #1, the information processing device 10 identifies that appropriate positions of space separators in the character string "AABBCC" are between "AA" and "BB" and between "BB" and "CC."

また、例えば、情報処理装置１０は、グループ＃２において、検索クエリ「ＤＤＥＥＦＦ」の入力回数が最も多いため、文字列「ＤＤＥＥＦＦ」の適切なスペース区切りの位置が、「ＤＤＥＥ」及び「ＦＦ」の間であると特定する。 Furthermore, for example, since the search query "DDEE FF" is entered most frequently in group #2, the information processing device 10 determines that the appropriate space delimiter position for the string "DDEEFF" is between "DDEE" and "FF."

続いて、情報処理装置１０は、ステップＳ３において特定された文字列におけるスペース区切りの位置に基づいて、文字列が入力された場合に、スペース区切りを追加、若しくは、スペース区切りを削除した文字列を出力するモデルの学習を行う（ステップＳ４）。例えば、情報処理装置１０は、検索サービスにおいて利用者が入力した検索クエリが入力された場合に、当該検索クエリにスペース区切りを追加、若しくは、当該検索クエリに存在するスペース区切りを削除した文字列を出力するモデルの学習を行う。 Next, the information processing device 10 trains a model that, when a string is input, adds a space separator or deletes the space separator based on the position of the space separator in the string identified in step S3, and outputs the string (step S4). For example, when a search query entered by a user in a search service is input, the information processing device 10 trains a model that adds a space separator to the search query or deletes the space separator from the search query.

なお、情報処理装置１０は、モデルの生成及び学習は、機械学習に関する種々の従来技術を適宜用いて行われてもよい。例えば、モデルの生成は、ＳＶＭ（Support Vector Machine）等の教師あり学習の機械学習に関する技術を用いて行われてもよい。また、モデルの生成は、教師なし学習の機械学習に関する技術を用いて行われてもよい。また、モデルの生成は、深層学習（ディープラーニング）の技術を用いて行われてもよい。また、モデルの生成は、ＤＮＮ（Deep Neural Network）やＲＮＮ（Recurrent Neural Network）やＣＮＮ（Convolutional Neural Network）等の種々のディープラーニングの技術を適宜用いて行われてもよい。なお、上記モデルの生成及び学習に関する記載は例示であり、モデルの生成は、取得可能な情報等に応じて適宜選択された学習手法により行われてもよい。 Note that the information processing device 10 may generate and learn models using various conventional machine learning techniques, as appropriate. For example, models may be generated using supervised machine learning techniques such as SVM (Support Vector Machine). Models may also be generated using unsupervised machine learning techniques. Models may also be generated using deep learning techniques. Models may also be generated using various deep learning techniques, such as DNN (Deep Neural Network), RNN (Recurrent Neural Network), and CNN (Convolutional Neural Network). Note that the above descriptions of model generation and learning are merely examples, and models may be generated using a learning method selected appropriately depending on the available information, etc.

続いて、情報処理装置１０は、検索サービスにおいて利用者が入力した検索クエリを利用者端末１００から受け付ける（ステップＳ５）。続いて、情報処理装置１０は、ステップＳ３において特定したスペース区切りの位置や、ステップＳ４において学習を行ったモデルを用いて、利用者が入力した検索クエリを編集し、編集した検索クエリに基づく検索結果を利用者端末１００に提供する（ステップＳ６）。例えば、利用者Ｕ１が検索クエリ「ＡＡＢＢＣＣ」を入力した場合（言い換えると、スペース区切りを削除した場合に、グループ＃１に分類される検索クエリを入力した場合）、情報処理装置１０は、当該検索クエリを「ＡＡＢＢＣＣ」と編集し、編集後の検索クエリに基づく検索結果を提供する。また、利用者Ｕ２が検索クエリ「ＤＤＥＥＦＦ」を入力した場合（言い換えると、スペース区切りを削除した場合に、グループ＃２に分類される検索クエリを入力した場合）、情報処理装置１０は、当該検索クエリを「ＤＤＥＥＦＦ」と編集し、編集後の検索クエリに基づく検索結果を提供する。 Next, the information processing device 10 receives from the user terminal 100 a search query entered by the user in the search service (step S5). Next, the information processing device 10 edits the search query entered by the user using the positions of the space separators identified in step S3 and the model trained in step S4, and provides search results based on the edited search query to the user terminal 100 (step S6). For example, if user U1 enters the search query "AA BBCC" (in other words, if the user enters a search query that would be classified as group #1 if the space separators were deleted), the information processing device 10 edits the search query to "AA BB CC" and provides search results based on the edited search query. Also, if user U2 enters the search query "DD EE FF" (in other words, if the user enters a search query that would be classified as group #2 if the space separators were deleted), the information processing device 10 edits the search query to "DDEE FF" and provides search results based on the edited search query.

また、適切なスペース区切りの位置が特定されていない文字列「ＧＧＨＨＩＩ」を示す検索クエリ（例えば、「ＧＧＨＨＩＩ」）を利用者Ｕ３が入力した場合、情報処理装置１０は、当該検索クエリをモデルに入力し、出力された文字列「ＧＧＨＨＩＩ」に基づく検索結果を提供する。 Furthermore, if user U3 inputs a search query (e.g., "GG HHII") indicating the string "GGHHII" without identifying the appropriate space separator, the information processing device 10 inputs the search query into the model and provides search results based on the output string "GGHH II."

続いて、情報処理装置１０は、検索結果を提供した後の利用者の行動に関する行動情報を利用者端末１００から取得する（ステップＳ７）。例えば、情報処理装置１０は、モデルから出力された文字列「ＧＧＨＨＩＩ」に基づく検索結果を提供した利用者Ｕ３の行動情報を、利用者端末１００－３から取得する。具体的な例を挙げると、情報処理装置１０は、「ＧＧＨＨＩＩ」に基づく検索結果として提供されたコンテンツを利用者Ｕ３が選択したか否かを示す行動情報を取得する。 Next, the information processing device 10 acquires behavioral information related to the user's behavior after the search results were provided from the user terminal 100 (step S7). For example, the information processing device 10 acquires behavioral information of user U3, who was provided with search results based on the character string "GGHH II" output from the model, from the user terminal 100-3. As a specific example, the information processing device 10 acquires behavioral information indicating whether user U3 selected the content provided as a search result based on "GGHH II."

続いて、情報処理装置１０は、利用者の行動情報に基づいて、モデルの再学習を行う（ステップＳ８）。例えば、情報処理装置１０は、利用者Ｕ３の行動情報が、「ＧＧＨＨＩＩ」に基づく検索結果として提供されたコンテンツを、利用者Ｕ３が選択したことを示す場合、情報処理装置１０は、文字列「ＧＧＨＨＩＩ」を編集した文字列「ＧＧＨＨＩＩ」を正例としてモデルの学習を行う。また、利用者Ｕ３の行動情報が、「ＧＧＨＨＩＩ」に基づく検索結果として提供されたコンテンツを、利用者Ｕ３が選択しなかったことを示す場合、情報処理装置１０は、文字列「ＧＧＨＨＩＩ」を編集した文字列「ＧＧＨＨＩＩ」を負例としてモデルの学習を行う。 Next, the information processing device 10 retrains the model based on the user's behavioral information (step S8). For example, if the behavioral information of user U3 indicates that user U3 selected content provided as a search result based on "GGHH II", the information processing device 10 trains the model using the edited string "GGHH II" of the string "GGHHII" as a positive example. Also, if the behavioral information of user U3 indicates that user U3 did not select content provided as a search result based on "GGHH II", the information processing device 10 trains the model using the edited string "GGHHII" of the string "GGHH II" as a negative example.

以上のように、実施形態に係る情報処理装置１０は、同一のグループに分類された各検索クエリのうち、利用者により最も多く入力された検索クエリに基づき、当該各検索クエリが示す文字列における、スペース区切りの位置を特定する。すなわち、実施形態に係る情報処理装置１０は、検索クエリの適切なスペース区切りの位置を特定することができる。 As described above, the information processing device 10 according to the embodiment identifies the position of space separators in the character string indicated by each search query based on the search query most frequently entered by users among the search queries classified into the same group. In other words, the information processing device 10 according to the embodiment can identify the appropriate position of space separators in the search query.

〔２．その他の処理例〕
なお、上述した処理は一例に過ぎず、情報処理装置１０は、様々な情報を用いて様々な処理を行ってもよい。この点について、以下例示を列挙する。 [2. Other processing examples]
The above-described process is merely an example, and the information processing device 10 may perform various processes using various information. In this regard, examples are listed below.

〔２－１．スペース区切りの特定について〕
図１の例において、情報処理装置１０は、グループ内において、各検索クエリが入力された割合を算出し、割合が最も多い検索クエリのスペース区切りの位置に基づき、文字列の適切なスペース区切りの位置を特定してもよい。 [2-1. Identifying space delimiters]
In the example of Figure 1, the information processing device 10 may calculate the proportion of times each search query was input within the group, and identify the appropriate position of a space separator in a character string based on the position of the space separator in the search query with the highest proportion.

また、情報処理装置１０は、同一のグループに分類された各検索クエリのうち、入力を行ったユニークユーザの数が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定してもよい。 In addition, the information processing device 10 may identify the position of space separators in the character string based on the search query that has been entered by the largest number of unique users among the search queries classified into the same group.

また、情報処理装置１０は、同一のグループに分類された各検索クエリのうち、入力が行われたユニークブラウザの数が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定してもよい。 In addition, the information processing device 10 may identify the position of the space separator in the character string based on the search query that has been entered by the largest number of unique browsers among the search queries classified into the same group.

また、情報処理装置１０は、同一のグループに分類された各検索クエリによる検索結果を示すコンテンツが選択された回数に基づいて、文字列におけるスペース区切りの位置を特定してもよい。例えば、情報処理装置１０は、同一のグループに分類された検索クエリのうち、検索結果として利用者に提供されたコンテンツが選択された回数（ＣＴＲ（Click Through Rate）等）が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定する。 In addition, the information processing device 10 may identify the position of a space separator in a character string based on the number of times that content indicating search results for each search query classified into the same group was selected. For example, the information processing device 10 identifies the position of a space separator in a character string based on the search query that, among search queries classified into the same group, has the most number of times that content provided to users as search results has been selected (e.g., CTR (Click Through Rate)).

また、情報処理装置１０は、同一のグループに分類された各検索クエリに対応する広告コンテンツのコンバージョンの回数に基づいて、文字列におけるスペース区切りの位置を特定してもよい。例えば、情報処理装置１０は、同一のグループに分類された検索クエリのうち、検索結果として利用者に提供された広告コンテンツが選択された回数（ＣＶＲ（Conversion Rate）等）が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定する。 The information processing device 10 may also identify the position of space separators in a character string based on the number of conversions of advertising content corresponding to each search query classified into the same group. For example, the information processing device 10 identifies the position of space separators in a character string based on the search query that, among search queries classified into the same group, has the highest number of times advertising content provided to users as search results has been selected (e.g., CVR (Conversion Rate)).

なお、情報処理装置１０は、上記の処理を複数組み合わせ、文字列の適切なスペース区切りの位置を特定してもよい。例えば、情報処理装置１０は、同一のグループに分類された各検索クエリが入力された回数や、グループ内における各検索クエリが入力された割合、各検索クエリの入力を行ったユニークユーザの数、各検索クエリの入力が行われたユニークブラウザの数、各検索クエリによる検索結果を示すコンテンツが選択された回数、各検索クエリに対応する広告コンテンツのコンバージョンの回数などに基づいて、文字列の適切なスペース区切りの位置を特定してもよい。 The information processing device 10 may combine multiple of the above processes to identify appropriate space delimiter positions in a character string. For example, the information processing device 10 may identify appropriate space delimiter positions in a character string based on the number of times each search query classified into the same group was entered, the percentage of times each search query was entered within the group, the number of unique users who entered each search query, the number of unique browsers that entered each search query, the number of times content showing search results for each search query was selected, the number of conversions of advertising content corresponding to each search query, etc.

〔２－２．所定の閾値以上に入力された検索クエリが複数存在する場合について〕
図１の例において、情報処理装置１０は、同一のグループに分類された各検索クエリのうち、入力回数が所定の閾値以上である検索クエリが複数存在する場合（言い換えると、同程度に利用者から多く入力された検索クエリが複数存在する場合）、スペース区切りの数が最も多い検索クエリに基づいて、文字列におけるスペース区切りの位置を特定してもよい。例えば、入力回数の閾値が「１００」と設定されている場合、グループ＃２では、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」が該当する。このような場合、情報処理装置１０は、スペース区切りの位置が最も多い検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。 [2-2. When there are multiple search queries with values equal to or greater than a certain threshold]
In the example of FIG. 1 , if there are multiple search queries classified into the same group that have been entered more than a predetermined threshold number of times (in other words, if there are multiple search queries entered by users at similar frequencies), the information processing device 10 may identify the positions of space separators in the character string based on the search query with the most space separators. For example, if the input frequency threshold is set to "100," group #2 includes the search queries "DDEE FF" and "DD EE FF." In such a case, the information processing device 10 identifies the positions of space separators in the character string "DDEEFF" based on the search query "DD EE FF" that has the most space separator positions.

なお、情報処理装置１０は、同一のグループに分類された各検索クエリのうち、入力回数が所定の閾値以上である検索クエリが複数存在する場合、スペース区切りの数が最も少ない検索クエリに基づいて、文字列におけるスペース区切りの位置を特定してもよい。例えば、情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」のうち、スペース区切りの位置が最も少ない検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。 Note that, when there are multiple search queries classified into the same group that have been entered a predetermined number of times or more, the information processing device 10 may identify the positions of space separators in the character string based on the search query with the fewest number of space separators. For example, of the search queries "DDEE FF" and "DD EE FF," the information processing device 10 may identify the positions of space separators in the character string "DDEEFF" based on the search query "DDEE FF" that has the fewest space separator positions.

情報処理装置１０は、同一のグループに分類された各検索クエリのうち、入力回数が所定の閾値以上である検索クエリが複数存在する場合、スペースにより区切られた各部分文字列が入力された回数に基づき抽出される検索クエリに基づいて、文字列におけるスペース区切りの位置を特定してもよい。例えば、情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」に含まれる部分検索クエリ「ＤＤ」、「ＥＥ」及び「ＤＤＥＥ」（言い換えると、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」とで、スペース区切りの位置が異なる部分に対応する部分検索クエリ）の入力回数をそれぞれ集計する。そして、部分検索クエリ「ＤＤ」、「ＥＥ」の各入力回数よりも、部分検索クエリ「ＤＤＥＥ」の入力回数の方が多い場合、情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。一方、部分検索クエリ「ＤＤ」、「ＥＥ」のうち、いずれかの入力回数が、部分検索クエリ「ＤＤＥＥ」の入力回数よりも多い場合、情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。 When there are multiple search queries classified into the same group whose input counts are equal to or greater than a predetermined threshold, the information processing device 10 may identify the positions of space separators in the character string based on search queries extracted based on the number of times each substring separated by a space is input. For example, the information processing device 10 tallies the input counts of the partial search queries "DD," "EE," and "DDEE" contained in the search queries "DDEE FF" and "DD EE FF" (in other words, the partial search queries corresponding to the parts of the search queries "DDEE FF" and "DD EE FF" where the positions of the space separators differ). If the input count of the partial search query "DDEE" is greater than the input count of the partial search queries "DD" and "EE," the information processing device 10 identifies the positions of the space separators in the character string "DDEEFF" based on the search query "DDEE FF." On the other hand, if the number of times either the partial search queries "DD" or "EE" is entered is greater than the number of times the partial search query "DDEE" is entered, the information processing device 10 identifies the positions of space delimiters in the character string "DDEEFF" based on the search query "DD EE FF."

〔２－３．一の検索クエリ（単一の検索クエリ）が複数のグループに分類可能な場合について〕
図１の例において、情報処理装置１０は、一の検索クエリが複数のグループに分類可能である場合、一の検索クエリが入力された回数と、複数のグループに分類される各検索クエリが入力された回数とに基づいて、一の検索クエリを分類してもよい。例えば、シソーラスを用いて、検索クエリの一部を同義語や類義語に変換した場合に、一の検索クエリが複数のグループに分類可能である場合、情報処理装置１０は、分類可能なグループのうち、分類されている検索クエリの入力回数（例えば、入力回数の平均値）と、当該一の検索クエリの入力回数との差異が所定の閾値未満であるグループに、当該一の検索クエリを分類する。また、一の検索クエリが複数のグループに分類可能である場合、情報処理装置１０は、分類可能なグループのうち、分類されている検索クエリの入力回数と、当該一の検索クエリの入力回数との差異が所定の閾値以上であるグループに、当該一の検索クエリを分類する。 [2-3. When a single search query can be classified into multiple groups]
In the example of FIG. 1 , if a search query can be classified into multiple groups, the information processing device 10 may classify the search query based on the number of times the search query is input and the number of times each search query classified into the multiple groups is input. For example, if a search query can be classified into multiple groups when part of the search query is converted into synonyms or similar words using a thesaurus, the information processing device 10 classifies the search query into one of the possible groups in which the difference between the number of inputs of the classified search query (e.g., the average number of inputs) and the number of inputs of the search query is less than a predetermined threshold. Furthermore, if a search query can be classified into multiple groups, the information processing device 10 classifies the search query into one of the possible groups in which the difference between the number of inputs of the classified search query and the number of inputs of the search query is equal to or greater than a predetermined threshold.

〔２－４．モデルについて〕
図１の例において、情報処理装置１０は、文字列が入力された場合に、スペースにより区切られた文字列を出力するモデル（以下、「モデル＃１」と記載する場合がある）と、文字列が入力された場合に、スペース区切りを削除した文字列を出力するモデル（以下、「モデル＃２」と記載する場合がある）とを個別に生成し、学習を行ってもよい。 [2-4. About the model]
In the example of Figure 1, the information processing device 10 may separately generate and train a model (hereinafter, sometimes referred to as "model #1") that outputs a string separated by spaces when a string is input, and a model (hereinafter, sometimes referred to as "model #2") that outputs a string with the space separators removed when a string is input.

なお、情報処理装置１０は、モデル＃１及び＃２とで、それぞれ異なる学習用のデータを用いて学習を行ってもよい。例えば、情報処理装置１０は、同一のグループに分類された各検索クエリの入力回数に基づいて特定した文字列のスペース区切りの位置（言い換えると、検索クエリの入力回数を指標として特定した文字列のスペース区切りの位置）と、当該文字列とを示す情報を用いてモデル＃１の学習を行う。そして、情報処理装置１０は、モデル＃１の学習に用いた情報とは異なる指標（例えば、ＣＴＲや、ＣＶＲ）を用いて特定した文字列のスペース区切りの位置と、当該文字列とを示す情報を用いて、モデル＃２の学習を行う。 In addition, the information processing device 10 may perform training using different training data for models #1 and #2. For example, the information processing device 10 trains model #1 using the positions of space separators in character strings identified based on the number of times each search query is entered in the same group (in other words, the positions of space separators in character strings identified using the number of times the search query is entered as an index) and information indicating the character strings. Then, the information processing device 10 trains model #2 using the positions of space separators in character strings identified using an index (e.g., CTR or CVR) different from the information used to train model #1 and information indicating the character strings.

また、情報処理装置１０は、モデル＃１に用いたものとは異なる手法により特定された文字列のスペース区切りの位置と、当該文字列とを示す情報を用いてモデル＃２の学習を行う。具体的な例を挙げると、情報処理装置１０は、同一のグループに分類された検索クエリのうち、検索結果として利用者に提供されたコンテンツが選択された回数が最も多い検索クエリに基づいて特定した文字列のスペース区切りの位置と、当該文字列を示す情報を用いてモデル＃２の学習を行う。また、情報処理装置１０は、同一のグループに分類された各検索クエリに対応する広告コンテンツのコンバージョンの回数に基づいて特定した文字列のスペース区切りの位置と、当該文字列を示す情報を用いてモデル＃２の学習を行う。 The information processing device 10 also trains model #2 using the positions of space separators in character strings identified by a method different from that used for model #1 and information indicating the character strings. To give a specific example, the information processing device 10 trains model #2 using the positions of space separators in character strings identified based on the search query for which content provided to users as search results was selected the most frequently among search queries classified into the same group and information indicating the character strings. The information processing device 10 also trains model #2 using the positions of space separators in character strings identified based on the number of conversions of advertising content corresponding to each search query classified into the same group and information indicating the character strings.

〔２－５．所定の文字を削除した文字列の生成について〕
図１の例において、情報処理装置１０は、スペース区切り以外に、検索クエリに含まれる所定の文字を削除することにより、文字列を生成してもよい。例えば、情報処理装置１０は、スペース区切りと、スペースで区切られた部分検索クエリの長音符とを削除することで、検索クエリが示す文字列を生成する。具体的な例を挙げると、情報処理装置１０は、検索クエリ「チェリージュース旬」や、「チェリージュース旬」から、スペース区切りと、単語「チェリー」の語末の長音符等を削除することにより、文字列「チェリジュース旬」を生成する。 [2-5. Generating a string by deleting specific characters]
1 , the information processing device 10 may generate a character string by deleting predetermined characters included in the search query in addition to space separators. For example, the information processing device 10 generates a character string indicated by the search query by deleting space separators and macrons from partial search queries separated by spaces. As a specific example, the information processing device 10 generates the character string "cherry juice in season" by deleting space separators and macrons at the end of the word "cherry" from the search queries "cherry juice in season" and "cherry juice in season."

なお、情報処理装置１０は、検索クエリに含まれるスペース区切りに加え、検索クエリに含まれる長音符を削除した場合に、検索クエリを所定のグループに分類することが可能である場合に、検索クエリの長音符を削除してもよい。例えば、検索クエリ「チェリージュース旬」に含まれる単語「チェリー」の語末の長音符等を削除することにより、当該検索クエリを、文字列「チェリジュース旬」のグループに分類可能である場合、情報処理装置１０は、単語「チェリー」の語末の長音符を削除する。 In addition to the space separators included in the search query, the information processing device 10 may also delete the long note from the search query if deleting the long note included in the search query makes it possible to classify the search query into a predetermined group. For example, if deleting the long note at the end of the word "cherry" included in the search query "cherry juice in season" makes it possible to classify the search query into the group of the character string "cherry juice in season," the information processing device 10 deletes the long note at the end of the word "cherry."

また、情報処理装置１０は、検索クエリに含まれるスペース区切りに加え、検索クエリに含まれる長音符をすべて削除し、文字列を生成してもよい。例えば、検索クエリ「チェリージュース旬」から、スペース区切りと、長音符とを削除し、文字列「チェリジュス旬」を生成してもよい。また、情報処理装置１０は、スペース区切りを残し、文字列「チェリジュス旬」を生成してもよい。 The information processing device 10 may also generate a string by deleting all of the long notes included in the search query, in addition to the space separators included in the search query. For example, the information processing device 10 may delete the space separators and long notes from the search query "cherry juice shun" to generate the string "cherry juice shun." The information processing device 10 may also leave the space separators and generate the string "cherry juice shun."

〔３．情報処理装置の構成〕
次に、図２を用いて、情報処理装置１０の構成について説明する。図２は、実施形態に係る情報処理装置１０の構成例を示す図である。図２に示すように、情報処理装置１０は、通信部２０と、記憶部３０と、制御部４０とを有する。 3. Configuration of information processing device
Next, the configuration of the information processing device 10 will be described with reference to Fig. 2. Fig. 2 is a diagram showing an example of the configuration of the information processing device 10 according to the embodiment. As shown in Fig. 2, the information processing device 10 includes a communication unit 20, a storage unit 30, and a control unit 40.

（通信部２０について）
通信部２０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。そして、通信部２０は、ネットワークＮと有線または無線で接続され、利用者端末１００等との間で情報の送受信を行う。 (Regarding the communication unit 20)
The communication unit 20 is realized by, for example, a network interface card (NIC), etc. The communication unit 20 is connected to the network N by wire or wirelessly, and transmits and receives information to and from the user terminal 100, etc.

（記憶部３０について）
記憶部３０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。図２に示すように、記憶部３０は、検索クエリ情報データベース３１と、グループ情報データベース３２と、モデルデータベース３３とを有する。 (Regarding the storage unit 30)
The storage unit 30 is realized by, for example, a semiconductor memory element such as a random access memory (RAM) or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 2 , the storage unit 30 has a search query information database 31, a group information database 32, and a model database 33.

（検索クエリ情報データベース３１について）
検索クエリ情報データベース３１は、利用者に入力された検索クエリに関する各種の情報を記憶する。ここで、図３を用いて、検索クエリ情報データベース３１が記憶する情報の一例を説明する。図３は、実施形態に係る検索クエリ情報データベース３１の一例を示す図である。図３の例において、検索クエリ情報データベース３１は、「検索クエリＩＤ」、「グループＩＤ」、「検索クエリ」、「文字列」、「入力回数」、「検索結果情報」、「コンバージョン情報」といった項目を有する。 (Regarding the search query information database 31)
The search query information database 31 stores various information related to search queries entered by users. An example of information stored in the search query information database 31 will now be described with reference to FIG. 3. FIG. 3 is a diagram showing an example of the search query information database 31 according to an embodiment. In the example of FIG. 3, the search query information database 31 has items such as a "search query ID," a "group ID," a "search query," a "character string," a "number of inputs,""search result information," and "conversion information."

「検索クエリＩＤ」は、検索クエリを識別するための識別情報を示す。「グループＩＤ」は、グループを識別するための識別情報を示す。「検索クエリ」は、検索クエリを示す情報（テキスト情報等）を示す。「文字列」は、検索クエリから生成された文字列を示す。「入力回数」は、検索クエリが入力された回数を示す。「検索結果情報」は、検索クエリが入力された際の検索結果に関する情報を示し、例えば、検索結果として入力されたコンテンツが選択された回数などを示す情報などが格納される。「コンバージョン情報」は、検索クエリに対応する広告コンテンツのコンバージョンに関する情報を示す。 "Search query ID" indicates identification information for identifying a search query. "Group ID" indicates identification information for identifying a group. "Search query" indicates information (such as text information) that indicates the search query. "String" indicates a string generated from the search query. "Number of inputs" indicates the number of times the search query was input. "Search result information" indicates information about the search results when a search query was input, and stores information indicating, for example, the number of times content input as a search result was selected. "Conversion information" indicates information about conversions of advertising content corresponding to the search query.

すなわち、図３では、検索クエリＩＤ「ＱＩＤ＃１」により識別される検索クエリが、グループＩＤ「ＧＩＤ＃１」により識別されるグループに分類された、当該検索クエリが「検索クエリ＃１」であり、文字列が「文字列＃１」、入力回数が「入力回数＃１」、検索結果情報が「検索結果情報＃１」、コンバージョン情報が「コンバージョン情報＃１」である例を示す。 In other words, Figure 3 shows an example in which a search query identified by search query ID "QID#1" is classified into a group identified by group ID "GID#1", the search query is "search query #1", the character string is "character string #1", the number of inputs is "number of inputs #1", the search result information is "search result information #1", and the conversion information is "conversion information #1".

なお、検索クエリ情報データベース３１が記憶する情報は上記のものに限定されず、各種の情報を記憶してよい、例えば、検索クエリ情報データベース３１は、検索クエリの入力を行ったユニークユーザの数や、検索クエリの入力が行われたユニークブラウザの数などを示す情報を記憶してもよい。 Note that the information stored in the search query information database 31 is not limited to the above, and various types of information may be stored. For example, the search query information database 31 may store information indicating the number of unique users who have input search queries, the number of unique browsers that have input search queries, etc.

（グループ情報データベース３２について）
グループ情報データベース３２は、検索クエリが分類されるグループに関する各種の情報を記憶する。ここで、図４を用いて、グループ情報データベース３２が記憶する情報の一例を説明する。図４は、実施形態に係るグループ情報データベース３２の一例を示す図である。図４の例において、グループ情報データベース３２は、「グループＩＤ」、「検索クエリ情報」、「文字列」、「スペース区切り情報」といった項目を有する。 (Regarding the group information database 32)
The group information database 32 stores various information related to groups into which search queries are classified. An example of information stored in the group information database 32 will now be described with reference to FIG. 4. FIG. 4 is a diagram illustrating an example of the group information database 32 according to an embodiment. In the example of FIG. 4, the group information database 32 has items such as "group ID,""search query information,""characterstring," and "space delimiter information."

「グループＩＤ」は、グループを識別するための識別情報を示す。「検索クエリ情報」は、グループに含まれる検索クエリに関する情報（検索クエリＩＤ等）を示す。「文字列」は、グループに分類された検索クエリから生成された文字列を示す。「スペース区切り情報」は、文字列のスペース区切りの位置を示す。 "Group ID" indicates identification information for identifying a group. "Search query information" indicates information about the search queries included in the group (such as the search query ID). "String" indicates the string generated from the search queries classified into the group. "Space separator information" indicates the position of the space separator in the string.

すなわち、図４では、グループＩＤ「ＧＩＤ＃１」により期別されるグループの検索クエリ情報が「検索クエリ情報＃１」、文字列が「文字列＃１」、スペース区切り情報が「スペース区切り情報＃１」である例を示す。 In other words, Figure 4 shows an example in which the search query information for a group classified by group ID "GID#1" is "search query information #1," the character string is "character string #1," and the space separator information is "space separator information #1."

（モデルデータベース３３について）
モデルデータベース３３は、文字列が入力された場合に、スペース区切りを追加、若しくは、スペース区切りを削除した文字列を出力するモデルを記憶する。 (Regarding the model database 33)
The model database 33 stores a model that, when a character string is input, outputs a character string with a space delimiter added or deleted.

（制御部４０について）
制御部４０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等によって、情報処理装置１０内部の記憶装置に記憶されている各種プログラムがＲＡＭを作業領域として実行されることにより実現される。また、制御部４０は、コントローラであり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現される。実施形態に係る制御部４０は、図３に示すように、生成部４１と、分類部４２と、特定部４３と、学習部４４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。 (Regarding the control unit 40)
The control unit 40 is a controller, and is realized, for example, by a CPU (Central Processing Unit) or an MPU (Micro Processing Unit) executing various programs stored in a storage device inside the information processing device 10 using RAM as a work area. The control unit 40 is also a controller, and is realized, for example, by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array). As shown in FIG. 3 , the control unit 40 according to the embodiment has a generation unit 41, a classification unit 42, an identification unit 43, and a learning unit 44, and realizes or executes the functions and actions of information processing described below.

（生成部４１について）
生成部４１は、利用者が入力した検索クエリに含まれるスペース区切りを削除することで、当該検索クエリが示す文字列を生成する。例えば、図１の例において、生成部４１は、検索クエリ「ＡＡＢＢＣＣ」において、「ＡＡ」及び「ＢＢ」の間に存在するスペース区切り、並びに、「ＢＢ」及び「ＣＣ」の間に存在するスペース区切りを削除することにより、検索クエリ「ＡＡＢＢＣＣ」から文字列「ＡＡＢＢＣＣ」を生成し、検索クエリ情報データベース３１に格納する。 (Regarding the generation unit 41)
The generation unit 41 generates a character string indicated by the search query entered by the user by deleting space separators included in the search query. For example, in the example of Fig. 1 , the generation unit 41 deletes the space separator between "AA" and "BB" and the space separator between "BB" and "CC" in the search query "AA BB CC" to generate a character string "AABBCC" from the search query "AA BB CC" and stores the character string in the search query information database 31.

また、生成部４１は、さらに、検索クエリに含まれる所定の文字を削除することで、当該検索クエリが示す文字列を生成してもよい。例えば、図１の例において、生成部４１は、スペース区切りと、スペースで区切られた部分検索クエリの語末の長音符とを削除することで、検索クエリが示す文字列を生成する。 The generation unit 41 may also generate a character string indicated by the search query by deleting predetermined characters included in the search query. For example, in the example of FIG. 1, the generation unit 41 generates a character string indicated by the search query by deleting space separators and macrons at the ends of partial search queries separated by spaces.

（分類部４２について）
分類部４２は、生成部４１により生成された文字列に基づいて、検索クエリを分類する。例えば、図１の例において、分類部４２は、検索クエリ「ＡＡＢＢＣＣ」、「ＡＡＢＢＣＣ」及び「ＡＡＢＢＣＣ」のそれぞれから生成された文字列が「ＡＡＢＢＣＣ」であるため、これらの検索クエリをグループ＃１に分類する。具体的な例を挙げると、分類部４２は、検索クエリ「ＡＡＢＢＣＣ」、「ＡＡＢＢＣＣ」及び「ＡＡＢＢＣＣ」を、グループ＃１に対応付けてグループ情報データベース３２に格納する。 (Regarding the classification unit 42)
The classification unit 42 classifies the search queries based on the character strings generated by the generation unit 41. For example, in the example of FIG. 1 , the character string generated from each of the search queries "AABBCC,""AA BB CC," and "AA BBCC" is "AABBCC," so the classification unit 42 classifies these search queries into group #1. As a specific example, the classification unit 42 associates the search queries "AABBCC,""AA BB CC," and "AA BBCC" with group #1 and stores them in the group information database 32.

また、分類部４２は、一の検索クエリが複数のグループに分類可能である場合、一の検索クエリが入力された回数と、当該複数のグループに分類される各検索クエリが入力された回数とに基づいて、一の検索クエリを分類してもよい。例えば、図１の例において、シソーラスを用いて、検索クエリの一部を同義語や類義語に変換した場合に、一の検索クエリが複数のグループに分類可能である場合、分類部４２は、分類可能なグループのうち、分類されている検索クエリの入力回数と、当該一の検索クエリの入力回数との差異が所定の閾値未満であるグループに、当該一の検索クエリを分類する。また、一の検索クエリが複数のグループに分類可能である場合、分類部４２は、分類可能なグループのうち、分類されている検索クエリの入力回数と、当該一の検索クエリの入力回数との差異が所定の閾値以上であるグループに、当該一の検索クエリを分類する。 Furthermore, when a search query can be classified into multiple groups, the classification unit 42 may classify the search query based on the number of times the search query is input and the number of times each search query classified into the multiple groups is input. For example, in the example of FIG. 1, when a thesaurus is used to convert part of the search query into synonyms or similar words, if the search query can be classified into multiple groups, the classification unit 42 classifies the search query into one of the possible groups into which the difference between the number of times the search query is input and the number of times the search query is input is less than a predetermined threshold. Furthermore, when a search query can be classified into multiple groups, the classification unit 42 classifies the search query into one of the possible groups into which the difference between the number of times the search query is input and the number of times the search query is input is equal to or greater than a predetermined threshold.

（特定部４３について）
特定部４３は、同一のグループに分類された各検索クエリが入力された回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。例えば、図１の例において、特定部４３は、グループ＃１において、検索クエリ「ＡＡＢＢＣＣ」の入力回数が最も多いため、文字列「ＡＡＢＢＣＣ」の適切なスペース区切りの位置が、「ＡＡ」及び「ＢＢ」の間、並びに、「ＢＢ」及び「ＣＣ」の間であると特定する。 (Regarding the identification unit 43)
The identification unit 43 identifies the positions of space separators in the character string generated from each search query classified into the same group based on the number of times each search query has been input. For example, in the example of Fig. 1, the identification unit 43 identifies that the appropriate positions of space separators in the character string "AABBCC" are between "AA" and "BB" and between "BB" and "CC" because the search query "AA BB CC" has been input the most in group #1.

また、特定部４３は、同一のグループに分類された各検索クエリが入力された割合であって、同一のグループ内での割合に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、特定部４３は、グループ内において、各検索クエリが入力された割合を算出し、割合が最も多い検索クエリのスペース区切りの位置に基づき、文字列の適切なスペース区切りの位置を特定する。 The identification unit 43 may also identify the position of space separators in the character string generated from each search query based on the proportion of search queries that are input within the same group, that is, the proportion within the same group. For example, in the example of FIG. 1, the identification unit 43 calculates the proportion of search queries that are input within the group, and identifies the appropriate position of space separators in the character string based on the position of space separators in the search query with the highest proportion.

また、特定部４３は、同一のグループに分類された各検索クエリを入力したブラウザの数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、特定部４３は、同一のグループに分類された各検索クエリのうち、入力が行われたユニークブラウザの数が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定する。 The identification unit 43 may also identify the position of space separators in the character string generated from each search query classified into the same group, based on the number of browsers that input each search query. For example, in the example of FIG. 1, the identification unit 43 identifies the position of space separators in the character string, based on the search query that has been input by the largest number of unique browsers, among the search queries classified into the same group.

また、特定部４３は、さらに、同一のグループに分類された各検索クエリによる検索結果を示すコンテンツが選択された回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、特定部４３は、同一のグループに分類された検索クエリのうち、検索結果として利用者に提供されたコンテンツが選択された回数が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定する。 The identification unit 43 may further identify the position of space separators in the character string generated from each search query based on the number of times that content representing search results for each search query classified in the same group was selected. For example, in the example of FIG. 1, the identification unit 43 identifies the position of space separators in the character string based on the search query for which content provided to users as search results was selected the most frequently, among the search queries classified in the same group.

また、特定部４３は、さらに、同一のグループに分類された各検索クエリに対応する広告コンテンツのコンバージョンの回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、特定部４３は、同一のグループに分類された検索クエリのうち、検索結果として利用者に提供された広告コンテンツが選択された回数が最も多い検索クエリに基づいて、文字列のスペース区切りの位置を特定する。 The identification unit 43 may further identify the position of space separators in the character string generated from each search query based on the number of conversions of advertising content corresponding to each search query classified into the same group. For example, in the example of FIG. 1, the identification unit 43 identifies the position of space separators in the character string based on the search query for which advertising content provided to users as search results has been selected the most frequently, among the search queries classified into the same group.

また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が最も多い検索クエリのスペース区切りの位置に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、特定部４３は、グループ＃１において、検索クエリ「ＡＡＢＢＣＣ」の入力回数が最も多いため、文字列「ＡＡＢＢＣＣ」の適切なスペース区切りの位置が、「ＡＡ」及び「ＢＢ」の間、並びに、「ＢＢ」及び「ＣＣ」の間であると特定する。 The identification unit 43 may also identify the positions of space separators in the character string generated from each search query based on the positions of space separators in the search query that has been entered most frequently among the search queries classified into the same group. For example, in the example of FIG. 1, the identification unit 43 identifies that the appropriate positions of space separators in the character string "AABBCC" are between "AA" and "BB" and between "BB" and "CC" because the search query "AA BB CC" has been entered the most frequently in group #1.

また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が所定の閾値以上である検索クエリが複数存在する場合、入力された回数が所定の閾値以上である検索クエリのうち、スペース区切りの数が最も多い検索クエリに基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、入力回数の閾値が「１００」と設定されている場合、グループ＃２では、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」が該当する。このような場合、特定部４３は、スペース区切りの位置が最も多い検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。 Furthermore, when there are multiple search queries classified into the same group that have been input a predetermined number of times or more, the identification unit 43 may identify the positions of space separators in the character string generated from each search query based on the search query with the greatest number of space separators among the search queries that have been input a predetermined number of times or more. For example, in the example of FIG. 1, if the input count threshold is set to "100," group #2 would include the search queries "DDEE FF" and "DD EE FF." In such a case, the identification unit 43 identifies the positions of space separators in the character string "DDEEFF" based on the search query "DD EE FF" that has the greatest number of space separator positions.

また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が所定の閾値以上である検索クエリが複数存在する場合、入力された回数が所定の閾値以上である検索クエリのうち、スペース区切りの数が最も少ない検索クエリに基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、入力回数の閾値が「１００」と設定されている場合、グループ＃２では、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」が該当する。このような場合、特定部４３は、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」のうち、スペース区切りの位置が最も少ない検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。 Furthermore, when there are multiple search queries classified into the same group that have been input a predetermined number of times or more, the identification unit 43 may identify the positions of space separators in the character string generated from each search query based on the search query with the fewest number of space separators among the search queries that have been input a predetermined number of times or more. For example, in the example of FIG. 1, if the input count threshold is set to "100," group #2 corresponds to the search queries "DDEE FF" and "DD EE FF." In such a case, the identification unit 43 identifies the positions of space separators in the character string "DDEEFF" based on the search query "DDEE FF" that has the fewest space separators among the search queries "DDEE FF" and "DD EE FF."

また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が所定の閾値以上である検索クエリが複数存在する場合、入力された回数が所定の閾値以上である検索クエリのうち、スペースにより区切られた各部分文字列が入力された回数に基づき抽出される検索クエリに基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定してもよい。例えば、図１の例において、入力回数の閾値が「１００」と設定されている場合、グループ＃２では、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」が該当する。このような場合、特定部４３は、検索クエリ「ＤＤＥＥＦＦ」及び「ＤＤＥＥＦＦ」に含まれる部分検索クエリ「ＤＤ」、「ＥＥ」及び「ＤＤＥＥ」の入力回数をそれぞれ集計する。そして、部分検索クエリ「ＤＤ」、「ＥＥ」の各入力回数よりも、部分検索クエリ「ＤＤＥＥ」の入力回数の方が多い場合、情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。一方、部分検索クエリ「ＤＤ」、「ＥＥ」のうち、いずれかの入力回数が、部分検索クエリ「ＤＤＥＥ」の入力回数よりも多い場合、情報処理装置１０は、検索クエリ「ＤＤＥＥＦＦ」に基づき、文字列「ＤＤＥＥＦＦ」のスペース区切りの位置を特定する。 Furthermore, when there are multiple search queries classified into the same group that have been input a predetermined number of times or more, the identification unit 43 may identify the position of space separators in the character strings generated from each search query based on search queries extracted based on the number of times each substring separated by a space was input from the search queries that have been input a predetermined number of times or more. For example, in the example of Figure 1, if the input count threshold is set to "100," group #2 corresponds to the search queries "DDEE FF" and "DD EE FF." In such a case, the identification unit 43 tallies the input counts of the partial search queries "DD," "EE," and "DDEE" included in the search queries "DDEE FF" and "DD EE FF," respectively. If the number of times the partial search query "DDEE" is entered is greater than the number of times each of the partial search queries "DD" and "EE" is entered, the information processing device 10 identifies the positions of space separators in the character string "DDEEFF" based on the search query "DDEE FF". On the other hand, if the number of times either the partial search queries "DD" or "EE" is entered is greater than the number of times the partial search query "DDEE" is entered, the information processing device 10 identifies the positions of space separators in the character string "DDEEFF" based on the search query "DD EE FF".

（学習部４４について）
学習部４４は、特定部４３により特定された文字列におけるスペース区切りの位置に基づいて、文字列が入力された場合に、スペースにより区切られた文字列を出力するモデルの学習を行う。例えば、図１の例において、学習部４４は、文字列が入力された場合に、スペースにより区切られた文字列を出力するモデルの学習を行う。 (Regarding the learning unit 44)
The learning unit 44 learns a model that outputs a character string separated by spaces when a character string is input, based on the positions of space separators in the character string identified by the identification unit 43. For example, in the example of Fig. 1, the learning unit 44 learns a model that outputs a character string separated by spaces when a character string is input.

また、学習部４４は、さらに、モデルにより出力された文字列を用いた検索結果に関する情報に基づいて、モデルの学習を行ってもよい。例えば、図１の例において、学習部４４は、利用者Ｕ３の行動情報が、「ＧＧＨＨＩＩ」に基づく検索結果として提供されたコンテンツを、利用者Ｕ３が選択したことを示す場合、学習部４４は、文字列「ＧＧＨＨＩＩ」を編集した文字列「ＧＧＨＨＩＩ」を正例としてモデルの学習を行う。また、利用者Ｕ３の行動情報が、「ＧＧＨＨＩＩ」に基づく検索結果として提供されたコンテンツを、利用者Ｕ３が選択しなかったことを示す場合、学習部４４は、文字列「ＧＧＨＨＩＩ」を編集した文字列「ＧＧＨＨＩＩ」を負例としてモデルの学習を行う。 The learning unit 44 may also train the model based on information regarding search results using the character string output by the model. For example, in the example of FIG. 1, if the behavioral information of user U3 indicates that user U3 selected content provided as a search result based on "GGHH II", the learning unit 44 trains the model using the edited character string "GGHH II" of the character string "GGHHII" as a positive example. Also, if the behavioral information of user U3 indicates that user U3 did not select content provided as a search result based on "GGHH II", the learning unit 44 trains the model using the edited character string "GGHHII" of the character string "GGHH II" as a negative example.

また、学習部４４は、特定部４３により特定された文字列におけるスペース区切りの位置に基づいて、文字列が入力された場合に、スペース区切りを削除した文字列を出力するモデルの学習を行ってもよい。例えば、図１の例において、学習部４４は、文字列が入力された場合に、スペース区切りを削除した文字列を出力するモデルの学習を行う。 The learning unit 44 may also train a model that, when a string is input, outputs a string from which space separators have been deleted, based on the positions of space separators in the string identified by the identification unit 43. For example, in the example of Figure 1, the learning unit 44 trains a model that, when a string is input, outputs a string from which space separators have been deleted.

〔４．情報処理のフロー〕
図５を用いて、実施形態に係る情報処理装置１０の情報処理の手順について説明する。図５は、実施形態に係る情報処理の手順の一例を示すフローチャートである。 [4. Information processing flow]
The procedure of information processing by the information processing apparatus 10 according to the embodiment will be described with reference to Fig. 5. Fig. 5 is a flowchart showing an example of the procedure of information processing according to the embodiment.

図５に示すように、情報処理装置１０は、利用者が入力した検索クエリに含まれるスペース区切りを削除することで、当該検索クエリが示す文字列を生成する（ステップＳ１０１）。続いて、情報処理装置１０は、生成された文字列に基づいて、検索クエリを分類する（ステップＳ１０２）。続いて、情報処理装置１０は、同一のグループに分類された各検索クエリが入力された回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する（ステップＳ１０３）。続いて、情報処理装置１０は、利用者から検索クエリを受け付けたか否かを判定する（ステップＳ１０４）。検索クエリを受け付けていない場合（ステップＳ１０４；Ｎｏ）、情報処理装置１０は、検索クエリを受け付けるまで待機する。 As shown in FIG. 5, the information processing device 10 generates a character string indicated by a search query entered by a user by deleting space separators included in the search query (step S101). Next, the information processing device 10 classifies the search queries based on the generated character strings (step S102). Next, the information processing device 10 identifies the positions of space separators in the character strings generated from each search query based on the number of times each search query classified into the same group has been entered (step S103). Next, the information processing device 10 determines whether a search query has been received from the user (step S104). If a search query has not been received (step S104; No), the information processing device 10 waits until a search query is received.

一方、検索クエリを受け付けた場合（ステップＳ１０４；Ｙｅｓ）、特定したスペース区切りの位置と、受け付けた検索クエリとに基づき、検索結果を提供し（ステップＳ１０５）、処理を終了する。 On the other hand, if a search query has been received (step S104; Yes), search results are provided based on the identified space separator position and the received search query (step S105), and the process ends.

〔５．変形例〕
上述の実施形態は一例を示したものであり、種々の変更及び応用が可能である。 5. Modifications
The above-described embodiment is merely an example, and various modifications and applications are possible.

〔５－１．検索クエリが入力された期間について〕
上述の実施形態において、情報処理装置１０は、利用者が所定の期間（例えば、直近１か月や、直近６か月など）に入力した検索クエリを用いて、上述の処理を実行してもよい。なお、利用者が所定の期間において入力した検索クエリの量が、所定の量に達しない場合、情報処理装置１０は、利用者が入力した検索クエリの量が所定の量に達するように、期間を延長（例えば、直近１か月から直近３か月に延長、直近６か月から直近１年に延長など）してもよい。 [5-1. Regarding the period when the search query was entered]
In the above-described embodiment, the information processing device 10 may perform the above-described process using search queries input by a user in a predetermined period (e.g., the last month, the last six months, etc.). Note that if the amount of search queries input by a user in a predetermined period does not reach a predetermined amount, the information processing device 10 may extend the period (e.g., from the last month to the last three months, or from the last six months to the last year, etc.) so that the amount of search queries input by the user reaches the predetermined amount.

〔５－２．文字列の生成について〕
情報処理装置１０が検索クエリから文字列を生成する手法は、上述したものに限定されず、任意のものが用いられてよい。例えば、文字列「ＢＢＣＣＡＡ」のグループが存在する場合、情報処理装置１０は、検索クエリ「ＡＡＢＢＣＣ」からスペース区切りを削除するとともに、部分検索クエリの順番を変更し、当該検索クエリから文字列「ＢＢＣＣＡＡ」を生成してもよい。 [5-2. Generating strings]
The method by which the information processing device 10 generates a character string from a search query is not limited to the above-described method, and any method may be used. For example, if a group of the character string "BBCCAA" exists, the information processing device 10 may delete the space separator from the search query "AA BB CC" and change the order of the partial search queries to generate the character string "BBCCAA" from the search query.

〔５－３．処理態様について〕
上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、逆に、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文章中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 5-3. Processing Mode
Of the processes described in the above embodiments, all or part of the processes described as being performed automatically can be performed manually, and conversely, all or part of the processes described as being performed manually can be performed automatically using known methods. In addition, the information, including the processing procedures, specific names, various data, and parameters shown in the above text and drawings, can be changed as desired unless otherwise specified. For example, the various information shown in each drawing is not limited to the information shown in the drawings.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Furthermore, the components of each device shown in the figure are functional concepts and do not necessarily have to be physically configured as shown. In other words, the specific form of distribution and integration of each device is not limited to that shown, and all or part of them can be functionally or physically distributed and integrated in any unit depending on various loads, usage conditions, etc.

また、上記してきた各実施形態は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Furthermore, the above-mentioned embodiments can be combined as appropriate to the extent that the processing content is not contradictory.

なお、情報処理装置１０は、スペル訂正を行うモデルを構築するために、上述した処理を各種他の処理と組みあわせて実施してもよい。例えば、情報処理装置１０は、訂正前文字列として「しょっぷＹＹ港町」と訂正後文字列として「ショップＹＹ港町」といったスペル訂正データを取得する。かかる処理は、例えば、入力された検索クエリのログと、あらかじめ作成されたデータベースとを比較し、異なる文字の数、発音、入力時に隣接するキーボードの数等が所定の閾値以上類似する文字列のペアを、訂正前文字列および訂正後文字列として抽出することにより取得される。情報処理装置１０は、このようなスペル訂正データに対して、上述したモデルによりスペース区切りを追加する。例えば、情報処理装置１０は、スペル訂正データの訂正後文字列「ショップＹＹ港町」を「ショップＹＹ港町」というデータに変換し、「しょっぷＹＹ港町」を「ショップＹＹ港町」といった文字列に訂正する訂正モデルを学習してもよい。すなわち、情報処理装置１０は、生成したモデルを用いて、訂正モデルの学習に用いる訂正後文字列にスペース区切りを入力し、訂正前文字列から訂正後文字列に訂正を行う訂正モデルを生成してもよい。 The information processing device 10 may combine the above-described process with various other processes to build a model for spelling correction. For example, the information processing device 10 acquires spelling correction data such as "shop YY port town" as an uncorrected string and "shop YY port town" as a corrected string. This process is performed, for example, by comparing the log of the input search query with a pre-created database and extracting pairs of strings that are similar enough to or exceed a predetermined threshold in terms of the number of different characters, pronunciation, number of adjacent keyboard keys used during input, etc., as the uncorrected string and the corrected string. The information processing device 10 adds space separators to such spelling correction data using the above-described model. For example, the information processing device 10 may convert the corrected string "shop YY port town" in the spelling correction data into data such as "shop YY port town" and train a correction model that corrects "shop YY port town" to a string such as "shop YY port town." That is, the information processing device 10 may use the generated model to input a space delimiter into the corrected string used to train the correction model, and generate a correction model that corrects the pre-correction string to the corrected string.

〔６．効果〕
上述してきたように、実施形態に係る情報処理装置１０は、生成部４１と、分類部４２と、特定部４３と、学習部４４とを有する。生成部４１は、利用者が入力した検索クエリに含まれるスペース区切りを削除することで、当該検索クエリが示す文字列を生成する。また、生成部４１は、さらに、検索クエリに含まれる所定の文字を削除することで、当該検索クエリが示す文字列を生成する。分類部４２は、生成部４１により生成された文字列に基づいて、検索クエリを分類する。また、分類部４２は、一の検索クエリが複数のグループに分類可能である場合、一の検索クエリが入力された回数と、当該複数のグループに分類される各検索クエリが入力された回数とに基づいて、一の検索クエリを分類する。特定部４３は、同一のグループに分類された各検索クエリが入力された回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が最も多い検索クエリのスペース区切りの位置に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。学習部４４は、特定部４３により特定された文字列におけるスペース区切りの位置に基づいて、文字列が入力された場合に、スペースにより区切られた文字列を出力するモデルの学習を行う。また、学習部４４は、特定部４３により特定された文字列におけるスペース区切りの位置に基づいて、文字列が入力された場合に、スペース区切りを削除した文字列を出力するモデルの学習を行う。また、学習部４４は、さらに、モデルにより出力された文字列を用いた検索結果に関する情報に基づいて、モデルの学習を行う。 6. Effects
As described above, the information processing device 10 according to the embodiment includes a generation unit 41, a classification unit 42, an identification unit 43, and a learning unit 44. The generation unit 41 generates a character string indicated by a search query entered by a user by deleting space separators included in the search query. The generation unit 41 also generates a character string indicated by the search query by deleting predetermined characters included in the search query. The classification unit 42 classifies the search query based on the character string generated by the generation unit 41. If a search query can be classified into multiple groups, the classification unit 42 classifies the search query based on the number of times the search query was entered and the number of times each search query classified into the multiple groups was entered. The identification unit 43 identifies the position of a space separator in a character string generated from each search query based on the number of times each search query classified into the same group was entered. The identification unit 43 also identifies the position of a space separator in a character string generated from each search query based on the position of a space separator in the search query that has been entered the most frequently among the search queries classified into the same group. The learning unit 44 trains a model that outputs a character string separated by spaces when a character string is input, based on the positions of space separators in the character string identified by the identification unit 43. The learning unit 44 also trains a model that outputs a character string from which the space separators have been deleted when a character string is input, based on the positions of space separators in the character string identified by the identification unit 43. The learning unit 44 also trains the model based on information related to search results using the character string output by the model.

これにより、実施形態に係る情報処理装置１０は、同一のグループに分類された各検索クエリのうち、利用者により最も多く入力された検索クエリに基づき、当該各検索クエリが示す文字列における、スペース区切りの位置を特定することができるため、検索クエリの適切なスペース区切りの位置を特定することができる。 As a result, the information processing device 10 according to the embodiment can identify the position of space separators in the character strings indicated by each search query based on the search query most frequently entered by users among the search queries classified into the same group, thereby identifying the appropriate position of space separators in the search query.

また、実施形態に係る情報処理装置１０において、例えば、特定部４３は、同一のグループに分類された各検索クエリが入力された割合であって、同一のグループ内での割合に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、同一のグループに分類された各検索クエリを入力したブラウザの数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、さらに、同一のグループに分類された各検索クエリによる検索結果を示すコンテンツが選択された回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、さらに、同一のグループに分類された各検索クエリに対応する広告コンテンツのコンバージョンの回数に基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が所定の閾値以上である検索クエリが複数存在する場合、入力された回数が所定の閾値以上である検索クエリのうち、スペース区切りの数が最も多い検索クエリに基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が所定の閾値以上である検索クエリが複数存在する場合、入力された回数が所定の閾値以上である検索クエリのうち、スペース区切りの数が最も少ない検索クエリに基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。また、特定部４３は、同一のグループに分類された各検索クエリのうち、入力された回数が所定の閾値以上である検索クエリが複数存在する場合、入力された回数が所定の閾値以上である検索クエリのうち、スペースにより区切られた各部分文字列が入力された回数に基づき抽出される検索クエリに基づいて、当該各検索クエリから生成された文字列におけるスペース区切りの位置を特定する。 In the information processing device 10 according to the embodiment, for example, the identification unit 43 identifies the position of space separators in the character string generated from each search query based on the proportion of search queries entered within the same group. The identification unit 43 also identifies the position of space separators in the character string generated from each search query based on the number of browsers that entered each search query entered within the same group. The identification unit 43 further identifies the position of space separators in the character string generated from each search query based on the number of times that content indicating search results for each search query entered within the same group was selected. The identification unit 43 also identifies the position of space separators in the character string generated from each search query based on the number of conversions of advertising content corresponding to each search query entered within the same group. When there are multiple search queries entered within the same group that have been entered a predetermined number of times or more, the identification unit 43 identifies the position of space separators in the character string generated from each search query based on the search query with the greatest number of space separators among the search queries entered a predetermined number of times or more. Furthermore, when there are multiple search queries that have been input a predetermined number of times or more, among the search queries classified into the same group, the identification unit 43 identifies the positions of space separators in the character string generated from each search query based on the search query with the fewest number of space separators among the search queries that have been input a predetermined number of times or more. Furthermore, when there are multiple search queries that have been input a predetermined number of times or more, among the search queries classified into the same group, the identification unit 43 identifies the positions of space separators in the character string generated from each search query based on the search queries extracted based on the number of times each substring separated by a space was input, among the search queries that have been input a predetermined number of times or more.

これにより、実施形態に係る情報処理装置１０は、各種の指標を用いて、検索クエリのスペース区切りの位置を特定することができるため、スペース区切りの位置の特定の精度を向上させることができる。 As a result, the information processing device 10 according to the embodiment can use various indices to identify the positions of space separators in a search query, thereby improving the accuracy of identifying the positions of space separators.

〔７．ハードウェア構成〕
また、上述してきた各実施形態に係る情報処理装置１０は、例えば、図６に示すような構成のコンピュータ１０００によって実現される。以下、情報処理装置１０を例に挙げて説明する。図６は、情報処理装置１０の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＯＭ１２００、ＲＡＭ１３００、ＨＤＤ１４００、通信インターフェイス（Ｉ／Ｆ）１５００、入出力インターフェイス（Ｉ／Ｆ）１６００、及びメディアインターフェイス（Ｉ／Ｆ）１７００を有する。 7. Hardware Configuration
The information processing device 10 according to each of the above-described embodiments is realized, for example, by a computer 1000 configured as shown in Fig. 6. The following description will be given using the information processing device 10 as an example. Fig. 6 is a hardware configuration diagram showing an example of a computer that realizes the functions of the information processing device 10. The computer 1000 has a CPU 1100, a ROM 1200, a RAM 1300, a HDD 1400, a communication interface (I/F) 1500, an input/output interface (I/F) 1600, and a media interface (I/F) 1700.

ＣＰＵ１１００は、ＲＯＭ１２００又はＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１２００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The CPU 1100 operates based on programs stored in the ROM 1200 or HDD 1400, and controls each component. The ROM 1200 stores a boot program executed by the CPU 1100 when the computer 1000 starts up, as well as programs that depend on the computer 1000's hardware.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラム、及び、かかるプログラムによって使用されるデータ等を記憶する。通信インターフェイス１５００は、通信網５００（実施形態のネットワークＮに対応する）を介して他の機器からデータを受信してＣＰＵ１１００へ送り、また、通信網５００を介してＣＰＵ１１００が生成したデータを他の機器へ送信する。 HDD 1400 stores programs executed by CPU 1100 and data used by such programs. Communication interface 1500 receives data from other devices via communication network 500 (corresponding to network N in the embodiment) and sends it to CPU 1100, and also transmits data generated by CPU 1100 to other devices via communication network 500.

ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやプリンタ等の出力装置、及び、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インターフェイス１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、入出力インターフェイス１６００を介して生成したデータを出力装置へ出力する。 The CPU 1100 controls output devices such as displays and printers, and input devices such as keyboards and mice, via the input/output interface 1600. The CPU 1100 acquires data from input devices via the input/output interface 1600. The CPU 1100 also outputs data generated via the input/output interface 1600 to output devices.

メディアインターフェイス１７００は、記録媒体１８００に格納されたプログラム又はデータを読み取り、ＲＡＭ１３００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、かかるプログラムを、メディアインターフェイス１７００を介して記録媒体１８００からＲＡＭ１３００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、例えばＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 Media interface 1700 reads programs or data stored on recording medium 1800 and provides them to CPU 1100 via RAM 1300. CPU 1100 loads the programs from recording medium 1800 onto RAM 1300 via media interface 1700 and executes the loaded programs. Recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or a PD (Phase Change Rewritable Disc), a magneto-optical recording medium such as an MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory.

例えば、コンピュータ１０００が情報処理装置１０として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１３００上にロードされたプログラムを実行することにより、制御部４０の機能を実現する。また、ＨＤＤ１４００には、情報処理装置１０の記憶装置内の各データが格納される。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムを記録媒体１８００から読み取って実行するが、他の例として、他の装置から所定の通信網を介してこれらのプログラムを取得してもよい。 For example, when the computer 1000 functions as the information processing device 10, the CPU 1100 of the computer 1000 executes programs loaded onto the RAM 1300 to realize the functions of the control unit 40. In addition, the HDD 1400 stores various data within the storage device of the information processing device 10. The CPU 1100 of the computer 1000 reads and executes these programs from the recording medium 1800, but as another example, these programs may be obtained from another device via a specified communications network.

〔８．その他〕
以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 [8. Other]
Although some of the embodiments of the present application have been described in detail above with reference to the drawings, these are merely examples, and the present invention can be implemented in other forms that include the embodiments described in the Disclosure of the Invention section and that have been modified and improved in various ways based on the knowledge of those skilled in the art.

また、上述した情報処理装置１０は、機能によっては外部のプラットフォーム等をＡＰＩ（Application Programming Interface）やネットワークコンピューティングなどで呼び出して実現するなど、構成は柔軟に変更できる。 Furthermore, the configuration of the information processing device 10 described above can be flexibly changed, such as by calling external platforms using APIs (Application Programming Interfaces) or network computing, depending on the function.

また、特許請求の範囲に記載した「部」は、「手段」や「回路」などに読み替えることができる。例えば、生成部は、生成手段や生成回路に読み替えることができる。 Furthermore, the term "unit" in the claims can be read as "means" or "circuit." For example, a generation unit can be read as a generation means or a generation circuit.

１０情報処理装置
２０通信部
３０記憶部
３１検索クエリ情報データベース
３２グループ情報データベース
３３モデルデータベース
４０制御部
４１生成部
４２分類部
４３特定部
４４学習部
１００利用者端末 REFERENCE SIGNS LIST 10 Information processing device 20 Communication unit 30 Storage unit 31 Search query information database 32 Group information database 33 Model database 40 Control unit 41 Generation unit 42 Classification unit 43 Identification unit 44 Learning unit 100 User terminal

Claims

a generation unit that generates a character string indicated by a search query input by a user by deleting space separators included in the search query;
a classification unit that classifies the search queries based on the character strings generated by the generation unit;
an identification unit that identifies the position of a space delimiter in the character string generated from each search query classified into the same group based on the number of times each search query has been input ,
The identification unit
Furthermore, the position of the space delimiter in the character string generated from each search query is identified based on the number of times that content representing search results for each search query classified into the same group has been selected.
1. An information processing device comprising:

The identification unit
The information processing device according to claim 1, characterized in that the position of a space separator in the character string generated from each search query is identified based on the ratio of input search queries classified into the same group, the ratio within the same group.

The identification unit
The information processing device according to claim 1 , further comprising: identifying a position of a space delimiter in the character string generated from each search query classified into the same group based on the number of browsers that input each search query.

The identification unit
The information processing device according to claim 1 , further comprising: identifying a position of a space delimiter in the character string generated from each search query based on the number of conversions of advertising content corresponding to each search query classified into the same group.

The identification unit
The information processing device according to claim 1, characterized in that the position of a space separator in the character string generated from each search query is identified based on the position of a space separator in the search query that has been entered most frequently among the search queries classified into the same group.

The identification unit
2. The information processing device according to claim 1, wherein, when there are multiple search queries among the search queries classified into the same group that have been input a number of times equal to or greater than a predetermined threshold, the position of space separators in the character strings generated from each of the search queries is identified based on the search query that has the largest number of space separators among the search queries that have been input a number of times equal to or greater than the predetermined threshold.

The identification unit
2. The information processing device according to claim 1, wherein, when there are multiple search queries that have been entered a predetermined number of times or more among the search queries classified into the same group, the position of space separators in the character strings generated from each of the search queries is identified based on the search query that has the fewest number of space separators among the search queries that have been entered a predetermined number of times or more.

The identification unit
2. The information processing device according to claim 1, wherein, when there are multiple search queries among the search queries classified into the same group that have been input a predetermined threshold or more in number of times, the information processing device identifies the position of space separators in the character strings generated from each of the search queries based on search queries extracted based on the number of times each substring separated by a space has been input from the search queries that have been input a predetermined threshold or more in number of times.

2. The information processing device according to claim 1, further comprising: a first learning unit configured to learn a model that outputs a character string separated by spaces when a character string is input, based on the positions of space separators in the character string identified by the identification unit.

The first learning unit
The information processing apparatus according to claim 9 , further comprising: learning the model based on information about a search result using a character string output by the model.

2. The information processing device according to claim 1, further comprising: a second learning unit configured to learn a model that outputs a character string from which space separators have been deleted when a character string is input, based on the positions of space separators in the character string identified by the identification unit.

The second learning unit
The information processing apparatus according to claim 11 , further comprising: learning the model based on information about a search result using a character string output by the model.

The generation unit
The information processing device according to claim 1 , further comprising: a processor configured to generate a character string indicated by a search query by deleting predetermined characters included in the search query.

1. A computer-implemented information processing method, comprising:
a generating step of generating a character string indicated by the search query by deleting space separators included in the search query entered by the user;
a classification step of classifying the search queries based on the character strings generated by the generation step;
and a specifying step of specifying the positions of space delimiters in the character strings generated from each of the search queries classified into the same group based on the number of times each of the search queries has been input,
The identifying step includes:
Furthermore, the position of the space delimiter in the character string generated from each search query is identified based on the number of times that content representing search results for each search query classified into the same group has been selected.
1. An information processing method comprising:

a generation step of generating a character string indicated by a search query input by a user by deleting space separators included in the search query;
a classification step of classifying the search queries based on the character strings generated by the generation step;
and a step of identifying the positions of space delimiters in the character strings generated from each search query classified into the same group based on the number of times each search query has been input;
The identification procedure includes:
Furthermore, the position of the space delimiter in the character string generated from each search query is identified based on the number of times that content representing search results for each search query classified into the same group has been selected.
An information processing program characterized by :