JP4307220B2

JP4307220B2 - Content recommendation target user selection apparatus and method, program, and content recommendation system

Info

Publication number: JP4307220B2
Application number: JP2003377206A
Authority: JP
Inventors: 俊介土井; 由紀吉田; 豪東野
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2003-11-06
Filing date: 2003-11-06
Publication date: 2009-08-05
Anticipated expiration: 2023-11-06
Also published as: JP2005141486A

Description

本発明は、あるコンテンツをユーザに推薦する際に、その推薦対象となるユーザ（被推薦ユーザ）を自動的に選出する技術に係わり、特に、利用実績のない新規コンテンツの被推薦ユーザを効率的に選出するのに好適な技術に関するものである。 The present invention relates to a technique for automatically selecting a user (recommended user) to be recommended when recommending a certain content to a user, and in particular, efficiently recommending a recommended user of a new content that has not been used. The present invention relates to a technique suitable for selection.

従来、インターネット等において、あるコンテンツをユーザに推薦する際に、その推薦対象となる被推薦ユーザを自動的に選出する技術としては、例えば、非特許文献１に記載の技術（一般的にcontent-based filteringと呼ばれている）がある。 Conventionally, as a technique for automatically selecting a recommended user to be recommended when recommending a certain content to a user on the Internet or the like, for example, the technique described in Non-Patent Document 1 (generally content- based filtering).

この技術は、あるユーザの嗜好情報と類似している場合に、コンテンツを推薦するものであり、この技術では、被推薦ユーザへのアンケート結果やこれまでのコンテンツ利用実績などから当該被推薦ユーザの嗜好情報を生成し、推薦対象コンテンツの説明文や説明キーワードと、被推薦ユーザの嗜好情報との類似度の大きさから、推薦対象コンテンツを当該被推薦ユーザに推薦するか否かを判別する。 This technology recommends content when it is similar to the preference information of a certain user. In this technology, based on the results of questionnaires to the recommended user and the past use of the content, Preference information is generated, and it is determined whether or not the recommendation target content is recommended to the recommended user based on the degree of similarity between the description text or the description keyword of the recommendation target content and the preference information of the recommended user.

また、例えば、非特許文献２に記載の技術（一般的にcollaborative filtering（協調フィルタリング方式）と呼ばれている）では、複数のユーザの利用履歴を用いて、被推薦ユーザと利用傾向が類似している他のユーザを協調ユーザとして選出し、協調ユーザに利用実績があり、被推薦ユーザに利用実績が無いコンテンツを推薦する。 Further, for example, in the technique described in Non-Patent Document 2 (generally called collaborative filtering (collaborative filtering method)), the usage tendency of a plurality of users is similar to that of the recommended user. Other users who are present are selected as collaborative users, and the collaborative users have a usage record and the recommended users have no usage record.

特許文献１（「レコメンドエンジン、レコメンド方法、レコメンドプログラム」）においては、対象ユーザの協調ユーザを選出し、選出した協調ユーザが利用していて対象ユーザが利用していないコンテンツを対象ユーザに推薦し、推薦できない場合は利用ランキングを表示する技術、すなわち、前記非特許文献２の技術を用いたレコメンド（推薦）技術が記載されている In Patent Document 1 (“Recommendation Engine, Recommendation Method, Recommendation Program”), a target user's collaborative user is selected, and content that the selected collaborative user uses but the target user does not use is recommended to the target user. In this case, a technique for displaying a usage ranking when recommendation is not possible, that is, a recommendation (recommendation) technique using the technique of Non-Patent Document 2 is described.

また、特許文献２（「情報推薦サーバ装置」）においては、被推薦ユーザとプロファイルが類似した協調ユーザを選出し、選出した協調ユーザのプロファイルと推薦対象コンテンツとの類似性によって推薦を決定する技術、および、ユーザには複数のプロファイルを持たせ、協調ユーザのプロファイルと類似していないプロファイルを用い、そのプロファイルと推薦対象コンテンツとが類似している場合、当該推薦対象コンテンツを被推薦ユーザに推薦する技術が記載されている。 Patent Document 2 (“Information Recommendation Server Device”) selects a collaborative user whose profile is similar to that of the recommended user, and determines a recommendation based on the similarity between the profile of the selected collaborative user and the recommended content. If the user has a plurality of profiles and uses a profile that is not similar to the profile of the collaborative user, and the profile is similar to the recommendation target content, the recommendation target content is recommended to the recommended user. The technology to do is described.

この特許文献２に記載の技術の特徴は、コンテンツやユーザのプロファイルはベクトルで記述することが可能であるため、協調ユーザが利用したことのないコンテンツであっても、被推薦ユーザに推薦することが可能なことである。この技術では、前記非特許文献１および非特許文献２の両方の技術を用いている。 The feature of the technique described in Patent Document 2 is that content and user profiles can be described by vectors, so even content that has not been used by a collaborative user is recommended to a recommended user. Is possible. In this technique, the techniques of both Non-Patent Document 1 and Non-Patent Document 2 are used.

しかし、非特許文献１に記載の技術（「content based filtering」）の場合、被推薦ユーザが回答したアンケート結果や利用実績と類似したコンテンツのみが推薦されがちになるとの第１の問題点がある。 However, in the case of the technique described in Non-Patent Document 1 (“content based filtering”), there is a first problem that only content similar to a questionnaire result or usage record answered by the recommended user tends to be recommended. .

また、非特許文献２に記載の技術（「collaborative filtering」）の場合、複数のユーザの利用履歴を用いて、被推薦ユーザと利用傾向が類似している他のユーザを協調ユーザとして選出し、協調ユーザに利用実績があり、被推薦ユーザに利用実績が無いコンテンツを推薦する技術であるため、協調ユーザも利用実績の無いコンテンツは推薦できないとの第２の問題点がある。さらに、誰も利用実績の無い、新しいコンテンツの場合（例えば、新しいコンテンツを一斉に適したユーザに対してリコメンドする場合）は推薦できないといった第３の問題点がある。 In the case of the technique described in Non-Patent Document 2 (“collaborative filtering”), other users who have similar usage trends to the recommended user are selected as cooperative users using the usage histories of a plurality of users. Since this is a technique for recommending content that has been used by a collaborative user and not used by a recommended user, there is a second problem in that content that has not been used by a collaborative user cannot be recommended. Furthermore, there is a third problem in that nobody has a track record of use, and new content cannot be recommended (for example, when new content is recommended to users who are suitable all at once).

また、多くのユーザ中から、ある推薦コンテンツの推薦対象ユーザを選出する場合、前記非特許文献１および非特許文献２のいずれの技術においても、処理量がユーザ数に比例して大きくなる傾向があるという第４の問題点がある。 In addition, when selecting a recommendation target user of a certain recommended content from among many users, the processing amount tends to increase in proportion to the number of users in both the non-patent document 1 and the non-patent document 2. There is a fourth problem.

その結果、特許文献１に記載の技術の場合、前記第２，第３，第４の問題点があり、特許文献２に記載の技術の場合、前記第４の問題点がある。 As a result, the technique described in Patent Document 1 has the second, third, and fourth problems, and the technique described in Patent Document 2 has the fourth problem.

特開２００２−３３４２５７号公報JP 2002-334257 A 特開２００２−２１５６６５号公報JP 2002-215665 A Marko Balabonavic and Yoav Shoham．"A Content-Based，Collaborative Recommendation"，Communication of ACM, Vol.40，No.3，PP.66-72(1997)Marko Balabonavic and Yoav Shoham. "A Content-Based, Collaborative Recommendation", Communication of ACM, Vol.40, No.3, PP.66-72 (1997) P．Resnick，N．lacovou，M．Suchak，P．Bergstrom，and J．Riedl．"GroupLens：Open Architecture for Collaborative Filtering of Netnews"，Conference on Computer Supported Cooperative Work，PP．175-186(1994)P. Resnick, N. lacovou, M.M. Suchak, P. Bergstrom, and J.M. Riedl. "GroupLens: Open Architecture for Collaborative Filtering of Netnews", Conference on Computer Supported Cooperative Work, PP. 175-186 (1994)

解決しようとする問題点は、従来の技術では、（１）被推薦ユーザが回答したアンケート結果や利用実績と類似したコンテンツのみが推薦されがちになる点と、（２）協調ユーザも利用実績の無いコンテンツは推薦できない点、（３）誰も利用実績の無い、新しいコンテンツの場合は推薦できない点、（４）多くのユーザ中から、ある推薦コンテンツの推薦対象ユーザを選出する場合、処理量がユーザ数に比例して大きくなる傾向がある点である。 The problems to be solved are that, in the conventional technology, (1) only content similar to the questionnaire results and usage results answered by the recommended users tends to be recommended, and (2) the cooperative users also have usage results. No content can be recommended, (3) Nobody has a track record of use, new content cannot be recommended, and (4) When selecting recommended users of recommended content from many users, the amount of processing is large. This is a point that tends to increase in proportion to the number of users.

上記目的を達成するため、本発明では、スコア生成手段（スコア生成ブロック２）と学習手段（学習ブロック３）および選出手段（選出ブロック４）の３つの機能を用いてコンテンツの推薦対象ユーザを選出する。 In order to achieve the above object, in the present invention, content recommendation target users are selected using the three functions of score generation means (score generation block 2), learning means (learning block 3) and selection means (selection block 4). To do.

すなわち、図１に示すように、まず、スコア生成ブロック２において、操作情報入力手段２ａにより、操作情報１０（ユーザの操作履歴やアンケート結果等の情報）を入力し、コンテンツ利用ログ格納部２ｂで順次にログ情報として記憶し、このログ情報に対して書式変換手段２ｃによってコンテンツの利用履歴に対する重み付けや正規化、データの書式変換等を実施し、ユーザ毎にコンテンツのメタデータであるキーワードそれぞれに対してスコアが記録されたユーザ・キーワード間スコアデータを生成し、ユーザ・キーワード間スコア格納部２ｄで記憶装置に格納する。 That is, as shown in FIG. 1, first, in the score generation block 2, the operation information input means 2a is used to input operation information 10 (information such as user operation history and questionnaire results), and the content usage log storage unit 2b. The log information is sequentially stored as log information, and the format conversion means 2c performs weighting and normalization of the content usage history, data format conversion, and the like for each log keyword. On the other hand, score data between the user and the keyword in which the score is recorded is generated and stored in the storage device by the score storing unit 2d between the user and the keyword.

次に、学習ブロック３において、キーワード関係グループ抽出手段３ａにより、ユーザ・キーワード間スコア格納部２ｄで格納したユーザ・キーワード間スコアデータを読み込み、関係が得られたキーワードをグループ化して抽出し、その結果をキーワード関係グループ抽出結果格納部３ｂで記憶装置に格納する。さらに、グループ別ユーザ別スコア算出手段３ｃにより、キーワード関係グループ抽出結果格納部３ｂで格納したキーワード関係グループ抽出結果とユーザ・キーワード間スコア格納部２ｄで格納したユーザ・キーワード間スコアデータを基に、キーワード関係グループ毎に各ユーザのスコアをしかるべき演算を行って算出し、その結果を、グループ別ユーザ別スコア格納部３ｄにより記憶装置に格納する。 Next, in the learning block 3, the keyword relation group extracting means 3a reads the user / keyword score data stored in the user / keyword score storage unit 2d, and groups and extracts the keywords from which the relation is obtained, The result is stored in the storage device by the keyword relation group extraction result storage unit 3b. Furthermore, based on the keyword-related group extraction result stored in the keyword-related group extraction result storage unit 3b and the user-keyword score data stored in the user-keyword score storage unit 2d by the group-specific user score calculation means 3c, The score of each user is calculated by performing an appropriate calculation for each keyword-related group, and the result is stored in the storage device by the group-specific user-specific score storage unit 3d.

そして、選出ブロック４においては、コンテンツキーワード情報入力手段４ａにより、推薦対象コンテンツのメタデータであるキーワードの集合を入力し、ユーザ別スコア算出手段４ｂにより、キーワード関係グループ抽出結果格納部３ｂとグループ別ユーザ別スコア格納部３ｄを参照し、しかるべき演算と処理を行ってユーザ別のスコアを算出する。その後、推薦対象ユーザ情報出力手段４ｃにより、選出条件（４ｄ）に合致するスコアを有する単数もしくは複数のユーザを推薦対象ユーザ１２（被推薦ユーザ）として選出する。 In the selection block 4, a set of keywords, which are metadata of recommended content, is input by the content keyword information input unit 4a, and the keyword-related group extraction result storage unit 3b and the group-specific group are calculated by the user-specific score calculation unit 4b. The user-specific score is stored by referring to the user-specific score storage unit 3d and performing appropriate calculations and processing. Thereafter, the recommendation target user information output means 4c selects one or a plurality of users having scores that match the selection condition (4d) as the recommendation target user 12 (recommended user).

本発明によれば、コンテンツの利用履歴（ユーザの操作履歴やアンケート結果等の情報）を基にして生成した「ユーザ・キーワード間スコアデータ」を基にしてキーワード間で関係があるグループ（キーワード関係グループ）を抽出し、その「キーワード関係グループ」における各ユーザのスコアの大きさからコンテンツの被推薦ユーザの選出を行っている。つまりは、他のユーザの動向を反映した「キーワード関係グループ」を用いてコンテンツの推薦を行っているので、第１の問題点（被推薦ユーザが回答したアンケート結果や利用実績と類似したコンテンツのみが推薦されがちになる）は解決される。 According to the present invention, a group (keyword relationship) having a relationship between keywords based on “user / keyword score data” generated based on content usage history (information such as user operation history and questionnaire results). Group) is extracted, and recommended users of content are selected based on the score of each user in the “keyword related group”. In other words, content recommendation is performed using a “keyword-related group” that reflects the trends of other users, so the first problem (only content similar to the questionnaire results and usage results answered by the recommended user) Is likely to be recommended) is resolved.

また、他のユーザの動向を反映したキーワード関係グループを用いて、コンテンツの被推薦ユーザの選出を行っており、協調ユーザの選出は行わない。そのため、選出した協調ユーザの履歴を基にリコメンドを行うために発生する第２問題点（協調ユーザの利用実績が無いコンテンツは推薦できない）と第３の問題点（誰も利用実績の無い、新しいコンテンツの場合は推薦できない）は解決される。 Moreover, the recommended user of the content is selected using a keyword-related group reflecting the trends of other users, and the cooperative user is not selected. Therefore, the second problem that occurs to make recommendations based on the history of the selected collaborative user (content that does not have a collaborative user's usage record cannot be recommended) and the third problem (nobody has a use record, new It cannot be recommended for content).

また、本発明では、学習ブロック（３）において、予めキーワード関係グループ毎に全ユーザのスコアを算出して保持しており、ユーザそれぞれについて評価する必要がない。すなわち、コンテンツを推薦する対象ユーザ（被推薦ユーザ）を選出する際に行う演算は、予め定めた数のキーワード関係グループとの類似度算出処理である。そのため、非特許文献１に記載の技術（「content based filtering」）のように全ユーザとの類似度の算出演算を行う場合や、非特許文献２に記載の技術（「collaborative filtering」）のように全ユーザに対して類似ユーザの選出処理を行う場合と比較して、ユーザ数が予め定めた数のキーワード関係グループ数より多い場合は、本発明における算出処理量のほうが少なくなる。さらに、本発明ではユーザ数に比例して処理量は大きくならない。その結果、第４の問題点（多くのユーザ中から、ある推薦コンテンツの推薦対象ユーザを選出する場合、処理量がユーザ数に比例して大きくなる傾向がある）は解決される。 In the present invention, in the learning block (3), the scores of all users are calculated and held in advance for each keyword relation group, and it is not necessary to evaluate each user. That is, the calculation performed when selecting a target user (recommended user) who recommends content is a similarity calculation process with a predetermined number of keyword-related groups. Therefore, as in the technique described in Non-Patent Document 1 (“content based filtering”), the calculation of similarity with all users is performed, or the technique described in Non-Patent Document 2 (“collaborative filtering”). When the number of users is larger than the predetermined number of keyword-related groups as compared to the case where similar users are selected for all users, the amount of calculation processing in the present invention is smaller. Furthermore, in the present invention, the amount of processing does not increase in proportion to the number of users. As a result, the fourth problem (when a recommended target user of a recommended content is selected from many users, the processing amount tends to increase in proportion to the number of users) is solved.

以下、図を用いて本発明を実施するための最良の形態例を説明する。 The best mode for carrying out the present invention will be described below with reference to the drawings.

図１は、本発明に係わるコンテンツ推薦対象ユーザ選出装置の構成例を示すブロック図であり、図２は、本発明に係わるコンテンツ推薦対象ユーザ選出装置が組み込まれたコンテンツ推薦システムの構成例を示すブロック図、図３は、図２におけるコンテンツ推薦システムの詳細構成を示すブロック図、図４は、図１におけるスコア生成ブロックの操作情報入力手段により入力される操作情報の具体例を示す説明図、図５は、図１におけるスコア生成ブロックのコンテンツ利用ログ格納部で格納されるコンテンツ利用ログの具体例を示す説明図、図６は、図１におけるスコア生成ブロックの書式変換手段により出力されるユーザ・キーワード間スコアの具体例を示す説明図、図７は、図１における学習ブロックの処理動作例を示すフローチャート、図８は、図１における学習ブロックのキーワード関係グループ抽出手段により抽出されキーワード関係グループ抽出結果格納部で格納されるキーワード関係グループ生成結果の具体例を示す説明図、図９は、図１における学習ブロックのグループ別ユーザ別スコア算出手段により算出されグループ別ユーザ別スコア格納部で格納されるグループ別ユーザ別スコア生成結果の具体例を示す説明図、図１０は、図１における選出ブロックの処理動作例を示すフローチャート、図１１は、図３におけるコンテンツ推薦システムのコンテンツ配信制御装置に入力される推薦対象コンテンツ情報の具体例を示す説明図、図１２は、本発明に係わるコンテンツ推薦対象ユーザ選出装置に入力されるコンテンツキーワード情報の具体例を示す説明図、図１３は、図１における選出ブロックの他の処理動作例を示すフローチャートである。 FIG. 1 is a block diagram showing a configuration example of a content recommendation target user selection device according to the present invention, and FIG. 2 shows a configuration example of a content recommendation system incorporating a content recommendation target user selection device according to the present invention. FIG. 3 is a block diagram showing the detailed configuration of the content recommendation system in FIG. 2, FIG. 4 is an explanatory diagram showing a specific example of operation information input by the operation information input means of the score generation block in FIG. FIG. 5 is an explanatory diagram showing a specific example of the content usage log stored in the content usage log storage unit of the score generation block in FIG. 1, and FIG. 6 is a user output by the format conversion means of the score generation block in FIG. An explanatory diagram showing a specific example of the score between keywords, FIG. 7 is a flowchart showing an example of processing operation of the learning block in FIG. FIG. 8 is an explanatory view showing a specific example of the keyword relation group generation result extracted by the keyword relation group extraction means of the learning block in FIG. 1 and stored in the keyword relation group extraction result storage unit, and FIG. FIG. 10 is an explanatory diagram showing a specific example of the group-specific user score generation result calculated by the group-specific user score calculation means of the learning block and stored in the group-specific user score storage unit, FIG. FIG. 11 is an explanatory diagram showing a specific example of recommendation target content information input to the content distribution control device of the content recommendation system in FIG. 3, and FIG. 12 is a content recommendation target user according to the present invention. Explanatory drawing which shows the specific example of the content keyword information input into a selection apparatus, 13 is a flow chart showing another processing operation example of the select block in Figure 1.

まず、図２および図３に基づき、コンテンツ推薦システムについて説明する。図２に示すように、本例のコンテンツ推薦システムは、コンテンツ推薦対象ユーザ選出装置１、コンテンツ配信制御装置２０、コンテンツデータベース（図中、「コンテンツＤＢ」と記載）２１、ネットワーク２２、端末２３ａ〜２３ｃからなる。 First, the content recommendation system will be described with reference to FIGS. As shown in FIG. 2, the content recommendation system of this example includes a content recommendation target user selection device 1, a content distribution control device 20, a content database (denoted as “content DB” in the figure) 21, a network 22, and terminals 23a to 23a. 23c.

コンテンツ推薦対象ユーザ選出装置１、コンテンツ配信制御装置２０、端末２３ａ〜２３ｃのそれぞれは、ＣＰＵ（Central Processing Unit）や主メモリ、表示装置、入力装置、外部記憶装置等を有するコンピュータ構成からなり、光ディスク駆動装置等を介してＣＤ−ＲＯＭ等の記憶媒体に記録されたプログラムやデータを外部記憶装置内にインストールした後、この外部記憶装置から主メモリに読み込みＣＰＵで処理することにより、各処理部の機能を実行する。 Each of the content recommendation target user selection device 1, the content distribution control device 20, and the terminals 23a to 23c has a computer configuration including a CPU (Central Processing Unit), a main memory, a display device, an input device, an external storage device, and the like. After installing a program or data recorded in a storage medium such as a CD-ROM via a drive device or the like into the external storage device, the program is read from the external storage device into the main memory and processed by the CPU, so that each processing unit Perform the function.

本例のコンテンツ推薦システムは、コンテンツデータベース２１に格納されているコンテンツを、コンテンツ配信制御装置２０が各端末２３ａ〜２３ｃに送信するものであり、コンテンツ推薦対象ユーザ選出装置１は、コンテンツ配信制御装置２０がコンテンツ情報を送信する対象の端末（または推薦対象ユーザ）を選出する。 In the content recommendation system of this example, the content distribution control device 20 transmits the content stored in the content database 21 to each of the terminals 23a to 23c. The content recommendation target user selection device 1 is a content distribution control device. 20 selects a target terminal (or recommendation target user) to transmit content information.

本例では、コンテンツ推薦対象ユーザ選出装置１とコンテンツ配信制御装置２０と被推薦ユーザが利用している端末２３ａ〜２３ｃは、ネットワーク２２で接続されている。また、本例では、コンテンツデータベース２１はコンテンツ配信制御装置２０と接続されている。 In this example, the content recommendation target user selection device 1, the content distribution control device 20, and terminals 23 a to 23 c used by the recommended user are connected via a network 22. In this example, the content database 21 is connected to the content distribution control device 20.

尚、本例では、ユーザを識別するユーザＩＤと端末２３ａ〜２３ｃを識別するＩＤとは一意に対応付けされており、被推薦ユーザが判明すれば、そのユーザが使用している端末２３ａ〜２３ｃを一意に判別し、その端末２３ａ〜２３ｃに推薦対象コンテンツを推薦できるものとする。 In this example, the user ID for identifying the user and the ID for identifying the terminals 23a to 23c are uniquely associated. If the recommended user is found, the terminals 23a to 23c used by the user are identified. Is uniquely determined, and the recommendation target content can be recommended to the terminals 23a to 23c.

もし、ユーザを識別するユーザＩＤと端末２３ａ〜２３ｃを識別するＩＤとが一意に対応付けされていない場合は、別途、ユーザＩＤとそのユーザが使用している端末２３ａ〜２３ｃを識別するＩＤとを対応付けたテーブルを用意して、それを参照すればよい。 If the user ID for identifying the user and the ID for identifying the terminals 23a to 23c are not uniquely associated with each other, a user ID and an ID for identifying the terminals 23a to 23c used by the user are separately provided. It is sufficient to prepare a table in which is associated with each other and refer to it.

図３において、図２に示したコンテンツ推薦システムの詳細を示しその動作を説明する。コンテンツ推薦システムは、コンテンツ配信制御装置２０がコンテンツデータベース２１から推薦対象コンテンツ情報２４を取得し、コンテンツ推薦対象ユーザ選出装置１に対して推薦対象コンテンツ情報２４のメタデータであるキーワードの集合としてのコンテンツキーワード情報１１を送信し、それに対応してコンテンツ推薦対象ユーザ選出装置１から、推薦対象コンテンツ情報２４を送信するユーザとして適している被推薦ユーザを特定する推薦対象ユーザ情報１２を取得する。 3, details of the content recommendation system shown in FIG. 2 are shown and the operation thereof will be described. In the content recommendation system, the content distribution control device 20 acquires the recommendation target content information 24 from the content database 21, and the content as a set of keywords that are metadata of the recommendation target content information 24 for the content recommendation target user selection device 1. The keyword information 11 is transmitted, and the recommendation target user information 12 for identifying the recommended user suitable as the user who transmits the recommendation target content information 24 is acquired from the content recommendation target user selection device 1 correspondingly.

そして、コンテンツ配信制御装置２０は、コンテンツ推薦対象ユーザ選出装置１から取得した推薦対象ユーザ情報１２で特定される推薦対象ユーザ（被推薦ユーザ）が利用している例えば端末２３ａ，２３ｃに対して推薦対象コンテンツ情報２４を送信する。 Then, the content distribution control device 20 recommends, for example, the terminals 23a and 23c used by the recommendation target user (recommended user) specified by the recommendation target user information 12 acquired from the content recommendation target user selection device 1. The target content information 24 is transmitted.

コンテンツ推薦対象ユーザ選出装置１は、スコア生成ブロック２と学習ブロック３および選出ブロック４の各処理機能ブロックを有し、スコア生成ブロック２においては、端末２ｄから操作情報１０（ユーザの操作履歴やコンテンツの利用履歴やアンケート結果等の情報）を随時入力し、学習ブロック３が用いるユーザ・キーワード間スコアデータを生成してユーザ・キーワード間スコア格納部２ｄで格納する。 The content recommendation target user selection device 1 has processing function blocks of a score generation block 2, a learning block 3, and a selection block 4. In the score generation block 2, the operation information 10 (user operation history and content) is received from the terminal 2d. Information such as usage history and questionnaire results) is input as needed, and user / keyword score data used by the learning block 3 is generated and stored in the user / keyword score storage unit 2d.

学習ブロック３においては、スコア生成ブロック２で生成し格納したユーザ・キーワード間スコアデータを読み込み、関連性があるキーワードをグループ化したキーワード関係グループと、そのグループ別にユーザ毎のスコアを記録したグループ別ユーザ別スコアデータを、しかるべき契機で随時生成し、それぞれ、キーワード関係グループ抽出結果格納部３ｂ、グループ別ユーザ別スコア格納部３ｄで格納する。 In the learning block 3, the user-keyword score data generated and stored in the score generation block 2 is read, and the keyword relation group in which related keywords are grouped, and the score for each user is recorded for each group. User-specific score data is generated as needed at appropriate times, and stored in the keyword-related group extraction result storage unit 3b and the group-specific user score storage unit 3d, respectively.

選出ブロック４においては、コンテンツ情報入力手段４ａにより、コンテンツ配信制御装置２０から送信されたコンテンツキーワード情報１１を入力し、学習ブロック３におけるキーワード関係グループ抽出結果格納部３ｂとグループ別ユーザ別スコア格納部３ｄでの格納内容を参照して、推薦対象ユーザ情報出力手段４ｃによりしかるべき演算と処理によって推薦対象ユーザ（被推薦ユーザ）を選出し、選出結果を推薦対象ユーザ情報１２としてコンテンツ配信制御装置２０に対して出力する。 In the selection block 4, the content keyword information 11 transmitted from the content distribution control device 20 is input by the content information input means 4a, and the keyword-related group extraction result storage unit 3b and the group-specific score storage unit in the learning block 3 With reference to the contents stored in 3d, the recommendation target user (recommended user) is selected by the appropriate calculation and processing by the recommendation target user information output means 4c, and the selection result is used as the recommendation target user information 12, and the content distribution control apparatus 20 Output for.

このコンテンツ推薦対象ユーザ選出装置１の詳細を、図１に基づき以下に説明する。図１に示すように、コンテンツ推薦対象ユーザ選出装置１は、スコア生成ブロック２と学習ブロック３および選出ブロック４の各処理機能ブロックを有し、このスコア生成ブロック２は、操作情報入力手段２ａ、コンテンツ利用ログ格納部２ｂ、書式変換手段２ｃ、ユーザ・キーワード間スコア格納部２ｄを有し、また、学習ブロック３は、キーワード関係グループ抽出手段３ａ、キーワード関係グループ抽出結果格納部３ｂ、グループ別ユーザ別スコア算出手段３ｃ、グループ別ユーザ別スコア格納部３ｄを有し、そして、選出ブロック４は、コンテンツキーワード情報入力手段４ａ、ユーザ別スコア算出手段４ｂ、推薦対象ユーザ情報出力手段４ｃ、選出条件記憶部（図中「選出条件」と記載）４ｄを有している。 Details of the content recommendation target user selection device 1 will be described below with reference to FIG. As shown in FIG. 1, the content recommendation target user selection device 1 has processing function blocks of a score generation block 2, a learning block 3, and a selection block 4. The score generation block 2 includes operation information input means 2a, It has a content usage log storage unit 2b, a format conversion unit 2c, and a user / keyword score storage unit 2d. The learning block 3 includes a keyword relationship group extraction unit 3a, a keyword relationship group extraction result storage unit 3b, and a user by group. There is a separate score calculation means 3c, a group-specific user score storage section 3d, and the selection block 4 is a content keyword information input means 4a, a user-specific score calculation means 4b, a recommendation target user information output means 4c, a selection condition storage. Part (described as “selection condition” in the figure) 4d.

このコンテンツ推薦対象ユーザ選出装置１の各ブロックについて詳細を説明する。スコア生成ブロック２においては、操作情報入力手段２ａにより、図４に具体例を示す操作情報１０を読み込み、コンテンツ利用ログ格納部２ｂにおいて、図５に具体例を示す内容のコンテンツ利用ログ情報を記録する。 Details of each block of the content recommendation target user selection device 1 will be described. In the score generation block 2, the operation information input means 2a reads the operation information 10 shown in the specific example in FIG. 4, and the content use log storage unit 2b records the content use log information having the specific example in FIG. To do.

さらに、書式変換手段２ｃにより、コンテンツ利用ログ格納部２ｂで記録した図５に例示するコンテンツ利用ログ情報を参照して、コンテンツの利用履歴に対する重み付けや正規化、データの書式変換等を実施し、スコア付け等の処理を行い、ユーザ毎にコンテンツのメタデータであるキーワードそれぞれに対してスコアが記録されたユーザ・キーワード間スコアデータを生成し、ユーザ・キーワード間スコア格納部２ｄにおいて、図６に具体例を示す内容で記憶装置に格納する。 Further, the format conversion means 2c refers to the content usage log information illustrated in FIG. 5 recorded in the content usage log storage unit 2b, performs weighting and normalization for content usage history, data format conversion, etc. Processing such as scoring is performed to generate user / keyword score data in which a score is recorded for each keyword that is content metadata for each user. In the user / keyword score storage unit 2d, FIG. The content is stored in the storage device with a specific example.

例えば、書式変換手段２ｃは、図５に例示するコンテンツ利用ログにおいて、操作内容が「削除」となったキーワードがある場合、図６「ユーザ・キーワード間スコア」の該当ユーザの該当キーワードのスコアを「−１０」する。同様に、コンテンツ利用ログにおいて、操作内容が「アイコン保存」となったキーワードがある場合、「ユーザ・キーワード間スコア」の該当ユーザの該当キーワードのスコアを「＋１０」する。 For example, if there is a keyword whose operation content is “deleted” in the content usage log illustrated in FIG. 5, the format conversion unit 2 c calculates the score of the corresponding keyword of the corresponding user in the “user-keyword score” in FIG. 6. “−10”. Similarly, when there is a keyword whose operation content is “save icon” in the content usage log, the score of the corresponding keyword of the corresponding user of “user-keyword score” is “+10”.

尚、この書式変換手段２ｃでは、操作情報１０の種類や目的に応じて、スコアのつけ方や、変換の手法等をカスタマイズすることができる。 The format conversion means 2c can customize the score assignment method, conversion method, and the like according to the type and purpose of the operation information 10.

また、図５のコンテンツ利用ログにおいて、異なった利用シーンにおいて取得した操作情報１０が混在することも考えられる。例えば、あるテレビ番組の視聴履歴における「途中で視聴中断した」情報や、端末上のコンテンツアイコンの操作履歴における「アイコンを削除した」情報、書籍の利用履歴データにおける「１０分間アイコンを所持していた」情報などが混在する場合がある。 Further, in the content usage log of FIG. 5, operation information 10 acquired in different usage scenes may be mixed. For example, information on “intermediate viewing has been interrupted” in the viewing history of a TV program, information on “icon deleted” in the operation history of a content icon on the terminal, and “10-minute icon in book usage history data” Information may be mixed.

この場合は、予め書式変換手段２ｃにおいて、それぞれの利用シーン毎にスコアを定義しておくことで、異なる利用シーンであっても「ユーザ・キーワード間スコア」を生成することができる。例えば、「アイコン保存＝＋１０点」、「アイコン削除＝−１０点」、「アイコン消滅＝−５点」、「ａ分間アイコン所有＝＋log１０（ａ）点」、「コンテンツ起動＝＋２０点」、「コンテンツ途中中断＝−５点」、「コンテンツ完全利用＝＋５点」と定義しておく。 In this case, by defining a score for each usage scene in the format conversion means 2c in advance, it is possible to generate a “user-keyword score” even for different usage scenes. For example, “icon save = + 10 points”, “icon deletion = −10 points”, “icon disappearance = −5 points”, “a minute icon ownership = + log 10 (a) points”, “content activation = + 20 points”, “ It is defined that “intermediate content interruption = −5 points” and “content complete use = + 5 points”.

次に、学習ブロック３においては、キーワード関係グループ抽出手段３ａにより、ユーザ・キーワード間スコア格納部２ｄで格納したユーザ・キーワード間スコアデータ（図６参照）を読み込み、関係があるキーワード同士をしかるべき演算を行って抽出し、関係が得られた各キーワードをグループ化し、そのグループにグループＩＤ（識別子）を付与し、その結果を、図８に例示する内容で、キーワード関係グループ抽出結果格納部３ｂにおいて格納する。 Next, in the learning block 3, the keyword-related group extracting means 3a reads the user-keyword score data (see FIG. 6) stored in the user-keyword score storage unit 2d, and the relevant keywords should be determined appropriately. The keywords extracted by performing the operation are grouped, the group ID (identifier) is given to the group, and the result is the keyword relationship group extraction result storage unit 3b with the contents illustrated in FIG. Store in.

さらに、グループ別ユーザ別スコア算出手段３ｃにより、キーワード関係グループ抽出結果格納部３ｂで格納したキーワード関係グループ抽出結果とユーザ・キーワード間スコア格納部２ｄで格納したユーザ・キーワード間スコアデータを基に、キーワード関係グループ毎に各ユーザのスコアをしかるべき演算を行って算出し、その結果を、図９に例示する内容でグループ別ユーザ別スコア格納部３ｄにおいて格納する。 Furthermore, based on the keyword-related group extraction result stored in the keyword-related group extraction result storage unit 3b and the user-keyword score data stored in the user-keyword score storage unit 2d by the group-specific user score calculation means 3c, The score of each user is calculated by performing an appropriate calculation for each keyword-related group, and the result is stored in the group-by-user score storage unit 3d with the contents illustrated in FIG.

選出ブロック４においては、コンテンツキーワード情報入力手段４ａにより、図２，３におけるコンテンツ配信制御装置２０等の外部装置から、図１１に例示する推薦対象コンテンツ情報２４に対応する図１２に例示するコンテンツキーワード集合１１を入力し、ユーザ別スコア算出手段４ｂにより、キーワード関係グループ抽出結果格納部３ｂとグループ別ユーザ別スコア格納部３ｄのそれぞれで格納された情報を読み出し、各情報を用いてしかるべき演算と処理を行ってユーザ別のスコアを算出する。 In the selection block 4, the content keyword illustrated in FIG. 12 corresponding to the recommendation target content information 24 illustrated in FIG. 11 is received from the external device such as the content distribution control device 20 in FIGS. The set 11 is input, and the user-specific score calculation means 4b reads the information stored in each of the keyword-related group extraction result storage unit 3b and the group-specific user score storage unit 3d, and performs an appropriate calculation using each information. Processing is performed to calculate a score for each user.

その後、推薦対象ユーザ情報出力手段４ｃにより、ユーザ別スコア算出手段４ｂで算出されたスコアが選出条件４ｄに合致する単数もしくは複数のユーザ（被推薦ユーザ）を選出し、推薦対象ユーザ情報１２として、図２，３におけるコンテンツ配信制御装置２０等の外部装置に出力する。 Thereafter, the recommendation target user information output means 4c selects one or a plurality of users (recommended users) whose scores calculated by the user-specific score calculation means 4b match the selection condition 4d, and the recommended target user information 12 is Output to an external device such as the content distribution control device 20 in FIGS.

図４に例示する操作情報では、ユーザＩＤ「ｕｓｅｒ-ｃ」のユーザが、そのユーザが使用している端末（２３ｄ）上に表示されている「子猫さしあげます」というコンテンツを起動する起動アイコンに対し「保存」操作を１５時５５分に行った場合の例である。ここでは、「子猫さしあげます」というコンテンツには予め「Ｋ１」および「Ｋ３４」というキーワード（メタデータ）が付与されているとする。 In the operation information illustrated in FIG. 4, the user with the user ID “user-c” uses an activation icon that activates the content “Sick Kitten” displayed on the terminal (23 d) used by the user. In this example, the “save” operation is performed at 15:55. Here, it is assumed that the keywords “K1” and “K34” are preliminarily assigned to the content “I will raise a kitten”.

このような操作情報（１０）は、端末（２３ｄ）の画面上の操作から取得することが可能である。例えば、端末（２３ｄ）の画面上にさまざまなコンテンツを起動するためのアイコンが表示されており、ユーザは必要に応じてそのアイコン押下することでコンテンツを利用したり、アイコンを削除したりフォルダに保存することが可能である場合、そのアイコンに対する操作を操作情報（１０）として取得することができる。 Such operation information (10) can be acquired from an operation on the screen of the terminal (23d). For example, icons for activating various contents are displayed on the screen of the terminal (23d), and the user can use the contents by pressing the icons as needed, delete the icons, If it can be saved, the operation for the icon can be acquired as the operation information (10).

また、この操作情報（１０）として、視聴履歴、操作履歴、入力履歴、取得履歴、利用履歴、行動履歴といった様々なデータを用いることが可能である。 As the operation information (10), various data such as a viewing history, an operation history, an input history, an acquisition history, a usage history, and an action history can be used.

尚、本例では、操作情報（１０）において、コンテンツを説明するためのメタデータであるキーワードが予め付与されており、そのキーワードを用いているが、コンテンツを説明するためのキーワードが付与されておらず、代わりにコンテンツの説明文が付与されている場合は、その説明文に対して形態素解析等を行い当該コンテンツのメタデータであるキーワードを抽出するための別途処理が必要となる。 In this example, in the operation information (10), a keyword that is metadata for explaining the content is assigned in advance, and the keyword is used, but a keyword for explaining the content is assigned. If a description of the content is given instead, a separate process is required to extract a keyword that is metadata of the content by performing morphological analysis on the description.

図５に例示するコンテンツ利用ログは、随時入力した操作情報（１０）から、学習ブロック３が利用可能なユーザ・キーワード間スコアデータの生成に必要な項目を抜粋して蓄積したものであり、「時刻」、「ユーザＩＤ」、「操作対象キーワード」、「操作内容」の各項目からなる。 The content usage log illustrated in FIG. 5 is an accumulation of excerpts of items necessary for generating score data between users and keywords that can be used by the learning block 3 from the operation information (10) input as needed. Each item includes “time”, “user ID”, “operation target keyword”, and “operation content”.

図６に例示するユーザ・キーワード間スコアデータは、図１の書式変換手段２ｃによって、例えば図５に示すコンテンツ利用ログからスコア付け処理を行い、学習ブロック３で参照する形式で保存されたものであり、本例のユーザ・キーワード間スコアデータは、ユーザとキーワードの２次元配列変数であり、ユーザ毎およびキーワード毎にそれぞれにスコア（数値）が記録されている。 The score data between the user and the keyword illustrated in FIG. 6 is stored in a format referred to in the learning block 3 by performing scoring processing from the content use log shown in FIG. Yes, the score data between the user and the keyword in this example is a two-dimensional array variable of the user and the keyword, and a score (numerical value) is recorded for each user and each keyword.

以下、この図６に示すコンテンツ利用ログに基づく学習ブロック３における処理動作例を、図７に基づき説明する。尚、本図７の例は、キーワード関係グループ抽出手段３ａにおけるキーワード関係グループを生成する演算にクラスタリング処理を用いる。 Hereinafter, an example of processing operation in the learning block 3 based on the content usage log shown in FIG. 6 will be described with reference to FIG. In the example of FIG. 7, clustering processing is used for the operation for generating the keyword relationship group in the keyword relationship group extraction unit 3 a.

このように、キーワード関係グループを生成する演算にクラスタリング処理を用いた場合、キーワード関係グループ抽出手段３ａにおいて、まず、ユーザ・キーワード間スコア格納部２ｄで格納されたユーザ・キーワード間スコアデータを読み込み（ステップＳ７０１）、当該ユーザ・キーワード間スコアデータに対するクラスタリング処理を実施してキーワード集合のクラスタを生成する（ステップＳ７０２）。そして、生成したキーワード集合のクラスタを、それぞれキーワード関係グループとしてキーワード関係グループ抽出結果格納部３ｂに渡し格納する（ステップＳ７０３）。 As described above, when the clustering process is used for the operation for generating the keyword relation group, the keyword relation group extraction unit 3a first reads the user / keyword score data stored in the user / keyword score storage unit 2d ( In step S701, a clustering process is performed on the user-keyword score data to generate a cluster of keyword sets (step S702). Then, the generated cluster of the keyword set is transferred to and stored in the keyword relationship group extraction result storage unit 3b as a keyword relationship group (step S703).

生成されたキーワード関係グループは、例えば、図８に示すように、「グループＡ＝｛Ｋ１，Ｋ２，Ｋ８｝、「グループＢ＝｛Ｋ３，Ｋ９，Ｋ１０，Ｋ１２，Ｋ１５｝」、…といった具合になる。 For example, as shown in FIG. 8, the generated keyword-related groups are “group A = {K1, K2, K8},“ group B = {K3, K9, K10, K12, K15} ”, and so on. Become.

尚、このクラスタリング処理の際は、生成クラスタ数、データの正規化の有無といったクラスタリング条件を設定ファイルにて記述しておいたり、ハードコーディングによって定義しておくことができる。また、生成されたキーワード関係グループ以外にも、予め登録しておいたキーワード関係グループを用いることも可能である。 In this clustering process, clustering conditions such as the number of generated clusters and the presence / absence of data normalization can be described in a setting file or can be defined by hard coding. In addition to the generated keyword relation group, a keyword relation group registered in advance can be used.

次に、学習ブロック３では、グループ別ユーザ別スコア算出手段３ｃにより、キーワード関係グループ別にユーザ別のスコアであるグループ別ユーザ別スコアデータを算出する。すなわち、グループ別ユーザ別スコア算出手段３ｃは、ユーザ・キーワード間スコア格納部２ｄからユーザ・キーワード間スコアデータを１レコードずつ読み込む（ステップＳ７０４）。 Next, in the learning block 3, the group-specific user score calculation means 3c calculates group-specific user score data that is a user-specific score for each keyword-related group. That is, the group-by-user score calculation means 3c reads the user / keyword score data from the user / keyword score storage unit 2d one record at a time (step S704).

例えば、キーワードを「ｋｅｙ」、ユーザを「ｕ」，ユーザ・キーワード間スコアデータを「ＳＣＯＲＥ（ｋｅｙ，ｕ）」と表す場合、図６に示すデータを１レコード読み込むと、「ＳＣＯＲＥ（Ｋ１，ｕｓｅｒ-ａ）＝２」、「ＳＣＯＲＥ（Ｋ１，ｕｓｅｒ-ｂ）＝−５」、「ＳＣＯＲＥ（Ｋ１，ｕｓｅｒ-ｃ）＝２」、「ＳＣＯＲＥ（Ｋ１，ｕｓｅｒ-ｄ）＝１」、「ＳＣＯＲＥ（Ｋ１，ｕｓｅｒ-ｅ）＝２」、「ＳＣＯＲＥ（Ｋ１，ｕｓｅｒ-ｆ）＝０」となる。 For example, when the keyword is “key”, the user is “u”, and the user-keyword score data is “SCORE (key, u)”, when one record of the data shown in FIG. 6 is read, “SCORE (K1, user) -a) = 2 "," SCORE (K1, user-b) =-5 "," SCORE (K1, user-c) = 2 "," SCORE (K1, user-d) = 1 "," SCORE ( K1, user-e) = 2 "and" SCORE (K1, user-f) = 0 ".

さらに、グループ別ユーザ別スコア算出手段３ｃは、このようにして読み込んだキーワード（ｋｅｙ）が、ステップＳ７０２で生成してステップＳ７０３においてキーワード関係グループ抽出結果格納部３ｂで格納したキーワード関係グループのいずれかに含まれているかを判別し、ｋｅｙを含むキーワード関係グループのグループＩＤである「ｇｒ」を取得し、（ステップＳ７０５）、そして、キーワード関係グループＩＤ「ｇｒ」のユーザｕの「グループ別ユーザ別スコア」ＳＣ（ｇｒ，ｕ）に、ステップＳ７０４で読み込んだスコアＳＣＯＲＥ（ｋｅｙ，ｕ）を加算する（「ＳＣ（ｇｒ，ｕ）＝ＳＣ（ｇｒ，ｕ）＋ＳＣＯＲＥ（ｋｅｙ，ｕ）」）（ステップＳ７０５）。 Further, the group-specific user score calculation means 3c uses any one of the keyword-related groups generated in step S702 and stored in the keyword-related group extraction result storage unit 3b in step S703. And “gr”, which is the group ID of the keyword related group including the key, is acquired (step S705), and “by user by group” of the user u of the keyword related group ID “gr” The score SCORE (key, u) read in step S704 is added to the score “SC (gr, u) (“ SC (gr, u) = SC (gr, u) + SCORE (key, u) ”) (step S705).

例えば、ステップＳ７０２で生成してステップＳ７０３においてキーワード関係グループ抽出結果格納部３ｂで格納したキーワード関係グループが図８に例示するキーワード関係グループであり、ステップＳ７０４で、図６に例示するユーザ・キーワード間スコアデータにおける最初の１レコード読み込んだ場合、ステップＳ７０４で読み込んだキーワードＫ１は、図８においてキーワード関係グループＡに含まれているため、図９に示すグループ別ユーザ別スコアデータにおけるグループＡ項目のユーザ毎のスコアにそれぞれキーワードＫ１の各ユーザのスコアを加算する。 For example, the keyword relation group generated in step S702 and stored in the keyword relation group extraction result storage unit 3b in step S703 is the keyword relation group illustrated in FIG. 8, and in step S704, between the user and the keyword illustrated in FIG. When the first record in the score data is read, since the keyword K1 read in step S704 is included in the keyword relation group A in FIG. 8, the user of the group A item in the group-specific user score data shown in FIG. The score of each user of the keyword K1 is added to each score.

この例の場合、グループ別ユーザ別スコアＳＣ（ｇｒ，ｕ）の演算は、「ＳＣ（グループＡ，ｕｓｅｒ-ａ）＝ＳＣ（グループＡ，ｕｓｅｒ-ａ）＋２」、「ＳＣ（グループＡ，ｕｓｅｒ-ｂ）＝ＳＣ（グループＡ，ｕｓｅｒ-ｂ）−５」、「ＳＣ（グループＡ，ｕｓｅｒ-ｃ）＝ＳＣ（グループＡ，ｕｓｅｒ-ｃ）＋２」、「ＳＣ（グループＡ，ｕｓｅｒ-ｄ）＝ＳＣ（グループＡ，ｕｓｅｒ-ｄ）＋１」、「ＳＣ（グループＡ，ｕｓｅｒ-ｅ）＝ＳＣ（グループＡ，ｕｓｅｒ-ｅ）＋２」、「ＳＣ（グループＡ，ｕｓｅｒ-ｆ）＝ＳＣ（グループＡ，ｕｓｅｒ-ｆ）＋０」となる。 In this example, the calculation of the group-specific user score SC (gr, u) is “SC (group A, user-a) = SC (group A, user-a) +2”, “SC (group A, user). -b) = SC (group A, user-b) -5 "," SC (group A, user-c) = SC (group A, user-c) +2 "," SC (group A, user-d) " = SC (group A, user-d) +1 "," SC (group A, user-e) = SC (group A, user-e) + 2 "," SC (group A, user-f) = SC (group A, user-f) +0 ".

グループ別ユーザ別スコア算出手段３ｃでは、ユーザ・キーワード間スコア格納部２ｄから全てのレコードを読み込み終わった場合は、次のステップＳ７０７の処理を行い、、まだの場合は再びステップＳ７０４からの処理を繰り返し実行する。尚、この際の分岐条件としては、ユーザ・キーワード間スコア格納部２ｄからすべてのレコードを読み込み終わった場合以外にも、予め設定しておいた条件に合致した場合なども考えられる。 In the group-by-user score calculation means 3c, when all the records have been read from the user / keyword score storage unit 2d, the process of the next step S707 is performed. If not, the process from step S704 is performed again. Run repeatedly. In addition, as a branching condition at this time, not only the case where all records have been read from the user / keyword score storage unit 2d, but also a case where the preset condition is met.

ステップＳ７０７において、グループ別ユーザ別スコア算出手段３ｃは、ステップＳ７０５で生成したグループ別ユーザ別スコアＳＣ（ｇｒ，ｕ）を、グループ別ユーザ別スコア格納部３ｄに渡して格納する。 In step S707, the group-specific user score calculation means 3c passes the group-specific user score SC (gr, u) generated in step S705 to the group-specific user score storage unit 3d for storage.

図８に示すキーワード関係グループデータは、キーワード関係グループ抽出手段３ａにおいて抽出されたキーワード関係グループを記述したデータである。本例のキーワード関係グループデータでは、キーワード関係グループ毎に、そのグループＩＤと、それぞれグループに含まれるキーワードを記述している。 The keyword relation group data shown in FIG. 8 is data describing the keyword relation group extracted by the keyword relation group extraction means 3a. In the keyword-related group data of this example, for each keyword-related group, the group ID and the keyword included in each group are described.

尚、キーワード関係グループデータとしては、キーワード関係グループ抽出手段３ａで抽出したキーワード関係グループ以外に、予め定義しておいたキーワード関係グループも格納することが可能である。例えば、図８に例示するように、キーワード関係グループ抽出手段３ａで抽出したキーワード関係グループ以外に、全てのキーワードをグループにした「全キーワード」というキーワード関係グループを、予め登録しておくこともできる。 In addition to the keyword relationship group extracted by the keyword relationship group extraction unit 3a, a keyword relationship group defined in advance can be stored as the keyword relationship group data. For example, as illustrated in FIG. 8, in addition to the keyword related group extracted by the keyword related group extracting unit 3a, a keyword related group called “all keywords” in which all keywords are grouped can be registered in advance. .

図９に示すグループ別ユーザ別スコアデータは、キーワード関係グループ抽出手段３ａで生成したキーワード関係グループ別に、ユーザそれぞれのスコアを格納したデータであり、ここでは、キーワード関係グループ抽出手段３ａで生成されたキーワード関係グループ「グループＡ」、「グループＢ」、「グループＣ」、「グループＤ」と、すべてのキーワードが含まれる「全キーワード」それぞれにおいて、ユーザ「ｕｓｅｒ-a」、「ｕｓｅｒ-b」、「ｕｓｅｒ-ｃ」、「ｕｓｅｒ-d」、「ｕｓｅｒ-e」、「ｕｓｅｒ-f」のスコアがそれぞれ対応付けて格納されている。 The group-specific user-specific score data shown in FIG. 9 is data in which the score of each user is stored for each keyword-related group generated by the keyword-related group extracting unit 3a. Here, the score data generated by the keyword-related group extracting unit 3a is used. In each of the keyword related groups “group A”, “group B”, “group C”, “group D” and “all keywords” including all keywords, the users “user-a”, “user-b”, The scores of “user-c”, “user-d”, “user-e”, and “user-f” are stored in association with each other.

尚、図８の例で説明したように、「全キーワード」は、キーワード関係グループ抽出手段３ａで抽出されたキーワード関係グループではなく、別途定義されていたキーワード関係グループである。また、グループ別ユーザ別スコアは、学習ブロック３の実行時は空であり、グループ別ユーザ別スコア算出手段３ａにおいてスコアが加算されて生成される。 As described in the example of FIG. 8, “all keywords” is not a keyword relationship group extracted by the keyword relationship group extraction unit 3a, but a keyword relationship group defined separately. The group-specific user-specific score is empty when the learning block 3 is executed, and is generated by adding the score in the group-specific user-specific score calculation means 3a.

次に、このように学習ブロック３で生成した図８に示すキーワード関係グループデータと図９に示すグループ別ユーザ別スコアデータに基づく選出ブロック４における処理動作例を、図１０に基づき説明する。 Next, an example of processing operation in the selection block 4 based on the keyword-related group data shown in FIG. 8 generated in the learning block 3 and the group-specific user-specific score data shown in FIG. 9 will be described based on FIG.

まず、コンテンツキーワード情報入力手段４ａにおいて、推薦対象のコンテンツのメタデータであるキーワードの集合としてのコンテンツキーワード情報を入力する（ステップＳ８０１）。次に、ユーザ別スコア算出手段４ｂにおいて、キーワード関係グループ抽出結果格納部３ｂからキーワード関係グループデータを読み込む（ステップＳ８０２）。 First, in the content keyword information input means 4a, content keyword information as a set of keywords that is metadata of content to be recommended is input (step S801). Next, the keyword-related group calculation unit 4b reads keyword-related group data from the keyword-related group extraction result storage unit 3b (step S802).

さらに、ユーザ別スコア算出手段４ｂにおいて、ステップＳ８０１で入力したコンテンツキーワード情報と、ステップＳ８０２で読み込んだキーワード関係グループデータの各グループとの類似度を所定の算出式を用いて算出する（ステップＳ８０３）。類似度はキーワード関係グループのグループｇｒごとに算出する。 Further, the score calculation unit 4b for each user calculates the similarity between the content keyword information input in step S801 and each group of the keyword-related group data read in step S802 using a predetermined calculation formula (step S803). . The similarity is calculated for each group gr of the keyword related group.

類似度の算出式の例としては、「類似度α（ｇｒ）＝（コンテンツキーワード情報と、キーワード関係グループｇｒとの両方に含まれるキーワード数）÷（キーワード関係グループｇｒに含まれるキーワード数）」がある。また、この他にも、コンテンツキーワード情報やキーワード関係グループは、それぞれキーワード集合のため、キーワードベクトルと考えて、キーワードベクトル同士の内積、余弦演算で類似度を求めても良い。 As an example of the similarity calculation formula, “similarity α (gr) = (number of keywords included in both content keyword information and keyword relationship group gr) / ÷ (number of keywords included in keyword relationship group gr)” There is. In addition, since the content keyword information and the keyword relation group are each a set of keywords, the content keyword information and the keyword relation group may be considered as a keyword vector, and the similarity may be obtained by an inner product of the keyword vectors or a cosine calculation.

上述の算出式を用いた場合、例えば、ステップＳ８０１で図１２に示すコンテンツキーワード情報を入力し、ステップＳ８０２で図８に示すキーワード関係グループデータを読み込んだ場合、グループＡに属するキーワード数は「３（＝Ｋ１，Ｋ２，Ｋ８）」で、図１２におけるコンテンツキーワード情報とグループＡとの両方に含まれるキーワード数は「０」なので、「類似度α（グループＡ）＝０÷３＝０」となる。 When the above calculation formula is used, for example, when the content keyword information shown in FIG. 12 is input in step S801 and the keyword-related group data shown in FIG. 8 is read in step S802, the number of keywords belonging to group A is “3. (= K1, K2, K8) ”and the number of keywords included in both the content keyword information and group A in FIG. 12 is“ 0 ”, so“ similarity α (group A) = 0 ÷ 3 = 0 ”. Become.

同様に、グループＢに属するキーワード数は「５（＝Ｋ３，Ｋ９，Ｋ１０，Ｋ１２，Ｋ１５）」、コンテンツキーワード情報とグループＢとの両方に含まれるキーワード数は「２（＝Ｋ３，Ｋ１５）」なので、「類似度α（グループＢ）＝２÷５＝０．４」となり、グループＣに属するキーワード数は「２」、コンテンツキーワード情報とグループＣとの両方に含まれるキーワード数は「１」なので、「類似度α（グループＣ）＝１÷２＝０．５」、グループＤに属するキーワード数は「５」、コンテンツキーワード情報とグループＤとの両方に含まれるキーワード数は「１」なので、「類似度α（グループＤ）＝１÷５＝０．２」となり、全キーワード関係グループに属するキーワード数は「１５」、コンテンツキーワード情報と全キーワード関係グループとの両方に含まれるキーワード数は「４」なので、「類似度α（全キーワード）＝４÷１５＝０．２６７」となる。 Similarly, the number of keywords belonging to group B is “5 (= K3, K9, K10, K12, K15)”, and the number of keywords included in both the content keyword information and group B is “2 (= K3, K15)”. Therefore, “similarity α (group B) = 2 ÷ 5 = 0.4”, the number of keywords belonging to group C is “2”, and the number of keywords included in both the content keyword information and group C is “1”. Therefore, “similarity α (group C) = 1 ÷ 2 = 0.5”, the number of keywords belonging to group D is “5”, and the number of keywords included in both the content keyword information and group D is “1”. , “Similarity α (group D) = 1 ÷ 5 = 0.2”, the number of keywords belonging to all keyword-related groups is “15”, content keyword information and all keywords Number of keywords included in both the engagement groups since "4", and "similarity alpha (total keyword) = 4 ÷ 15 = 0.267".

このようにして類似度を算出した後、ユーザ別スコア算出手段４ｂは、類似度が一番大きいキーワード関係グループのグループＩＤを選出する（ステップＳ８０４）。上述の例の場合、類似度が一番大きいキーワード関係グループのグループＩＤは、グループＣ（類似度＝０．５）である。 After calculating the similarity in this way, the score calculation unit 4b for each user selects the group ID of the keyword-related group with the highest similarity (step S804). In the case of the above-described example, the group ID of the keyword-related group having the highest similarity is group C (similarity = 0.5).

そして、ユーザ別スコア算出手段４ｂは、グループ別ユーザ別スコア格納部３ｄで格納したグループ別ユーザ別スコア「ＳＣ（グループｇｒ，ユーザｕ）」を参照して、ステップＳ８０４で選出したキーワード関係グループのユーザ別のスコアＳ（ｕ）を取得する。 Then, the user-specific score calculation means 4b refers to the group-specific user score “SC (group gr, user u)” stored in the group-specific user-specific score storage unit 3d, and determines the keyword-related group selected in step S804. The score S (u) for each user is acquired.

例えば、上述の例のように、ステップＳ８０４においてグループＣが選出された場合、図９におけるグループ別ユーザ別スコアデータからグループＣのユーザ別スコアを読み込み、その結果、取得するユーザ別スコアＳ（ｕ）は、「Ｓ（ｕｓｅｒ- ａ）＝ＳＣ（グループＣ，ｕｓｅｒ-ａ）＝−６」、「Ｓ（ｕｓｅｒ-ｂ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｂ）＝＋７」、「Ｓ（ｕｓｅｒ-ｃ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｃ）＝−８」、「Ｓ（ｕｓｅｒ-ｄ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｄ）＝＋９」、「Ｓ（ｕｓｅｒ-ｅ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｅ）＝−１３」、「Ｓ（ｕｓｅｒ-ｆ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｆ）＝−１０」となる。 For example, as in the above example, when the group C is selected in step S804, the read user-specific group score C from a group by subscriber score data in Figure 9, so that the user-specific scores S (u to obtain ) “S (user-a) = SC (group C, user-a) = − 6”, “S (user-b) = SC (group C, user-b) = + 7”, “S (user -c) = SC (group C, user-c) =-8 "," S (user-d) = SC (group C, user-d) = + 9 "," S (user-e) = SC (group C, user-e) = − 13 ”and“ S (user-f) = SC (group C, user-f) = − 10 ”.

推薦対象ユーザ情報出力手段４ｃでは、このようにしてユーザ別スコア算出手段４ｂで算出されたユーザ別スコアＳ（ｕ）から、予め選出条件記憶部４ｄにおいて設定された選出条件を満たしたスコアを有するユーザを選出し（ステップＳ８０６）、選出したユーザの識別子（ユーザＩＤ）を出力する（ステップＳ８０７）。 The recommendation target user information output unit 4c has a score that satisfies the selection condition set in advance in the selection condition storage unit 4d from the user-specific score S (u) calculated by the user-specific score calculation unit 4b in this way. A user is selected (step S806), and an identifier (user ID) of the selected user is output (step S807).

例えば、ステップＳ８０５で取得したユーザ別スコアが上述した「Ｓ（ｕｓｅｒ-ａ）＝ＳＣ（グループＣ，ｕｓｅｒ-ａ）＝−６」、「Ｓ（ｕｓｅｒ-ｂ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｂ）＝＋７」、「Ｓ（ｕｓｅｒ-ｃ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｃ）＝−８」、「Ｓ（ｕｓｅｒ-ｄ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｄ）＝＋９」、「Ｓ（ｕｓｅｒ-ｅ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｅ）＝−１３」、「Ｓ（ｕｓｅｒ-ｆ）＝ＳＣ（グループＣ，ｕｓｅｒ-ｆ）＝−１０」の場合で、選出対象ユーザの設定条件が「スコアが０以上のユーザ」である場合、その選出条件を満たした「ｕｓｅｒ-ｂ」と「ｕｓｅｒ-ｄ」を選出して出力する。 For example, the user-specific scores acquired in step S805 are “S (user-a) = SC (group C, user-a) = − 6” and “S (user-b) = SC (group C, user- b) = + 7 ”,“ S (user-c) = SC (group C, user-c) = − 8 ”,“ S (user-d) = SC (group C, user-d) = + 9 ”,“ In the case of S (user-e) = SC (group C, user-e) = − 13 ”and“ S (user-f) = SC (group C, user-f) = − 10 ”, the selection target user When the setting condition is “user with score of 0 or more”, “user-b” and “user-d” that satisfy the selection condition are selected and output.

尚、選出条件記憶部４ｄに設定する選出条件は、予めハードコーディングしておいても良い。また、設定ファイルで記述しておいても良い。さらに、選択条件としては、ユーザ数やグループに含まれるキーワード数をはじめとする値を変数とした数式で記述しておいても良い。 The selection conditions set in the selection condition storage unit 4d may be hard-coded in advance. It may also be described in a configuration file. Furthermore, the selection condition may be described by a mathematical expression using values such as the number of users and the number of keywords included in the group as variables.

図１１に示す推薦対象コンテンツ情報は、ユーザに推薦するコンテンツに関する情報を記述しており、本例では、コンテンツ名（「ＧｏＧｏトラ物語第３話」）やそのコンテンツへアクセスするためのＵＲＬ（リンク先ＵＲＬ）、そのコンテンツのアイコンのＵＲＬ（アイコンＵＲＬ）、さらに、そのコンテンツの内容を表現する単数もしくは複数のキーワード（Ｋ３，Ｋ４，Ｋ１３，Ｋ１５）が登録されている。尚、このような推薦対象コンテンツ情報のキーワードは、そのコンテンツの説明文を形態素解析したものを用いることも可能である。 The recommendation target content information shown in FIG. 11 describes information related to the content recommended to the user. In this example, the content name (“GoGo Tora Story Episode 3”) and the URL (link) for accessing the content are shown. The destination URL), the URL of the icon of the content (icon URL), and one or more keywords (K3, K4, K13, K15) expressing the contents of the content are registered. In addition, as a keyword of such recommendation target content information, it is possible to use a morphological analysis of the description of the content.

図１２に示すコンテンツキーワード情報は、図１１の推薦対象コンテンツ情報からキーワードだけを取り出した情報であり、「Ｋ３，Ｋ４，Ｋ１３，Ｋ１５」の各キーワードからなる。 The content keyword information shown in FIG. 12 is information obtained by extracting only the keyword from the recommendation target content information shown in FIG. 11, and includes the keywords “K3, K4, K13, K15”.

図１における選出ブロック４の処理動作例としては、図１０で示したものに限定されるものではなく、例えば、図１３に示す処理動作でも良い。この図１３に示す処理動作例は、図１０の処理フローにおけるステップＳ８０４とステップＳ８０５の処理部分のみ、図１３におけるステップＳ１３０４として別手法にしたものであり、図１３におけるステップＳ１３０１〜Ｓ１３０３、および、ステップＳ１３０５，Ｓ１３０６の処理は、それぞれ、図１０におけるステップＳ８０１〜Ｓ８０３、および、ステップＳ８０６，Ｓ８０７の処理に相当する。以下、その相違するステップＳ１３０４の処理部分のみ説明する。 The processing operation example of the selection block 4 in FIG. 1 is not limited to the processing operation shown in FIG. 10, and for example, the processing operation shown in FIG. In the processing operation example shown in FIG. 13, only the processing part of steps S804 and S805 in the processing flow of FIG. 10 is changed to step S1304 in FIG. 13, and steps S1301 to S1303 in FIG. The processes in steps S1305 and S1306 correspond to the processes in steps S801 to S803 and steps S806 and S807 in FIG. 10, respectively. Only the processing part of step S1304 that is different will be described below.

ここで、キーワード関係グループを「ｇｒ」、コンテンツキーワード情報とキーワード関係グループｇｒとの類似度を「α（ｇｒ）」、グループ別ユーザ別スコアを「ＳＣ（ｇｒ，ｕ）」とする。本図１３のステップＳ１３０４の処理において、ユーザ別スコア「Ｓ（ｕ）」は、「Ｓ（ｕ）＝Σｇｒ｛ＳＣ（ｇｒ，ｕ）×α（ｇｒ）｝として算出する。この手法を用いることによって、複数のキーワード関係グループの特徴を反映したユーザ選出が可能となる。 Here, it is assumed that the keyword-related group is “gr”, the similarity between the content keyword information and the keyword-related group gr is “α (gr)”, and the group-specific user score is “SC (gr, u)”. 13, the user-specific score “S (u)” is calculated as “S (u) = Σgr {SC (gr, u) × α (gr)}. This method is used. This makes it possible to select users reflecting the characteristics of a plurality of keyword-related groups.

以上、図１〜図１３を用いて説明したように、本例では、ネットワークを介して配信するコンテンツの推薦対象となるユーザ（被対象ユーザ）をコンピュータ処理して選出する際、まず、スコア生成ブロック２において、ユーザのコンテンツに対する操作内容と当該コンテンツのメタデータであるキーワード（図４および図５に示す操作情報およびコンテンツ利用ログ情報）に基づき、ユーザ別、かつキーワード別に操作内容に応じたスコアを累計して表（図６に示すユーザ・キーワード間スコア情報）を生成する。 As described above with reference to FIGS. 1 to 13, in this example, when a user (target user) to be recommended for content distributed via a network is selected by computer processing, first, score generation is performed. In block 2, based on the operation content of the user's content and the keyword (operation information and content usage log information shown in FIGS. 4 and 5) of the content, a score corresponding to the operation content for each user and for each keyword Is accumulated to generate a table (score information between users and keywords shown in FIG. 6).

次に、学習ブロック３において、図６に示すユーザ・キーワード間スコア情報に基づき、キーワードをグループ分けすると共に（図８のキーワード関係グループ情報）、この図８のキーワード関係グループ情報と図６のユーザ・キーワード間スコア情報に基づき、グループ別、かつ、ユーザ別に、グループ配下の各キーワードのスコアを累計して表（図９のグループ別ユーザ別スコア情報）を生成する。 Next, in the learning block 3, the keywords are grouped based on the user / keyword score information shown in FIG. 6 (keyword related group information in FIG. 8), and the keyword related group information in FIG. 8 and the user in FIG. Based on the inter-keyword score information, the score of each keyword under the group is accumulated for each group and for each user to generate a table (score information for each user in FIG. 9).

そして、選出ブロック４において、推薦対象のコンテンツのメタデータであるキーワード群と図８のキーワード関係グループ情報における各グループのキーワード群との類似度を算出し、例えば、最も類似するキーワード群のグループを特定し、このグループに図９に示すグループ別ユーザ別スコア情報で対応付けられた各ユーザのスコアの内、例えばスコアが一定値以上である等の所定条件に合致するスコアのユーザを求め、求めたユーザを被対象ユーザとして選出する。 Then, in the selection block 4, the similarity between the keyword group that is the metadata of the content to be recommended and the keyword group of each group in the keyword-related group information in FIG. 8 is calculated. For example, the group of the most similar keyword group is determined. A user having a score that matches a predetermined condition such as, for example, the score being equal to or higher than a certain value among the scores of each user identified and associated with this group by the group-specific user score information shown in FIG. Selected users as target users.

このように、本例の構成および処理によれば、全ユーザの利用履歴を基にして生成したキーワード関係グループを用いる協調的な手法を用いたユーザ推薦（リコメンド）を行うので、被推薦ユーザにとって利用実績が無いコンテンツであっても、当該被推薦ユーザへの推薦が可能となる。また、コンテンツをキーワードベクトルとして表現するため、全ユーザの利用実績がない新しいコンテンツであっても、当該ユーザへの推薦が可能となる。さらに、あるコンテンツを、多くのユーザ中から推薦に適したユーザを選出する場合であっても、協調ユーザを選出する必要がなく、コンテンツが属するグループを選択する処理を行うだけなので、処理がユーザ数に関連して大きくならず、高速なユーザ選出処理が可能となる。 As described above, according to the configuration and processing of this example, user recommendation (recommendation) is performed using a collaborative method using a keyword relation group generated based on the usage history of all users. Even content that has not been used can be recommended to the recommended user. In addition, since the content is expressed as a keyword vector, it is possible to recommend new content that has not been used by all users. Furthermore, even when selecting a user suitable for recommendation from a large number of users, there is no need to select a collaborative user and only the process of selecting a group to which the content belongs is performed. The user selection process can be performed at high speed without increasing in relation to the number.

尚、本発明は、図１〜図１３を用いて説明した例に限定されるものではなく、その要旨を逸脱しない範囲において種々変更可能である。例えば、本例では、キーワード関係グループ抽出手段３ａにおける、関係あるキーワード同士をグループ化する手法として、クラスタリング処理手法を用いているが、このクラスタリング手法以外にも、データマイニングの相関ルール抽出手法によってグループを生成する手法等を用いることでも良い。 In addition, this invention is not limited to the example demonstrated using FIGS. 1-13, In the range which does not deviate from the summary, various changes are possible. For example, in this example, a clustering processing technique is used as a technique for grouping related keywords in the keyword relation group extracting unit 3a. However, in addition to this clustering technique, grouping is performed by an association rule extraction technique for data mining. It is also possible to use a method for generating.

本発明に係わるコンテンツ推薦対象ユーザ選出装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the content recommendation object user selection apparatus concerning this invention. 本発明に係わるコンテンツ推薦対象ユーザ選出装置が組み込まれたコンテンツ推薦システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the content recommendation system incorporating the content recommendation object user selection apparatus concerning this invention. 図２におけるコンテンツ推薦システムの詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of the content recommendation system in FIG. 図１におけるスコア生成ブロックの操作情報入力手段により入力される操作情報の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the operation information input by the operation information input means of the score generation block in FIG. 図１におけるスコア生成ブロックのコンテンツ利用ログ格納部で格納されるコンテンツ利用ログの具体例を示す説明図である。It is explanatory drawing which shows the specific example of the content utilization log stored in the content utilization log storage part of the score generation block in FIG. 図１におけるスコア生成ブロックの書式変換手段により出力されるユーザ・キーワード間スコアの具体例を示す説明図である。It is explanatory drawing which shows the specific example of the score between user and keywords output by the format conversion means of the score generation block in FIG. 図１における学習ブロックの処理動作例を示すフローチャートである。It is a flowchart which shows the processing operation example of the learning block in FIG. 図１における学習ブロックのキーワード関係グループ抽出手段により抽出されキーワード関係グループ抽出結果格納部で格納されるキーワード関係グループ生成結果の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the keyword relation group production | generation result extracted by the keyword relation group extraction means of the learning block in FIG. 1, and stored in the keyword relation group extraction result storage part. 図１における学習ブロックのグループ別ユーザ別スコア算出手段により算出されグループ別ユーザ別スコア格納部で格納されるグループ別ユーザ別スコア生成結果の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the score production | generation result classified by user classified by group computed by the score calculation means classified by user classified by group of the learning block in FIG. 1 and stored in the score storage part classified by user classified by group. 図１における選出ブロックの処理動作例を示すフローチャートである。It is a flowchart which shows the processing operation example of the selection block in FIG. 図３におけるコンテンツ推薦システムのコンテンツ配信制御装置に入力される推薦対象コンテンツ情報の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the recommendation object content information input into the content delivery control apparatus of the content recommendation system in FIG. 本発明に係わるコンテンツ推薦対象ユーザ選出装置に入力されるコンテンツキーワード情報の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the content keyword information input into the content recommendation object user selection apparatus concerning this invention. 図１における選出ブロックの他の処理動作例を示すフローチャートである。It is a flowchart which shows the other process operation example of the selection block in FIG.

Explanation of symbols

１：コンテンツ推薦対象ユーザ選出装置、２：スコア生成ブロック、２ａ：操作情報入力手段、２ｂ：コンテンツ利用ログ格納部、２ｃ：書式変換手段、２ｄ：ユーザ・キーワード間スコア格納部、３：学習ブロック、３ａ：キーワード関係グループ抽出手段、３ｂ：キーワード関係グループ抽出結果格納部、３ｃ：グループ別ユーザ別スコア算出手段、３ｄ：グループ別ユーザ別スコア格納部、４：選出ブロック、４ａ：コンテンツキーワード情報入力手段、４ｂ：ユーザ別スコア算出手段、４ｃ：推薦対象ユーザ情報出力手段、４ｄ：選択条件記憶部、１０：操作情報、１１：コンテンツキーワード情報、１２：推薦対象ユーザ情報、２０：コンテンツ配信制御装置、２１：コンテンツデータベース（「コンテンツＤＢ」）、２２：ネットワーク、２３ａ〜２３ｄ：端末、２４：推薦対象コンテンツ。 1: content recommendation target user selection device, 2: score generation block, 2a: operation information input means, 2b: content use log storage section, 2c: format conversion means, 2d: score storage section between user and keyword, 3: learning block 3a: Keyword-related group extraction means, 3b: Keyword-related group extraction result storage section, 3c: User-specific score calculation means by group, 3d: User-specific score storage section by group, 4: Selection block, 4a: Content keyword information input Means 4b: user-specific score calculation means 4c: recommendation target user information output means 4d: selection condition storage unit 10: operation information 11: content keyword information 12: recommendation target user information 20: content distribution control device , 21: content database (“content DB”), 22: network Work, 23a~23d: terminal, 24: recommendation target content.

Claims

A content recommendation target user selection device for selecting users who are content recommendation targets,
Each time a user uses a content, a keyword that is metadata of the content, a user operation content and user identification information for the content, and a content usage log in which each operation content is associated with each user and each keyword First means for generating and recording information in a storage device;
A score corresponding to each operation content in the content usage log information is obtained for all content usage log information recorded in the storage device, and the obtained score is accumulated for each user and each keyword, and the accumulated score is Second means for generating user-keyword score information associated with each user and each keyword and recording it in the storage device;
A third means for calculating the respective associations of the keywords to divide them into groups, generating keyword-related group information in which each keyword is associated with each group, and registering the information in a storage device;
Referring to the keyword-related group information and the user-keyword score information, for each group, the cumulative score of each user associated with each keyword in the same group is further accumulated, and the accumulated score A fourth means for generating score information for each group and for each group and for each user and storing them in the storage device;
One or more keywords that are metadata of content to be recommended are acquired, one or more groups including the acquired keywords are specified with reference to the keyword-related group information, and each user associated with the specified group A fifth means for obtaining a score for each user by referring to the group-specific score information for each user and, if there are a plurality of identified groups, for each user,
A content recommendation target user selection device, comprising: a sixth means for selecting a user whose calculated score satisfies a predetermined score condition as a user who is a recommendation target of the recommended content.

A content recommendation target user selection device for selecting users who are content recommendation targets,
Each time a user uses a content, a keyword that is metadata of the content, a user operation content and user identification information for the content, and a content usage log in which each operation content is associated with each user and each keyword First means for generating and recording information in a storage device;
A score corresponding to each operation content in the content usage log information is obtained for all content usage log information recorded in the storage device, and the obtained score is accumulated for each user and each keyword, and the accumulated score is Second means for generating user-keyword score information associated with each user and each keyword and recording it in the storage device;
A third means for calculating the respective associations of the keywords to divide them into groups, generating keyword-related group information in which each keyword is associated with each group, and registering the information in a storage device;
Referring to the keyword-related group information and the user-keyword score information, for each group, the cumulative score of each user associated with each keyword in the same group is further accumulated, and the accumulated score A fourth means for generating score information for each group and for each group and for each user and storing them in the storage device;
A fifth means for acquiring a keyword set which is metadata of content to be recommended, and specifying a keyword set group most similar to the acquired keyword set with reference to the keyword-related group information;
A sixth user who refers to the group-specific user score information and selects a user whose cumulative score associated with the specified group satisfies a predetermined score condition as a user who is a recommendation target of the recommended content. And a content recommendation target user selection device.

A content recommendation target user selection device for selecting users who are content recommendation targets,
Each time a user uses a content, a keyword that is metadata of the content, a user operation content and user identification information for the content, and a content usage log in which each operation content is associated with each user and each keyword First means for generating and recording information in a storage device;
A score corresponding to each operation content in the content usage log information is obtained for all content usage log information recorded in the storage device, and the obtained score is accumulated for each user and each keyword, and the accumulated score is Second means for generating user-keyword score information associated with each user and each keyword and recording it in the storage device;
A third means for calculating the respective associations of the keywords to divide them into groups, generating keyword-related group information in which each keyword is associated with each group, and registering the information in a storage device;
Referring to the keyword-related group information and the user-keyword score information, for each group, the cumulative score of each user associated with each keyword in the same group is further accumulated, and the accumulated score A fourth means for generating score information for each group and for each group and for each user and storing them in the storage device;
Obtain a keyword set that is metadata of content to be recommended, calculate the similarity between the acquired keyword set and the keyword set included in each group read from the keyword-related group information for each group,
Using the calculated similarity for each group, a weighting operation is performed on each score associated with each user for each group in the group-specific user score information, and the sum of the weighted scores for each user A fifth means for obtaining
A content recommendation target user selection device comprising: a sixth means for selecting a user whose total score satisfies a predetermined score condition as a user who is a recommendation target of the recommended content.

The content recommendation target user selection device according to any one of claims 1 to 3, wherein the third means reads the user / keyword score information and uses the user / keyword score information. perform one of the keywords of the clustering operations or data mining association rules extraction, the content recommendation target user selection apparatus characterized by performing a grouping for each keyword.

The content recommendation target user selection device according to any one of claims 1 to 4,
The fourth means is
Read each user included in the user-keyword score information and each group included in the keyword-related group information, and generate a table in which each is two-dimensionally arranged,
Read the score of each user for each keyword sequentially with reference to the user-keyword score information, and each time the read keyword group is identified with reference to the keyword-related group information,
A content recommendation target user selection device, wherein the score information for each group user is generated by adding the score of each read user in association with the specified group and each read user in the table.

6. The content recommendation target user selection device according to claim 1, wherein the first means associates a keyword, which is metadata of content used by the user, with the content in advance. obtained from granted keyword, even properly, the content recommendation target user selection apparatus characterized by obtaining a description that is previously assigned to the content by morphological analysis.

The program for functioning a computer as each means in the content recommendation object user selection apparatus in any one of Claims 1-6.

A content recommendation system for recommending content to a user via a computer network,
The content recommendation target user selection device according to any one of claims 1 to 6, comprising:
A content recommendation system, wherein a user who recommends the content is selected by the content recommendation target user selection device, and the content is recommended for the selected user.

A content recommendation target user selection method for an apparatus that performs selection of a user to be recommended for content distributed via a network by programmed computer processing,
As a process execution procedure of the programmed computer,
The content recommendation object user selection method characterized by including the process procedure which each means in the content recommendation object user selection apparatus in any one of Claims 1-6 performs .