JP4388142B2

JP4388142B2 - Information processing system and recording medium storing program for causing computer to perform processing in this system

Info

Publication number: JP4388142B2
Application number: JP30687196A
Authority: JP
Inventors: 徹大川; 泰雄佐藤; 隆一松倉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1995-11-22
Filing date: 1996-11-18
Publication date: 2009-12-24
Anticipated expiration: 2016-11-18
Also published as: JPH09204444A

Description

【０００１】
【発明の技術分野】
本発明は、画像データ、音声データ等のコード化されない非コード化データを含む文書を処理する情報処理システムに係り、詳しくは、このような非コード化データに基づいた検索処理を可能とする情報処理システムに関する。
【０００２】
【従来の技術】
文書の作成、編集、検索等の処理を行なう情報処理システムでは、文字、記号列等のコード化されたコード化データ、及び画像、音声等のコード化されない非コード化データを処理の対象としている。このようなシステムでの検索処理は、ある文書においてその文書を構成するコード化された文字、記号列からある特定の文字、記号（列）を検索したり、蓄積した文書群からある特定のコード化された文字、記号（列）（キーワード）を含む文書を検索するものである。
【０００３】
【発明が解決しようとする課題】
しかし、従来の情報処理システムでは、画像データや音声データ等の非コード化データに基づいた検索処理を行なうことができない。例えば、ある文書から特定の画像、音声を検索したり、また、蓄積された文書群からある特定の画像や音声を含む文書を検索することができない。
【０００４】
そこで、本発明の課題は、画像データや音声データ等の非コード化データに基づいた検索処理を行なうことのできる情報処理システムを提供することである。
【０００５】
【課題を解決するための手段】
上記課題を解決するため、本発明は、コード化されていない音声データを含む文書を処理する情報処理システムにおいて、検索指令がなされた場合に、指定された文書に含まれる音声データを抽出する抽出手段と、前記抽出手段にて抽出された音声データから前記音声データの特徴を表す主周波数成分を含むラベル情報を生成するラベル情報生成手段と、前記検索指令と供に提供される検索キーとしてのラベル情報である主周波数成分情報と前記ラベル情報生成手段にて生成されたラベル情報である主周波数成分情報とが実質的に同一であるか否かを判定する判定手段とを有し、前記判定手段が前記両ラベル情報の主周波数成分情報が実質的に同一であると判定した場合には、当該判定結果に基づいた検索結果が得られるように構成される。
【００１２】
このような情報処理システムでは、検索指令がなされた場合に、指定された文書から音声データが抽出され、その抽出された音声データから音声データの特徴を表す主周波数成分を含むラベル情報が生成される。そして、その生成されたラベル情報が検索キーとして提供されたラベル情報と実質的に同一であると判定されると、その判定結果に基づいた検索結果が得られる。
【００１３】
この判定結果に基づいた検索結果とは、その判定結果から合理的に得られる検索結果である。例えば、検索キーとして提供されたラベル情報と実質的に同一となるラベル情報の元となる音声データや、その音声データを含む当該指定された文書が検索結果として得られる。
【００１６】
更に、文書の所定の領域に記述した音声データに基づいて分類された文書を検索することができるという観点から、本発明は、上記抽出手段が、文書の予め定めた領域に記述された音声データを抽出するように構成される。
【００１７】
このような情報処理システムでは、常に、文書の予め定めた領域から音声データが抽出される。その抽出された音声データからラベル情報が生成される。そして、そのラベル情報が検索キーとして提供されるラベル情報と実質的に同一の場合、その文書を検索結果として得ることができる。従って、同じ分類に属する文書の当該予め定めた領域に同じ音声データを記述しておけば、検索結果として、当該同じ分類に属する文書を得ることができる。
【００１９】
【発明の実施の形態】
以下、本発明の実施の形態を図面に基づいて説明する。
本発明の実施の一形態に係る情報処理システムは、コンピュータシステムにて構築され、その機能的な構成は、例えば、図１に示すようになる。この情報処理システムでは、画像データ及び音声データの少なくとも一方で構成されるメディアデータを含む文書が処理（作成、編集、検索等）される。図１において、情報処理システム１は、文書管理ユニット１０、管理ユニット１１、表示ユニット１２、音声出力ユニット１３、入力ユニット１４、入出力制御ユニット１５、特徴抽出ユニット１６、圧縮ユニット１７、表示制御ユニット１８及び検索ユニット１９を有している。
【００２０】
文書管理ユニット１０は、当該システムにて扱われる文書（メディアデータを含む）を管理する。管理ユニット１１は、文書管理ユニット１０にて管理される文書と後述するようなラベル情報との関係を管理する。表示ユニット１２は、文書管理ユニット１０によって管理される文書等を表示する。音声出力ユニット１３は、文書管理ユニット１０によって管理される文書に含まれるメディアデータが音声データにて構成される場合、その音声データに基づいた音声を出力する。入力ユニット１４は、メディアデータを含む文書や、検索キーとなるデータを当該システムに入力する。
【００２１】
入出力制御ユニット１５は、表示ユニット１２、音声出力ユニット１３及び入力ユニット１４との間のインタフェース処理を行なう。文書管理ユニット１０にて管理される文書が入出力制御ユニット１５を介して表示ユニット１２に提供され、該文書が表示ユニット１２に表示される。文書管理ユニット１０にて管理される文書に含まれる音声データ（メディアデータ）が入出力制御ユニット１５を介して音声出力ユニット１３に提供され、該音声データに基づいた音声が音声出力ユニット１３から出力される。また、入力ユニット１４によって入力された文書が入出力制御ユニット１５を介して文書管理ユニット１０に提供され、その提供された文書が文書管理ユニット１０にて管理される。
【００２２】
特徴抽出ユニット１６は、文書管理ユニット１０にて管理される文書からメディアデータを抽出し、そのメディアデータから更にそのメディアデータの特徴を表す特徴データを抽出する。その特徴データがラベル情報として管理ユニット１１に提供される。例えば、あるパターンを有するメディアデータの場合、そのパターンの持つ周波数成分量が特徴データとして算出される。また、例えば、メディアデータが画像データにて構成される場合、その画像に含まれる画像の塊（文字、記号等）の数で定義されるブロック数が特徴データとして抽出される。更にまた、例えば、メディアデータが音声データにて構成される場合、その音声データの主周波数成分値（例えば、８ＫＨｚ等）が特徴データとして抽出される。
【００２３】
文書管理ユニット１０にて管理される文書から抽出されるメディアデータが画像データにて構成される場合、圧縮ユニット１７は、その抽出された画像データの一部または全部を縮小（所定の大きさに正規化）した縮小データを生成し、または、画像データを単純化して表す骨組みデータを算出する。この骨組みデータは、例えば、画像データを単純化した、三角形、四角形、丸等を表す。そして、圧縮ユニット１７にて生成または算出された縮小データまたは骨組みデータは、ラベル情報として管理ユニット１１に提供される。
【００２４】
上記特徴抽出ユニット１６や圧縮ユニット１７からのラベル情報の提供を受ける管理ユニット１１は、そのラベル情報の元となるメディアデータを含む文書と当該ラベル情報との対応関係を管理する。即ち、ラベル情報を指定すると、管理ユニット１１にて管理される対応関係に基づいて、その指定されたラベル情報に対応する文書が特定される。
【００２５】
表示制御ユニット１８は、管理ユニット１１にて各文書との関係が管理されるラベル情報の一覧表を作成し、そのラベル情報の一覧表を表示ユニット１２に表示させる。この表示ユニット１２に表示されたラベル情報の一覧表は、検索処理において検索キーを選択するために用いられる。
【００２６】
検索ユニット１９は、入力ユニット１４から入力される検索キーに対応した文書を文書管理ユニット１０にて管理される文書群から検索する。具体的には、検索キーと実質的に同一なラベル情報を管理ユニット１１にて文書との対応関係が管理されているラベル情報から検索し、検索されたラベル情報と対応関係のある文書を特定する。そして、その特定された文書が検索結果として得られる。
【００２７】
上述したように、表示制御ユニット１８の制御に基づいて表示ユニット１２に管理ユニット１１において文書との対応関係が管理されるラベル情報の一覧表を表示させてもよい。この場合、入力ユニット１２によって、その一覧表から１つのラベル情報を選択する操作を行なうと、そのラベル情報が検索ユニット１９に提供される。検索ユニット１９は、この提供されたラベル情報を検索キーとして上記と同様の処理を行なう。
【００２８】
上記のように文書管理ユニット１０にて管理される文書とラベル情報との対応関係を管理する管理ユニット１１を特に設けなくてもよい。この場合、検索ユニット１９は、文書管理ユニット１０にて管理される文書を特定し、その特定された文書から抽出されるメディアデータから特徴抽出ユニット１６または圧縮ユニット１７にて生成されるラベル情報を得る。そして、検索ユニット１９は、更に、このラベル情報と入力された検索キーとが実質的に同一であるか否かを判定する。
【００２９】
これらが実質的に同一である場合、検索ユニット１９は、このラベル情報の元となるメディアデータを検索結果として出力することができる。この処理は、特定された文書から検索キーに一致するメディアデータを検索する処理に対応する。また、このラベル情報の元となるメディアデータを含む文書を検索結果として出力することができる。この処理は、検索キーに一致するラベル情報の元となるメディアデータを含む文書を文書管理ユニット１０にて管理される文書の中から検索する処理に対応する。
【００３０】
上述したような機能を有する情報処理システムによれば、コード化されていないメディアデータの特徴を表すラベル情報が該メディアデータを含む文書と対応付けられるので、該ラベル情報に基づいた検索処理が可能となる。その結果、該ラベル情報の元となるメディアデータ（非コード化データ）に基づいた検索処理が実質的に可能となる。
【００３１】
次に、更に具体的なシステムについて説明する。
図２は、情報処理システムのハードウエア構成を示す。このシステムは、例えば、手書き入力可能な携帯用コンピュータ端末装置内に構成される。図２において、システムは、それぞれバスに接続されたＣＰＵ（中央演算処理ユニッ）２０、ＲＯＭ（読み出し専用メモリ）２１、メモリユニット２２、手書き入力装置２３、キーボード装置２４、表示装置２５及びファイル格納ユニット２６を有している。
【００３２】
ＲＯＭ２１には、種々のプログラムが格納されている。ＲＯＭ２１内に格納されたプログラムには、本発明にかかるシステムでの処理を行うためのプログラムが含まれている。即ち、このＲＯＭ２１は、本発明に係る記録媒体に対応する。ＣＰＵ２１は、ＲＯＭ２０に格納されたプログラムに従って、文書作成処理、文書編集処理、文書管理処理、文書検索処理等の種々の処理を実行する。
【００３３】
メモリユニット２２は、ＣＰＵ２０での処理にて得られたデータを格納したり、作業メモリ等として使用される。手書き入力装置２３は、タッチペンを用いて手書き線画像や手書き文字等の入力に使用される。キーボード装置２４は、キー操作によって文字、記号のコード入力を行うために使用される。表示装置２５は、例えば、液晶表示パネルで構成され、システム内で作成、編集される文書等を表示する。ファイル格納ユニット２６は、例えば、ＥＥＰＲＯＭや、バックアップ付きのＲＡＭ等で構成され、システム内で作成された文書、テーブル等を格納する。
【００３４】
上記のようなシステムでは、手書き入力装置２３から入力される手書き線画像及び手書き文字や、キーボード装置２４からコード入力される文字、記号によって文書が作成、編集される。そして、更に、作成、編集された文書の特定領域に手書き入力装置２３を用いてマークが記述される。文書の特定領域は、例えば、図３（ａ）に示すように文書の先頭部分に予め定められる。このマークは、例えば、文書の大まかな分類（重要度のランク等）のために用いることができ、図５に示すように、１個の○印（５図（ａ））、２個の○印（５図（ｂ））、３個の○印（５図（ｃ））等で表される。
【００３５】
上記のようにして、文書の作成、編集及びマークの記入が終了すると、ＣＰＵ２０は、図４の手順に従って文書（画像データを含み、以下、画像データ文書という）の保存処理を行なう。
図４において、ＣＰＵ２０は、画像データ文書の特定領域にある画像データを取り出し（Ｓ１）、その表示画像が所定の大きさとなるように、その画像データの正規化処理を行なう（Ｓ２）。そして、ＣＰＵ２０は、更に、正規化された画像データから特徴を抽出する（Ｓ３）（図１に示す特徴抽出手段１６に対応）。例えば、正規化された画像データで表される画像に含まれる塊の数、即ち、ブロック数が特徴として抽出される。
【００３６】
図５（ａ）、（ｂ）、（ｃ）でそれぞれ示されるマークが記述された場合、抽出されるブロック数は、それぞれ「１」、「２」、「３」である。そして、ＣＰＵ２０は、そのブロック数を当該画像データ文書に対応したラベル情報として生成する（Ｓ４）。その後、ＣＰＵ２０は、作成、編集した画像データ文書をファイル格納ユニット２６に格納する（Ｓ５）。画像データ文書（文書名）と格納位置との関係を表すテーブルがファイル格納ユニット２６内に構築され、両者の関係がシステム内で管理される（図１に示す文書管理手段１０に対応）。
【００３７】
また、例えば、図７に示すように、ラベル情報（この例の場合、ブロック数（ａ、ｂ、．．．））と画像データ文書（文書名、ＡＡＡ、ＢＢＢ等）との関係を記述したラベル管理テーブルがファイル格納ユニット２６内に構築されている。上記のようにラベル情報が作成されると、その作成されたラベル情報が当該作成、編集された画像データ文書に対応するように、ラベル管理テーブル内に記述される（Ｓ６）。このラベル管理テーブルを用いて、画像データ文書とラベル情報との関係が管理される（図１に示す管理手段１１に対応）。
【００３８】
上記のように、画像データ文書の作成、編集がなされる毎に、その特定領域に記述された画像データ（マーク）に基づいてラベル情報が作成され、当該画像データ文書がファイル格納ユニット２６に格納されると供に、当該画像データ文書とラベル情報との関係がラベル管理テーブルに記述される。
【００３９】
ユーザが、例えば、最も重要度の高い文書と決めた、３個の○印が付された文書（図５（ｃ）参照）を検索するため、検索キーとして、ブロック数”３”をキーボード装置２４または、手書き入力装置２３から入力すると、ＣＰＵ２０は、例えば、図８に示す手順に従って検索処理を実行する（図１に示す検索手段１９に対応）。
【００４０】
ＣＰＵ２０は、内部係数等の初期化を行なった後に（Ｓ１１）、入力された検索キー（ブロック数”３”）を取得する（Ｓ１２）。更に、ＣＰＵ２０は、”０”に初期化されたカウンタ値ｉをインクリメント（ｉ＝１）して（Ｓ１３）、ｉ番目（１番目）の文書に対応するラベル情報を上記ラベル管理テーブル（図７参照）から読みだす（Ｓ１４）。
【００４１】
そして、ＣＰＵ２０は、取得した検索キーと読みだしたラベル情報とを比較し（Ｓ１５）、それらが一致しているか否かを判定する（Ｓ１６）。それらが一致している場合、ＣＰＵ２０は、そのｉ番目（１番目）の画像データ文書のＩＤ（文書番号）を所定のレジスタにセットする（Ｓ１７）。検索キーと読みだしたラベル情報が一致しなかった場合、または、画像データ文書のＩＤをレジスタにセットした後に、ＣＰＵ２０は、カウンタ値ｉがファイル格納ユニット２６に格納される全ての文書数ｉ₀に達したか否か（ｉ≧ｉ₀）を判定し（Ｓ１８）、達していない場合、次の文書に対応するラベル情報と検索キーとの比較を行なう。
【００４２】
以後、カウンタ値ｉが全文書数ｉ₀に達するまで、上述した処理を繰り返し、カウンタ値ｉが全文書数ｉ₀に達すると（全文書に対応するラベル情報と検索キーとの比較が終わると）、ＣＰＵ２０は、その時点で、レジスタにセットされている画像データ文書のＩＤを表示装置２５に一覧表として表示させる（Ｓ１９）。即ち、表示装置２５には、例えば、検索キーで指定されるブロック数”３”の文書（特定領域に３つの○印が記述された文書）の一覧表が表示される。
【００４３】
ユーザは、その一覧表内の文書群から１つの文書を、手書き入力装置２３を用いて指定すると、その指定された文書がファイル格納ユニット２６から読みだされて、表示装置２５に表示される。
上記のようなシステムでは、文書に含まれる画像データ（手書き○印）の特徴からラベル情報が作成され、そのラベル情報と文書との対応関係が管理される。そして、検索キーとしてラベル情報が提供された時に、管理されている対応関係に基づいて文書が検索される。その結果、画像データ（手書き○印）に基づいた文書の検索処理が可能となる。
【００４４】
なお、取得した画像データからブロック数を特徴として抽出する場合、塊が抽出できるものであれば、同一のシステムにおいて、種々の画像を用いることができる。例えば、図６（ａ）、（ｂ）、（ｃ）に示すように、１個の△印、２個の△印、３個の△印を用いることができる。この場合、夫々のブロック数は、○印の場合と同様に、「１」、「２」、「３」である。
【００４５】
また、次のような画像データの特徴を抽出してラベル情報を作成することも可能である。即ち、特定領域の画像データに二次元フーリエ解析を施して、その時に求まる周波数成分量や、特定領域の画像データの持つ色成分等を画像データの特徴として抽出し、この特徴に基づいてラベル情報を作成することができる。
【００４６】
上記例では、取得した画像データから特徴を抽出してラベル情報を作成したが、画像データを縮小等して得られる縮小画像をラベル情報として用いることができる。
特定領域に記述される画像として、例えば、図９に示すような○、×、△、□が用いられる例を説明する。
【００４７】
文書の作成、編集の終了時に、上記４種類のいずれかのマークが手書き文字入力装置２３によって当該文書の特定領域（先頭領域）に記入される。この場合、ＣＰＵ２０は、実質的に図４に示す手順を同様の手順に従って、処理を実行する。即ち、特定領域の画像データ（○、×、△、□のいずれか）を取り出した後（Ｓ１）に、その画像データの正規化処理を実行して（Ｓ２）、所定の大きさの画像データ（一般に縮小画像データ）を生成する（サムネール処理）（図１に示す圧縮ユニット１７に対応）。そして、その正規化された画像データに基づいてラベル画像情報が作成される（Ｓ４）。作成、編集された文書は、ファイル格納ユニット２６に格納される（Ｓ５）。
【００４８】
ラベル画像情報と文書との関係は、次のようにして管理される。
作成されたラベル画像情報（縮小画像データ）そのものがファイル格納ユニット２６に格納される。そして、このラベル画像情報の格納位置（ポインタ値）が、例えば、図１０に示すように、ラベル画像情報とその格納位置との関係を表すラベル格納位置テーブルに記述される。このラベル格納位置テーブルは、ファイル格納ユニット２６に構築されている。
【００４９】
上記のように、ラベル画像情報の格納位置がラベル格納テーブルに記述された後に、更に、作成、編集された文書（ＡＡＡ、ＢＢＢ等）とラベル画像情報の関係が、例えば、図１１に示すようなラベル管理テーブルに記述される（図４に示すＳ６）。
【００５０】
上記のようにしてラベル画像情報との対応が付けられながらファイル格納ユニット２６内に蓄積された文書の検索処理は、例えば、図１２に示す手順に従って行なわれる。
ユーザが、例えば、○、×、△、□のうちのいずれかのマークを付けた文書を検索するために、手書き入力装置２３を用いてそのマークを検索キーとして手書き入力すると、ＣＰＵ２０は、各係数値の初期化を行なった（Ｓ２１）後に、入力されたマークの画像データを取得する（Ｓ２２）。そして、ＣＰＵ２０は、取得した画像データの正規化処理を行ない（Ｓ２３）、その結果得られた正規化画像データから検索すべきラベル画像情報に対応した検索データを生成する（Ｓ２４）。
【００５１】
このように検索データが生成されると、ＣＰＵ２０は、ラベル格納位置テーブル（図１０参照）を参照して、ラベル画像情報をファイル格納ユニット２６から読み出す（Ｓ２５）。そして、ＣＰＵ２０は、上記のように生成された検索データと読み出したラベル画像情報を比較し（Ｓ２６）、それらが一致するか否かを判定する（Ｓ２７）。この比較及び一致判定は、一般的な画像認識処理により行なわれ、検索データに対して所定の類似度を有する（実質的に同一な）ラベル画像情報は、当該検索データに一致するラベル画像情報と判定される。
【００５２】
一致しない場合、ラベル格納位置テーブルにて管理されるラベル画像情報について、上記と同様に検索データとの比較及び一致判定の処理が繰り返し行なわれる（Ｓ３６、Ｓ２５、Ｓ２６、Ｓ２７）。そして、検索データに一致するラベル画像情報があると、ＣＰＵ２０は、そのラベル画像情報のＩＤを検索すべきラベル画像情報（検索ラベル）のＩＤとして所定のレジスタにセットする（Ｓ２８）。
【００５３】
その後、ＣＰＵ２０は、初期化されたカウンタ値ｉ（ｉ＝０）をインクリメント（ｉ＝１）して（Ｓ２９）、ｉ番目（１番目）の文書に対応するラベル画像情報のＩＤを上記ラベル管理テーブル（図１１参照）から読みだす（Ｓ３０）。そして、ＣＰＵ２０は、上記所定のレジスタにセットした検索ラベルのＩＤと読みだしたラベル画像情報のＩＤとを比較し（Ｓ３１）、それらが一致しているか否かを判定する（Ｓ３２）。それらが一致している場合、ＣＰＵ２０は、そのｉ番目（１番目）の画像データ文書のＩＤ（文書番号、文書名等）を所定のレジスタにセットする（Ｓ３３）。検索ラベルのＩＤと読みだしたラベル画像情報のＩＤが一致しなかった場合、または、画像データ文書のＩＤをレジスタにセットした後に、ＣＰＵ２０は、カウンタ値ｉがファイル格納ユニット２６に格納される全ての文書数ｉ₀に達したか否か（ｉ≧ｉ₀）を判定し（Ｓ３４）、達していない場合、次の文書に対応するラベル画像情報のＩＤと検索ラベルのＩＤとの比較を行なう。
【００５４】
以後、カウンタ値ｉが全文書数ｉ₀に達するまで、上述した処理を繰り返し、カウンタ値ｉが全文書数ｉ₀に達すると（全文書に対応するラベル画像情報と検索ラベルとの比較が終わると）、ＣＰＵ２０は、その時点で、レジスタにセットされている画像データ文書のＩＤを表示装置２５に一覧表として表示させる（Ｓ３５）。即ち、表示装置２５には、例えば、○印が特定領域に記述された文書の一覧表が表示される。
【００５５】
ユーザは、その一覧表内の文書群から１つの文書を、手書き入力装置２３を用いて指定すると、その指定された文書がファイル格納ユニット２６から読みだされて、表示装置２５に表示される。
なお、上記図１２に示す手順に従った処理において、全てのラベル画像情報が検索データに一致しなかった場合（Ｓ３６、ＹＥＳ）、ＣＰＵ２０は、該当する画像データ文書が存在しない旨のメッセージを表示装置２５に表示させた後に（Ｓ３７）、検索処理を終了する。
【００５６】
上記のようなシステムでは、文書に含まれる画像データ（手書き○印、×印、△印、□印）を縮小したラベル画像情報が作成され、そのラベル画像情報と文書との対応関係が管理される。そして、検索キーとして手書きの画像データが提供されたときに、管理されている対応関係に基づいて、その画像データに対応した文書が検索される。
【００５７】
上記の例では、検索すべき文書に記述した画像データ（マーク等）をユーザが直接入力したが、その画像データの種類が多くなると、それらの画像データをユーザが覚えておくことが難しくなる。このような場合、容易に画像データの指定を行なえるようにするため、上述したような検索処理を行なう前に、ＣＰＵ２０は、例えば、図１３に示すような手順に従った処理を行なう。
【００５８】
図１３において、ＣＰＵ２０は、ラベル格納位置テーブル（図１０参照）を参照して、各ラベル画像情報をファイル格納ユニット２６から読みだし、各ラベル画像情報（縮小画像）の一覧表を表示装置２５に表示させる（Ｓ４１）。そして、ユーザが手書き入力装置２３を用いて、表示されたラベル画像情報群から１つのラベル画像情報を指定する操作を行なうと、ＣＰＵ２０は、その指定されたラベル画像情報を取得する（Ｓ４２）。以後、ＣＰＵ２０は、その取得したラベル画像情報を検索データとして、図１２に示す手順と同様の手順に従った検索処理Ｐ₁₀₀を実行する。
【００５９】
また、上述した各例においては、画像データ文書の先頭部分を特定領域として、その部分に記述された画像データからラベル情報を作成したが、これに限られることはない。例えば、図３（ｂ）に示すように、画像データ文書の先頭の行や、図３（ｃ）に示すように、画像データ文書に含まれる画像実データの先頭部分を特定領域として予め設定してもよい。
【００６０】
更に、上述した各例においては、ラベル情報の元となる画像は、ユーザが例えば、文書の分類に使用するためにわざわざ記述したマークであったが、これに限られることはない。例えば、ユーザが作成した文書そのものの所定領域に含まれる画像データ（イメージとしてメモリユニット２２に展開された文書データの所定領域部分）を縮小したり、文書全体（１頁分）を表す画像データ（イメージとしてメモリユニット２２に展開された文書データ）を縮小することで、ラベル情報を作成することも可能である。
【００６１】
このような場合、作成される文書に応じて種々のラベル情報が作成されるので、文書検索に際して、ラベル情報に対応した検索データを指定することが難しい。それを解決するために、検索処理の前に、前述したような、図１３に示す手順に従った処理が実行される。即ち、作成されたラベル情報（縮小画像）の一覧表が表示装置２５に表示され、その一覧表のラベル情報群から選択されたラベル情報を検索データとして、対応する文書の検索が行なわれる。
【００６２】
上記各例においては、一つの文書に対応して一つのラベル情報が作成されたが、これに限られない。一つの文書に対して複数のラベル情報を作成し、それらのラベル情報を一つの文書に対応付けられるように管理することも可能である。
例えば、図１４に示すように、作成した画像データ文書（ＡＡＡ）において、画像データ部分α₁、α₂、．．．α_nを指定すると、指定された画像データ部分の縮小、正規化処理（サムネール処理）がなされて、対応するラベル情報ａ₁、ａ₂、．．．、ａ_nが作成される。この作成されたラベル情報は、作成された文書と供にファイル格納ユニット２６に格納される。そして、例えば、図１５に示すようなラベル管理テーブルに、画像データ文書（ＡＡＡ、ＢＢＢ等）とラベル情報との関係が記述される。
【００６３】
この場合も、前記と同様の処理に従って（図１２参照）、あるラベル情報が検索データとして指定されたときに、該ラベル管理テーブルを参照して、対応する画像データ文書が検索される。
上記各例においては、ラベル情報を用いて文書検索を行なうものであった。次に、各文書内の画像データの検索処理について説明する。
【００６４】
このような検索処理は、例えば、図１６に示す手順に従って行なわれる。
図１６において、ユーザが手書き入力装置２３を用いて検索キーとなる画像データ（α）を入力すると、ＣＰＵ２０は、種々の係数の初期化を行なった後に（Ｓ５１）、入力された画像データ（α）を取得する（Ｓ５２）。そして、ＣＰＵ２０は、取得した画像データ（α）の正規化処理を行ない（Ｓ５３）、その結果得られた正規化画像データ（縮小画像）から検索すべきラベル画像情報に対応した検索データを生成する（Ｓ５４）。
【００６５】
その後、ＣＰＵ２０は、ファイル格納ユニット２６から画像データ文書を読みだし、メモリユニット２２に展開する（Ｓ５５）。そして、ＣＰＵ２０は、メモリユニット２２に展開された画像データ文書を走査して（Ｓ５６）画像データ部分が存在するか否かを判定する（Ｓ５７）。読みだした文書の最終端に達する前に（Ｓ５８）、画像データ部分が存在することを判定すると、ＣＰＵ２０は、その画像データ部分を抽出する（Ｓ５９）。
【００６６】
このように画像データ部分が抽出されると、ＣＰＵ２０は、その抽出された画像データ部分の正規化処理を行ない（Ｓ６０）、その結果得られる正規化画像データからラベル情報（ａ）（縮小画像）を作成する（Ｓ６１）。そして、ＣＰＵ２０は、上記のようにして手書き入力画像から生成した検索データとラベル情報（ａ）が一致するか否かを判定する（Ｓ６２）。この一致判定の処理では、前述した例（図１２参照）と同様に、検索データと所定の類似度を有する（実質的に同一）ラベル情報は、当該検索データと一致するものと判定される。
【００６７】
当該検索データとラベル情報（ａ）が一致していると判定した場合、ＣＰＵ２０は、抽出された画像データ部分が検索データに該当することを表示装置２５表示させる（Ｓ６４）。その後、ＣＰＵ２０は、処理中断のための操作入力があったか否かを判定する（Ｓ６５）。そのような処理中断のための操作入力がなければ、再度、画像データ文書の走査を開始して、画像データ部分を抽出するための処理及び抽出された画像データ部分が検索データに一致するか否かの判定処理を続行する（Ｓ５６乃至Ｓ６５）。
【００６８】
なお、抽出された画像データ部分が検索データと一致しなかった場合、そのまま、画像データ文書の走査を続行する（Ｓ６３→Ｓ５６）。
上記の処理を繰り返し、読みだした文書の最終端まで走査が終了すると、ＣＰＵ２０は、更に、前述した処理の過程で、検索データに一致する画像データ部分が存在したかどうかを判別し（Ｓ６６）、一致する画像データが読みだした文書内に存在しなかった場合には、ＣＰＵ２０は、該当文書データ部分が存在したなかった旨のメッセージを表示装置２５に表示させる（Ｓ６７）。
【００６９】
その後、ＣＰＵ２０は、次の文書についての処理の要求がなされているか否かを判定し、そのような要求がなされている場合には、次の画像データ文書を、新たな検索対象の文書として、ファイル格納ユニット２６から読み出してメモリユニット２２に展開する（Ｓ６８→Ｓ５５）。以後、新たな画像データ文書に対して、上述したのと同様の処理が実行される。
【００７０】
なお、抽出された画像データ部分が検索データに該当することが表示装置２５表示された後に、中断操作がなされたことを判定すると、ＣＰＵ２０は、当該文書についての検索処理を終了し、次の文書についての検索処理を行なうべきか否かの判定を行なう（Ｓ６８）。
【００７１】
上記のような検索処理によれば、手書き入力装置２３から入力された画像ータと一致する画像データが各画像データ文書から検索される。
上述したラベル情報に基づいて文書検索処理を行なうシステムでは、ラベル情報と画像データ文書との関係を記述したラベル管理テーブルを用いて検索処理を行なっていたが、これに限定されない。特にラベル管理テーブルを用いることなしに、検索データが与えられる毎に、図１６に示す手順と同様の手順に従って、文書の所定領域の画像データを読み出して、この読みだした画像データと検索データとの比較処理、一致判定処理を行なってもよい。
【００７２】
上述したように、例えば、図１７に示すように、画像データが「富士山」を表す場合、図１８（ａ）に示すように、その「富士山」の縮小データをラベル情報とするほか、図１８（ｂ）に示すように、「富士山」の骨組みデータである台形の線画をラベル情報として登録する（ラベル管理テーブルに記述する）ことも可能である。
【００７３】
また、上述した各実施例は、画像データからラベル情報を作成するものであったが、これに限定されない。音声データや、音声データと画像データとの複合データ等からラベル情報を作成することもできる。
また更に、本発明に係るシステムが携帯用コンピュータ端末装置内に構築した例を説明したが、これに限られない。通常のデスクトップ型、ラップトップ型のパーソナルコンピュータ内に構築することもできる。この場合、ファイル格納ユニット２６は、例えば、ハードディスク装置で構成される。そして、各種プログラムは、ＣＤ−ＲＯＭ、フロッピーディスク、各種メモリカード等の記録媒体によって、当該システムに提供することができる。
【００７４】
なお、上記各例において、図４に示すステップＳ１の処理は抽出手段（抽出プログラム手段）に対応し、ステップＳ２乃至ステップＳ４での処理はラベル情報生成手段（ラベル情報生成プログラム手段）に対応し、図７、図１０及び図１１、更に、図１５に夫々示されるテーブルは、管理手段に対応する。図８及び図１２の各手順に従った処理は、検索手段（検索プログラム手段）に対応する。
【００７５】
また、図１６に示すステップＳ５６乃至Ｓ５９の処理は抽出手段（抽出プログラム手段）に対応し、図１６に示すステップＳ６０及びＳ６１の処理はラベル情報生成手段（ラベル情報生成プログラム手段）に対応し、図１６に示すステップＳ６２及びＳ６３の処理は判定手段（判定プログラム手段）に対応する。
【００７６】
更に、図４に示すステップＳ６の処理は、文書とラベル情報の対応関係を管理手段に提供するためのプログラム手段に対応し、図１３に示す手順の処理はラベル一覧表示制御手段に対応する。
【００７７】
【発明の効果】
以上、説明してきたように、請求項１乃至２記載の本願発明によれば、文書の一部の音声データからラベル情報を作成し、そのラベル情報と文書との対応関係に基づいて文書検索等を行なうようにしたため、音声データに基づいた検索処理を行なうことができるシステムが実現できる。
【図面の簡単な説明】
【図１】本発明の実施の形態に係るシステムの機能的な構成を示すブロック図である。
【図２】本発明の実施の形態に係るシステムのハードウエア構成の一例を示すブロック図である。
【図３】画像データを抽出すべき画像データ文書の特定領域の例を示す図である。
【図４】作成した文書を保存する際の処理手順の一例を示すフローチャートである。
【図５】文書の特定領域に記述すべき画像の例を示す図である。
【図６】文書の特定領域に記述すべき画像の他の例を示す図である。
【図７】文書とラベル情報との関係を記述したラベル管理テーブルを示す図である。
【図８】文書検索処理の手順の例を示すフローチャートである。
【図９】文書の特例領域に記述すべき画像の更に他の例を示す図である。
【図１０】ラベル情報とその格納位置との関係を記述したラベル格納位置テーブルを示す図である。
【図１１】ラベル情報と文書との対応関係を記述したラベル管理テーブルを示す図である。
【図１２】文書検索処理の手順の一例を示すフローチャートである。
【図１３】ラベル情報の一覧表を表示するための処理の手順の一例を示すフローチャートである。
【図１４】作成、編集された画像データ文書内の画像データとラベル情報の対応を示す図である。
【図１５】文書とラベル情報との対応関係を記述したラベル管理テーブルを示す図である。
【図１６】画像データの検索処理の手順の一例を示すフローチャートである。
【図１７】画像データの一例を示す図である。
【図１８】ラベル情報の例を示す図である。
【符号の説明】
１情報処理システム
１０文書管理ユニット
１１管理ユニット
１２表示ユニット
１３音声出力ユニット
１４入力ユニット
１５入出力制御ユニット
１６特徴抽出ユニット
１７圧縮ユニット
１８表示制御ユニット
１９検索ユニット
２０ＣＰＵ
２１ＲＯＭ
２２メモリユニット
２３手書き入力装置
２４キーボード
２５表示装置
２６ファイル格納ユニット[0001]
TECHNICAL FIELD OF THE INVENTION
  The present invention relates to an information processing system for processing a document including uncoded non-coded data such as image data and audio data, and more specifically, information enabling search processing based on such uncoded data. Regarding processing system.
[0002]
[Prior art]
In an information processing system that performs processing such as document creation, editing, and search, encoded coded data such as characters and symbol strings and uncoded non-coded data such as images and sounds are processed. . The search process in such a system searches for a specific character or symbol (sequence) from a coded character or symbol string constituting the document in a document, or a specific code from an accumulated document group. A document including the converted character, symbol (column) (keyword) is searched.
[0003]
[Problems to be solved by the invention]
However, the conventional information processing system cannot perform a search process based on uncoded data such as image data and audio data. For example, it is not possible to search for a specific image or sound from a certain document, or to search for a document containing a specific image or sound from an accumulated document group.
[0004]
  Therefore, the present inventionSectionThe title is to provide an information processing system capable of performing search processing based on uncoded data such as image data and audio data..
[0005]
[Means for Solving the Problems]
  UpLessonIn order to solve the problem, the present inventionIn an information processing system for processing a document containing unencoded audio data, when a search command is issued, an extraction unit that extracts audio data included in the specified document, and the extraction unit Label information generating means for generating label information including main frequency components representing the characteristics of the audio data from the audio data, main frequency component information as label information as a search key provided together with the search command, and the label Determining means for determining whether or not the main frequency component information which is the label information generated by the information generating means is substantially the same, and the determining means is the main frequency component information of the both label information Are determined to be substantially the same, a search result based on the determination result is obtained.
[0012]
  In such an information processing system, a search command was issued.CaseFrom the specified documentvoiceData was extracted and extractedvoiceFrom the dataContains the main frequency component representing the characteristics of the audio dataLabel information is generated. When it is determined that the generated label information is substantially the same as the label information provided as the search key, a search result based on the determination result is obtained.
[0013]
  The search result based on this determination result is a search result reasonably obtained from the determination result. For example, the source of label information that is substantially the same as the label information provided as a search keyAudio dataAnd thatAudio dataThe designated document including “” is obtained as a search result.
[0016]
  Furthermore, it is described in a predetermined area of the document.voiceFrom the standpoint that documents classified based on data can be searched, the present invention provides that the extraction means is described in a predetermined area of the document.voiceConfigured to extract data.
[0017]
  In such an information processing system, always from a predetermined area of the documentvoiceData is extracted. Its extractedvoiceLabel information is generated from the data. When the label information is substantially the same as the label information provided as a search key, the document can be obtained as a search result. Therefore, it is the same as the predetermined area of documents belonging to the same classificationvoiceIf data is described, documents belonging to the same classification can be obtained as search results.
[0019]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to the drawings.
An information processing system according to an embodiment of the present invention is constructed by a computer system, and its functional configuration is, for example, as shown in FIG. In this information processing system, a document including media data composed of at least one of image data and audio data is processed (created, edited, searched, etc.). In FIG. 1, an information processing system 1 includes a document management unit 10, a management unit 11, a display unit 12, an audio output unit 13, an input unit 14, an input / output control unit 15, a feature extraction unit 16, a compression unit 17, and a display control unit. 18 and a search unit 19.
[0020]
The document management unit 10 manages documents (including media data) handled in the system. The management unit 11 manages the relationship between the document managed by the document management unit 10 and label information as will be described later. The display unit 12 displays a document managed by the document management unit 10. When the media data included in the document managed by the document management unit 10 is composed of sound data, the sound output unit 13 outputs sound based on the sound data. The input unit 14 inputs a document including media data and data serving as a search key to the system.
[0021]
The input / output control unit 15 performs interface processing among the display unit 12, the audio output unit 13 and the input unit 14. A document managed by the document management unit 10 is provided to the display unit 12 via the input / output control unit 15, and the document is displayed on the display unit 12. Audio data (media data) included in a document managed by the document management unit 10 is provided to the audio output unit 13 via the input / output control unit 15, and audio based on the audio data is output from the audio output unit 13. Is done. A document input by the input unit 14 is provided to the document management unit 10 via the input / output control unit 15, and the provided document is managed by the document management unit 10.
[0022]
The feature extraction unit 16 extracts media data from the document managed by the document management unit 10, and further extracts feature data representing the feature of the media data from the media data. The feature data is provided to the management unit 11 as label information. For example, in the case of media data having a certain pattern, the frequency component amount that the pattern has is calculated as the feature data. For example, when the media data is composed of image data, the number of blocks defined by the number of image chunks (characters, symbols, etc.) included in the image is extracted as feature data. Furthermore, for example, when the media data is composed of audio data, the main frequency component value (for example, 8 kHz) of the audio data is extracted as the feature data.
[0023]
When the media data extracted from the document managed by the document management unit 10 is composed of image data, the compression unit 17 reduces a part or all of the extracted image data (to a predetermined size). Normalized reduced data is generated, or frame data representing simplified image data is calculated. This frame data represents, for example, a triangle, a quadrangle, a circle, etc., which are simplified image data. The reduced data or the frame data generated or calculated by the compression unit 17 is provided to the management unit 11 as label information.
[0024]
The management unit 11 that receives the label information from the feature extraction unit 16 or the compression unit 17 manages the correspondence between the document including the media data that is the label information and the label information. That is, when label information is designated, a document corresponding to the designated label information is specified based on the correspondence relationship managed by the management unit 11.
[0025]
The display control unit 18 creates a list of label information whose relationship with each document is managed by the management unit 11 and causes the display unit 12 to display the list of label information. The list of label information displayed on the display unit 12 is used for selecting a search key in the search process.
[0026]
The search unit 19 searches the document group managed by the document management unit 10 for a document corresponding to the search key input from the input unit 14. Specifically, the label information substantially the same as the search key is searched from the label information whose correspondence with the document is managed by the management unit 11, and the document having the correspondence with the searched label information is specified. To do. Then, the identified document is obtained as a search result.
[0027]
As described above, based on the control of the display control unit 18, the display unit 12 may display a list of label information whose correspondence with the document is managed in the management unit 11. In this case, when the input unit 12 performs an operation of selecting one label information from the list, the label information is provided to the search unit 19. The search unit 19 performs the same process as described above using the provided label information as a search key.
[0028]
As described above, the management unit 11 that manages the correspondence between the document managed by the document management unit 10 and the label information may not be provided. In this case, the search unit 19 specifies a document managed by the document management unit 10, and uses label information generated by the feature extraction unit 16 or the compression unit 17 from media data extracted from the specified document. obtain. Then, the search unit 19 further determines whether or not the label information and the input search key are substantially the same.
[0029]
If they are substantially the same, the search unit 19 can output the media data that is the basis of this label information as a search result. This process corresponds to a process of searching for media data matching the search key from the specified document. In addition, a document including media data that is the source of the label information can be output as a search result. This processing corresponds to processing for searching a document including media data that is the source of label information matching the search key from documents managed by the document management unit 10.
[0030]
According to the information processing system having the above-described functions, label information representing the characteristics of uncoded media data is associated with a document including the media data, so that search processing based on the label information is possible. It becomes. As a result, a search process based on the media data (uncoded data) that is the source of the label information becomes substantially possible.
[0031]
Next, a more specific system will be described.
FIG. 2 shows a hardware configuration of the information processing system. This system is configured, for example, in a portable computer terminal device capable of handwriting input. In FIG. 2, the system includes a CPU (Central Processing Unit) 20, a ROM (Read Only Memory) 21, a memory unit 22, a handwriting input device 23, a keyboard device 24, a display device 25, and a file storage unit connected to a bus. 26.
[0032]
Various programs are stored in the ROM 21. The program stored in the ROM 21 includes a program for performing processing in the system according to the present invention. That is, the ROM 21 corresponds to the recording medium according to the present invention. The CPU 21 executes various processes such as a document creation process, a document edit process, a document management process, and a document search process in accordance with a program stored in the ROM 20.
[0033]
The memory unit 22 stores data obtained by processing in the CPU 20 and is used as a working memory. The handwriting input device 23 is used for inputting handwritten line images, handwritten characters, and the like using a touch pen. The keyboard device 24 is used for inputting characters and symbols by key operation. The display device 25 is composed of, for example, a liquid crystal display panel, and displays a document or the like created and edited in the system. The file storage unit 26 is composed of, for example, an EEPROM, a RAM with a backup, or the like, and stores documents, tables and the like created in the system.
[0034]
In the system as described above, a document is created and edited using handwritten line images and handwritten characters input from the handwriting input device 23, and characters and symbols input from the keyboard device 24. Further, a mark is described in a specific area of the created and edited document using the handwriting input device 23. The specific area of the document is determined in advance at the beginning of the document, for example, as shown in FIG. This mark can be used, for example, for rough classification of documents (e.g., importance rank), and as shown in FIG. 5, one mark (FIG. 5 (a)) and two marks It is represented by a mark (Fig. 5 (b)), three circles (Fig. 5 (c)), and the like.
[0035]
When the document creation, editing, and mark entry are completed as described above, the CPU 20 performs document storage processing (including image data, hereinafter referred to as an image data document) according to the procedure shown in FIG.
In FIG. 4, the CPU 20 extracts image data in a specific area of the image data document (S1), and performs normalization processing of the image data so that the display image has a predetermined size (S2). Then, the CPU 20 further extracts features from the normalized image data (S3) (corresponding to the feature extraction means 16 shown in FIG. 1). For example, the number of blocks included in the image represented by the normalized image data, that is, the number of blocks is extracted as a feature.
[0036]
When the marks shown in FIGS. 5A, 5B, and 5C are described, the numbers of extracted blocks are “1”, “2”, and “3”, respectively. Then, the CPU 20 generates the number of blocks as label information corresponding to the image data document (S4). Thereafter, the CPU 20 stores the created and edited image data document in the file storage unit 26 (S5). A table representing the relationship between the image data document (document name) and the storage position is constructed in the file storage unit 26, and the relationship between the two is managed in the system (corresponding to the document management means 10 shown in FIG. 1).
[0037]
For example, as shown in FIG. 7, the relationship between label information (in this example, the number of blocks (a, b,...)) And an image data document (document name, AAA, BBB, etc.) is described. A label management table is built in the file storage unit 26. When the label information is created as described above, the created label information is described in the label management table so as to correspond to the created and edited image data document (S6). The relationship between the image data document and the label information is managed using this label management table (corresponding to the management means 11 shown in FIG. 1).
[0038]
As described above, every time an image data document is created and edited, label information is created based on the image data (mark) described in the specific area, and the image data document is stored in the file storage unit 26. In addition, the relationship between the image data document and the label information is described in the label management table.
[0039]
For example, in order to search for three marked documents (see FIG. 5C) that the user has determined as the most important document, the number of blocks “3” is used as a search key. 24 or when input from the handwriting input device 23, the CPU 20 executes a search process according to the procedure shown in FIG. 8, for example (corresponding to the search means 19 shown in FIG. 1).
[0040]
After initializing internal coefficients and the like (S11), the CPU 20 acquires the input search key (number of blocks “3”) (S12). Further, the CPU 20 increments the counter value i initialized to “0” (i = 1) (S13), and stores the label information corresponding to the i-th (first) document in the label management table (FIG. 7). Read from (see) (S14).
[0041]
Then, the CPU 20 compares the acquired search key with the read label information (S15), and determines whether or not they match (S16). If they match, the CPU 20 sets the ID (document number) of the i-th (first) image data document in a predetermined register (S17). If the retrieved key information does not match the read label information, or after setting the ID of the image data document in the register, the CPU 20 counts the number i of all documents in which the counter value i is stored in the file storage unit 26.₀(I ≧ i)₀(S18), if not reached, the label information corresponding to the next document is compared with the search key.
[0042]
Thereafter, the counter value i is the total number of documents i.₀Until the counter value i is equal to the total document number i.₀(When the comparison between the label information corresponding to all documents and the search key is completed), the CPU 20 displays the IDs of the image data documents set in the register at that time on the display device 25 as a list. (S19). In other words, for example, a list of documents having the number of blocks “3” specified by the search key (documents in which three circles are described in a specific area) is displayed on the display device 25.
[0043]
When the user designates one document from the document group in the list by using the handwriting input device 23, the designated document is read from the file storage unit 26 and displayed on the display device 25.
In the system as described above, label information is created from the characteristics of image data (handwritten circles) included in the document, and the correspondence between the label information and the document is managed. Then, when the label information is provided as a search key, the document is searched based on the managed correspondence. As a result, a document search process based on image data (handwritten circles) can be performed.
[0044]
When extracting the block number from the acquired image data as a feature, various images can be used in the same system as long as a block can be extracted. For example, as shown in FIGS. 6A, 6B, and 6C, one Δ mark, two Δ marks, and three Δ marks can be used. In this case, the number of blocks is “1”, “2”, and “3”, as in the case of the circles.
[0045]
It is also possible to create label information by extracting the following features of image data. That is, two-dimensional Fourier analysis is performed on the image data of the specific area, and the frequency component amount obtained at that time, the color component of the image data of the specific area, etc. are extracted as the characteristics of the image data. Can be created.
[0046]
In the above example, the label information is created by extracting features from the acquired image data. However, a reduced image obtained by reducing the image data can be used as the label information.
As an image described in the specific area, an example in which ◯, ×, Δ, □ as shown in FIG. 9 is used will be described.
[0047]
At the end of the creation and editing of the document, any one of the four types of marks is entered in the specific area (first area) of the document by the handwritten character input device 23. In this case, the CPU 20 executes processing in accordance with substantially the same procedure as that shown in FIG. That is, after image data (one of ○, ×, Δ, □) in a specific area is taken out (S1), normalization processing of the image data is executed (S2), and image data of a predetermined size is obtained. (Generally reduced image data) is generated (thumbnail processing) (corresponding to the compression unit 17 shown in FIG. 1). Then, label image information is created based on the normalized image data (S4). The created and edited document is stored in the file storage unit 26 (S5).
[0048]
The relationship between the label image information and the document is managed as follows.
The created label image information (reduced image data) itself is stored in the file storage unit 26. Then, the storage position (pointer value) of the label image information is described in a label storage position table representing the relationship between the label image information and the storage position, for example, as shown in FIG. This label storage position table is constructed in the file storage unit 26.
[0049]
As described above, after the storage position of the label image information is described in the label storage table, the relationship between the document (AAA, BBB, etc.) created and edited and the label image information is as shown in FIG. Is described in the label management table (S6 shown in FIG. 4).
[0050]
The search processing of the document stored in the file storage unit 26 while being associated with the label image information as described above is performed, for example, according to the procedure shown in FIG.
For example, when a user performs handwriting input using the handwriting input device 23 as a search key in order to search for a document with any mark of ○, ×, Δ, □, the CPU 20 After the coefficient values are initialized (S21), the image data of the input mark is acquired (S22). Then, the CPU 20 normalizes the acquired image data (S23), and generates search data corresponding to the label image information to be searched from the normalized image data obtained as a result (S24).
[0051]
When the search data is generated in this way, the CPU 20 reads the label image information from the file storage unit 26 with reference to the label storage position table (see FIG. 10) (S25). Then, the CPU 20 compares the search data generated as described above with the read label image information (S26), and determines whether or not they match (S27). This comparison and coincidence determination is performed by a general image recognition process, and label image information having a predetermined similarity (substantially the same) to the search data is the label image information that matches the search data. Determined.
[0052]
If they do not match, the label image information managed in the label storage position table is repeatedly compared with the search data and matched in the same manner as described above (S36, S25, S26, S27). When there is label image information that matches the search data, the CPU 20 sets the ID of the label image information in a predetermined register as the ID of the label image information (search label) to be searched (S28).
[0053]
Thereafter, the CPU 20 increments the initialized counter value i (i = 0) (i = 1) (S29), and the ID of the label image information corresponding to the i-th (first) document is managed by the label management. Read from the table (see FIG. 11) (S30). Then, the CPU 20 compares the ID of the search label set in the predetermined register with the ID of the read label image information (S31), and determines whether or not they match (S32). If they match, the CPU 20 sets the ID (document number, document name, etc.) of the i-th (first) image data document in a predetermined register (S33). If the ID of the search label does not match the ID of the read label image information, or after setting the ID of the image data document in the register, the CPU 20 stores all the counter values i stored in the file storage unit 26. Number of documents i₀(I ≧ i)₀(S34), if not reached, the ID of the label image information corresponding to the next document is compared with the ID of the search label.
[0054]
Thereafter, the counter value i is the total number of documents i.₀Until the counter value i is equal to the total document number i.₀(When the comparison between the label image information corresponding to all documents and the search label is completed), the CPU 20 displays the IDs of the image data documents set in the register at that time on the display device 25 as a list. (S35). That is, on the display device 25, for example, a list of documents in which a circle is described in a specific area is displayed.
[0055]
When the user designates one document from the document group in the list by using the handwriting input device 23, the designated document is read from the file storage unit 26 and displayed on the display device 25.
In the process according to the procedure shown in FIG. 12, if all the label image information does not match the search data (S36, YES), the CPU 20 displays a message that the corresponding image data document does not exist. After being displayed on the device 25 (S37), the search process is terminated.
[0056]
In the above system, label image information obtained by reducing image data (handwritten ○ mark, × mark, Δ mark, □ mark) included in a document is created, and the correspondence between the label image information and the document is managed. The Then, when handwritten image data is provided as a search key, a document corresponding to the image data is searched based on the managed correspondence.
[0057]
In the above example, the user directly inputs image data (such as a mark) described in the document to be searched. However, as the types of image data increase, it becomes difficult for the user to remember the image data. In such a case, the CPU 20 performs, for example, a process according to the procedure shown in FIG. 13 before performing the search process as described above so that the image data can be easily specified.
[0058]
In FIG. 13, the CPU 20 refers to the label storage position table (see FIG. 10), reads each label image information from the file storage unit 26, and displays a list of each label image information (reduced image) on the display device 25. It is displayed (S41). When the user performs an operation of designating one label image information from the displayed label image information group using the handwriting input device 23, the CPU 20 acquires the designated label image information (S42). Thereafter, the CPU 20 uses the acquired label image information as search data, and performs a search process P according to a procedure similar to the procedure shown in FIG.₁₀₀Execute.
[0059]
In each of the above-described examples, the head portion of the image data document is used as the specific area, and the label information is created from the image data described in that portion. However, the present invention is not limited to this. For example, as shown in FIG. 3B, the head line of the image data document or the head part of the actual image data included in the image data document as shown in FIG. May be.
[0060]
Furthermore, in each of the above-described examples, the image that is the basis of the label information is a mark that the user has purposely described for use in, for example, document classification, but is not limited thereto. For example, image data (predetermined area portion of the document data developed as an image in the memory unit 22) included in a predetermined area of the document itself created by the user is reduced, or image data (for one page) representing the entire document (for one page) It is also possible to create label information by reducing the document data developed in the memory unit 22 as an image.
[0061]
In such a case, since various label information is created according to the created document, it is difficult to specify search data corresponding to the label information when searching for a document. In order to solve this, the process according to the procedure shown in FIG. 13 as described above is executed before the search process. That is, a list of created label information (reduced images) is displayed on the display device 25, and a corresponding document is searched using the label information selected from the label information group of the list as search data.
[0062]
In each of the above examples, one label information is created corresponding to one document, but the present invention is not limited to this. It is also possible to create a plurality of label information for one document and manage the label information so as to be associated with one document.
For example, as shown in FIG. 14, in the created image data document (AAA), the image data portion α₁, Α₂,. . . α_nIs specified, the specified image data part is reduced and normalized (thumbnail processing), and the corresponding label information a₁, A₂,. . . , A_nIs created. The created label information is stored in the file storage unit 26 together with the created document. For example, the relationship between the image data document (AAA, BBB, etc.) and the label information is described in the label management table as shown in FIG.
[0063]
Also in this case, according to the same process as described above (see FIG. 12), when certain label information is designated as search data, the corresponding image data document is searched with reference to the label management table.
In each of the above examples, the document search is performed using the label information. Next, search processing for image data in each document will be described.
[0064]
Such a search process is performed, for example, according to the procedure shown in FIG.
In FIG. 16, when the user inputs image data (α) as a search key using the handwriting input device 23, the CPU 20 initializes various coefficients (S51), and then inputs the input image data (α ) Is acquired (S52). Then, the CPU 20 normalizes the acquired image data (α) (S53), and generates search data corresponding to label image information to be searched from the normalized image data (reduced image) obtained as a result. (S54).
[0065]
Thereafter, the CPU 20 reads the image data document from the file storage unit 26 and develops it in the memory unit 22 (S55). Then, the CPU 20 scans the image data document developed in the memory unit 22 (S56) and determines whether or not an image data portion exists (S57). If it is determined that the image data portion exists before reaching the final end of the read document (S58), the CPU 20 extracts the image data portion (S59).
[0066]
When the image data portion is extracted in this way, the CPU 20 normalizes the extracted image data portion (S60), and label information (a) (reduced image) is obtained from the normalized image data obtained as a result. Is created (S61). Then, the CPU 20 determines whether or not the search data generated from the handwritten input image as described above matches the label information (a) (S62). In this matching determination process, as in the above-described example (see FIG. 12), the label information having a predetermined similarity (substantially the same) as the search data is determined to match the search data.
[0067]
If it is determined that the search data matches the label information (a), the CPU 20 causes the display device 25 to display that the extracted image data portion corresponds to the search data (S64). Thereafter, the CPU 20 determines whether or not there is an operation input for interrupting the process (S65). If there is no operation input for such processing interruption, scanning of the image data document is started again, and processing for extracting the image data portion and whether or not the extracted image data portion matches the search data. Is continued (S56 to S65).
[0068]
If the extracted image data portion does not match the search data, scanning of the image data document is continued as it is (S63 → S56).
When the above processing is repeated and scanning is completed up to the final end of the read document, the CPU 20 further determines whether or not there is an image data portion that matches the search data in the above-described process (S66). If the matching image data does not exist in the read document, the CPU 20 displays a message indicating that the corresponding document data portion does not exist on the display device 25 (S67).
[0069]
Thereafter, the CPU 20 determines whether or not a request for processing for the next document has been made. If such a request has been made, the next image data document is used as a new search target document. The data is read from the file storage unit 26 and expanded in the memory unit 22 (S68 → S55). Thereafter, the same processing as described above is performed on the new image data document.
[0070]
If the display device 25 displays that the extracted image data portion corresponds to the search data and determines that an interruption operation has been performed, the CPU 20 ends the search process for the document, and the next document It is determined whether or not the search process should be performed (S68).
[0071]
According to the search processing as described above, image data that matches the image data input from the handwriting input device 23 is searched from each image data document.
In the system that performs the document search process based on the label information described above, the search process is performed using the label management table that describes the relationship between the label information and the image data document. However, the present invention is not limited to this. In particular, every time search data is given without using a label management table, the image data of a predetermined area of the document is read according to the same procedure as shown in FIG. 16, and the read image data and search data The comparison process and the coincidence determination process may be performed.
[0072]
As described above, for example, as shown in FIG. 17, when the image data represents “Mt. Fuji”, as shown in FIG. 18A, the reduced data of “Mt. Fuji” is used as label information. As shown in (b), it is also possible to register a trapezoidal line drawing, which is the frame data of “Mt. Fuji”, as label information (described in a label management table).
[0073]
Moreover, although each Example mentioned above produced label information from image data, it is not limited to this. Label information can also be created from audio data, composite data of audio data and image data, or the like.
Furthermore, although the example which the system based on this invention constructed | assembled in the portable computer terminal device was demonstrated, it is not restricted to this. It can also be built in a normal desktop or laptop personal computer. In this case, the file storage unit 26 is composed of, for example, a hard disk device. Various programs can be provided to the system by a recording medium such as a CD-ROM, a floppy disk, and various memory cards.
[0074]
In each of the above examples, the processing in step S1 shown in FIG. 4 corresponds to the extraction means (extraction program means), and the processing in steps S2 to S4 corresponds to the label information generation means (label information generation program means). 7, 10 and 11, and the tables shown in FIG. 15 respectively correspond to management means. The processing according to each procedure in FIGS. 8 and 12 corresponds to search means (search program means).
[0075]
  Also, the processing of steps S56 to S59 shown in FIG.)Correspondingly, the processing in steps S60 and S61 shown in FIG.)Correspondingly, the processing of steps S62 and S63 shown in FIG.)Correspond.
[0076]
Further, the process of step S6 shown in FIG. 4 corresponds to a program means for providing the management means with the correspondence between the document and the label information, and the process of the procedure shown in FIG. 13 corresponds to the label list display control means.
[0077]
【The invention's effect】
  As described above, claims 1 to2According to the described invention of this application,voiceSince label information was created from data and document search was performed based on the correspondence between the label information and the document,voiceA system capable of performing a search process based on data can be realized.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a functional configuration of a system according to an embodiment of the present invention.
FIG. 2 is a block diagram showing an example of a hardware configuration of a system according to the embodiment of the present invention.
FIG. 3 is a diagram illustrating an example of a specific area of an image data document from which image data is to be extracted.
FIG. 4 is a flowchart illustrating an example of a processing procedure when a created document is stored.
FIG. 5 is a diagram illustrating an example of an image to be described in a specific area of a document.
FIG. 6 is a diagram illustrating another example of an image to be described in a specific area of a document.
FIG. 7 is a diagram showing a label management table describing a relationship between a document and label information.
FIG. 8 is a flowchart illustrating an example of a procedure of document search processing.
FIG. 9 is a diagram showing still another example of an image to be described in a special case area of a document.
FIG. 10 is a diagram showing a label storage position table describing the relationship between label information and its storage position.
FIG. 11 is a diagram showing a label management table describing the correspondence between label information and documents.
FIG. 12 is a flowchart illustrating an example of a procedure of document search processing.
FIG. 13 is a flowchart illustrating an example of a processing procedure for displaying a list of label information.
FIG. 14 is a diagram showing a correspondence between image data in a created and edited image data document and label information.
FIG. 15 is a diagram showing a label management table describing the correspondence between documents and label information.
FIG. 16 is a flowchart illustrating an example of a procedure of image data search processing;
FIG. 17 is a diagram illustrating an example of image data.
FIG. 18 is a diagram illustrating an example of label information.
[Explanation of symbols]
1 Information processing system
10 Document management unit
11 Management unit
12 Display unit
13 Audio output unit
14 Input unit
15 Input / output control unit
16 Feature extraction unit
17 Compression unit
18 Display control unit
19 Search unit
20 CPU
21 ROM
22 Memory unit
23 Handwriting input device
24 keyboard
25 Display device
26 File storage unit

Claims

In an information processing system that processes documents containing unencoded audio data,
An extraction means for extracting audio data included in a designated document when a search command is issued;
Label information generating means for generating label information including a main frequency component representing the characteristics of the audio data from the audio data extracted by the extracting means;
Whether main frequency component information, which is label information as a search key provided together with the search command, and main frequency component information, which is label information generated by the label information generating means, are substantially the same. Determination means for determining whether or not
An information processing system configured to obtain a search result based on the determination result when the determination unit determines that the main frequency component information of both the label information is substantially the same.

The information processing system according to claim 1, wherein the extraction unit extracts voice data recorded in a predetermined area of the document.