JPH0612539B2

JPH0612539B2 - Kanji / Kana conversion device

Info

Publication number: JPH0612539B2
Application number: JP57174573A
Authority: JP
Inventors: 和明田中; 誠糸賀; 順司津田
Original assignee: Hitachi Microcomputer System Ltd; Hitachi Ltd
Current assignee: Hitachi Microcomputer System Ltd; Hitachi Ltd
Priority date: 1982-10-06
Filing date: 1982-10-06
Publication date: 1994-02-16
Anticipated expiration: 2009-02-16
Also published as: JPS5965342A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は、漢字仮名混りデータの漢字に読み仮名を自動
付与する漢字仮名変換方式に関するものである。TECHNICAL FIELD The present invention relates to a kanji-kana conversion system for automatically giving a reading kana to kanji in kanji-kana mixed data.

[Prior art]

従来の漢字仮名変換方式においては、漢字の字単位の読
み仮名付けが基本であり、一字単位に漢字の読みが一意
に決められている。そしてそれ以外の読みとなる場合
は、その漢字前後の文字を合わせた熟語の例外辞書が用
意されており、変換処理において変換対象となる漢字が
例外辞書に登録されていれば、その読みを優先させる方
式をとつている。しかし、この方式では、幾つかの漢字
熟語が組み合わされた漢字文字列において、必ずしも各
熟語単位で読みが付与されるとは限らないため、本来意
図された熟語ではない漢字文字列の部分を熟語と判断し
て読みが付与される可能性があつた。例えば、「開発作
業」という熟語に読みを付与する場合、各漢字の読み
が、開（カイ）、発（ハツ）、作（サ）、業（ギヨウ）
と決められ、例外辞書には、発作（ホツサ）という熟語
が登録されていると、開発作業（カイホツサギヨウ）と
誤まつて読み仮名付けられる。In the conventional kanji-kana conversion method, the kana reading of kanji is basically used for each character, and the kanji reading is uniquely determined for each character. If the reading is other than that, an exception dictionary is prepared for the compound words that combine the characters before and after the kanji, and if the kanji to be converted in the conversion process is registered in the exception dictionary, that reading is prioritized. The method of letting is adopted. However, in this method, in a kanji character string in which several kanji compound words are combined, the reading is not always given in each compound word unit, so the part of the kanji character string that is not the originally intended compound word is not There is a possibility that a reading will be added. For example, when adding a reading to the idiom “development work”, the reading of each kanji will be open (kai), utterance (hatsu), work (sa), and kaku (guiyo).
If the idiom of seizure (hotssa) is registered in the exception dictionary, it will be mistaken for development work (kaihotsagiyo) and will be given a pseudonym.

一方漢字仮名混り文を音声で出力する際には漢字の読み
仮名を付与することが必要となるが、このような用途に
利用する場合には音調に抑揚をつけるため、熟語単位で
み仮名付けを行うことが望ましい。しかしながら従来の
漢字仮名変換方式は字単位での変換を基本にしているた
め、上記のような用途には不向きであつた。On the other hand, when outputting mixed sentences of kanji and kana by voice, it is necessary to add reading kana of kanji, but when using it for such purposes, in order to add tones to the tones, kana It is desirable to make a mark. However, the conventional kanji / kana conversion method is based on the conversion on a character-by-character basis, and thus is not suitable for the above-mentioned applications.

更に、漢字の読み仮名付与方法としては、漢字文字列の
先頭から漢字仮名辞書に登録されている漢字（一字の漢
字または、二字以上の漢字熟語）の中で、最長一致する
もの（語基と呼ぶ。）を逐次見つけ、その語基の読みを
付与する方式が考えられるが、次の要因により誤まつた
読みを付与する可能性がある。In addition, as a method for assigning Kanji reading kana, the longest matching kanji (one kanji or two or more kanji compound words) registered in the kanji kana dictionary from the beginning of the kanji character string (word It is conceivable that the reading of the word base will be added and the reading of the word base will be added, but the wrong reading may be added due to the following factors.

(a) 接辞や、漢字仮名辞書に未登録な熟語の出現によ
り語基の認定を誤まることがある。(a) Occurrence of an affix or a idiom that has not been registered in the Kanji / Kana dictionary may cause incorrect recognition of the word base.

(b) 同一熟語が、複数個の読みを持つことがある。(b) The same compound word may have multiple readings.

[Object of the Invention]

本発明の目的は、漢字仮名混りデータに読み仮名を自動
付与する方式において、読み仮名を付与する対象となる
漢字文字列が、複数個の読みを有する場合、及び漢字文
字列が複数通りに分割でき、その分割された文字列が漢
字仮名辞書に登録されている場合に極めて高精度に読み
仮名付けを行い得る方式を提供することにある。An object of the present invention is to automatically give a reading kana to mixed kanji kana data, in a case where the kanji character string to which the reading kana is attached has a plurality of readings, and when there are a plurality of kanji character strings. An object of the present invention is to provide a method that can be divided and that can perform reading kana naming with extremely high accuracy when the divided character string is registered in a kanji kana dictionary.

[Outline of Invention]

本発明方式は読み仮名付与の単位となる語基の認定と、
読み仮名の選択とを次の手順で確認しつつ行なうことに
特徴がある。According to the method of the present invention, recognition of a word base that is a unit of giving a kana,
The feature is that it is performed while confirming the selection of the reading kana in the following procedure.

(1) 最長一致により仮認定された漢字語基（前方語）
に対する複数個の読み仮名の中から、その語基の品詞や
前後の文字の属性などを手掛りに、読み仮名付与規則に
基づきその読み仮名を付与する。(1) Kanji word base tentatively recognized by the longest match (forward word)
Based on the part-of-speech of the word base and the attributes of the preceding and following characters, the reading kana is given from the plurality of reading kana for the.

(2) (1)で仮認定された語基に続く漢字文字列に対し
て、同様に語基（後方語）を仮認定し、その読み仮名を
付与する。(2) For the Kanji character string following the word base tentatively recognized in (1), similarly, the word base (backward word) is tentatively recognized and its phonetic kana is given.

(3) 前方語、後方語の字数や、品詞情報などを手掛り
に、変換処理制御規則に基づき、語基間の接続関係を調
べ、語基分割の妥当性をチエツクして読みを確定する。(3) Based on the number of characters of the front word and the rear word, the part-of-speech information, etc., based on the conversion processing control rule, the connection relation between the word bases is checked, and the validity of the word base division is checked to determine the reading.

例えば、後方語が、１文字であると認定された場合、そ
れが接辞（この場合、接尾語）ならば、前方語は正しい
語基に分割されていると考えられるが、接辞でない場
合、前方語は本来意図された意味を持つ語基として認定
されていない可能性があり、再度、語基の認定をやり直
す。For example, if a trailing word is identified as a single letter, if it is an affix (in this case, a suffix), then the leading word is considered to be split into the correct base, but if it is not, then the leading word is The word may not have been certified as a base having the originally intended meaning, and the base should be certified again.

Example of Invention

以下、本発明を図面を参照して詳細に説明する。本発明
の一実施例のハードウエア構成を第１図に示す。第１図
における１はプロセツサ、２は磁気テープや磁気デイス
ク等の漢字仮名混りデータ格納メモリ、３は漢字仮名変
換された漢字とその読み仮名データの格納メモリ、４は
プログラム格納メモリ、５はワーク・エリア、６は漢字
仮名辞書メモリ、７は変換処理制御規則テーブル・メモ
リ、８は読み仮名付与規則テーブル・メモリ、９はライ
ンプリンタを表わしている。漢字仮名辞書メモリ６に
は、第２図に示す様な形式の辞書が格納されている。つ
まり、１字以上の漢字文字列Ａを見出し語として、その
読み仮名文字列Ｂ、語属性コードＣ、頻度情報Ｄとが格
納されている。ある漢字文字列に対して複数個の読みが
存在する場合や、ある漢字文字列とその読み仮名文字列
に対して複数個の語属性コードが存在する場合には、そ
れぞれに１組の構成をとるものとする。語属性コードＣ
は、漢字文字列Ａとその読み仮名文字列Ｂとが有する文
法情報や、音読み訓読みといつた語属性を示す符号であ
り、その例を第３図に示す。頻度情報は、漢字文字列と
その読み仮名文字列および語属性コードが漢字仮名変換
時に用いられた頻度が格納されている。Hereinafter, the present invention will be described in detail with reference to the drawings. A hardware configuration of one embodiment of the present invention is shown in FIG. In FIG. 1, 1 is a processor, 2 is a storage memory for kanji / kana mixed data such as magnetic tape or magnetic disk, 3 is storage memory for kanji / kana converted kanji and its kana reading data, 4 is program storage memory, 5 is A work area, 6 is a kanji kana dictionary memory, 7 is a conversion processing control rule table memory, 8 is a reading kana giving rule table memory, and 9 is a line printer. The kanji / kana dictionary memory 6 stores a dictionary having a format as shown in FIG. That is, the kana character string A of one or more characters is used as an entry word, and its phonetic kana character string B, word attribute code C, and frequency information D are stored. If there are multiple readings for a given kanji character string, or if there are multiple word attribute codes for a given kanji character string and its reading kana character string, configure one set for each. Shall be taken. Word attribute code C
Is a code indicating the grammatical information of the Kanji character string A and its phonetic kana character string B, and the pronunciation of the pronunciation and the word attribute, and an example thereof is shown in FIG. The frequency information stores the kanji character string, its reading kana character string, and the frequency with which the word attribute code was used during kanji kana conversion.

変換処理制御規則テーブル・メモリ７は、第４図に示す
様な論理的構成を有する。各変換処理制御規則は、条件
と、その条件が満足された場合に実行される処理との組
み合わせである。The conversion processing control rule table memory 7 has a logical structure as shown in FIG. Each conversion process control rule is a combination of a condition and a process executed when the condition is satisfied.

各行が、各変換処理制御規則に対応し、右端を除く各列
が条件を示す。Each row corresponds to each conversion processing control rule, and each column except the right end indicates the condition.

配列内のＹｅｓ，Ｎｏ，−は、それぞれ各規則での各条
件が（Ｙｅｓ）「満足されなければならない」、（Ｎ
ｏ）「満足されてはならない」、（−）「いずれでもよ
い」、ことを示す。右端の欄は、各規則の条件が満足さ
れたとき実施すべき処理の内容を示す。Yes, No, and-in the array indicate that each condition in each rule is (Yes) "must be satisfied", (N
o) Indicates that "it must not be satisfied", and (-) "any is acceptable". The rightmost column shows the contents of processing to be executed when the conditions of each rule are satisfied.

各規則は、規則番号が小さいほど優先される。The smaller the rule number, the higher the priority of each rule.

変換処理制御規則番号１を例により説明する。The conversion processing control rule number 1 will be described as an example.

「前方語に読み仮名を付与できたか」という条件を満足
し、「前方語直後の文字は漢字か」という条件を満足し
なければ、「前方語の読み仮名を確定し、当該文字列に
対する読み仮名付け処理を終了する。」上記規則テーブル（第４図）の物理的な実施例を第５図
に示す。ここで、配列内の各値は、“２”がＹｅｓ、
“１”１がＮｏ、“０”がＰassを意味する。従つて第
５図の行ａ，ｂ，ｃの各値は、第４図の規則番号１，
２，３の条件にそれぞれ対応する。第６図の各値は、第
５図の各列に対応して条件の判定を行なう条件サブルー
チン番号を示す。例えば条件番号１の「漢字文字列はあ
るか」という判定を行う処理プログラムにはサブルーチ
ン番号１１００が付与されている。第７図の各値は、第
５図の各行に対応して各条件が満足されたとき実行され
る処理サブ・ルーチン番号を示す。例えば第７図の「１
０」は、第４図における規則番号１の条件が全部満足し
たときに行われる処理、つまり「前方後を確定する」と
いう処理を実行するプログラムのサブルーチン番号を示
す。読み仮名付与規則テーブル・メモリは、第８図に示
す様な論理的構成を有する。If the condition "Is the phonetic kana given to the forward word given?" Is satisfied and the condition "Is the character immediately after the forward word a kanji?" Is not satisfied, "Define the phonetic kana of the forward word and read it for the character string. The pseudonymization process is completed. "A physical example of the rule table (Fig. 4) is shown in Fig. 5. Here, "2" is Yes for each value in the array,
“1” 1 means No and “0” means Pass. Therefore, the values in rows a, b, and c in FIG.
It corresponds to a few conditions, respectively. Each value in FIG. 6 indicates a condition subroutine number for determining the condition corresponding to each column in FIG. For example, a subroutine number 1100 is added to the processing program for determining whether there is a Chinese character string of condition number 1. Each value in FIG. 7 indicates a processing sub-routine number executed when each condition is satisfied corresponding to each row in FIG. For example, "1" in FIG.
"0" indicates the subroutine number of the program that executes the process performed when all the conditions of rule number 1 in FIG. 4 are satisfied, that is, the process of "determining the front-back". The reading kana name giving rule table memory has a logical structure as shown in FIG.

第８図の見方や物理的実施方法は、変換処理制御規則テ
ーブルと同様である。The view and the physical implementation method of FIG. 8 are the same as those of the conversion processing control rule table.

プログラムは、第９図のようなモジユール構成をとる。
漢字仮名変換メインモジユール２０では、各規則テーブ
ル（第４図、第８１図）をプロセッサ１へロードする処
理、漢字仮名変換の対象となる漢字データを漢字データ
格納メモリ２から取出す処理、漢字データを、非漢字と
漢字との間で区切り、漢字仮名変換処理単位（ここで
は、文節と呼ぶ）を設定する処理、処理単位毎に変換処
理制御モジユールを参照する処理、漢字仮名変換された
漢字とその読み仮名データを、仮名データ格納メモリ３
へ出力する処理および、処理実行中のエラーに対するエ
ラーメツセージをラインプリンタへ出力する処理を行な
う。The program has a module structure as shown in FIG.
In the Kanji / Kana conversion main module 20, a process of loading each rule table (FIGS. 4 and 81) into the processor 1, a process of extracting Kanji data to be subjected to Kanji / Kana conversion from the Kanji data storage memory 2, a Kanji data Is separated between non-Kanji and Kanji, and the process of setting the Kanji Kana conversion processing unit (called a phrase here), the process of referring to the conversion process control module for each processing unit, and the Kanji Kana converted Kanji The reading kana data is stored in the kana data storage memory 3
To output to the line printer and an error message for an error during execution of the process.

変換処理制御モジユール２１は、変換処理制御規則テー
ブル（第４図）に従い、第６図で示したサブルーチン番
号に対応する条件サブルーチン２２を参照し、リターン
・コードが、規則表の値とすべて一致したものであつて
且つ規則番号の一番小さな規則に相当するサブルーチン
番号（第７図参照）の実行処理サブルーチン２３を起動
する。本実施例では、漢字仮名変換の対象となる漢字文
字列が、漢字仮名辞書に登録されている複数の漢字文字
列（語基とよぶ）から構成されている場合、語基の認定
誤りを防止するため、連続する２つの語基（前方語およ
び後方語とよぶ）の接続関係を調べて妥当性を確認しつ
つその語基の読み仮名を付与する方式としている。それ
ゆえ、前方語に対してその読み仮名を付与ることや、後
方語に対してその読み仮名を付与することも、変換処理
制御規則の一条件としている。それぞれ、前方語に対す
る読み仮名付与条件サブルーチン２２ａ、後方語に対す
る読み仮名付与条件サブルーチン２２ｂと呼ぶ。The conversion processing control module 21 refers to the conditional subroutine 22 corresponding to the subroutine number shown in FIG. 6 according to the conversion processing control rule table (FIG. 4), and the return code matches all the values in the rule table. The execution processing subroutine 23 of the subroutine number (see FIG. 7) corresponding to the rule having the smallest rule number is started. In the present embodiment, when the kanji character string to be converted into kanji kana is composed of a plurality of kanji character strings (called word bases) registered in the kanji kana dictionary, a mistaken recognition of the word base is prevented. For this reason, the method of assigning a phonetic alphabet for the word base while checking the validity by checking the connection relationship between two consecutive word bases (referred to as a forward word and a backward word). Therefore, the addition of the phonetic kana to the forward word and the addition of the phonetic kana to the backward word are also conditions for the conversion processing control rule. They are called a phonetic kana addition condition subroutine 22a for the forward word and a phonetic kana addition condition subroutine 22b for the backward word, respectively.

各サブルーチン２２ａ，２２ｂは、変換処理制御モジユ
ール２１と同様に、読み仮名付与規則テーブルに基づ
き、条件サブルーチン２４と、実行処理サブルーチン２
５とを起動して、複数個存在する読み仮名の中から、最
適なものを選択する。Similar to the conversion processing control module 21, each of the subroutines 22a and 22b is based on the reading kana provision rule table, and the condition subroutine 24 and the execution processing subroutine 2
5 and are started, and the optimum one is selected from a plurality of reading kana.

次に、変換処理手順について、「新規則の追加」という
漢字仮名混りデータに読み仮名を付与する場合を例にと
つて説明する。Next, the conversion processing procedure will be described by taking as an example the case of giving a reading kana to the kanji kana mixed data of "adding a new rule".

初めに、漢字仮名変換メインモジュール２０の処理内容
について、第２０図のフローチヤートに基づき説明す
る。First, the processing contents of the kanji / kana conversion main module 20 will be described based on the flow chart of FIG.

漢字仮名変換メインモジュールは、起動されると、変換
処理制御規則テーブル（第４図）を変換処理制御規則テ
ーブルメモリ７から、プロセツサ１上にロードし、ワー
クエリアCRULEに格納する（１０１）とともに、読み仮
名付与規則テーブル（第８図）を読み仮名付与規則テー
ブル・メモリ８から、プロセツサ１上にロードし、ワー
クエリアYRULEに格納する（１０２）。次に漢字仮名混
りデータ格納メモリ２から、漢字仮名変換の対象となる
データ「新規則の追加」を読み込む（１０３）。もし対
象となるデータが漢字仮名混りデータ格納メモリ２にな
いと漢字仮名変換メインルーチンの処理を終了する。読
み込まれた漢字仮名混りデータは、非漢字から漢字への
変化点で分割され、ワークエリアKANJIに格納される
（１０４）。上記の例では、ワークエリアKANJI(1)に
「新規則の」が格納され、ワークエリアKANJI(2)に「追
加」が格納される。When the Kanji / Kana conversion main module is started, the conversion processing control rule table (FIG. 4) is loaded from the conversion processing control rule table memory 7 onto the processor 1 and stored in the work area CRULE (101). The reading kana provision rule table (FIG. 8) is loaded from the reading kana provision rule table memory 8 onto the processor 1 and stored in the work area YRULE (102). Next, the data "addition of new rule" which is the target of kanji / kana conversion is read from the kanji / kana mixed data storage memory 2 (103). If the target data is not in the kanji / kana mixed data storage memory 2, the process of the kanji / kana conversion main routine is terminated. The read kanji / kana mixed data is divided at the change points from non-kanji to kanji and stored in the work area KANJI (104). In the above example, "new rule" is stored in the work area KANJI (1), and "additional" is stored in the work area KANJI (2).

漢字仮名混りデータが分割されたものを、ここでは文節
と呼ぶが、その文節の個数を、パラメータＮＵＭに格納
する（１０５）。上記の例では２が格納される。Ｋに１
個目の分節を示す１を格納（１０６）し、第Ｋ番目の分
節KANJI(K)をバツフアＡに格納する（１０７）。上記の
例では、第１４図のように漢字仮名変換対象文字列が格
納される。エラーコードERCODEに０を格納（１０８）
後、変換処理制御モジユール２１を起動し、読み仮名処
理を行ない、結果をバツフアＢに格納する（１０９）。
上記の例では、第１５図のように漢字とその読みが格納
される。処理終了後、エラーコードが０でなければ、
（１１０）、エラーメツセージをラインプリンタに出力
して（１１１）、漢字仮名変換処理メインモジュールの
処理を終了する。The data obtained by dividing the kanji / kana mixed data is called a phrase here, and the number of the phrase is stored in the parameter NUM (105). In the above example, 2 is stored. 1 for K
The value 1 indicating the segment is stored (106), and the Kth segment KANJI (K) is stored in buffer A (107). In the above example, a kanji-kana conversion target character string is stored as shown in FIG. Store 0 in error code ERCODE (108)
After that, the conversion processing control module 21 is activated to perform the reading kana processing, and the result is stored in the buffer B (109).
In the above example, kanji and their readings are stored as shown in FIG. After processing, if the error code is not 0,
(110), the error message is output to the line printer (111), and the process of the Kanji-kana conversion main module is completed.

エラーコードが０ならば（１１０）、バツフアＢの内容
を仮名データ格納メモリ３に出力する（１１２）。Ｋに
１を加えた後次の文節（１１３）があるかを判定し（１
１４）、存在すれば１０７以降の処理を繰返し、存在し
なければ、１０３以降の処理を繰返す。上記例では、第
２番目の文節「追加」について、１０７以降の処理が行
なわれ、その読み仮名が仮名データ格納メモリに出力さ
れ、その後１０３で、漢字仮名変換の対象となる漢字仮
名データがないため、漢字仮名変換メインモジュールの
処理を終了する。If the error code is 0 (110), the contents of buffer B are output to the kana data storage memory 3 (112). After adding 1 to K, it is judged whether there is a next clause (113) (1
14) If it exists, the processing from 107 onward is repeated, and if it does not exist, the processing from 103 onward is repeated. In the above example, with respect to the second phrase “addition”, the processing after 107 is performed, the reading kana is output to the kana data storage memory, and then 103, there is no kanji kana data to be converted to kanji kana. Therefore, the processing of the kanji-kana conversion main module is ended.

変換処理制御方法について、第１１図のフローチヤート
に基づき説明する。A conversion processing control method will be described based on the flow chart of FIG.

変換処理制御モジユール２１は、起動されると、再処理
カウンタAGAINに０を格納し（２０１）、Ｉに第１番目
の変換処理制御規則を示す１を格納し（２０２）、Ｊに
第１番目の変換処理制御条件を示す１を格納し（２０
３）、リターンコードに０を格納し（２０４）、第Ｊ番
目の変換処理制御条件サブルーチンを起動する（２０
５）。２０５を処理中にエラーが発生したかどうかを判
定し（２０６）、エラーがあれば変換処理制御モジユー
ル２１の処理を終了する。エラーがなければ、変換処理
制御規則CRULE(I,J)の値が０か、又は、変換処理制御規
則CRULE(I,J)と２０５の処理結果のリターンコードRCOD
Eとが等しいかを判定し（２０７）、満足すれば２０８
を、満足しなければ２１０を実行する。２０８でＪに１
を加えた（２０８）後、第Ｊ番目の変換処理制御条件が
あるかを判定して（２０９）、満足すれば、２０４以降
の処理を繰返し、満足しなければ、第Ｉ番目の変換処理
制御実行処理サブルーチンを起動する（２１３）。２１
３の処理中にエラーが発生したかどうかを判定して（２
１４）、エラーがあれば変換処理制御モジユール２１の
処理を終了し、エラーがなければ、２１３の処理で、再
処理カウンタAGAINに０以外の値が格納されたかを判定
し（２１５）、０以外の値が格納されていれば、２０２
以降の処理を繰返す。When the conversion processing control module 21 is activated, it stores 0 in the reprocessing counter AGAIN (201), stores 1 in I indicating the first conversion processing control rule (202), and stores the first in J. The conversion processing control condition of 1 is stored (20
3), 0 is stored in the return code (204), and the Jth conversion processing control condition subroutine is activated (20
5). It is determined whether an error has occurred during the processing of 205 (206), and if there is an error, the processing of the conversion processing control module 21 is ended. If there is no error, the value of the conversion processing control rule CRULE (I, J) is 0, or the return code RCOD of the processing result of the conversion processing control rules CRULE (I, J) and 205.
It is judged whether E and E are equal (207), and if satisfied, 208
If not satisfied, 210 is executed. 1 for J at 208
After adding (208), it is judged whether or not there is a Jth conversion processing control condition (209), and if satisfied, the processing after 204 is repeated. The execution processing subroutine is started (213). 21
Determine whether an error occurred during the processing of 3 (2
14) If there is an error, the processing of the conversion processing control module 21 is terminated, and if there is no error, it is judged in the processing of 213 whether a value other than 0 is stored in the reprocessing counter AGAIN (215). If the value of
The subsequent processing is repeated.

再処理カウンタAGAINのフラグが０ならば、バツフアＡ
に格納されている漢字のすべてに読みが付与されている
かを判定して（２１６）、読みがすべての漢字に付与さ
れていれば変換処理制御モジユール２１の処理を終了
し、読みがついていない漢字があれば、２０１以降の処
理を繰返す。If the flag of reprocessing counter AGAIN is 0, buffer A
It is judged whether all the kanji stored in the kanji have been given readings (216). If the readings have been given to all kanji, the conversion process control module 21 ends the processing, and the kanji without readings are added. If there is, the processing after 201 is repeated.

２１０で、Ｉに１を加えた後、第Ｉ番目の変換処理制御
規則があるかを判定して（２１１）、満足すれば、２０
３以降の処理を繰返し、満足しなければエラーコードER
CODEに１を格納して（２１２）変換処理制御モジユール
２１の処理を終了する。At 210, after adding 1 to I, it is judged whether there is an I-th conversion processing control rule (211), and if satisfied, 20
Repeat the processing after 3 and if not satisfied, error code ER
1 is stored in CODE (212) and the process of the conversion process control module 21 is ended.

本実施例では、変換処理制御規則の条件サブルーチン２
２として、バツフアＡの先頭文字が漢字かを判定するサ
ブルーチン（第４図の条件番号１）と、前方語に読みを
付与するサブルーチン（条件番号２）と、前方語直後の
文字が漢字かを判定するサブルーチン（条件番号３）
と、再処理中かを判定するサブルーチン（条件番号４）
と前方語が２文字以上かを判定するサブルーチン（条件
番号５）と前方語が接頭語かを判定するサブルーチン
（条件番号６）と、後方語に読みを付与するサブルーチ
ン（条件番号７）と、後方語が２文字以上かを判定する
サブルーチン（条件番号８）と、後方語が接頭語又は接
尾語かどうかを判定するサブルーチン（条件番号９）
と、後方語の直後の文字が漢字かを判定するサブルーチ
ン（条件番号１０）があり、処理実行後、条件を満足し
ていれば、リターンコードRCODEに２を格納し、条件を
満足していなければ、リターンコードRCODEに１を格納
する。In the present embodiment, the conditional subroutine 2 of the conversion processing control rule
2, a subroutine (condition number 1 in FIG. 4) for determining whether the first character of the buffer A is a kanji character, a subroutine (condition number 2) for giving a reading to the forward word, and a character immediately after the forward word are kanji characters. Subroutine to judge (condition number 3)
And a subroutine for determining whether reprocessing is in progress (condition number 4)
And a subroutine (condition number 5) for determining whether the forward word has two or more characters, a subroutine (condition number 6) for determining whether the forward word is a prefix, and a subroutine for adding reading to the backward word (condition number 7). Subroutine for determining if the backward word is two or more characters (condition number 8) and subroutine for determining whether the backward word is a prefix or suffix (condition number 9)
Then, there is a subroutine (condition number 10) that determines whether the character immediately after the backward word is Kanji. If the condition is satisfied after executing the process, store 2 in the return code RCODE and satisfy the condition. For example, 1 is stored in the return code RCODE.

又、変換処理制御規則の実行処理サブルーチン２３とし
て、バツフアＥ内の前方語とその読み仮名をバツフアＢ
に格納するサブルーチン（第４図の規則番号１の処理参
照）と、バツフアＥ内の前方語とその読み仮名およびバ
ツフアＦ内の後方語とその読み仮名をバツフアＢに格納
するサブルーチン（規則番号２，７，１１参照）と、バ
ツフアＥ内の前方語とその読み仮名をバツフアＢに格納
し、バツフアＡ内の読み仮名未付与文字列をバツフアＡ
の先頭に移すサブルーチン（規則番号３，６，８，１
２）と、バツフアＥの前方語とその読み仮名およびバツ
フアＦの後方語とその読み仮名をバツフアＢに格納し、
バツフアＡ内の読み仮名未付与文字列をバツフアＡの先
頭に移すサブルーチン（規則番号４，９）と、再処理カ
ウンタAGAINに１を加え、バツフアＥの前方語とその読
み仮名およびバツフアＦの後方語とその読み仮名を第２
０図に例示するような、バツフアＧに格納するサブルー
チン（規則番号５，１０）と、バツフアＧの漢字文字列
とその読み仮名をバツフアＢに格納し、バツフアＡ内の
読み仮名未付与文字列をバツフアＡの先頭に移すサブル
ーチン（規則番号１３，１４）と、バツフアＡ内の非漢
字文字列があれば、バツフアＢに格納するサブルーチン
（規則番号１５）がある。Further, as the execution processing subroutine 23 of the conversion processing control rule, the forward word in the buffer E and its reading kana are buffer B.
In the buffer E (see the processing of rule number 1 in FIG. 4), and a subroutine for storing the forward word and its reading kana in buffer E and the backward word in buffer F and its reading kana in buffer B (rule number 2). , 7 and 11), the forward word in the buffer E and its phonetic kana are stored in the buffer B, and the character string without the phonetic kana in the buffer A is stored in the buffer A.
Subroutine to move to the beginning of (rule number 3, 6, 8, 1
2), the forward word of the buffer E and its phonetic kana and the backward word of the buffer F and its phonetic kana are stored in the buffer B,
Subroutine (rule number 4, 9) for moving the character string without reading kana in buffer A to the beginning of buffer A and adding 1 to the reprocessing counter AGAIN, and the forward word of buffer E and its reading kana and the back of buffer F Second word and its phonetic kana
As shown in FIG. 0, a subroutine (rule numbers 5 and 10) for storing in buffer G, a kanji character string of buffer G and its phonetic kana are stored in buffer B, and a character string not yet given in phonetic A in buffer A To the beginning of buffer A (rule numbers 13 and 14), and if there is a non-Kanji character string in buffer A, there is a subroutine to store it in buffer B (rule number 15).

尚、処理中に、不合理な事象が発生したら、エラーコー
ドERCODEに０以外の数値を格納して処理を終了する。If an irrational event occurs during processing, a value other than 0 is stored in the error code ERCODE and the processing ends.

上記例の「新規則」では、漢字仮名辞書６に、「新規
則」という熟語が登録されておらず、「新規」、「規
則」、「新」、「規」、「則」が登録されているとする
と、第１回目には、第４図に示す変換処理制御規則番号
５が適用され、「新規」と「則」とで読み仮名を付与し
て、仮にバツフアＧに格納され、再処理カウンタAGAIN
に１が格納される。再処理カウンタが０でないことから
（２１５）、再度、変換処理制御規則に基づき変換処理
が行なわれ、変換処理規則番号１１が適用され、「新」
と「規則」とで読み仮名が付与され、バツフアＢに格納
される（第１５図参照）。その後、変換処理制御モジユ
ール２１の処理が終了する。In the “new rule” of the above example, the idiom “new rule” is not registered in the kanji kana dictionary 6, but “new”, “rule”, “new”, “rule”, and “rule” are registered. Then, in the first time, the conversion processing control rule number 5 shown in FIG. 4 is applied, the reading kana is given by “new” and “rule”, and it is temporarily stored in the buffer G, Processing counter AGAIN
1 is stored in. Since the reprocessing counter is not 0 (215), the conversion process is performed again based on the conversion process control rule, the conversion process rule number 11 is applied, and "new" is applied.
And a "rule" are added to the reading kana and stored in the buffer B (see FIG. 15). After that, the process of the conversion process control module 21 ends.

上記例「追加」については、漢字仮名辞書に、「追加」
という熟語が登録されているとすると、第１回目に、変
換処理制御規則番号１が適用され「追加」に読み仮名
「ツイカ」が付与されて、バツフアＢに格納後、変換処
理制御モジユール２１の処理が終了する。For the above example "Add", add "Add" to the Kanji Kana dictionary.
In the first time, the conversion processing control rule number 1 is applied, the phonetic kana “Tsuika” is added to “Add”, the result is stored in the buffer B, and then the conversion processing control module 21 is added. The process ends.

次に、前方語や後方語に対する読み仮名付与方法につい
て、第１２図および第１３図のフローチヤートに基づき
説明する。Next, a method of giving reading kana to the forward word and the backward word will be described with reference to the flowcharts of FIGS. 12 and 13.

前方語に対する読み仮名付与サブルーチンは、第１２図
に示すように、起動されると、再照合カウンタFAGAINに
０を格納し（１１０１）、再処理カウンタAGAINが０か
を判定し（１１０２）して、０ならば１１０４以降の処
理を実行し、０でなければ、再処理カウンタFAGAINに再
照合カウンタAGAINの値を代入（１１０３）後、１１０
４以降の処理を実行する。１１０４では、ＦＩに、第１
番目の読み仮名付与規則（第８図参照）を示す１を格納
する（１１０４）とともに、ＦＪに、第１番目の読み仮
名付与条件を示す１を格納（１１０５）後、リターンコ
ードYRCODEに０を格納（１１０６）して、第ＦＪ番目の
読み仮名付与条件サブルーチンを起動する（１１０
７）。As shown in FIG. 12, the reading kana assigning subroutine for the forward word, when activated, stores 0 in the rematch counter FAGAIN (1101) and determines whether the reprocessing counter AGAIN is 0 (1102). , 0, the processes after 1104 are executed, and if it is not 0, the value of the re-matching counter AGAIN is assigned to the re-processing counter FAGAIN (1103), and then 110
The processing after 4 is executed. At 1104, the FI first
After storing 1 indicating the 1st reading kana provision rule (see FIG. 8) (1104) and storing 1 indicating the 1st reading kana provision condition in FJ (1105), 0 is set to the return code YRCODE. It is stored (1106) and the FJ-th reading kana provision condition subroutine is activated (110).
7).

１１０７を処理中にエラーが生じたかを判定して（１１
０８）、エラーがあれば前方語に対する読み仮名付与サ
ブルーチンの処理を終了し、エラーがなければ、読み仮
名付与規則YRULE(FI,FJ)の値が０か、又はYRULE(FI,FJ)
と、１１０７の処理結果のリターンコードYRCODEとの値
が等しいかを判定して（１１０９）、満足すれば１１１
０以降の処理を実行し、満足しなければ１１１２以降の
処理を実行する。It is determined whether an error has occurred during processing 1107 (11
08), if there is an error, the processing of the reading kana addition subroutine for the forward word is terminated, and if there is no error, the value of the reading kana addition rule YRULE (FI, FJ) is 0 or YRULE (FI, FJ)
And the return code YRCODE of the processing result of 1107 are equal (1109), and if satisfied, 111
The processing after 0 is executed, and if not satisfied, the processing after 1112 is executed.

１１１０では、ＦＪに１を加えた（１１１０）後、第Ｆ
Ｊ番目の読み仮名付与規則条件があるかを判定して（１
１１１）、満足すれば、１１０６以降の処理を繰返し、
満足しなければ、第ＦＩ番目の読み仮名付与実行処理サ
ブルーチンを起動する（１１１５）。１１１５では、付
与した読み仮名データが、第１７図に例示するような、
バツフアＤに格納される。At 1110, after adding 1 to FJ (1110), the Fth
Judge whether there is a Jth reading kana provision rule condition (1
111), if satisfied, repeat the processing after 1106,
If not satisfied, the FIth reading kana addition execution processing subroutine is started (1115). In 1115, the added reading kana data is as shown in FIG.
It is stored in the buffer D.

１１１５の処理中にエラーが発生したかどうかを判定し
て（１１１６）、エラーがあれば、前方語に対する読み
仮名付与サブルーチンの処理を終了し、エラーがなけれ
ば、バツフアＤのデータを、第１８図に例示するよう
な、バツフアＥに移した（１１１７）後、１１１５の処
理で、再照合カウンタFAGAINに０以外の値が格納されて
いないかを判定して（１１１８）、０以外の値が格納さ
れていれば、１１０２以降の処理を繰返し、０が格納さ
れていれば、前方語に対する読み仮名付与処理を終了す
る。It is determined whether or not an error has occurred during the processing of 1115 (1116), and if there is an error, the processing of the reading kana assigning subroutine for the preceding word is ended. If there is no error, the data of buffer D is After moving to the buffer E (1117) as illustrated in the figure, in the processing of 1115, it is determined whether or not a value other than 0 is stored in the re-verification counter FAGAIN (1118). If it is stored, the processing from 1102 onward is repeated, and if 0 is stored, the reading kana addition processing for the forward word is ended.

１１１２で、ＦＩに１を加えた（１１１２）後、第ＦＩ
番目の読み仮名付与規則があるかを判定して（１１１
３）、存在すれば、１１０５以降の処理を繰返し、存在
しなければ、エラーコードERCODEに、２を格納して前方
語に対する読み仮名付与サブルーチンの処理を終了す
る。At 1112, after adding 1 to the FI (1112), the FI
It is judged whether there is the second reading kana assignment rule (111
3) If it exists, the processing after 1105 is repeated. If it does not exist, 2 is stored in the error code ERCODE and the processing of the reading kana addition subroutine for the preceding word is completed.

後方語に対する読み仮名付与サブルーチンは、第１３図
に示すように起動されると、再照合カウンタBAGAINに０
を格納し（１２０１）、パラメータＢＩに、第１番目の
読み仮名付与規則を示す１を格納し（１２０２）、パラ
メータＢＪに、第１番目の読み仮名付与条件を示す１を
格納（１２０３）後、リターンコードYRCODEに０を格納
（１２０４）して、第ＢＪ番目の読み仮名付与条件サブ
ルーチンを起動する（１２０５）。１２０５を処理中
に、エラーが発生したかを判定して（１２０６）、エラ
ーがあれば後方語に対する読み仮名付与サブルーチンの
処理を終了し、エラーがなければ、読み仮名付与規則YR
ULE(BI,BJ)の値が０か、又は、YRULE(BI,BJ)と、１２０
５の処理結果のリターンコードYRCODEとの値が等しいか
を判定して（１２０７）、満足すれば１２０８以降の処
理を実行し、満足しなければ、１２１１以降の処理を実
行する。When the reading kana addition subroutine for the backward word is activated as shown in FIG. 13, the rematch counter BAGAIN is set to 0.
Is stored (1201), the parameter BI is set to 1 indicating the first reading kana provision rule (1202), and the parameter BJ is set to 1 indicating the first reading kana provision condition (1203). , 0 is stored in the return code YRCODE (1204), and the BJ-th reading kana provision condition subroutine is activated (1205). During processing of 1205, it is judged whether an error has occurred (1206), and if there is an error, the processing of the reading kana addition subroutine for the backward word is ended. If there is no error, the reading kana addition rule YR
The value of ULE (BI, BJ) is 0, or YRULE (BI, BJ) is 120
It is determined whether the value of the return code YRCODE of the processing result of 5 is equal (1207), and if satisfied, the processing of 1208 and subsequent steps is executed, and if not satisfied, the processing of 1211 and subsequent steps is executed.

１２０８では、ＢＪに１を加えた（１２０８）後、第Ｂ
Ｊ番目の読み仮名付与条件があるかを判定して（１２０
９）、満足すれば、１２０４以降の処理を繰り返し、満
足しなければ、第ＢＩ番目の読み仮名付与実行処理サブ
ルーチンを起動する（１２１０）。１２１０では、付与
した読み仮名データが、バツフアＤに格納される。In 1208, after adding 1 to BJ (1208),
It is determined whether or not there is a J-th reading kana provision condition (120
9) If it is satisfied, the processing from 1204 onward is repeated, and if it is not satisfied, the BI-th reading kana addition execution processing subroutine is started (1210). At 1210, the added reading kana data is stored in the buffer D.

１２１０の処理中にエラーが発生したかどうかを判定し
て（１２１４）、エラーがあれば、後方語に対する読み
仮名付与サブルーチンの処理を終了し、エラーがなけれ
ば、バツフアＤのデータを、第１９図に例示するよう
な、バツフアＦに移した（１２１５）後、１２１０の処
理で、再照合カウンタBAGAINに０以外の値が格納されて
いないかを判定して（１２１６）、０以外の値が格納さ
れていれば、１２０２以降の処理を繰返し、０が格納さ
れていれば、後方語に対する読み仮名付与処理を終了す
る。It is determined whether or not an error has occurred during the processing of 1210 (1214), and if there is an error, the processing of the reading kana assigning subroutine for the backward word is ended. If there is no error, the data of buffer D is set to the 19th. As shown in the figure, after moving to the buffer F (1215), it is judged in the process of 1210 whether a value other than 0 is stored in the rematch counter BAGAIN (1216). If it is stored, the processing from 1202 onward is repeated, and if 0 is stored, the phonetic provision processing for the backward word is ended.

１２１１で、ＢＩに１を加えた（１２１１）後、第ＢＩ
番目の読み仮名付与規則があるかを判定して（１２１
２）、存在すれば、１２０３以降の処理を繰返し、存在
しなければ、エラーコードERCODEに、３を格納して後方
語に対する読み仮名付与サブルーチンの処理を終了す
る。At 1211, after adding 1 to BI (1211), the BI
It is determined whether there is the second rule for assigning reading kana (121
2) If it exists, the processing after 1203 is repeated, and if it does not exist, 3 is stored in the error code ERCODE and the processing of the reading kana giving subroutine for the backward word is ended.

本実施例では、読み仮名付与規則の条件サブルーチンと
して、漢字仮名辞書に登録されている漢字文字列の中
で、バツフアＡの漢字仮名変換文字列の先頭から、最長
一致するものあるいは、もし、再照合フラグが０以外の
値ならば、最長一致文字数から再照合フラグの値を引い
た文字数のものを取出し、第１６図に示すバツフアＣに
格納後、それが２文字以上の文字列かを判定するサブル
ーチン（第８図条件番号１参照）と、バツフアＣの漢字
文字列が１文字かを判定するサブルーチン（条件番号
２）と、バツフアＣの漢字文字列の中で、五段動詞と認
定できるものがあるかを判定するサブルーチン（条件番
号３）と、バツフアＣの漢字文字列の中で、接頭語と認
定できるものがあるかを判定するサブルーチン（条件番
号４）と、バツフアＣの漢字文字列の中で、接尾語と認
定できるものがあるかを判定するサブルーチン（条件番
号５）と、バツフアＣの漢字文字列の中で、形容詞と認
定できるものがあるかを判定するサブルーチン（条件番
号６）と、バツフアＣの漢字文字列の中で、五段動詞、
接頭語、接尾語、形容詞、音読み、訓読み以外のものが
あるかを判定するサブルーチン（条件番号７）と、バツ
フアＣの漢字文字列が、音読みと訓読みの両方の読みが
あるかを判定するサブルーチン（条件番号８）と、バツ
フアＡ内の最長一致した文字列の直後の文字が漢字かを
判定するサブルーチン（条件番号９）があり、処理実行
後、条件を満足していれば、リターンコードYRCODEに２
が格納され、条件を満足していなければ、YRCODEに１が
格納される。In this embodiment, as a conditional subroutine of the rule for assigning phonetic kana, among the kanji character strings registered in the kanji kana dictionary, the one that has the longest match from the beginning of the kanji kana converted character string of buffer A or if If the collation flag is a value other than 0, the number of characters obtained by subtracting the value of the re-collation flag from the number of longest matching characters is taken out, stored in buffer C shown in FIG. 16, and it is determined whether or not it is a character string. The subroutine (see FIG. 8, condition number 1) and the subroutine (condition number 2) that determines whether the Kanji character string of buffer C is one character, and the Kanji character string of buffer C can be identified as a five-stage verb. There is a subroutine (condition number 3) that determines whether there is something, and a subroutine (condition number 4) that determines whether or not there is one that can be recognized as a prefix in the Kanji character string of buffer C. Subroutine (condition number 5) that determines whether or not there is a kanji character string that can be recognized as a suffix, and subroutine that determines whether or not any of the kanji character strings of buffer C can be recognized as an adjective ( Condition number 6) and the Kanji character string of Bathua C
Subroutine (condition number 7) that determines whether there is anything other than a prefix, suffix, adjective, on-reading, and kun-yomi, and a subroutine that determines whether the Kanji character string of buffer C has both on-yomi and kun-yomi There is a subroutine (condition number 9) that determines whether the character immediately after the longest matching character string in buffer A is Kanji (condition number 9), and if the condition is satisfied after the processing is executed, the return code YRCODE To 2
Is stored, and if the condition is not satisfied, 1 is stored in YRCODE.

又、読み仮名付与規則の実行処理サブルーチンとして、
バツフアＣの漢字文字列の中で、語属性が五段動詞であ
るものを、バツフアＤに格納するサブルーチン（第８図
の規則番号１，７に対応する処理）と、バツフアＣの漢
字文字列の中で、語属性が接頭語であるものを、バツフ
アＤに格納するサブルーチン（規則番号２，８）と、語
属性が接尾語であるものを、バツフアＤに格納するサブ
ルーチン（規則番号３，９）と、バツフアＣの漢字文字
列の中で、語属性が形容詞であるものを、バツフアＤに
格納するサブルーチン（規則番号４，１０）と、バツフ
アＣの漢字文字列の中で、語属性が五段動詞、接頭語、
接尾語、形容詞、音読み、訓読み以外であるものを、バ
ツフアＤに格納するサブルーチン（規則番号５，１１）
と、再照合フラグに１を加算するサブルーチン（規則番
号６）と、バツフアＣの漢字文字列の中で、語属性が音
読みであるものを、バツフアＤに格納するサブルーチン
（規則番号１２）と、バツフアＣの漢字文字列の中で、
語属性が訓読みであるものを、バツフアＤに格納するサ
ブルーチン（規則番号１３）と、語属性が音読み又は訓
読みであるものを、バツフアＤに格納するサブルーチン
（規則番号１４）とがある。尚、各実行処理サブルーチ
ンの処理中、バツフアＤに格納すべきものが複数個存在
する場合は、バツフアＣの各漢字文字列の頻度の最大の
ものを選ぶものとし、頻度が同じならば、バツフアＣの
上段に位置するものを選ぶものとする。Also, as an execution processing subroutine of the reading kana provision rule,
A subroutine (a process corresponding to the rule numbers 1 and 7 in FIG. 8) for storing a character whose word attribute is a five-stage verb in the buffer C's Kanji character string and the buffer C's Kanji character string Among these, a subroutine (rule number 2, 8) for storing the one having the word attribute as the prefix in the buffer D and a subroutine (rule number 3, for storing the one for which the word attribute is the suffix as the suffix) 9) and a subroutine (rule numbers 4 and 10) for storing in the buffer D the character whose word attribute is an adjective in the Chinese character string of buffer C and the word attribute in the Chinese character string of buffer C. Is a five-verb, prefix,
Subroutine for storing in the buffer D anything other than suffixes, adjectives, on-reading and kun-reading (rule numbers 5 and 11)
And a subroutine (rule number 6) for adding 1 to the re-matching flag, and a subroutine (rule number 12) for storing in the buffer D a Kanji character string of the buffer C whose word attribute is on reading. In the character string of Bathua C,
There is a sub-routine (rule number 13) for storing the word attribute of kun-yomi in buffer D, and a subroutine (rule number 14) for storing the word attribute of on-reading or kun-reading in buffer D. During the processing of each execution processing subroutine, if there is more than one to be stored in the buffer D, the one having the highest frequency of each Kanji character string in the buffer C is selected, and if the frequency is the same, the buffer C is selected. The one located in the upper row shall be selected.

又、各条件サブルーチンおよび各実行処理サブルーチン
を処理中に、不合理な事象が発生したら、エラーコード
ERCODEに０以外の数値を格納して処理を終了する。Also, if an unreasonable event occurs during processing of each condition subroutine and each execution subroutine, an error code
Store a value other than 0 in ERCODE and end the process.

〔The invention's effect〕

本発明によれば、次のような効果が得られる。 According to the present invention, the following effects can be obtained.

(1) 漢字仮名混り文の漢字文字列に、語基単位で読み
仮名を付与することができる。(1) You can add reading kana in word base units to the kanji character strings of mixed kanji kana sentences.

(2) 複数個の読み仮名を持つ漢字文字列に対する読み
仮名選定基準を、条件と実行処理とを１組とした規則と
して登録する方式としたことにより、規則の変更や追加
・削除は、規則テーブルの修正、条件サブルーチンの追
加・削除、実行サブルーチンの追加・削除という形態を
とり、変換処理プロシジヤの変更が不要なため、規則の
改良および拡張が容易に行なえる。(2) By using the method of registering the phonetic kana selection criteria for a kanji character string having a plurality of kana characters as a rule with a set of conditions and execution processes, rules can be changed or added / deleted. It takes the form of table correction, addition / deletion of conditional subroutines, addition / deletion of execution subroutines, and since it is not necessary to change the conversion processing procedure, the rules can be easily improved and expanded.

(3) 変換処理手順についても、変換処理制御規則とし
て規則化することにより、規則の改良および拡張が容易
に行なえる。(3) The conversion processing procedure can also be easily improved and expanded by formulating it as a conversion processing control rule.

(4) 変換処理を制御する規則と、読み仮名を付与する
規則とを分離したことにより、漢字仮名変換規則の数を
少なくすることができる。(4) By separating the rule for controlling the conversion process and the rule for giving the reading kana, the number of kanji and kana conversion rules can be reduced.

実施例は、変換処理制御規則が１５個、読み仮名付与規
則が１４個、合計２９個の規則で構成されているが、分
離されていないと、前方語と後方語に読み仮名を付与す
ることから約１００倍の１４×１４×１５＝２９４０個
もの規則を必要とする。In the embodiment, the conversion processing control rules are 15 and the reading rules are 14 in total, 29 rules in total, but if they are not separated, the reading words are given to the forward and backward words. From about 100 times as many as 14 × 14 × 15 = 2940 rules are required.

[Brief description of drawings]

第１図は、本発明の一実施例の漢字仮名変換方式のハー
ドウエア構成図、第２図は、漢字仮名辞書の項目を示す
図、第３図は、漢字仮名辞書の一項目である語属性の例
を示す図、第４図は、漢字仮名変換処理の制御規則の論
理的な構成例を示す図、第５図は、漢字仮名変換制御規
則のメモリ内での格納のされ方を示す図、第６図は、漢
字仮名変換制御規則の各条件に対応するサブルーチン番
号のメモリ内での格納のされ方を示す図、第７図は、漢
字仮名変換制御規則の各実行処理に対応するサブルーチ
ンの番号のメモリ内での格納のされ方を示す図、第８図
は、読み仮名付与規則の論理的な構成例を示す図、第９
図は、本発明実施例のソフトウエア・モジユール構成を
例示する図、第１０図(A)，(B)は、漢字仮名変換メイン
・モジユールの処理手順を示す図、第１１図(A)，(B)
は、変換処理制御規則に基づく、変換処理制御モジユー
ルの処理手順を示す図、第１２図(A)，(B)は、読み仮名
付与規則に基づく、前方語の読み仮名付与処理手順を示
す図、第１３図(A)，(B)は、読み仮名付与規則に基づ
く、後方語の読み仮名付与処理手順を示す図、第１４図
は、バツフアＡの構成を例示した図、第１５図は、バツ
フアＢの構成を例示した図、第１６図は、バツフアＣの
構成を例示した図、第１７図は、バツフアＤの構成を例
示した図、第１８図は、バツフアＥの構成を例示した
図、第１９図は、バツフアＦの構成を例示した図、第２
０図は、バツフアＧの構成を例示した図である。FIG. 1 is a hardware configuration diagram of a kanji / kana conversion system according to an embodiment of the present invention, FIG. 2 is a diagram showing items of a kanji / kana dictionary, and FIG. 3 is a word which is one item of a kanji / kana dictionary. FIG. 4 is a diagram showing an example of attributes, FIG. 4 is a diagram showing an example of a logical configuration of a control rule for Kanji-Kana conversion processing, and FIG. 5 is a diagram showing how the Kanji-Kana conversion control rule is stored in the memory. 6 and 6 show how the subroutine numbers corresponding to the respective conditions of the Kanji-Kana conversion control rule are stored in the memory, and FIG. 7 corresponds to each execution process of the Kanji-Kana conversion control rule. FIG. 8 is a diagram showing how the numbers of the subroutines are stored in the memory, FIG. 8 is a diagram showing a logical configuration example of the reading kana naming rule, and FIG.
FIG. 10 is a diagram illustrating a software module configuration of an embodiment of the present invention, FIGS. 10 (A) and (B) are diagrams showing a processing procedure of a kanji / kana conversion main module, FIG. 11 (A), (B)
Is a diagram showing a processing procedure of the conversion processing control module based on the conversion processing control rule, and FIGS. 12 (A) and 12 (B) are diagrams showing a processing procedure of giving the pronunciation of a forward word based on the reading kana character assignment rule. , FIG. 13 (A) and FIG. 13 (B) are diagrams showing a processing procedure for assigning the pronunciation of a backward word based on the rules for assigning the pronunciation kana, FIG. 14 is a diagram illustrating the configuration of the buffer A, and FIG. FIG. 16 illustrates the configuration of buffer C, FIG. 17 illustrates the configuration of buffer C, FIG. 17 illustrates the configuration of buffer D, and FIG. 18 illustrates the configuration of buffer E. FIG. 19 and FIG. 19 are views exemplifying the structure of the buffer F, and FIG.
FIG. 0 is a diagram illustrating the configuration of the buffer G.

フロントページの続き (72)発明者津田順司神奈川県川崎市麻生区王禅寺1099番地株式会社日立製作所システム開発研究所内 (56)参考文献特開昭56−92677（ＪＰ，Ａ)Front page continued (72) Inventor Junji Tsuda 1099 Ozenji, Aso-ku, Kawasaki-shi, Kanagawa Inside the Hitachi, Ltd. System Development Laboratory (56) References JP-A-56-92677 (JP, A)

Claims

[Claims]

1. A dictionary in which a word consisting of a kanji character string and at least one or more reading kana character strings and word attributes of the word are stored in association with each other, and the word attribute of the word consisting of a kanji character string can be recognized. First storage means for storing a first rule consisting of a condition and a process of selecting a reading kana of a word consisting of the kanji character string in accordance with the condition of the satisfaction of the condition, and at least two input kanji character strings. Means for temporarily dividing into two words and giving a reading kana character string to the words temporarily divided based on the dictionary and the first rule; word attributes of two consecutive words in the temporarily divided kanji character string; Second storage means for storing a second rule consisting of a condition relating to whether or not to give a reading kana, a process of determining a reading kana character string to be executed corresponding to the condition of the satisfaction of the condition, and a process of redoing the temporary division. And the above Means for determining the satisfaction of the condition defined by the second rule for two words, checking the validity of the division of the two consecutive words with reference to the second rule, and dividing by the checking means. And a means for determining the reading kana character string based on the second rule when it is determined to be valid.

2. The kanji / kana conversion device according to claim 1, wherein the means for assigning the kana character string includes the means for assigning the kana when the checking means determines that the tentative division is inappropriate. Based on the processing prescribed in 2 rules,
A kanji / kana conversion device, wherein the temporary division is performed again.

3. The kanji-kana conversion device according to claim 1, wherein the first rule is associated with a plurality of conditions relating to approval or disapproval of word attributes of words, and a condition in which the plurality of conditions are satisfied. An apparatus for converting kana to kana, comprising a plurality of processes, wherein the first storage means stores the first rule in a table format.

4. The kanji / kana conversion device according to claim 1, wherein the second rule includes a plurality of conditions regarding word attributes of two consecutive words and whether or not to give a reading kana.
A kanji / kana conversion device, comprising a plurality of processes associated with the established condition of the plurality of conditions, wherein the second storage means stores the second rule in a table format.