JP3288066B2

JP3288066B2 - Learning processing method and system for network configuration data processing device

Info

Publication number: JP3288066B2
Application number: JP05104292A
Authority: JP
Inventors: 茂美長田; 一繁佐賀; 正雄比屋根
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1992-03-10
Filing date: 1992-03-10
Publication date: 2002-06-04
Anticipated expiration: 2017-06-04
Also published as: JPH05250347A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は，ネットワーク構成のデ
ータ変換機能に従って所望の情報処理装置を学習により
構成するための，ニューロ・コンピュータと呼ばれるよ
うなネットワーク構成データ処理装置の学習処理方法お
よびシステムに関し，特に，正しい事例からだけではな
く，そうであってはならないことを示す反例からでも，
ネットワーク構成データ処理装置を所望の情報処理装置
として構成できるようにしたネットワーク構成データ処
理装置の学習処理方法およびシステムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a learning method and system for a network-structured data processing apparatus such as a neurocomputer for configuring a desired information processing apparatus by learning in accordance with a network-structured data conversion function. , Especially not only from the correct case, but also from counter-examples showing that it should not be,
The present invention relates to a learning processing method and system for a network configuration data processing device that can configure the network configuration data processing device as a desired information processing device.

【０００２】近年，人工知能（ＡＩ），ファクトリオー
トメーション（ＦＡ），オフィスオートメーション（Ｏ
Ａ）の進展に伴い，人間にとって使い易く，人間と共存
できる知的な“柔らかい”システムへの要求が高まって
いる。この期待に応えるため，従来のノイマン型コンピ
ュータと異なるタイプのネットワーク構成の並列分散処
理方式の適応的なデータ処理装置が提案されている。In recent years, artificial intelligence (AI), factory automation (FA), office automation (O)
With the development of A), there is an increasing demand for intelligent "soft" systems that are easy to use for humans and can coexist with humans. In order to meet this expectation, an adaptive data processing apparatus of a parallel distributed processing method having a network configuration different from that of the conventional Neumann computer has been proposed.

【０００３】階層ネットワーク構造をとるデータ処理装
置の内部状態値の学習処理方式の中では，特に，誤差逆
伝播法と呼ばれる学習方式（ D.E.Rumelhart, G.E.Hint
on,and R.J.Williams,"Learning Internal Representat
ions by Error Propagation." PARALLEL DISTRIBUTED
PROCESSING, Vol.1, pp.318-364, The MIT Press,198
6）が，その実用性の高さから注目されている。[0003] Among learning methods of internal state values of a data processing device having a hierarchical network structure, particularly, a learning method (DERumelhart, GEHint) called an error back propagation method.
on, and RJWilliams, "Learning Internal Representat
ions by Error Propagation. "PARALLEL DISTRIBUTED
PROCESSING, Vol.1, pp.318-364, The MIT Press, 198
6) is attracting attention because of its practicality.

【０００４】この階層ネットワーク構造をとるデータ処
理装置では，基本ユニットと呼ぶ一種の演算要素のノー
ドと内部状態値に相当する重み値をもつ内部結合とから
階層ネットワークを構成している。In a data processing apparatus having a hierarchical network structure, a hierarchical network is composed of nodes of a kind of arithmetic element called a basic unit and an internal connection having a weight corresponding to an internal state value.

【０００５】図８に，基本ユニット１の構成を示す。こ
の基本ユニット１は，多入力１出力系となっており，複
数の入力に対し，それぞれの内部結合の重み値を乗算す
る乗算処理部２と，それらの全乗算結果を加算する累算
処理部３と，この累算値に非線形の閾値処理を施して１
つの最終出力を出力する閾値処理部４とを備える。FIG. 8 shows the configuration of the basic unit 1. The basic unit 1 has a multi-input, one-output system, and a multiplication unit 2 for multiplying a plurality of inputs by respective internal connection weights, and an accumulation unit for adding all multiplication results thereof. 3 and this accumulated value is subjected to non-linear threshold processing to obtain 1
A threshold processing unit 4 that outputs two final outputs.

【０００６】そして，このような構成の多数の基本ユニ
ット１が，図９に示すように，入力信号値をそのまま分
配して動作する入力ユニット１−ｈを入力層として，階
層的に接続されることにより階層ネットワークが構成さ
れ，これによって，入力パターン（入力信号）を，対応
する出力パターン（出力信号）に変換するというデータ
処理機能を発揮する。As shown in FIG. 9, a number of basic units 1 having such a configuration are hierarchically connected to each other with an input unit 1-h operating by directly distributing an input signal value as an input layer. As a result, a hierarchical network is formed, thereby exhibiting a data processing function of converting an input pattern (input signal) into a corresponding output pattern (output signal).

【０００７】この階層ネットワークに対し，誤差逆伝播
法では，学習用に用意された入力パターンに対して出力
される階層ネットワークからの出力パターンが教師パタ
ーン（教師信号）となるべく，所定の学習アルゴリズム
に従って，階層ネットワークの内部結合の重み値を決定
していく。そして，この処理により重み値が決定される
と，想定していなかった入力信号が入力されたとして
も，この階層ネットワークから，もっともらしい適切な
出力信号を出力するという“柔らかい”データ処理機能
が実現されることになる。With respect to this hierarchical network, the error backpropagation method uses a predetermined learning algorithm so that an output pattern from the hierarchical network output for an input pattern prepared for learning becomes a teacher pattern (teacher signal). , The weight value of the internal connection of the hierarchical network is determined. When the weight value is determined by this processing, even if an unexpected input signal is input, a "soft" data processing function that outputs a plausible and appropriate output signal from this hierarchical network is realized. Will be done.

【０００８】このような階層ネットワーク構造のデータ
処理装置に代表されるネットワーク構成のデータ処理装
置を使用して，所望の情報処理装置を構築していくため
には，データ処理機能を規定するネットワーク構造の内
部状態値を所望の情報処理機能が実現されるべく学習処
理により求めていく必要がある。In order to construct a desired information processing apparatus by using a data processing apparatus having a network configuration represented by such a data processing apparatus having a hierarchical network structure, a network structure for defining a data processing function is required. It is necessary to obtain the internal state value by learning processing so that a desired information processing function is realized.

【０００９】[0009]

【従来の技術】階層ネットワーク構造をとるデータ処理
装置では，ｈ層を前段層とし，ｉ層を後段層とすると，
基本ユニットの累算処理部では下記の式(1) の演算を実
行し，閾値処理部では下記の式(2) の演算を実行する。2. Description of the Related Art In a data processing device having a hierarchical network structure, if an h layer is a former layer and an i layer is a latter layer,
The accumulation processing section of the basic unit executes the operation of the following equation (1), and the threshold processing section executes the operation of the following equation (2).

【００１０】ｘ_pi＝Σｙ_phｗ_ih （Σはｈの総和） (1) ｙ_pi＝１／（１＋exp(−ｘ_pi＋θ_i) ） (2) 但し，ｈ：ｈ層のユニット番号ｉ：ｉ層のユニット番号ｐ：入力信号のパターン番号 θ_i：ｉ層のｉ番ユニットの閾値ｗ_ih：ｈ−ｉ層の内部結合の重みｘ_pi：ｈ層の各ユニットからｉ層のｉ番ユニットへの入
力の積和ｙ_ph：ｐ番目パターンの入力信号に対するｈ層のｈ番ユ
ニットからの出力ｙ_pi：ｐ番目パターンの入力信号に対するｉ層のｉ番ユ
ニットからの出力誤差逆伝播法では，この重み値ｗ_ihと閾値θ_iとを誤差
のフィードバックにより適応的に自動調節して学習する
ことになる。この式(1) ，(2) から明らかなように，重
み値ｗ_ihと閾値θ_iとの調節は同時に実行される必要が
あるが，この作業は，相互に干渉する難しい作業とな
る。そこで，本出願人は，先に出願の「特願昭 62-3334
84号（昭和62年12月28日出願，“ネットワーク構成デー
タ処理装置”）」で開示したように，入力側のｈ層に常
に“１”を出力するとともにその出力に対して閾値θ_i
を重み値として割りつける基本ユニット１を設けること
で，閾値θ_iを重み値ｗ_ihの中に組み込んで，閾値θ_i
を重み値として扱うようにすることを提案した。こうす
ることによって，上述の式(1) および式(2) は，ｘ_pi＝Σｙ_phｗ_ih （Σはｈの総和） (3) ｙ_pi＝１／（１＋exp(−ｘ_pi) ） (4) 但し，ｈ：ｈ層のユニット番号ｉ：ｉ層のユニット番号ｐ：入力信号のパターン番号ｗ_ih：ｈ−ｉ層の内部結合の重みｘ_pi：ｈ層の各ユニットからｉ層のｉ番ユニットへの入
力の積和ｙ_ph：ｐ番目パターンの入力信号に対するｈ層のｈ番ユ
ニットからの出力ｙ_pi：ｐ番目パターンの入力信号に対するｉ層のｉ番ユ
ニットからの出力と表されることになる。[0010] x _pi = Σy _ph w _ih (the sum of Σ is _{h) (1) y pi =} 1 / (1 + exp (-x pi + θ i)) (2) However, h: unit of the h layer number i: i Layer unit number p: Input signal pattern number θ _i : Threshold of the i-th unit of the i-th layer w _ih : Weight of the internal connection of the h-i layer x _pi : From each unit of the h-th layer to the i-th unit of the i-th layer Y _ph : The output from the h-th unit in the h-th layer for the p-th pattern input signal y _pi : The output from the i-th unit in the i-th layer for the p-th pattern input signal The weight value w _ih and the threshold value θ _i are adaptively and automatically adjusted based on error feedback, and learning is performed. As is clear from the equations (1) and (2), the adjustment of the weight value w _ih and the threshold value θ _i needs to be performed simultaneously, but this operation is a difficult operation that interferes with each other. Therefore, the applicant of the present application has filed the application “Japanese Patent Application No. 62-3334
No. 84 (filed on December 28, 1987, "Network Configuration Data Processing Apparatus"), "1" is always output to the h layer on the input side, and a threshold θ _i
The by providing the basic unit 1 allocates a weight value, by incorporating a threshold theta _i in the weight value w _ih, threshold theta _i
It is proposed to treat as a weight value. By doing so, the above-mentioned formula (1) and _{_{_{(2), x pi = Σy ph w ih}}} (Σ is the sum of _{h) (3) y pi =} 1 / (1 + exp (-x pi)) (4 ) Where h: unit number of the h layer i: unit number of the i layer p: pattern number of the input signal w _ih : weight of the internal connection of the hi layer x _pi : number i of the i layer from each unit of the h layer Y _ph : output from the h-th unit in the h-th layer for the p-th pattern input signal y _pi : output from the i-th unit in the i-th layer for the p-th pattern input signal become.

【００１１】誤差逆伝播法では，重み値の学習処理のた
めに，最初に，次式に従い，出力層（ｊ層とする）から
の出力パターンと該出力パターンのとるべき信号となる
教師パターンとの誤差の二乗和である誤差ベクトルＥ_p
を求めて，その誤差ベクトルＥ_pの総和Ｅを階層ネット
ワークのデータ処理機能の誤差として算出する。In the error backpropagation method, in order to perform a weight value learning process, an output pattern from an output layer (referred to as a j-layer) and a teacher pattern to be a signal to be taken by the output pattern are firstly calculated according to the following equation. Error vector E _p, which is the sum of squares of the error
The seeking, calculates the total sum E of the error vector E _p as the error of the data processing functions of the hierarchical network.

【００１２】Ｅ＝ΣＥ_p＝ΣΣ（ｙ_pj−ｄ_pj）²／２ (5) 但し，Ｅ_pのΣはｐの総和，右辺のΣΣはそれぞれｐ，
ｊの総和Ｅ_p：ｐ番目パターンの入力信号に対しての誤差ベクト
ルｙ_pj：ｐ番目パターンの入力信号に対するｊ層のｊ番ユ
ニットからの出力ｄ_pj：ｐ番目パターンの入力信号に対するｊ層のｊ番ユ
ニットへの教師信号次に，階層ネットワークがｈ層−ｉ層−ｊ層の３層構造
をとる例で説明するならば，次式に従い，誤差ベクトル
の総和Ｅを漸近的に０とすべく重み値の１回当たりの更
新量Δｗ_jiとΔｗ_ihを算出する。[0012] _{E = ΣE p = ΣΣ (y} pj -d pj) 2/2 (5) where the sum of the Σ of E _p p, respectively Shigumashiguma the right side p,
j sum E _p: p-th pattern error vector y _pj with respect to the input signal: p th output of the j-th unit of the j layer for the input signal patterns d _pj: j-layer with respect to the input signal of the p-th pattern Teacher signal to the j-th unit Next, assuming that the hierarchical network has a three-layer structure of h layer-i layer-j layer, the sum E of error vectors is asymptotically set to 0 according to the following equation. in order to calculate the update amount [Delta] w _ji and [Delta] w _ih per one weight value.

【００１３】 Δｗ_ji(t) ＝−ε∂Ｅ(t) ／∂ｗ_ji(t) ＋ζｗ_ji(t−1) (6) Δｗ_ih(t) ＝−ε∂Ｅ(t) ／∂ｗ_ih(t) ＋ζｗ_ih(t−1) (7) 但し， ε ：学習定数 ζ ：モーメンタムｔ：更新回数この重み値の更新を反復することにより，誤差ベクトル
の総和Ｅを極小値に収束させ，そのときの重み値ｗ_ji，
ｗ_ih（閾値θ_i，θ_jも含まれる）を学習するようにし
ている。[0013] _{Δw ji (t) = -ε∂E (} t) / ∂w ji (t) + ζw ji (t-1) (6) Δw ih (t) = -ε∂E (t) / ∂w ih _{(t) + ζw ih (t} -1) (7) where, epsilon: learning constant zeta: momentum t: by repeating the updating of the number of updates the weight value, converges the sum E of the error vector to the minimum value, the _Time weight value w _ji ,
w _ih (threshold _values θ _i and θ _j are also included) is learned.

【００１４】この誤差逆伝播法による重み値の学習処理
では，用意された学習用の学習信号（入力パターンと教
師パターンとの対）のグループに対して誤差逆伝播法を
適用していけば，誤差ベクトルの総和Ｅを極小値に収束
できることが証明されている。すなわち，教師パターン
に相当する出力パターンを出力できるようになる。In the weight value learning process by the error backpropagation method, if the error backpropagation method is applied to a group of prepared learning signals (pairs of input pattern and teacher pattern) for learning, It has been proven that the sum E of the error vectors can be converged to a minimum value. That is, an output pattern corresponding to the teacher pattern can be output.

【００１５】このように，誤差逆伝播法に代表されるよ
うなネットワーク構成のデータ処理装置におけるネット
ワーク構造の重み値の学習処理方式にあっては，学習用
の学習信号を予め用意して重み値の学習を実行するよう
に処理することになる。As described above, in the learning processing method of the weight value of the network structure in the data processing apparatus having the network configuration represented by the error back propagation method, a learning signal for learning is prepared in advance and the weight value is adjusted. Is performed to execute learning.

【００１６】したがって，ネットワーク構成のデータ処
理装置で所望の情報処理装置を構築する場合，従来で
は，予め充分な数の学習信号，すなわち，入力パターン
と“望ましい”出力パターンを意味する教師パターンを
用意し，重み値の学習を実行するように処理していた。Therefore, when a desired information processing apparatus is constructed by a data processing apparatus having a network configuration, a sufficient number of learning signals, that is, an input pattern and a teacher pattern meaning a "desirable" output pattern are prepared in advance. In this case, the weight value learning is performed.

【００１７】[0017]

【発明が解決しようとする課題】しかしながら，学習信
号として入力パターンとそれに対する“望ましい”出力
パターン，すなわち正しい事例のみを扱うこのような従
来技術では，ある入力パターンに対して，こうであって
はならないといった否定の形で与えられる“望ましくな
い”出力パターン，すなわち，いわゆる反例を学習でき
ないという問題点があった。However, in such a conventional technique which handles only an input pattern and a "desirable" output pattern corresponding to the input signal as a learning signal, that is, only a correct case, for a certain input pattern, There is a problem in that it is not possible to learn an "undesirable" output pattern given in a negative form, that is, no counterexample.

【００１８】実際に現実世界における情報処理を考えて
みた場合，ある入力パターンに対して，必ずしも“望ま
しい”出力パターンが定義できるとは限らず，ある入力
パターンに対しては，むしろ“望ましくない”出力パタ
ーンを定義する方がはるかに簡単であることがある。When actually considering information processing in the real world, it is not always possible to define a “desirable” output pattern for a certain input pattern, but rather “undesirable” for a certain input pattern. Defining output patterns can be much easier.

【００１９】本発明はかかる事情に鑑みてなされたもの
であって，学習信号として正しい事例のみならず反例も
使用することによって，所望の高機能な情報処理装置を
構築できるようにする新たなネットワーク構成データ処
理装置の学習処理方式を提供することを目的とする。The present invention has been made in view of such circumstances, and a new network for constructing a desired high-performance information processing apparatus by using not only correct examples but also counterexamples as learning signals. An object of the present invention is to provide a learning processing method for a configuration data processing device.

【００２０】[0020]

【課題を解決するための手段】図１は，本発明の原理説
明図である。図中，１０はネットワーク構成データ処理
装置であって，所望の情報処理装置を構築するもの，１
１はネットワーク構成データ処理装置１０が備えるパタ
ーン変換部であって，ネットワーク構造を備えて該ネッ
トワーク構造とその構造のもつ内部状態値とにより規定
されるデータ変換機能に従って，入力パターンに対応す
る出力パターンを算出し出力するもの，１２はネットワ
ーク構成データ処理装置１０が備える内部状態値格納部
であって，パターン変換部１１がデータ変換処理の実行
時に必要とする内部状態値を管理するものである。FIG. 1 is a diagram illustrating the principle of the present invention. In the figure, reference numeral 10 denotes a network configuration data processing device for constructing a desired information processing device;
Reference numeral 1 denotes a pattern conversion unit provided in the network configuration data processing device 10, which has a network structure and outputs an output pattern corresponding to an input pattern according to a data conversion function defined by the network structure and an internal state value of the structure. Numeral 12 is an internal state value storage unit provided in the network configuration data processing device 10 for managing an internal state value required when the pattern conversion unit 11 executes data conversion processing.

【００２１】このパターン変換部１１が階層ネットワー
ク構造をとるときには，複数の入力とこれらの入力に対
して乗算されるべき内部状態値（重み値）とを受け取っ
て積和を得るとともに，この得られた積和値を閾値関数
によって変換して最終出力を得るよう処理する基本ユニ
ット１を基本単位として，次のように構成される。When the pattern converter 11 has a hierarchical network structure, it receives a plurality of inputs and an internal state value (weight value) to be multiplied with these inputs to obtain a product sum, and obtains the product sum. The basic unit 1 that converts the sum of product values by a threshold function to obtain a final output is configured as a basic unit as follows.

【００２２】入力パターンを受け取って出力する複数個
の入力ユニット１−ｈにより入力層が構成され，複数個
の基本ユニット１−ｉにより１つまたは複数段設けられ
る中間層が構成され，複数個の基本ユニット１−ｊによ
り出力層が構成されるとともに，入力ユニット１−ｈと
基本ユニット１−ｉとの間，基本ユニット１−ｉの相互
間，基本ユニット１−ｉと基本ユニット１−ｊとの間で
接続がなされ，かつ，この各接続に対応して設定される
内部状態値に従って，階層ネットワーク構造が実現され
る。An input layer is constituted by a plurality of input units 1-h for receiving and outputting an input pattern, and an intermediate layer provided in one or a plurality of stages is constituted by a plurality of basic units 1-i. An output layer is formed by the basic units 1-j, and between the input units 1-h and the basic units 1-i, between the basic units 1-i, between the basic units 1-i and the basic units 1-j. Are connected to each other, and a hierarchical network structure is realized according to an internal state value set corresponding to each connection.

【００２３】２０は学習パターン提示装置であって，学
習対象の入力パターン群をパターン変換部１１に提示す
るとともに，各入力パターンに対してとるべき“望まし
い”パターン（これを実教師パターンという），とって
はならない“望ましくない”パターン（これを反面教師
パターンという）等を示す複数種類からなる教師パター
ン群を次に示す内部状態値学習装置３０に提示するも
の，３０は例えば誤差逆伝播法を実装する内部状態値学
習装置であって，学習パターン提示装置２０による入力
パターンの提示に従って出力されるパターン変換部１１
からの出力パターン群と，学習パターン提示装置２０か
ら提示される実教師パターン群や反面教師パターン群等
の複数種類からなる教師パターン群との間に誤差を定義
するとともに，それらの教師パターン群の種類に応じて
誤差値を算出し，その誤差値に基づいて内部状態値を順
次更新していくものである。Reference numeral 20 denotes a learning pattern presenting apparatus which presents a group of input patterns to be learned to the pattern conversion unit 11 and a "desirable" pattern to be taken for each input pattern (this is called an actual teacher pattern). A plurality of types of teacher pattern groups showing “undesirable” patterns (which are referred to as “teacher patterns”) which should not be taken are presented to the internal state value learning device 30 shown below. An internal state value learning device to be mounted, the pattern conversion unit 11 being output in accordance with the presentation of an input pattern by the learning pattern presentation device 20.
An error is defined between the output pattern group from the learning pattern group and a plurality of types of teacher pattern groups such as a real teacher pattern group and a reverse teacher pattern group presented by the learning pattern presentation device 20. An error value is calculated according to the type, and the internal state value is sequentially updated based on the error value.

【００２４】この内部状態値学習装置３０により，例え
ばパターン変換部１１からの出力パターン群と反面教師
パターン群との誤差はできるだけ大きく保ちながら，パ
ターン変換部１１からの出力パターン群と実教師パター
ン群との誤差値が許容範囲となる内部状態値を学習して
いくことにより，正しい事例（実教師パターン群）のみ
ならず，反例（反面教師パターン群）からもネットワー
ク構成データ処理装置１０を所望の高機能な情報処理装
置として構築できるようになる。By the internal state value learning device 30, for example, the error between the output pattern group from the pattern conversion unit 11 and the teacher pattern group is kept as large as possible, while the output pattern group from the pattern conversion unit 11 and the actual teacher pattern group are maintained. By learning the internal state value in which the error value of the network configuration data is within an allowable range, the network configuration data processing device 10 can be set to a desired state not only from a correct case (actual teacher pattern group) but also from a counter example (opposite teacher pattern group). It can be constructed as a high-performance information processing device.

【００２５】[0025]

【作用】本発明では，学習パターン提示装置２０が，実
教師パターン，反面教師パターン等の複数種類の教師パ
ターン群をパターン変換部１１に提示し，内部状態値学
習装置３０が教師パターン群の種類に応じて，きめ細か
な内部状態値の更新，学習を実行する。According to the present invention, the learning pattern presenting device 20 presents a plurality of types of teacher pattern groups, such as a real teacher pattern and a reverse teacher pattern, to the pattern converter 11, and the internal state value learning device 30 displays the type of the teacher pattern group. In accordance with, detailed internal state value updating and learning are executed.

【００２６】このように，本発明によれば，学習パター
ンとして，学習対象の入力パターンとそれに対する“望
ましい”出力パターンの組からなる正しい事例群からだ
けではなく，学習対象の入力パターンとそれに対する
“望ましくない”出力パターンの組からなる反例群等の
複数種類の事例群からも，ネットワーク構成データ処理
装置１０を所望の情報処理装置として構築できる。As described above, according to the present invention, the learning pattern includes not only the correct case group including the set of the input pattern to be learned and the "desirable" output pattern for the learning pattern, but also the input pattern to be learned and the corresponding The network configuration data processing apparatus 10 can be constructed as a desired information processing apparatus from a plurality of types of case groups such as a group of counterexamples including a set of “undesirable” output patterns.

【００２７】また，学習パターン提示装置２０がパター
ン変換部１１に提示する教師パターン群の提示方法（学
習カリキュラム）を，例えばまず最初に，実教師パター
ン群を提示し，パターン変換部１１からの出力パターン
群と実教師パターン群との誤差がある許容範囲となる内
部状態値を内部状態値学習装置３０で学習した後，反面
教師パターン群を提示し，パターン変換部１１からの出
力パターン群と反面教師パターン群との誤差を大きく保
ちながら，実教師パターン群との誤差をさらに厳しい許
容範囲内のものとするように内部状態値学習装置３０で
学習するように，段階的に提示していくことによって，
ネットワーク構成データ処理装置１０の内部状態値の更
新回数，すなわち，学習量を低減できるので，学習効率
を向上させることができる。The method of presenting a teacher pattern group (learning curriculum) that the learning pattern presenting device 20 presents to the pattern conversion unit 11 is, for example, first, an actual teacher pattern group is presented, and the output from the pattern conversion unit 11 is output. The internal state value learning device 30 learns an internal state value within an allowable range having an error between the pattern group and the actual teacher pattern group, and then presents a reverse teacher pattern group. While the error with the teacher pattern group is kept large, it is presented stepwise so that the internal state value learning device 30 learns so that the error with the actual teacher pattern group is within a stricter allowable range. By
Since the number of updates of the internal state value of the network configuration data processing device 10, that is, the learning amount can be reduced, the learning efficiency can be improved.

【００２８】[0028]

【実施例】以下，階層ネットワーク構成をとるデータ処
理装置により構築される情報処理装置の例に従って，本
発明の実施例を詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below in detail with reference to an example of an information processing device constructed by a data processing device having a hierarchical network configuration.

【００２９】図２に，本発明の一実施例のシステム構成
図を示す。図中，６０は学習機能を備える階層ネットワ
ーク構成のデータ処理装置であって，ある入力パターン
が入力されたときに出力パターンを算出して出力するも
の，７０は学習信号管理提示装置であって，階層ネット
ワーク構成データ処理装置６０を所望の情報処理装置と
して構成するために必要となる学習信号（入力パターン
と実教師パターン，反面教師パターン等からなる教師パ
ターンとの対で構成される）を管理するとともに，その
学習信号を階層ネットワーク構成データ処理装置６０の
学習機能に対して提示するものである。FIG. 2 shows a system configuration diagram of an embodiment of the present invention. In the figure, reference numeral 60 denotes a data processing device having a hierarchical network configuration having a learning function, which calculates and outputs an output pattern when a certain input pattern is input, and 70 denotes a learning signal management / presentation device. A learning signal (composed of a pair of an input pattern and a teacher pattern including an actual teacher pattern and a teacher pattern) necessary for configuring the hierarchical network configuration data processing device 60 as a desired information processing device is managed. At the same time, the learning signal is presented to the learning function of the hierarchical network configuration data processing device 60.

【００３０】図３に階層ネットワーク構成データ処理装
置６０の詳細な一構成例を示す。図中，６１は図１で説
明したような階層ネットワーク構造をとる階層ネットワ
ーク部であって，重み値により規定されるデータ変換機
能に従って入力パターンに対応する出力パターンを算出
して出力するもの，６２はパターン入力部であって，学
習信号管理提示装置７０から提示される学習信号の内の
入力パターンを階層ネットワーク部６１に入力するとと
もに教師パターン（ここでは実教師パターン，反面教師
パターンの２種類を例にとる）を後述する重み値変更部
６６に入力するもの，６３はパターン出力部であって，
階層ネットワーク部６１からの出力パターンを重み値変
更部６６に出力するもの，６４は学習パラメータ格納部
であって，誤差逆伝播法の学習パラメータε，ζ等を格
納するもの，６５は重み値格納部であって，階層ネット
ワーク部６１の内部結合の重み値を管理するもの，６６
は重み値変更部であって，パターン入力部６２からの教
師パターン（実教師パターン，反面教師パターン）とパ
ターン出力部６３からの出力パターンとを受けて，階層
ネットワーク部６１の内部結合の重み値を，目的とする
収束値に近づくように更新，学習するとともに，学習中
の重み値に基づく実教師パターン群および反面教師パタ
ーン群に対する誤差値を学習信号管理提示装置７０に出
力するものである。FIG. 3 shows a detailed configuration example of the hierarchical network configuration data processing device 60. In the figure, reference numeral 61 denotes a hierarchical network unit having a hierarchical network structure as described in FIG. 1, which calculates and outputs an output pattern corresponding to an input pattern according to a data conversion function defined by a weight value; Is a pattern input unit for inputting an input pattern of a learning signal presented from the learning signal management / presentation device 70 to the hierarchical network unit 61 and for inputting a teacher pattern (here, two types of a real teacher pattern and a reverse teacher pattern). ) Is input to a weight value changing unit 66 described later, and 63 is a pattern output unit.
A learning parameter storage unit for outputting an output pattern from the hierarchical network unit 61 to the weight value changing unit 66, a learning parameter storage unit 64 for storing learning parameters ε, ζ, etc. of the backpropagation method, and 65, a weight value storage unit , Which manages the weight value of the internal connection of the hierarchical network unit 61, 66
Is a weight value changing unit, which receives a teacher pattern (actual teacher pattern, on the other hand, a teacher pattern) from the pattern input unit 62 and an output pattern from the pattern output unit 63, and receives the weight value of the internal connection of the hierarchical network unit 61. Are updated and learned so as to approach the target convergence value, and error values for the actual teacher pattern group and the reverse teacher pattern group based on the weight value during learning are output to the learning signal management / presentation device 70.

【００３１】図４に，学習信号管理提示装置７０の詳細
な一構成例を示す。図中，７２は学習信号格納部であっ
て，学習信号（入力パターンと実教師パターンの対，あ
るいは入力パターンと反面教師パターンの対で構成され
る）を実教師パターンと反面教師パターンとを区別でき
る形で格納するもの，７３は学習信号提示部であって，
学習信号格納部７２から学習信号を読み出して階層ネッ
トワーク構成データ処理装置６０に提示するもの，７４
は学習パラメータ設定部であって，階層ネットワーク構
成データ処理装置６０に対して学習パラメータε，ζを
設定するもの，７５は学習収束判定部であって，階層ネ
ットワーク構成データ処理装置６０からの学習中の重み
値に基づく実教師パターン群および反面教師パターン群
に対する誤差値を受けて，その誤差値が許容範囲内に入
ったか否かを判定してその判定結果を学習信号提示部７
３に通知するものである。FIG. 4 shows a detailed configuration example of the learning signal management and presentation device 70. In the figure, reference numeral 72 denotes a learning signal storage unit which discriminates a learning signal (consisting of a pair of an input pattern and a real teacher pattern or a pair of an input pattern and a reverse teacher pattern) between a real teacher pattern and a reverse teacher pattern. What is stored in a form that can be stored, 73 is a learning signal presentation unit,
Reading a learning signal from the learning signal storage unit 72 and presenting it to the hierarchical network configuration data processing device 60;
Is a learning parameter setting unit that sets learning parameters ε and に対して for the hierarchical network configuration data processing device 60, and 75 is a learning convergence determination unit that is performing learning from the hierarchical network configuration data processing device 60. Receiving the error values for the actual teacher pattern group and the reverse teacher pattern group based on the weight values of the two, determines whether or not the error value is within an allowable range, and outputs the determination result to the learning signal presentation unit 7.
3 is notified.

【００３２】次に，このように構成される本発明の実施
例が実行する，階層ネットワーク構成データ処理装置６
０を所望の情報処理装置として構築するための処理内容
について詳細に説明する。Next, the hierarchical network configuration data processing device 6 executed by the embodiment of the present invention configured as described above.
A detailed description will be given of the contents of processing for constructing a desired information processing apparatus as 0.

【００３３】階層ネットワーク構成データ処理装置６０
の入力パターンがｌ個の要素からなるベクトルＩ_p＝
（ｉ_p1，ｉ_p2，…，ｉ_pl）で，出力パターンがｍ個の要
素からなるベクトルＯ_p＝（ｏ_p1，ｏ_p2，…，ｏ_pm）で
ある場合を想定する。Hierarchical network configuration data processing device 60
Is a vector I _p =
_{_{(I p1, i p2, ...}} , i pl) in the output pattern vectors _{_{O p = (o p1, o}} p2, ..., o pm) consisting of m elements assume a case is.

【００３４】この階層ネットワーク構成データ処理装置
６０を所望の情報処理装置として構築しようとする設計
者は，まず最初に，所望の情報処理装置の入出力関係を
規定するための教師パターン群を作成する。入力パター
ンＩ_yp＝（ｉ_yp1，ｉ_yp2，…，ｉ_ypl）に対する実教
師パターン群として，Ｔ_yp＝（ｔ_yp1，ｔ_yp2，…，ｔ
_ypm）を，入力パターンＩ_np＝（ｉ_np1，ｉ_np2，…，
ｉ_npl）に対する反面教師パターン群として，Ｔ_np＝
（ｔ_np1，ｔ_np2，…，ｔ_npm）を用意する。A designer who intends to construct the hierarchical network configuration data processing device 60 as a desired information processing device first creates a teacher pattern group for defining the input / output relationship of the desired information processing device. . As a real teacher pattern group for the input pattern I _yp = (i _yp1 , i _yp2 ,..., I _ypl ), T _yp = (t _yp1 , t _yp2,.
the _ypm), the input pattern _{_{I np = (i np1, i}} np2, ...,
i _npl ), on the other hand, T _np =
(T _np1 , t _np2 ,..., T _npm ) are prepared.

【００３５】この階層ネットワーク構成データ処理装置
６０の入出力パターンの要素数に従って，階層ネットワ
ーク構成データ処理装置６０の階層ネットワーク部６１
として，入力パターンの要素数と一致する入力ユニット
数からなる入力層と，出力パターンの要素数と一致する
基本ユニット数からなる出力層と，実用的な段数と基本
ユニット数をもつ中間層から構成されるものを用意す
る。以下では，中間層は１段のものを想定する。In accordance with the number of elements of the input / output pattern of the hierarchical network configuration data processing device 60, the hierarchical network section 61 of the hierarchical network configuration data processing device 60
Consists of an input layer consisting of the number of input units matching the number of elements in the input pattern, an output layer consisting of the number of basic units matching the number of elements of the output pattern, and an intermediate layer having a practical number of stages and the number of basic units Prepare what is to be done. In the following, it is assumed that the intermediate layer has one stage.

【００３６】このようにして，階層ネットワーク部６１
の準備を完了すると，続いて，設計者は，作成した教師
パターン群を学習信号管理提示装置７０の学習信号格納
部７２に格納する。Thus, the hierarchical network unit 61
After completing the preparation, the designer stores the created teacher pattern group in the learning signal storage unit 72 of the learning signal management / presentation device 70.

【００３７】学習信号管理提示装置７０に教師パターン
群が格納されると，階層ネットワーク構成データ処理装
置６０を所望の情報処理装置として機能させようとする
設計者は，次に，学習信号管理提示装置７０に対し，格
納された教師パターン群を学習信号として，階層ネット
ワーク部６１の重み値の学習に入るように指示する。When the group of teacher patterns is stored in the learning signal management / presentation device 70, the designer who wants the hierarchical network configuration data processing device 60 to function as a desired information processing device is next required. It instructs 70 to start learning the weight value of the hierarchical network unit 61 using the stored teacher pattern group as a learning signal.

【００３８】この指示を受け取ると，学習信号管理提示
装置７０の学習信号提示部７３は，学習信号格納部７２
から実教師パターン群および反面教師パターン群を１つ
ずつ読み出して，学習信号として階層ネットワーク構成
データ処理装置６０に提示していくように処理する。Upon receiving this instruction, the learning signal presenting unit 73 of the learning signal managing and presenting device 70 sends the learning signal storing unit 72
, The real teacher pattern group and the reverse teacher pattern group are read one by one, and are processed so as to be presented to the hierarchical network configuration data processing device 60 as a learning signal.

【００３９】この学習信号の提示を受け取ると，階層ネ
ットワーク構成データ処理装置６０のパターン入力部６
２は，提示される学習信号の入力パターンを階層ネット
ワーク部６１に対して入力していくとともに，実教師パ
ターンあるいは反面教師パターンを重み値変更部６６に
対して入力する。そして，階層ネットワーク部６１は，
この入力される入力パターンを重み値格納部６５の重み
値を用いてデータ変換していくことにより，対応する出
力パターンを算出してパターン出力部６３に出力するよ
う処理するとともに，パターン出力部６３は，この出力
パターンを重み値変更部６６に対して出力するよう処理
する。When the presentation of the learning signal is received, the pattern input unit 6 of the hierarchical network configuration data processing device 60
2 inputs the input pattern of the presented learning signal to the hierarchical network unit 61 and also inputs the actual teacher pattern or the reverse teacher pattern to the weight value changing unit 66. Then, the hierarchical network unit 61
By performing data conversion on the input pattern to be input using the weight value of the weight value storage unit 65, a corresponding output pattern is calculated and output to the pattern output unit 63, and the pattern output unit 63 is processed. Performs processing to output this output pattern to the weight value changing unit 66.

【００４０】このようにして，パターン入力部６２から
実教師パターンあるいは反面教師パターンを受け取ると
ともに，パターン出力部６３から学習用の入力パターン
対応の出力パターンを受け取ると，重み値変更部６６
は，実教師パターンに対しては，次式に従って実教師パ
ターンと学習用の入力パターン対応の出力パターンとの
間の誤差ベクトルＥ_ypを算出するとともに，提示される
実教師パターン毎に算出されるこの誤差ベクトルＥ_yの
総和を算出する。As described above, when the real teacher pattern or the reverse teacher pattern is received from the pattern input unit 62 and the output pattern corresponding to the learning input pattern is received from the pattern output unit 63, the weight value changing unit 66
Is calculated for an actual teacher pattern according to the following equation, and the error vector E _yp between the actual teacher pattern and the output pattern corresponding to the input pattern for learning is calculated for each presented actual teacher pattern. calculating the sum of the error vector E _y.

【００４１】Ｅ_y＝ΣＥ_yp＝ΣΣ（ｏ_ypj−ｔ_ypj）²／２ (8) 但し，Ｅ_ypのΣはｐの総和，右辺のΣΣはそれぞれｐ，
ｊの総和Ｅ_yp ：ｐ番目の実教師パターンの入力信号に対しての
誤差ベクトルｏ_ypj：ｐ番目の実教師パターンの入力信号に対するｊ
層のｊ番ユニットからの出力ｔ_ypj：ｐ番目の実教師パターンの入力信号に対するｊ
層のｊ番ユニットへの実教師信号同時に，反面教師パターンに対しては，次式に従って反
面教師パターンと学習用の入力パターン対応の出力パタ
ーンとの間の誤差ベクトルＥ_npを算出するとともに，提
示される反面教師パターン毎に算出されるこの誤差ベク
トルＥ_nの総和を算出する。[0041] _{_{E y = ΣE yp = ΣΣ (}} o ypj -t ypj) 2/2 (8) However, the sum of Σ of E _yp p, respectively ΣΣ of the right-hand side p,
j sum E _yp : error vector for input signal of p-th real teacher pattern o _ypj : j for input signal of p-th real teacher pattern
Output from the j-th unit of the layer t _ypj : j for the input signal of the p-th real teacher pattern
The real teacher signal to the j-th unit of the layer. At the same time, for the reverse teacher pattern, the error vector _Enp between the reverse teacher pattern and the output pattern corresponding to the input pattern for learning is calculated and presented according to the following equation. is the calculated total sum of the error vector E _n calculated in contrary every teacher pattern.

【００４２】Ｅ_n＝ΣＥ_np＝ΣΣ｛１−（ｏ_npj−ｔ_npj）｝²／２ (9) 但し，Ｅ_npのΣはｐの総和，右辺のΣΣはそれぞれｐ，
ｊの総和Ｅ_np ：ｐ番目の反面教師パターンの入力信号に対して
の誤差ベクトルｏ_npj：ｐ番目の反面教師パターンの入力信号に対する
ｊ層のｊ番ユニットからの出力ｔ_npj：ｐ番目の反面教師パターンの入力信号に対する
ｊ層のｊ番ユニットへの反面教師信号 (9) 式は，反面教師パターンと学習用の入力パターン対
応の出力パターンとの距離の逆数を誤差ベクトルとした
もので，例えば，(10)式のように，誤差ベクトルを定義
することもできる。[0042] _{_{E n = ΣE np = ΣΣ {}} 1- (o npj -t npj)} 2/2 (9) where the sum of the Σ of E _np p, respectively Shigumashiguma the right side p,
Sum of j E _np : error vector for input signal of p-th _reverse teacher pattern o _npj : output from j-th unit in layer j for input signal of p-th _reverse teacher pattern t _npj : p-th _reverse On the other hand, the inverse teacher signal to the j-th unit in the j-th layer with respect to the input signal of the teacher pattern is expressed by using the error vector as the reciprocal of the distance between the reverse teacher pattern and the output pattern corresponding to the input pattern for learning. , (10), an error vector can be defined.

【００４３】Ｅ_n＝ΣＥ_np＝ΣΣ１／〔１＋ exp｛（ｏ_npj−ｔ_npj）²＋α｝〕 (10) 但し，α：定数そして，重み値変更部６６は，この誤差Ｅ_y，Ｅ_nを用
い，次式に従って重み値の更新量Δｗ_ji(t) ，ｗ_ih(t)
を算出する。[0043] E _n = ΣE _np = ΣΣ1 / _{_{[1+ exp {(o npj -t npj}} ) 2 + α} ] (10) where, alpha: constant Then, the weight value changing unit 66, the error E _y, E _n And the update values of the weight values Δw _ji (t) and w _ih (t) according to the following equations:
Is calculated.

【００４４】 Δｗ_ji(t) ＝ｋ₁Δｗ_yji(t) ＋ｋ₂Δｗ_nji(t) (11) Δｗ_ih(t) ＝ｋ₁Δｗ_yih(t) ＋ｋ₂Δｗ_nih(t) (12) Δｗ_yji(t) ＝−ε∂Ｅ(t) ／∂ｗ_ji(t) ＋ζｗ_ji(t−1) (13) Δｗ_yih(t) ＝−ε∂Ｅ(t) ／∂ｗ_ih(t) ＋ζｗ_ih(t−1) (14) Δｗ_nji(t) ＝−｛−ε' ∂Ｅ(t) ／∂ｗ_ji(t) ＋ζ' ｗ_ji(t−1)｝ (15) Δｗ_nih(t) ＝−｛−ε' ∂Ｅ(t) ／∂ｗ_ih(t) ＋ζ' ｗ_ih(t−1)｝ (16) 但し， Δｗ_yji(t) ，Δｗ_yih(t) ：実教師パターン群に基づ
く重み値の更新量 Δｗ_nji(t) ，Δｗ_nih(t) ：反面教師パターン群に基
づく重み値の更新量ｋ₁，ｋ₂ ：定数 ε，ε' ：学習定数 ζ，ζ' ：モーメンタムｔ：更新回数この算出した更新量に従って，次の更新サイクルのため
の重み値である，ｗ_ji(t) ＝ｗ_ji(t−1)＋Δｗ_ji(t) (17) ｗ_ih(t) ＝ｗ_ih(t−1)＋Δｗ_ih(t) (18) を求めて重み値格納部６５に格納する。[0044] _{_{Δw ji (t) = k 1}} Δw yji (t) + k 2 Δw nji (t) (11) Δw ih (t) = k 1 Δw yih (t) + k 2 Δw nih (t) (12) Δw _{yji (t) = -ε∂E (t} ) / ∂w ji (t) + ζw ji (t-1) (13) Δw yih (t) = -ε∂E (t) / ∂w ih (t) + ζw _{ih (t-1) (14} ) Δw nji (t) = - {- ε '∂E (t) / ∂w ji (t) + ζ' w ji (t-1)} (15) Δw nih (t) = - {- ε '∂E ( t) / ∂w ih (t) + ζ' w ih (t-1)} (16) _{where, Δw yji (t), Δw} yih (t): the real teacher pattern group Update amount of weight value based on Δw _nji (t), Δw _nih (t): update amount of weight value based on teacher pattern group k ₁ , k ₂ : constant ε, ε ': learning constant ζ, ζ': momentum t : according to the updated amount update number of times of the calculation is a weighted value for the next update _{cycle, w ji (t) = w} ji (t-1) + Δw ji (t) (17) w ih (t) = w _{ih (t-1) + Δw} ih (t) (18) the seeking weight value storage section 6 And stores it in.

【００４５】一方，学習信号管理提示装置７０の学習収
束判定部７５は，重み値変更部６６から学習用の入力パ
ターン対応の出力パターン群の実教師パターン群および
反面教師パターン群に対する誤差Ｅ_y，Ｅ_nを受け取る
と，この誤差Ｅ_y，Ｅ_nが許容できるものになったか否
かを判定することにより，学習が収束したのか否かを判
定して，その判定結果を学習信号提示部７３に通知す
る。On the other hand, the learning convergence judging unit 75 of the learning signal management / presentation device 70 outputs the errors E _y , Upon receiving the E _n, the error E _y, by determining whether it is intended to E _n is acceptable, it is determined whether or not the learning has converged, the learning signal presenting unit 73 of the determination result Notice.

【００４６】学習収束判定部７５から非収束の通知を受
け取ると，学習信号提示部７３は，再度同じ実教師パタ
ーン群および反面教師パターン群を学習信号として提示
していく処理を実行することにより，学習の収束を実現
するよう処理していくことになる。Upon receiving the notification of non-convergence from the learning convergence determining unit 75, the learning signal presenting unit 73 executes a process of presenting the same real teacher pattern group and reverse teacher pattern group as learning signals again. Processing will be performed to achieve convergence of learning.

【００４７】このように，本発明では，ネットワーク構
成データ処理装置に対する実教師パターン群および反面
教師パターン群の学習信号に基づいて，内部状態値を決
定することにより，ネットワーク構成データ処理装置を
所望の情報処理装置として機能させることができる。As described above, according to the present invention, the internal state value is determined based on the learning signals of the real teacher pattern group and the reverse teacher pattern group for the network configuration data processing device, thereby enabling the network configuration data processing device to have a desired configuration. It can function as an information processing device.

【００４８】また，最初に，学習信号管理提示装置７０
の学習信号提示部７３が，学習信号格納部７２に格納さ
れた教師パターン群の提示法として，実教師パターン群
のみを学習信号として提示するフェーズ，反面教師パタ
ーン群のみを学習信号として提示するフェーズ，両者を
学習信号として提示するフェーズとを随時必要に応じて
切り替えて提示する方法をとることによって，学習を進
めれば，ネットワーク構成データ処理装置６０の学習の
効率を上げることができる。このフェーズの切り替え
は，所定の設定値に基づいて自動で行ってもよく，また
学習者が学習状況のモニタ機能を有するシミュレータ等
による表示を観測し，適宜行うことができるようにして
もよい。First, the learning signal management / presentation device 70
Is a phase in which only the actual teacher pattern group is presented as a learning signal, and a phase in which only the teacher pattern group is presented as a learning signal, as a method of presenting the teacher pattern group stored in the learning signal storage section 72. By adopting a method in which the phase of presenting both as a learning signal is switched as necessary and presented, the learning efficiency of the network configuration data processing device 60 can be increased if the learning is advanced. The switching of the phase may be performed automatically based on a predetermined set value, or the learner may observe a display by a simulator or the like having a function of monitoring the learning situation and perform the switching appropriately.

【００４９】例えば，最初に，実教師パターン群のみを
使用し，学習収束判定部７５の誤差の許容範囲は緩やか
なものに設定して学習を進め，学習が収束した後に，学
習信号として実教師パターン群と反面教師パターン群を
使用し，学習収束判定部７５の誤差の許容範囲を厳しく
して，学習をさらに進めるという方法をとることによっ
て，ネットワーク構成データ処理装置６０の学習の効率
を上げることができる。For example, first, only the actual teacher pattern group is used, and the learning convergence judging section 75 sets the allowable range of the error to a gradual one, and proceeds with learning. On the other hand, by using a group of patterns and a group of supervised teachers, by using a method in which the tolerance of the error of the learning convergence determination unit 75 is strict and the learning is further advanced, the learning efficiency of the network configuration data processing device 60 is improved. Can be.

【００５０】図５は，本発明による実験結果を示す。本
実験では，階層ネットワーク部６１として，１入力ユニ
ットからなる入力層，３基本ユニットからなる中間層，
１基本ユニットからなる出力層の３層の階層ネットワー
クを準備した。FIG. 5 shows an experimental result according to the present invention. In this experiment, as the hierarchical network unit 61, an input layer consisting of one input unit, an intermediate layer consisting of three basic units,
A three-layer hierarchical network of an output layer composed of one basic unit was prepared.

【００５１】この階層ネットワーク構成データ処理装置
に対して使用した学習信号を，図中，白丸○（実教師パ
ターン群）および黒丸●（反面教師パターン群）で示
す。例えば，図５（イ）の左端の○は，“０．０という
入力信号に対して０．２５という出力信号を出力せよ”
という実教師信号を意味し，図５（ロ）の左側の●は，
“０．３７５という入力信号に対して０．２という出力
信号を出力してはならない”という反面教師信号を意味
している。The learning signals used for the hierarchical network configuration data processing device are indicated by white circles (real teacher pattern group) and black circles (contrary teacher pattern group) in the figure. For example, the circle at the left end in FIG. 5A indicates “output an output signal of 0.25 for an input signal of 0.0”.
Means a real teacher signal, and the black circle on the left side of FIG.
On the other hand, it means that an output signal of 0.2 should not be output for an input signal of 0.375.

【００５２】図中の実線は，このような学習信号を上述
した方法で階層ネットワーク構成データ処理装置６０に
学習させた後，すべての入力に対する階層ネットワーク
構成データ処理装置６０の出力結果を図示したものであ
る。図中のＬｏｏｐは，学習の繰り返し回数を表してい
る。The solid line in the figure shows the output result of the hierarchical network configuration data processing device 60 for all inputs after the learning signal is learned by the hierarchical network configuration data processing device 60 by the above-described method. It is. Loop in the figure represents the number of times of learning.

【００５３】例えば，図５の（イ）に示すように，従来
と同様な実教師パターン群のみを用いて学習を行った結
果では，０．５の入力信号の前後に，いわゆるアンダー
シュート，オーバーシュートに相当するような歪みが現
れた。そこで，この部分を効率よく修正するために，図
５の（ロ）に黒丸●で示す反面教師パターン群を加え
て，学習を行った。その結果，図５の（ロ）に実線で示
すような望ましい出力が得られるようになった。For example, as shown in FIG. 5 (a), as a result of learning using only the same actual teacher pattern group as before, the so-called undershoot and overshoot appear before and after the input signal of 0.5. A distortion equivalent to a shot appeared. Therefore, in order to efficiently correct this part, learning was performed by adding a group of teacher patterns indicated by black circles in FIG. As a result, a desired output as shown by a solid line in FIG.

【００５４】図６は，他の学習パターンについて，従来
技術により学習したようすを示している。図に示すよう
に，６つの入力パターンに対する実教師パターン群を用
いて，４５３３回学習を繰り返した結果，各入力に対し
て実線で示すような出力が得られるようになった。FIG. 6 shows a state in which another learning pattern is learned by the conventional technique. As shown in the figure, as a result of repeating the learning 4533 times using the real teacher pattern group for the six input patterns, an output as shown by a solid line is obtained for each input.

【００５５】図７は，図６と同じケースについて，本発
明を適用して学習を行った実験結果を示している。最初
に実教師パターン群のみを用いて，１０００回学習を繰
り返した。その結果，図７の（イ）に示すような入出力
特性が得られた。そこで，図７の（イ）に示す×印の位
置の入力信号に対する出力信号を，反面教師パターンと
して追加しさらに学習を続けると，１９５４回の学習で
結果は図７の（ロ）に示すようになった。図６に示す結
果と，図７の（ロ）に示す結果とを比較すると明らかな
ように，反面教師パターン群の利用により，内部状態値
を望ましい値に高速に収束できることがわかる。FIG. 7 shows the results of an experiment in which learning was performed by applying the present invention to the same case as in FIG. First, learning was repeated 1000 times using only the actual teacher pattern group. As a result, input / output characteristics as shown in FIG. 7A were obtained. Therefore, if the output signal corresponding to the input signal at the position of the mark x shown in FIG. 7A is added as a teacher pattern and learning is further continued, the result is obtained as shown in FIG. Became. As is clear from the comparison between the result shown in FIG. 6 and the result shown in FIG. 7B, it can be seen that the internal state value can be quickly converged to a desired value by using the teacher pattern group.

【００５６】この実施例の説明では，実教師パターン群
と反面教師パターン群の２種類の教師パターン群により
学習を行う場合を例に説明したが，実教師パターン群以
外の教師パターン群を，例えば出力パターン群が絶対に
とってはならない値のもの，出力パターン群ができるだ
けとるべきではない値ものというように，望ましくない
程度に応じて，さらに複数種類に分け，その望ましくな
いレベルに応じて，内部状態値の更新量を調節するよう
にしてもよい。In the description of this embodiment, the case where learning is performed using two types of teacher pattern groups, a real teacher pattern group and a reverse teacher pattern group, has been described as an example. Depending on the degree of undesirability, such as those for which the output pattern group should not take values and those for which the output pattern group should not be possible, the internal state is determined according to the undesired levels. The update amount of the value may be adjusted.

【００５７】本発明は，階層ネットワーク構造のデータ
処理装置に限らず，他のネットワーク構造を持つデータ
処理装置の学習処理にも，同様に適用することができ
る。また，ネットワークの内部状態値の学習方法も誤差
逆伝播法に限らず適用できることは明らかである。さら
に，教師パターン群とネットワーク構成データ処理装置
６０からの出力パターン群との誤差の定義法，該誤差に
基づく内部状態値の更新量の決定法，教師パターン群の
提示の方法等も，上述した実施例の方法に限らず，本発
明を実施することが可能である。The present invention is not limited to a data processing device having a hierarchical network structure, and can be similarly applied to learning processing of a data processing device having another network structure. Also, it is clear that the learning method of the internal state value of the network can be applied not only to the back propagation method. Further, the method of defining the error between the teacher pattern group and the output pattern group from the network configuration data processing device 60, the method of determining the update amount of the internal state value based on the error, the method of presenting the teacher pattern group, and the like are also described above. The present invention can be implemented without being limited to the method of the embodiment.

【００５８】[0058]

【発明の効果】以上説明したように，本発明によれば，
学習パターンとして，学習対象の入力パターンとそれに
対する“望ましい”出力パターンの組からなる正しい事
例群からだけではなく，学習対象の入力パターンとそれ
に対する“望ましくない”出力パターンの組からなる反
例群等の複数種類の事例群からも，ネットワーク構成デ
ータ処理装置を所望の高機能な情報処理装置として構築
できるようになる。また，事例群の提示方法として正し
い事例群のみを使用するフェーズ，反例群のみを使用す
るフェーズ，両者を使用するフェーズ等というように随
時切り替えて提示する方法をとることにより，さらに学
習の効率を向上させることができるようになる。As described above, according to the present invention,
As a learning pattern, not only a correct case group consisting of a set of an input pattern to be learned and a "desirable" output pattern corresponding thereto, but also a counterexample group consisting of a set of an input pattern to be learned and a "desirable" output pattern corresponding thereto. From the plural types of case groups, the network configuration data processing device can be constructed as a desired high-performance information processing device. In addition, the efficiency of learning can be further improved by changing the presentation method such as a phase using only correct case groups, a phase using only counterexample groups, and a phase using both as the case group presentation method. Can be improved.

[Brief description of the drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】本発明の一実施例のシステム構成図である。FIG. 2 is a system configuration diagram of an embodiment of the present invention.

【図３】図２に示す階層ネットワーク構成データ処理装
置の一構成例を示す図である。FIG. 3 is a diagram illustrating a configuration example of a hierarchical network configuration data processing device illustrated in FIG. 2;

【図４】図２に示す学習信号管理提示装置の一構成例を
示す図である。FIG. 4 is a diagram illustrating an example of a configuration of a learning signal management and presentation device illustrated in FIG. 2;

【図５】本発明による実験結果の説明図である。FIG. 5 is an explanatory diagram of an experimental result according to the present invention.

【図６】従来技術による実験結果の説明図である。FIG. 6 is an explanatory diagram of an experimental result according to a conventional technique.

【図７】本発明による実験結果の説明図である。FIG. 7 is an explanatory diagram of an experimental result according to the present invention.

【図８】基本ユニットの基本構成図である。FIG. 8 is a basic configuration diagram of a basic unit.

【図９】階層ネットワークの基本構成図である。FIG. 9 is a basic configuration diagram of a hierarchical network.

[Explanation of symbols]

１０ネットワーク構成データ処理装置１１パターン変換部１２内部状態値格納部２０学習パターン提示装置３０内部状態値学習装置６０階層ネットワーク構成データ処理装置７０学習信号管理提示装置 Reference Signs List 10 network configuration data processing device 11 pattern conversion unit 12 internal state value storage unit 20 learning pattern presentation device 30 internal state value learning device 60 hierarchical network configuration data processing device 70 learning signal management presentation device

フロントページの続き (56)参考文献特開平５−334276（ＪＰ，Ａ) 特開平５−20291（ＪＰ，Ａ) 特開平５−165801（ＪＰ，Ａ) 特開平２−170265（ＪＰ，Ａ) 山田貢己・他「教師信号と出力との差が増大する学習方式」，1991年電子情報通信学会秋季大会講演論文集（分冊６），日本，社団法人電子情報通信学会・発行，特許庁情報館受入日：1991年 11月13日，ｐｐ．12 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06N 1/00 - 7/00 G06G 7/60 G05B 13/00 - 13/04 G10L 3/00 - 9/20 G06K 9/00 - 9/82 G06T 7/00 - 7/60 ＣＳＤＢ（日本国特許庁) ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (56) References JP-A-5-334276 (JP, A) JP-A-5-20291 (JP, A) JP-A-5-165801 (JP, A) JP-A-2-170265 (JP) , A) Mitsumi Yamada, et al. “A learning method in which the difference between the teacher signal and output increases”, Proc. Of the 1991 IEICE Autumn Meeting, Volume 6, Japanese Society of Electronics, Information and Communication Engineers, Japan , Japan Patent Office Information Center: November 13, 1991, pp. 12 (58) Field surveyed (Int.Cl. ⁷ , DB name) G06N 1/00-7/00 G06G 7/60 G05B 13/00-13/04 G10L 3/00-9/20 G06K 9/00- 9/82 G06T 7/00-7/60 CSDB (Japan Patent Office) JICST file (JOIS)

Claims

(57) [Claims]

1. A network configuration data processing device for calculating and outputting an output pattern corresponding to an input pattern and having a network structure in which a plurality of operation elements for performing data conversion specified by an internal state value on an input are connected. a learning processing method for determining by learning the internal <br/> unit status value of the location, in response to the input pattern group of the learning object to be presented to the network configuration data processing equipment, a pattern to take the output pattern group A plurality of types of teacher pattern groups including a real teacher pattern group to be instructed and a pattern group for instructing a pattern that the output pattern group should not take are given as learning information, and the network configuration data based on the input pattern group to be learned is provided. an output pattern group of the processing instrumentation placed et the error value between the teacher pattern group, calculated for each type of teacher pattern group The update amount of the internal state value is determined based on a plurality of error values obtained according to the type of the teacher pattern group, and while the internal state value is sequentially updated using the update amount, the value related to the error value is determined. double but to a predetermined allowable range, a learning processing method for Rene Ttowaku configuration data processing device to determine the learning the internal state value, used for learning of the network configuration data processing device
Several types of teacher patterns are processed by the network configuration data processing.
Output pattern group from the logical device and the actual teacher pattern group
According to the error value of the actual teacher pattern group, then other
Of the teacher patterns including the teacher patterns
A learning processing method for a network configuration data processing device, wherein the learning is advanced by sequentially updating internal state values .

2. The learning processing method for a network configuration data processing device according to claim 1, wherein said learning to be presented to said network configuration data processing device is performed.
As a method of presenting the teacher pattern group,
Phase that uses only the training,
Phase using only pharma pattern groups, using both
Has a phase, the learning processing method for network configuration data processing apparatus characterized by learning while switching these phase at any time.

3. A learning processing method for a network configuration data processing device according to claim 1, wherein said network configuration is based on an input pattern group to be learned.
Output patterns from the data processing device and the teacher pattern
The output pattern group and the plurality of types
The distance is defined between each of the teacher pattern groups, and at least the output pattern group and the actual teacher pattern group
To reduce the distance between and the output pattern group
On the other hand, the distance between the teacher pattern group and the
Thus, the internal state of the network configuration data processing device
The values are sequentially updated, and the error value between the output pattern group and the actual teacher pattern group is allowed.
A learning processing method for a network configuration data processing device, wherein learning is performed so as to fall within a range .

4. A learning processing method for a network configuration data processing device according to claim 1, wherein said network configuration is based on a group of input patterns to be learned.
Output patterns from the data processing device and the teacher pattern
The output pattern group and the actual teacher
The distance between the turn group and the above-mentioned teacher pattern group
At least an output pattern group and the actual teacher pattern group
To reduce the distance between and the output pattern group
On the other hand, the distance between the teacher pattern group and the
Thus, the internal state of the network configuration data processing device
The values are sequentially updated, and the distance and the distance between the output pattern group and the actual teacher pattern group are updated .
And the distance between the output pattern group and the reverse pattern group
The learning processing method of the network configuration data processing device, wherein learning is performed so as to be within an allowable range .

5. A network configuration data processing device for calculating and outputting an output pattern corresponding to an input pattern and having a network structure in which a plurality of operation elements for performing data conversion specified by an internal state value on an input are connected. And a learning processing system for a network configuration data processing apparatus that learns the internal state value based on a group of input patterns to be learned and a group of teacher patterns predetermined for the group of input patterns. , and the addition to presenting an input pattern group of the learning object to be input to the network structure data processing equipment, in accordance with their input pattern group, the network configuration data processing equipment
As a teacher pattern group for instructing pressurization et output pattern group, the plurality including a real teacher pattern group that instructs the pattern to take the output pattern group, and although the teacher pattern group for instructing the pattern output pattern group must not take wherein the type of teacher pattern group and the learning pattern presentation means to presentation as learning information, and the network configuration data processing instrumentation placed these output pattern group according to the input pattern group of the learning object given depending on the learning pattern presentation hand stage the error value between the teacher pattern group, calculated for each type of teacher pattern group, based on the plurality of error values obtained in accordance with the type of the teacher pattern group, the internal state of the network configuration data processing equipment While determining the update amount of the value and sequentially updating the internal state value using the update amount, at least the output pattern group and the training A learning processing system of Ne Ttowa <br/> click configuration data processor error value and a internal state value learning hand stage performing learning control such that the allowable range of the teacher pattern group, the learning pattern The presenting means includes the network configuration data.
Of multiple types of teacher patterns used for learning data processor
Is the output pattern from the network configuration data processor.
According to the error value between the pattern group and the actual teacher pattern group,
The actual teacher pattern group, and then the other teacher pattern group
Means for presenting in the order of a group of teacher patterns, wherein said internal state value learning means comprises:
Updates the internal state values sequentially based on the information presented by
A learning processing system for a network configuration data processing device, characterized in that learning is advanced .