JP2877413B2

JP2877413B2 - Surplus neuron decision processing method

Info

Publication number: JP2877413B2
Application number: JP2026277A
Authority: JP
Inventors: 竜介益岡; 信雄渡部; 旭川村; 有理大和田; 和雄浅川
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-02-06
Filing date: 1990-02-06
Publication date: 1999-03-31
Anticipated expiration: 2014-03-31
Also published as: JPH03230285A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明はニューロンの内部結合により構成されるネッ
トワーク構造に従ってデータ処理を実行するネットワー
ク構成データ処理装置においての、非活性な余剰ニュー
ロンを決定するための余剰ニューロン決定処理方式に関
するものである。DETAILED DESCRIPTION OF THE INVENTION [Industrial Application Field] The present invention determines an inactive surplus neuron in a network-structured data processing device that executes data processing according to a network structure formed by internal connections of neurons. In the surplus neuron determination processing method.

従来の逐次処理コンピュータ（ノイマン型コンピュー
タ）では、使用方法や環境の変化に応じてデータ処理機
能を調節することができないので、所定の演算処理を実
行するニューロンの内部結合により構成されるネットワ
ーク構造のデータ変換機能を用いる並列分散処理方式の
適応的なデータ処理装置が提案されてきている。このネ
ットワーク構成のデータ処理装置では、明示的なプログ
ラムを作成することなく、学習用に用意された入力信号
（入力パターン）の提示に対して出力されるネットワー
ク構造からの出力信号（出力パターン）が、教師信号
（教師パターン）と一致するべく所定の学習アルゴリズ
ムに従ってネットワーク構造の内部結合の重み値を決定
していく。そして、この学習処理により重み値が決定さ
れると、想定していなかった入力信号が入力されること
になっても、このネットワーク構造からそれらしい出力
信号を出力するという“柔らかい”データ処理機能が実
現されることになる。In a conventional sequential processing computer (Neumann-type computer), the data processing function cannot be adjusted according to a change in the method of use or the environment. An adaptive data processing apparatus of a parallel distributed processing method using a data conversion function has been proposed. In this data processing device having a network configuration, an output signal (output pattern) from a network structure output in response to the presentation of an input signal (input pattern) prepared for learning without creating an explicit program. The weight value of the internal connection of the network structure is determined according to a predetermined learning algorithm so as to match the teacher signal (teacher pattern). When the weight value is determined by the learning process, a "soft" data processing function of outputting an appropriate output signal from the network structure even if an unexpected input signal is input is provided. Will be realized.

このような構造のネットワーク構成データ処理装置に
おいて、ネットワーク構造の演算処理速度を速めたり、
ネットワーク構造のハードウェア量を削減していくため
にも、ネットワーク構造中の活性化していない余剰的な
ニューロンがどれなのかを特定できるための手段を用意
していく必要があるのである。In the network configuration data processing device having such a structure, the arithmetic processing speed of the network structure can be increased,
In order to reduce the amount of hardware in the network structure, it is necessary to provide a means for identifying the extra inactive neurons in the network structure.

[Conventional technology]

階層ネットワーク構成をとるネットワーク構成データ
処理装置に従って従来技術を説明する。The prior art will be described according to a network configuration data processing device having a hierarchical network configuration.

階層ネットワーク構成をとるデータ処理装置では、基
本ユニット（ニューロンの一態様をなす）と呼ぶ一種の
ノードと、内部状態値に相当する重み値を持つ内部結合
とから階層ネットワークを構成している。第12図に、基
本ユニット１の基本構成を示す。この基本ユニット１
は、多入力一出力系となっており、複数の入力に対し夫
々の内部結合の重み値を乗算する乗算処理部２と、それ
らの全乗算結果を加算する累算処理部３と、この累算値
に非線型の閾値処理を施して一つの最終出力を出力する
閾値処理部４とを備える。In a data processing device having a hierarchical network configuration, a hierarchical network is configured from a kind of node called a basic unit (which forms an aspect of a neuron) and an internal connection having a weight value corresponding to an internal state value. FIG. 12 shows the basic configuration of the basic unit 1. This basic unit 1
Is a multi-input, one-output system, a multiplication unit 2 for multiplying a plurality of inputs by respective weight values of the internal connection, an accumulation unit 3 for adding all the multiplication results thereof, A threshold processing unit 4 that performs a non-linear threshold process on the calculated value and outputs one final output.

ｈ層を前段層としｉ層を後段層とすると、この累算処
理部３では下記の（１）式の演算を実行し、閾値処理部
４では下記の（２）式の演算を実行する。Assuming that the h-th layer is the first layer and the i-th layer is the second layer, the accumulation processing unit 3 executes the operation of the following expression (1), and the threshold processing unit 4 executes the operation of the following expression (2).

但し、 h :h層のユニット番号 i :i層のユニット番号 p :入力信号のパターン番号 θ_i:i層のｉ番ユニットの閾値 W_ih:h−ｉ層間の内部結合の重み値 x_pi:h層の各ユニットからｉ層のｉ番ユニットへの入力
の積和 y_ph:p番目パターンの入力信号に対するｈ層のｈ番ユニ
ットからの出力 y_pi:p番目パターンの入力信号に対するｉ層のｉ番ユニ
ットからの出力そして、階層ネットワーク構成データ処理装置では、
このような構成の多数の基本ユニット１が、入力信号値
をそのまま分配して出力する入力ユニット１′を入力層
として、第13図に示すように階層的に接続されることで
階層ネットワーク部を構成して、入力パターンを対応す
る出力パターンに変換するという並列的なデータ処理機
能を発揮することになる。 Here, h: unit number of the h layer i: unit number of the i layer p: input signal pattern number θ _i : threshold value of the i unit of the i layer W _ih : weight value of the internal coupling between the _{hi and} i layers x _pi : y _ph : The output from the h-th unit of the h-th layer with respect to the input signal of the p-th pattern y _pi : The output of the i-th layer with respect to the input signal of the p-th pattern Output from the i-th unit Then, in the hierarchical network configuration data processing device,
A large number of basic units 1 having such a configuration are hierarchically connected as shown in FIG. 13 by using an input unit 1 'which distributes and outputs an input signal value as it is as an input layer, thereby forming a hierarchical network unit. When configured, a parallel data processing function of converting an input pattern into a corresponding output pattern is exhibited.

ネットワーク構成のデータ処理装置では、データ変換
機能を規定するところのネットワーク構造の重み値を学
習処理により求めていく必要がある。階層ネットワーク
構成データ処理装置の重み値の学習処理方式として、特
に、バック・プロパゲーション法と呼ばれる学習処理方
式がその実用性の高さから注目されている。In a data processing device having a network configuration, it is necessary to obtain a weight value of a network structure that defines a data conversion function by a learning process. As a learning processing method of the weight value of the hierarchical network configuration data processing device, a learning processing method called a back propagation method has been particularly noted because of its high practicality.

バック・プロパゲーション法では、階層ネットワーク
の重み値W_ihと閾値θ_ｉとを誤差のフィードバックによ
り適応的に自動調節して学習することになる。（１）
（２）式から明らかなように、重み値W_ihと閾値θ_ｉと
の調節は同時に実行される必要があるが、この作業は、
相互に干渉する難しい作業となる。そこで、本出願人
は、先に出願の「特願昭62−333484号（昭和62年12月28
日出願、“ネットワーク構成データ処理装置”）」で開
示したように、入力側のｈ層に常に“1"を出力するとと
もにその出力に対して閾値θ_ｉを重み値として割り付け
るユニットを設けることで、閾値θ_ｉを重み値W_ihの中
に組み込んで閾値θ_ｉを重み値として扱うようにするこ
とを提案した。このようにすることで、上述の（１）
（２）式は、で表されることになる。In the back propagation method, learning is performed by adaptively and automatically adjusting the weight value W _ih and the threshold value θ _i of the hierarchical network by error feedback. (1)
As is apparent from the equation (2), the adjustment of the weight value W _ih and the threshold θ _i needs to be performed at the same time.
It is a difficult task to interfere with each other. Therefore, the applicant of the present application has filed a Japanese Patent Application No. 62-333484 (December 28, 1987)
As disclosed in Japanese Patent Application, “Network Configuration Data Processing Apparatus”), by providing a unit that always outputs “1” to the h layer on the input side and assigns a threshold value θ _i to the output as a weight value. It was suggested to treat the threshold theta _i as the weighting value incorporates threshold theta _i in the weight value W _ih. By doing so, the above (1)
Equation (2) is Will be represented by

次に、この（３）（４）式の記述形式のものに従っ
て、バック・プロパゲージョン法による重み値の学習処
理方式について説明する。ここで、この説明は、階層ネ
ットワーク部が、第13図に示すｈ層−ｉ層−ｊ層という
３層構造の階層ネットワークをもつもので行う。Next, a description will be given of a weight value learning processing method by the back propagation method according to the description format of the expressions (3) and (4). Here, the description will be made on the assumption that the hierarchical network unit has a hierarchical network having a three-layer structure of the h layer-i layer-j layer shown in FIG.

（３）（４）式からの類推によって次の（５）（６）
式が得られる。すなわち、但し、 j :j層のユニット番号 W_ji:i−ｊ層間の内部結合の重み値 x_pj:i層の各ユニットからｊ層のｊ番ユニットへの入力
の積和 y_pj:p番目パターンの入力信号に対するｊ層のｊ番ユニ
ットからの出力バック・プロパゲージョン法では、学習用の入力パタ
ーンが提示されたときに出力される出力層からの出力パ
ターンy_pjと、その出力パターンy_piのとるべき信号であ
る教師パターンd_pj（ｐ番目パターンの入力信号に対す
るｊ層ｊ番目ユニットへの教師信号）とが定まると、先
ず最初に、出力パターンy_pjと教師パターンd_pjとの差分
値〔d_pj−y_pj〕を算出し、次に、 α_pj＝y_pj（１−y_pj）（d_pj−y_pj）を算出して、続いて、但し、ε：学習定数 ζ：モーメンタム t:学習回数に従って、ｉ層−ｊ層間の重み値の更新量ΔW_ij（ｔ）
を算出する。ここで、「ζΔW_ji（ｔ−１）」という前
回の更新サイクル時に決定された重み値の更新量に係る
ものを加算するのは学習の高速化を図るためである。By analogy with equations (3) and (4), the following equations (5) and (6)
An expression is obtained. That is, Where j: the unit number of the j-th layer W _ji : the weight value of the internal connection between the ij-layers x _pj : the product sum of the inputs from each unit of the i-th layer to the j-th unit of the j-th layer y _pj : the p-th pattern Output from the j-th unit of the j-th layer with respect to the input signal In the back propagation method, an output pattern y _pj from the output layer that is output when an input pattern for learning is presented, and an output pattern y _pi When the teacher pattern d _pj (teacher signal for the p-th pattern input signal to the j-th layer j-th unit) to be taken is determined, first, the difference value between the output pattern y _pj and the teacher pattern d _pj [ d _pj −y _pj ], then α _pj = y _pj (1−y _pj ) (d _pj −y _pj ), and then: Where ε: learning constant ζ: momentum t: learning frequency Δw _ij (t)
Is calculated. Here, for adding those pertaining to the update amount of the weight values determined in the previous update cycle of "ζΔW _ji (t-1)" is order to speed up the learning.

続いて、算出したα_pjを用いて、先ず最初に、を算出し、次に、に従って、ｈ層−ｉ層間の重み値の更新量ΔW_ih（ｔ）
を算出する。Subsequently, using the calculated α _pj , first, , Then , The update amount ΔW _ih (t) of the weight value between the h layer and the i layer
Is calculated.

続いて、この算出した更新量に従って次の更新サイク
ルのための重み値 W_ji（ｔ）＝W_ji（ｔ−１）＋ΔW_ji（ｔ） W_ih（ｔ）＝W_ih（ｔ−１）＋ΔW_ih（ｔ）を決定していく方法を繰り返していくことで、学習用の
入力パターンを提示したときに出力される出力層からの
出力パターンy_pjと、その出力パターンy_pjのとるべき信
号である教師パターンd_pjとが一致することになる重み
値W_ji,W_ih（閾値θ_i,θ_ｊも含まれる）を学習するよう
処理する。Subsequently, the weight value W _ji for according to the update amount calculated as described above in the next update cycle _{(t) = W ji (t} -1) + ΔW ji (t) W ih (t) = W ih (t-1) + ΔW By repeating the method of determining _ih (t), an output pattern y _pj from the output layer that is output when an input pattern for learning is presented, and a signal to be taken by the output pattern y _pj Processing is performed to learn weight values W _ji , W _ih (including threshold values θ _i , θ _j ) at which a certain teacher pattern d _pj matches.

そして、階層ネットワーク部がｇ層−ｈ層−ｉ層−ｊ
層という４層構造の階層ネットワークをもつときには、
先ず最初に、を算出し、次に、に従ってｇ層−ｈ層間の重み値の更新量ΔW_hg（ｔ）を
算出するというように、前段側の層間の重み値の更新量
ΔＷを、出力側の後段から求まる値とネットワーク出力
データとを使いながら決定していくことで、すべての重
み値と閾値を学習するよう処理している。Then, the hierarchical network unit is composed of g layer-h layer-i layer-j
If you have a 4-layer hierarchical network called layers,
First of all, , Then The update amount ΔW _hg (t) of the weight value between the g layer and the h layer is calculated according to the following formula. By determining while using, all weight values and threshold values are processed to be learned.

[Problems to be solved by the invention]

しかしながら、従来では、内部結合の重み値が学習さ
れると、その重み値を用いてそのままデータ処理に入る
よう処理していた。また、重み値の学習処理を大型計算
機で行ってその求められた重み値をネットワーク構造の
実ハードウェアに移植する方法をとる場合にあっては、
学習により求められた重み値をそのまま移植してデータ
処理に入るよう処理していた。However, conventionally, when the weight value of the internal connection is learned, processing is performed so as to directly enter the data processing using the weight value. Also, in the case where a method of performing weight value learning processing by a large computer and porting the obtained weight value to real hardware having a network structure is used,
The weight value obtained by the learning is transplanted as it is and the data processing is performed.

これから、従来技術では、データ処理に対して有効な
働きをしていないニューロンが存在していてもそのこと
を全く考慮することなくデータ処理を実行することにな
る。このような余剰なニューロンの存在は、ハードウェ
ア量を大きくするばかりか、演算速度の増加をもたらし
てしまい何の利点ももたらさない。From this, in the related art, even if there is a neuron that does not work effectively for data processing, the data processing is executed without considering the fact at all. The presence of such extra neurons not only increases the amount of hardware but also increases the operation speed, and does not bring any advantage.

本発明はかかる事情に鑑みてなされたものであって、
ネットワーク構成データ処理装置のネットワーク構造中
の活性化していない余剰的なニューロンがどれなのかを
特定できる手段を提供することで、ネットワーク構成デ
ータ処理装置の演算処理速度の高速化とハードウェア量
の削減を実現できる途を与えることを目的とするもので
ある。The present invention has been made in view of such circumstances,
By providing a means for identifying which extra neurons are not activated in the network structure of the network configuration data processing device, the calculation processing speed of the network configuration data processing device is increased and the amount of hardware is reduced. It is intended to provide a way to realize the above.

[Means for solving the problem]

第１図は本発明の原理構成図である。 FIG. 1 is a diagram illustrating the principle of the present invention.

図中、10はネットワーク構成データ処理装置であっ
て、ネットワーク構成に従って適応的なデータ処理を実
行するもの、11はネットワーク構成データ処理装置10の
備えるネットワーク構造部であって、１つ又は複数の入
力と該入力に対して乗算されるべき内部状態値とを受け
取って積和値を得るとともに、この積和値を所定の関数
によって変換して出力値を得るニューロンの内部結合に
より構成されて、入力パターンに対応する出力パターン
を算出して出力するもの、12はネットワーク構成データ
処理装置10の備える内部状態値格納手段であって、ネッ
トワーク構造部11がデータ変換処理の実行時に必要とす
る内部状態値を管理するもの、20は内部状態値学習装置
であって、内部状態値格納手段12に格納される内部状態
値を学習処理により求めるものである。In the figure, reference numeral 10 denotes a network configuration data processing device that performs adaptive data processing according to the network configuration, and 11 denotes a network structure provided in the network configuration data processing device 10 and includes one or more input devices. And an internal state value to be multiplied with respect to the input to obtain a sum-of-products value. The product-sum value is converted by a predetermined function to obtain an output value. A device for calculating and outputting an output pattern corresponding to the pattern, 12 is an internal state value storage means provided in the network configuration data processing device 10, and an internal state value required by the network structure unit 11 at the time of executing data conversion processing. 20 is an internal state value learning device, which obtains an internal state value stored in the internal state value storage means 12 by a learning process. It is.

30は余剰ニューロン決定装置であって、ネットワーク
構造情報管理手段31と、内部状態値読取手段32と、第１
の余剰ニューロン特定手段33と、ニューロン群特定手段
34と、同一機能ニューロン特定手段35と、第２の余剰ニ
ューロン特定手段36と、出力範囲特定手段37と、第３の
余剰ニューロン特定手段38と、余剰ニューロン出力手段
39とを備えるものである。Reference numeral 30 denotes a surplus neuron determination device, which includes a network structure information management unit 31, an internal state value reading unit 32,
Surplus neuron identification means 33 and neuron group identification means
34, the same function neuron specifying means 35, the second surplus neuron specifying means 36, the output range specifying means 37, the third surplus neuron specifying means 38, and the surplus neuron output means
39.

このネットワーク構造情報管理手段31は、ネットワー
ク構造部11のネットワーク構造情報を管理し、内部状態
値読取手段32は、内部状態値格納手段12に格納される内
部状態値を読み取るよう処理し、第１の余剰ニューロン
特定手段33は、入力に割り付けられる内部状態値の学習
値のゼロ値からの距離と、出力に割り付けられる内部状
態値の学習値のゼロ値からの距離のいずれか一方の距離
がゼロ値に近い値を持つニューロンを特定するととも
に、この特定されたニューロンを余剰ニューロンとして
決定するよう処理し、ニューロン群特定手段34は、ネッ
トワーク構造情報管理手段31を検索することでネットワ
ーク構造部11のニューロンの内の入出力先が同じ形態を
とるニューロン群を特定するよう処理し、同一機能ニュ
ーロン特定手段35は、ニューロン群特定手段34により特
定されたニューロン群の内、入出力に割り付けられる対
応の内部状態値の学習値が概略同一値をとるニューロン
群を特定するよう処理し、第２の余剰ニューロン特定手
段36は、同一機能ニューロン特定手段35により特定され
るニューロン群の内の１つ以外のニューロンを余剰ニュ
ーロンとして決定するよう処理し、出力範囲特定手段37
は、学習対象の入力パターン群の提示に対して各ニュー
ロンが出力する出力値の最大値と最小値とを特定するよ
う処理し、第３の余剰ニューロン特定手段38は、出力範
囲特定手段37により特定された最大値と最小値との差分
値がゼロ値に近い値を持つニューロンを特定するととも
に、この特定されたニューロンを余剰ニューロンとして
決定するよう処理し、余剰ニューロン出力手段39は、第
１、第２、第３の余剰ニューロン特定手段33,36,38が決
定した余剰ニューロンの識別子を出力するよう処理す
る。The network structure information management means 31 manages the network structure information of the network structure part 11, and the internal state value reading means 32 processes to read the internal state value stored in the internal state value storage means 12, The surplus neuron identification means 33 is configured to determine whether one of the distance from the zero value of the learning value of the internal state value assigned to the input and the distance from the zero value of the learning value of the internal state value assigned to the output is zero. A neuron having a value close to the value is specified, and the specified neuron is processed to be determined as a surplus neuron. The neuron group specifying means 34 searches the network structure information management means 31 to The input / output destinations of the neurons are processed so as to specify a group of neurons having the same form. Out of the neuron groups specified by the neuron group specifying means 34, the neuron groups in which the learning values of the corresponding internal state values assigned to the inputs and outputs have substantially the same value are specified. Performs processing to determine a neuron other than one of the neuron groups specified by the same function neuron specifying means 35 as a surplus neuron,
Performs processing to specify the maximum value and the minimum value of the output value output by each neuron in response to the presentation of the input pattern group to be learned, and the third surplus neuron specifying means 38 A neuron whose difference value between the specified maximum value and the minimum value is close to zero is specified, and the specified neuron is processed as a surplus neuron. , The second and third surplus neuron identification means 33, 36, and 38 output the determined surplus neuron identifier.

[Action]

本発明では、内部状態値学習装置20が学習対象の入力
パターン群をネットワーク構造部11に提示するととも
に、この提示に従って出力されるネットワーク構造部11
からの出力パターン群と、この出力パターン群のとるべ
きパターンを示す教師パターン群との間の誤差値を算出
し、この誤差値に基づいて内部状態値格納手段12の内部
状態値を順次更新していくことで、所望のデータ変換処
理を実験するネットワーク構造部11の内部状態値を学習
して内部状態値格納手段12に格納するよう処理する。In the present invention, the internal state value learning device 20 presents the input pattern group to be learned to the network structure unit 11 and outputs the network structure unit 11 according to the presentation.
, And an error value between the output pattern group and the teacher pattern group indicating the pattern to be taken by the output pattern group, and the internal state value of the internal state value storage means 12 is sequentially updated based on the error value. By doing so, the internal state value of the network structure unit 11 for experimenting with the desired data conversion processing is learned and stored in the internal state value storage unit 12.

このようにして、所望のデータ変換処理を実現するネ
ットワーク構造部11の内部状態値が内部状態値格納手段
12に格納されると、内部状態値読取手段32は、ネットワ
ーク構造情報管理手段31の管理情報を参照しつつ順次１
つのニューロンを選択して、その選択したニューロンの
入力側の内部結合に割り付けられる全内部状態値を内部
状態値格納手段12から読み出すとともに、出力側の内部
結合に割り付けられる全内部状態値を内部状態値格納手
段12から読み出すよう処理する。そして、第１の余剰ニ
ューロン特定手段33は、この読み出された入力側の内部
結合の内部状態値のゼロ値からの距離がゼロ値に近い値
を持つときには、そのニューロンは入力に応答しない非
活性な余剰ニューロンであると決定するとともに、この
読み出された出力側の内部結合の内部状態値のゼロ値か
らの距離がゼロ値に近い値を持つときには、そのニュー
ロンは出力を出さない非活性な余剰ニューロンであると
決定し、余剰ニューロン出力手段39は、その決定された
余剰ニューロンの識別子を出力する。In this way, the internal state value of the network structure unit 11 that realizes the desired data conversion processing is stored in the internal state value storage unit.
When stored in the storage unit 12, the internal state value reading unit 32 sequentially refers to the management information of the network structure information management unit 31,
One neuron is selected, and all internal state values assigned to the internal connection on the input side of the selected neuron are read out from the internal state value storage means 12, and all internal state values assigned to the internal connection on the output side are stored in the internal state. Processing to read from the value storage means 12 is performed. Then, when the distance from the zero value of the read internal state value of the internal connection on the input side has a value close to zero value, the first surplus neuron specifying means 33 determines that the neuron which does not respond to the input is When it is determined that the neuron is an active surplus neuron, and when the distance from the zero value of the internal state value of the read internal connection on the output side has a value close to zero value, the neuron does not output any inactive. The surplus neuron output means 39 outputs an identifier of the determined surplus neuron.

また、所望のデータ変換処理を実現するネットワーク
構造部11の内部状態値が内部状態値格納手段12に格納さ
れると、ニューロン群特定手段34は、ネットワーク構造
情報管理手段31の管理情報を参照することでネットワー
ク構造部11のニューロンの内の入出力先が同じ形態をと
るニューロン群を特定する。続いて、内部状態値読取手
段32は、この特定されたニューロン群の各ニューロンの
入出力の内部結合に割り付けられる全内部状態値を内部
状態値格納手段12から読み出す。そして、同一機能ニュ
ーロン特定手段35は、この読み出された内部状態値に従
い、入出力先が同じ形態をとるニューロン群の内の入出
力に割り付けられる各内部状態値が概略同一の値をとる
ニューロン群を特定し、第２の余剰ニューロン特定手段
36は、この概略同一の値をとるニューロン群は本来１つ
のニューロンにより代用できるものであって１つ以外の
ニューロンは非活性な余剰ニューロンとなるものである
と決定し、余剰ニューロン出力手段39は、その決定され
た余剰ニューロンの識別子を出力する。Further, when the internal state value of the network structure unit 11 that realizes the desired data conversion processing is stored in the internal state value storage unit 12, the neuron group identification unit 34 refers to the management information of the network structure information management unit 31. This specifies a group of neurons whose input / output destinations in the network structure unit 11 take the same form. Subsequently, the internal state value reading means 32 reads from the internal state value storing means 12 all the internal state values assigned to the internal connection of the input and output of each neuron of the specified neuron group. Then, in accordance with the read internal state value, the same-function neuron identification means 35 outputs a neuron whose internal state value assigned to the input / output of the neuron group whose input / output destination takes the same form has substantially the same value. A second surplus neuron identifying means for identifying a group
36 determines that the group of neurons having substantially the same value can be originally replaced by one neuron, and the other neurons become inactive surplus neurons. , And outputs the determined surplus neuron identifier.

また、所望のデータ変換処理を実現するネットワーク
構造部11の内部状態値が内部状態値格納手段12に格納さ
れると、出力範囲特定手段37は、ネットワーク構造情報
管理手段31の管理情報を参照しつつ順次１つのニューロ
ンを選択し、学習対象の入力パターン群の提示に対して
出力されるネットワーク構造部11のネットワーク出力デ
ータを参照することで、その選択したニューロンが学習
対象の入力パターン群の提示に対して出力する出力値の
最大値と最小値とを特定する。そして、第３の余剰ニュ
ーロン特定手段38は、この特定された最大値と最小値と
の差分値がゼロ値に近い値を持つときには、そのニュー
ロンは有効な動作を行っていない非活性な余剰ニューロ
ンであると決定し、余剰ニューロン出力手段39は、その
決定された余剰ニューロンの識別子を出力する。When the internal state value of the network structure unit 11 that realizes the desired data conversion processing is stored in the internal state value storage unit 12, the output range specifying unit 37 refers to the management information of the network structure information management unit 31. While selecting one neuron in sequence, and referring to the network output data of the network structure unit 11 that is output in response to the presentation of the input pattern group to be learned, the selected neuron presents the input pattern group to be learned. The maximum value and the minimum value of the output value to be output with respect to are specified. When the difference between the specified maximum value and the minimum value has a value close to zero, the third surplus neuron specifying means 38 determines that the neuron is an inactive surplus neuron that is not performing an effective operation. The surplus neuron output means 39 outputs the determined surplus neuron identifier.

このように、本発明によれば、ネットワーク構成デー
タ処理装置10のネットワーク構造部11中の活性化してい
ない余剰的なニューロンがとれなのかを特定できるよう
になるのである。As described above, according to the present invention, it is possible to specify whether or not extra inactive neurons in the network structure unit 11 of the network configuration data processing device 10 cannot be removed.

〔Example〕

以下、階層ネットワーク構成データ処理装置に適用し
た実施例に従って本発明を詳細に説明する。Hereinafter, the present invention will be described in detail according to an embodiment applied to a hierarchical network configuration data processing device.

第２図に、本発明の一実施例を図示する。図中、第１
図、第12図及び第13図で説明したものと同じものについ
ては同一の記号で示してある。10aは階層ネットワーク
構成に従って適応的なデータ処理を実行する階層ネット
ワーク構成データ処理装置、11aは第13図で説明したよ
うな構造をもつ階層ネットワーク構造部、12aは第１図
の内部状態値格納手段12に相当する重み値格納手段、20
aはバック・プロパゲーション法に従って階層ネットワ
ーク構造部11aの重み値の学習を実行する重み値学習装
置、30aは第１図の余剰ニューロン決定装置30に相当す
る余剰ニューロン決定装置である。FIG. 2 shows an embodiment of the present invention. In the figure, the first
The same components as those described in FIG. 12, FIG. 12, and FIG. 13 are denoted by the same reference numerals. 10a is a hierarchical network configuration data processing device for executing adaptive data processing according to the hierarchical network configuration, 11a is a hierarchical network structure unit having the structure described in FIG. 13, and 12a is an internal state value storage means of FIG. Weight value storage means equivalent to 12, 20
a is a weight value learning device for learning the weight value of the hierarchical network structure unit 11a according to the back propagation method, and 30a is a surplus neuron determination device corresponding to the surplus neuron determination device 30 in FIG.

重み値学習装置20aは、学習信号格納手段21と学習信
号提示手段22と学習収束判定手段23と重み値学習手段24
とを備える。この学習信号格納手段21は、重み値の学習
のために用いられる学習信号を格納し、学習信号提示手
段22は、学習信号格納手段21から学習信号を読み出し
て、その内の学習提示信号を階層ネットワーク構造部11
aに提示するとともに、対をなすもう一方の学習教師信
号を学習収束判定手段23及び重み値学習手段24に提示す
るよう処理し、学習収束判定手段23は、階層ネットワー
ク構造部11aから出力される出力信号と学習信号提示手
段22からの学習教師信号とを受けて、階層ネットワーク
構造部11aのデータ処理機能の誤差が許容範囲に入った
か否かを判定してその判定結果を学習信号提示手段22に
通知するように処理し、重み値学習手段24は、学習信号
提示手段22からの学習教師信号と階層ネットワーク構造
部11aのネットワーク出力データとを受けて、バック・
プロパゲーション法に従って重み値の更新量を算出し、
該更新量に従って重み値を更新していくことで重み値を
収束させるべく学習するよう処理する。The weight value learning device 20a includes a learning signal storage unit 21, a learning signal presenting unit 22, a learning convergence determining unit 23, and a weight value learning unit 24.
And The learning signal storage means 21 stores a learning signal used for learning the weight value, and the learning signal presentation means 22 reads the learning signal from the learning signal storage means 21 and stores the learning presentation signal in the hierarchy. Network Structure Department 11
a, and processing to present the other pair of learning teacher signals to the learning convergence determining means 23 and the weight value learning means 24, and the learning convergence determining means 23 is output from the hierarchical network structure unit 11a. In response to the output signal and the learning teacher signal from the learning signal presenting means 22, it is determined whether or not an error in the data processing function of the hierarchical network structure unit 11a has entered an allowable range, and the determination result is represented by the learning signal presenting means 22. Weight value learning means 24 receives the learning teacher signal from the learning signal presentation means 22 and the network output data of the hierarchical network structure unit 11a, and
Calculate the update amount of the weight value according to the propagation method,
By updating the weight value in accordance with the update amount, processing is performed to learn to converge the weight value.

そして、本発明に特徴的な余剰ユニット決定装置30a
は、重み値学習装置20aにより重み値の学習が完了する
と、プログラム手段で構成されて、階層ネットワーク構
造部11a中の非活性な基本ユニット１を特定すべく処理
する第１の余剰ユニット特定手段301、第２の余剰ユニ
ット特定手段302、第３の余剰ユニット特定手段303及び
第４の余剰ユニット特定手段304を備えるとともに、こ
れらの余剰ユニット特定手段301,302,303,304が必要と
する判断値γ及び距離関数ｆ（α_ｉ）を管理する判断デ
ータ格納手段305と、これらの余剰ユニット特定手段30
1,302,303,304の処理結果を外部機器に出力する結果出
力手段306とを備えるよう構成される。第３図に、第１
の余剰ユニット特定手段301が実行するフローチャー
ト、第４図に、第２の余剰ユニット特定手段302が実行
するフローチャート、第５図に、第３の余剰ユニット特
定手段303が実行するフローチャート、第６図に、第４
の余剰ユニット特定手段304が実行するフローチャート
を図示する。Then, the surplus unit determination device 30a characteristic of the present invention
When the weight value learning device 20a completes the learning of the weight value, the first redundant unit specifying means 301 which is constituted by the program means and processes to specify the inactive basic unit 1 in the hierarchical network structure section 11a. , A second surplus unit specifying unit 302, a third surplus unit specifying unit 303, and a fourth surplus unit specifying unit 304, and a judgment value γ and a distance function f () required by these surplus unit specifying units 301, 302, 303, 304. α _i ), a judgment data storage means 305 for managing these extra unit
And a result output means 306 for outputting the processing results of 1,302,303,304 to an external device. In FIG. 3, the first
4 is a flowchart executed by the surplus unit specifying means 301, FIG. 4 is a flowchart executed by the second surplus unit specifying means 302, FIG. 5 is a flowchart executed by the third surplus unit specifying means 303, and FIG. And the fourth
3 is a flowchart executed by the surplus unit identification means 304.

次に、この第３図ないし第６図に示すフローチャート
に従って、重み値学習装置20aにより重み値の学習が完
了することを条件にして実行されることになる余剰な基
本ユニット１の特定処理について詳細に説明する。Next, in accordance with the flowcharts shown in FIGS. 3 to 6, the process of specifying the surplus basic unit 1 which is to be executed on condition that the learning of the weight value is completed by the weight value learning device 20a will be described in detail. Will be described.

第１の余剰ユニット特定手段301は、第３図のフロー
チャートに示すように、ステップ30で活性か非活性かの
決定をしていない基本ユニット１を１つ選択し、次に、
ステップ31で、その基本ユニット１から出力されるすべ
ての内部結合に割り付けられる重み値の学習値α_ｉ（１
≦ｉ≦ｎ）を重み値格納手段12aから読み取る。続い
て、ステップ32で、その基本ユニット１に割り付けられ
ている判断値γと距離関数ｆ（α_ｉ）（基本ユニット１
毎や階層構造の段数を単位にして設定することが可能で
ある）とを判断データ格納手段305から読み出す。続い
て、ステップ33で、ステップ31で読み取った重み値（ｎ
次元空間をなす）をこの読み出した距離関数ｆ（α_ｉ）
に代入することで、読み取った重み値の原点からの距離
Ｋを算出する。ここで、距離関数ｆ（α_ｉ）は様々に定
義することが可能であって、例えば、やｆ（α_ｉ）＝MAX|α_i|但し、１≦ｉ≦ｎやの関数式に従って算出されることになる。As shown in the flowchart of FIG. 3, the first surplus unit identification means 301 selects one basic unit 1 for which the determination of active or inactive has not been made in step 30, and
In step 31, the learning value α _i (1) of the weight value assigned to all the inner connections output from the basic unit 1
≤ i ≤ n) is read from the weight value storage means 12a. Subsequently, in step 32, the judgment value γ assigned to the basic unit 1 and the distance function f (α _i ) (basic unit 1
Can be set every time or in units of the number of levels of the hierarchical structure) from the determination data storage means 305. Subsequently, in step 33, the weight value (n
The distance function f (α _i )
, The distance K from the origin of the read weight value is calculated. Here, the distance function f (α _i ) can be variously defined. And f (α _i ) = MAX | α _i | where 1 ≦ i ≦ n or Is calculated according to the following function formula.

このようにして、選択された基本ユニット１の出力側
の内部結合に割り付けられる重み値のｎ次元重み空間に
おける原点からの距離Ｋが求まると、続くステップ34
で、この距離Ｋがステップ32で読み出した判断値γより
も小さいか否かを判断する。そして、判断値γよりも小
さいと判断するときには、ステップ35に進んで選択した
基本ユニット１が余剰ユニットであると決定するととも
に、判断値γよりも大きいと判断するときには、ステッ
プ36に進んで選択した基本ユニット１が余剰ユニットで
はないと決定し、次のステップ37で、その決定結果を結
果出力手段306に通知する。そして、ステップ38で、す
べての基本ユニット１に対して活性か非活性かの決定を
していないことを検出するときには、ステップ30に戻る
よう処理することで、階層ネットワーク構造部11aのす
べての基本ユニット１に対して活性か非活性かの決定を
実行する。In this way, when the distance K from the origin in the n-dimensional weight space of the weight value assigned to the internal connection on the output side of the selected basic unit 1 is obtained, the following step 34
Then, it is determined whether or not the distance K is smaller than the determination value γ read in step 32. When it is determined that the basic unit 1 is smaller than the determination value γ, the process proceeds to step 35 to determine that the selected basic unit 1 is a surplus unit. The determined basic unit 1 is determined not to be a surplus unit, and the result of the determination is notified to the result output means 306 in the next step 37. Then, when it is detected in step 38 that all the basic units 1 have not been determined to be active or inactive, the processing returns to step 30 so that all the basic units 1 of the hierarchical network structure unit 11a are processed. A determination is made as to whether unit 1 is active or inactive.

このように、第１の余剰ユニット特定手段301は、基
本ユニット１の出力側の内部結合の全内部状態値の学習
値がゼロ値に近い値を持つときには、その基本ユニット
１は出力を出さない非活性な余剰ユニットであると決定
していくよう処理するものである。As described above, when the learning values of all the internal state values of the internal connection on the output side of the basic unit 1 have a value close to zero, the first surplus unit specifying unit 301 does not output the basic unit 1. Processing is performed to determine that the unit is an inactive surplus unit.

これに対して、第２の余剰ユニット特定手段302は、
第４図のフローチャートを実行することで、基本ユニッ
ト１の入力側の内部結合の全内部状態値の学習値がゼロ
値に近い値を持つときには、その基本ユニット１は入力
を受け付けない非活性な余剰ユニットであると決定して
いくよう処理するものである。第４図のフローチャート
と第３図のフローチャートとの違いは、第３図のフロー
チャートのステップ31では、選択された基本ユニット１
から“出力”されるすべての内部結合に割り付けられる
重み値の学習値α_ｉを読み取るのに対して、第４図のフ
ローチャートのステップ41では、選択された基本ユニッ
ト１に“入力”されるすべての内部結合に割り付けられ
る重み値の学習値α_ｉを読み取る点にあるだけである。
なお、基本ユニット１の閾値の学習値は、この読み取る
重み値の学習値に含めることを要しない。On the other hand, the second surplus unit identification means 302
By executing the flowchart of FIG. 4, when the learning values of all the internal state values of the internal connection on the input side of the basic unit 1 have a value close to zero, the basic unit 1 does not accept the input and is inactive. Processing is performed to determine that the unit is a surplus unit. The difference between the flowchart of FIG. 4 and the flowchart of FIG. 3 is that, in step 31 of the flowchart of FIG.
The learning values α _i of the weight values assigned to all the internal connections “output” from “ _i” are read, while in step 41 of the flowchart of FIG. At the point of reading the learning value α _i of the weight value assigned to the internal connection of.
The learning value of the threshold value of the basic unit 1 does not need to be included in the learning value of the weight value to be read.

第３の余剰ユニット特定手段303は、第５図のフロー
チャートに示すように、ステップ50で第２段目の中間層
の基本ユニット１のグループといったように、同じ重み
空間を持つ基本ユニット１のグループ（同一の基本ユニ
ット１群との間に内部結合するグループ）の中の１つの
グループを選択する。次に、ステップ51で、そのグルー
プに割り付けられている判断値γと距離関数ｆ（α_i,
α′_ｉ）（グループ毎に設定することが可能である）と
を判断データ格納手段305から読み出す。続いて、ステ
ップ52で、選択したグループの基本ユニット１の中から
１つの基本ユニット１を基準ユニットとして選択し、続
くステップ53で、その選択した基準ユニットを活性化さ
れた余剰でないユニットと決定する。As shown in the flowchart of FIG. 5, the third surplus unit specifying means 303 determines the group of the basic units 1 having the same weight space, such as the group of the basic units 1 of the second layer in the middle layer in step 50. One group is selected from (a group that is internally connected to the same basic unit group). Next, in step 51, the judgment value γ assigned to the group and the distance function f (α _i ,
α ′ _i ) (which can be set for each group) is read from the judgment data storage means 305. Subsequently, in step 52, one basic unit 1 is selected as a reference unit from the basic units 1 of the selected group, and in the following step 53, the selected reference unit is determined as an activated non-surplus unit. .

次に、ステップ54で、未決定未選択の基本ユニット１
の中から基本ユニット１を１つ選択し、続くステップ55
で、重み値格納手段12aから読み取られる基準ユニット
の重み値α_ｉ（ｎ次元空間をなす）とこの選択された基
本ユニット１の重み値α′_ｉ（ｎ次元空間をなす）とを
読み出した距離関数ｆ（α_i,α′_ｉ）に代入すること
で、基準ユニットとステップ54で選択された基本ユニッ
ト１との間の距離Ｋを算出する。ここで、閾値について
は重み値に含めることが好ましいが、寄与度が小さいこ
とから含めないようにするものであってもよい。また、
距離関数ｆ（α_i,α′_ｉ）は様々に定義することが可能
であって、例えば、やｆ（α_i,α′_ｉ）＝MAX|α_i,α′_i| 但し、１≦ｉ≦ｎやの関数式に従って算出されることになる。Next, in step 54, the undetermined and unselected basic unit 1
One basic unit 1 is selected from, and the following step 55
The distance at which the weight value α _{i of the} reference unit read from the weight value storage means 12a (which forms an n-dimensional space) and the weight value α ′ _{i of} this selected basic unit 1 which forms an n-dimensional space are read. By substituting into the function f (α _i , α ′ _i ), the distance K between the reference unit and the basic unit 1 selected in step 54 is calculated. Here, the threshold value is preferably included in the weight value, but may not be included because the degree of contribution is small. Also,
The distance function f (α _i , α ′ _i ) can be variously defined, for example, And f (α _i , α ′ _i ) = MAX | α _i , α ′ _i | where 1 ≦ i ≦ n or Is calculated according to the following function formula.

基準ユニットとステップ54で選択された基本ユニット
１との間の距離Ｋが求まると、続くステップ56で、この
距離Ｋがステップ51で読み出した判断値γよりも小さい
か否かを判断する。そして、判断値γよりも小さいと判
断するときには、ステップ57に進んで、選択した基本ユ
ニット１が基準ユニットと同一の機能（基準ユニットの
重み値を増加すれば省略できる機能である）を果たすも
のに過ぎないことから非活性化な余剰のユニットである
と決定し、一方、判断値γよりも大きいと判断するとき
には、何も決定せずに未決定のままにしておく。そし
て、続くステップ58で、未決定の基本ユニット１のすべ
てを選択したか否かを判断して、選択されていないもの
があるときにはステップ54に戻るよう処理する。When the distance K between the reference unit and the basic unit 1 selected in step 54 is determined, it is determined in subsequent step 56 whether or not the distance K is smaller than the determination value γ read in step 51. When it is determined that the value is smaller than the determination value γ, the process proceeds to step 57, in which the selected basic unit 1 performs the same function as the reference unit (a function that can be omitted by increasing the weight value of the reference unit). Therefore, when it is determined that the unit is an inactive surplus unit, and when it is determined that the unit is larger than the determination value γ, nothing is determined and is left undetermined. Then, in a succeeding step 58, it is determined whether or not all of the undecided basic units 1 have been selected, and if there is any unselected basic unit 1, processing is returned to step 54.

ステップ58の判断ですべて選択したと判断するときに
は、続くステップ59で、ステップ50で選択したグループ
の基本ユニット１の内に未決定の基本ユニット１が残さ
れているか否かを判断して、残されているときには、ス
テップ52に戻って、その残されているものの中から基準
ユニットを選択して、その残されているものの中でその
選択された基準ユニットと同一の機能を果たす余剰の基
本ユニット１があるか否かを調べていくよう処理する。
そして、ステップ59の判断で未決定の基本ユニット１が
残されていないことを判断するときには、ステップ60に
進んで、同じ重み空間を持つ基本ユニット１のグループ
のすべてを処理したのかを判断して、処理していないこ
とを検出するときには、ステップ50に戻るよう処理する
ことで、階層ネットワーク構造部11aのすべての基本ユ
ニット１に対して活性か非活性かの決定を実行して、ス
テップ61でその決定結果を結果出力手段306に通知す
る。When it is determined in step 58 that all of the basic units 1 have been selected, it is determined in subsequent step 59 whether or not undetermined basic units 1 are left in the basic units 1 of the group selected in step 50. If so, the process returns to step 52 to select a reference unit from the remaining ones, and a surplus basic unit that performs the same function as the selected reference unit among the remaining ones. Processing is performed to check whether or not 1 exists.
If it is determined in step 59 that no undetermined basic unit 1 is left, the process proceeds to step 60 to determine whether all groups of the basic unit 1 having the same weight space have been processed. When it is detected that the processing has not been performed, the process returns to step 50 to determine whether all the basic units 1 of the hierarchical network structure unit 11a are active or inactive. The result of the determination is notified to the result output means 306.

このように、第３の余剰ユニット特定手段303は、階
層ネットワーク構造部11aの基本ユニット１に対して同
一の機能を果たすもの同士をグルーピングして、そのグ
ルーピングした基本ユニット１の内の１つのみを余剰ユ
ニットでないものとして決定するとともに、残りの基本
ユニット１を余剰ユニットであると決定していくよう処
理するものである。この第３の余剰ユニット特定手段30
3により余剰であると判断される基本ユニット１につい
ては、代表して選んだ余剰でない基本ユニット１の重み
値をその分大きく設定することで省略することが可能と
なる。As described above, the third surplus unit specifying unit 303 groups the units that perform the same function as the basic unit 1 of the hierarchical network structure unit 11a, and only one of the grouped basic units 1 Is determined not to be a surplus unit, and the remaining basic unit 1 is determined to be a surplus unit. This third surplus unit specifying means 30
The basic unit 1 determined to be surplus according to 3 can be omitted by setting the weight value of the representatively selected non-surplus basic unit 1 large accordingly.

第４の余剰ユニット特定手段304は、第６図のフロー
チャートに示すように、ステップ70で活性か非活性かの
決定をしていない基本ユニット１を１つ選択し、次に、
ステップ71で、その基本ユニット１に割り付けられてい
る判断値γ（基本ユニット１毎や階層構造の段数を単位
にして設定することが可能である）を判断データ格納手
段305から読み出す。続いて、ステップ72で、学習信号
提示手段22に対して階層ネットワーク構造部11aへの学
習提示信号の提示を指示するとともに、この指示に応答
して出力される選択した基本ユニット１からの出力値の
最大値と最小値とを求めるよう処理する。そして、続く
ステップ73で、この求めた最大値と最小値の差分値がス
テップ71で読み取った判断値γより小さいか否かを判断
して、判断値γより小さいと判断するときには、ステッ
プ74に進んで選択した基本ユニット１が余剰ユニットで
あると決定するとともに、判断値γよりも大きいと判断
するときには、ステップ75に進んで選択した基本ユニッ
ト１が余剰ユニットではないと決定し、次のステップ76
で、その決定結果を結果出力手段306に通知する。そし
て、ステップ77で、すべての基本ユニット１に対して活
性か非活性かの決定をしていないことを検出するときに
は、ステップ70に戻るように処理することで、階層ネッ
トワーク構造部11aのすべての基本ユニット１に対して
活性か非活性かの決定を実行する。As shown in the flowchart of FIG. 6, the fourth surplus unit specifying means 304 selects one basic unit 1 which has not been determined to be active or inactive in step 70,
In step 71, the judgment value γ assigned to the basic unit 1 (which can be set for each basic unit 1 or in units of the number of levels of the hierarchical structure) is read from the judgment data storage unit 305. Subsequently, in step 72, the learning signal presenting unit 22 is instructed to present the learning presentation signal to the hierarchical network structure unit 11a, and the output value from the selected basic unit 1 outputted in response to the instruction is given. Is processed so as to obtain the maximum value and the minimum value of. Then, in a succeeding step 73, it is determined whether or not the difference value between the obtained maximum value and the minimum value is smaller than the judgment value γ read in the step 71. When it is determined that the selected basic unit 1 is a surplus unit, and when it is determined that the basic unit 1 is larger than the determination value γ, the process proceeds to step 75, where it is determined that the selected basic unit 1 is not a surplus unit. 76
Then, the result of the determination is notified to the result output means 306. When it is detected in step 77 that all the basic units 1 have not been determined to be active or inactive, the processing is returned to step 70, so that all of the hierarchical network structure units 11a are processed. A determination is made as to whether the basic unit 1 is active or inactive.

このように、第４の余剰ユニット特定手段304は、階
層ネットワーク構造部11aに入力される入力パターンに
対して殆ど出力値を変化させない基本ユニット１はデー
タ処理に寄与していない非活性な余剰のユニットである
と決定していくよう処理するものである。As described above, the fourth surplus unit specifying unit 304 determines that the basic unit 1 that hardly changes the output value with respect to the input pattern input to the hierarchical network structure unit 11a has the inactive surplus that does not contribute to the data processing. Processing is performed so as to be determined as a unit.

次に、シミュレーションに従って、余剰ユニットの特
定処理について具体的に説明する。このシミュレーショ
ンは、階層ネットワーク構造部11aが、第７図に示すよ
うに、２ユニットの入力層・９ユニットの１段構成の中
間層・１ユニットの出力層という階層ネットワーク構造
を持ち、重み値学習装置20aが、第８図に示す20個の学
習信号を使用し、学習定数εを5.0、モーメンタムζを
0.4に設定し、重み値と閾値を−0.5から0.5までの乱数
で初期化し、学習信号を5490回提示（このとき、学習教
師信号と出力信号との差は0.025以下になる）すること
で行った。Next, the surplus unit identification processing will be specifically described according to a simulation. In this simulation, as shown in FIG. 7, the hierarchical network structure section 11a has a hierarchical network structure of two input layers, nine units, a one-stage intermediate layer, and one unit output layer. The device 20a uses the 20 learning signals shown in FIG. 8 and sets the learning constant ε to 5.0 and the momentum ζ
Set to 0.4, initialize the weight value and threshold with random numbers from -0.5 to 0.5, and present the learning signal 5490 times (at this time, the difference between the learning teacher signal and the output signal is 0.025 or less) Was.

第９図（ａ）に、このシミュレーションにより得られ
た重み値の学習値、第９図（ｂ）に、このシミュレーシ
ョンにより得られた閾値の学習値を図示する。ここで、
第９図（ａ）中、“0.548750"は「i0ユニット」と「h0
ユニット」との間の内部結合に割り付けられる重み値を
示しており、また、第９図（ｂ）中、“0.049658"は「i
0ユニット」の閾値を示している。FIG. 9 (a) illustrates the learning value of the weight value obtained by the simulation, and FIG. 9 (b) illustrates the learning value of the threshold value obtained by the simulation. here,
In FIG. 9A, “0.548750” indicates “i0 unit” and “h0
9B shows the weight value assigned to the internal connection between the “unit” and “0.049658” in FIG. 9B.
0 unit ”.

このような重み値が得られると、第１の余剰ユニット
特定手段301は、中間層の「i4ユニット」の出力側の内
部結合に割り付けられる重み値が“−0.014513"とゼロ
値に近いことから、この基本ユニット１を余剰ユニット
と特定することになる。そして、他の中間層の基本ユニ
ット１の出力側の内部結合に割り付けられる重み値は比
較的大きいことから、余剰でない活性化したユニットで
あると特定することになる。When such a weight value is obtained, the first surplus unit identifying means 301 determines that the weight value assigned to the inner connection on the output side of the “i4 unit” in the intermediate layer is “−0.014513”, which is close to zero. This basic unit 1 is specified as a surplus unit. Since the weight value assigned to the internal connection on the output side of the basic unit 1 of the other intermediate layer is relatively large, it is specified that the unit is a non-surplus activated unit.

第10図に、、この第９図（ａ）の中間層の基本ユニッ
ト１の入力側の内部結合に割り付けられる重み値（２次
元の重み空間をなす）をプロットしたものを図示する。FIG. 10 shows a plot of weight values (forming a two-dimensional weight space) assigned to the internal connection on the input side of the basic unit 1 of the intermediate layer in FIG. 9 (a).

このような重み空間が得られると、第２の余剰ユニッ
ト特定手段302は、中間層の「i4ユニット」と「i8ユニ
ット」の入力側の内部結合に割り付けられる重み値の原
点からの距離が小さいことから、これらの基本ユニット
１を余剰ユニットと特定することになる。そして、他の
中間層の基本ユニット１の入力側の内部結合に割り付け
られる重み値の原点からの距離は比較的大きいことか
ら、余剰でない活性化したユニットであると特定するこ
とになる。When such a weight space is obtained, the second surplus unit specifying means 302 determines that the distance from the origin of the weight value assigned to the internal connection of the input side of “i4 unit” and “i8 unit” in the intermediate layer is small. Therefore, these basic units 1 are specified as surplus units. Then, since the distance from the origin of the weight value assigned to the internal connection on the input side of the basic unit 1 of the other intermediate layer is relatively large, the unit is specified as a non-surplus activated unit.

また、第３の余剰ユニット特定手段303は、この第10
図からも分かるように、閾値を考慮しない場合におい
て、中間層の「i2ユニット」と「i6ユニット」とが概略
同一の機能を果たし、「i4ユニット」と「i8ユニット」
とが概略同一の機能を果たすことから、「i6ユニット」
と「i8ユニット」とを余剰ユニットと特定することにな
る。Further, the third surplus unit identification means 303
As can be seen from the figure, when the threshold is not taken into account, the “i2 unit” and “i6 unit” of the intermediate layer perform substantially the same function, and the “i4 unit” and “i8 unit”
Perform the same function as the i6 unit.
And “i8 unit” are specified as surplus units.

第11図に、学習された重み値及び閾値を設定したとき
に、20個の各学習信号に対して出力する中間層の各基本
ユニット１の出力値を図示する。この第11図からも分か
るように、中間層の「i2ユニット」と「i4ユニット」と
「i6ユニット」と「i8ユニット」とが出力値を変化させ
ないことから、第４の余剰ユニット特定手段304はこれ
らの基本ユニット１を余剰ユニットと特定することにな
る。FIG. 11 illustrates the output values of the basic units 1 of the intermediate layer, which are output for each of the 20 learning signals when the learned weight value and the threshold value are set. As can be seen from FIG. 11, since the “i2 unit”, “i4 unit”, “i6 unit”, and “i8 unit” of the intermediate layer do not change the output value, the fourth surplus unit specifying means 304 Specifies these basic units 1 as surplus units.

以上図示実施例について説明したが、本発明はこれに
限定されるものでない。例えば、本発明は階層ネットワ
ーク構成データ処理装置に限られることなく、他のネッ
トワーク構成データ処理装置に対してもそのまま適用で
きるのである。また、本発明の適用を受けるニューロン
は、実施例で説明したような閾値変換処理を行う基本ユ
ニットに限られるものではなく、他の関数変換処理を行
うものであってもよいのである。Although the illustrated embodiment has been described above, the present invention is not limited to this. For example, the present invention is not limited to a hierarchical network configuration data processing device, but can be applied to other network configuration data processing devices as it is. Further, the neurons to which the present invention is applied are not limited to the basic units that perform the threshold value conversion processing as described in the embodiment, but may perform other function conversion processing.

〔The invention's effect〕

以上説明したように、本発明によれば、ネットワーク
構成データ処理装置において、ネットワーク構造中の活
性化していない余剰的なニューロンがどれなのかを特定
できるようになることから、ネットワーク構造のハード
ウェア量を不要に大きく用意しておく必要がなくなる。
これから、ネットワーク構成データ処理装置の演算処理
速度を速められるようになるとともに、ハードウェア量
を削減できるようになるのである。As described above, according to the present invention, in the network configuration data processing device, it is possible to specify which extra neurons are not activated in the network structure, and therefore, the hardware amount of the network structure is reduced. Need not be prepared unnecessarily large.
As a result, the arithmetic processing speed of the network configuration data processing device can be increased, and the amount of hardware can be reduced.

[Brief description of the drawings]

第１図は本発明の原理構成図、第２図は本発明の一実施例、第３図は第１の余剰ユニット特定手段の実行するフロー
チャート、第４図は第２の余剰ユニット特定手段の実行するフロー
チャート、第５図は第３の余剰ユニット特定手段の実行するフロー
チャート、第６図は第４の余剰ユニット特定手段の実行するフロー
チャート、第７図はシミュレーションに用いた階層ネットワーク構
造部の説明図、第８図はシミュレーションに用いた学習信号の説明図、第９図、第10図及び第11図はシミュレーション結果の説
明図、第12図は基本ユニットの基本構成図、第13図は階層ネットワーク部の基本構成図である。図中、１は基本ユニット、10はネットワーク構成データ
処理装置、10aは階層ネットワーク構成データ処理装
置、11はネットワーク構造部、11aは階層ネットワーク
構造部、12は内部状態値格納手段、12aは重み値格納手
段、20は内部状態値学習装置、20aは重み値学習装置、3
0は余剰ニューロン決定装置、30aは余剰ユニット決定装
置である。FIG. 1 is a block diagram of the principle of the present invention, FIG. 2 is an embodiment of the present invention, FIG. 3 is a flowchart executed by the first surplus unit specifying means, and FIG. FIG. 5 is a flowchart executed by a third surplus unit specifying unit, FIG. 6 is a flowchart executed by a fourth surplus unit specifying unit, FIG. 7 is a description of a hierarchical network structure used in the simulation. Fig. 8, Fig. 8 is an explanatory diagram of a learning signal used in the simulation, Figs. 9, 10, and 11 are explanatory diagrams of simulation results, Fig. 12 is a basic configuration diagram of a basic unit, and Fig. 13 is a hierarchy. It is a basic block diagram of a network part. In the figure, 1 is a basic unit, 10 is a network configuration data processing device, 10a is a hierarchical network configuration data processing device, 11 is a network structure unit, 11a is a hierarchical network structure unit, 12 is an internal state value storage unit, and 12a is a weight value. Storage means, 20 is an internal state value learning device, 20a is a weight value learning device, 3
0 is a surplus neuron determination device, and 30a is a surplus unit determination device.

フロントページの続き (72)発明者大和田有理神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者浅川和雄神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (56)参考文献特開平２−231670（ＪＰ，Ａ) 特開平３−83157（ＪＰ，Ａ) 特開平３−83156（ＪＰ，Ａ) 萩原将文，中川正雄，「淘汰機能を有するバックプロパゲーション−ＬｏｃａｌＭｉｎｉｍｕｍからの回避アルゴリズム−」，電子情報通信学会春季全国大会（1989年），Ｐ．７−19 (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06F 15/18 G06G 7/00 ＪＩＣＳＴファイル（ＪＯＩＳ)Continuing on the front page (72) Inventor Yuri Owada 1015 Uedanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Inside Fujitsu Co., Ltd. Documents JP-A-2-231670 (JP, A) JP-A-3-83157 (JP, A) JP-A-3-83156 (JP, A) Masafumi Hagiwara, Masao Nakagawa, "Backpropagation with selection function- Evasion Algorithm from Local Minimum ", IEICE Spring National Convention (1989), P. 7-19 (58) Field surveyed (Int. Cl. ⁶ , DB name) G06F 15/18 G06G 7/00 JICST file (JOIS)

Claims

(57) [Claims]

1. A method for receiving one or more inputs and an internal state value to be multiplied with respect to the input to obtain a product sum value, converting the product sum value by a predetermined function to obtain an output value With a network structure formed by internal connections of the obtained neurons as learning targets, an output pattern group output from the network structure in response to the presentation of an input pattern group to be learned, and a pattern to be taken by the output pattern group This is a surplus neuron determination processing method used in a learning processing system that learns an internal state value by calculating an error value between the instruction pattern group and a teacher pattern group that indicates the internal state value based on the error value. Output range specifying means for specifying the maximum value and the minimum value of the output value output by each neuron in response to the presentation of the input pattern group to be learned according to the learned internal state value And a surplus neuron specifying means for specifying a surplus neuron by specifying a neuron whose difference value between the maximum value and the minimum value specified by the output range specifying means is close to zero. Characteristic surplus neuron decision processing method.