JP6433926B2

JP6433926B2 - Rebalancing device, rebalancing method, and program

Info

Publication number: JP6433926B2
Application number: JP2016003875A
Authority: JP
Inventors: 篤史外山
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2016-01-12
Filing date: 2016-01-12
Publication date: 2018-12-05
Anticipated expiration: 2036-01-12
Also published as: JP2017126131A

Description

本発明は、分散システムにおいて、複数のノード間での負荷の偏りを是正する技術に関連するものである。 The present invention relates to a technique for correcting a load imbalance among a plurality of nodes in a distributed system.

近年、クラウドコンピューティングの隆盛に伴い、多量なデータの処理や保持を効率的に行うことが求められている。そこで、複数のサーバを協調動作させることにより効率的な処理を実現する分散処理技術が発展している。分散処理を行う際には、クラスタを構成して分散システムを構築する各サーバ（以降、ノードともいう）が担当するデータを決定する必要がある。この際、分散システム全体でのデータの処理能力を高めるためには、各ノードが担当するデータ数は平均化されていることが望ましい。 In recent years, with the rise of cloud computing, it has been required to efficiently process and hold a large amount of data. Thus, distributed processing technology has been developed that realizes efficient processing by operating a plurality of servers in a coordinated manner. When performing distributed processing, it is necessary to determine data for each server (hereinafter also referred to as a node) that configures a cluster and constructs a distributed system. At this time, in order to increase the data processing capacity of the entire distributed system, the number of data handled by each node is preferably averaged.

ところで、代表的なデータの管理手法には、各データのｋｅｙ（キー）をハッシュ関数にかけた値（以降、ｈａｓｈ（ｋｅｙ）：ハッシュキーともいう）を、ノード数Ｎで割った余り、即ちｈａｓｈ（ｋｅｙ）ｍｏｄＮを番号として持つノードが、データを管理する手法がある。但し、その際、ノードに事前に０からＮ−１まで番号を割り当てている。このような管理手法を用いた場合、ノードを追加又は離脱すると、Ｎの値が変化し、多くのデータでは担当するノードが変更になるため、担当ノードの再配置が必要になる。 By the way, as a representative data management method, a remainder obtained by dividing a value obtained by multiplying the key (key) of each data by a hash function (hereinafter also referred to as hash (key): hash key) by the number of nodes N, that is, hash. There is a method in which a node having (key) mod N as a number manages data. At this time, however, numbers from 0 to N-1 are assigned to the nodes in advance. When such a management method is used, when a node is added or removed, the value of N changes, and the node in charge is changed for a lot of data, so that the node in charge needs to be rearranged.

そこで、ノードの追加・離脱に伴い担当するノードが変更になるデータ数を約1/Nに抑える方法として、コンシステント・ハッシュ[Consistent Hashing]（非特許文献１）を用いたデータ管理手法があり、Amazon Dynamo等で用いられている（非特許文献２）。コンシステント・ハッシュ法を用いたデータ管理手法では、例えば図１５に符号５で示す円形状のＩＤ空間において、ノードＡ，Ｂ，Ｃ，Ｄ，Ｅと、○及び●印で示す負荷が異なる負荷データとの双方にＩＤ(identification)を割り当てる。データのＩＤからＩＤ空間５を時計回りに辿り、最初に突き当たったノードが当該データの担当ノードになる。ノードＡ〜Ｅに対するＩＤの与え方の例としては、ＩＰ(Internet Protocol)アドレスをハッシュ関数にかけた値｛これを、ｈａｓｈ（ＩＰアドレス）ともいう｝が挙げられる。 Therefore, there is a data management method using a consistent hash [Consistent Hashing] (Non-patent Document 1) as a method of suppressing the number of data that the node in charge changes with the addition / detachment of the node to about 1 / N. , Amazon Dynamo, etc. (Non-Patent Document 2). In the data management method using the consistent hash method, for example, in the circular ID space indicated by reference numeral 5 in FIG. 15, the loads indicated by the nodes A, B, C, D, and E, and the marks indicated by ◯ and ● are different. An ID (identification) is assigned to both data and data. The ID space 5 is traced clockwise from the ID of the data, and the first node encountered becomes the responsible node for the data. As an example of how IDs are given to the nodes A to E, a value obtained by applying an IP (Internet Protocol) address to a hash function {this is also referred to as a hash (IP address)} can be given.

クラスタ構成の分散システムでは、例えば各ノードの性能が等しい場合には、各ノードＡ〜Ｅが担当するデータ量は等しい、即ち、コンシステント・ハッシュ法のＩＤ空間５における、ノード間の距離（以降、ノードの担当領域ともいう）が等しいことが望ましい。 In a distributed system having a cluster configuration, for example, when the performance of each node is equal, the amount of data handled by each node A to E is equal, that is, the distance between nodes in the ID space 5 of the consistent hash method (hereinafter, It is desirable that the nodes are in charge of each other.

この点を解決するため、各ノードＡ〜Ｅに仮想的に複数のＩＤを持たせる手法が用いられている（非特許文献３）。各ノードＡ〜Ｅが複数の仮想ＩＤを持つことで、仮想ＩＤ毎の担当領域は異なっていても、大数の法則に従い、ノードＡ〜Ｅの担当領域は平均化される。このようなコンシステント・ハッシュ法や仮想ＩＤ等の従来技術により、ノード間で担当するデータ数を均一化し、負荷を分散させることが可能となる。 In order to solve this point, a method of virtually giving a plurality of IDs to each of the nodes A to E is used (Non-Patent Document 3). Since each node A to E has a plurality of virtual IDs, even if the assigned area for each virtual ID is different, the assigned areas of the nodes A to E are averaged according to the law of large numbers. Conventional techniques such as the consistent hash method and virtual ID make it possible to equalize the number of data handled between nodes and distribute the load.

しかしながら、各ノードＡ〜Ｅの内の特定ノードにて、アクセス頻度の多いデータや、処理時間の長いデータ（高負荷データ)が偏って発生するため、各ノードＡ〜Ｅが担当するデータ数自体は均等であっても、ノード間で負荷の偏りが発生する。 However, since data with a high access frequency and data with a long processing time (high load data) are unevenly generated at specific nodes among the nodes A to E, the number of data handled by the nodes A to E itself Even if they are equal, load imbalance occurs between nodes.

このようなコンシステント・ハッシュ法の分散システムにおける負荷増大に対する対策としては、分散システムに、例えば図１５に示す新たなノードＦを増設して分散システムをスケールアウトさせ、高負荷となったノード（高負荷ノード）、例えば高負荷ノードＣが担当するデータ数を縮小させて負荷を低減する手法がとられている。 As a countermeasure against such a load increase in the distributed system of the consistent hash method, for example, a new node F shown in FIG. 15 is added to the distributed system, and the distributed system is scaled out. High load node), for example, a method of reducing the load by reducing the number of data handled by the high load node C.

また、ノードのコンシステント・ハッシュ上での空間配置変更（これを、リバランスという）を行い適切に負荷が分散されていれば、増設を行うことなく現行のノード台数で対処可能なケースもある。非特許文献４には、スケールアウト／リバランスで対処すべき状況を識別し、更に、リバランスで対処すべき状況においては、コンシステント・ハッシュ空間（ＩＤ空間５）上の隣接ノード間（例えばＥであれば、２つの矢印で示す隣のＡ又はＤ）でリバランスを実行して、ノード間の負荷の偏りを是正する手法が提案されている。また、リバランスについて、非特許文献４以外にも、種々の技術が検討されている。 In addition, there is a case where the current number of nodes can be dealt with without additional installation if the load is distributed appropriately by changing the spatial arrangement on the consistent hash of the node (this is called rebalancing). . Non-Patent Document 4 identifies the situation to be dealt with by scale-out / rebalance, and in the situation to be dealt with by rebalance, between adjacent nodes on the consistent hash space (ID space 5) (for example, In the case of E, a method has been proposed in which rebalancing is performed in the adjacent A or D) indicated by two arrows to correct the load imbalance between nodes. In addition to the non-patent document 4, various techniques for rebalancing have been studied.

David Karger(著), "Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web"David Karger (Author), "Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web" Giuseppe DeCandia (著), "Dynamo: Amazon's Highly Available Key-value Store"Giuseppe DeCandia (Author), "Dynamo: Amazon's Highly Available Key-value Store" 入江道生他，"コンシステント・ハッシュ法におけるデータの複製を意識した負荷分散手法"，信学技報，IEICE Technical Report，IN2010-77（2010-10）Michio Irie et al., "Load Balancing Method Aware of Data Replication in the Consistent Hash Method", IEICE Technical Report, IEICE Technical Report, IN2010-77 (2010-10) 鶴田泰他，「分散サーバシステムにおけるノード負荷分散最適化方式」，電子情報通信学会総合大会講演論文集，Mar．2014，B-7-84Y. Tsuruta et al., “Node Load Balancing Optimization Method in Distributed Server System”, Proceedings of the IEICE General Conference, Mar. 2014, B-7-84

既存のリバランス方法は時間経過を考慮していないため、リバランス設計のための計算に係る時間によっては、実負荷が適正値であるにも関わらず、リバランスが実行されてしまうという課題がある。 Since the existing rebalancing method does not consider the passage of time, there is a problem that depending on the time for calculation for rebalancing design, the rebalancing is executed even though the actual load is an appropriate value. is there.

例えば、図１６（ａ）に示すＩＤ空間において、ノードＢの負荷（図１６のＢとＢ´の担当領域の負荷）と、ノードＥの負荷（図１６のＥとＥ´の担当領域の負荷）が許容範囲外にある。従って、リバランスを行うことが決定される。 For example, in the ID space shown in FIG. 16A, the load on the node B (the load in the area in charge of B and B ′ in FIG. 16) and the load on the node E (the load in the area in charge of E and E ′ in FIG. 16). ) Is outside the allowable range. Therefore, it is determined to perform rebalancing.

そして、図１６（ａ）に示す状態から、リバランス設計のための計算に係る時間（Δｔ）が経過した後（リバランスをする前）、図１６（ｂ）に示す状態になったとする。図１６（ｂ）に示す状態は、負荷の高かったノードＥ（Ｅ´）における負荷が減少し、負荷の低かったノードＢにおける負荷が増加したことを示している。これにより、負荷のアンバランスは許容範囲に収まっている。 Then, assume that the state shown in FIG. 16B is reached after the time (Δt) related to the calculation for the rebalance design has elapsed from the state shown in FIG. The state shown in FIG. 16B indicates that the load at node E (E ′) having a high load has decreased and the load at node B having a low load has increased. Thereby, the load imbalance is within an allowable range.

しかし、従来技術では、図１６（ａ）の時点でリバランスを行うことを決定したら、図１６（ｂ）に示す状態になるか否かに関わらずに、リバランスを実行する。従って、例えば、図１６（ｂ）に示す状態からリバランスを実行したために、アンバランスが生じ、再度リバランスを実行しなければならない、といったことが生じ得る。 However, in the prior art, if it is decided to perform rebalancing at the time of FIG. 16A, the rebalancing is executed regardless of whether or not the state shown in FIG. Therefore, for example, since the rebalance is executed from the state shown in FIG. 16B, an unbalance may occur and the rebalance must be executed again.

本発明は上記の点に鑑みてなされたものであり、分散システムにおける複数のノード間での負荷の偏りを是正するリバランスを実施する技術において、時間の経過に伴う負荷状態に基づき、リバランスの実行可否を決定することを可能とする技術を提供することを目的とする。 The present invention has been made in view of the above points. In a technique for performing rebalancing to correct a load imbalance among a plurality of nodes in a distributed system, the rebalancing is performed based on the load state with time. It is an object of the present invention to provide a technique that makes it possible to determine whether or not to execute the above.

本発明の実施の形態によれば、通信サービスを利用する複数のクライアントマシンからの情報がネットワークを介して振り分けられる複数のノードを有する分散システムにおいて用いられるリバランス装置であって、
前記複数のノードの負荷量に基づいて、当該複数のノード間の負荷量の偏りを抑制するリバランスが必要であるか否かを判定するリバランス処理手段と、
前記リバランス処理手段により、リバランスが必要であると判定された場合において、前記リバランス後の前記複数のノードの予測負荷状態に基づいて、前記リバランスをキャンセルするか否かを判定するキャンセル処理手段とを備え、
前記キャンセル処理手段は、
前記リバランス処理手段によりリバランスが必要であると判定された第１の時点において、当該第１の時点における前記複数のノードの負荷量の平均値からの差分をノード毎に算出し、前記第１の時点から、前記リバランス処理手段によるリバランス設計にかかる時間が経過した第２の時点において、ノード毎に、当該第２の時点におけるノードの負荷量から前記差分を引くことにより、前記予測負荷状態を算出する
ことを特徴とするリバランス装置が提供される。 According to an embodiment of the present invention, there is provided a rebalancing apparatus used in a distributed system having a plurality of nodes to which information from a plurality of client machines using a communication service is distributed via a network,
Rebalancing processing means for determining whether or not rebalancing is necessary to suppress the uneven load amount between the plurality of nodes based on the load amounts of the plurality of nodes;
When the rebalancing processing unit determines that rebalancing is necessary, based on the predicted load state of the plurality of nodes after the rebalancing, canceling whether to cancel the rebalancing or not Processing means ,
The cancellation processing means
At a first time point when the rebalance processing unit determines that rebalancing is necessary, a difference from an average value of load amounts of the plurality of nodes at the first time point is calculated for each node, By subtracting the difference from the load amount of the node at the second time point for each node at the second time point when the time required for the rebalance design by the rebalance processing unit has elapsed from the time point 1, the prediction is performed. A rebalancing device is provided that calculates a load state .

また、本発明の実施の形態によれば、通信サービスを利用する複数のクライアントマシンからの情報がネットワークを介して振り分けられる複数のノードを有する分散システムにおいて用いられるリバランス装置が実行するリバランス方法であって、
前記複数のノードの負荷量に基づいて、当該複数のノード間の負荷量の偏りを抑制するリバランスが必要であるか否かを判定するリバランス判定ステップと、
前記リバランス判定ステップにより、リバランスが必要であると判定された場合において、前記リバランス後の前記複数のノードの予測負荷状態に基づいて、前記リバランスをキャンセルするか否かを判定するキャンセル判定ステップとを備え、
前記キャンセル判定ステップにおいて、前記リバランス装置は、
前記リバランス判定ステップによりリバランスが必要であると判定された第１の時点において、当該第１の時点における前記複数のノードの負荷量の平均値からの差分をノード毎に算出し、前記第１の時点から、リバランス設計にかかる時間が経過した第２の時点において、ノード毎に、当該第２の時点におけるノードの負荷量から前記差分を引くことにより、前記予測負荷状態を算出する
ことを特徴とするリバランス方法が提供される。 In addition, according to the embodiment of the present invention, the rebalancing method executed by the rebalancing apparatus used in the distributed system having a plurality of nodes to which information from a plurality of client machines using the communication service is distributed via the network Because
A rebalance determination step for determining whether or not rebalancing is required to suppress the uneven load amount between the plurality of nodes based on the load amounts of the plurality of nodes;
Cancellation for determining whether or not to cancel the rebalancing based on the predicted load state of the plurality of nodes after the rebalancing when the rebalancing determination step determines that rebalancing is necessary A determination step ,
In the cancellation determination step, the rebalance device
At a first time point when rebalancing is determined by the rebalance determining step, a difference from an average value of load amounts of the plurality of nodes at the first time point is calculated for each node, The predicted load state is calculated by subtracting the difference from the load amount of the node at the second time point for each node at the second time point when the time required for rebalance design has elapsed from the time point 1. A rebalancing method is provided.

本発明の実施の形態によれば、分散システムにおける複数のノード間での負荷の偏りを是正するリバランスを実施する技術において、時間の経過に伴う負荷状態に基づき、リバランスの実行可否を決定することを可能とする技術が提供される。 According to an embodiment of the present invention, in a technique for rebalancing to correct a load imbalance among a plurality of nodes in a distributed system, whether or not rebalancing can be performed is determined based on a load state over time. Techniques are provided that enable this to be done.

本発明の第１実施形態に係る分散システムの構成を示すブロック図である。1 is a block diagram illustrating a configuration of a distributed system according to a first embodiment of the present invention. 本実施形態の分散システムにおけるノードの構成を示し、（ａ）は制御部の構成を示すブロック図、（ｂ）は記憶部の情報を示すブロック図である。The configuration of the node in the distributed system of this embodiment is shown, (a) is a block diagram showing the configuration of the control unit, (b) is a block diagram showing the information in the storage unit. 複数のノードＡ〜Ｅで分割されたハッシュ空間を示す図である。It is a figure which shows the hash space divided | segmented by several nodes AE. （ａ）ノード識別子管理表の一例を示す図、（ｂ）振分ＩＤ表の一例を示す図である。(A) A figure showing an example of a node identifier management table, (b) A figure showing an example of a distribution ID table. （ａ）ノード負荷計測データ（ノード単位）の一例を示す図、（ｂ）ノード負荷計測データ（仮想ノード単位）の一例を示す図、（ｃ）ノード負荷計測データ（データ単位）の一例を示す図である。(A) A diagram showing an example of node load measurement data (node unit), (b) a diagram showing an example of node load measurement data (virtual node unit), and (c) an example of node load measurement data (data unit). FIG. 分散システム負荷集計データの一例を示す図である。It is a figure which shows an example of distributed system load total data. （ａ）リバランス前の振分ＩＤ表の一例を示す図、（ｂ）リバランス後の振分ＩＤ表の一例を示す図、（ｃ）リバランス後の振分ＩＤ表の他例を示す図である。(A) The figure which shows an example of the distribution ID table before rebalance, (b) The figure which shows an example of the distribution ID table after rebalance, (c) The other example of the distribution ID table after rebalance is shown FIG. （ａ）リバランス前の振分ＩＤ表の一例を示す図、（ｂ）リバランス後の振分ＩＤ表の一例を示す図である。(A) A figure showing an example of a distribution ID table before rebalancing, (b) A figure showing an example of a distribution ID table after rebalancing. リバランシングキャンセル機能部の動作を説明するための図である。It is a figure for demonstrating operation | movement of a rebalancing cancellation function part. リバランシングキャンセル機能部の動作を説明するための図である。It is a figure for demonstrating operation | movement of a rebalancing cancellation function part. リバランシングキャンセル機能部の動作の効果を説明するための図である。It is a figure for demonstrating the effect of operation | movement of a rebalancing cancellation function part. 本実施形態の分散システムの各ノードのリバランスを実行する際の動作を説明するためのフローチャートである。It is a flowchart for demonstrating the operation | movement at the time of performing the rebalance of each node of the distributed system of this embodiment. 本実施形態の分散システムの各ノードのリバランスを実行する際の動作を説明するためのフローチャートである。It is a flowchart for demonstrating the operation | movement at the time of performing the rebalance of each node of the distributed system of this embodiment. リバランシングキャンセル機能部の動作を説明するためのフローチャートである。It is a flowchart for demonstrating operation | movement of a rebalancing cancellation function part. 従来技術を説明するためのハッシュ空間を示す図である。It is a figure which shows the hash space for demonstrating a prior art. 課題を説明するための図である。It is a figure for demonstrating a subject.

以下、図面を参照して本発明の実施の形態（本実施の形態）を説明する。なお、以下で説明する実施の形態は一例に過ぎず、本発明が適用される実施の形態は、以下の実施の形態に限られるわけではない。 Hereinafter, an embodiment (this embodiment) of the present invention will be described with reference to the drawings. The embodiment described below is only an example, and the embodiment to which the present invention is applied is not limited to the following embodiment.

例えば、以下で説明する例では、クラスタを構成するノード自身が、リバランス必要性判定、リバランス設計、リバランスのキャンセル判定を行うこととしているが、リバランス必要性判定、リバランス設計・実行、リバランスのキャンセル判定を、クラスタを構成するノード以外の装置が実行してもよい。この場合、当該装置が各ノードの負荷情報を収集し、リバランス必要性判定、リバランス設計、リバランスのキャンセル判定、振分ＩＤ表の配付等を行う。 For example, in the example described below, the nodes constituting the cluster themselves perform rebalancing necessity determination, rebalancing design, and rebalancing cancellation determination. However, rebalancing necessity determination, rebalancing design and execution are performed. The rebalancing cancellation determination may be executed by a device other than the nodes constituting the cluster. In this case, the device collects load information of each node and performs rebalancing necessity determination, rebalancing design, rebalancing cancellation determination, distribution ID table distribution, and the like.

なお、リバランス処理・リバランシングキャンセル判定処理を行う主体が、ノード自身の場合、ノード以外の装置の場合のいずれの場合も、当該主体をリバランス装置と称することができる。 In addition, when the main body that performs the rebalancing process / rebalancing cancellation determination process is the node itself or an apparatus other than the node, the main body can be referred to as a rebalancing apparatus.

また、以下で説明するリバランスの方法は一例である。本発明に係るリバランシングキャンセル判定方法は、以下で説明するリバランスの方法に限らず、他のリバランスの方法（例：非特許文献４）にも適用可能である。 The rebalancing method described below is an example. The rebalancing cancellation determination method according to the present invention is not limited to the rebalancing method described below, but can be applied to other rebalancing methods (eg, Non-Patent Document 4).

ただし、以下で説明するリバランスの方法は、非特許文献４のように隣接ノード間での空間配置変更に限定されず、効率的に負荷の偏りを是正できる優れたリバランスの方法である。 However, the rebalancing method described below is not limited to changing the spatial arrangement between adjacent nodes as in Non-Patent Document 4, and is an excellent rebalancing method that can efficiently correct the load imbalance.

（システムの全体構成、ノードの構成）
図１は、本実施の形態に係る分散システムの構成例を示すブロック図である。 (Overall system configuration, node configuration)
FIG. 1 is a block diagram illustrating a configuration example of a distributed system according to the present embodiment.

図１に示す分散システム１０は、コンシステント・ハッシュ法を用いた複数のノード１５を利用し、データ管理を行うシステムである。当該分散システム１０では、分散システム１０を構成するノード１５間で負荷の偏りが生じた際に、現行ノード１５の負荷の偏り状況を踏まえて、効率的にリバランスを行い負荷の偏りを是正する。ただし、リバランスが必要であると判断した場合、所定時間経過後の負荷状態に基づき、リバランスを実行するか否か（キャンセルするか否か）を判定し、キャンセルしない場合にリバランスを実行する。 A distributed system 10 shown in FIG. 1 is a system that performs data management using a plurality of nodes 15 using a consistent hash method. In the distributed system 10, when a load bias occurs between the nodes 15 constituting the distributed system 10, the load balance of the current node 15 is efficiently rebalanced to correct the load bias. . However, if it is determined that rebalancing is necessary, it is determined whether or not rebalancing is to be executed (whether or not to cancel) based on the load state after a predetermined time has elapsed. To do.

分散システム１０は、複数のクライアントマシン（単に、クライアントともいう）１１にインターネット等のネットワーク１２を介して接続されたロードバランサ１３と、クラスタ１４を構成する複数のノード１５とを備えて構成されている。 The distributed system 10 includes a load balancer 13 connected to a plurality of client machines (also simply referred to as clients) 11 via a network 12 such as the Internet, and a plurality of nodes 15 constituting a cluster 14. Yes.

各ノード１５は、コンピュータ等の物理装置や仮想マシン等の論理装置、言い換えれば、物理的又は仮想的なサーバ等である。クライアント１１からのメッセージが、ロードバランサ１３によって各ノード１５に振り分けられる。この振り分けは、単純なラウンドロビン法等により行われる。 Each node 15 is a physical device such as a computer or a logical device such as a virtual machine, in other words, a physical or virtual server. A message from the client 11 is distributed to each node 15 by the load balancer 13. This distribution is performed by a simple round robin method or the like.

ノード１５は、制御部１８及び記憶部１９を備えて構成されている。但し、制御部１８及び記憶部１９は、ソフトウェア（プログラム）が上記装置（コンピュータ）によって実行されることにより実現されている。当該プログラムは、ネットワークを介して配信してもよいし、メモリ等の記憶媒体に記憶して配付してもよい。なお、制御部１８及び記憶部１９は、それぞれハードウェア（例：処理ロジックを組み込んだ集積回路）によって構成してもよい。 The node 15 includes a control unit 18 and a storage unit 19. However, the control part 18 and the memory | storage part 19 are implement | achieved when software (program) is run by the said apparatus (computer). The program may be distributed via a network, or may be stored and distributed in a storage medium such as a memory. Note that the control unit 18 and the storage unit 19 may each be configured by hardware (for example, an integrated circuit incorporating processing logic).

図２（ａ）に示すように、制御部１８は、ノード識別子管理部１８ａと、振分部１８ｂと、信号処理部１８ｃと、ノード負荷計測部１８ｄと、分散システム負荷リバランス部（単に、リバランス部ともいう）１８ｅと、リバランシングキャンセル機能部１８ｆとを備える。 As shown in FIG. 2A, the control unit 18 includes a node identifier management unit 18a, a distribution unit 18b, a signal processing unit 18c, a node load measurement unit 18d, a distributed system load rebalancing unit (simply, 18e and a rebalancing cancel function unit 18f.

図２（ｂ）に示すように、記憶部１９は、ノード識別子管理表１９ａと、振分ＩＤ表１９ｂと、データ１９ｃと、ノード負荷計測データ１９ｄと、分散システム負荷集計データ１９ｅと、呼制御状態フラグ１９ｆと、ノード毎負荷差分表・ノード毎予測負荷比較表１９ｇと、前回測定データ１９ｈとを記憶する。なお、ノード識別子管理表１９ａを管理表１９ａともいい、分散システム負荷集計データ１９ｅを集計データ１９ｅ、呼制御状態フラグ１９ｆをフラグ１９ｆともいう。 As shown in FIG. 2B, the storage unit 19 includes a node identifier management table 19a, a distribution ID table 19b, data 19c, node load measurement data 19d, distributed system load summary data 19e, and call control. A state flag 19f, a node-by-node load difference table / node-by-node predicted load comparison table 19g, and previous measurement data 19h are stored. The node identifier management table 19a is also called a management table 19a, the distributed system load summary data 19e is also called summary data 19e, and the call control status flag 19f is also called flag 19f.

振分部１８ｂは、クライアント１１からのメッセージ（情報）を、例えばコンシステント・ハッシュ法等に基づき、メッセージを担当するノード１５に振り分ける。 The distribution unit 18b distributes the message (information) from the client 11 to the node 15 in charge of the message based on, for example, a consistent hash method.

信号処理部１８ｃは、クライアント１１からのメッセージに応じて、所定の信号処理を行い、クライアント１１にサービスを提供する。つまり、メッセージを担当するノード１５では、信号処理部１８ｃにて所定の信号処理を行ってクライアント１１にサービスを提供する。この振分部１８ｂ及び信号処理部１８ｃの処理動作については後述で更に詳細に説明する。 The signal processing unit 18 c performs predetermined signal processing in response to a message from the client 11 and provides a service to the client 11. That is, the node 15 in charge of the message provides a service to the client 11 by performing predetermined signal processing in the signal processing unit 18c. The processing operations of the distribution unit 18b and the signal processing unit 18c will be described in detail later.

但し、分散システム１０においては、ロードバランサ１３が存在せず、クライアント１１から任意のノード１５（振分部１８ｂ）にメッセージを送信することも可能である。また、振分部１８ｂと信号処理部１８ｃは、図２のように同じノード１５上に同時に存在させてもよいし、別ノード１５上に存在させてもよい。 However, in the distributed system 10, the load balancer 13 does not exist, and a message can be transmitted from the client 11 to an arbitrary node 15 (distribution unit 18b). Further, the allocating unit 18b and the signal processing unit 18c may be simultaneously present on the same node 15 as illustrated in FIG.

制御部１８において、ノード識別子管理部１８ａ（あるいはリバランス部１８ｅ）は、分散システム１０上のノード情報をノード識別子管理表１９ａに蓄積することにより、各ノード１５が担当するＩＤ空間を管理する。このＩＤ空間は、コンシステント・ハッシュ法ではコンシステント・ハッシュ上の空間（ハッシュ空間）である。 In the control unit 18, the node identifier management unit 18a (or rebalancing unit 18e) manages the ID space handled by each node 15 by accumulating node information on the distributed system 10 in the node identifier management table 19a. This ID space is a space (hash space) on the consistent hash in the consistent hash method.

このハッシュ空間を、例えば図３に示すように、複数のノードＡ〜Ｅで分割し、各ノードＡ〜Ｅの担当領域を決めて管理する。この際、ノードＡが担当するハッシュ空間は、ノードＥから時計回りにノードＡまでの領域であり、このハッシュ空間に存在するデータを担当ノードＡが保持（もしくは処理）する。他のノードＢ〜Ｅも同様である。なお、ハッシュ空間（担当領域）のサイズが大きい程に、多くのデータを保持（処理）できるようになっている。 For example, as shown in FIG. 3, the hash space is divided by a plurality of nodes A to E, and assigned areas of the nodes A to E are determined and managed. At this time, the hash space handled by the node A is an area from the node E to the node A in the clockwise direction, and the responsible node A holds (or processes) data existing in the hash space. The same applies to the other nodes B to E. In addition, as the size of the hash space (area in charge) is larger, more data can be held (processed).

図２（ａ）に戻って、振分部１８ｂは、振分ＩＤ表１９ｂに基づき、メッセージ等のデータの振分先の決定に関する処理を行う。 Returning to FIG. 2A, the distribution unit 18b performs processing related to determination of a distribution destination of data such as a message based on the distribution ID table 19b.

信号処理部１８ｃは、ノード１５における信号処理を行う。この信号処理時のアクセス対象となるデータ１９ｃが記憶部１９に記憶される。 The signal processing unit 18 c performs signal processing at the node 15. Data 19c to be accessed at the time of this signal processing is stored in the storage unit 19.

ノード負荷計測部１８ｄは、自ノード１５の負荷を計測し、この計測結果を記憶部１９にノード負荷計測データ１９ｄとして記録すると共に、必要に応じて定められる特権ノード１５（図３に示す例えばノードＢ）に送付する。 The node load measurement unit 18d measures the load of the node 15 and records the measurement result as the node load measurement data 19d in the storage unit 19, and the privileged node 15 (for example, the node shown in FIG. 3) determined as necessary. Send to B).

分散システム負荷リバランス部１８ｅは、分散システム１０全体のノード負荷に基づいて、負荷の平均値及び標準偏差等の算出を行い、これらの算出結果である分散システム負荷集計データ１９ｅを記憶部１９に記憶する。更に、リバランス部１８ｅは、その記憶された集計データ１９ｅに基づくリバランスの必要性判定、並びにリバランス設計を行って、リバランシングキャンセル機能部１８ｆにより、リバランスのキャンセルをしないと判定された場合に、リバランスを実行する。リバランシングキャンセル機能部１８ｆについては後述する。 The distributed system load rebalancing unit 18e calculates the average value and standard deviation of the load based on the node load of the entire distributed system 10, and stores the distributed system load aggregate data 19e as the calculation result in the storage unit 19. Remember. Further, the rebalancing unit 18e performs the rebalancing necessity determination and the rebalancing design based on the stored total data 19e, and the rebalancing cancellation function unit 18f determines that the rebalancing is not canceled. If so, perform a rebalance. The rebalancing cancel function unit 18f will be described later.

また、記憶部１９に記憶される呼制御状態フラグ１９ｆは、新規呼を制御する状態か否かを判別するための情報である。 The call control state flag 19f stored in the storage unit 19 is information for determining whether or not the state is a state for controlling a new call.

（振分部１８ｂ及び信号処理部１８ｃにおける処理について）
ここで、前述した図２に示すノード１５の振分部１８ｂ及び信号処理部１８ｃによるメッセージの振分処理及び信号処理について更に詳細に説明する。 (About processing in the distribution unit 18b and the signal processing unit 18c)
Here, message distribution processing and signal processing by the distribution unit 18b and the signal processing unit 18c of the node 15 shown in FIG. 2 will be described in more detail.

振分部１８ｂは、クライアント１１から発呼されるメッセージ内の情報をもとに、信号処理を担当するノード１５を特定し、当該ノード１５にメッセージの振り分けを行う。メッセージは、新規呼（例えば、ＳＩＰ(Session Initiation Protocol)においてはＩｎｉｔｉａｌ−ＩＮＶＩＴＥ等）と後続呼（ＳＩＰにおいてはＢＹＥ等）に分けられる。 The distribution unit 18b identifies the node 15 in charge of signal processing based on the information in the message called from the client 11, and distributes the message to the node 15. The message is divided into a new call (for example, Initial-INVITE or the like in SIP (Session Initiation Protocol)) and a subsequent call (for example, BYE in SIP).

新規呼か後続呼かの識別は、呼のメッセージに後述の振分キーが埋め込まれているか否かで判定できる。例えば、ＳＩＰにおいては、Ｔｏ／ＦｒｏｍヘッダのＴａｇ等で判定できる。 Whether the call is a new call or a subsequent call can be determined based on whether or not a distribution key described later is embedded in the call message. For example, in SIP, it can be determined by Tag of To / From header or the like.

振分キーは、"データ識別子（ＳＩＰにおいてはｃａｌｌ−ｉｄ）＋ハッシュ値"で構成されている。ハッシュ値は、データ識別子からハッシュ関数をかけて導出された値である。 The distribution key is configured by “data identifier (call-id in SIP) + hash value”. The hash value is a value derived from the data identifier by applying a hash function.

一方、上述した新規呼か後続呼かの識別の判定の結果、後続呼の場合、振分部１８ｂにて、振分ＩＤ表１９ｂ上のノード１５毎の担当領域である振分ＩＤ空間{図４（ｂ）に示し後述する}と、振分キー内のハッシュ値とを比較して担当するノード１５を特定する。更に、担当するノード１５のアドレスを、後述の図４（ａ）に示すノード識別子管理表１９ａから特定し、この特定されたノード１５に転送する。 On the other hand, in the case of a subsequent call as a result of the above-described determination of whether the call is a new call or a subsequent call, the distribution unit 18b allocates a distribution ID space that is an assigned region for each node 15 on the distribution ID table 19b { 4 (b) and will be described later} is compared with the hash value in the distribution key to identify the node 15 in charge. Further, the address of the node 15 in charge is specified from the node identifier management table 19a shown in FIG. 4A described later, and transferred to the specified node 15.

一方、上述した判定の結果、新規呼の場合、振分キーが存在しないため、メッセージからＣａｌｌ−ｉｄ（データ識別子）を抽出し、これをハッシュ関数に導入してハッシュ値を導出する。更に、振分部１８ｂにて、振分ＩＤ表１９ｂ上のノード１５毎の担当領域である振分ＩＤ空間{図４（ｂ）に示し後述する}と、導出したハッシュ値とを比較して担当するノード１５を特定する。更に、担当するノード１５のアドレスを、後述の図４（ａ）に示すノード識別子管理表１９ａから特定し、この特定されたノード１５に転送する。 On the other hand, as a result of the determination described above, since a distribution key does not exist in the case of a new call, a Call-id (data identifier) is extracted from the message and introduced into a hash function to derive a hash value. Further, the distribution unit 18b compares the distribution ID space {shown in FIG. 4B and described later}, which is the assigned area for each node 15 on the distribution ID table 19b, with the derived hash value. The node 15 in charge is specified. Further, the address of the node 15 in charge is specified from the node identifier management table 19a shown in FIG. 4A described later, and transferred to the specified node 15.

新規呼を信号処理部１８ｃで受信した場合も、メッセージからＣａｌｌ−ｉｄ（データ識別子）を抽出し、これをハッシュ関数に導入してハッシュ値を導出して、振分キーを生成する。また、信号処理部１８ｃによる信号処理後に、クライアント１１（ＳＩＰにおいてはＵＡＣやＵＡＳ等）に送付するメッセージに振分キーを埋め込んで（ＳＩＰにおいてはＴｏ／ＦｒｏｍヘッダのＴａｇ）送付する。 Even when a new call is received by the signal processing unit 18c, a Call-id (data identifier) is extracted from the message, and this is introduced into a hash function to derive a hash value, thereby generating a distribution key. Further, after the signal processing by the signal processing unit 18c, a distribution key is embedded in a message to be sent to the client 11 (UAC, UAS, etc. in SIP) (To / From header Tag in SIP) and sent.

以降、クライアント１１からの後続呼には本振分キーを埋め込みの上、メッセージを送付し、振分部１８ｂにて本振分キーのハッシュ値を基に振り分けが行われることで、当該呼が処理されたノード１５に後続呼が届くことが可能となる。 Thereafter, a message is sent after embedding the real distribution key in the subsequent call from the client 11, and the distribution unit 18b distributes the call based on the hash value of the main distribution key. Subsequent calls can reach the processed node 15.

（ノード識別子管理部１８ａについて）
次に、上述したノード識別子管理部１８ａについて、より詳細に説明する。 (About the node identifier management unit 18a)
Next, the node identifier management unit 18a described above will be described in more detail.

ノード識別子管理部１８ａは、分散システム１０へのノード１５の追加や離脱が発生した際に、分散システム１０を構成するノード１５の識別子情報（ノード識別子）を更新し、これを、図４（ａ）に示すノード識別子管理表１９ａとして管理する。図４（ａ）の例においては、ノード識別子（又はノードＩＤ）（例えば、「Ｎｏｄｅ１」）に、アドレス（例えば、「１０．４５．０．１」）が対応付けられている。そのノード識別子は、特権ノードのノード識別子管理部１８ａで付与され、全ノード１５へと配信される。 The node identifier management unit 18a updates the identifier information (node identifier) of the node 15 constituting the distributed system 10 when the node 15 is added to or removed from the distributed system 10, and this is updated as shown in FIG. The node identifier management table 19a shown in FIG. In the example of FIG. 4A, an address (for example, “10.45.0.1”) is associated with a node identifier (or node ID) (for example, “Node 1”). The node identifier is assigned by the node identifier management unit 18a of the privileged node and distributed to all the nodes 15.

コンシステント・ハッシュ法においては、ノード識別子に、図４（ｂ）に示す仮想ノード識別子（又は仮想ノードＩＤ）が従属している。この仮想ノードＩＤは、振分ＩＤ空間の任意のＩＤ（ハッシュ値による）である。例えば図４（ａ）に示すノード識別子「Ｎｏｄｅ１」には、図４（ｂ）の振分ＩＤ表１９ｂに示す少なくとも１つ以上の仮想ノードＩＤ「Ｎｏｄｅ１−１」，「Ｎｏｄｅ１−２」が従属している。言い換えれば、ノード１５に１つ以上の仮想ノードが従属している。但し、これは基本構成であって、ノード１５に仮想ノードが従属しない場合もある。 In the consistent hash method, the virtual node identifier (or virtual node ID) shown in FIG. 4B is subordinate to the node identifier. This virtual node ID is an arbitrary ID (by hash value) in the distribution ID space. For example, the node identifier “Node1” shown in FIG. 4A is subordinate to at least one or more virtual node IDs “Node1-1” and “Node1-2” shown in the distribution ID table 19b of FIG. 4B. doing. In other words, one or more virtual nodes are subordinate to the node 15. However, this is a basic configuration, and the virtual node may not be subordinate to the node 15 in some cases.

このように、前述のノード識別子管理表１９ａの更新と合わせて、ノード１５が担当する振分ＩＤ空間の担当領域を更新し、これを振分ＩＤ表１９ｂとして管理する。振分ＩＤ表１９ｂには、例えば、仮想ノードＩＤ「Ｎｏｄｅ１−１」に、担当する振分ＩＤ空間の担当領域として「０〜１９９（Ｄ＝２００）」のデータサイズが対応付けられ、仮想ノードＩＤ「Ｎｏｄｅ１−２」に、担当する振分ＩＤ空間の担当領域として「６００〜９９９（Ｄ＝４００）」のデータサイズが対応付けられている。即ち、Ｄ＝２００は、担当領域のデータサイズが２００であることを示す。他のＤ＝４００等も同じである。 Thus, in conjunction with the update of the node identifier management table 19a described above, the assigned area of the distribution ID space handled by the node 15 is updated and managed as the distribution ID table 19b. In the distribution ID table 19b, for example, the virtual node ID “Node1-1” is associated with the data size of “0 to 199 (D = 200)” as the assigned area of the assigned ID space, and the virtual node The data size of “600 to 999 (D = 400)” is associated with the ID “Node1-2” as the assigned area of the assigned ID space. That is, D = 200 indicates that the data size of the assigned area is 200. The same applies to other D = 400 and the like.

（ノード負荷計測部１８ｄの処理について）
次に、上述したノード負荷計測部１８ｄにより計測される負荷の情報収集と、この収集された負荷の特権ノードへの送付について説明する。 (Regarding the processing of the node load measuring unit 18d)
Next, information collection of the load measured by the node load measurement unit 18d described above and sending of the collected load to the privileged node will be described.

ノード負荷計測部１８ｄは、所定の周期で当該ノード１５の負荷を計測し、これをノード負荷計測データ１９ｄとして記憶部１９に記録して蓄積する。また、ノード負荷計測部１８ｄは、所定の周期で特権ノード（例えば図３に示すノードＢ）に蓄積したノード負荷計測データ１９ｄを送付する。 The node load measurement unit 18d measures the load on the node 15 at a predetermined cycle, and records and accumulates the load on the storage unit 19 as node load measurement data 19d. Further, the node load measurement unit 18d sends the node load measurement data 19d accumulated in the privileged node (for example, the node B shown in FIG. 3) at a predetermined cycle.

また、特権ノードは、各ノード１５から収集した全ノードの負荷データを、全ノード１５へ配信する。各ノード１５は、この負荷データをノード負荷計測データ１９ｄとして記憶部１９に記録することができる。 Further, the privileged node distributes the load data of all nodes collected from each node 15 to all the nodes 15. Each node 15 can record this load data in the storage unit 19 as node load measurement data 19d.

上述したノード負荷計測部１８ｄにおいて所定周期で計測されるノード１５の負荷として、ＣＰＵ(Central Processing Unit)使用率、メモリ使用率、アクセス頻度等の、ノード１５にて取得可能なあらゆるパラメータを使用することができる。また、どの数値がボトルネックとなるか、更に、どの程度の値であればリバランスすべき閾値となるかは、分散システムのシステム特性に応じて異なり、複数のパラメータの組み合わせにより判断するケースもある。従って、特定のパラメータ種別に限定せず利用可能とする。 As the load of the node 15 measured at a predetermined period in the node load measuring unit 18d described above, all parameters that can be acquired by the node 15 such as a CPU (Central Processing Unit) usage rate, a memory usage rate, and an access frequency are used. be able to. Also, which numerical value is the bottleneck, and what value is the threshold to be rebalanced depends on the system characteristics of the distributed system, and there are cases where it is determined by a combination of multiple parameters. is there. Therefore, it can be used without being limited to a specific parameter type.

また、ノード負荷計測部１８ｄによる負荷の計測単位は、図５の（ａ）ノードＩＤによるノード単位、（ｂ）仮想ノードＩＤによる仮想ノードＩＤ単位、（ｃ）データ単位の内、どの単位で計測しても構わない。また、図５（ｂ）の仮想ノード単位で負荷を計測する場合、それを集計して（ａ）のノード単位を算出可能であり、（ｃ）のデータ単位で負荷を計測する場合、それを集計して（ｂ）の仮想ノード単位や（ａ）のノード単位の負荷を算出可能である。なお、図５においては、ノード負荷計測部１８ｄにより計測される負荷は、アクセス頻度（アクセス回数）を例に示してある。 Further, the unit of load measurement by the node load measuring unit 18d is measured in any one of (a) node unit by node ID, (b) virtual node ID unit by virtual node ID, and (c) data unit in FIG. It doesn't matter. In addition, when the load is measured in the virtual node unit of FIG. 5B, it is possible to calculate the node unit of (a) by summing up the load, and when measuring the load in the data unit of (c), It is possible to calculate the load in units of virtual nodes in (b) and in units of nodes in (a) by summing up. In FIG. 5, the load measured by the node load measuring unit 18 d shows an access frequency (number of accesses) as an example.

このような図５（ａ）〜（ｃ）の表は、２つのノード１５で構成される分散システム１０におけるものであり、次のような構成となっている。 Such tables in FIGS. 5A to 5C are for the distributed system 10 including two nodes 15 and have the following configuration.

図５（ａ）に示す１つのノード１５（例えばノードＩＤ＝Ｎｏｄｅ１）において、図５（ｂ）に示す２つの仮想ノードＩＤ（Ｎｏｄｅ１_１，Ｎｏｄｅ１_２）による２つの仮想ノードを保持する。更に、図５（ｃ）に示す１つの仮想ノードＩＤ（例えばＮｏｄｅ１_１）による仮想ノード当り２つのデータ（ｄａｔａ１，ｄａｔａ２）を保有する場合を想定してある。他のノードも同様である。 In one node 15 (for example, node ID = Node1) shown in FIG. 5A, two virtual nodes with two virtual node IDs (Node1_1, Node1_2) shown in FIG. 5B are held. Furthermore, it is assumed that two data (data1, data2) per virtual node with one virtual node ID (for example, Node1_1) shown in FIG. The same applies to the other nodes.

この場合に、負荷の計測単位を図５（ｂ）に示すように仮想ノード単位とし、収集する負荷としてのアクセス頻度（回数）を、１０秒周期（１０：１５：００→１０：１５：１０→１０：１５：２０）で収集して、蓄積するケースを想定してある。 In this case, the load measurement unit is a virtual node unit as shown in FIG. 5B, and the access frequency (number of times) as a load to be collected is a 10 second period (10: 15: 00 → 10: 15: 10). → 10:15:20) The case of collecting and accumulating is assumed.

（分散システム負荷リバランス部１８ｅによるリバランスの判定処理について）
次に、上述した分散システム負荷リバランス部１８ｅによるノード１５の負荷の偏り算出及びリバランス必要性判断の処理について説明する。以下で説明する分散システム負荷リバランス部１８ｅにおけるリバランス必要性判断、リバランス設計、実行（振分ＩＤ表配付）の処理は、どのノード１５が行ってもよいが、ここでは、特権ノードが行うことを想定している。 (Regarding rebalancing determination processing by the distributed system load rebalancing unit 18e)
Next, a description will be given of the process of calculating the load deviation of the node 15 and the rebalancing necessity determination by the distributed system load rebalancing unit 18e. Any node 15 may perform the rebalancing necessity determination, rebalancing design, and execution (distribution ID table distribution) processing in the distributed system load rebalancing unit 18e described below. Assumes to do.

リバランス部１８ｅは、所定の周期で各ノード１５から収集したノードの負荷データ（ノード負荷計測データ１９ｄ）に基づき、分散システム１０全体のノード１５の負荷の平均値及び標準偏差、偏差並びに偏差／標準偏差（偏差を標準偏差で除した値）の算出を行う。更に、リバランス部１８ｅは、それらの算出結果を、図６に一例を示すように集計データ１９ｅとして記録し、この記録した集計データ１９ｅに基づき、後述の３つの条件（１）〜（３）の何れか１つを満たす場合、ノード１５間の負荷の偏りを是正するリバランスが必要であると判定する。何れも満たさない場合はリバランスが不要であると判定する。 Based on the node load data (node load measurement data 19d) collected from each node 15 at a predetermined period, the rebalance unit 18e loads the average value, standard deviation, deviation, and deviation / The standard deviation (value obtained by dividing the deviation by the standard deviation) is calculated. Further, the rebalance unit 18e records the calculation results as aggregated data 19e as shown in FIG. 6, and based on the recorded aggregated data 19e, the following three conditions (1) to (3) are recorded. If any one of the above is satisfied, it is determined that rebalancing is necessary to correct the load imbalance among the nodes 15. If neither is satisfied, it is determined that rebalancing is unnecessary.

図６に示す集計データ１９ｅには、収集時の時刻、ノードＩＤ（ノード識別子）、平均値（アクセス頻度）、標準偏差、実測値（アクセス頻度）、偏差（平均値からの差分）、及び偏差／標準偏差が記録される。なお、平均値及び実測値は、アクセス頻度の平均値及び実測値である。 The total data 19e shown in FIG. 6 includes a collection time, a node ID (node identifier), an average value (access frequency), a standard deviation, an actual measurement value (access frequency), a deviation (difference from the average value), and a deviation. / Standard deviation is recorded. The average value and the actual measurement value are the average value and the actual measurement value of the access frequency.

条件（１）、リバランス部１８ｅは、集計データ１９ｅに基づき、いずれかのノード１５の負荷が、当該ノード１５が許容する負荷の上限値（予め定められた上限値）を超えていないか否かをチェックし、上限値を超えるノードが存在する場合に、リバランスが必要であると判定する。 Condition (1), the rebalance unit 18e, based on the total data 19e, whether or not the load on any of the nodes 15 exceeds the upper limit value (predetermined upper limit value) that the node 15 allows. If there is a node exceeding the upper limit value, it is determined that rebalancing is necessary.

条件（２）、リバランス部１８ｅは、集計データ１９ｅに基づき、ノード１５全体の負荷の標準偏差が所定の閾値（第１閾値）以下であるか否かを確認し、閾値を超えている場合に、リバランスが必要であると判定する。 Condition (2), the rebalance unit 18e checks whether or not the standard deviation of the load of the entire node 15 is equal to or less than a predetermined threshold (first threshold) based on the total data 19e, and exceeds the threshold Therefore, it is determined that rebalancing is necessary.

条件（３）、リバランス部１８ｅは、集計データ１９ｅに基づき、ノード１５毎の負荷の偏差／標準偏差が所定の閾値（第２閾値）以下であるか否かを確認し、閾値を超えているノード１５がある場合に、リバランスが必要であると判定する。 Condition (3), the rebalance unit 18e checks whether or not the load deviation / standard deviation for each node 15 is equal to or less than a predetermined threshold (second threshold) based on the aggregated data 19e. When there is a node 15 that is present, it is determined that rebalancing is necessary.

但し、図６に示す集計データ１９ｅにおいては、負荷の計測単位を図５（ａ）に示すノード単位（仮想ノード単位の場合もある）とし、負荷の平均値及び実測値をアクセス頻度とする。更に、平均値や標準偏差を算出する際の時間間隔を２０秒間（例えば、１０：１４：４０〜１０：１５：００）とする。図６の例は、１０：１５：００の時刻における上記算出値等である。また、各ノード１５が許容する負荷の上限値をアクセス頻度の実測値についての上限値として、これを「９０」｛条件（１）｝とし、標準偏差の閾値を「１５」｛条件（２）｝、偏差／標準偏差の閾値（乖離閾値ともいう）を「１．２」｛条件（３）｝とした際の例である。なお、平均値や標準偏差を算出する際の時間間隔は、リバランスの必要性を判定する時間間隔であってもよい。 However, in the aggregated data 19e shown in FIG. 6, the load measurement unit is the node unit shown in FIG. 5A (there may be a virtual node unit), and the load average value and the actual measurement value are the access frequency. Furthermore, the time interval for calculating the average value and the standard deviation is set to 20 seconds (for example, 10:14:40 to 10:15:00). The example of FIG. 6 is the calculated value and the like at the time of 10:15:00. In addition, the upper limit value of the load allowed by each node 15 is set as the upper limit value for the actually measured access frequency, which is “90” {condition (1)}, and the standard deviation threshold is “15” {condition (2). }, This is an example when the deviation / standard deviation threshold (also referred to as the deviation threshold) is “1.2” {condition (3)}. Note that the time interval for calculating the average value and the standard deviation may be a time interval for determining the necessity of rebalancing.

この例では、条件（１）、（２）は満たさない。しかし、図６ではノード識別子の「Ｎｏｄｅ１」、「Ｎｏｄｅ３」、「Ｎｏｄｅ４」の偏差／標準偏差が「１．５」であり、閾値「１．２」を超えており、条件（３）を満たしているため、リバランスが必要であると判定される。ただし、後述するリバランシングキャンセル機能部１８ｆにより、リバランスをキャンセルすると判定された場合には、このタイミングでのリバランスは実行されない。 In this example, the conditions (1) and (2) are not satisfied. However, in FIG. 6, the deviation / standard deviation of the node identifiers “Node 1”, “Node 3”, and “Node 4” is “1.5”, which exceeds the threshold value “1.2” and satisfies the condition (3) Therefore, it is determined that rebalancing is necessary. However, if the rebalancing cancel function unit 18f described later determines that the rebalancing is canceled, the rebalancing at this timing is not executed.

（分散システム負荷リバランス部１８ｅによるリバランスの設計、及び実行処理について）
ここで、リバランシングキャンセル機能部１８ｆによりリバランスがキャンセルされない場合における、リバランス部１８ｅが実行するリバランスの設計、及び実行処理について説明する。 (Rebalancing design and execution processing by the distributed system load rebalancing unit 18e)
Here, the rebalancing design and execution processing executed by the rebalancing unit 18e when the rebalancing is not canceled by the rebalancing cancellation function unit 18f will be described.

リバランスは、負荷の高いノード１５の担当領域（担当のＩＤ空間）中の移譲領域（後述）を、負荷の低いノード１５へ移譲することで負荷の偏りを是正する。この時、負荷の乖離を是正するために、担当領域の必要な移譲領域のサイズを推定の上、その移譲領域のみを移譲する。但し、移譲領域は、担当領域の全てであったり、担当領域の１００％未満の割合の領域であったりする。 The rebalance corrects the load bias by transferring a transfer area (described later) in the assigned area (responsible ID space) of the node 15 having a high load to the node 15 having a low load. At this time, in order to correct the load divergence, the size of the necessary transfer area of the assigned area is estimated, and only the transfer area is transferred. However, the transfer area may be all of the assigned areas or an area with a ratio of less than 100% of the assigned areas.

本実施の形態において、この移譲の方法は、次の（Ｔ１）〜（Ｔ４）のようになる。 In the present embodiment, this transfer method is as follows (T1) to (T4).

（Ｔ１）全てのノード１５の中で最も負荷の高いノード１５の担当領域中の移譲領域を、最も低いノード１５に対して移譲していくものとする。 (T1) It is assumed that the transfer area in the assigned area of the node 15 having the highest load among all the nodes 15 is transferred to the lowest node 15.

（Ｔ２）移譲領域の移譲は次の場合に終了するものとする。即ち、上記の条件（１）〜（３）の何れかを満たす要因となった偏差の全てが存在しなくなった場合（Ｔ２−１）、若しくは、その偏差の一部（予め指定の偏差解消割合を満たす偏差）を解消する移譲領域の移譲が決定した場合（Ｔ２−２）、若しくは、移譲領域の移譲を許容可能な移譲先ノード１５が存在しなくなった場合（Ｔ２−３）に終了するものとする。 (T2) The transfer of the transfer area ends in the following case. That is, when all of the deviations that cause any of the above conditions (1) to (3) no longer exist (T2-1), or a part of the deviation (previously designated deviation cancellation ratio) The process ends when the transfer of the transfer area that resolves (deviation satisfying) is determined (T2-2), or when there is no transfer destination node 15 that can transfer the transfer area (T2-3). And

（Ｔ３）移譲領域の移譲単位は、ノード単位や仮想ノード単位でも構わないし、仮想ノード単位でなく、仮想ノードの担当領域の半分を割譲する単位や、１つのハッシュ値によるデータのみの移譲単位でも構わない。 (T3) The transfer unit of the transfer area may be a node unit or a virtual node unit, not a virtual node unit, but a unit for transferring half of the area in charge of the virtual node, or a transfer unit of only data using one hash value I do not care.

（Ｔ４）リバランス部１８ｅがリバランスを行う際に事前に実行するリバランス設計は、負荷の計測単位を上述したノード単位、仮想ノード単位及びデータ単位の内、どの単位で実行していたかで、可能なリバランス設計の粒度が、次に記載するように変わる。 (T4) The rebalancing design executed in advance when the rebalancing unit 18e performs rebalancing determines in which unit the load measurement unit is executed from among the node unit, the virtual node unit, and the data unit described above. The granularity of possible rebalance designs varies as described below.

即ち、ノード単位の負荷計測の場合、後述のリバランス粒度が粗い場合のみの方式となる。 That is, in the case of load measurement in node units, the method is only used when the rebalance granularity described later is coarse.

仮想ノード単位の負荷計測の場合、後述のリバランス粒度が粗い場合及びリバランス粒度が中間（粗いと細かいとの中間）の場合の方式が可能となる。 In the case of load measurement in units of virtual nodes, it is possible to use a method in which the rebalance granularity described later is coarse and the rebalance granularity is intermediate (intermediate between coarse and fine).

データ単位の負荷計測の場合、後述のリバランス粒度が粗い場合、中間の場合及び細かい場合の３つ全ての方式が採用可能となる。 In the case of load measurement in units of data, all three methods can be employed when the rebalance granularity described later is coarse, intermediate, and fine.

まず、リバランス粒度が粗い場合について説明する。 First, the case where the rebalance particle size is coarse will be described.

ノード１５全体における負荷の総量を、ノード１５全ての仮想ノードＩＤ数で割った仮想ノード当たりの平均負荷量「Ｌｖ＿ａｖｅ」を算出する。次に、ノード１５間において最も負荷の高いノード１５に着目し、このノード１５について解消すべき負荷量の偏差（この偏差の符号は＋であることから、プラス偏差ともいう）「Ｌｔａｒｇｅｔ」を算出する。この偏差は、例えば、当該ノード１５について、図６で示されているプラスの偏差（実測値の平均値との差分）である。 An average load amount “Lv_ave” per virtual node obtained by dividing the total load amount of the entire node 15 by the number of virtual node IDs of all the nodes 15 is calculated. Next, paying attention to the node 15 with the highest load among the nodes 15, calculate the deviation of the load amount to be eliminated for this node 15 (because the sign of this deviation is +, it is also called a plus deviation) “Ltarget” To do. This deviation is, for example, the positive deviation (difference from the average value of the actual measurement values) shown in FIG.

次に、「Ｌｔａｒｇｅｔ」を「Ｌｖ＿ａｖｅ」で割った値を、解消すべき負荷量を解消するために必要な仮想ノードＩＤ数「Ｖｔａｒｇｅｔ＿ｎｕｍ」と考える。 Next, a value obtained by dividing “Ltarget” by “Lv_ave” is considered as the number of virtual node IDs “Vtarget_num” necessary for eliminating the load to be eliminated.

この最も負荷の高いノード１５の仮想ノードの中から無作為に「Ｖｔａｒｇｅｔ＿ｎｕｍ」の仮想ノードＩＤを抽出する。この時、「Ｖｔａｒｇｅｔ＿ｎｕｍ」に小数が含まれる場合は、上記（Ｔ３）にその概要を記載したように、所定の仮想ノードＩＤの仮想ノードの担当領域を例えば小数に基づき割譲してもよい。これは、例えば「１．５」の場合、仮想ノード１つの担当領域の割譲と、仮想ノード２つ目の担当領域を半分にして割譲することである。更に、小数部分を切り捨てや切り上げ、又は四捨五入する等して整数個の仮想ノードＩＤを抽出してもよい。 The virtual node ID of “Vtarget_num” is randomly extracted from the virtual nodes of the node 15 having the highest load. At this time, when “Vtarget_num” includes a decimal number, as described in the outline in (T3) above, the area in charge of the virtual node having a predetermined virtual node ID may be assigned based on the decimal number, for example. For example, in the case of “1.5”, the assigned area of one virtual node is assigned and the assigned area of the second virtual node is divided in half. Further, an integer number of virtual node IDs may be extracted by rounding down, rounding up, or rounding off the decimal part.

上述したように、無作為に抽出された仮想ノードＩＤ「Ｖｔａｒｇｅｔ＿ｎｕｍ」の仮想ノードの担当領域中の移譲領域を移譲する際に、全てのノード１５の中で、最も負荷の低いノード１５から順に移譲していく。この際、移譲によって移譲先のノード１５の負荷が高まりすぎないように、許容可能な担当領域の移譲サイズを求める必要がある。 As described above, when the transfer area in the assigned area of the virtual node with the virtual node ID “Vtarget_num” extracted at random is transferred, the transfer is performed in order from the node 15 having the lowest load among all the nodes 15. I will do it. At this time, it is necessary to determine an allowable transfer size of the assigned area so that the load on the transfer destination node 15 does not increase too much due to the transfer.

具体的には、移譲先のノード１５は、負荷量の偏差（この偏差の符号は−であることから、マイナス偏差ともいう）までは受け入れ許容可能である。このため、負荷量のマイナス偏差を平均負荷量「Ｌｖ＿ａｖｅ」で割った値である負荷量解消に必要な仮想ノードＩＤ数「Ｖｇｅｔ＿ｎｕｍ１」が、許容可能な仮想ノードＩＤ数となる。 Specifically, the node 15 as the transfer destination can accept the load amount deviation (the sign of the deviation is −, so it is also referred to as a minus deviation). For this reason, the number of virtual node IDs “Vget_num1” required to eliminate the load amount, which is a value obtained by dividing the minus deviation of the load amount by the average load amount “Lv_ave”, is the allowable number of virtual node IDs.

ここで、移譲先のノード１５の担当領域が許容量を越える場合は、次に負荷の低いノード１５について、同様の手順で許容可能な仮想ノードＩＤ数「Ｖｇｅｔ＿ｎｕｍ２」を求めていき、Ｖｔａｒｇｅｔ＿ｎｕｍ＜Ｖｇｅｔ＿ｎｕｍ１＋Ｖｇｅｔ＿ｎｕｍ２＋…となって、全ての必要な担当領域中の移譲領域の移譲が完了すれば終了となる。 Here, when the assigned area of the transfer destination node 15 exceeds the allowable amount, an allowable virtual node ID number “Vget_num2” is obtained for the node 15 having the next lowest load in the same procedure, and Vtarget_num <Vget_num1 + Vget_num2 + When the transfer of the transfer area in all the necessary assigned areas is completed, the process ends.

以降同様の処理を、次に負荷の高いノードに対しても実行し、全ての負荷乖離の解消が必要なノード１５において、負荷の乖離を是正する担当領域中の移譲領域の移譲が完了するか、若しくは、移譲領域の移譲が可能なノードが存在しなくなるまで実行する。 Thereafter, the same processing is executed for the next highest load node, and the transfer of the transfer area in the assigned area that corrects the load divergence is completed in all the nodes 15 that need to eliminate the load divergence. Or, it is executed until there is no node that can be transferred to the transfer area.

次に、リバランス粒度が中間の場合について説明する。 Next, a case where the rebalance granularity is intermediate will be described.

基本的に上述したリバランス粒度が粗い場合と同じであるため、粗い場合との差分のみを説明する。 Since it is basically the same as the case where the rebalance granularity described above is coarse, only the difference from the coarse case will be described.

上述したように、移譲元のノード１５の仮想ノードの中から無作為に仮想ノードを抽出するのではなく、解消すべき負荷量のプラス偏差を発生させている仮想ノードを選択的に抽出し、この抽出した仮想ノードの担当領域中の移譲領域を移譲するものとする。この場合、仮想ノード単位で負荷量を計測しているため、計測負荷は粒度が粗い場合よりも高くなるが、負荷の乖離を是正するための移譲領域の移譲を、より正確に行うことが可能となる。 As described above, instead of randomly extracting virtual nodes from the virtual nodes of the transfer source node 15, the virtual nodes that cause a positive deviation of the load amount to be eliminated are selectively extracted. It is assumed that the transfer area in the area in charge of the extracted virtual node is transferred. In this case, since the load amount is measured in units of virtual nodes, the measured load is higher than when the granularity is coarse, but the transfer area can be transferred more accurately to correct the load divergence. It becomes.

次に、リバランス粒度が細かい場合について説明する。 Next, the case where the rebalance granularity is fine will be described.

上述したように、移譲元のノード１５の仮想ノードの中から無作為に仮想ノードを抽出するのではなく、解消すべき負荷量のプラス偏差を発生させているデータのハッシュ値を選択的に抽出し、そのハッシュ値のみを移譲するものとする。この場合、データ単位で負荷量を計測しているため、計測負荷はリバランス粒度が中間の場合よりも高くなるが、負荷の乖離を是正するための移譲領域の移譲を、より正確に行うことが可能となる。また、移譲の単位も最小化することができる。 As described above, instead of randomly extracting virtual nodes from the virtual nodes of the transfer source node 15, a hash value of data that causes a plus deviation of the load to be eliminated is selectively extracted. Only the hash value is transferred. In this case, the load is measured in units of data, so the measured load is higher than when the rebalance granularity is intermediate, but the transfer of the transfer area to correct the load divergence should be performed more accurately. Is possible. Also, the unit of transfer can be minimized.

リバランス部１８ｅは、リバランスが必要であると判定した場合に、負荷の偏りを是正するリバランス設計を、上述した手順で実行し、振分ＩＤ表１９ｂに反映させる。ただし、振分ＩＤ表１９ｂの全ノード１５への送付については、リバランスがキャンセルされない場合に実行する。 When it is determined that rebalancing is necessary, the rebalancing unit 18e executes the rebalancing design for correcting the load bias in the above-described procedure, and reflects the rebalancing design in the distribution ID table 19b. However, the transmission to all the nodes 15 in the distribution ID table 19b is executed when the rebalance is not canceled.

例えば、図７（ａ）に示すように、リバランス前の振分ＩＤ表１９ｂは、仮想ノードＩＤ「Ｎｏｄｅ１−１」に、担当する振分ＩＤ空間の担当領域として「０〜１９９（Ｄ＝２００）」のデータサイズが対応付けられ、「Ｎｏｄｅ２−１」に「２００〜３９９（Ｄ＝２００）」、「Ｎｏｄｅ３−１」に「４００〜５９９（Ｄ＝２００）」、「Ｎｏｄｅ１−２」に「６００〜９９９（Ｄ＝４００）」のデータサイズが対応付けられているとする。 For example, as shown in FIG. 7A, the distribution ID table 19b before rebalancing is assigned to the virtual node ID “Node1-1” as “0-199 (D = 200) ”,“ Node 2-1 ”is“ 200 to 399 (D = 200) ”,“ Node 3-1 ”is“ 400 to 599 (D = 200) ”, and“ Node 1-2 ”. Are associated with a data size of “600 to 999 (D = 400)”.

ここで、例えば図７（ａ）に示す「Ｎｏｄｅ１−２」の仮想ノードの担当領域「６００〜９９９（Ｄ＝４００）」を全て、他の仮想ノードＩＤ「Ｎｏｄｅ３＿２」の仮想ノードへ移譲するものとする。この場合、図７（ｂ）に示すように、仮想ノードＩＤ「Ｎｏｄｅ３＿２」の仮想ノードの担当領域が「６００〜９９９（Ｄ＝４００）」のサイズとなる。 Here, for example, all the assigned areas “600 to 999 (D = 400)” of the virtual node “Node 1-2” illustrated in FIG. 7A are transferred to the virtual node having the other virtual node ID “Node3_2”. And In this case, as shown in FIG. 7B, the area in charge of the virtual node with the virtual node ID “Node3_2” has a size of “600 to 999 (D = 400)”.

また、図７（ａ）に示す「Ｎｏｄｅ１−２」の仮想ノードの担当領域「６００〜９９９（Ｄ＝４００）」の半分を、他の仮想ノードＩＤ「Ｎｏｄｅ３＿２」の仮想ノードへ移譲するものとする。この場合、図７（ｃ）に示すように、仮想ノードＩＤ「Ｎｏｄｅ１＿２」の仮想ノードの担当領域が「６００〜７９９（Ｄ＝２００）」のサイズとなり、仮想ノードＩＤ「Ｎｏｄｅ３＿２」の仮想ノードの担当領域が「８００〜９９９（Ｄ＝２００）」のサイズとなる。 Also, half of the assigned area “600 to 999 (D = 400)” of the virtual node “Node 1-2” illustrated in FIG. 7A is transferred to the virtual node having the other virtual node ID “Node3_2”. To do. In this case, as shown in FIG. 7C, the virtual node with the virtual node ID “Node1_2” has a size of “600 to 799 (D = 200)”, and the virtual node with the virtual node ID “Node3_2” The area in charge is “800 to 999 (D = 200)”.

＜リバランス設計の他の例＞
リバランス部１８ｅが以下で説明する処理によりリバランス設計を行うようにしてもよい。 <Other examples of rebalancing design>
The rebalance unit 18e may perform rebalance design by the process described below.

ここでは、分散システム１０の各ノード１５のリソースの総量（負荷の総量）が、使用リソース量（使用負荷量）に対して十分であるにも関わらず、使用リソース量に偏りが生じているとする。この際に、リバランス部１８ｅが、上述した処理のように担当領域中の移譲領域を移譲する対象ノードや、該当ノード１５の適切な移譲サイズを指定することなく、各ノード１５が持つ仮想ノード数を、ノード１５毎の現状の負荷の状況に応じて、ノード１５毎に必要な負荷量とする仮想ノード数に再設定するリバランスを行うようにする。 Here, although the total amount of resources (total load) of each node 15 of the distributed system 10 is sufficient with respect to the used resource amount (used load amount), the used resource amount is biased. To do. At this time, the rebalancing unit 18e does not specify the target node to which the transfer area in the assigned area is transferred as in the above-described process, or the virtual node of each node 15 without specifying the appropriate transfer size of the corresponding node 15. The number is rebalanced according to the current load status of each node 15 to the number of virtual nodes that is required for each node 15.

この際、リバランス部１８ｅは、各ノード１５の仮想ノード数を、負荷の状況に合わせて下式（１）により算出し、この算出された各ノード１５の仮想ノード数に基づきリバランスする。 At this time, the rebalancing unit 18e calculates the number of virtual nodes of each node 15 according to the following equation (1) according to the load condition, and rebalances based on the calculated number of virtual nodes of each node 15.

このリバランスにおいては、算出された仮想ノード数に基づき、各仮想ノードの振分ＩＤ空間の先頭から仮想ノードＩＤと振分ＩＤ空間の再マッピングを行う。再マッピングは、担当領域の総延長（総サイズ）を算出した仮想ノード数で除し、１仮想ノード当たりの振分ＩＤ空間サイズを求め、振分ＩＤ空間の先頭から新たな振分ＩＤ空間サイズ毎に、仮想ノードＩＤ毎の仮想ノード数を再設定していく。 In this rebalancing, the virtual node ID and the distribution ID space are remapped from the top of the distribution ID space of each virtual node based on the calculated number of virtual nodes. In the remapping, the total extension (total size) of the assigned area is divided by the calculated number of virtual nodes to obtain a distribution ID space size per virtual node, and a new distribution ID space size from the top of the distribution ID space. Each time, the number of virtual nodes for each virtual node ID is reset.

リバランス後の仮想ノード数＝現状の仮想ノード数×(全ノードの負荷の平均値／該当ノードの負荷の実測値) …（式１）
但し、式（１）中の「該当ノードの負荷の実測値」は、現状の仮想ノードを有するノードの負荷の実測値である。また、式（１）はリバランス部１８ｅの図示せぬ記憶部に保持されるものとする。 Number of virtual nodes after rebalancing = current number of virtual nodes × (average value of loads of all nodes / actual value of loads of relevant nodes) (Equation 1)
However, the “actually measured load value of the corresponding node” in the equation (1) is an actually measured value of the load of the node having the current virtual node. In addition, Expression (1) is held in a storage unit (not shown) of the rebalance unit 18e.

以下、ここでのリバランス処理について具体的に説明する。 Hereinafter, the rebalancing process here will be specifically described.

リバランス部１８ｅは、まず、各ノード１５が持つ仮想ノード数を再設定する。この再設定の処理を図８（ａ）及び（ｂ）を参照して説明する。但し、図８（ａ）及び（ｂ）に示す仮想ノードＩＤ「Ｎｏｄｅ１−１」，「Ｎｏｄｅ１−２」は、ノードＩＤ「Ｎｏｄｅ１」のノード１に従属する仮想ノード１−１，１−２に対応するものとする。他の仮想ノードＩＤにおいても同様であり、例えば、仮想ノードＩＤ「Ｎｏｄｅ５−１」，「Ｎｏｄｅ５−２」，「Ｎｏｄｅ５−３」は、ノードＩＤ「Ｎｏｄｅ５」のノード５に従属する仮想ノード５−１，５−２，５−３に対応するものとする。 First, the rebalancing unit 18e resets the number of virtual nodes that each node 15 has. The resetting process will be described with reference to FIGS. 8 (a) and 8 (b). However, the virtual node IDs “Node 1-1” and “Node 1-2” illustrated in FIGS. 8A and 8B are assigned to the virtual nodes 1-1 and 1-2 subordinate to the node 1 having the node ID “Node 1”. It shall correspond. The same applies to other virtual node IDs. For example, the virtual node IDs “Node5-1”, “Node5-2”, and “Node5-3” are virtual nodes 5 to 5 subordinate to the node 5 of the node ID “Node5”. It shall correspond to 1,5-2,5-3.

図８（ａ）に示す振分ＩＤ表１９ｂには、仮想ノードＩＤ「Ｎｏｄｅ１−１」に、担当する振分ＩＤ空間の担当領域として「０〜１９９（Ｄ＝２００）」のデータサイズが対応付けられ、「Ｎｏｄｅ１−２」に、「２００〜３９９（Ｄ＝２００）」のデータサイズが対応付けられている。他の仮想ノードＩＤにおいても図示する通りである。 In the distribution ID table 19b shown in FIG. 8A, the virtual node ID “Node1-1” corresponds to the data size “0 to 199 (D = 200)” as the assigned area of the assigned ID space. A data size of “200 to 399 (D = 200)” is associated with “Node 1-2”. The same applies to other virtual node IDs.

更に、各ノード１〜５の仮想ノード数は、ノード１の仮想ノード数が２個、ノード２の仮想ノード数が１個、ノード３の仮想ノード数が１個、ノード４の仮想ノード数が２個、ノード５の仮想ノード数が２個である。 Further, the number of virtual nodes of each of the nodes 1 to 5 is that the number of virtual nodes of node 1 is 2, the number of virtual nodes of node 2 is 1, the number of virtual nodes of node 3 is 1, and the number of virtual nodes of node 4 is Two and the number of virtual nodes of the node 5 is two.

このような条件において、リバランス部１８ｅは、ノード１〜５が持つ仮想ノード数を現状の負荷の状況に応じて、必要な負荷量に再設定する。以降、この再設定の処理について説明する。 Under such conditions, the rebalance unit 18e resets the number of virtual nodes held by the nodes 1 to 5 to a necessary load amount according to the current load state. Hereinafter, the resetting process will be described.

まず、リバランス部１８ｅは、各ノード１〜５の仮想ノード数を変更する。例えば、各ノード１〜５の現状の仮想ノード数は、図８（ａ）に示すように、ノード１が２個、ノード２が１個、ノード３が１個、ノード４が２個、ノード５が２個の合計８個である。これを、各ノード１〜５の負荷の現状に応じて、図８（ｂ）に示すように、ノード１が１個、ノード２が２個、ノード３が２個、ノード４が２個、ノード５が３個の合計１０個に変更する。 First, the rebalance unit 18e changes the number of virtual nodes of each of the nodes 1-5. For example, as shown in FIG. 8A, the current number of virtual nodes in each of the nodes 1 to 5 is two for node 1, one for node 2, one for node 3, two for node 4, 5 is a total of 8 pieces. As shown in FIG. 8 (b), this corresponds to one node 1, two nodes 2, two nodes 3, two nodes 4, according to the current state of loads of the nodes 1 to 5. Node 5 changes to a total of 10 nodes.

この仮想ノード数の変更を行う場合に上記式（１）を用いる。仮想ノード数の変更は、例えば図８（ａ）に示す各ノード１〜５の個数「２個、１個、１個、２個、２個」＝８個を、図８（ｂ）に示す各ノード１〜５の個数「１個、２個、２個、２個、３個」＝１０個に変更することである。 The above formula (1) is used when changing the number of virtual nodes. For example, the number of virtual nodes is changed as shown in FIG. 8B in which the number of nodes 1 to 5 shown in FIG. 8A is “2, 1, 1, 2, 2” = 8. The number of nodes 1 to 5 is changed to “1, 2, 2, 2, 3” = 10.

図８（ａ）の現状では、全ノード１〜５のハッシュ空間サイズ（担当領域のサイズ）は「０〜１５９９」の１６００であり、仮想ノード数は８個なので、仮想ノード当たりの担当領域のサイズは、１６００÷８＝２００である。このＤ＝２００の担当領域のサイズの内、該当ノード１の負荷の実測値は、例えば「８０」や「１５０」のようになる。このような全ノード１〜５の実測値から、全ノード１〜５の負荷の平均値が求められるので、その平均値及び実測値を上記式（１）に代入する。 In the current state of FIG. 8A, the hash space size (size of the assigned area) of all the nodes 1 to 5 is 1600 of “0 to 1599” and the number of virtual nodes is 8, so the assigned area per virtual node is The size is 1600 ÷ 8 = 200. Of the size of the assigned area of D = 200, the actual measured value of the load of the corresponding node 1 is, for example, “80” or “150”. Since the average value of the loads of all the nodes 1 to 5 is obtained from the actually measured values of all the nodes 1 to 5, the average value and the actually measured value are substituted into the above formula (1).

例えば、仮想ノードＩＤ＝「Ｎｏｄｅ１−１」の振分ＩＤ空間の担当領域（サイズＤ＝２００）では負荷の実測値が「１４０」、「Ｎｏｄｅ１−２」では負荷の実測値が「１６０」であるとすると、ノード１の負荷の実測値は「３００」である。この際、全ノード１〜５の負荷の平均値が「１５０」とする。この場合、ノード１のリバランス後の仮想ノード数は、２×（１５０／３００）＝１となる。同様に、他のノード２〜５においてもリバランス後の仮想ノード数を求め、各ノード１〜５の仮想ノード数を、その求められた仮想ノード数に変更する。但し、式（１）に当て嵌めた計算結果が、１．６等の小数点を伴う場合、切り上げ、切り捨て、四捨五入とすることを予め決めておく。 For example, in the assigned area (size D = 200) of the distribution ID space of virtual node ID = “Node 1-1”, the actual load value is “140”, and in “Node 1-2”, the actual load value is “160”. If there is, the measured value of the load of the node 1 is “300”. At this time, the average value of the loads of all the nodes 1 to 5 is “150”. In this case, the number of virtual nodes after the rebalancing of node 1 is 2 × (150/300) = 1. Similarly, the number of virtual nodes after rebalancing is obtained in the other nodes 2 to 5, and the number of virtual nodes of each node 1 to 5 is changed to the obtained number of virtual nodes. However, when the calculation result fitted to the formula (1) includes a decimal point such as 1.6, it is determined in advance that rounding up, rounding down, and rounding off are performed.

次に、リバランス部１８ｅは、仮想ノード当たりのハッシュ空間サイズを変更する。図８（ａ）に示す現状では、上述したように、仮想ノード当たりのハッシュ空間サイズＤは、１６００÷８＝２００である。 Next, the rebalance unit 18e changes the hash space size per virtual node. In the current state shown in FIG. 8A, as described above, the hash space size D per virtual node is 1600/8 = 200.

これを、上述した変更後の仮想ノード数＝１０個を用いると、仮想ノード当たりのハッシュ空間サイズは、１６００÷１０＝１６０となる。このハッシュ空間サイズを用いて、図８（ｂ）に示すように、１個当たりの仮想ノードのハッシュ空間サイズＤを「１６０」とする。 If the number of virtual nodes after change = 10 is used, the hash space size per virtual node is 1600/10 = 160. Using this hash space size, as shown in FIG. 8B, the hash space size D of each virtual node is set to “160”.

次に、リバランス部１８ｅは、その変更後のハッシュ空間サイズＤ＝「１６０」の仮想ノードを、前述で変更した後の各ノード１〜５の仮想ノード数だけ割り振って行く。即ち、ノード１では変更後の仮想ノードが１個なので、図８（ｂ）に示すように、ノード１において、変更後のハッシュ空間サイズＤ＝「１６０」の仮想ノード｛仮想ノードＩＤ「Ｎｏｄｅ１−１」｝が１個割り振られる。 Next, the rebalance unit 18e allocates the virtual nodes having the changed hash space size D = “160” by the number of virtual nodes of the respective nodes 1 to 5 after the above change. That is, since there is one changed virtual node in the node 1, as shown in FIG. 8B, in the node 1, the changed virtual node {has the virtual node ID “Node1-” having the hash space size D = “160”. 1 "} is allocated.

同様に、ノード２では変更後の仮想ノードが２個なので、サイズＤ＝「１６０」の仮想ノード｛仮想ノードＩＤ「Ｎｏｄｅ２−１，Ｎｏｄｅ２−２」｝が２個割り振られる。以降、同様に図示するように、ノード３〜５まで変更後の仮想ノード２個〜３個が割り振られる。 Similarly, since there are two virtual nodes after the change in node 2, two virtual nodes {virtual node IDs “Node2-1, Node2-2”} of size D = “160” are allocated. Thereafter, as shown in the figure, two to three virtual nodes after the change are allocated to the nodes 3 to 5.

上記のようなリバランス設計の結果、図８（ｂ）に示すようなリバランス後の振分ＩＤ表が得られ、キャンセルがない場合に、これが各ノード１５に配付される。各ノード１５は、当該振分ＩＤ表に従って、振り分け処理を実行する。 As a result of the rebalance design as described above, a post-rebalance distribution ID table as shown in FIG. 8B is obtained, and this is distributed to each node 15 when there is no cancellation. Each node 15 executes a distribution process according to the distribution ID table.

（リバランシングキャンセル機能部１８ｆの処理について）
以下、リバランシングキャンセル機能部１８ｆの処理について説明する。リバランシングキャンセル機能部１８ｆの処理についてもどのノード１５で行ってもよいが、ここでは、前述したように、リバランス設計、及びリバランス後の振分ＩＤ表の配付を行う特権ノードが行うことを想定している。 (Regarding the processing of the rebalancing cancel function unit 18f)
Hereinafter, the processing of the rebalancing cancel function unit 18f will be described. The processing of the rebalancing cancellation function unit 18f may be performed by any node 15, but here, as described above, it is performed by a privileged node that performs rebalancing design and distribution of the distribution ID table after rebalancing. Is assumed.

リバランシングキャンセル機能部１８ｆは、リバランス部１８ｅにより、リバランスが必要であると判定された場合に、まず、ノード負荷計測部１８ｄにより得られた全ノード１５の負荷データに基づき、ノード毎負荷差分表１９ｇを作成し、記憶部１９に記憶する。 When the rebalancing cancel function unit 18f determines that rebalancing is necessary by the rebalancing unit 18e, the rebalancing cancellation function unit 18f first loads each node based on the load data of all the nodes 15 obtained by the node load measuring unit 18d. A difference table 19g is created and stored in the storage unit 19.

図９の左側に、ノード毎負荷差分表１９ｇの例を示す。図９の例に示すように、ノード毎負荷差分表１９ｇは、ノード１５毎の実測負荷と、平均値（全ノードの合計負荷÷ノード数、ここでは７０）からの差分（偏差）からなる。なお、本例では、ノード単位の計算例を示しているが、ここでのノードは、「仮想ノード」であってもよい。 On the left side of FIG. 9, an example of a node-by-node load difference table 19g is shown. As shown in the example of FIG. 9, the node-by-node load difference table 19g includes an actually measured load for each node 15 and a difference (deviation) from an average value (total load of all nodes / number of nodes, here 70). In this example, a calculation example in units of nodes is shown, but the node here may be a “virtual node”.

そして、ノード負荷計測部１８ｄは、ノード毎負荷差分表１９ｇを作成した時刻（＝リバランスが必要であると判定された時刻であり、これをｔとおく）における各ノードの負荷データ（差分のみでもよい）を、前回測定データ１９ｈとして、記憶部１９に格納する。前回測定データ１９ｈが既に格納されている場合、既に格納されている「前回測定データ１９ｈ」は削除され、今回の「前回測定データ１９ｈ」が格納される。 Then, the node load measuring unit 18d loads the load data (only the difference only) of each node at the time when the node-by-node load difference table 19g is created (= the time when it is determined that rebalancing is necessary, and this is t). May be stored in the storage unit 19 as the previous measurement data 19h. When the previous measurement data 19h is already stored, the previously stored “previous measurement data 19h” is deleted, and the current “previous measurement data 19h” is stored.

次に、リバランシングキャンセル機能部１８ｆは、ｔからリバランス設計にかかる時間（Δｔ）の後（つまり、時刻ｔ＋Δｔの時点）に、現時点のノード負荷計測データ１９ｄ（各ノード１５の負荷の実測データ）を取得し、当該現時点のノード負荷計測データ１９ｄと、前回測定データ１９ｈとから、ノード毎予測負荷比較表１９ｇ（キャンセル判定に係る表という意味でノード毎負荷差分表と同じ１９ｇを付している）を作成する。なお、Δｔは、予め定めた時間であってもよいし、リバランス部１８ｅが計算に係るノード数等からΔｔを推定し、リバランス部１８ｅがリバランシングキャンセル機能部１８ｆに対してΔｔを通知することとしてもよい。なお、ノード負荷計測データ１９ｄの更新時間間隔は、Δｔよりも短い。 Next, the rebalancing cancel function unit 18f performs the current node load measurement data 19d (actual measurement data of the load of each node 15) after the time (Δt) required for the rebalance design from t (that is, at the time t + Δt). ) From the current node load measurement data 19d and the previous measurement data 19h, the node-specific predicted load comparison table 19g (19g same as the node load difference table in the meaning of the table related to the cancellation determination) is attached. Create). Δt may be a predetermined time, or the rebalancing unit 18e estimates Δt from the number of nodes involved in the calculation, and the rebalancing unit 18e notifies Δt to the rebalancing cancellation function unit 18f. It is good to do. Note that the update time interval of the node load measurement data 19d is shorter than Δt.

図９の右側に、ノード毎予測負荷比較表１９ｇの例を示す。当該ノード毎予測負荷比較表１９ｇに示されるように、ノード毎予測負荷比較表１９ｇは、現時点（ｔ+Δｔ）でのノード毎の負荷と、ｔの時刻での差分（"前差"と呼ぶ）と、予測負荷とを有する。予測負荷は、リバランスを実行したと仮定した場合における、各ノードの負荷である。リバランスにより、差分が解消されることが想定されるから、ここでの予測負荷は、「現時点での負荷‐前差」で求めている。例えば、図９の例で、ノードＡにおける現時点（ｔ＋Δｔ）での負荷は１２５であり、前差が＋６８であるから、予測負荷は１２５−６８＝５７となっている。 An example of the predicted load comparison table 19g for each node is shown on the right side of FIG. As shown in the per-node predicted load comparison table 19g, the per-node predicted load comparison table 19g is a load per node at the current time (t + Δt) and a difference at time t (referred to as “previous difference”). ) And a predicted load. The predicted load is the load on each node when it is assumed that rebalancing has been executed. Since it is assumed that the difference is eliminated by the rebalancing, the predicted load here is obtained by “current load-previous difference”. For example, in the example of FIG. 9, the load at the current time (t + Δt) in the node A is 125 and the front difference is +68, so the predicted load is 125−68 = 57.

そして、リバランシングキャンセル機能部１８ｆは、以下の判定基準（１）、（２）のうちのいずれかが満たされるか否かを判定し、いずれかが満たされる場合に、ｔのタイミングで必要であると判定されたリバランスをキャンセルする。いずれも満たされない場合は、リバランスを実行することを決定する。 Then, the rebalancing cancellation function unit 18f determines whether or not any of the following criteria (1) and (2) is satisfied, and is required at the timing t when either is satisfied. Cancel the rebalance determined to be present. If neither is satisfied, it is decided to perform rebalancing.

判定基準（１）：ノード数に増減がある。 Determination criterion (1): There is an increase or decrease in the number of nodes.

判定基準（２）：負荷予測値が、許容範囲外になるノードが存在する。 Criteria (2): There is a node whose predicted load value is outside the allowable range.

上記判定基準（１）のノード数の増減については、例えば、リバランシングキャンセル機能部１８ｆが、リバランス部１８ｅからリバランス設計結果（振分ＩＤ表）を取得することで判定できる。ここで、ノード数が増加する場合とは、例えば、負荷の増大が大きく、現状のノード数では不足し、ノードを追加することが必要になる場合等である。また、ノード数が減少する場合とは、例えば、負荷の減少が大きく、現状のノード数では大きすぎ、非効率であるため、ノードを削減する場合等である。 The increase / decrease in the number of nodes in the determination criterion (1) can be determined by, for example, the rebalancing cancellation function unit 18f acquiring the rebalance design result (distribution ID table) from the rebalancing unit 18e. Here, the case where the number of nodes increases is, for example, a case where the increase in load is large, the current number of nodes is insufficient, and it is necessary to add nodes. Also, the case where the number of nodes decreases is, for example, a case where the number of nodes is reduced because the decrease in load is large, the current number of nodes is too large and inefficient.

また、判定基準（２）について、許容範囲は予め定めておく値である。図９の例では、許容範囲を平均値から±２０％としている。そして、図９の例では、ノードＧが許容範囲外となるため、リバランスをキャンセルすると判定される。 In addition, regarding the criterion (2), the allowable range is a predetermined value. In the example of FIG. 9, the allowable range is ± 20% from the average value. In the example of FIG. 9, since the node G is out of the allowable range, it is determined to cancel the rebalance.

一方、図１０に示す例では、全ノードについて、許容範囲内であるため、リバランスを実行すると判定される。 On the other hand, in the example shown in FIG. 10, since all nodes are within the allowable range, it is determined that rebalancing is to be executed.

また、判定基準（２）に関して、許容範囲外となるノードの個数が予め定めた閾値を超えた場合に、判定基準（２）を満たす、こととしてもよい。一例として、判定基準（２）を「許容範囲外のノードの個数＞全ノード数の20%」とし、これを満たした場合にキャンセルする。この場合、図９、１０の例では、予測値が３個以上許容範囲外の場合にキャンセルする。 Further, regarding the criterion (2), the criterion (2) may be satisfied when the number of nodes outside the allowable range exceeds a predetermined threshold. As an example, the criterion (2) is “the number of nodes outside the allowable range> 20% of the total number of nodes”, and if this is satisfied, the determination is canceled. In this case, in the examples of FIGS. 9 and 10, when three or more predicted values are outside the allowable range, the cancellation is performed.

ここで、リバランスをキャンセルするとは、例えば、リバランシングキャンセル機能部１８ｆがリバランス部１８ｅに対し、リバランス設計で得られた新たな振分ＩＤ表を、各ノード１５に配付せずに破棄することを指示することである。また、リバランスを実行することを決定した場合、リバランシングキャンセル機能部１８ｆは、リバランス部１８ｅに対し、リバランス設計で得られた新たな振分ＩＤ表を、各ノード１５に配付することを指示する。 Here, canceling the rebalance means, for example, that the rebalancing cancellation function unit 18f discards the new distribution ID table obtained by the rebalancing design to the rebalance unit 18e without distributing it to each node 15. It is to instruct to do. Further, when it is determined to execute the rebalance, the rebalancing cancel function unit 18f distributes the new distribution ID table obtained by the rebalance design to each node 15 to the rebalance unit 18e. Instruct.

図１１を参照して、リバランシングキャンセル機能部１８ｆの処理の効果を説明する。図１１は、例えば、あるノードについての負荷の推移を示したものである。時刻ｔにおいてリバランスが必要であると判定される。その後、当該ノードの負荷が低下し、ｔ＋Δｔの時点では、許容範囲（リバランスを行う必要のない範囲）に入っている。しかし、従来技術では、このような負荷の時間変化を考慮しないため、リバランスを行ってしまい、負荷が許容範囲外となり、再度リバランスが実行される。 With reference to FIG. 11, the effect of the processing of the rebalancing cancel function unit 18f will be described. FIG. 11 shows, for example, the transition of load for a certain node. It is determined that rebalancing is necessary at time t. After that, the load on the node decreases, and at the time of t + Δt, it is within an allowable range (a range in which rebalancing is not required). However, the prior art does not consider such a change in load over time, so rebalancing is performed, the load falls outside the allowable range, and rebalancing is executed again.

一方、リバランシングキャンセル機能部１８ｆの判定機能により、ｔ＋Δｔの時点でリバランスをキャンセルすると判定するので、無駄なリバランスを実行することなく、許容範囲の負荷を保つことができる。 On the other hand, the determination function of the rebalancing cancellation function unit 18f determines that rebalancing is canceled at the time point t + Δt, and thus it is possible to maintain an allowable load without executing unnecessary rebalancing.

（動作例）
次に、本実施の形態に係る分散システム１０において、ノード１５間の負荷の偏りを是正するリバランスの必要性の判定、及び、キャンセル判定等を実行する際の動作例を、図１２〜図１４のフローチャートを参照して説明する。本動作例は、例えば図３に示すハッシュ空間を前提とする。また、以下の動作例は、最初に説明したリバランス設計方法に対応する。 (Operation example)
Next, in the distributed system 10 according to the present embodiment, examples of operations when performing determination of the necessity of rebalancing to correct the load imbalance between the nodes 15, cancellation determination, and the like are illustrated in FIGS. This will be described with reference to the flowchart of FIG. This operation example is based on the hash space shown in FIG. 3, for example. The following operation example corresponds to the rebalance design method described first.

まず、図１２に示すステップＳ１において、所定のノード（例：図３のノードＢ）の分散システム負荷リバランス部１８ｅは、所定の周期で各ノードＡ〜Ｅから収集したノード負荷計測データ１９ｄに基づき、各ノードＡ〜Ｅの負荷の平均値及び標準偏差、偏差並びに偏差／標準偏差の算出を行う。 First, in step S1 shown in FIG. 12, the distributed system load rebalancing unit 18e of a predetermined node (example: node B of FIG. 3) uses the node load measurement data 19d collected from each node A to E at a predetermined cycle. Based on the load average values, standard deviation, deviation, and deviation / standard deviation of each node A to E are calculated.

次に、ステップＳ２において、リバランス部１８ｅは、上記ステップＳ１での算出結果を集計データ１９ｅ（例：図６）として記憶部１９に記録する。 Next, in step S2, the rebalance unit 18e records the calculation result in step S1 in the storage unit 19 as the aggregate data 19e (example: FIG. 6).

ステップＳ３において、リバランス部１８ｅは、上記ステップＳ２で記録したデータ１９ｅに基づき、上述した３つの条件（１）〜（３）の何れか１つを満たすか否かを判定する。この結果、満たさなければ（Ｎｏ）、リバランスの処理を終了する。 In step S3, the rebalance unit 18e determines whether any one of the three conditions (1) to (3) described above is satisfied based on the data 19e recorded in step S2. As a result, if not satisfied (No), the rebalancing process is terminated.

一方、その判定の結果、何れか１つを満たす場合、ステップＳ３において、リバランシングキャンセル機能部１８ｆが、リバランシングキャンセル判定を実施する。 On the other hand, if any one of them is satisfied as a result of the determination, the rebalancing cancel function unit 18f performs the rebalancing cancel determination in step S3.

リバランシングキャンセル判定については図１４を参照する。リバランシングキャンセル機能部１８ｆは、リバランスが必要であると判定された時点（ｔ）におけるノード毎負荷差分表１９ｇを作成し、その後、ｔ＋Δｔの時点で、当該ノード毎負荷差分表１９ｇと、その時点のノード負荷計測データ１９ｄとに基づいて、ノード毎予測負荷比較表１９ｇを作成する（ステップＳ４１）。そして、ステップＳ４２において、リバランシングキャンセル機能部１８ｆは、前述した判定基準（１）、判定基準（２）のうちのいずれかを満たすかどうかを判定し、満たす場合はリバランスをキャンセルし（ステップＳ４３）、満たさない場合はキャンセルしない（ステップＳ４４）。 Refer to FIG. 14 for rebalancing cancellation determination. The rebalancing cancellation function unit 18f creates a node-by-node load difference table 19g at the time (t) when it is determined that rebalancing is necessary, and then, at time t + Δt, the node-by-node load difference table 19g and its Based on the node load measurement data 19d at the time, the node-by-node predicted load comparison table 19g is created (step S41). In step S42, the rebalancing cancel function unit 18f determines whether any of the above-described determination criterion (1) and determination criterion (2) is satisfied, and cancels the rebalance if satisfied (step S42). S43) If not satisfied, no cancellation is made (step S44).

図１２に戻り、リバランスをキャンセルする場合は、ステップＳ１に戻り、次のタイミングで、上記と同様に、リバランス実施の必要性を判定する。 Returning to FIG. 12, when canceling the rebalance, the process returns to step S <b> 1, and the necessity for rebalancing is determined at the next timing in the same manner as described above.

リバランスをキャンセルしない場合、ステップＳ５において、各ノードＡ〜Ｅの負荷量を検知し、例えば、高負荷ノードＡの担当領域中の移譲領域を他ノードＢ，Ｃに移譲することで負荷の偏りを是正する、といった判断を行う。 When the rebalance is not canceled, the load amount of each node A to E is detected in step S5. For example, by transferring the transfer area in the assigned area of the high load node A to the other nodes B and C, the load bias To make corrections.

ステップＳ６において、リバランス部１８ｅは、移譲元ノードの担当領域中の移譲領域のサイズ（＝負荷量）を求め、ステップＳ７において、移譲先ノードの担当領域の許容可能なサイズを求める。 In step S6, the rebalancing unit 18e calculates the size (= load amount) of the transfer area in the transfer area of the transfer source node, and in step S7, determines the allowable size of the transfer area of the transfer destination node.

次に、図１３に示すステップＳ８において、リバランス部１８ｅは、移譲元ノードの移譲対象の担当領域を移譲可能な移譲先ノードが有るか否かを判定する。この判定は、移譲先ノードの担当領域の許容可能なサイズが、移譲元ノードの担当領域中の移譲領域を移譲可能であるか否かを検知して行う。この結果、移譲可能な移譲先ノーが無ければ（Ｎｏ）、リバランスの処理を終了する。 Next, in step S8 shown in FIG. 13, the rebalancing unit 18e determines whether or not there is a transfer destination node that can transfer the assigned area of the transfer source node. This determination is made by detecting whether or not the allowable size of the area in charge of the transfer destination node can transfer the transfer area in the area in charge of the transfer source node. As a result, if there is no transferable transfer destination no (No), the rebalancing process is terminated.

一方、移譲可能な移譲先ノードが有れば（Ｙｅｓ）、ステップＳ９において、リバランス部１８ｅは、移譲元ノードの担当領域中の移譲領域を、移譲先ノードへ移譲する。 On the other hand, if there is a transfer destination node that can be transferred (Yes), in step S9, the rebalance unit 18e transfers the transfer area in the assigned area of the transfer source node to the transfer destination node.

この後、図１３に示すステップＳ１０において、リバランス部１８ｅは、移譲元ノードの担当領域中の移譲領域の残りが有るか否かを判定する。この結果、残りが無ければ（Ｎｏ）、言い換えれば、移譲対象の担当領域が全て移譲完遂されていれば、リバランスの処理を終了する。 Thereafter, in step S10 shown in FIG. 13, the rebalancing unit 18e determines whether or not there is a remaining transfer area in the assigned area of the transfer source node. As a result, if there is no remaining (No), in other words, if all of the transfer target areas have been transferred, the rebalancing process ends.

一方、上記ステップＳ１０の判定で残りが有れば（Ｙｅｓ）、ステップＳ１１において、リバランス部１８ｅは、移譲元ノードの残りの移譲領域が移譲可能な、移譲先ノードが有るか否かを判定する。この結果、移譲先ノードが無ければ（Ｎｏ）、リバランスの処理を終了する。 On the other hand, if there is a remaining in the determination in step S10 (Yes), in step S11, the rebalance unit 18e determines whether there is a transfer destination node to which the remaining transfer area of the transfer source node can be transferred. To do. As a result, if there is no transfer destination node (No), the rebalancing process is terminated.

上記ステップＳ１１の判定の結果、残りの移譲領域が移譲可能な移譲先ノードが有れば（Ｙｅｓ）、ステップＳ１２において、リバランス部１８ｅは、移譲元ノードの残りの移譲領域を、上記ステップＳ１０で存在が認められた移譲先ノードへ移譲する。 As a result of the determination in step S11, if there is a transfer destination node to which the remaining transfer area can be transferred (Yes), in step S12, the rebalancing unit 18e determines the remaining transfer area of the transfer source node in step S10. It is transferred to the transfer destination node whose existence is recognized in.

この後、上記ステップＳ１０に戻って、リバランス部１８ｅは、処理が終了するまでステップＳ１０〜Ｓ１２を繰り返す。また、前述したように、ステップＳ６以降の処理は、移譲元ノード毎に繰り返される。 Thereafter, returning to step S10, the rebalance unit 18e repeats steps S10 to S12 until the processing is completed. Further, as described above, the processing after step S6 is repeated for each transfer source node.

上記の処理により、リバランス後の振分ＩＤ表１９ｂが作成され、各ノードに配付されることで、リバランシング後の振り分け処理が実行され、負荷の偏りが解消される。 By the above processing, the distribution ID table 19b after rebalancing is created and distributed to each node, whereby the distribution processing after rebalancing is executed, and the load bias is eliminated.

（実施の形態のまとめ）
以上、説明したように、本実施の形態により、通信サービスを利用する複数のクライアントマシンからの情報がネットワークを介して振り分けられる複数のノードを有する分散システムにおいて用いられるリバランス装置であって、前記複数のノードの負荷量に基づいて、当該複数のノード間の負荷量の偏りを抑制するリバランスが必要であるか否かを判定するリバランス処理手段と、前記リバランス処理手段により、リバランスが必要であると判定された場合において、前記リバランス後の前記複数のノードの予測負荷状態に基づいて、前記リバランスをキャンセルするか否かを判定するキャンセル処理手段とを備えるリバランス装置が提供される。当該リバランス装置は、前記複数のノードにおけるいずれかのノードであってもよいし、前記複数のノード以外の装置であってもよい。 (Summary of embodiment)
As described above, according to the present embodiment, a rebalancing apparatus used in a distributed system having a plurality of nodes to which information from a plurality of client machines using a communication service is distributed via a network, Based on the load amounts of a plurality of nodes, the rebalance processing means for determining whether or not the rebalance that suppresses the uneven load amount among the plurality of nodes is necessary, and the rebalance processing means And a cancel processing means for determining whether or not to cancel the rebalance based on the predicted load state of the plurality of nodes after the rebalance. Provided. The rebalancing device may be any node in the plurality of nodes, or may be a device other than the plurality of nodes.

前記キャンセル処理手段は、例えば、前記リバランス処理手段によりリバランスが必要であると判定された第１の時点から、前記リバランス処理手段によるリバランス設計にかかる時間が経過した第２の時点における前記複数のノードの負荷量と、前記第１の時点における前記複数のノードの負荷量とに基づいて、前記予測負荷状態を算出する。 The cancel processing means is, for example, at a second time when a time required for rebalance design by the rebalance processing means has elapsed from a first time when the rebalance processing means determines that rebalancing is necessary. The predicted load state is calculated based on the load amounts of the plurality of nodes and the load amounts of the plurality of nodes at the first time point.

また、例えば、前記予測負荷状態が、ノード数の増減を要する負荷状態であるか、又は、前記予測負荷状態が、許容範囲外の負荷量を持つ所定数のノードが存在する負荷状態である場合に、前記キャンセル処理手段は、前記リバランスをキャンセルすると判定する。 Further, for example, when the predicted load state is a load state that requires an increase or decrease in the number of nodes, or the predicted load state is a load state in which a predetermined number of nodes having a load amount outside an allowable range exists. In addition, the cancel processing means determines to cancel the rebalance.

また、前記キャンセル処理手段は、例えば、前記第１の時点において、当該第１の時点における前記複数のノードの負荷量の平均値からの差分をノード毎に算出し、前記第２の時点において、ノード毎に、当該第２の時点におけるノードの負荷量から前記差分を引くことにより、前記予測負荷状態を算出する。 In addition, for example, at the first time point, the cancel processing unit calculates a difference from an average value of load amounts of the plurality of nodes at the first time point for each node, and at the second time point, For each node, the predicted load state is calculated by subtracting the difference from the load amount of the node at the second time point.

なお、分散システム負荷リバランス部１８ｅは、リバランス処理手段の例である。リバランシングキャンセル機能部１８ｆは、キャンセル処理手段の例である。 The distributed system load rebalancing unit 18e is an example of a rebalance processing unit. The rebalancing cancel function unit 18f is an example of a cancel processing unit.

本実施の形態に係る技術によれば、分散システムにおいて、時間経過の観点を含めてリバランスの実行が適切であるかを高速で判断し、実行すべきでない状況を特定することができる。よって、無駄なリバランスの実行を抑制できる。 According to the technique according to the present embodiment, in a distributed system, it is possible to determine at high speed whether rebalancing is appropriate, including the viewpoint of the passage of time, and to specify a situation that should not be performed. Therefore, execution of useless rebalancing can be suppressed.

（第１項）
通信サービスを利用する複数のクライアントマシンからの情報がネットワークを介して振り分けられる複数のノードを有する分散システムにおいて用いられるリバランス装置であって、
前記複数のノードの負荷量に基づいて、当該複数のノード間の負荷量の偏りを抑制するリバランスが必要であるか否かを判定するリバランス処理手段と、
前記リバランス処理手段により、リバランスが必要であると判定された場合において、前記リバランス後の前記複数のノードの予測負荷状態に基づいて、前記リバランスをキャンセルするか否かを判定するキャンセル処理手段と
を備えることを特徴とするリバランス装置。
（第２項）
前記キャンセル処理手段は、
前記リバランス処理手段によりリバランスが必要であると判定された第１の時点から、前記リバランス処理手段によるリバランス設計にかかる時間が経過した第２の時点における前記複数のノードの負荷量と、前記第１の時点における前記複数のノードの負荷量とに基づいて、前記予測負荷状態を算出する
ことを特徴とする第１項に記載のリバランス装置。
（第３項）
前記予測負荷状態が、ノード数の増減を要する負荷状態であるか、又は、前記予測負荷状態が、許容範囲外の負荷量を持つ所定数のノードが存在する負荷状態である場合に、前記キャンセル処理手段は、前記リバランスをキャンセルすると判定する
ことを特徴とする第１項又は第２項に記載のリバランス装置。
（第４項）
前記キャンセル処理手段は、
前記第１の時点において、当該第１の時点における前記複数のノードの負荷量の平均値からの差分をノード毎に算出し、前記第２の時点において、ノード毎に、当該第２の時点におけるノードの負荷量から前記差分を引くことにより、前記予測負荷状態を算出する
ことを特徴とする第２項に記載のリバランス装置。
（第５項）
通信サービスを利用する複数のクライアントマシンからの情報がネットワークを介して振り分けられる複数のノードを有する分散システムにおいて用いられるリバランス装置が実行するリバランス方法であって、
前記複数のノードの負荷量に基づいて、当該複数のノード間の負荷量の偏りを抑制するリバランスが必要であるか否かを判定するリバランス判定ステップと、
前記リバランス判定ステップにより、リバランスが必要であると判定された場合において、前記リバランス後の前記複数のノードの予測負荷状態に基づいて、前記リバランスをキャンセルするか否かを判定するキャンセル判定ステップと
を備えることを特徴とするリバランス方法。
（第６項）
前記キャンセル判定ステップにおいて、前記リバランス装置は、
前記リバランス判定ステップによりリバランスが必要であると判定された第１の時点から、リバランス設計にかかる時間が経過した第２の時点における前記複数のノードの負荷量と、前記第１の時点における前記複数のノードの負荷量とに基づいて、前記予測負荷状態を算出する
ことを特徴とする第５項に記載のリバランス方法。
（第７項）
前記キャンセル判定ステップにおいて、前記予測負荷状態が、ノード数の増減を要する負荷状態であるか、又は、前記予測負荷状態が、許容範囲外の負荷量を持つ所定数のノードが存在する負荷状態である場合に、前記リバランス装置は、前記リバランスをキャンセルすると判定する
ことを特徴とする第５項又は第６項に記載のリバランス方法。
（第８項）
コンピュータを、第１項ないし第４項のうちいずれか１項に記載のリバランス装置における各手段として機能させるためのプログラム。
以上、本発明の実施例について詳述したが、本発明は斯かる特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。
(Section 1)
A rebalancing device used in a distributed system having a plurality of nodes to which information from a plurality of client machines using a communication service is distributed via a network,
Rebalancing processing means for determining whether or not rebalancing is necessary to suppress the uneven load amount between the plurality of nodes based on the load amounts of the plurality of nodes;
When the rebalancing processing unit determines that rebalancing is necessary, based on the predicted load state of the plurality of nodes after the rebalancing, canceling whether to cancel the rebalancing or not Processing means and
A rebalancing device comprising:
(Section 2)
The cancellation processing means
The load amounts of the plurality of nodes at the second time point when the time required for the rebalance design by the rebalance processing unit has elapsed from the first time point when the rebalance processing unit determines that rebalancing is necessary. And calculating the predicted load state based on load amounts of the plurality of nodes at the first time point.
The rebalancing device according to item 1, characterized in that:
(Section 3)
If the predicted load state is a load state that requires an increase or decrease in the number of nodes, or the predicted load state is a load state in which a predetermined number of nodes having a load amount outside the allowable range exists, the cancellation is performed. The processing means determines to cancel the rebalance
The rebalancing apparatus according to item 1 or 2, characterized in that:
(Section 4)
The cancellation processing means
At the first time point, a difference from the average load amount of the plurality of nodes at the first time point is calculated for each node, and at the second time point, for each node, at the second time point The predicted load state is calculated by subtracting the difference from the load amount of the node.
The rebalancing device according to item 2, characterized in that:
(Section 5)
A rebalancing method executed by a rebalancing device used in a distributed system having a plurality of nodes to which information from a plurality of client machines using a communication service is distributed via a network,
A rebalance determination step for determining whether or not rebalancing is required to suppress the uneven load amount between the plurality of nodes based on the load amounts of the plurality of nodes;
Cancellation for determining whether or not to cancel the rebalancing based on the predicted load state of the plurality of nodes after the rebalancing when the rebalancing determination step determines that rebalancing is necessary Judgment step and
A rebalancing method comprising:
(Section 6)
In the cancellation determination step, the rebalance device
Load amounts of the plurality of nodes at the second time point when the time required for rebalance design has elapsed from the first time point when the rebalance determination step determines that rebalancing is necessary, and the first time point The predicted load state is calculated based on the load amounts of the plurality of nodes at
6. The rebalancing method according to item 5, wherein
(Section 7)
In the cancellation determination step, the predicted load state is a load state that requires an increase or decrease in the number of nodes, or the predicted load state is a load state in which a predetermined number of nodes having a load amount outside an allowable range exists. In some cases, the rebalancing device determines to cancel the rebalancing.
7. The rebalancing method according to item 5 or 6, wherein
(Section 8)
The program for functioning a computer as each means in the rebalancing apparatus of any one of Claim 1 thru | or 4.
As mentioned above, although the Example of this invention was explained in full detail, this invention is not limited to such specific embodiment, In the range of the summary of this invention described in the claim, various deformation | transformation・ Change is possible.

１０分散システム
１１クライアントマシン
１２ネットワーク
１３ロードバランサ
１４クラスタ
１５ノード
１８制御部
１８ａノード識別子管理部
１８ｂ振分部
１８ｃ信号処理部
１８ｄノード負荷計測部
１８ｅ分散システム負荷リバランス部
１８ｆリバランシングキャンセル機能部
１９記憶部
１９ａノード識別子管理表
１９ｂ振分ＩＤ表
１９ｃデータ
１９ｄノード負荷計測データ
１９ｅ分散システム負荷集計データ
１９ｆ呼制御状態フラグ
１９ｇノード毎負荷差分表、ノード毎予測負荷比較表
１９ｈ前回測定データ DESCRIPTION OF SYMBOLS 10 Distributed system 11 Client machine 12 Network 13 Load balancer 14 Cluster 15 Node 18 Control part 18a Node identifier management part 18b Distribution part 18c Signal processing part 18d Node load measurement part 18e Distributed system load rebalancing part 18f Rebalancing cancellation function part 19 Storage unit 19a Node identifier management table 19b Distribution ID table 19c Data 19d Node load measurement data 19e Distributed system load summary data 19f Call control status flag 19g Load difference table for each node, predicted load comparison table for each node 19h Previous measurement data

Claims

A rebalancing device used in a distributed system having a plurality of nodes to which information from a plurality of client machines using a communication service is distributed via a network,
Rebalancing processing means for determining whether or not rebalancing is necessary to suppress the uneven load amount between the plurality of nodes based on the load amounts of the plurality of nodes;
When the rebalancing processing unit determines that rebalancing is necessary, based on the predicted load state of the plurality of nodes after the rebalancing, canceling whether to cancel the rebalancing or not Processing means ,
The cancellation processing means
At a first time point when the rebalance processing unit determines that rebalancing is necessary, a difference from an average value of load amounts of the plurality of nodes at the first time point is calculated for each node, By subtracting the difference from the load amount of the node at the second time point for each node at the second time point when the time required for the rebalance design by the rebalance processing unit has elapsed from the time point 1, the prediction is performed. A rebalancing device that calculates a load state .

If the predicted load state is a load state that requires an increase or decrease in the number of nodes, or the predicted load state is a load state in which a predetermined number of nodes having a load amount outside the allowable range exists, the cancellation is performed. The rebalancing apparatus according to claim 1, wherein the processing unit determines to cancel the rebalancing.

A rebalancing method executed by a rebalancing device used in a distributed system having a plurality of nodes to which information from a plurality of client machines using a communication service is distributed via a network,
A rebalance determination step for determining whether or not rebalancing is required to suppress the uneven load amount between the plurality of nodes based on the load amounts of the plurality of nodes;
Cancellation for determining whether or not to cancel the rebalancing based on the predicted load state of the plurality of nodes after the rebalancing when the rebalancing determination step determines that rebalancing is necessary A determination step ,
In the cancellation determination step, the rebalance device
At a first time point when rebalancing is determined by the rebalance determining step, a difference from an average value of load amounts of the plurality of nodes at the first time point is calculated for each node, The predicted load state is calculated by subtracting the difference from the load amount of the node at the second time point for each node at the second time point when the time required for rebalance design has elapsed from the time point 1. A rebalancing method characterized by

In the cancellation determination step, the rebalancing device is configured such that the predicted load state is a load state that requires an increase or decrease in the number of nodes, or the predicted load state has a predetermined number of nodes having a load amount outside an allowable range. The rebalancing method according to claim 3 , wherein the rebalancing device determines that the rebalancing is canceled when a load state exists.

The program for functioning a computer as each means in the rebalancing apparatus of Claim 1 or 2 .